Difference between revisions of "Msigdb v2 release notes"
(16 intermediate revisions by 2 users not shown) | |||
Line 1: | Line 1: | ||
− | + | [http://www.broadinstitute.org/gsea/ GSEA Home] | | |
+ | [http://www.broadinstitute.org/gsea/downloads.jsp Downloads] | | ||
+ | [http://www.broadinstitute.org/gsea/msigdb/ Molecular Signatures Database] | | ||
+ | [http://www.broadinstitute.org/cancer/software/gsea/wiki/index.php/Main_Page Documentation] | | ||
+ | [http://www.broadinstitute.org/gsea/contact.jsp Contact] <br /> | ||
+ | <br /> | ||
+ | Details on how the gene set databases were generated is provided below.<br /> | ||
+ | <br /> | ||
+ | <h3><span style="font-weight: bold; color: rgb(255, 0, 0);">C1 (Positional gene sets)</span></h3> | ||
+ | Cytogenetic locations were parsed from hugo (October 2006) and Unigene(build 197). When there were conflicts, the Unigene entry was used.<br /> | ||
+ | <br /> | ||
+ | <h3><span style="font-weight: bold; color: rgb(255, 0, 0);">C2 (Curated gene sets)</span></h3> | ||
+ | C2 sets were curated from several sources including:<br /> | ||
+ | <p style="line-height: 150%;" class="MsoNormal"> </p> | ||
+ | <em style=""><u><span style="font-family: Arial;">Online pathway databases</span></u></em><em style=""><span style="font-family: Arial;">: </span></em>Several online resources provide catalogs of well studied metabolic and signaling pathways as well as functional categories of genes. We downloaded gene sets from 12 such databases into our system.<br /> | ||
+ | <br /> | ||
+ | <table cellspacing="0" cellpadding="0" border="1" style="border: medium none ; border-collapse: collapse; width: 872px; height: 602px;" class="MsoTableGrid"> | ||
+ | <tbody> | ||
+ | <tr style="height: 13.2pt;"> | ||
+ | <td width="271" valign="top" style="border: 1pt solid windowtext; padding: 0in 5.4pt; width: 203.4pt; height: 13.2pt; font-weight: bold;"> | ||
+ | <p>Name</p> | ||
+ | </td> | ||
+ | <td width="374" valign="top" style="border-style: solid solid solid none; border-color: windowtext windowtext windowtext -moz-use-text-color; border-width: 1pt 1pt 1pt medium; padding: 0in 5.4pt; width: 280.3pt; height: 13.2pt; font-weight: bold;"> | ||
+ | <p>URL/Reference</p> | ||
+ | </td> | ||
+ | </tr> | ||
+ | <tr style="height: 13.2pt;"> | ||
+ | <td width="271" valign="top" style="border-style: none solid solid; border-color: -moz-use-text-color windowtext windowtext; border-width: medium 1pt 1pt; padding: 0in 5.4pt; width: 203.4pt; height: 13.2pt;">BioCarta<br /> | ||
+ | </td> | ||
+ | <td width="374" valign="top" style="border-style: none solid solid none; border-color: -moz-use-text-color windowtext windowtext -moz-use-text-color; border-width: medium 1pt 1pt medium; padding: 0in 5.4pt; width: 280.3pt; height: 13.2pt;"> [http://www.biocarta.com http://www.biocarta.com]</td> | ||
+ | </tr> | ||
+ | <tr style="height: 13.2pt;"> | ||
+ | <td width="271" valign="top" style="border-style: none solid solid; border-color: -moz-use-text-color windowtext windowtext; border-width: medium 1pt 1pt; padding: 0in 5.4pt; width: 203.4pt; height: 13.2pt;"> Signaling pathway database</td> | ||
+ | <td width="374" valign="top" style="border-style: none solid solid none; border-color: -moz-use-text-color windowtext windowtext -moz-use-text-color; border-width: medium 1pt 1pt medium; padding: 0in 5.4pt; width: 280.3pt; height: 13.2pt;"> [http://www.grt.kyushu-u.ac.jp/spad/menu.html http://www.grt.kyushu-u.ac.jp/spad/menu.html]</td> | ||
+ | </tr> | ||
+ | <tr style="height: 14.25pt;"> | ||
+ | <td width="271" valign="top" style="border-style: none solid solid; border-color: -moz-use-text-color windowtext windowtext; border-width: medium 1pt 1pt; padding: 0in 5.4pt; width: 203.4pt; height: 14.25pt;"> Signaling gateway</td> | ||
+ | <td width="374" valign="top" style="border-style: none solid solid none; border-color: -moz-use-text-color windowtext windowtext -moz-use-text-color; border-width: medium 1pt 1pt medium; padding: 0in 5.4pt; width: 280.3pt; height: 14.25pt;">[http://www.signaling-gateway.org/ http://www.signaling-gateway.org/]<br /> | ||
+ | </td> | ||
+ | </tr> | ||
+ | <tr style="height: 27.45pt;"> | ||
+ | <td width="271" valign="top" style="border-style: none solid solid; border-color: -moz-use-text-color windowtext windowtext; border-width: medium 1pt 1pt; padding: 0in 5.4pt; width: 203.4pt; height: 27.45pt;"> Signal transduction knowledge environment</td> | ||
+ | <td width="374" valign="top" style="border-style: none solid solid none; border-color: -moz-use-text-color windowtext windowtext -moz-use-text-color; border-width: medium 1pt 1pt medium; padding: 0in 5.4pt; width: 280.3pt; height: 27.45pt;"> [http://stke.sciencemag.org/ http://stke.sciencemag.org/]</td> | ||
+ | </tr> | ||
+ | <tr style="height: 27.45pt;"> | ||
+ | <td width="271" valign="top" style="border-style: none solid solid; border-color: -moz-use-text-color windowtext windowtext; border-width: medium 1pt 1pt; padding: 0in 5.4pt; width: 203.4pt; height: 27.45pt;"> Human protein reference database</td> | ||
+ | <td width="374" valign="top" style="border-style: none solid solid none; border-color: -moz-use-text-color windowtext windowtext -moz-use-text-color; border-width: medium 1pt 1pt medium; padding: 0in 5.4pt; width: 280.3pt; height: 27.45pt;"> [http://www.hprd.org/ http://www.hprd.org/]</td> | ||
+ | </tr> | ||
+ | <tr style="height: 13.2pt;"> | ||
+ | <td width="271" valign="top" style="border-style: none solid solid; border-color: -moz-use-text-color windowtext windowtext; border-width: medium 1pt 1pt; padding: 0in 5.4pt; width: 203.4pt; height: 13.2pt;"> GenMAPP</td> | ||
+ | <td width="374" valign="top" style="border-style: none solid solid none; border-color: -moz-use-text-color windowtext windowtext -moz-use-text-color; border-width: medium 1pt 1pt medium; padding: 0in 5.4pt; width: 280.3pt; height: 13.2pt;">[http://www.genmapp.org/ http://www.genmapp.org/]<br /> | ||
+ | </td> | ||
+ | </tr> | ||
+ | <tr style="height: 14.25pt;"> | ||
+ | <td width="271" valign="top" style="border-style: none solid solid; border-color: -moz-use-text-color windowtext windowtext; border-width: medium 1pt 1pt; padding: 0in 5.4pt; width: 203.4pt; height: 14.25pt;"> KEGG</td> | ||
+ | <td width="374" valign="top" style="border-style: none solid solid none; border-color: -moz-use-text-color windowtext windowtext -moz-use-text-color; border-width: medium 1pt 1pt medium; padding: 0in 5.4pt; width: 280.3pt; height: 14.25pt;"> [http://www.genome.jp/kegg/ http://www.genome.jp/kegg/]</td> | ||
+ | </tr> | ||
+ | <tr style="height: 13.2pt;"> | ||
+ | <td width="271" valign="top" style="border-style: none solid solid; border-color: -moz-use-text-color windowtext windowtext; border-width: medium 1pt 1pt; padding: 0in 5.4pt; width: 203.4pt; height: 13.2pt;"> Gene ontology</td> | ||
+ | <td width="374" valign="top" style="border-style: none solid solid none; border-color: -moz-use-text-color windowtext windowtext -moz-use-text-color; border-width: medium 1pt 1pt medium; padding: 0in 5.4pt; width: 280.3pt; height: 13.2pt;">[http://www.geneontology.org http://www.geneontology.org]</td> | ||
+ | </tr> | ||
+ | <tr style="height: 41.7pt;"> | ||
+ | <td width="271" valign="top" style="border-style: none solid solid; border-color: -moz-use-text-color windowtext windowtext; border-width: medium 1pt 1pt; padding: 0in 5.4pt; width: 203.4pt; height: 41.7pt;"> Sigma-Aldrich pathways</td> | ||
+ | <td width="374" valign="top" style="border-style: none solid solid none; border-color: -moz-use-text-color windowtext windowtext -moz-use-text-color; border-width: medium 1pt 1pt medium; padding: 0in 5.4pt; width: 280.3pt; height: 41.7pt;"> [http://www.sigmaaldrich.com/Area_of_Interest/Biochemicals/Enzyme_Explorer/Key_Resources.html http://www.sigmaaldrich.com/Area_of_Interest/Biochemicals/Enzyme_Explorer/Key_Resources.html]</td> | ||
+ | </tr> | ||
+ | <tr style="height: 13.2pt;"> | ||
+ | <td width="271" valign="top" style="border-style: none solid solid; border-color: -moz-use-text-color windowtext windowtext; border-width: medium 1pt 1pt; padding: 0in 5.4pt; width: 203.4pt; height: 13.2pt;"> Gene arrays, BioScience Corp</td> | ||
+ | <td width="374" valign="top" style="border-style: none solid solid none; border-color: -moz-use-text-color windowtext windowtext -moz-use-text-color; border-width: medium 1pt 1pt medium; padding: 0in 5.4pt; width: 280.3pt; height: 13.2pt;">[http://www.superarray.com/ http://www.superarray.com/]</td> | ||
+ | </tr> | ||
+ | <tr style="height: 27.45pt;"> | ||
+ | <td width="271" valign="top" style="border-style: none solid solid; border-color: -moz-use-text-color windowtext windowtext; border-width: medium 1pt 1pt; padding: 0in 5.4pt; width: 203.4pt; height: 27.45pt;"> Human cancer genome anatomy consortium</td> | ||
+ | <td width="374" valign="top" style="border-style: none solid solid none; border-color: -moz-use-text-color windowtext windowtext -moz-use-text-color; border-width: medium 1pt 1pt medium; padding: 0in 5.4pt; width: 280.3pt; height: 27.45pt;"> [http://cgap.nci.nih.gov/<br /> | ||
+ | http://cgap.nci.nih.gov/]</td> | ||
+ | </tr> | ||
+ | <tr style="height: 13.2pt;"> | ||
+ | <td width="271" valign="top" style="border-style: none solid solid; border-color: -moz-use-text-color windowtext windowtext; border-width: medium 1pt 1pt; padding: 0in 5.4pt; width: 203.4pt; height: 13.2pt;"> NetAffx</td> | ||
+ | <td width="374" valign="top" style="border-style: none solid solid none; border-color: -moz-use-text-color windowtext windowtext -moz-use-text-color; border-width: medium 1pt 1pt medium; padding: 0in 5.4pt; width: 280.3pt; height: 13.2pt;"> [http://www.affymetrix.com/index.affx http://www.affymetrix.com/index.affx]</td> | ||
+ | </tr> | ||
+ | </tbody> | ||
+ | </table> | ||
+ | <br /> | ||
+ | <p style="line-height: 150%;" class="MsoNormal"> </p> | ||
+ | <em style=""><u><span style="font-family: Arial;"><br /> | ||
+ | Biomedical literature</span></u></em><em style=""><span style="font-family: Arial;">: </span></em><span style="font-family: Arial;">Over the past few years, microarray studies have identified signatures of several important biological and clinical states (e.g. cancer metastasis, stem cell characteristics, drug resistance). These gene sets are valuable biological results. Unfortunately, because gene sets are typically published as tables in a paper, the </span><span style="line-height: 150%; font-family: Arial;">important biological findings they represent are not easily accessible to computational tools. Our first goal was to convert published gene sets into an electronic form. Towards this we compiled a list of microarray articles with published gene expression signatures. From each article, we extracted one or more gene set from tables in the main text or supplementary information. Notably, our focus was on capturing the identity (e.g. gene symbol, GenBank accession) of all members in a gene set rather than on relationships between individual genes. </span><span style="font-family: Arial;">Currently the process of curating a gene set from the literature is largely manual. In this report we include a collection of 1181 gene sets curated in this manner from 343 distinct PubMed accessions.</span> <br /> | ||
+ | <br /> | ||
+ | <h3><span style="font-weight: bold; color: rgb(255, 0, 0);">C3 (sequence motif gene sets)</span></h3> | ||
+ | <p style="line-height: 150%;" class="MsoNormal"><span style="font-family: Arial; color: black;">We compiled gene sets on the basis of<span style=""> </span>shared regulatory motifs from a recently published comparative analysis of the Human, Mouse, Rat and Dog genomes </span><!--[if supportFields]><span | ||
+ | style='font-family:Arial;color:black'><span style='mso-element:field-begin'></span><span | ||
+ | style='mso-spacerun:yes'> </span>ADDIN EN.CITE | ||
+ | <EndNote><Cite><Author>Xie</Author><Year>2005</Year><RecNum>342</RecNum><record><database | ||
+ | name='genesets_db.enl' path='C:\Local\xdev\active\msigdb\DOC_PUSH\genesets_db.enl'>genesets_db.enl</database><source-app | ||
+ | name='EndNote' version='8.0'>EndNote</source-app><rec-number>342</rec-number><ref-type | ||
+ | name='Journal | ||
+ | Article'>17</ref-type><contributors><authors><author><style | ||
+ | face='normal' font='default' size='100%'>Xie, X.</style></author><author><style | ||
+ | face='normal' font='default' size='100%'>Lu, | ||
+ | J.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Kulbokas, E. | ||
+ | J.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Golub, T. R.</style></author><author><style | ||
+ | face='normal' font='default' size='100%'>Mootha, | ||
+ | V.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Lindblad-Toh, | ||
+ | K.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Lander, E. S.</style></author><author><style | ||
+ | face='normal' font='default' size='100%'>Kellis, | ||
+ | M.</style></author></authors></contributors><auth-address><style | ||
+ | face='normal' font='default' size='100%'>Broad Institute of MIT and Harvard, | ||
+ | Cambridge, Massachusetts 02141, USA.</style></auth-address><titles><title><style | ||
+ | face='normal' font='default' size='100%'>Systematic discovery of regulatory | ||
+ | motifs in human promoters and 3&apos; UTRs by comparison of several | ||
+ | mammals</style></title><secondary-title><style | ||
+ | face='normal' font='default' size='100%'>Nature</style></secondary-title></titles><periodical><full-title><style | ||
+ | face='normal' font='default' | ||
+ | size='100%'>Nature</style></full-title></periodical><pages><style | ||
+ | face='normal' font='default' | ||
+ | size='100%'>338-45</style></pages><volume><style | ||
+ | face='normal' font='default' size='100%'>434</style></volume><number><style | ||
+ | face='normal' font='default' | ||
+ | size='100%'>7031</style></number><keywords><keyword><style | ||
+ | face='normal' font='default' size='100%'>3&apos; Untranslated | ||
+ | Regions/*genetics</style></keyword><keyword><style | ||
+ | face='normal' font='default' size='100%'>Animals</style></keyword><keyword><style | ||
+ | face='normal' font='default' size='100%'>Base | ||
+ | Sequence</style></keyword><keyword><style face='normal' | ||
+ | font='default' size='100%'>Comparative Study</style></keyword><keyword><style | ||
+ | face='normal' font='default' size='100%'>Conserved | ||
+ | Sequence/genetics</style></keyword><keyword><style | ||
+ | face='normal' font='default' | ||
+ | size='100%'>Dogs</style></keyword><keyword><style | ||
+ | face='normal' font='default' size='100%'>Gene Expression | ||
+ | Profiling</style></keyword><keyword><style face='normal' | ||
+ | font='default' | ||
+ | size='100%'>Humans</style></keyword><keyword><style | ||
+ | face='normal' font='default' | ||
+ | size='100%'>Mammals/*genetics</style></keyword><keyword><style | ||
+ | face='normal' font='default' size='100%'>Mice</style></keyword><keyword><style | ||
+ | face='normal' font='default' | ||
+ | size='100%'>MicroRNAs/genetics/metabolism</style></keyword><keyword><style | ||
+ | face='normal' font='default' size='100%'>Molecular Sequence | ||
+ | Data</style></keyword><keyword><style face='normal' | ||
+ | font='default' size='100%'>Organ Specificity</style></keyword><keyword><style | ||
+ | face='normal' font='default' size='100%'>Promoter Regions | ||
+ | (Genetics)/*genetics</style></keyword><keyword><style | ||
+ | face='normal' font='default' size='100%'>Rats</style></keyword><keyword><style | ||
+ | face='normal' font='default' size='100%'>Research Support, U.S. | ||
+ | Gov&apos;t, P.H.S.</style></keyword><keyword><style | ||
+ | face='normal' font='default' size='100%'>Sequence | ||
+ | Alignment</style></keyword></keywords><dates><year><style | ||
+ | face='normal' font='default' | ||
+ | size='100%'>2005</style></year><pub-dates><date><style | ||
+ | face='normal' font='default' size='100%'>Mar | ||
+ | 17</style></date></pub-dates></dates><accession-num><style | ||
+ | face='normal' font='default' | ||
+ | size='100%'>15735639</style></accession-num><urls><related-urls><url><style | ||
+ | face='normal' font='default' size='100%'>http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&amp;db=PubMed&amp;dopt=Citation&amp;list_uids=15735639<span | ||
+ | style='mso-spacerun:yes'> | ||
+ | </span></style></url></related-urls></urls></record></Cite></EndNote><span | ||
+ | style='mso-element:field-separator'></span></span><![endif]--><span style="font-family: Arial; color: black;">(Xie, Lu et al. 2005)</span><!--[if supportFields]><span | ||
+ | style='font-family:Arial;color:black'><span style='mso-element:field-end'></span></span><![endif]--><span style="font-family: Arial; color: black;">. This database consists of 837 motifs sets including 222 microRNA target gene sets.<br /> | ||
+ | </span></p> | ||
+ | <h3><span style="font-family: Arial; color: black;"><span style="font-weight: bold; color: rgb(255, 0, 0);">C4 (computed gene sets)</span></span></h3> | ||
+ | <p><span style="font-family: Arial;">We mined 4 expression compendia datasets for correlated gene sets by searching for neighbors (i.e. genes with similar expression profiles across a compendium) of <span style=""> </span>380 cancer associated genes </span><!--[if supportFields]><span | ||
+ | style='font-family:Arial'><span style='mso-element:field-begin'></span><span | ||
+ | style='mso-spacerun:yes'> </span>ADDIN EN.CITE | ||
+ | <EndNote><Cite><Author>Brentani</Author><Year>2003</Year><RecNum>184</RecNum><record><database | ||
+ | name='genesets_db.enl' path='C:\Local\xdev\active\msigdb\DOC_PUSH\genesets_db.enl'>genesets_db.enl</database><source-app | ||
+ | name='EndNote' | ||
+ | version='8.0'>EndNote</source-app><rec-number>184</rec-number><ref-type | ||
+ | name='Journal Article'>17</ref-type><contributors><authors><author><style | ||
+ | face='normal' font='default' size='100%'>Brentani, | ||
+ | H.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Caballero, O. | ||
+ | L.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Camargo, A. A.</style></author><author><style | ||
+ | face='normal' font='default' size='100%'>da Silva, A. | ||
+ | M.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>da Silva, W. A., | ||
+ | Jr.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Dias Neto, | ||
+ | E.</style></author><author><style face='normal' font='default' | ||
+ | size='100%'>Grivet, M.</style></author><author><style | ||
+ | face='normal' font='default' size='100%'>Gruber, | ||
+ | A.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Guimaraes, P. E.</style></author><author><style | ||
+ | face='normal' font='default' size='100%'>Hide, | ||
+ | W.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Iseli, | ||
+ | C.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Jongeneel, C. V.</style></author><author><style | ||
+ | face='normal' font='default' size='100%'>Kelso, | ||
+ | J.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Nagai, M. | ||
+ | A.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Ojopi, E. P.</style></author><author><style | ||
+ | face='normal' font='default' size='100%'>Osorio, E. | ||
+ | C.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Reis, E. | ||
+ | M.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Riggins, G. J.</style></author><author><style | ||
+ | face='normal' font='default' size='100%'>Simpson, A. | ||
+ | J.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>de Souza, | ||
+ | S.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Stevenson, B. J.</style></author><author><style | ||
+ | face='normal' font='default' size='100%'>Strausberg, R. | ||
+ | L.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Tajara, E. | ||
+ | H.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Verjovski-Almeida, | ||
+ | S.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Acencio, M. | ||
+ | L.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Bengtson, M. | ||
+ | H.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Bettoni, | ||
+ | F.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Bodmer, W. | ||
+ | F.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Briones, M. | ||
+ | R.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Camargo, L. | ||
+ | P.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Cavenee, | ||
+ | W.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Cerutti, J. | ||
+ | M.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Coelho Andrade, L. | ||
+ | E.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Costa dos Santos, P. | ||
+ | C.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Ramos Costa, M. | ||
+ | C.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>da Silva, I. T.</style></author><author><style | ||
+ | face='normal' font='default' size='100%'>Estecio, M. | ||
+ | R.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Sa Ferreira, | ||
+ | K.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Furnari, F. B.</style></author><author><style | ||
+ | face='normal' font='default' size='100%'>Faria, M., | ||
+ | Jr.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Galante, P. A.</style></author><author><style | ||
+ | face='normal' font='default' size='100%'>Guimaraes, G. | ||
+ | S.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Holanda, A. | ||
+ | J.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Kimura, E. T.</style></author><author><style | ||
+ | face='normal' font='default' size='100%'>Leerkes, M. | ||
+ | R.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Lu, | ||
+ | X.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Maciel, R. M.</style></author><author><style | ||
+ | face='normal' font='default' size='100%'>Martins, E. | ||
+ | A.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Massirer, K. | ||
+ | B.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Melo, A. S.</style></author><author><style | ||
+ | face='normal' font='default' size='100%'>Mestriner, C. | ||
+ | A.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Miracca, E. | ||
+ | C.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Miranda, L. L.</style></author><author><style | ||
+ | face='normal' font='default' size='100%'>Nobrega, F. | ||
+ | G.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Oliveira, P. | ||
+ | S.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Paquola, A. C.</style></author><author><style | ||
+ | face='normal' font='default' size='100%'>Pandolfi, J. | ||
+ | R.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Campos Pardini, M. | ||
+ | I.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Passetti, F.</style></author><author><style | ||
+ | face='normal' font='default' size='100%'>Quackenbush, | ||
+ | J.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Schnabel, | ||
+ | B.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Sogayar, M. | ||
+ | C.</style></author><author><style face='normal' font='default' | ||
+ | size='100%'>Souza, J. E.</style></author><author><style | ||
+ | face='normal' font='default' size='100%'>Valentini, S. | ||
+ | R.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Zaiats, A. | ||
+ | C.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Amaral, E. | ||
+ | J.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Arnaldi, L. | ||
+ | A.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>de Araujo, A. | ||
+ | G.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>de Bessa, S. | ||
+ | A.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Bicknell, D. | ||
+ | C.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Ribeiro de Camaro, M. | ||
+ | E.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Carraro, D. | ||
+ | M.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Carrer, | ||
+ | H.</style></author><author><style face='normal' font='default' | ||
+ | size='100%'>Carvalho, A. F.</style></author><author><style | ||
+ | face='normal' font='default' size='100%'>Colin, | ||
+ | C.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Costa, | ||
+ | F.</style></author><author><style face='normal' font='default' | ||
+ | size='100%'>Curcio, C.</style></author><author><style | ||
+ | face='normal' font='default' size='100%'>Guerreiro da Silva, I. | ||
+ | D.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Pereira da Silva, | ||
+ | N.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Dellamano, M.</style></author><author><style | ||
+ | face='normal' font='default' size='100%'>El-Dorry, | ||
+ | H.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Espreafico, E. | ||
+ | M.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Scattone Ferreira, A. | ||
+ | J.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Ayres Ferreira, | ||
+ | C.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Fortes, M. A.</style></author><author><style | ||
+ | face='normal' font='default' size='100%'>Gama, A. | ||
+ | H.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Giannella-Neto, | ||
+ | D.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Giannella, M. L.</style></author><author><style | ||
+ | face='normal' font='default' size='100%'>Giorgi, R. | ||
+ | R.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Goldman, G. | ||
+ | H.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Goldman, M. H.</style></author><author><style | ||
+ | face='normal' font='default' size='100%'>Hackel, | ||
+ | C.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Ho, P. | ||
+ | L.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Kimura, E. M.</style></author><author><style | ||
+ | face='normal' font='default' size='100%'>Kowalski, L. | ||
+ | P.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Krieger, J. | ||
+ | E.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Leite, L. C.</style></author><author><style | ||
+ | face='normal' font='default' size='100%'>Lopes, | ||
+ | A.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Luna, A. | ||
+ | M.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Mackay, A.</style></author><author><style | ||
+ | face='normal' font='default' size='100%'>Mari, S. | ||
+ | K.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Marques, A. | ||
+ | A.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Martins, W. K.</style></author><author><style | ||
+ | face='normal' font='default' size='100%'>Montagnini, | ||
+ | A.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Mourao Neto, | ||
+ | M.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Nascimento, A. L.</style></author><author><style | ||
+ | face='normal' font='default' size='100%'>Neville, A. | ||
+ | M.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Nobrega, M. | ||
+ | P.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>O&apos;Hare, M. | ||
+ | J.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Otsuka, A. | ||
+ | Y.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Ruas de Melo, A. | ||
+ | I.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Paco-Larson, M. | ||
+ | L.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Guimaraes Pereira, | ||
+ | G.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Pesquero, J. | ||
+ | B.</style></author><author><style face='normal' font='default' | ||
+ | size='100%'>Pessoa, J. G.</style></author><author><style | ||
+ | face='normal' font='default' size='100%'>Rahal, | ||
+ | P.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Rainho, C. | ||
+ | A.</style></author><author><style face='normal' font='default' | ||
+ | size='100%'>Rodrigues, V.</style></author><author><style | ||
+ | face='normal' font='default' size='100%'>Rogatto, S. | ||
+ | R.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Romano, C. | ||
+ | M.</style></author><author><style face='normal' font='default' | ||
+ | size='100%'>Romeiro, J. G.</style></author><author><style | ||
+ | face='normal' font='default' size='100%'>Rossi, B. | ||
+ | M.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Rusticci, | ||
+ | M.</style></author><author><style face='normal' font='default' | ||
+ | size='100%'>Guerra de Sa, R.</style></author><author><style | ||
+ | face='normal' font='default' size='100%'>Sant&apos; Anna, S. | ||
+ | C.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Sarmazo, M. | ||
+ | L.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Silva, T. C.</style></author><author><style | ||
+ | face='normal' font='default' size='100%'>Soares, F. | ||
+ | A.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Sonati Mde, | ||
+ | F.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>de Freitas Sousa, | ||
+ | J.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Queiroz, | ||
+ | D.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Valente, V.</style></author><author><style | ||
+ | face='normal' font='default' size='100%'>Vettore, A. | ||
+ | L.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Villanova, F. | ||
+ | E.</style></author><author><style face='normal' | ||
+ | font='default' size='100%'>Zago, M. A.</style></author><author><style | ||
+ | face='normal' font='default' size='100%'>Zalcberg, | ||
+ | H.</style></author></authors></contributors><auth-address><style | ||
+ | face='normal' font='default' size='100%'>Laboratorio de Genetica Molecular | ||
+ | do Cancer, Departmento de Radiologia, Universidade de Sao Paulo, Travessa da | ||
+ | Rua Dr. Ovideo Pires de Campos S/N, 4deg, | ||
+ | Brazil.</style></auth-address><titles><title><style | ||
+ | face='normal' font='default' size='100%'>The generation and utilization of a | ||
+ | cancer-oriented representation of the human transcriptome by using expressed | ||
+ | sequence tags</style></title><secondary-title><style | ||
+ | face='normal' font='default' size='100%'>Proc Natl Acad Sci U S | ||
+ | A</style></secondary-title></titles><periodical><full-title><style | ||
+ | face='normal' font='default' size='100%'>Proc Natl Acad Sci U S A</style></full-title></periodical><pages><style | ||
+ | face='normal' font='default' | ||
+ | size='100%'>13418-23</style></pages><volume><style | ||
+ | face='normal' font='default' | ||
+ | size='100%'>100</style></volume><number><style | ||
+ | face='normal' font='default' size='100%'>23</style></number><keywords><keyword><style | ||
+ | face='normal' font='default' size='100%'>Chromosome | ||
+ | Mapping</style></keyword><keyword><style face='normal' | ||
+ | font='default' size='100%'>Databases, | ||
+ | Genetic</style></keyword><keyword><style face='normal' | ||
+ | font='default' size='100%'>*Expressed Sequence | ||
+ | Tags</style></keyword><keyword><style face='normal' | ||
+ | font='default' size='100%'>*Gene Expression Regulation, | ||
+ | Neoplastic</style></keyword><keyword><style face='normal' | ||
+ | font='default' | ||
+ | size='100%'>Humans</style></keyword><keyword><style | ||
+ | face='normal' font='default' size='100%'>Neoplasms/*genetics/metabolism</style></keyword><keyword><style | ||
+ | face='normal' font='default' size='100%'>Polymorphism, Single | ||
+ | Nucleotide</style></keyword><keyword><style face='normal' | ||
+ | font='default' size='100%'>*Proteome</style></keyword><keyword><style | ||
+ | face='normal' font='default' size='100%'>RNA, | ||
+ | Messenger/*metabolism</style></keyword><keyword><style | ||
+ | face='normal' font='default' size='100%'>Research Support, Non-U.S. | ||
+ | Gov&apos;t</style></keyword><keyword><style | ||
+ | face='normal' font='default' size='100%'>Tissue Distribution</style></keyword><keyword><style | ||
+ | face='normal' font='default' size='100%'>Variation | ||
+ | (Genetics)</style></keyword></keywords><dates><year><style | ||
+ | face='normal' font='default' size='100%'>2003</style></year><pub-dates><date><style | ||
+ | face='normal' font='default' size='100%'>Nov | ||
+ | 11</style></date></pub-dates></dates><accession-num><style | ||
+ | face='normal' font='default' | ||
+ | size='100%'>14593198</style></accession-num><urls><related-urls><url><style | ||
+ | face='normal' font='default' size='100%'>http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&amp;db=PubMed&amp;dopt=Citation&amp;list_uids=14593198 | ||
+ | </style></url></related-urls></urls></record></Cite></EndNote><span | ||
+ | style='mso-element:field-separator'></span></span><![endif]--><span style="font-family: Arial;">(Brentani, Caballero et al. 2003)</span><!--[if supportFields]><span | ||
+ | style='font-family:Arial'><span style='mso-element:field-end'></span></span><![endif]--><span style="font-family: Arial;">. Neighborhoods with <25 genes at a Pearson correlation threshold of 0.8 were omitted yielding 427 sets. This category of the database is identical to that previously reported in <span style=""> </span></span><!--[if supportFields]><span | ||
+ | style='font-family:Arial'><span style='mso-element:field-begin'></span><span | ||
+ | style='mso-spacerun:yes'> </span>ADDIN EN.CITE | ||
+ | <EndNote><Cite><Author>Subramanian</Author><Year>2005</Year><RecNum>369</RecNum><record><database | ||
+ | name="genesets_db.enl" path="C:\Local\xdev\active\msigdb\DOC_PUSH\genesets_db.enl">genesets_db.enl</database><source-app | ||
+ | name="EndNote" | ||
+ | version="8.0">EndNote</source-app><rec-number>369</rec-number><ref-type | ||
+ | name="Journal | ||
+ | Article">17</ref-type><contributors><authors><author><style | ||
+ | face="normal" font="default" | ||
+ | size="100%">Subramanian, | ||
+ | A.</style></author><author><style face="normal" | ||
+ | font="default" size="100%">Tamayo, | ||
+ | P.</style></author><author><style face="normal" | ||
+ | font="default" size="100%">Mootha, V. | ||
+ | K.</style></author><author><style face="normal" | ||
+ | font="default" size="100%">Mukherjee, | ||
+ | S.</style></author><author><style face="normal" | ||
+ | font="default" size="100%">Ebert, B. | ||
+ | L.</style></author><author><style face="normal" | ||
+ | font="default" size="100%">Gillette, M. | ||
+ | A.</style></author><author><style face="normal" | ||
+ | font="default" size="100%">Paulovich, | ||
+ | A.</style></author><author><style face="normal" | ||
+ | font="default" size="100%">Pomeroy, S. | ||
+ | L.</style></author><author><style face="normal" | ||
+ | font="default" size="100%">Golub, T. | ||
+ | R.</style></author><author><style face="normal" | ||
+ | font="default" size="100%">Lander, E. | ||
+ | S.</style></author><author><style face="normal" | ||
+ | font="default" size="100%">Mesirov, J. | ||
+ | P.</style></author></authors></contributors><auth-address><style | ||
+ | face="normal" font="default" size="100%">Broad | ||
+ | Institute of Massachusetts Institute of Technology and Harvard, 320 Charles | ||
+ | Street, Cambridge, MA | ||
+ | 02141.</style></auth-address><titles><title><style | ||
+ | face="normal" font="default" size="100%">From | ||
+ | the Cover: Gene set enrichment analysis: A knowledge-based approach for | ||
+ | interpreting genome-wide expression | ||
+ | profiles</style></title><secondary-title><style | ||
+ | face="normal" font="default" size="100%">Proc | ||
+ | Natl Acad Sci U S A</style></secondary-title></titles><periodical><full-title><style | ||
+ | face="normal" font="default" size="100%">Proc | ||
+ | Natl Acad Sci U S | ||
+ | A</style></full-title></periodical><pages><style | ||
+ | face="normal" font="default" | ||
+ | size="100%">15545-50</style></pages><volume><style | ||
+ | face="normal" font="default" | ||
+ | size="100%">102</style></volume><number><style | ||
+ | face="normal" font="default" | ||
+ | size="100%">43</style></number><dates><year><style | ||
+ | face="normal" font="default" | ||
+ | size="100%">2005</style></year><pub-dates><date><style | ||
+ | face="normal" font="default" size="100%">Oct | ||
+ | 25</style></date></pub-dates></dates><accession-num><style | ||
+ | face="normal" font="default" | ||
+ | size="100%">16199517</style></accession-num><urls><related-urls><url><style | ||
+ | face="normal" font="default" | ||
+ | size="100%">http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&amp;db=PubMed&amp;dopt=Citation&amp;list_uids=16199517<span | ||
+ | style='mso-spacerun:yes'> </span></style></url></related-urls></urls></record></Cite></EndNote><span | ||
+ | style='mso-element:field-separator'></span></span><![endif]--><span style="font-family: Arial;">(Subramanian, Tamayo et al. 2005)</span><!--[if supportFields]><span | ||
+ | style='font-family:Arial'><span style='mso-element:field-end'></span></span><![endif]--><span style="font-family: Arial;">.</span></p> |
Latest revision as of 02:08, 25 September 2016
GSEA Home |
Downloads |
Molecular Signatures Database |
Documentation |
Contact
Details on how the gene set databases were generated is provided below.
Contents
C1 (Positional gene sets)
Cytogenetic locations were parsed from hugo (October 2006) and Unigene(build 197). When there were conflicts, the Unigene entry was used.
C2 (Curated gene sets)
C2 sets were curated from several sources including:
Online pathway databases: Several online resources provide catalogs of well studied metabolic and signaling pathways as well as functional categories of genes. We downloaded gene sets from 12 such databases into our system.
Name |
URL/Reference |
BioCarta |
http://www.biocarta.com |
Signaling pathway database | http://www.grt.kyushu-u.ac.jp/spad/menu.html |
Signaling gateway | http://www.signaling-gateway.org/ |
Signal transduction knowledge environment | http://stke.sciencemag.org/ |
Human protein reference database | http://www.hprd.org/ |
GenMAPP | http://www.genmapp.org/ |
KEGG | http://www.genome.jp/kegg/ |
Gene ontology | http://www.geneontology.org |
Sigma-Aldrich pathways | http://www.sigmaaldrich.com/Area_of_Interest/Biochemicals/Enzyme_Explorer/Key_Resources.html |
Gene arrays, BioScience Corp | http://www.superarray.com/ |
Human cancer genome anatomy consortium | [http://cgap.nci.nih.gov/ http://cgap.nci.nih.gov/] |
NetAffx | http://www.affymetrix.com/index.affx |
Biomedical literature: Over the past few years, microarray studies have identified signatures of several important biological and clinical states (e.g. cancer metastasis, stem cell characteristics, drug resistance). These gene sets are valuable biological results. Unfortunately, because gene sets are typically published as tables in a paper, the important biological findings they represent are not easily accessible to computational tools. Our first goal was to convert published gene sets into an electronic form. Towards this we compiled a list of microarray articles with published gene expression signatures. From each article, we extracted one or more gene set from tables in the main text or supplementary information. Notably, our focus was on capturing the identity (e.g. gene symbol, GenBank accession) of all members in a gene set rather than on relationships between individual genes. Currently the process of curating a gene set from the literature is largely manual. In this report we include a collection of 1181 gene sets curated in this manner from 343 distinct PubMed accessions.
C3 (sequence motif gene sets)
We compiled gene sets on the basis of shared regulatory motifs from a recently published comparative analysis of the Human, Mouse, Rat and Dog genomes (Xie, Lu et al. 2005). This database consists of 837 motifs sets including 222 microRNA target gene sets.
C4 (computed gene sets)
We mined 4 expression compendia datasets for correlated gene sets by searching for neighbors (i.e. genes with similar expression profiles across a compendium) of 380 cancer associated genes (Brentani, Caballero et al. 2003). Neighborhoods with <25 genes at a Pearson correlation threshold of 0.8 were omitted yielding 427 sets. This category of the database is identical to that previously reported in (Subramanian, Tamayo et al. 2005).