Difference between revisions of "MSigDB v2.5 Release Notes"

From GeneSetEnrichmentAnalysisWiki
Jump to navigation Jump to search
 
 
(32 intermediate revisions by 3 users not shown)
Line 1: Line 1:
<p class="MsoNormal">This page describes the changes made to the gene set collections for Release 2.1 of the Molecular Signatures Database (MSigDB).</p>
+
[http://www.broadinstitute.org/gsea/ GSEA Home] |
<p class="MsoNormal"><o:p>&nbsp;</o:p></p>
+
[http://www.broadinstitute.org/gsea/downloads.jsp Downloads] |
 +
[http://www.broadinstitute.org/gsea/msigdb/ Molecular Signatures Database] |
 +
[http://www.broadinstitute.org/cancer/software/gsea/wiki/index.php/Main_Page Documentation] |
 +
[http://www.broadinstitute.org/gsea/contact.jsp Contact]<br />
 +
<br />
 +
<p>This page describes the changes made to the gene set collections for Release 2.5 of the Molecular Signatures Database (MSigDB).</p>
 +
<p><br />
 +
</p>
 
<h3>C1: Positional gene sets</h3>
 
<h3>C1: Positional gene sets</h3>
<p class="MsoNormal">No changes were made.</p>
+
No changes were made. <br />
 +
For a description of this collection, see the [http://www.broad.mit.edu/gsea/msigdb/collections.jsp Browse Collections] page.<br />
 +
&nbsp;
 
<h3>C2: Curated gene sets (+205)</h3>
 
<h3>C2: Curated gene sets (+205)</h3>
<p class="MsoNormal">Gene sets from two sources were added to the C2 collection:</p>
+
Gene sets from two sources were added to the C2 collection:<br />
<p class="MsoListBullet"><!--[if !supportLists]--><span style="font-size: 8pt;"><span style="">■<span style="font-family: &quot;Times New Roman&quot;; font-style: normal; font-variant: normal; font-weight: normal; font-size: 7pt; line-height: normal; font-size-adjust: none; font-stretch: normal;">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; </span></span></span><!--[endif]-->CPG: chemical and genetic perturbations (+5 gene sets). Gene sets by Bild et al. (<em>Nature</em>&nbsp;<strong>439</strong>, 353 &ndash; 357, 2006) based on microarray analysis of expression profiles of key oncogenes in a model system where expression of these oncogenes transformed otherwise quiescent cells.</p>
+
<ul>
<p class="MsoListBullet"><!--[if !supportLists]--><span style="font-size: 8pt;"><span style="">■<span style="font-family: &quot;Times New Roman&quot;; font-style: normal; font-variant: normal; font-weight: normal; font-size: 7pt; line-height: normal; font-size-adjust: none; font-stretch: normal;">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; </span></span></span><!--[endif]-->CP: canonical pathways (+200 gene sets). Gene sets from the KEGG PATHWAY (<a href="http://www.genome.jp/kegg/">http://www.genome.jp/kegg/</a>) database of molecular interaction and reaction networks for metabolism, various cellular processes, and human diseases.</p>
+
    <li><strong>CGP</strong>: chemical and genetic perturbations (+5 gene sets). Gene sets by Bild et al. (Nature 439, 353 &ndash; 357, 2006) based on microarray analysis of expression profiles of key oncogenes in a model system where expression of these oncogenes transformed otherwise quiescent cells.</li>
<p class="MsoNormal">In addition, links to GenMAPP have been corrected. Specifically, on gene set cards for gene sets from GenMAPP, in the <em style="">External links</em> section, links broken due to GenMAPP updates have been corrected.</p>
+
    <li><strong>CP</strong>: canonical pathways (+200 gene sets). Gene sets from the KEGG PATHWAY (http://www.genome.jp/kegg/) database of molecular interaction and reaction networks for metabolism, various cellular processes, and human diseases.</li>
 +
</ul>
 +
In addition, links to GenMAPP have been fixed. Specifically, broken links have been corrected in the External links section of the gene set page for any gene set derived from GenMAPP.<br />
 +
&nbsp;
 +
 
 
<h3>C3: Motif gene sets</h3>
 
<h3>C3: Motif gene sets</h3>
<p class="MsoNormal">No changes were made.</p>
+
No changes were made.<br />
 +
For a description of this collection, see the [http://www.broad.mit.edu/gsea/msigdb/collections.jsp Browse Collections] page.<br />
 +
&nbsp;
 
<h3>C4: Computational gene sets (+456)</h3>
 
<h3>C4: Computational gene sets (+456)</h3>
<p class="MsoNormal">C4 now contains two subcollections:</p>
+
C4 now contains two subcollections:<br />
<p class="MsoListBullet"><!--[if !supportLists]--><span style="font-size: 8pt;"><span style="">■<span style="font-family: &quot;Times New Roman&quot;; font-style: normal; font-variant: normal; font-weight: normal; font-size: 7pt; line-height: normal; font-size-adjust: none; font-stretch: normal;">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; </span></span></span><!--[endif]-->CGN: cancer gene neighborhoods (+0 gene sets). Gene sets defined by expression neighborhoods centered on 380 cancer-associated genes (Brentani, Caballero et al. 2003). This is the C4 collection from the previous MSigDB release.</p>
+
<ul>
<p class="MsoListBullet"><!--[if !supportLists]--><span style="font-size: 8pt;"><span style="">■<span style="font-family: &quot;Times New Roman&quot;; font-style: normal; font-variant: normal; font-weight: normal; font-size: 7pt; line-height: normal; font-size-adjust: none; font-stretch: normal;">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; </span></span></span><!--[endif]-->CM: cancer modules (+456 gene sets). <span class="blacksml">Gene sets defined by </span>Segal et al. (<span class="blacksml"><em>Nature Genetics</em> &nbsp;<strong>36</strong>, 1090 &ndash; 1098, 2004). Briefly, the authors compiled gene sets (&lsquo;modules&rsquo;) from a variety of resources such as KEGG, GO, and others. By mining a large compendium of cancer-related microarray data, they identified 456 such modules as </span>significantly changed in a variety of cancer conditions.</p>
+
    <li><strong>CGN</strong>: cancer gene neighborhoods (+0 gene sets). Gene sets defined by expression neighborhoods centered on 380 cancer-associated genes (Brentani, Caballero et al. 2003). This is the C4 collection from the previous MSigDB release.</li>
 +
    <li><strong>CM: </strong>cancer modules (+456 gene sets). Gene sets <em>identical </em>to the modules described in Segal et al. (Nature Genetics&nbsp; 36, 1090 &ndash; 1098, 2004). Gene sets in this subcollection are made of transcriptionally coregulated genes that share a common function and have been found significantly deregulated in tumors. Starting with a list of 2,849 gene sets from a variety of resources such as Gene Ontology, KEGG and others, the authors extracted 456 statistically significant regulatory modules from a large collection of published microarray data spanning 22 tumor types. This is an entirely new subcollection. </li>
 +
</ul>
 +
&nbsp;
 
<h3>C5: GO gene sets (+1454)</h3>
 
<h3>C5: GO gene sets (+1454)</h3>
<p class="MsoNormal">Gene sets in this new collection are derived from the controlled vocabulary of the Gene Ontology (GO) project: <span class="author"><em>The Gene Ontology Consortium.</em></span><cite> </cite><span class="title"><em>Gene Ontology: tool for the unification of biology.</em></span><cite> </cite><span class="journal"><em>Nature Genet.</em></span><cite> (2000) </cite><span class="volume"><em>25:</em></span><cite> 25-29</cite><cite><span style="font-style: normal;"> (</span></cite><a href="http://www.geneontology.org/">http://www.geneontology.org/</a>). The gene sets are named by GO term and contain genes annotated by that term.</p>
+
<p> Gene sets in this new collection are derived from the controlled vocabulary of the Gene Ontology (GO) project: The Gene Ontology Consortium. Gene Ontology: tool for the unification of biology. Nature Genet. (2000) 25: 25-29 (http://www.geneontology.org/). The gene sets are named by GO term and contain genes annotated by that term.</p>
<p class="MsoNormal"><strong style=""><o:p>&nbsp;</o:p></strong></p>
+
<p>This collection is divided into three subcollections:</p>
<p class="MsoNormal">This collection is divided into three subcollections:</p>
+
<ul>
<p class="MsoListBullet"><!--[if !supportLists]--><span style="font-size: 8pt;"><span style="">■<span style="font-family: &quot;Times New Roman&quot;; font-style: normal; font-variant: normal; font-weight: normal; font-size: 7pt; line-height: normal; font-size-adjust: none; font-stretch: normal;">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; </span></span></span><!--[endif]-->CC: GO Cellular component (+233 gene sets). Gene sets derived from the Cellular Component Ontology.</p>
+
    <li><strong>CC</strong>: GO Cellular component (+233 gene sets). Gene sets derived from the Cellular Component Ontology.</li>
<p class="MsoListBullet"><!--[if !supportLists]--><span style="font-size: 8pt;"><span style="">■<span style="font-family: &quot;Times New Roman&quot;; font-style: normal; font-variant: normal; font-weight: normal; font-size: 7pt; line-height: normal; font-size-adjust: none; font-stretch: normal;">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; </span></span></span><!--[endif]-->MF: GO Molecular function (+396 gene sets). Gene sets derived from the Molecular Function Ontology.</p>
+
    <li><strong>MF</strong>: GO Molecular function (+396 gene sets). Gene sets derived from the Molecular Function Ontology.</li>
<p class="MsoListBullet"><!--[if !supportLists]--><span style="font-size: 8pt;"><span style="">■<span style="font-family: &quot;Times New Roman&quot;; font-style: normal; font-variant: normal; font-weight: normal; font-size: 7pt; line-height: normal; font-size-adjust: none; font-stretch: normal;">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; </span></span></span><!--[endif]-->BP: GO Biological process (+825 gene sets). Gene sets derived from the Biological Process Ontology.</p>
+
    <li><strong>BP</strong>: GO Biological process (+825 gene sets). Gene sets derived from the Biological Process Ontology.</li>
<p class="MsoNormal"><strong style="">GSEA users</strong>: Gene set enrichment analysis identifies gene sets consisting of <em style="">co-regulated</em> genes; GO gene sets are based on ontologies and do <em style="">not</em> generally consist of co-regulated genes.</p>
+
</ul>
 +
<p>&nbsp;<strong>GSEA users</strong>: Gene set enrichment analysis identifies gene sets consisting of co-regulated genes; GO gene sets are based on ontologies and do not generally consist of co-regulated genes.</p>
 +
&nbsp;
 
<h3>For more information</h3>
 
<h3>For more information</h3>
<p class="MsoNormal">For complete descriptions of all collections, see the Molecular Signatures Database page.</p>
+
For complete descriptions of all collections or to download the updated gene sets, go to the [http://www.broad.mit.edu/gsea/msigdb/collections.jsp Browse Collections] page.
<p class="MsoNormal">To download the updated gene sets, go to the Browse Collections page.</p>
 

Latest revision as of 02:37, 25 September 2016

GSEA Home | Downloads | Molecular Signatures Database | Documentation | Contact

This page describes the changes made to the gene set collections for Release 2.5 of the Molecular Signatures Database (MSigDB).


C1: Positional gene sets

No changes were made.
For a description of this collection, see the Browse Collections page.
 

C2: Curated gene sets (+205)

Gene sets from two sources were added to the C2 collection:

  • CGP: chemical and genetic perturbations (+5 gene sets). Gene sets by Bild et al. (Nature 439, 353 – 357, 2006) based on microarray analysis of expression profiles of key oncogenes in a model system where expression of these oncogenes transformed otherwise quiescent cells.
  • CP: canonical pathways (+200 gene sets). Gene sets from the KEGG PATHWAY (http://www.genome.jp/kegg/) database of molecular interaction and reaction networks for metabolism, various cellular processes, and human diseases.

In addition, links to GenMAPP have been fixed. Specifically, broken links have been corrected in the External links section of the gene set page for any gene set derived from GenMAPP.
 

C3: Motif gene sets

No changes were made.
For a description of this collection, see the Browse Collections page.
 

C4: Computational gene sets (+456)

C4 now contains two subcollections:

  • CGN: cancer gene neighborhoods (+0 gene sets). Gene sets defined by expression neighborhoods centered on 380 cancer-associated genes (Brentani, Caballero et al. 2003). This is the C4 collection from the previous MSigDB release.
  • CM: cancer modules (+456 gene sets). Gene sets identical to the modules described in Segal et al. (Nature Genetics  36, 1090 – 1098, 2004). Gene sets in this subcollection are made of transcriptionally coregulated genes that share a common function and have been found significantly deregulated in tumors. Starting with a list of 2,849 gene sets from a variety of resources such as Gene Ontology, KEGG and others, the authors extracted 456 statistically significant regulatory modules from a large collection of published microarray data spanning 22 tumor types. This is an entirely new subcollection.

 

C5: GO gene sets (+1454)

Gene sets in this new collection are derived from the controlled vocabulary of the Gene Ontology (GO) project: The Gene Ontology Consortium. Gene Ontology: tool for the unification of biology. Nature Genet. (2000) 25: 25-29 (http://www.geneontology.org/). The gene sets are named by GO term and contain genes annotated by that term.

This collection is divided into three subcollections:

  • CC: GO Cellular component (+233 gene sets). Gene sets derived from the Cellular Component Ontology.
  • MF: GO Molecular function (+396 gene sets). Gene sets derived from the Molecular Function Ontology.
  • BP: GO Biological process (+825 gene sets). Gene sets derived from the Biological Process Ontology.

 GSEA users: Gene set enrichment analysis identifies gene sets consisting of co-regulated genes; GO gene sets are based on ontologies and do not generally consist of co-regulated genes.

 

For more information

For complete descriptions of all collections or to download the updated gene sets, go to the Browse Collections page.