Difference between revisions of "MSigDB v2022.1.Mm Release Notes"

From GeneSetEnrichmentAnalysisWiki
Jump to navigation Jump to search
Line 20: Line 20:
  
 
<h2>MH: mouse-ortholog hallmark gene sets</h2>
 
<h2>MH: mouse-ortholog hallmark gene sets</h2>
 +
The MSigDB Hallmarks collection is being made available in an orthology converted form to aid in initial exploratory analysis of mouse datasets utilizing orthology mappings to MGI IDs provided by the Mouse Genome Informatics (MGI) institute at The Jackson Laboratory.
  
 
<h2>M1: positional gene sets</h2>
 
<h2>M1: positional gene sets</h2>
 +
Ensembl IDs for genes were retrieved from the cytogenetic band annotations provided in Ensembl 102 release, corresponding to the GRCm38 assembly as cytogentic band annotatations for GRCm39 are not presently available.
  
 
<h2>M2: curated gene sets</h2>
 
<h2>M2: curated gene sets</h2>
 +
<h3>M2:CGP</h3>
 +
932 gene sets consisting of:
 +
<ul>
 +
<li>869 miscellaneous gene sets derived from studies originally conducted in mouse models were copied from the Human C2:CGP subcollection and are included in M2:CGP in their native mouse namespace</li>
 +
<li>21 gene sets describing the mouse turmor models and xenografts curated by Jill Recla of the Mouse Genome Informatics (MGI) institute at The Jackson Laboratory</li>
 +
<li>42 gene sets describing neurogenic fates and cortical patterning from mouse experiments provided by Robert Hevner</li>
 +
</ul>
 +
 +
<h3>M2:CP</h3>
 +
The initial release of the mouse C2:CP collection contains:
 +
<ul>
 +
<li>252 gene sets from the BioCarta mouse database</li>
 +
<li>1249 gene sets from the Reactome mouse database</li>
 +
<li>186 gene sets from the WikiPathways mouse dabase</li>
 +
</ul>
  
 
<h2>M3: regulatory target gene sets</h2>
 
<h2>M3: regulatory target gene sets</h2>
 +
<ul>
 +
<li>Transcription factor target gene sets from the Gene Transcription Regulation Database (GTRD) corresponding to experiments performed using mouse ChIP-seq experiments</li>
 +
<li>miRNA target gene sets from computationally predicted mouse gene targets of miRNAs using the MirTarget algorithm. Data was curated from [http://mirdb.org miRDB v6.0] target predictions with MirTarget scores >80 (high confidence predictions). miRNAs catalogued in miRDB v6.0 are derived from miRBase v22 (March 2018).
 +
</li>
 +
</ul>
  
 
<h2>M5: ontology gene sets</h2>
 
<h2>M5: ontology gene sets</h2>
  
 
<h2>M8: cell type signature gene sets</h2>
 
<h2>M8: cell type signature gene sets</h2>
 +
Two initial groups of gene sets are being provided in this initial release
 +
<ul>
 +
<li>38 gene sets derived from cell identity signatures from the <span class="plainlinks">[https://oncoscape.v3.sttrcancer.org/atlas.gs.washington.edu.mouse.rna/mouse Mouse Organogenesis Cell Atlas (MOCA)]</span></li>
 +
<li>176 gene sets derived from cell aging signatures fromo the <span class="plainlinks">[http://tabula-muris-senis.ds.czbiohub.org/ Tabula Muris Senis, or 'Mouse Ageing Cell Atlas']</span></li>
 +
</ul>
  
 
<h2>CHIP file release</h2>
 
<h2>CHIP file release</h2>

Revision as of 18:02, 7 September 2022

GSEA Home | Downloads | Molecular Signatures Database | Documentation | Contact

Important Notices

This page describes updates made to the Molecular Signatures Database for release 2022.1. This release introduces several major changes to previous conventions. MSigDB is now split into two major divisions; a series of gene set collections that are provided in the namespace of human gene symbols, and a series of gene set collections that are provided in the namespace of mouse gene symbols. As such the versioning convention of MSigDB has changed to adopt the format Year.Release.Species. This initial release in the new format is versioned 2022.1.Hs for the human collections and 2022.1.Mm for the mouse collections. Likewise, CHIP files have been updated to reflect this convention, as well as the specific series of collections (i.e. human or mouse) that they are targeted towards.

Note that in order to access the MSigBD mouse collections through the GSEA UI, the latest version of GSEA (4.3.0) is required.

MSigDB v2022.1 is based on gene annotation data from Ensembl Release 107 (Jul 2022).

Initial Release of Mouse Collections (MSigDB v2022.1.Mm)

The initial release of the MSigDB Mouse Collections contains the following 6 collections, with some collection numbers reserved for future development. Please see the Collection Details Page for collection-specific general information.

MH: mouse-ortholog hallmark gene sets

The MSigDB Hallmarks collection is being made available in an orthology converted form to aid in initial exploratory analysis of mouse datasets utilizing orthology mappings to MGI IDs provided by the Mouse Genome Informatics (MGI) institute at The Jackson Laboratory.

M1: positional gene sets

Ensembl IDs for genes were retrieved from the cytogenetic band annotations provided in Ensembl 102 release, corresponding to the GRCm38 assembly as cytogentic band annotatations for GRCm39 are not presently available.

M2: curated gene sets

M2:CGP

932 gene sets consisting of:

  • 869 miscellaneous gene sets derived from studies originally conducted in mouse models were copied from the Human C2:CGP subcollection and are included in M2:CGP in their native mouse namespace
  • 21 gene sets describing the mouse turmor models and xenografts curated by Jill Recla of the Mouse Genome Informatics (MGI) institute at The Jackson Laboratory
  • 42 gene sets describing neurogenic fates and cortical patterning from mouse experiments provided by Robert Hevner

M2:CP

The initial release of the mouse C2:CP collection contains:

  • 252 gene sets from the BioCarta mouse database
  • 1249 gene sets from the Reactome mouse database
  • 186 gene sets from the WikiPathways mouse dabase

M3: regulatory target gene sets

  • Transcription factor target gene sets from the Gene Transcription Regulation Database (GTRD) corresponding to experiments performed using mouse ChIP-seq experiments
  • miRNA target gene sets from computationally predicted mouse gene targets of miRNAs using the MirTarget algorithm. Data was curated from miRDB v6.0 target predictions with MirTarget scores >80 (high confidence predictions). miRNAs catalogued in miRDB v6.0 are derived from miRBase v22 (March 2018).

M5: ontology gene sets

M8: cell type signature gene sets

Two initial groups of gene sets are being provided in this initial release

CHIP file release

  • MSigDB 2022.1.Mm gene annotations and gene mapping CHIP files are being provided utilizing data from Ensembl 107.
  • Gene orthology annotations for mapping human and rat genes to their best match mouse orthologs are being provided utilizing information from Alliance of Genome Resources orthology database release 5.2.1 (2022-07-15)