Difference between revisions of "Msigdb v2 release notes"

Revision as of 12:14, 24 March 2008

<a href="http://www.broad.mit.edu/gsea/">GSEA Home</a> | <a href="http://www.broad.mit.edu/gsea/downloads.jsp">Downloads</a> | <a href="http://www.broad.mit.edu/gsea/msigdb/">Molecular Signatures Database</a> | Documentation | <a href="http://www.broad.mit.edu/gsea/contact.jsp">Contact</a>

Details on how the gene set databases were generated is provided below.
This information is also available on the <a href="http://www.broad.mit.edu/gsea/msigdb/">Molecular Signatures Database</a> page.

Name	URL/Reference
BioCarta	http://www.biocarta.com
Signaling pathway database	http://www.grt.kyushu-u.ac.jp/spad/menu.html
Signaling gateway	http://www.signaling-gateway.org/
Signal transduction knowledge environment	http://stke.sciencemag.org/
Human protein reference database	http://www.hprd.org/
GenMAPP	http://www.genmapp.org/
KEGG	http://www.genome.jp/kegg/
Gene ontology	http://www.geneontology.org
Sigma-Aldrich pathways	http://www.sigmaaldrich.com/Area_of_Interest/Biochemicals/Enzyme_Explorer/Key_Resources.html
Gene arrays, BioScience Corp	http://www.superarray.com/
Human cancer genome anatomy consortium	[http://cgap.nci.nih.gov/ http://cgap.nci.nih.gov/]
NetAffx	http://www.affymetrix.com/index.affx

Biomedical literature: Over the past few years, microarray studies have identified signatures of several important biological and clinical states (e.g. cancer metastasis, stem cell characteristics, drug resistance). These gene sets are valuable biological results. Unfortunately, because gene sets are typically published as tables in a paper, the important biological findings they represent are not easily accessible to computational tools. Our first goal was to convert published gene sets into an electronic form. Towards this we compiled a list of microarray articles with published gene expression signatures. From each article, we extracted one or more gene set from tables in the main text or supplementary information. Notably, our focus was on capturing the identity (e.g. gene symbol, GenBank accession) of all members in a gene set rather than on relationships between individual genes. Currently the process of curating a gene set from the literature is largely manual. In this report we include a collection of 1181 gene sets curated in this manner from 343 distinct PubMed accessions.

C3 (sequence motif gene sets)

We compiled gene sets on the basis of shared regulatory motifs from a recently published comparative analysis of the Human, Mouse, Rat and Dog genomes (Xie, Lu et al. 2005). This database consists of 837 motifs sets including 222 microRNA target gene sets.

C4 (computed gene sets)

We mined 4 expression compendia datasets for correlated gene sets by searching for neighbors (i.e. genes with similar expression profiles across a compendium) of 380 cancer associated genes (Brentani, Caballero et al. 2003). Neighborhoods with <25 genes at a Pearson correlation threshold of 0.8 were omitted yielding 427 sets. This category of the database is identical to that previously reported in (Subramanian, Tamayo et al. 2005).

@@ Line 1: / Line 1: @@
 <a href="http://www.broad.mit.edu/gsea/">GSEA Home</a> |  <a href="http://www.broad.mit.edu/gsea/downloads.jsp">Downloads</a> | <a href="http://www.broad.mit.edu/gsea/msigdb/">Molecular Signatures Database</a> | Documentation | <a href="http://www.broad.mit.edu/gsea/contact.jsp">Contact</a>  <br />
 <br />
-Details on how the gene set databases were generated is provided below:<br />
+Details on how the gene set databases were generated is provided below.<br />
+This information is also available on the <a href="http://www.broad.mit.edu/gsea/msigdb/">Molecular Signatures Database</a> page.<br />
 <br />
 <h3><span style="font-weight: bold; color: rgb(255, 0, 0);">C1 (Positional gene sets)</span></h3>
@@ Line 8: / Line 9: @@
 <h3><span style="font-weight: bold; color: rgb(255, 0, 0);">C2 (Curated gene sets)</span></h3>
 C2 sets were curated from several sources including:<br />
-<p style="line-height: 150%;" class="MsoNormal">  </p>
+<p class="MsoNormal" style="line-height: 150%;">  </p>
 <em style=""><u><span style="font-family: Arial;">Online pathway databases</span></u></em><em style=""><span style="font-family: Arial;">: </span></em>Several online resources provide catalogs of well studied metabolic and signaling pathways as well as functional categories of genes. We downloaded gene sets from 12 such databases into our system.<br />
 <br />
-<table cellspacing="0" cellpadding="0" border="1" style="border: medium none ; border-collapse: collapse; width: 872px; height: 602px;" class="MsoTableGrid">
+<table cellspacing="0" cellpadding="0" border="1" class="MsoTableGrid" style="border: medium none ; border-collapse: collapse; width: 872px; height: 602px;">
      <tbody>
          <tr style="height: 13.2pt;">
@@ Line 76: / Line 77: @@
 </table>
 <br />
-<p style="line-height: 150%;" class="MsoNormal">  </p>
+<p class="MsoNormal" style="line-height: 150%;">  </p>
 <em style=""><u><span style="font-family: Arial;"><br />
 Biomedical literature</span></u></em><em style=""><span style="font-family: Arial;">: </span></em><span style="font-family: Arial;">Over the past few years, microarray studies have identified signatures of several important biological and clinical states (e.g. cancer metastasis, stem cell characteristics, drug resistance). These gene sets are valuable biological results. Unfortunately, because gene sets are typically published as tables in a paper, the </span><span style="line-height: 150%; font-family: Arial;">important biological findings they represent are not easily accessible to computational tools. Our first goal was to convert published gene sets into an electronic form. Towards this we compiled a list of microarray articles with published gene expression signatures. From each article, we extracted one or more gene set from tables in the main text or supplementary information. Notably, our focus was on capturing the identity (e.g. gene symbol, GenBank accession) of all members in a gene set rather than on relationships between individual genes. </span><span style="font-family: Arial;">Currently the process of curating a gene set from the literature is largely manual. In this report we include a collection of 1181 gene sets curated in this manner from 343 distinct PubMed accessions.</span> <br />
 <br />
 <h3><span style="font-weight: bold; color: rgb(255, 0, 0);">C3 (sequence motif gene sets)</span></h3>
-<p style="line-height: 150%;" class="MsoNormal"><span style="font-family: Arial; color: black;">We compiled gene sets on the basis of<span style="">&nbsp; </span>shared regulatory motifs from a recently published comparative analysis of the Human, Mouse, Rat and Dog genomes </span><!--[if supportFields]><span
+<p class="MsoNormal" style="line-height: 150%;"><span style="font-family: Arial; color: black;">We compiled gene sets on the basis of<span style="">&nbsp; </span>shared regulatory motifs from a recently published comparative analysis of the Human, Mouse, Rat and Dog genomes </span><!--[if supportFields]><span
 style='font-family:Arial;color:black'><span style='mso-element:field-begin'></span><span
 style='mso-spacerun:yes'> </span>ADDIN EN.CITE

Difference between revisions of "Msigdb v2 release notes"

Revision as of 12:14, 24 March 2008

Contents

C1 (Positional gene sets)

C2 (Curated gene sets)

C3 (sequence motif gene sets)

C4 (computed gene sets)

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

MSigDB

Software

Internal only

Tools