Msigdb dtd

From GeneSetEnrichmentAnalysisWiki
Revision as of 12:47, 24 March 2006 by Hkuehn (talk | contribs)
Jump to navigation Jump to search


Note: To include the dtd in Wiki, the angle brackets were removed.

xml version="1.0" encoding="UTF-8"<br /><br />    	This is the DTD for MSigDB.<br />	It describes a format for a database of gene sets that captures<br />	both the content of a gene set (i.e gene members) as well as annotation about<br />	the gene set.<br />	@VERSION 1.0 May 2006<br />	@AUTHOR Aravind Subramanian<br /><br />DOCTYPE MSIGDB <br />ELEMENT MSIGDB GENESET*          	-- top level container<br />ATTLIST MSIGDB<br />  	NAME	CDATA	#REQUIRED	-- name of the database<br />  	VERSION	CDATA	#REQUIRED	-- version of the database<br /><br /><br />ELEMENT GENESET                  	-- a single gene set<br />ATTLIST GENESET<br />	STANDARD_NAME		CDATA	#REQUIRED	-- an english name for the gene set<br />	SYSTEMATIC_NAME		CDATA	#REQUIRED	-- a systematized identifier<br />	ORGANISM		CDATA	#REQUIRED	-- organism in which the geneset was generated<br />	EXTERNAL_DETAILS_URL	CDATA	    	        -- 3rd party url for more info<br />	CHIP			CDATA	#REQUIRED	-- platform on which the geneset was generated<br />	CATEGORY_CODE		CDATA	(C1|C2|C3|C4)<br />	SUB_CATEGORY_CODE	CDATA	(CYTO|PATH|ONTO|CURT|PERT|MODL|CLIN|REPM|TFAC|NEGH)	<br />	CONTRIBUTOR		CDATA	#REQUIRED	-- name of person/database that curated the set<br />	PMID			CDATA			-- PubMed ID if available<br />	GEOID			CDATA			-- GEO ID if available<br />	DESCRIPTION		CDATA			-- a full detailed desc or abstract<br />	DESCRIPTION_BRIEF	CDATA	#REQUIRED	-- a brief 1 line description<br />	%MESH			CDATA			-- keywords and MESH identifiers<br />	%MEMBERS		CDATA	#REQUIRED	-- Genes in the native format<br />	%MEMBERS_SYMBOLIZED	CDATA	#REQUIRED	-- Genes after converting to gene symbols<br /><br />