Difference between revisions of "Annotation Dependencies"

Line 1: Line 1:
 +
geWorkbench currently can only read in annotation files based on the Affymetrix annotation file format.  Following is a list of the column headers for data types required by various geWorkbench components.
  
geWorkbench currently can only read in annotation files based on the Affymetrix annotation file format.  Following is a list of the column headers for data types required by various geWorkbench components.
+
All components require "Probe Set ID".
  
 
{| border="1" cellpadding="5" cellspacing="0" style="margin:10px;"
 
{| border="1" cellpadding="5" cellspacing="0" style="margin:10px;"
Line 9: Line 10:
 
|-
 
|-
 
|CNKB
 
|CNKB
|EntrezID
+
|Entrez Gene
  
 
|-
 
|-
 
|Gene Ontology
 
|Gene Ontology
|
+
|Gene Ontology Biological Process, Gene Ontology Cellular Component, Gene Ontology Molecular Function
  
 
|-
 
|-
Line 28: Line 29:
  
 
|}
 
|}
 +
 +
 +
All Affy headers:
 +
 +
* Probe Set ID
 +
* GeneChip Array
 +
* Species Scientific Name
 +
* Annotation Date
 +
* Sequence Type
 +
* Sequence Source
 +
* Transcript ID(Array Design)
 +
* Target Description
 +
* Representative Public ID
 +
* Archival UniGene Cluster
 +
* UniGene ID
 +
* Genome Version
 +
* Alignments
 +
* Gene Title
 +
* Gene Symbol
 +
* Chromosomal Location
 +
* Unigene Cluster Type
 +
* Ensembl
 +
* Entrez Gene
 +
* SwissProt
 +
* EC
 +
* OMIM
 +
* RefSeq Protein ID
 +
* RefSeq Transcript ID
 +
* FlyBase
 +
* AGI
 +
* WormBase
 +
* MGI Name
 +
* RGD Name
 +
* SGD accession number
 +
* Gene Ontology Biological Process
 +
* Gene Ontology Cellular Component
 +
* Gene Ontology Molecular Function
 +
* Pathway
 +
* InterPro
 +
* Trans Membrane
 +
* QTL
 +
* Annotation Description
 +
* Annotation Transcript Cluster
 +
* Transcript Assignments
 +
* Annotation Notes

Revision as of 15:52, 18 June 2010

geWorkbench currently can only read in annotation files based on the Affymetrix annotation file format. Following is a list of the column headers for data types required by various geWorkbench components.

All components require "Probe Set ID".

Component Columns required
CNKB Entrez Gene
Gene Ontology Gene Ontology Biological Process, Gene Ontology Cellular Component, Gene Ontology Molecular Function
Marker Annotations
Sequence Retrieval (EBI)
Sequence Retrieval (Santa Cruz)


All Affy headers:

  • Probe Set ID
  • GeneChip Array
  • Species Scientific Name
  • Annotation Date
  • Sequence Type
  • Sequence Source
  • Transcript ID(Array Design)
  • Target Description
  • Representative Public ID
  • Archival UniGene Cluster
  • UniGene ID
  • Genome Version
  • Alignments
  • Gene Title
  • Gene Symbol
  • Chromosomal Location
  • Unigene Cluster Type
  • Ensembl
  • Entrez Gene
  • SwissProt
  • EC
  • OMIM
  • RefSeq Protein ID
  • RefSeq Transcript ID
  • FlyBase
  • AGI
  • WormBase
  • MGI Name
  • RGD Name
  • SGD accession number
  • Gene Ontology Biological Process
  • Gene Ontology Cellular Component
  • Gene Ontology Molecular Function
  • Pathway
  • InterPro
  • Trans Membrane
  • QTL
  • Annotation Description
  • Annotation Transcript Cluster
  • Transcript Assignments
  • Annotation Notes