Difference between revisions of "Annotation Dependencies"
Line 1: | Line 1: | ||
+ | geWorkbench currently can only read in annotation files based on the Affymetrix annotation file format. Following is a list of the column headers for data types required by various geWorkbench components. | ||
− | + | All components require "Probe Set ID". | |
{| border="1" cellpadding="5" cellspacing="0" style="margin:10px;" | {| border="1" cellpadding="5" cellspacing="0" style="margin:10px;" | ||
Line 9: | Line 10: | ||
|- | |- | ||
|CNKB | |CNKB | ||
− | | | + | |Entrez Gene |
|- | |- | ||
|Gene Ontology | |Gene Ontology | ||
− | | | + | |Gene Ontology Biological Process, Gene Ontology Cellular Component, Gene Ontology Molecular Function |
|- | |- | ||
Line 28: | Line 29: | ||
|} | |} | ||
+ | |||
+ | |||
+ | All Affy headers: | ||
+ | |||
+ | * Probe Set ID | ||
+ | * GeneChip Array | ||
+ | * Species Scientific Name | ||
+ | * Annotation Date | ||
+ | * Sequence Type | ||
+ | * Sequence Source | ||
+ | * Transcript ID(Array Design) | ||
+ | * Target Description | ||
+ | * Representative Public ID | ||
+ | * Archival UniGene Cluster | ||
+ | * UniGene ID | ||
+ | * Genome Version | ||
+ | * Alignments | ||
+ | * Gene Title | ||
+ | * Gene Symbol | ||
+ | * Chromosomal Location | ||
+ | * Unigene Cluster Type | ||
+ | * Ensembl | ||
+ | * Entrez Gene | ||
+ | * SwissProt | ||
+ | * EC | ||
+ | * OMIM | ||
+ | * RefSeq Protein ID | ||
+ | * RefSeq Transcript ID | ||
+ | * FlyBase | ||
+ | * AGI | ||
+ | * WormBase | ||
+ | * MGI Name | ||
+ | * RGD Name | ||
+ | * SGD accession number | ||
+ | * Gene Ontology Biological Process | ||
+ | * Gene Ontology Cellular Component | ||
+ | * Gene Ontology Molecular Function | ||
+ | * Pathway | ||
+ | * InterPro | ||
+ | * Trans Membrane | ||
+ | * QTL | ||
+ | * Annotation Description | ||
+ | * Annotation Transcript Cluster | ||
+ | * Transcript Assignments | ||
+ | * Annotation Notes |
Revision as of 15:52, 18 June 2010
geWorkbench currently can only read in annotation files based on the Affymetrix annotation file format. Following is a list of the column headers for data types required by various geWorkbench components.
All components require "Probe Set ID".
Component | Columns required |
---|---|
CNKB | Entrez Gene |
Gene Ontology | Gene Ontology Biological Process, Gene Ontology Cellular Component, Gene Ontology Molecular Function |
Marker Annotations | |
Sequence Retrieval (EBI) | |
Sequence Retrieval (Santa Cruz) |
All Affy headers:
- Probe Set ID
- GeneChip Array
- Species Scientific Name
- Annotation Date
- Sequence Type
- Sequence Source
- Transcript ID(Array Design)
- Target Description
- Representative Public ID
- Archival UniGene Cluster
- UniGene ID
- Genome Version
- Alignments
- Gene Title
- Gene Symbol
- Chromosomal Location
- Unigene Cluster Type
- Ensembl
- Entrez Gene
- SwissProt
- EC
- OMIM
- RefSeq Protein ID
- RefSeq Transcript ID
- FlyBase
- AGI
- WormBase
- MGI Name
- RGD Name
- SGD accession number
- Gene Ontology Biological Process
- Gene Ontology Cellular Component
- Gene Ontology Molecular Function
- Pathway
- InterPro
- Trans Membrane
- QTL
- Annotation Description
- Annotation Transcript Cluster
- Transcript Assignments
- Annotation Notes