Difference between revisions of "Annotation Dependencies"
| Line 1: | Line 1: | ||
| + | geWorkbench currently can only read in annotation files based on the Affymetrix annotation file format. Following is a list of the column headers for data types required by various geWorkbench components. | ||
| − | + | All components require "Probe Set ID". | |
{| border="1" cellpadding="5" cellspacing="0" style="margin:10px;" | {| border="1" cellpadding="5" cellspacing="0" style="margin:10px;" | ||
| Line 9: | Line 10: | ||
|- | |- | ||
|CNKB | |CNKB | ||
| − | | | + | |Entrez Gene |
|- | |- | ||
|Gene Ontology | |Gene Ontology | ||
| − | | | + | |Gene Ontology Biological Process, Gene Ontology Cellular Component, Gene Ontology Molecular Function |
|- | |- | ||
| Line 28: | Line 29: | ||
|} | |} | ||
| + | |||
| + | |||
| + | All Affy headers: | ||
| + | |||
| + | * Probe Set ID | ||
| + | * GeneChip Array | ||
| + | * Species Scientific Name | ||
| + | * Annotation Date | ||
| + | * Sequence Type | ||
| + | * Sequence Source | ||
| + | * Transcript ID(Array Design) | ||
| + | * Target Description | ||
| + | * Representative Public ID | ||
| + | * Archival UniGene Cluster | ||
| + | * UniGene ID | ||
| + | * Genome Version | ||
| + | * Alignments | ||
| + | * Gene Title | ||
| + | * Gene Symbol | ||
| + | * Chromosomal Location | ||
| + | * Unigene Cluster Type | ||
| + | * Ensembl | ||
| + | * Entrez Gene | ||
| + | * SwissProt | ||
| + | * EC | ||
| + | * OMIM | ||
| + | * RefSeq Protein ID | ||
| + | * RefSeq Transcript ID | ||
| + | * FlyBase | ||
| + | * AGI | ||
| + | * WormBase | ||
| + | * MGI Name | ||
| + | * RGD Name | ||
| + | * SGD accession number | ||
| + | * Gene Ontology Biological Process | ||
| + | * Gene Ontology Cellular Component | ||
| + | * Gene Ontology Molecular Function | ||
| + | * Pathway | ||
| + | * InterPro | ||
| + | * Trans Membrane | ||
| + | * QTL | ||
| + | * Annotation Description | ||
| + | * Annotation Transcript Cluster | ||
| + | * Transcript Assignments | ||
| + | * Annotation Notes | ||
Revision as of 15:52, 18 June 2010
geWorkbench currently can only read in annotation files based on the Affymetrix annotation file format. Following is a list of the column headers for data types required by various geWorkbench components.
All components require "Probe Set ID".
| Component | Columns required |
|---|---|
| CNKB | Entrez Gene |
| Gene Ontology | Gene Ontology Biological Process, Gene Ontology Cellular Component, Gene Ontology Molecular Function |
| Marker Annotations | |
| Sequence Retrieval (EBI) | |
| Sequence Retrieval (Santa Cruz) |
All Affy headers:
- Probe Set ID
- GeneChip Array
- Species Scientific Name
- Annotation Date
- Sequence Type
- Sequence Source
- Transcript ID(Array Design)
- Target Description
- Representative Public ID
- Archival UniGene Cluster
- UniGene ID
- Genome Version
- Alignments
- Gene Title
- Gene Symbol
- Chromosomal Location
- Unigene Cluster Type
- Ensembl
- Entrez Gene
- SwissProt
- EC
- OMIM
- RefSeq Protein ID
- RefSeq Transcript ID
- FlyBase
- AGI
- WormBase
- MGI Name
- RGD Name
- SGD accession number
- Gene Ontology Biological Process
- Gene Ontology Cellular Component
- Gene Ontology Molecular Function
- Pathway
- InterPro
- Trans Membrane
- QTL
- Annotation Description
- Annotation Transcript Cluster
- Transcript Assignments
- Annotation Notes