Difference between revisions of "Annotation Dependencies"
(6 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
+ | ==Annotation data required by certain components== | ||
+ | geWorkbench currently can only read in annotation files based on the Affymetrix annotation file format. Following is a list of the column headers for data types required by various geWorkbench components. | ||
+ | * All components require "'''Probe Set ID'''". | ||
{| border="1" cellpadding="5" cellspacing="0" style="margin:10px;" | {| border="1" cellpadding="5" cellspacing="0" style="margin:10px;" | ||
Line 6: | Line 9: | ||
!Component | !Component | ||
!Columns required | !Columns required | ||
+ | |||
+ | |- | ||
+ | |CNKB | ||
+ | |Entrez Gene | ||
+ | |||
+ | |- | ||
+ | |Gene Ontology | ||
+ | |Gene Ontology Biological Process, Gene Ontology Cellular Component, Gene Ontology Molecular Function | ||
+ | |||
+ | |- | ||
+ | |Marker Annotations | ||
+ | | | ||
+ | |||
+ | |- | ||
+ | |Sequence Retrieval (EBI) | ||
+ | | | ||
+ | |||
+ | |- | ||
+ | |Sequence Retrieval (Santa Cruz) | ||
+ | | | ||
+ | |||
|} | |} | ||
+ | |||
+ | |||
+ | ==List of All Affy headers== | ||
+ | |||
+ | * Probe Set ID | ||
+ | * GeneChip Array | ||
+ | * Species Scientific Name | ||
+ | * Annotation Date | ||
+ | * Sequence Type | ||
+ | * Sequence Source | ||
+ | * Transcript ID(Array Design) | ||
+ | * Target Description | ||
+ | * Representative Public ID | ||
+ | * Archival UniGene Cluster | ||
+ | * UniGene ID | ||
+ | * Genome Version | ||
+ | * Alignments | ||
+ | * Gene Title | ||
+ | * Gene Symbol | ||
+ | * Chromosomal Location | ||
+ | * Unigene Cluster Type | ||
+ | * Ensembl | ||
+ | * Entrez Gene | ||
+ | * SwissProt | ||
+ | * EC | ||
+ | * OMIM | ||
+ | * RefSeq Protein ID | ||
+ | * RefSeq Transcript ID | ||
+ | * FlyBase | ||
+ | * AGI | ||
+ | * WormBase | ||
+ | * MGI Name | ||
+ | * RGD Name | ||
+ | * SGD accession number | ||
+ | * Gene Ontology Biological Process | ||
+ | * Gene Ontology Cellular Component | ||
+ | * Gene Ontology Molecular Function | ||
+ | * Pathway | ||
+ | * InterPro | ||
+ | * Trans Membrane | ||
+ | * QTL | ||
+ | * Annotation Description | ||
+ | * Annotation Transcript Cluster | ||
+ | * Transcript Assignments | ||
+ | * Annotation Notes |
Latest revision as of 16:04, 18 June 2010
Annotation data required by certain components
geWorkbench currently can only read in annotation files based on the Affymetrix annotation file format. Following is a list of the column headers for data types required by various geWorkbench components.
- All components require "Probe Set ID".
Component | Columns required |
---|---|
CNKB | Entrez Gene |
Gene Ontology | Gene Ontology Biological Process, Gene Ontology Cellular Component, Gene Ontology Molecular Function |
Marker Annotations | |
Sequence Retrieval (EBI) | |
Sequence Retrieval (Santa Cruz) |
List of All Affy headers
- Probe Set ID
- GeneChip Array
- Species Scientific Name
- Annotation Date
- Sequence Type
- Sequence Source
- Transcript ID(Array Design)
- Target Description
- Representative Public ID
- Archival UniGene Cluster
- UniGene ID
- Genome Version
- Alignments
- Gene Title
- Gene Symbol
- Chromosomal Location
- Unigene Cluster Type
- Ensembl
- Entrez Gene
- SwissProt
- EC
- OMIM
- RefSeq Protein ID
- RefSeq Transcript ID
- FlyBase
- AGI
- WormBase
- MGI Name
- RGD Name
- SGD accession number
- Gene Ontology Biological Process
- Gene Ontology Cellular Component
- Gene Ontology Molecular Function
- Pathway
- InterPro
- Trans Membrane
- QTL
- Annotation Description
- Annotation Transcript Cluster
- Transcript Assignments
- Annotation Notes