Difference between revisions of "Sequence Retriever"

(Retrieving the sequences)
(Background)
Line 6: Line 6:
 
geWorkbench contains a number of modules that allow DNA or protein sequences to be analyzed.  Sequences can be loaded from a local disk as a FASTA format file, or can be retrieved from a database.  Here we discuss retrieval of sequences from the network.
 
geWorkbench contains a number of modules that allow DNA or protein sequences to be analyzed.  Sequences can be loaded from a local disk as a FASTA format file, or can be retrieved from a database.  Here we discuss retrieval of sequences from the network.
  
For this example, we will start with the group of markers selected in the '''Hierarchical Clustering''' tutorial.
+
For this example, we will start with the group of markers selected in an excercise similar to that shown in the '''Hierarchical Clustering''' tutorial.
  
 
We will download sequences from +-2000 bp from the transcription start site of each gene.  This region may contain some regulatory elements such a transcription factor binding sites.
 
We will download sequences from +-2000 bp from the transcription start site of each gene.  This region may contain some regulatory elements such a transcription factor binding sites.
 
  
 
==Retrieving the sequences==
 
==Retrieving the sequences==

Revision as of 00:32, 25 April 2006

Home | Quick Start | Basics | Menu Bar | Preferences | Component Configuration Manager | Workspace | Information Panel | Local Data Files | File Formats | caArray | Array Sets | Marker Sets | Microarray Dataset Viewers | Filtering | Normalization | Tutorial Data | geWorkbench-web Tutorials

Analysis Framework | ANOVA | ARACNe | BLAST | Cellular Networks KnowledgeBase | CeRNA/Hermes Query | Classification (KNN, WV) | Color Mosaic | Consensus Clustering | Cytoscape | Cupid | DeMAND | Expression Value Distribution | Fold-Change | Gene Ontology Term Analysis | Gene Ontology Viewer | GenomeSpace | genSpace | Grid Services | GSEA | Hierarchical Clustering | IDEA | Jmol | K-Means Clustering | LINCS Query | Marker Annotations | MarkUs | Master Regulator Analysis | (MRA-FET Method) | (MRA-MARINa Method) | MatrixREDUCE | MINDy | Pattern Discovery | PCA | Promoter Analysis | Pudge | SAM | Sequence Retriever | SkyBase | SkyLine | SOM | SVM | T-Test | Viper Analysis | Volcano Plot


Background

geWorkbench contains a number of modules that allow DNA or protein sequences to be analyzed. Sequences can be loaded from a local disk as a FASTA format file, or can be retrieved from a database. Here we discuss retrieval of sequences from the network.

For this example, we will start with the group of markers selected in an excercise similar to that shown in the Hierarchical Clustering tutorial.

We will download sequences from +-2000 bp from the transcription start site of each gene. This region may contain some regulatory elements such a transcription factor binding sites.

Retrieving the sequences

Press the Get Sequence button to download the sequences.


T SequenceRetriever ClusterTree.png


The retrieved sequences are placed in the Project Folder. Note that when this entry is selected, the modules supporting sequence analysis will appear.


T ProjectFolder ClusterSeqs.png

Viewing retrieved sequences

Besides the sequence retriever component, visualizers Sequence and Promoter can display retrieved sequences.