Difference between revisions of "Tutorials"

Line 3: Line 3:
  
 
__NOTOC__
 
__NOTOC__
 +
  
 
The tutorials shown on this page provide a quick introduction to the most important features of geWorkbench.  Additional information can be found in the User Guide and in the Online Help section of the program.  
 
The tutorials shown on this page provide a quick introduction to the most important features of geWorkbench.  Additional information can be found in the User Guide and in the Online Help section of the program.  
Line 9: Line 10:
  
 
==Using the basic framework of geWorkbench: Projects, files and data==
 
==Using the basic framework of geWorkbench: Projects, files and data==
 +
 +
===[[QuickStart | Quick Start]]===
 +
A quick jump into the most important topics for learning to use geWorkbench.
  
 
===[[Tutorial - Basics | Basics]]===
 
===[[Tutorial - Basics | Basics]]===
A brief introduction to the use of geWorkbench.
+
An introduction to the use of geWorkbench.
  
 
===[[Component_Configuration_Manager | Component Configuration Manager]]===
 
===[[Component_Configuration_Manager | Component Configuration Manager]]===
Add or remove particular components to geWorkbench.
+
Customize geWorkbench to your needs.  geWorkbench comes initially configured with only basic components installed.  Use the CCM to load additional available modules.
  
===[[Tutorial - Data | Tutorial Data]]===
+
===[[Projects| Projects]]===
Downloadable data used in the tutorials.
+
The Project Folders component is where data is loaded and analysis results are stored.
  
===[[Tutorial - Data Subsets | Data Subsets]]===
+
===[[Tutorial - Project Details | Project Details]]===
Shows how subsets of both markers and arrays can be defined for targeted analysis.
+
Describes components use to record details of calculations and datasets.
  
===[[Tutorial - Projects and Data Files | Projects and Data Files]]===
+
===[[Tutorial - Local Data Files | Local Data Files]]===
Creating projects, loading microarray data files, merging multiple datasets into one, and saving.
+
Covers loading data from files on your local computer.
  
===[[Tutorial - Project Details | Project Details]]===
+
===[[File_Formats | File Formats]]===
 +
Details of several different file formats supported by geWorkbench.
  
 +
===[[Tutorial - Remote Data Sources | Remote Data Sources (caArray)]]===
 +
How to download microarray data from caArray.  geWorkbench can download "derived" data sets from caArray.
  
===[[Tutorial - Local Data Files | Local Data Files]]===
+
===[[Data_Subsets - Arrays | Data Subsets - Arrays]] ===
How to load a data file from disk.
+
How to create and use sets of arrays for controlling data analysis.
  
===[[Tutorial - Remote Data Sources | Remote Data Sources (caArray)]]===
+
===[[Data_Subsets - Markers| Data Subsets - Markers]] ===
How to download microarray data from caArray.
+
How to create and use sets of markers for controlling data analysis.
  
 
===[[Tutorial - Viewing a Microarray Dataset | Viewing a Microarray Dataset]]===
 
===[[Tutorial - Viewing a Microarray Dataset | Viewing a Microarray Dataset]]===
Line 42: Line 49:
 
===[[Normalization|Normalization]]===
 
===[[Normalization|Normalization]]===
 
geWorkbench provides numerous methods for normalizing microarray data.
 
geWorkbench provides numerous methods for normalizing microarray data.
 +
 +
===[[Tutorial - Data | Tutorial Data]]===
 +
Downloadable data used in the tutorials.
 +
  
  
 
==Individual analysis and visualization components==
 
==Individual analysis and visualization components==
  
===[[Tutorial - Analysis| Analysis]]===
+
 
 +
===[[Tutorial_-_Analysis | Analysis Framework]]===
 
Most analysis routines are located in the command area located in the lower right quadrant of geWorkbench.  This section describes a common framework for saving parameter settings that these components share.  
 
Most analysis routines are located in the command area located in the lower right quadrant of geWorkbench.  This section describes a common framework for saving parameter settings that these components share.  
  
Line 72: Line 84:
  
  
===[[Tutorial - Differential Expression | Differential Expression]]===
+
===[[Tutorial - Differential Expression | Differential Expression (t-test)]]===
 
Several variants of the t-test are available.
 
Several variants of the t-test are available.
  
 
===[[Tutorial - EVD | Expression Value Distribution]]===
 
===[[Tutorial - EVD | Expression Value Distribution]]===
 
View and manipulate a histogram of the distribution of expression values for each array.
 
View and manipulate a histogram of the distribution of expression values for each array.
 
  
 
===[[Tutorial_-_Gene_Ontology_Term_Over-representation | Gene Ontology Term Over-representation]]===
 
===[[Tutorial_-_Gene_Ontology_Term_Over-representation | Gene Ontology Term Over-representation]]===
 
Finds Gene Ontology terms that are over-represented in a list of genes of interest.
 
Finds Gene Ontology terms that are over-represented in a list of genes of interest.
 +
 +
===[[Tutorial_-_genSpace | genSpace]]===
 +
  
 
===[[Tutorial - Grid Services | Grid Services]]===
 
===[[Tutorial - Grid Services | Grid Services]]===
Line 91: Line 105:
 
===[[Tutorial - Jmol | Jmol]]===
 
===[[Tutorial - Jmol | Jmol]]===
 
Jmol is a molecular structure viewer for viewing PDB format files.
 
Jmol is a molecular structure viewer for viewing PDB format files.
 +
 +
 +
===[[Tutorial - Mark-Us | Mark-Us]]===
 +
  
 
===[[Tutorial - Marker Annotations | Marker Annotations]]===
 
===[[Tutorial - Marker Annotations | Marker Annotations]]===
 
Marker annotations can be retrieved, including BioCarta pathway diagrams.
 
Marker annotations can be retrieved, including BioCarta pathway diagrams.
 +
 +
 +
===[[Tutorial - Master Regulator Analysis | Master Regulator Analysis]]===
 +
  
 
===[[Tutorial - MatrixREDUCE | MatrixREDUCE]]===
 
===[[Tutorial - MatrixREDUCE | MatrixREDUCE]]===
Line 107: Line 129:
 
Search a set of sequences against a promoter database.
 
Search a set of sequences against a promoter database.
  
 +
 +
===[[Tutorial - Pudge | Pudge]]===
  
 
===[[Tutorial - Sequence Retrieval | Sequence Retrieval]]===
 
===[[Tutorial - Sequence Retrieval | Sequence Retrieval]]===

Revision as of 12:35, 25 June 2010

Home | Quick Start | Basics | Menu Bar | Preferences | Component Configuration Manager | Workspace | Information Panel | Local Data Files | File Formats | caArray | Array Sets | Marker Sets | Microarray Dataset Viewers | Filtering | Normalization | Tutorial Data | geWorkbench-web Tutorials

Analysis Framework | ANOVA | ARACNe | BLAST | Cellular Networks KnowledgeBase | CeRNA/Hermes Query | Classification (KNN, WV) | Color Mosaic | Consensus Clustering | Cytoscape | Cupid | DeMAND | Expression Value Distribution | Fold-Change | Gene Ontology Term Analysis | Gene Ontology Viewer | GenomeSpace | genSpace | Grid Services | GSEA | Hierarchical Clustering | IDEA | Jmol | K-Means Clustering | LINCS Query | Marker Annotations | MarkUs | Master Regulator Analysis | (MRA-FET Method) | (MRA-MARINa Method) | MatrixREDUCE | MINDy | Pattern Discovery | PCA | Promoter Analysis | Pudge | SAM | Sequence Retriever | SkyBase | SkyLine | SOM | SVM | T-Test | Viper Analysis | Volcano Plot




The tutorials shown on this page provide a quick introduction to the most important features of geWorkbench. Additional information can be found in the User Guide and in the Online Help section of the program.

All data sets used in the tutorials are available from the download area of our site.

Using the basic framework of geWorkbench: Projects, files and data

Quick Start

A quick jump into the most important topics for learning to use geWorkbench.

Basics

An introduction to the use of geWorkbench.

Component Configuration Manager

Customize geWorkbench to your needs. geWorkbench comes initially configured with only basic components installed. Use the CCM to load additional available modules.

Projects

The Project Folders component is where data is loaded and analysis results are stored.

Project Details

Describes components use to record details of calculations and datasets.

Local Data Files

Covers loading data from files on your local computer.

File Formats

Details of several different file formats supported by geWorkbench.

Remote Data Sources (caArray)

How to download microarray data from caArray. geWorkbench can download "derived" data sets from caArray.

Data Subsets - Arrays

How to create and use sets of arrays for controlling data analysis.

Data Subsets - Markers

How to create and use sets of markers for controlling data analysis.

Viewing a Microarray Dataset

Survey of geWorkbench visualiztion tools for microarray data.

Filtering

geWorkbench provides numerous methods for filtering microarray data.

Normalization

geWorkbench provides numerous methods for normalizing microarray data.

Tutorial Data

Downloadable data used in the tutorials.


Individual analysis and visualization components

Analysis Framework

Most analysis routines are located in the command area located in the lower right quadrant of geWorkbench. This section describes a common framework for saving parameter settings that these components share.

ANOVA

How to set up and run Analysis of Variance.

ARACNE

Formal method for reverse Engineering - microarray datasets can be analyzed for interactions between genes. Now includes new ARACNe2, which implements the much faster Adaptive Partitioning algorithm and accurate parameter estimation.

BLAST

Submits BLAST jobs to the NCBI server and displays and allows further interaction with alignment results.

Cellular Networks KnowledgeBase

The CNKB component queries a database of protein-protein and protein-DNA interactions maintained at Columbia University.


Classification

Color Mosaic

Displays expression results as a heat map.

Cytoscape

Cytoscape is used to display network interaction diagrams (from adjacency matrices). It features two-way interaction with the geWorkbench Markers component.


Differential Expression (t-test)

Several variants of the t-test are available.

Expression Value Distribution

View and manipulate a histogram of the distribution of expression values for each array.

Gene Ontology Term Over-representation

Finds Gene Ontology terms that are over-represented in a list of genes of interest.

genSpace

Grid Services

A number of geWorkbench data analysis components have been implemented as services on the National Cancer Institute's caGrid. caGrid is an infrastructure component of the NCI's caBIG(R) program.


Hierarchical Clustering

geWorkbench implements its own agglomerative hierarchical clustering algorithm.

Jmol

Jmol is a molecular structure viewer for viewing PDB format files.


Mark-Us

Marker Annotations

Marker annotations can be retrieved, including BioCarta pathway diagrams.


Master Regulator Analysis

MatrixREDUCE

MINDy

Pattern Discovery

Upstream seqeunce can be analyzed for conserved sequence patterns.

Promoter Analysis

Search a set of sequences against a promoter database.


Pudge

Sequence Retrieval

Genomic sequences for markers can be retrieved for further analysis.


SOM

Clustering using Self-Organizing Maps.


Coming Soon

Tutorials for a number of components are under development, including:

  • Expression Profiles
  • Sequence (Viewer)
  • genSpace
  • SkyBase
  • SkyLine
  • Online Help
  • NetBoost

Gene Pattern components:

  • PCA (GenePattern) - Analysis and Viewer
  • SVM 3.0 (GenePattern) - Analysis and Viewer - include, we need to develop online help and tutorial (Aris).