Revision as of 16:07, 14 February 2006

Overview

'(who uses this , why/ what for? background )

geWorkbench is an open-source bioinformatics platform that offers a comprehensive and extendible collection of tools for the management, analysis, visualization and annotation of biomedical data.

Benefits include:

Integration with existing bioinformatics modules for analysis and visualization.
Support for a variety of genomic data including microarrays, sequences, pathways, networks, alignments and phenotypes.
Access to remote servers and clusters for the performance of computationally intensive calculations.
Accesses analyses with biological annotations from the National Cancer Institute.
Flexible import options: Allows user to merge files from various sources.
Community: decribe this aspect
Insert developer benefit ( plugin)

Tutorials

The following { insert description)

Getting Started

Starting the application
GUI elements
- Panels
- Navigation

Loading Data

Data formats

Working with Marker and Phenotype Panels

Creating panels

We can now assign phenotypes to each chip. We will place the phenotypes in the default group, however you can create new phenotype groups by pushing the New button on the Phenotype Panel at lower left.

Here we select and label arrays in the Phenotype Panel which contain samples from the congestive cardiomyopathy disease state...

Next, we can similarly label the remaining arrays as "Normal". We have also checked boxes to indicate that these groups of arrays are "Active". Various analysis and visualization components can be set to only use/display activated arrays or markers.

For statistical tests such as the t-test the Case and Control groups can be specified. This is done by left-clicking on the thumb-tack icon in front of the phenotype name. Here we are specifying the disease arrays as the "Case". The remaining "Normal" arrays are by default labeled control.

A red thumbtack indicates the arrays have been specified as "Case".

Visualize Panels

Here we select the relative display type.

Returning to the Open File dialog as we before by right-clicking on the project entry, we will select the "cardiomyopathy.exp" file we previously saved...

Resulting in the following colorful display of the array data for the first array....

Visualize Gene Expression

Visualization tools provide a view of the chip(s) under investigation and can be used for ascertaining the quality of the data. The phenotype and gene panel can be used to limit display. The images can be saved and exported.

The Microarray View can be used to inspect each separate microarray using the scroll bar.

(insert image)

The Tabular Microarray Panel can be used to see data in spreadsheet format. One row is created per individual marker/probe and one column per microarray.

(insert image)

Color Mosaic Heat maps for microarray expression data, organized by phenotypic or gene groupings.

(insert image)

Expression Profiles This is a line graph of genes expression profiles across several arrays/ hybridizations. (insert image)

Expression Value Distribution (EVD) A distribution plot of marker expression values across one or more microarrays.

(insert image)

Scatter Plot A pairwise (array vs. array and marker vs. marker) comparison and plotting of expression values. (insert image)

Filter and Normalize Data

Normalize

Normalization can be used to decrease the effects of systematic differences across a set of experiments. In caWorkbench, normalization results in replacing values with new values. Available geWorkbench Normalization methods:

log2 transformation
Threshold Normalizer
Array-based centering
Missing Value Computation
Mean –Variance normalizer
Marker-based centering

Clustering Gene Expression Data

Hierarchical Clustering
Self Organizing Map (SOM)

Differential Expression

T Test
Multi Test
- Volcano Plot
- Color Mosaic

Regulatory Network

Reverse Engineering
Cytoscape

Integrated Annotation Information

Enrichment Analysis

Go Term
- Go Miner

Sequence Analysis

Sequence Retrieval
Sequence Homology Analysis
- Blast
- Other

Pattern Discovery

Position Histogram

@@ Line 116: / Line 116: @@
 In caWorkbench, normalization results in replacing values with new values.
 Available geWorkbench Normalization methods:
-log2 transformation 		Missing Value Computation
+* log2 transformation
-Threshold Normalizer		Marker-based centering
+* Threshold Normalizer
-Array-based centering		Mean –Variance normalizer
+* Array-based centering
+* Missing Value Computation
+* Mean –Variance normalizer
+* Marker-based centering
 == Clustering Gene Expression Data==

geWorkbench

Difference between revisions of "User:Daly"

Revision as of 16:07, 14 February 2006

Contents

Overview

Tutorials

Getting Started

Loading Data

Working with Marker and Phenotype Panels

Creating panels

Visualize Panels

Visualize Gene Expression

Filter and Normalize Data

Normalize

Clustering Gene Expression Data

Differential Expression

Regulatory Network

Integrated Annotation Information

Enrichment Analysis

Sequence Analysis

Pattern Discovery

Promoter Analysis

Search

Personal tools

Tools