geWorkbench

1 Overview
2 geWorkbench Configuration
- 2.1 Component Configuration Manager
3 Workspace and Data Management
4 Microarray Data Displays
5 Gene and Pathway Annotations
- 5.1 Marker Annotations
- 5.2 Marker Annotations - BioCarta Pathways
6 Statistical tests, clustering and classification
7 Sequence Analysis / Pattern Discovery
8 Network Discovery and Visualization
9 Molecular Structure

Overview

The geWorkbench graphical interface with a CNKB query result dislayed in Cytoscape.

geWorkbench Configuration

Component Configuration Manager

Individual components can be loaded as needed.

Workspace and Data Management

Workspace

Data sets are loaded into the Workspace. Individual analysis results are stored under their parent dataset.

Array/Phenotypes

Sets of arrays can be defined and included or excluded from particular analyses (via checkboxes). Sets can be individually marked as belonging to case (here indicated with a red thumbtack) or control groups.

Markers

Sets of arrays can be defined and included or excluded from particular analyses (via checkboxes).

In addition, routines such as ANOVA or t-test return lists of significant markers to the Markers component.

Retrieval from caArray

Microarray data can be retrieved directly from instances of caArray.

Microarray Data Displays

Microarray Viewer

The Microarray Viewer displaying marker values for selected array.

Tabular Microarray Viewer

The Tabular Microarray Viewer displays expression values in spreadsheet format.

CEL Image Viewer

Allows viewing of Affyemtrix CEL files.

Color Mosaic

The Color Mosaic component displaying a result from ANOVA analysis. It can also directly display the loaded expression data, or subsets of that data created using Marker and Array sets.

Expression Profile

Expression Profile plotting values for selected markers and arrays. Individual values can be seen by hovering over a desired data point.

Expression Value Distribution

The dataset has been quantile normalized and log2 transformed.

Scatter Plot

Compare multiple markers or arrays with the standard Scatter Plot analysis.

Array vs Array

Marker vs Marker

Gene and Pathway Annotations

Marker Annotations

Retrieve and display gene and pathway information from bioDBNet.

Marker Annotations - BioCarta Pathways

Displays BioCarta images retrieved via bioDBNet.

Statistical tests, clustering and classification

Gene Ontology Term Over-representation Analysis

T Test

Volcano Plot

A t-test result display on a "volcano plot": Log significance vs log fold change.

Color Mosaic

The t-test result can also be displayed in the Color Mosaic component.

(Visualization preference setting: Relative)

Analysis of Variance (ANOVA)

Detects markers for which a statistically significant difference exists in a data set containing multiple classes of samples.

Color Mosaic View

(Visualization preference setting: Relative)

Tabular View

Hierarchical Clustering Dendrogram

A Dendrogram displays the results of the Hierarchical clustering analysis.

SOM Clustering

Self Ordered Map clustering results are displayed as series of expression profiles corresponding to discovered groupings.

Consensus Clustering (GenePattern)

Classification (GenePattern)

GenePattern components that perform classification on microarray datasets have been adapted to geWorkbench.

K-Nearest Neighbors (KNN)

Example of classifier result

The classifiers return groups of markers to the Markers component:

Sequence Analysis / Pattern Discovery

Sequence Retriever

Retrieve genomic and protein sequences for selected markers. Retrieved sequences can be individually selected and added to the project as new data nodes.

BLAST Queries

The Sequence Alignment component submits BLAST jobs to the NCBI server and displays the results such that individual hits can be used in further analysis steps.

Pattern Discovery

Use the SPLASH algorithm to discover sparse amino or nucleic acid patterns in a loaded sequence.

Motif discovery and display

The Pattern Discovery component itself with results displayed in the sequence viewer.

Position Histogram

The Pattern Discovery component with results displayed as histogram of support for selected discovered motifs across the sequence data set. Support indicates what fraction of the seqeunces are matched by the motif at a within a sliding window about a given location.

Promoter

Individual motifs from the JASPAR Transcription Factor Binding Profile Database can be scanned against loaded genomic sequences.

Motif selection and Logo display

Result of a scan against a single sequence

Sequence-level display of match

MatrixREDUCE

MatrixREDUCE is a tool for inferring the binding specificity and nuclear concentration of transcription factors from microarray data.

Network Discovery and Visualization

Cytoscape - ARACNe Network display

The adjacency matrix generated by an ARACNe network reverse engineering run displayed in Cytoscape.

Cellular Network Knowledge Base (CNKB)

Results of queries against the CNKB can be filtered based on confidence values using the throttle graph.

Query results and Throttle Graph

CNKB query results displayed in Cytoscape

Display of CNKB interactions in Cytoscape.

Master Regulator Analysis

Molecular Structure

JMOL Structure Viewer

JMOL is a viewer for PDB protein structure files.

Mark-Us - Protein Functional Annotation

Mark-Us is a web server to assist the assessment of the biochemical function for a given protein structure. MarkUs identifies related protein structures and sequences, detects protein cavities, and calculates the surface electrostatic potentials and amino acid conservation profile.

PUDGE

Pudge is a server for the prediction of the 3D structures of proteins. While the server can be run without any user intervention, it is primarily designed to be interactive and to allow functional information to be used as a guide to the modeling.

PUDGE allows a pipeline of modeling and evaluation steps, depicted below, to be set up and run.

Pudge has a number of output types, the following illustrates a sequence alignment:

Screenshots

Contents

Overview

geWorkbench Configuration

Component Configuration Manager

Workspace and Data Management

Workspace

Array/Phenotypes

Markers

Retrieval from caArray

Microarray Data Displays

Microarray Viewer

Tabular Microarray Viewer

CEL Image Viewer

Color Mosaic

Expression Profile

Expression Value Distribution

Scatter Plot

Array vs Array

Marker vs Marker

Gene and Pathway Annotations

Marker Annotations

Marker Annotations - BioCarta Pathways

Statistical tests, clustering and classification

Gene Ontology Term Over-representation Analysis

T Test

Volcano Plot

Color Mosaic

Analysis of Variance (ANOVA)

Color Mosaic View

Tabular View

Hierarchical Clustering Dendrogram

SOM Clustering

Consensus Clustering (GenePattern)

Classification (GenePattern)

K-Nearest Neighbors (KNN)

Example of classifier result

Sequence Analysis / Pattern Discovery

Sequence Retriever

BLAST Queries

Pattern Discovery

Motif discovery and display

Position Histogram

Promoter

Motif selection and Logo display

Result of a scan against a single sequence

Sequence-level display of match

MatrixREDUCE

Network Discovery and Visualization

Cytoscape - ARACNe Network display

Cellular Network Knowledge Base (CNKB)

Query results and Throttle Graph

CNKB query results displayed in Cytoscape

Master Regulator Analysis

Molecular Structure

JMOL Structure Viewer

Mark-Us - Protein Functional Annotation

PUDGE

Search

Personal tools

Tools