ANOVA

Revision as of 13:14, 28 July 2009 by Smith (talk | contribs) (Overview)

Home | Quick Start | Basics | Menu Bar | Preferences | Component Configuration Manager | Workspace | Information Panel | Local Data Files | File Formats | caArray | Array Sets | Marker Sets | Microarray Dataset Viewers | Filtering | Normalization | Tutorial Data | geWorkbench-web Tutorials

Analysis Framework | ANOVA | ARACNe | BLAST | Cellular Networks KnowledgeBase | CeRNA/Hermes Query | Classification (KNN, WV) | Color Mosaic | Consensus Clustering | Cytoscape | Cupid | DeMAND | Expression Value Distribution | Fold-Change | Gene Ontology Term Analysis | Gene Ontology Viewer | GenomeSpace | genSpace | Grid Services | GSEA | Hierarchical Clustering | IDEA | Jmol | K-Means Clustering | LINCS Query | Marker Annotations | MarkUs | Master Regulator Analysis | (MRA-FET Method) | (MRA-MARINa Method) | MatrixREDUCE | MINDy | Pattern Discovery | PCA | Promoter Analysis | Pudge | SAM | Sequence Retriever | SkyBase | SkyLine | SOM | SVM | T-Test | Viper Analysis | Volcano Plot


Overview

The geWorkbench ANOVA component utilizes a one-way analysis of variance calculation from TIGR's MEV package. At least three groups of arrays must be specified by defining and activating them in the Arrays/Phenotypes component. For each chosen marker the routine determines if, at the specified level of significance, any difference exists in expression values between any of the groups (the null hypothesis is that there is no difference between the groups). Several basic methods of multiple testing correction are offered. The analysis does not indicate between which groups the difference is found, only that one exists.

Those markers for which a significant difference is found are placed into a new set in the Markers component called "Significant Genes". The results are also display as a heat map in the Color Mosaic component.

Loading the example data

This example uses the Bcell-100.zip available and further described on the Download page. Briefly, this dataset is composed of 100 Affymetrix HG-U95Av2 arrays on which various B-cell lines, both normal and cancerous, were analyzed. Thus it explores a potentially wide variety of expression phenotypes. In order to see the gene names associated with the probesets, one must also download the correct annotation file for this array from the Affymetrix Netaffx website.

  1. Obtain the Bcell-100.exp dataset. If downloaded as a zip file, unzip it to a convenient directory.
  2. (Optional) Obtain the annotation file for the HG-U95Av2 array type from Affymetrix. The name will be similar to "HG_U95Av2.na28.annot.csv", where na28 is the version number.
  3. Load the Bcell-100.exp dataset into geWorkbench as type "Affymetrix File Matrix". (See Loading from File).
  4. When prompted, and if desired, load the annotation file.

Choosing array groups

The Bcell-100 dataset comes with predefined sets of arrays. In the Arrays/Phenotypes component (at lower left in the geWorkbench GUI), choose the group in the pulldown menu called "Class". Check each of the four sets of arrays to activate them as shown in the figure below.

T ANOVA array groups.png


Setting up ANOVA Parameters

T ANOVA setup.png


ANOVA result display

T ANOVA ColorMosaic.png



T ANOVA significant genes set.png