Difference between revisions of "T-test"

Line 4: Line 4:
 
__TOC__
 
__TOC__
  
 +
==Overview==
 +
 +
Note - as of 5/22/2006 this page is being redrafted, should be complete on 5/23/2006.
  
 
In this tutorial, you will:  
 
In this tutorial, you will:  
  
* Get acquainted with the t-Test and Multi t-Test
+
* Discuss the background for using Student's t-Test to evaluate data.
* Apply a t-Test and Multi t-Test
+
* Load a predefined matrix format microarray dataset.
 
+
* Select classes of arrays to compare.
 
+
* Apply a t-Test.
 +
* Apply a multi t-test.
 +
* Visualize the results in the Color Mosaic and Volcano Plot components.
 +
* Note the creation of resulting marker lists and visualization objects.
  
Before you can continue, geworkbench should be running.  Load the data as described in [[Tutorial - Projects and Data Files]].
+
==Preparation==
  
 +
The file "webmatrix_quantile_log2_dev1_mv0.exp" is contained in the downloadable zip archive tutorial_data.zip.  See the Download area.
  
 +
For tips on loading data files, see the section [[Tutorial - Projects and Data Files]].
  
==t-Test==
 
  
A t-Test analysis can be used to identify markers with statistically significant differential expression between sets of microarrays. The t-test determines, for each marker, if there is a significant difference between the two groups (case and control). To perform this analysis, you must classify the sets, set the analysis parameters and view the results in the visualization components. A detailed description of the t-Test parameters is described in online help.
 
  
 +
==t-Test==
  
===Classify the Sets===
+
A t-Test analysis can be used to identify markers with statistically significant differential expression between two sets of microarrays. The t-test determines, for each marker, if there is a significant difference between the two groups (case and control). To perform this analysis, you must classify the sets, set the analysis parameters and view the results in the visualization components. A detailed description of the t-Test parameters is also available in online help. The implementation in geWorkbench offers several options for multiple testing correction and evaluation of the test statistic.
  
This process has already been described in [[Tutorial - Data Subsets]].  Briefly,
+
===Classification===
  
1. Mark the '''Cardio''' phenotype a 'Case'. By default, sets are marked as control.  Sets classified Case are shown with a red thumbtack icon.
+
The desired sets arrays should be activated in the Arrays/Phenotypes component.
* Right-click on '''Cardio''' phenotype.
 
* Select '''Classification'''>'''Case'''. 
 
2. Activate the arrays '''Normal''' and '''CCMP''' by selecting the checkboxes next to the set name.
 
  
 +
The t-test requires two groups of microarrays to compare.  geWorkbench distinguishes the two groups by one being labeled as "Case".  By default, all others are labeled control.  Note that in the Arrays/Phenotypes component, more than one set of arrays can be marked "Case".  All remaining (activated) arrays will then be in the "Control" group.
  
  
[[Image:T_Arrays_SetCase.png]]
 
  
 
===Set Analysis Parameters===
 
===Set Analysis Parameters===
Line 41: Line 45:
 
* P-Value Parameters tab: p-values based on t-distribution. Note that the default alpha (critical p-value) is set to 0.01.
 
* P-Value Parameters tab: p-values based on t-distribution. Note that the default alpha (critical p-value) is set to 0.01.
 
* Degree of Freedom tab: Welch approximation - unequal group variances.  
 
* Degree of Freedom tab: Welch approximation - unequal group variances.  
 
[[Image:Ttest.gif]]<br>
 
 
  
  
 
===t-Test Results===
 
===t-Test Results===
 +
T_multi-t-test_ProjectFolders_result.png
  
 +
T_multi-t-test_Set_selection_BCELL.png
  
 +
T_t-test_Arrays_case-set_BCELL_webm_qldm.png
  
{|style="border: 1px solid lightGray"
+
T_t-test_bonferroni_BCELL_webm_qldm.png
!|| ||
 
|-
 
|-
 
|  Markers which met the significance test are included in a new Marker Set called “Significant Genes”.  || [[Image:E_ttestgpanel.png]]
 
|-
 
|-
 
| Ancillary dataset is created in the project window. ||  [[Image:Ed_ttestproj.png]]
 
|-
 
|}
 
 
 
  
The values of the t-Test can be seen in the Color Mosaic panel and the Volcano Plot.
+
T_t-test_colormosaic_BCELL_webm_qldm.png
  
 +
T_t-test_dof.png
  
{|style="border: 1px solid lightGray"
+
T_t-test_Markers_BCELL_result.png
!VOLCANO PLOT||COLOR MOSAIC||
 
|-
 
|-|-
 
|-
 
|-
 
| [[Image:Vplot.png]]  ||  [[Image:Ed_cm.png]]
 
|-
 
|-
 
|-
 
| Clicking on any of the spots highlights the marker selected in the Marker component.  * Insert another description || 
 
* The label to the right displays the Significance value ( lower the value, most likely different) and gene name for the displayed genes. The genes are displayed in ascending order by Significance Value.
 
  
* Gene height and width values can be altered to modify the display.
+
T_t-test_ProjectFolders_result.png
  
* The intensity slider is used to modify the intensity of the color codings.  
+
T_t-test_p-values.png
  
* Accession: Includes the accesion number in the label.
+
T_t-test_set_Case_BCELL_webm_qldm.png
  
* Printer Icon: Prints the displayed image.  
+
T_t-test_Set_selection_BCELL.png
  
* Display: Must be toggled on to display data.
+
T_t-test_volcano_BCELL_webm_qldm.png
  
*  ''Pat, Abs, Ratio and Overlapping Pages Icons: These are not relevant to the t-Test display.''
 
  
|-
 
|-
 
|}
 
  
 
==Multi t-test==
 
==Multi t-test==
  
*The Multi t-test component allows more than two groups to be compared simultaneously.  Its shows each Marker Set that has been defined.  It will compare all selected sets against all other selected sets.
+
*The Multi t-test component allows more than two groups to be compared simultaneously.  Its shows each Array Set that has been defined.  It will compare in pairwise fashion all selected sets.  
  
*A step-down Bonferonni type correction is used to account for multiple testing.
+
* A step-down Bonferonni type correction is used to account for multiple testing of markers.
*Results can be viewed in the Volcano Plot and in the Color Mosaic components.
+
* As currently implemented, it does not correct for multiple pairwise t-tests being run on the various array sets.
 +
* Results can be viewed in the Volcano Plot and in the Color Mosaic components.
  
  

Revision as of 18:00, 22 May 2006

Home | Quick Start | Basics | Menu Bar | Preferences | Component Configuration Manager | Workspace | Information Panel | Local Data Files | File Formats | caArray | Array Sets | Marker Sets | Microarray Dataset Viewers | Filtering | Normalization | Tutorial Data | geWorkbench-web Tutorials

Analysis Framework | ANOVA | ARACNe | BLAST | Cellular Networks KnowledgeBase | CeRNA/Hermes Query | Classification (KNN, WV) | Color Mosaic | Consensus Clustering | Cytoscape | Cupid | DeMAND | Expression Value Distribution | Fold-Change | Gene Ontology Term Analysis | Gene Ontology Viewer | GenomeSpace | genSpace | Grid Services | GSEA | Hierarchical Clustering | IDEA | Jmol | K-Means Clustering | LINCS Query | Marker Annotations | MarkUs | Master Regulator Analysis | (MRA-FET Method) | (MRA-MARINa Method) | MatrixREDUCE | MINDy | Pattern Discovery | PCA | Promoter Analysis | Pudge | SAM | Sequence Retriever | SkyBase | SkyLine | SOM | SVM | T-Test | Viper Analysis | Volcano Plot



Overview

Note - as of 5/22/2006 this page is being redrafted, should be complete on 5/23/2006.

In this tutorial, you will:

  • Discuss the background for using Student's t-Test to evaluate data.
  • Load a predefined matrix format microarray dataset.
  • Select classes of arrays to compare.
  • Apply a t-Test.
  • Apply a multi t-test.
  • Visualize the results in the Color Mosaic and Volcano Plot components.
  • Note the creation of resulting marker lists and visualization objects.

Preparation

The file "webmatrix_quantile_log2_dev1_mv0.exp" is contained in the downloadable zip archive tutorial_data.zip. See the Download area.

For tips on loading data files, see the section Tutorial - Projects and Data Files.


t-Test

A t-Test analysis can be used to identify markers with statistically significant differential expression between two sets of microarrays. The t-test determines, for each marker, if there is a significant difference between the two groups (case and control). To perform this analysis, you must classify the sets, set the analysis parameters and view the results in the visualization components. A detailed description of the t-Test parameters is also available in online help. The implementation in geWorkbench offers several options for multiple testing correction and evaluation of the test statistic.

Classification

The desired sets arrays should be activated in the Arrays/Phenotypes component.

The t-test requires two groups of microarrays to compare. geWorkbench distinguishes the two groups by one being labeled as "Case". By default, all others are labeled control. Note that in the Arrays/Phenotypes component, more than one set of arrays can be marked "Case". All remaining (activated) arrays will then be in the "Control" group.


Set Analysis Parameters

  1. From the Analysis Panel, select T-Test Analysis.
  2. Populate the below parameters values and click on Analyze.
  • Alpha-corrections tab: Just Alpha.
  • P-Value Parameters tab: p-values based on t-distribution. Note that the default alpha (critical p-value) is set to 0.01.
  • Degree of Freedom tab: Welch approximation - unequal group variances.


t-Test Results

T_multi-t-test_ProjectFolders_result.png

T_multi-t-test_Set_selection_BCELL.png

T_t-test_Arrays_case-set_BCELL_webm_qldm.png

T_t-test_bonferroni_BCELL_webm_qldm.png

T_t-test_colormosaic_BCELL_webm_qldm.png

T_t-test_dof.png

T_t-test_Markers_BCELL_result.png

T_t-test_ProjectFolders_result.png

T_t-test_p-values.png

T_t-test_set_Case_BCELL_webm_qldm.png

T_t-test_Set_selection_BCELL.png

T_t-test_volcano_BCELL_webm_qldm.png


Multi t-test

  • The Multi t-test component allows more than two groups to be compared simultaneously. Its shows each Array Set that has been defined. It will compare in pairwise fashion all selected sets.
  • A step-down Bonferonni type correction is used to account for multiple testing of markers.
  • As currently implemented, it does not correct for multiple pairwise t-tests being run on the various array sets.
  • Results can be viewed in the Volcano Plot and in the Color Mosaic components.


References

t-test [1]