Difference between revisions of "Fold Change"

(References)
(Overview)
 
(One intermediate revision by the same user not shown)
Line 8: Line 8:
  
 
Two methods are provided to calculate fold change.  The component also allows either calculation to be carried out starting with either linear or log2-transformed data.
 
Two methods are provided to calculate fold change.  The component also allows either calculation to be carried out starting with either linear or log2-transformed data.
 +
 +
 +
'''Note''' - Despite the flexibility offered by this component, the most relevant calculation for log2 transformed input data is the "Difference of average log2 values".
  
 
[[Image:Fold_Change_Analysis.png]]
 
[[Image:Fold_Change_Analysis.png]]
Line 75: Line 78:
 
==References==
 
==References==
  
A comparison of fold-change and the t-statistic for microarray data analysis (2007) Witten  and Tibshirani [http://www-stat.stanford.edu/~tibs/ftp/FCTComparison.pdf | link to paper].
+
A comparison of fold-change and the t-statistic for microarray data analysis (2007) Witten  and Tibshirani [http://www-stat.stanford.edu/~tibs/ftp/FCTComparison.pdf link to paper].

Latest revision as of 15:31, 13 March 2015

Home | Quick Start | Basics | Menu Bar | Preferences | Component Configuration Manager | Workspace | Information Panel | Local Data Files | File Formats | caArray | Array Sets | Marker Sets | Microarray Dataset Viewers | Filtering | Normalization | Tutorial Data | geWorkbench-web Tutorials

Analysis Framework | ANOVA | ARACNe | BLAST | Cellular Networks KnowledgeBase | CeRNA/Hermes Query | Classification (KNN, WV) | Color Mosaic | Consensus Clustering | Cytoscape | Cupid | DeMAND | Expression Value Distribution | Fold-Change | Gene Ontology Term Analysis | Gene Ontology Viewer | GenomeSpace | genSpace | Grid Services | GSEA | Hierarchical Clustering | IDEA | Jmol | K-Means Clustering | LINCS Query | Marker Annotations | MarkUs | Master Regulator Analysis | (MRA-FET Method) | (MRA-MARINa Method) | MatrixREDUCE | MINDy | Pattern Discovery | PCA | Promoter Analysis | Pudge | SAM | Sequence Retriever | SkyBase | SkyLine | SOM | SVM | T-Test | Viper Analysis | Volcano Plot


Overview

Fold change is used to compare the expression of genes between two sets of arrays, e.g. case and control sets.

This component allows a threshold for fold-change to be set. Markers that exceed the threshold are placed into new sets in the Markers component - one for those with positive fold-change, the other for negative (further described below).

Two methods are provided to calculate fold change. The component also allows either calculation to be carried out starting with either linear or log2-transformed data.


Note - Despite the flexibility offered by this component, the most relevant calculation for log2 transformed input data is the "Difference of average log2 values".

Fold Change Analysis.png

Parameters

Criterion

The fold-change threshold that must be met for a marker to be included in the positive or negative fold-change set. This number must be greater than or equal to zero.

The criterion is not adjusted based on the type of calculation. For the ratio method, a fold-change criterion of 4 is comparable in scale to a criterion of 2 for the average log2 method.

Input Data Format

To correctly calculate the chosen fold-change value, the component must know if the data is linear or log2 transformed. This must be specified by the user.

  • Linear
  • Log2-transformed

Calculation Method

Ratio

  • The ratio of case to control is calculated with linear values. Log2 data will first be transformed to linear.
  • For each marker, calculate Avg(Case arrays) / Avg(Control arrays).
  • Ratios less than 1 are inverted and given a negative sign, e.g. 0.25 is reported as -4.

For the ratio calculation, for any given marker, the numerator must be postive or zero, and the denominator must be positive. If either condition is not met, the marker will be skipped an no fold-change calculated for it. The user will be warned about the first 5 markers that are skipped.

Difference of average log2 values

  • Calculated with log2 values. Linear data will be transformed to log2 (shown in formula below).
  • Because of the log2 transformation of linear data, no linear value should be less than or equal to zero. If such a value is encountered, a warning will be issued and the analysis terminated.
  • For each marker, calculate Avg( log2(Case arrays) ) - Avg( log2(Control arrays) ).
  • Here, if the Case arrays have lower expression than the Control, the negative sign results directly.


Tranformation Notice

The below figures show that when the data is to be transformed, the transformation is indicated below the parameter selections.

  • Log2 Transform:


Fold Change Parameters cross1.png


  • Antilog2 Transform:


Fold Change Parameters cross2.png


If the data was not actually log2 transformed, taking the anti-log can cause an arithematic overflow. This is caught and reported to the user, and the analysis terminated.

Example and Results

Setup

  • At least two set of markers must be activated, and at least one marked as Case and one marked as Control.


Fold Change Analysis Arrays.png


We will use the default Fold Change paramter settings:


Fold Change Analysis Settings.png

Results

Separate marker sets are created in the Markers component for those markers with positive and negative fold change. The set names are created as "Fold_change_Pos#.#" and "Fold_change_Neg#.#, where #.# indicates the fold-change threshold criterion that was met.

Fold Change Analysis Result.png

References

A comparison of fold-change and the t-statistic for microarray data analysis (2007) Witten and Tibshirani link to paper.