Master Regulator Analysis

Revision as of 10:49, 15 July 2009 by Smith (talk | contribs) (Overview)

Home | Quick Start | Basics | Menu Bar | Preferences | Component Configuration Manager | Workspace | Information Panel | Local Data Files | File Formats | caArray | Array Sets | Marker Sets | Microarray Dataset Viewers | Filtering | Normalization | Tutorial Data | geWorkbench-web Tutorials

Analysis Framework | ANOVA | ARACNe | BLAST | Cellular Networks KnowledgeBase | CeRNA/Hermes Query | Classification (KNN, WV) | Color Mosaic | Consensus Clustering | Cytoscape | Cupid | DeMAND | Expression Value Distribution | Fold-Change | Gene Ontology Term Analysis | Gene Ontology Viewer | GenomeSpace | genSpace | Grid Services | GSEA | Hierarchical Clustering | IDEA | Jmol | K-Means Clustering | LINCS Query | Marker Annotations | MarkUs | Master Regulator Analysis | (MRA-FET Method) | (MRA-MARINa Method) | MatrixREDUCE | MINDy | Pattern Discovery | PCA | Promoter Analysis | Pudge | SAM | Sequence Retriever | SkyBase | SkyLine | SOM | SVM | T-Test | Viper Analysis | Volcano Plot



Overview

The goal of Master Regulator Analysis (MRA)is to identify transcription factors (TFs) which control the regulation of a set of target genes (TGs) that demonstrate significant differential expression across two cellular phenotypes, e.g. “Case” and “Control” in a microarray dataset. Differential expression is measured using a simple t-test. Sets of genes putatively controlled by each TF (each TF's regulon) are obtained from an adjacency matrix (interaction network) calculated by ARACNe or other source prior to MRA.

The dataset from which the adjacency matrix is derived would not necessarily be the same one used for the t-test. An ARACNe run requires a dataset which explores many different expression phenotypes of a particular cell type, whereas a differential expression experiment compares only two classes.

MRA then calculates for each TF, using Fisher's Exact test, whether there is greater overlap between the set of the TF's target genes and the set of differentially expressed genes than would be expected by chance.

The types of data which will be used in the MRA then are:

  1. A microarray dataset appropriate for examining differential gene expression using a t-test.
  2. A list of putative transcription factors which are to be tested against the differentially expressed genes.
  3. An interaction network in the form of an ARACNe adjacency matrix. It should contain the results of an ARACNe run including, as hub markers, at least all of the transcription factors that will be tested in MRA.

Parameters and Settings

Load Network

The network consists of an adjacency matrix generated by ARACNe.

  • From File - load an adjacency matrix generated by an external run of ARACNe.
  • From Project - load an ARACNe adjacency matrix from a result node in the Project Folders component.

Transcription Factors

  • From File - Load a comma-separate list of transcription factors from a file.
  • From Sets - Use a set defined in the Markers component as the list of transcription factors.

Significance Treshold

  • T-test p-value (alpha) - The cutoff p-value by which to establish whether a particular marker shows a significant difference in expression between the two groups. (Note that multiple testing corrections are offered on the t-test parameters tab).


T MRA Setup.png


The parameter settings for the MRA t-test are shown in the figure below. These parameters are the same as those described in the t-test component tutorial


T MRA t-test.png


Running MRA

  1. Once all the parameters described above are set as desired, press the Analyze button. The t-test followed by the Fisher's Exact tests will be carried out.
  2. A table and graphic showing transcription factors for whose interactions significant overlap with the set of differentially expressed genes was found will be displayed.