MINDy

Revision as of 12:44, 9 December 2008 by Smith (talk | contribs)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

MINDY

The MINDY algorithm uses gene expression data to quantify modulation effects on the regulatory activity of a transcription factor gene (TF) of interest over a set of target genes. The general workflow includes 1) selection of modulator, hub TF and target genes and 2) defining (result) marker sets in the Table and List panel and displaying results for a single modulator and its target transcription factor(s) in the Heat Map panel. This help topic describes executing [#_Run_MINDY MINDY analysis] and [#_View_MINDY_Results viewing the results.]

Setting the Main Parameters

T Mindy Main.png

Modulators List - (From File or From Set) - The list of candidate modulators can either be loaded from a file as a comma separated list, or a set of markers from the Markers component can be selected.

Hub Marker - Select as the Hub Marker a transcription factor gene. The Hub marker can be entered directly in the text field, or the most recently selected marker in the Markers component will be used. Note that even if you directly type in a marker name, it will be replaced if any selection is made in the Markers component, either in the list or in the set "Selection".

Target List - (All Markers, From File or From Set) - The target list should be composed of genes thought to be regulated by the Hub Marker transcription factor. The list of target markers can be loaded from a file containing a comma separated list, or a set of markers from the Markers component can be selected. Alternatively, All Markers can be selected.

(Note - the "All Markers" checkbox at the bottom of the Analysis component should not be used in the MINDY component).

Setting the Advanced Parameters

T Mindy Advanced.png

Sample per Condition (%) - MINDY calculated the difference in mutual information for the TF-Target interaction between the set where the modulator gene is most expressed (+) and the set where the modulator gene is least expressed (-). This parameter specifies the percentage of the available samples to include in each group. E.g. 35% means that the top and bottom 35% of a list of samples ranked by expression would be used.

Conditional (MINDY) - measures the mutual information (MI) of the TF and its target gene across the subset of samples in which the modulator gene is most (or least) expressed.

Mutual Info- Threshold for a conditional mutual information (MI) estimate to be considered statistically different from zero. For example, a value of 0.20 filters out target genes with a conditional MI score of less than 0.20. By default, no threshold is set (MI = 0).

P-Value - Significance level for a conditional mutual information (MI) estimate to be considered statistically different from zero. This is a value between 0 and 1, with 1 indicating no threshold. By default, the value is 1. (Note - P-values are not currently available in MINDY, so this option is disabled).

Correction - None or Bonferroni - correct for multiple testing.

Unconditional (ARACNE)- measures the MI of the TF and its target gene across all samples.

Mutual Info - Threshold for an unconditional mutual information (MI) estimate to be considered statistically different from zero. For example, a value of 0.20 filters out target genes with an unconditional MI score of less than 0.20. By default, no threshold is set (MI = 0).

P-value - Significance level for an unconditional mutual information (MI) estimate to be considered statistically different from zero. This is a value between 0 and 1, with 1 indicating no threshold. By default, the value is 1.

Correction - None or Bonferroni - correct for multiple testing.

DPI Tolerance - The Data Processing Inequality can be used to remove the effects of indirect interactions, e.g. if TF1->TF2->Target, DPI can be used to remove the indirect action of TF1 on the target. The DPI tolerance specifies the degree of sampling error to be accepted, as with a finite sample size an exact value can not be calculated.

The DPI tolerance is normally between 0 and 0.15 since values larger than 0.15 yields higher false positives.

See Margolin et al. 2006 for further details on use of DPI.

DPI Target List - The DPI target list can be used to limit the ARACNE calculation to transcriptional networks. It is used to screen out spurious regulatory interaction signals of genes that are tightly coexpressed but are not in a regulatory relationship to each other, for example genes for two proteins that are in a physical complex and hence always produced in the same amounts. A comma-separated list can be typed in, or it can be loaded from an external file. If used , the DPI Target List should contain all markers that are annotated as transcription factors. Signalling proteins could also be included.

Running a MINDY Analysis

1. Select a microarray set node in the Project Folder.

2. In the analysis pane (lower right of the application), select MINDY Analysis.

3. In the Main tab, populate the Modulators List by selecting a set of markers defined in the Markers component, or load a list from a file.

4. Populate the Target List textbox by selecting the choice "All Markers", or by selecting a set of markers defined in the Markers component, or by loading a list from a file.

5. Populate the Hub Gene textbox to designate the TF gene by (1) typing the marker name (as displayed in the Markers component) or (2) in the Selection Area (lower left of the application) Marker Tab, click on the marker name corresponding to the TF.

6. In the Advanced Tab, accept the default values or update the parameter values by clicking the parameter textbox and typing a new value.

7. Click Analyze. If successful, the project window is updated to reflect the MINDY result node which is a child node of the input dataset. Please note that the Dataset History tab captures the analysis parameters.

Viewing MINDY Results

1. Select the MINDY result node in the Project Folder.

2. In the Modulator Tab, indicate the modulators of interest using the checkboxes or click on Select All to display all modulators in the Table, List,and Heat Map views. The Modulators Selected is updated to reflect the number of modulators selected. Only selected Modulators are displayed on the Table, List and Heat Map views. Additional actions include:

· Marker Display: Indicate marker display preferences for the Modulator column ( probe name or symbol).

· Sort: Click on the column headers or use sort options available in the left pane.

· Add to Set: Adds selected modulators to a Marker Set. You can select one or more Targets and/or Modulators, using the selection checkboxes.

· All Markers: This checkbox determines if all the target genes are displayed or only genes in activated marker groups.

3. Select from the various tabs to view the data in alternate formats. See [#_Navigating_Mindy_Visualization Navigating Mindy] for additional information on these data views.

Navigating Mindy Visualization

MINDY includes the following data views: [#mod Modulator], [#table Table], [#list List] and [#heatmap Heat Map].

T Mindy Viewer Modulator.png

Modulator: This table-based view contains one row per modulator gene. Only modulators selected in this tab are included the other data views. The value of the Mode column for a modulator M is either “+”, “-“ or null (0) depending on if M+ is larger, smaller or equal to M-.

T Mindy Viewer Table.png

Table: The rows of the table represent target genes and the columns represent modulators. Additional actions include:

· Marker Display: Indicate marker display preferences for the Modulator column ( probe name or symbol).

· Sorting: Displays columns (modulators) from left to right in descending order by; Aggregate ( M#), Enhancing (M+) or Negative (M-).

· Modulator Limits: Activates the checkbox to limit the columns (modulators) display to a defined value. This selection filters the modulator display based upon the current display order.

Display Options:

· Color View: Enables a heat map display of each cell based on the value of the score. 1 is displayed as absolute blue; +1 is displayed as absolute red; 0:1 is mapped uniformly from white to shades of red; -1:0 is mapped uniformly from shades of blue to white.

· Score View: Displays the discretized score values.

T Mindy Viewer List.png

List: The table has three columns: Modulator, Target and Score. Additional actions include:

Marker Display: Indicate marker display preferences for the Modulator column ( probe name or symbol).

Marker Override: Marker selection preferences. As markers are selected, the number of markers selected is listed next to Enable Selection field. This does not reflect the number of rows.

T Mindy Viewer HeatMap.png

Heat Map: The heat map contains two color mosaic panels. The rows correspond to target genes and arrays are ordered according to the expression of the TF gene (low to high). The left panel correspond to the L- arrays where the modulator is least expressed while the columns on the right panel to the L+ arrays where the modulator is most expressed. Additional actions include:

· Marker Display: Indicate marker display preferences for the Modulator column ( probe name or symbol).

· Transcription Factor: Displays the TF entered in the Mindy Analysis parameters.

· Modulator: Select a modulator from the “Selected Modulators” list to update the heat map display.

· Refresh: Resets the heat map display.

· Image Snapshot: Captures the heat map as an image node in the Project Folder.

References

Margolin, A., Wang, K., Lim, W.K., Kustagi, M., Nemenman, I., and Califano, A. Reverse Engineering Cellular Networks. Nature Protocols, 2006 Vol 1(2). ppgs 662-671.'