PCA

Revision as of 18:00, 26 November 2013 by Smith (talk | contribs)

Principle Component Analysis (PCA)

Principle component analysis is used to find the most important contributors to the variance in a dataset.

geWorkbench can dispatch a PCA job to a GenePattern server, and display the returned result.

The analysis can be done either in terms of experiments (arrays) or genes. The result will be the most important features of the experiments or genes in explaining the data.


The PCA component analysis interface is shown below.


PCA analysis genes.png


After running an analysis, any number of components can be graphed as to the weight of the contribution from each original variable by highlighting the desired components. Each principle component is shown as a line with a different color. In the table below the graph, the individual weights defining each principle component are shown.


PCA Components.png


A plot can be produced by selecting two or three components in the list and pushing the "Plot" button.

Selecting two components will produce a 2-D graph, while selecting 3 components will produce a 3-D graph.

The 3-D graph can be rotated by grabbing any data point by left-clicking on it and dragging it with the mouse.


PCA Projection.png


Left-clicking on a data point will also cause it to be highlighted in the Markers component.


[[Image::PCA_Selection.png]]


Below, the PCA result node is shown in the Workspace, along with a highlighted gene.


PCA Result.png