Difference between revisions of "Tutorial - GO Term Enrichment"
(→Preparations) |
|||
Line 27: | Line 27: | ||
[[Image:T_HierarchicalClustering_BCregion.png]] | [[Image:T_HierarchicalClustering_BCregion.png]] | ||
− | By scrolling down a bit, one finds | + | By scrolling down a bit, one finds a large interesting area, showing clear differences between groups of arrays. We will select two clearly differentiated clusters. Check the '''Enable Zoom''' checkbox. Then highlight the first cluster of 12 markers as shown here: |
+ | |||
+ | [[Image:T_HierarchicalClustering_BC12Markers.png]] | ||
− | |||
Then left-click to select this subset of the dendrogram. It will be displayed alone. | Then left-click to select this subset of the dendrogram. It will be displayed alone. | ||
− | Now right-click and select "Add to Set". In the Markers component, the select genes are added as Cluster Tree [ | + | |
− | + | [[Image:T_HierarchicalClustering_BC12MarkersZoom.png]] | |
+ | |||
+ | |||
+ | Now right-click and select "Add to Set". In the Markers component, the select genes are added as Cluster Tree [12], where 12 is the number of markers selected. | ||
− | [[Image: | + | Repeat for the similar region just below, which contains another 44 markers. |
+ | |||
+ | |||
+ | [[Image:T_HierarchicalClustering_BC44Markers.png]] |
Revision as of 19:02, 24 April 2006
Overview
The Gene Ontology component allows the exploration of the Gene Ontology (GO) terms represented within a list of genes. Several different display options are available. The entire GO can be displayed as a tree (TreeView), with the selected genes being shown within the tree. Or, the list of genes can be displayed sorted by their overrepresentation P-values (TableView). This P-value is calculated from the observed vs expected number of hits to a category based on its representation in markers annotated in the microarray type as a whole, e.g. HG_U95Av2.
Example
In this example we will use the file "webmatrix_quantile_log2_dev1_mv0.exp" available in the tutorial data section (coming soon).
Preparations
We must first obtain a list of interesting genes. We will use the results obtained in the tutorial on hierarchical clustering. In brief, this clustering result can be recreated by these steps:
- In the Arrays/Phenotypes component, select the set of arrays labeled "ultrashort designation".
- Activate two classes of arrays to compare, the GC B-cell and non-GC B-cell, by checking the boxes before the names.
- Go to the Analysis component, and select Hierarchical Clustering.
- At the bottom of the Analysis component, uncheck the box that says "All Arrays". This will allow the clustering to be done only on those arrays which were activated in the Arrays/Phenotypes component.
- In Hierarchical Clustering, set the parameters to:
- Clustering Method: Total Linkage
- Clustering Dimension: Both
- Clustering Metric: Euclidean
- Click Analyze.
The results will be displayed in the Dendrogram component.
By scrolling down a bit, one finds a large interesting area, showing clear differences between groups of arrays. We will select two clearly differentiated clusters. Check the Enable Zoom checkbox. Then highlight the first cluster of 12 markers as shown here:
Then left-click to select this subset of the dendrogram. It will be displayed alone.
Now right-click and select "Add to Set". In the Markers component, the select genes are added as Cluster Tree [12], where 12 is the number of markers selected.
Repeat for the similar region just below, which contains another 44 markers.