Hierarchical Clustering
Home | Quick Start | Basics | Menu Bar | Preferences | Component Configuration Manager | Workspace | Information Panel | Local Data Files | File Formats | caArray | Array Sets | Marker Sets | Microarray Dataset Viewers | Filtering | Normalization | Tutorial Data | geWorkbench-web Tutorials |
Analysis Framework | ANOVA | ARACNe | BLAST | Cellular Networks KnowledgeBase | CeRNA/Hermes Query | Classification (KNN, WV) | Color Mosaic | Consensus Clustering | Cytoscape | Cupid | DeMAND | Expression Value Distribution | Fold-Change | Gene Ontology Term Analysis | Gene Ontology Viewer | GenomeSpace | genSpace | Grid Services | GSEA | Hierarchical Clustering | IDEA | Jmol | K-Means Clustering | LINCS Query | Marker Annotations | MarkUs | Master Regulator Analysis | (MRA-FET Method) | (MRA-MARINa Method) | MatrixREDUCE | MINDy | Pattern Discovery | PCA | Promoter Analysis | Pudge | SAM | Sequence Retriever | SkyBase | SkyLine | SOM | SVM | T-Test | Viper Analysis | Volcano Plot |
Contents
[hide]Overview
geWorkbench implements its own code for agglomerative hierarchical clustering.
Note - hierarchical clustering is memory intensive. With the default memory settings (see here to change), clustering more than about 2000 markers is not recommended.
Parameters
Clustering Method
Single Linkage
Average Linkage
Total Linkage
Clustering Dimension
Marker
Microarray
Both
Clustering Metric
Euclidean
Pearson's
Spearman's
All Arrays
All Markers
Example
Running the calculation
This example will take off with the set of markers produced in the ANOVA example. Please follow the steps for that example to produce the starting marker set, or just create/select another set of markers of your own.
1. If following the ANOVA example, activate the set of markers labeled "Significant Genes [1786]" ( which contains 1786 markers).
2. Set the parameters as shown in the following figure.
- Clustering methods: Average Linkage.
- Clustering Dimension: Marker.
- Clustering Metric: Euclidean.
3. Click Analyze.
4. A progress bar will be visible during the calculation.
The results are placed in the Project Folders component and labeled "Hierarchical Clustering", and can be displayed in the Dendrogram component.
Displaying results in the Dendrogram component
Hierarchical clustering results are displayed in the Dendrogram component. The four horizontal bars shown in the diagram below were added just to show the boundaries of the four activated array sets.
The Dendrogram component allows one to select and work with just a portion of the displayed tree. To activate this feature, check the Enable Selection checkbox at lower left in the Dendrogram component.
Now when the cursor is moved over the displayed tree, a blue rectangle will indicate which portion of the tree is currently sub-selected.
Clicking on the selected area will cause only this area to be displayed.
In turn, right-clicking on the displayed sub-tree will show a menu with two entries:
- Image Snapshot - place a static snapshot of the tree as currently displayed into Project Folders component.
- Add to Set - add the markers represented in the currently displayed tree to a new set in the Markers component called "Cluster Tree".
The following figure shows the result of saving the displayed tree.