Difference between revisions of "Consensus Clustering"

(Parameters)
Line 9: Line 9:
  
 
[[File:Consensus_clustering_parameters.png]]
 
[[File:Consensus_clustering_parameters.png]]
 +
 +
 +
 +
* '''kmax''' - Try K=2,3,...,kmax clusters (must be > 1)
 +
* '''resampling iterations''' - Number of resampling iterations
 +
* '''seed value''' - Random number generator seed
 +
* '''clustering algorithm''' - Type of clustering algorithm
 +
* '''cluster by''' - Whether to cluster by rows/genes or columns/experiments
 +
* '''distance measure''' - Distance measure
 +
* '''resample''' - resampling scheme (one of 'subsample[ratio]', 'features[nfeat]',
 +
'nosampling')
 +
* '''merge type''' - Ignored when algorithm other than hierarchical selected
 +
* '''descent iterations''' - Number of SOM/NMF iterations
 +
* '''output stub''' - Stub pre-pended to all the output file names
 +
* '''normalize type''' - row-wise, column-wise, both
 +
* '''normalization iterations''' - number of row/column normalization iterations (supercedes normalize.type)
 +
* '''create heat map''' - Whether to create heatmaps (one for each cluster number)
 +
* '''heat map size''' - point size of a consensus matrix's heat map (between 1 and
 +
20)
 +
 +
 +
 +
==References==
 +
 +
References:
 +
• S. Monti, et al. “Consensus Clustering: A resampling-based method for class
 +
discovery and visualization of gene expression microarray data”, Machine
 +
Learning Journal, 52(1-2):91-118, 2003.

Revision as of 16:49, 8 August 2013

Overview

This component allows geWorkbench to run the Consensus Clustering on a GenePattern server. The user must supply the URL of an available GenePattern server. All results are returned to geWorkbench, and the clustered arrays or markers are available as sets for further analysis.

For information on the Consensus Clustering algorithm, please see the GenePattern documentation at GenePattern Analysis:Modules.


Parameters

Consensus clustering parameters.png


  • kmax - Try K=2,3,...,kmax clusters (must be > 1)
  • resampling iterations - Number of resampling iterations
  • seed value - Random number generator seed
  • clustering algorithm - Type of clustering algorithm
  • cluster by - Whether to cluster by rows/genes or columns/experiments
  • distance measure - Distance measure
  • resample - resampling scheme (one of 'subsample[ratio]', 'features[nfeat]',

'nosampling')

  • merge type - Ignored when algorithm other than hierarchical selected
  • descent iterations - Number of SOM/NMF iterations
  • output stub - Stub pre-pended to all the output file names
  • normalize type - row-wise, column-wise, both
  • normalization iterations - number of row/column normalization iterations (supercedes normalize.type)
  • create heat map - Whether to create heatmaps (one for each cluster number)
  • heat map size - point size of a consensus matrix's heat map (between 1 and

20)


References

References: • S. Monti, et al. “Consensus Clustering: A resampling-based method for class discovery and visualization of gene expression microarray data”, Machine Learning Journal, 52(1-2):91-118, 2003.