Difference between revisions of "Array Sets"

(Classifying a set)
Line 1: Line 1:
 
{{TutorialsTopNav}}
 
{{TutorialsTopNav}}
  
==Preparation==
+
==Background==
  
In this tutorial we will start with the same data files that were used in [[Tutorial - Projects and Data Files]]Load the ten individual MAS5 data files as shown there.
+
geWorkbench makes extensive use of the notion of sets of markers or arrays/phenotypes.  Sets of markers are returned from various analysis routines.  Sets of arrays/phenotypes can be specified to group arrays in a meaningful fashion for statistical analysisFor example, two such phenotypes might be the diseased and normal states of a tissue from which samples have been taken.  
  
 +
geWorkbench supports groupings of sets.  Each such group can contain different sets of markers or arrays, or alternatively just assign different names to the same arrangements of underlying data, for example abbreviations vs descriptive names.
  
==Creating sets==
 
  
When working with microarrays, geWorkBench uses the term '''''marker''''' to refer to a gene probe (in other cases, it can be individual items from other data sets, such as sequences).
+
==Preparation==
  
'''''Phenotype''''' refers to any user-defined grouping of microarrays. These microarrays will typically share some common property. For example, two such phenotypes might be the diseased and normal states of a tissue from which samples have been taken.  
+
In this tutorial we will start with the same data files that were used in [[Tutorial - Projects and Data Files]]. Load the ten individual MAS5 data files as shown there.
  
  
 
==Assigning arrays to sets==
 
==Assigning arrays to sets==
  
geWorkbench supports groupings of sets. We will place the phenotypes in the default group, however you can create new phenotype group by pushing the '''New''' button on '''Phenotype Sets''' at lower left.
+
  We will place the arrays in the default group, however you can create new a new group by pushing the '''New''' button on '''Array/Phenotype Sets''' at lower left.
  
 
First, we will select and label arrays which contain samples from the congestive cardiomyopathy disease state:
 
First, we will select and label arrays which contain samples from the congestive cardiomyopathy disease state:

Revision as of 22:40, 27 February 2006

Home | Quick Start | Basics | Menu Bar | Preferences | Component Configuration Manager | Workspace | Information Panel | Local Data Files | File Formats | caArray | Array Sets | Marker Sets | Microarray Dataset Viewers | Filtering | Normalization | Tutorial Data | geWorkbench-web Tutorials

Analysis Framework | ANOVA | ARACNe | BLAST | Cellular Networks KnowledgeBase | CeRNA/Hermes Query | Classification (KNN, WV) | Color Mosaic | Consensus Clustering | Cytoscape | Cupid | DeMAND | Expression Value Distribution | Fold-Change | Gene Ontology Term Analysis | Gene Ontology Viewer | GenomeSpace | genSpace | Grid Services | GSEA | Hierarchical Clustering | IDEA | Jmol | K-Means Clustering | LINCS Query | Marker Annotations | MarkUs | Master Regulator Analysis | (MRA-FET Method) | (MRA-MARINa Method) | MatrixREDUCE | MINDy | Pattern Discovery | PCA | Promoter Analysis | Pudge | SAM | Sequence Retriever | SkyBase | SkyLine | SOM | SVM | T-Test | Viper Analysis | Volcano Plot


Background

geWorkbench makes extensive use of the notion of sets of markers or arrays/phenotypes. Sets of markers are returned from various analysis routines. Sets of arrays/phenotypes can be specified to group arrays in a meaningful fashion for statistical analysis. For example, two such phenotypes might be the diseased and normal states of a tissue from which samples have been taken.

geWorkbench supports groupings of sets. Each such group can contain different sets of markers or arrays, or alternatively just assign different names to the same arrangements of underlying data, for example abbreviations vs descriptive names.


Preparation

In this tutorial we will start with the same data files that were used in Tutorial - Projects and Data Files. Load the ten individual MAS5 data files as shown there.


Assigning arrays to sets

We will place the arrays in the default group, however you can create new a new group by pushing the New button on Array/Phenotype Sets at lower left.

First, we will select and label arrays which contain samples from the congestive cardiomyopathy disease state:

1. In the Arrays/Phenotypes component, select the following arrays from samples from the congestive cardiomyopathy disease state:

JB-ccmp_0120.txt, JB-ccmp_0218.txt, JB-ccmp0718.txt, JB-ccmp0811.txt, JB-ccmp1003.txt, JB-ccmp_1109.tx

2. Right-click, select Add to Set.

3. Enter "Cardio" in the input box and click OK.


T PanelLabelCardio.png


4. Next, similarly label the follow arrays as "Normal" ( repeat steps 2 & 3 ):

JB-n_0106.txt , JB-n_0821.txt, JB-n_0915.txt, JB-n_1303.txt


Activating sets

5. Select the checked boxes next to the panel name to indicate that these sets of arrays are "Active". Various analysis and visualization components can be set to only use/display activated arrays or markers.


T PhenotypesPriorToCase.png


Classifying a set

For statistical tests such as the t-test, Case and Control groups can be specified.

  1. Left-click on the thumb-tack icon in front of the phenotype name.
  2. Select Case to specify the disease arrays as the "Case". The remaining "Normal" arrays are by default labeled control.

T PhenotypeSettingCase.png


A red thumbtack indicates the arrays have been specified as "Case".


T PhenotypeCaseSet.png