Difference between revisions of "Basics"

Line 9: Line 9:
 
The graphical user interface for geWorkbench is divided into four major sections, for
 
The graphical user interface for geWorkbench is divided into four major sections, for
  
1. Projects - Data management (upper left)
+
1. Data management - Workspace and Projects (upper left)
  
 
2. Marker and Array/Phenotype set selection and management  (lower left)
 
2. Marker and Array/Phenotype set selection and management  (lower left)
Line 21: Line 21:
  
  
The Data Management area can hold one workspace, and a workspace in turn can hold one or more projectsProjects can be used as wished to group different data sets.  Each opened data file or analysis result is stored in a project.  A workspace and all the data it contains can be saved and returned to later.
+
Areas 2, 3 and 4 are defined for convenienceThe actual placement of a given component into any of these three areas is controlled by a configuration file and can be customized as desired.
  
 +
===Data management area===
 +
 +
Working with geWorkbench involves creating a projects within the top-level workspace.  Open data files and the results of data transformation or analysis are stored within a project.  A workspace can contain more than one project at a time, allowing data to be organized as desired.  A workspace and all the projects and data within it can be saved and later reloaded.  These operations will be described in detail in further sections of the tutorials.
 +
 
The GUI provides a menu bar at top with a standard choice of commands.  Many commands that are available in the menu bar are also available by right-clicking on data objects.
 
The GUI provides a menu bar at top with a standard choice of commands.  Many commands that are available in the menu bar are also available by right-clicking on data objects.
 +
 +
 +
===Set selection and management===
 +
 +
A key feature of geWorkbench is the ability to work with defined sets of markers or arrays.  This allows subsets of data to be analyzed, and allows for passing of selected subsets of data between different components. For example, the t-test produces a list of markers showing a significant difference in expression between two states, and this list can then be used to retrieve relevant seqeunces or annotations.
 +
 +
 +
===Visualization  and Analysis tools===
 +
geWorkbench works such that all of the visualization and analysis components relevant to the type of dataset currently selected in the Project Folders (area 1) are available through tabs in their respective areas (3 and 4).  Thus choosing a microarray dataset will result in a different set of tabs being displayed as compared with those seen when a nucleotide sequence file is selected.  When a new data file is loaded, or an analysis produces a new data set, not only is it added to the Project area (1), but an appropriate viewer in the Visualization area (3)is automatically selected.
 +
  
 
==Component Interoperability==
 
==Component Interoperability==
  
The most important design goal of geWorkbench is to allow data produced or altered in one module to be easily transfered to other modules for successive analysis stepsThere are two places that hold shared data - the Project component (1), and the Set Selection component(2).  While the Project component holds files and various types of analysis result sets, the Set Selection component groups markers or arrays/phenotypes into sets.  These sets can then be selected for further analysis of only that particular subset of data.  For example, several analysis components produce lists of markers, and each such new list is placed into the Markers component as a new marker setAn example of using a phenotype set is to group microarrays by their disease state.
+
The features described above underly the most important design goal of geWorkbench, which is to allow the different components to interoperate easily.  The user is freed of the need to write programs to convert data from one format to another for different programsBoth the Project component (1) and the Set Selection component(2) can hold shared dataEach analysis component places either a new dataset into the active project, or a new set of markers or arrays into the Set Selection component.  These are then available to any other appropriate component.
 +
 
 +
 
 +
==Limitations==
  
A key feature of the GUI is that the modules displayed in the Visualization (3) and Analysis (4) areas depend on the type of data currently selected in the Project area (1)Thus you will see a different set of choices (tabs) when a microarray data set is selected as compared to when a DNA or protein sequence file is selected.  When a new data file is loaded, or an analysis produces a new data set, not only is it added to the Project area, but an appropriate viewer in the Visualization area is automatically selected.
+
As currently implemented, operations such as filtering and normalization are performed directly on the selected datasetA copy of the original is not made.

Revision as of 15:34, 1 June 2006

Home | Quick Start | Basics | Menu Bar | Preferences | Component Configuration Manager | Workspace | Information Panel | Local Data Files | File Formats | caArray | Array Sets | Marker Sets | Microarray Dataset Viewers | Filtering | Normalization | Tutorial Data | geWorkbench-web Tutorials

Analysis Framework | ANOVA | ARACNe | BLAST | Cellular Networks KnowledgeBase | CeRNA/Hermes Query | Classification (KNN, WV) | Color Mosaic | Consensus Clustering | Cytoscape | Cupid | DeMAND | Expression Value Distribution | Fold-Change | Gene Ontology Term Analysis | Gene Ontology Viewer | GenomeSpace | genSpace | Grid Services | GSEA | Hierarchical Clustering | IDEA | Jmol | K-Means Clustering | LINCS Query | Marker Annotations | MarkUs | Master Regulator Analysis | (MRA-FET Method) | (MRA-MARINa Method) | MatrixREDUCE | MINDy | Pattern Discovery | PCA | Promoter Analysis | Pudge | SAM | Sequence Retriever | SkyBase | SkyLine | SOM | SVM | T-Test | Viper Analysis | Volcano Plot



Basic Layout of the Graphical User Interface

The graphical user interface for geWorkbench is divided into four major sections, for

1. Data management - Workspace and Projects (upper left)

2. Marker and Array/Phenotype set selection and management (lower left)

3. Visualization tools (upper right)

4. Analytical tools (lower right)


T geWorkbench1.0 all.png


Areas 2, 3 and 4 are defined for convenience. The actual placement of a given component into any of these three areas is controlled by a configuration file and can be customized as desired.

Data management area

Working with geWorkbench involves creating a projects within the top-level workspace. Open data files and the results of data transformation or analysis are stored within a project. A workspace can contain more than one project at a time, allowing data to be organized as desired. A workspace and all the projects and data within it can be saved and later reloaded. These operations will be described in detail in further sections of the tutorials.

The GUI provides a menu bar at top with a standard choice of commands. Many commands that are available in the menu bar are also available by right-clicking on data objects.


Set selection and management

A key feature of geWorkbench is the ability to work with defined sets of markers or arrays. This allows subsets of data to be analyzed, and allows for passing of selected subsets of data between different components. For example, the t-test produces a list of markers showing a significant difference in expression between two states, and this list can then be used to retrieve relevant seqeunces or annotations.


Visualization and Analysis tools

geWorkbench works such that all of the visualization and analysis components relevant to the type of dataset currently selected in the Project Folders (area 1) are available through tabs in their respective areas (3 and 4). Thus choosing a microarray dataset will result in a different set of tabs being displayed as compared with those seen when a nucleotide sequence file is selected. When a new data file is loaded, or an analysis produces a new data set, not only is it added to the Project area (1), but an appropriate viewer in the Visualization area (3)is automatically selected.


Component Interoperability

The features described above underly the most important design goal of geWorkbench, which is to allow the different components to interoperate easily. The user is freed of the need to write programs to convert data from one format to another for different programs. Both the Project component (1) and the Set Selection component(2) can hold shared data. Each analysis component places either a new dataset into the active project, or a new set of markers or arrays into the Set Selection component. These are then available to any other appropriate component.


Limitations

As currently implemented, operations such as filtering and normalization are performed directly on the selected dataset. A copy of the original is not made.