Difference between revisions of "CaArray"

(Searching and viewing available experiments)
(The Remote Open File dialog)
Line 2: Line 2:
  
  
=The Remote Open File dialog=
+
=The Remote Open File dialog (caArray)=
geWorkbench can retrieve data from certain remote data sources; currently only instances of the NCICB's caArray database are supportedThe Open File dialog allows remote sources to be added to the list of those available either manually or through discovery using grid servicesEntries (locations, parameters) for non-grid services can be edited.
+
geWorkbench can retrieve gene expression data from remote instances of the NCICB's caArray database.  These may be copies maintained by the NCI itself, or copies maintained locally at your or another institutionYou can maintain settings for any number of different caArray installations.
  
 
Right-click on '''Project''' which will bring up the '''Open File''' dialog.  Click the '''Remote''' radio button.  The '''Open File''' dialog window will be expanded to include remote sources.
 
Right-click on '''Project''' which will bring up the '''Open File''' dialog.  Click the '''Remote''' radio button.  The '''Open File''' dialog window will be expanded to include remote sources.
Line 13: Line 13:
 
They buttons at the bottom of the remote file dialog are:
 
They buttons at the bottom of the remote file dialog are:
  
'''Source''' button - lists remote resources.
+
'''caArray (Source)''' menu - Shows a list of caArray instances that have been configured.  Entries for the "Production" and "Stage" instances of caArray maintained by NCI are preconfigured.
  
 
'''Go''' button - downloads a list of all available experiments from the remote source.
 
'''Go''' button - downloads a list of all available experiments from the remote source.
  
'''Filtering''' - Allows one to see a list of particular types of experiements, such as by organism or chip type, on the remote source.
+
'''Filtering''' - The list of available experiments can be "filtered" to shown only those matching particular criteria.  The available options are:
 +
* Categories: Experiment
 +
* Field Selection: Array Provider, Organism, Principal Investigator
 +
* Values:
 +
** Array Provider: Affymetrix, Agilent, GenePix, Illumina, ImaGene, Niblegen, ScanArray, UCSF Spot.
 +
** Organism: many entries, including human, mouse, fly etc...
 +
**Principal Investigator: PIs of listed public experiments in the particular instance of caArray.
  
'''Add A New Resource''' button - Opens the Data Source Definition Page used to add a remote data source.
+
'''Add A New Profile''' button - Opens the Data Source Definition Page used to add a new instance of caArray.
 
 
'''Edit''' button - Edits remote source parameters.
 
  
 +
'''Edit Profile''' button - Edit the currently selected profile.
  
 +
'''Delete Profile''' - Remove the currently selected profile.
  
 
=Loading data from an instance of caArray=
 
=Loading data from an instance of caArray=

Revision as of 12:53, 15 June 2010

Home | Quick Start | Basics | Menu Bar | Preferences | Component Configuration Manager | Workspace | Information Panel | Local Data Files | File Formats | caArray | Array Sets | Marker Sets | Microarray Dataset Viewers | Filtering | Normalization | Tutorial Data | geWorkbench-web Tutorials

Analysis Framework | ANOVA | ARACNe | BLAST | Cellular Networks KnowledgeBase | CeRNA/Hermes Query | Classification (KNN, WV) | Color Mosaic | Consensus Clustering | Cytoscape | Cupid | DeMAND | Expression Value Distribution | Fold-Change | Gene Ontology Term Analysis | Gene Ontology Viewer | GenomeSpace | genSpace | Grid Services | GSEA | Hierarchical Clustering | IDEA | Jmol | K-Means Clustering | LINCS Query | Marker Annotations | MarkUs | Master Regulator Analysis | (MRA-FET Method) | (MRA-MARINa Method) | MatrixREDUCE | MINDy | Pattern Discovery | PCA | Promoter Analysis | Pudge | SAM | Sequence Retriever | SkyBase | SkyLine | SOM | SVM | T-Test | Viper Analysis | Volcano Plot



The Remote Open File dialog (caArray)

geWorkbench can retrieve gene expression data from remote instances of the NCICB's caArray database. These may be copies maintained by the NCI itself, or copies maintained locally at your or another institution. You can maintain settings for any number of different caArray installations.

Right-click on Project which will bring up the Open File dialog. Click the Remote radio button. The Open File dialog window will be expanded to include remote sources.


T OpenFile Remote.png


They buttons at the bottom of the remote file dialog are:

caArray (Source) menu - Shows a list of caArray instances that have been configured. Entries for the "Production" and "Stage" instances of caArray maintained by NCI are preconfigured.

Go button - downloads a list of all available experiments from the remote source.

Filtering - The list of available experiments can be "filtered" to shown only those matching particular criteria. The available options are:

  • Categories: Experiment
  • Field Selection: Array Provider, Organism, Principal Investigator
  • Values:
    • Array Provider: Affymetrix, Agilent, GenePix, Illumina, ImaGene, Niblegen, ScanArray, UCSF Spot.
    • Organism: many entries, including human, mouse, fly etc...
    • Principal Investigator: PIs of listed public experiments in the particular instance of caArray.

Add A New Profile button - Opens the Data Source Definition Page used to add a new instance of caArray.

Edit Profile button - Edit the currently selected profile.

Delete Profile - Remove the currently selected profile.

Loading data from an instance of caArray

Setting up the connection

You can Add a New Resource or Edit existing connection settings to set up a connection to an instance of caArray. The configuration for connecting to the production instance of caArray at the NCI is shown here:

T OpenFile Remote Edit Profile.png

Searching and viewing available experiments

If you click on the red Go button next to the caArray data source at the bottom of the dialog, all available caArray experiments at that location will be displayed.

Instead, you can select only particular kinds of experiments by pushing the Filter button. Here we show experiments of type "Human" being selected.


T OpenFile Remote Filtered human plus.png


And here are the resulting entries in the database:


T OpenFile Remote Filtered human.png


Select an experiment and push the Show Arrays button to see the individual array datasets available for download for this experiment.

T OpenFile Remote GliomaExpt.png

Downloading select array datasets

Now we will select four of the arrays of type HG-U133A and push the Open button to begin the download. Dont' forget to click the Merge button first if desired to merge the data into a single dataset.


T OpenFile Remote GliomaExpt choose.png


You will be prompted to select the quantitation type from those available for the experiment. Here we select CHP Signal:

T OpenFile Remote QuantiationSelectionChoices.png


A progress bar will track the download process:

T OpenFile Remote ProgressBar.png


The resulting data set will appear in the Project Folders component:


T OpenFile Remote MergedSet.png