<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
		<id>http://wiki.c2b2.columbia.edu/workbench/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Ginhoven</id>
		<title>geWorkbench - User contributions [en]</title>
		<link rel="self" type="application/atom+xml" href="http://wiki.c2b2.columbia.edu/workbench/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Ginhoven"/>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php/Special:Contributions/Ginhoven"/>
		<updated>2026-04-30T14:32:40Z</updated>
		<subtitle>User contributions</subtitle>
		<generator>MediaWiki 1.27.7</generator>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=Array_Sets&amp;diff=3284</id>
		<title>Array Sets</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=Array_Sets&amp;diff=3284"/>
				<updated>2006-07-06T20:16:48Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: /* Assigning arrays to sets */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{TutorialsTopNav}}&lt;br /&gt;
&lt;br /&gt;
==Outline==&lt;br /&gt;
In this tutorial, you will learn&lt;br /&gt;
 &lt;br /&gt;
*How geWorkbench allows sets of markers and arrays/phenotypes to be defined and created.&lt;br /&gt;
*How to create a set of arrays&lt;br /&gt;
*How to mark a set of arrays as &amp;quot;Active&amp;quot;&lt;br /&gt;
*How to classify a set of arrays, e.g. as &amp;quot;case&amp;quot; vs. &amp;quot;control&amp;quot;.&lt;br /&gt;
*How arrays can be grouped in different ways with descriptive tags.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Background==&lt;br /&gt;
&lt;br /&gt;
geWorkbench makes extensive use of the notion of sets of markers or arrays/phenotypes.  Sets of markers are returned from various analysis routines.  Sets of arrays/phenotypes can be specified to group arrays in a meaningful fashion for statistical analysis.  For example, two such phenotypes might be the diseased and normal states of a tissue from which samples have been taken. &lt;br /&gt;
&lt;br /&gt;
geWorkbench supports groupings of sets.  Each such group can contain different sets of markers or arrays.&lt;br /&gt;
&lt;br /&gt;
Sets of markers can also be created by various components of geWorkbench.  For example the t-test returns a list of markers showing signficant differential expression, and after hierarchical clustering, the markers in a subtrees of the resulting dendrogram can be saved to a list.  The examples below will focus on creating sets of arrays/phenotypes.  As mentioned, examples of creating and working with sets of markers can be found in [[Tutorial_-_Differential_Expression]] and [[Tutorial_-_Clustering]].&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Preparation==&lt;br /&gt;
&lt;br /&gt;
In this tutorial we will start with the same data files that were used in [[Tutorial - Projects and Data Files]].  Load the ten individual MAS5 data files as shown there in the section &amp;quot;Creating a new project and loading microarray data files&amp;quot;.&lt;br /&gt;
&lt;br /&gt;
==Assigning arrays to sets==&lt;br /&gt;
&lt;br /&gt;
We will place the arrays in the default group, however you can create a new group by pushing the '''New''' button on '''Array/Phenotype Sets''' at lower left.&lt;br /&gt;
&lt;br /&gt;
First, we will select and label arrays which contain samples from the congestive cardiomyopathy disease state:&lt;br /&gt;
&lt;br /&gt;
1. In the Arrays/Phenotypes component, select the six arrays beginning with '''JB-ccmp''', which represent the samples from the  congestive cardiomyopathy disease state.&lt;br /&gt;
&lt;br /&gt;
[[Image:T_Arrays_AddToSet.png]]&lt;br /&gt;
2. Right-click,  select '''Add to Set'''.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
3. Enter &amp;quot;CCMP&amp;quot; in the input box and click OK.&lt;br /&gt;
&lt;br /&gt;
[[Image:T_Arrays_SetLabel.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
4. Next, similarly label the arrays beginning with JB-n as &amp;quot;Normal&amp;quot; ('' repeat steps 2 &amp;amp; 3 ''):&lt;br /&gt;
&lt;br /&gt;
The Array/Phenotype Sets component will now show the two sets added:&lt;br /&gt;
&lt;br /&gt;
[[Image:T_Array_ArraySets.png]]&lt;br /&gt;
&lt;br /&gt;
==Activating sets==&lt;br /&gt;
&lt;br /&gt;
The check boxes next to the set name can be checked to indicate that a sets of arrays is &amp;quot;Active&amp;quot;.  Various analysis and visualization components can be set to only use/display activated arrays or markers.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:T_Arrays_ActivateSets.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Classifying a set==&lt;br /&gt;
&lt;br /&gt;
For statistical tests such as the t-test, Case and Control groups can be specified.  &lt;br /&gt;
&lt;br /&gt;
# Left-click on the thumb-tack icon in front of the phenotype name.  &lt;br /&gt;
# Select Case to specify the disease arrays as the &amp;quot;Case&amp;quot;.  The remaining &amp;quot;Normal&amp;quot; arrays are by default labeled control.&lt;br /&gt;
&lt;br /&gt;
[[Image:T_Arrays_SetCase.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
A red thumbtack indicates the arrays have been specified as &amp;quot;Case&amp;quot;.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:T_Arrays_CaseSet.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Example of using multiple array groups==&lt;br /&gt;
&lt;br /&gt;
There can be different groupings of the same arrays in the Arrays/Phenotypes and Marker components.  Here we show how there are several different groupings defined in the example data file &amp;quot;webmatrix_quantile_log2_dev1_mv0.exp&amp;quot;.  After loading this file into geWorkbench as type &amp;quot;Affymetrix File Matrix&amp;quot;, the following groups can be seen in the Arrays/Phenotypes group pulldown menu:&lt;br /&gt;
&lt;br /&gt;
[[Image:T_Arrays_Groups_choose.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
If we choose the group called &amp;quot;Class&amp;quot;, the following sets of arrays are displayed:&lt;br /&gt;
&lt;br /&gt;
[[Image:T_Arrays_Groups_Class.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
If instead we choose the group &amp;quot;Cell Line&amp;quot;, a different grouping of the same arrays is seen:&lt;br /&gt;
&lt;br /&gt;
[[Image:T_Arrays_Groups_CellLine.png]]&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=Workspace&amp;diff=3283</id>
		<title>Workspace</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=Workspace&amp;diff=3283"/>
				<updated>2006-07-06T16:16:22Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: /* Workspaces and Projects */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{TutorialsTopNav}}&lt;br /&gt;
&lt;br /&gt;
==Outline==&lt;br /&gt;
In this tutorial, you will learn how to: &lt;br /&gt;
&lt;br /&gt;
*Create a new Project.&lt;br /&gt;
*Load microarray data.&lt;br /&gt;
*Merge data from several loaded microarray experiments.&lt;br /&gt;
*Rename a project and/or project node.&lt;br /&gt;
*Remove a project and/or project node.&lt;br /&gt;
*Save project files that you have created.&lt;br /&gt;
*Load, add, and/or modify remote data.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Supported data formats==&lt;br /&gt;
*Microarray&lt;br /&gt;
**Affymetrix MAS5/GCOS files - produced by the Affymetrix data analysis programs.&lt;br /&gt;
**Affymetrix File Matrix - a spreadsheet-type multi-experiment format; this is the native file type created by geWorkbench from merged datasets.&lt;br /&gt;
**Tab-delimited text (RMAExpress file) - A simple columnar file format, produced by the program RMAExpress.&lt;br /&gt;
**Affy Excel or txt data file - formats for single Affymetrix experiments (not supported).&lt;br /&gt;
**Genepix files - Produced by a popular analysis program for two-color microarrays.&lt;br /&gt;
*Other&lt;br /&gt;
**FASTA files. DNA or amino-acid sequence files in FASTA format.&lt;br /&gt;
**Pattern files - sequence motifs produced using the Pattern Discovery component of geWorkbench.&lt;br /&gt;
**Genotypic data files - (not supported).&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Data organization==&lt;br /&gt;
&lt;br /&gt;
===Workspaces and Projects===&lt;br /&gt;
In the '''Project Folders''' component there is a top-level object called a workspace.  The workspace can contain one or more separate projects, and each project can contain opened data files and analysis results.  The workspace as a whole, with all its projects and data nodes, can be saved and restored.  Projects allow data to be grouped, for example by experiment.  A project can contain many different types of data, for example microarray data, fasta sequence files and graphical images.&lt;br /&gt;
&lt;br /&gt;
===Microrray data and merging===&lt;br /&gt;
A file from disk or from the network is be opened within a given project.  Creation of a new project is described below.  When working with microarray data, all data to be analyzed must be present within one data node in a project.  If the data exists as multiple files containing results from single arrays, the data must be merged into a single node before it can be used.  geWorkbench can perform this merging step either at the time data is read in, or later in a separate step.  Once merged, such a dataset can be saved out to disk; it will be saved in the geWorkbench matrix file format.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Limitations==&lt;br /&gt;
Only one data node can be selected at one time.  If you wish to save a data node to a file, in most cases you must specify a file type extension, such as &amp;quot;.exp&amp;quot; for the geWorkbench merged file matrix format, or &amp;quot;.fasta&amp;quot; for a sequence file.  At present, the only type of remote data source which can be opened is NCICB's caArray database.  The remote file open feature is not multi-threaded, so you cannot perform other tasks in geWorkbench while downloading remote files.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Creating a new project and loading microarray data files==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
In this example, we will load 10 individual Affymetrix MAS5 format files, and merge them into a single dataset.  The origin of these file is described in the section [[Tutorial_-_Data]]&lt;br /&gt;
&lt;br /&gt;
All data must belong to a project.  Right-click on the '''Workspace''' entry in the '''Project Folders''' window at upper left to create a new project.&lt;br /&gt;
&lt;br /&gt;
[[Image:T_NewProject.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Next, right-click on the '''New Project''' entry and select '''Open Files'''.&lt;br /&gt;
&lt;br /&gt;
[[Image:T_OpenFiles.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Here, we will select file type '''Affymetrix GCOS/MAS5''' as shown.&lt;br /&gt;
&lt;br /&gt;
Make sure to check the '''Merge files''' checkbox.  This will created the merged data node as the files are read in.&lt;br /&gt;
&lt;br /&gt;
We will select 10 MAS5 format text files from the directory geworkbench\data\training\cardiogenomics.med.harvard.edu, which is included in the geWorkbench download. &lt;br /&gt;
&lt;br /&gt;
Click '''Open'''.&lt;br /&gt;
&lt;br /&gt;
[[Image:T_OpenFile_CardioMerge.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
You may see the message &amp;quot;The chip type HG_U95Av2 is recognized...&amp;quot;&lt;br /&gt;
&lt;br /&gt;
[[Image:T_OpenFile_ChipRecog.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
The merged dataset is listed in the Project folder. The data is displayed, in single array format, in the '''Microarray Viewer'''.  Note we have increased the intensity slider to maximum here.  You can scroll through the arrays from first to last using the slider.  The display in the '''Microarray Viewer''' is by marker in the linear order the markers appear in the data file.  It does not correspond in any way to a physical picture or representation of the actual 2-D microarray.&lt;br /&gt;
&lt;br /&gt;
[[Image:T_FullApp_MergedData.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Merging microarray datafiles after they have already been loaded.==&lt;br /&gt;
&lt;br /&gt;
If Affymetrix data files are not merged at the time they are read in, they can also be merged later, as long as they are from the same chip type.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''1.''' Select the read-in data files that you want to merge.&lt;br /&gt;
&lt;br /&gt;
'''2.''' Click on '''File''' in the menu bar, and choose '''Merge Datasets'''.&lt;br /&gt;
&lt;br /&gt;
The picture shows the resulting merged dataset created from several individual data files.&lt;br /&gt;
&lt;br /&gt;
[[Image:T_ProjectFolder_MergeIndivid.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
The result is a new data node containing the merged data.  The original data nodes are still present.&lt;br /&gt;
&lt;br /&gt;
[[Image:T_ProjectFolder_IndividMerged.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Renaming a project or a data node==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===Renaming a project===&lt;br /&gt;
&lt;br /&gt;
'''1.''' Right-click on '''Project''' folder.&lt;br /&gt;
&lt;br /&gt;
'''2.''' Select '''Rename'''.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:T_ProjectFolder_RenameProject.png]] &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''3.''' In the pop-up screen rename your project.&lt;br /&gt;
&lt;br /&gt;
'''4.''' Click on the '''OK''' button&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===Renaming a project data node===&lt;br /&gt;
&lt;br /&gt;
'''1.''' Right-click on a Project Folder data node.&lt;br /&gt;
&lt;br /&gt;
'''2.''' Select '''Rename'''.&lt;br /&gt;
&lt;br /&gt;
[[Image:T_ProjectFolder_RenameDataset.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''3.''' In the pop-up screen rename your data node.&lt;br /&gt;
&lt;br /&gt;
[[Image:T_ProjectFolder_RenameDataset2.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''4.''' Click on the '''OK''' button.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Removing a project or a data node==&lt;br /&gt;
&lt;br /&gt;
===Removing a project===&lt;br /&gt;
&lt;br /&gt;
'''1.''' Right-click on '''Project''' folder.&lt;br /&gt;
&lt;br /&gt;
'''2.''' Select '''Remove'''.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===Removing a project data node===&lt;br /&gt;
&lt;br /&gt;
'''1.''' Right-click on the data node.&lt;br /&gt;
&lt;br /&gt;
'''2.''' Select '''Remove'''. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Saving a data node to a file==&lt;br /&gt;
&lt;br /&gt;
It is here that, among other things, you can create the matrix multi-experiment file format used by geWorkbench from a merged dataset.&lt;br /&gt;
&lt;br /&gt;
'''1.''' Right-click on data node that you want to save. &lt;br /&gt;
&lt;br /&gt;
'''2.''' Click '''Save'''.&lt;br /&gt;
&lt;br /&gt;
[[Image:T_ProjectFolder_SaveNode.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
A standard file '''Save''' screen will come up.&lt;br /&gt;
&lt;br /&gt;
'''3.''' Choose a location.&lt;br /&gt;
&lt;br /&gt;
'''4.''' Enter a name.  Here you should be careful to enter an appropriate file type extension. as this is not automatic.  For example for the merged multi-experiment matrix file type you should include the extension &amp;quot;.exp&amp;quot; in the filename.&lt;br /&gt;
&lt;br /&gt;
'''5.''' Click on the '''Save''' button.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Working with remote data sources==&lt;br /&gt;
&lt;br /&gt;
===The remote Open File dialog===&lt;br /&gt;
geWorkbench can retrieve data from certain remote data sources; currently only instances of the NCICB's caArray database are supported.  The Open File dialog allows remote sources to be added to the list of those available either manually or through discovery using grid services.  Entries (locations, parameters) for non-grid services can be edited.&lt;br /&gt;
&lt;br /&gt;
As before, right-click on '''Project''' which will bring up the '''Open File''' dialog.   Click the '''Remote''' radio button.   The '''Open File''' dialog window will be expanded to include remote sources.&lt;br /&gt;
&lt;br /&gt;
Note the distinction between the &amp;quot;Open File&amp;quot; button, which opens a local or remote file, and the &amp;quot;Go&amp;quot; button, described below, which connects to a chosen remote resource to allow browsing.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)MEditRemoteData.png]]&lt;br /&gt;
&lt;br /&gt;
Four additional buttons appear.  They are: &lt;br /&gt;
&lt;br /&gt;
'''caArray''' button - lists remote resources.&lt;br /&gt;
&lt;br /&gt;
'''Go''' button - connects to the selected remote source.&lt;br /&gt;
&lt;br /&gt;
'''Add A New Resource''' button - Opens the Data Source Definition Page used to add a remote data source.&lt;br /&gt;
&lt;br /&gt;
'''Edit''' button - Edits remote source parameters.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===Loading data from a remote instance of caArray===&lt;br /&gt;
&lt;br /&gt;
Click on the '''Go''' button next to the caArray data source at the bottom of the dialog.  All available caArray experiments at that location will be displayed.&lt;br /&gt;
&lt;br /&gt;
[[Image:T_ProjectFolder_caArrayExpts.png]]&lt;br /&gt;
&lt;br /&gt;
Note that the type of experiment data provided here in caArray is of type &amp;quot;derived bioassay&amp;quot;.  This is data that has been processed from raw data, for example using RMA.&lt;br /&gt;
&lt;br /&gt;
Select an experiment that has derived bioassays.  Here we depict the experiment ending in *99049.  The number of derived bioassays, 12, is displayed, along with the experiment information. (A new dataset, &amp;quot;Public Rembrandt&amp;quot; has subsequently been added, which would also be good to use for experimenting with caArray data download.  It has 53 bioassays available).&lt;br /&gt;
&lt;br /&gt;
To retrieve the bioassays themselves, right click on the experiment and press '''Get bioassays'''.  This will download the list of available bioassays into geWorkbench.&lt;br /&gt;
&lt;br /&gt;
[[Image:T_ProjectFolder_GetRemoteBioassays.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
To actually retrieve bioassay data, select one or more desired arrays and push the '''Open''' button.  (Although below we show retrieving multiple array datasets, for demonstration purposes you might want to first select just one, as each can take several minutes to download).&lt;br /&gt;
&lt;br /&gt;
You can either select the merge option here, or wait until all data has been successfully download to perform a merge later.&lt;br /&gt;
&lt;br /&gt;
Note that downloading data from a remote resource is not multi-threaded, so you will not be able to perform other actions in geWorkbench while the data is downloaded.&lt;br /&gt;
&lt;br /&gt;
[[Image:T_ProjectFolder_OpenRemoteBioassays.png]]&lt;br /&gt;
&lt;br /&gt;
===To add a remote source===&lt;br /&gt;
&lt;br /&gt;
(Note - currently only caArray data sources are supported).&lt;br /&gt;
&lt;br /&gt;
'''1.''' Click on the '''Add A New Resource''' button.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)MRemoteData2.png]]&lt;br /&gt;
This is the Data Source Definition Page&lt;br /&gt;
&lt;br /&gt;
'''2.''' Fill in the Data Source definition page. URL and Short Name are required fields.&lt;br /&gt;
&lt;br /&gt;
'''3.''' Click on the OK button.&lt;br /&gt;
&lt;br /&gt;
The configuration is set up to automatically reflect your additional Data Source.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===To modify a remote source===&lt;br /&gt;
&lt;br /&gt;
The specification of the remote resource can be edited.&lt;br /&gt;
&lt;br /&gt;
'''1.''' Click on the '''Edit''' button at the bottom of the '''Open File''' dialog.&lt;br /&gt;
&lt;br /&gt;
'''2.''' Make the changes that you need.&lt;br /&gt;
&lt;br /&gt;
'''3.''' Click on the '''OK''' button&lt;br /&gt;
&lt;br /&gt;
[[Image:T_ProjectPanel_EditRemote.png]]&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=Tutorial_-_Reverse_Engineering&amp;diff=3280</id>
		<title>Tutorial - Reverse Engineering</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=Tutorial_-_Reverse_Engineering&amp;diff=3280"/>
				<updated>2006-06-23T17:34:21Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: /* Overview */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{TutorialsTopNav}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Overview==&lt;br /&gt;
The Reverse Engineering component included in geWorkbench is being rewritten (as of April 2006), and hence in a future release the details of the interface may change.  However, the functionality will be similar.&lt;br /&gt;
&lt;br /&gt;
The primary use of the Reverse Engineering component is to infer regulatory interactions between genes and gene products.  The Reverse Engineering component uses the information theory concept of mutual information to find these interactions.  Mutual Information is in principle more sensitive and flexible than a simple correlation calculation.  It is also invariant under data transformations, so the details of normalization should not be important.&lt;br /&gt;
&lt;br /&gt;
==Reverse Engineering in the context of geWorkbench==&lt;br /&gt;
The Reverse Engineering component calculates the information that the expression pattern of one gene carries about the expression of another gene, that is, it is a pairwise calculation.  Larger datasets, containing more arrays per marker, will yield greater sensitivity and better statistical support.  Full scale runs of reverse engineering algorithms, comparing all markers against each other, and typically done on datasets containing several hundred microarrays, are typically  performed on large cluster computers and are not feasible on a desktop machine.  During 2006 we hope to provide a remote service that can host such calculations for jobs launched from geWorkbench.  However, at present, smaller scale calculations are supported directly in geWorkbench.&lt;br /&gt;
&lt;br /&gt;
As typically used in geWorkbench, the Reverse Engineering component calculates the Mutual Information score between a single hub gene and all other N markers in the dataset.  In a second step, a subset containing the best M markers is chosen (with a current limit of 100), and a complete pairwise MxM/2 mutual information calculation is performed between them.  The network resulting from this calculation can be displayed as a branched tree of interactions within the Cytoscape component.&lt;br /&gt;
&lt;br /&gt;
==Prerequisites==&lt;br /&gt;
A dataset  containing multiple arrays (the more the better) should be loaded into geWorkbench.  If data is loaded from separate files, it should be merged into a single microarray datset, either at the time of or after being read in.  See the section  [[Tutorial - Projects and Data Files | Projects and Data Files]].  In this tutorial we will load a dataset also used in other tutorial sections, which has been normalized, and filtered to reduce the number of genes.  This file, &amp;quot;webmatrix_quantile_log2_dev1_mv0.exp&amp;quot; will be made available in the tutorial data section.   It can also be obtained by starting with the file &amp;quot;webmatrix.exp&amp;quot; available in the ''downloads'' section and performing the following steps:&lt;br /&gt;
&lt;br /&gt;
1. Load the file webmatrix.exp.&lt;br /&gt;
&lt;br /&gt;
2. Quantile Normalize.&lt;br /&gt;
&lt;br /&gt;
3. Log2 transform (also in the Normalize tab).&lt;br /&gt;
&lt;br /&gt;
4. Filter out values having deviation less than 1.&lt;br /&gt;
&lt;br /&gt;
5. Remove markers with filtered-out values using the missing values filter with a threshold of 0.&lt;br /&gt;
&lt;br /&gt;
==Example - Profiler==&lt;br /&gt;
* Load the data file &amp;quot;webmatrix_quantile_log2_dev1_mv0.exp&amp;quot;.  This contains a set of 100 experiments on Affymetrix HG_U95Av2 chips.  As described above, this file has been quantile normalized, log2 converted, and then filtered to remove markers with a deviation of less than 1, in order to reduce the size of the dataset, leaving 3837 markers.&lt;br /&gt;
&lt;br /&gt;
* In the upper right section of geWorkbench find the Reverse Engineering component.  It should by default be displaying the '''Profiler''' tab.&lt;br /&gt;
&lt;br /&gt;
* In the Markers component search box, on the left side of the geWorkbench interface, enter 1973 and hit enter.  This will find the marker 1973_s_at, which is the c-Myc gene, a well-known transcription factor with many interactions.  Click on this marker in the list.  This will enter the marker into the '''Hub Gene Label''' field of the '''Profiler'''.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:T_Markers_Search1973.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
* The default setting in the Profiler is '''Mutual Information (fast)'''.  With this selected, hit '''Analyze(2D)'''.  This will return a list of all markers having a MI score of greater than the cutoff value (the default is 0.2).&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:T_ReverseEngineering_Basic.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
After the Mutual Information algoritnm has been run, an adjacency matrix will be placed in the Projects Folder:&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:T_ProjectFolders_AdjacencyMatrix.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
* If at this point you hit the Create Network button, a network will be displayed based on the top 100 markers interacting with c-Myc.  As described above, the MI algoritm is run again on these M=100 markers, in order to measure interactions between each pair.  Each marker is then connected via an edge with the marker it most strongly interacts with, with the chosen hub-gene at the center.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:T_ReverseEngineering_InitialNet.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
This is best seen in '''Cytoscape''' by going to the '''Layout''' menu, and chosing '''yFiles-&amp;gt;organic'''.  The layout will now appear as:&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:T_ReverseEngineering_Central1973.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
* If a smaller list is desired, a set of markers can be highlighted in the list originally returned.  Only this selected subset, up to 100 markers, will then be used if &amp;quot;Create Network&amp;quot; is pressed.&lt;br /&gt;
&lt;br /&gt;
* By right-clicking and selecting &amp;quot;Add to Set&amp;quot;, this group will be added to the '''Markers''' component as a new set of markers which can be used in other components (sequence retrieval, annotation retriever etc.).&lt;br /&gt;
&lt;br /&gt;
* Within the network created in Cytoscape, one can select the central gene, and then on the '''Cytoscape''' menu chose '''Select-&amp;gt;Nodes-&amp;gt;First Neighbors''' of selected nodes.  &lt;br /&gt;
&lt;br /&gt;
[[Image:T_ReverseEngineering_SelectFirstNeighbors.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
The first neighbors will be highlighted in the graph, &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:T_ReverseEngineering_FirstNeighbors.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
and also added as a new set in the '''Markers''' component.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:T_Markers_ReverseEng_Selected.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
We can return to the main Reverse Engineering component by clicking on the original dataset in the Project Folders component.  If we select the first (highest MI score) marker on the list, the graph shown below is drawn in the '''Motif Location Histogram''' display. This shows a plot of the expression values on each array for the selected hub marker vs any other marker selected in the list.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:T_ReverseEngineering_MotifHistogram.png]]&lt;br /&gt;
&lt;br /&gt;
==Options==&lt;br /&gt;
&lt;br /&gt;
'''Pearson''' - Uses a Pearson correlation function to calculate the interaction scores.&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=SOM&amp;diff=3276</id>
		<title>SOM</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=SOM&amp;diff=3276"/>
				<updated>2006-06-16T14:43:17Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: /* Background */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{TutorialsTopNav}}&lt;br /&gt;
&lt;br /&gt;
__TOC__&lt;br /&gt;
&lt;br /&gt;
==Note==&lt;br /&gt;
&lt;br /&gt;
The algorithm use for Fast Hierarchical Clustering analysis does not implement the commonly understood versions of Average Linkage and Total Linkage.  This problem is present in the current version (1.0.3) of geWorkbench and all previous versions.  Standard implementations will be available in the next release.  The implementation of single linkage gives results as expected from the standard algorithm.&lt;br /&gt;
&lt;br /&gt;
==Outline==&lt;br /&gt;
&lt;br /&gt;
In this section the following information about clustering methods is covered:&lt;br /&gt;
# What clustering is and how it can be used.&lt;br /&gt;
# Methods implemented in geWorkbench: SOMs and Hierarchical Clustering&lt;br /&gt;
# An example of performing an SOM analysis&lt;br /&gt;
# An example of performing a Hierarchical Clustering analysis.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Background==&lt;br /&gt;
&lt;br /&gt;
Clustering methods can allow identification of groups of markers with similar expression.  A common application is to search for genes that appear to be co-regulated.  A list of such markers, saved to the '''Markers''' component, can be used for further steps, such as retrieving upstream sequences, Gene Ontology analysis, or viewing of annotations.&lt;br /&gt;
&lt;br /&gt;
==Methods supported==&lt;br /&gt;
&lt;br /&gt;
geWorkbench supports two clustering methods:&lt;br /&gt;
# Self-Organizing maps (SOMs)&lt;br /&gt;
# Hierarchical Clustering&lt;br /&gt;
&lt;br /&gt;
Self-organizing maps group the markers into a user-specified number of bins.  In geWorkbench, a SOM visualizer component displays the results graphically.   Hierarchical clustering constructs a tree-like relationship among the expression patterns of all markers present.  Results are viewed in the Dendrogram component.&lt;br /&gt;
&lt;br /&gt;
==SOM Example==&lt;br /&gt;
&lt;br /&gt;
* Load the microarray dataset &amp;quot;webmatrix_quantile_log2_dev1_mv0.exp&amp;quot;, available in the tutorial_data.zip [[Download]]. &lt;br /&gt;
* In the '''Arrays/Phenotypes''' component pulldown menu, select the group labeled &amp;quot;Class&amp;quot;.&lt;br /&gt;
* Activate two sets of arrays to compare, e.g. GC B-cell and non-GC B-cell, by checking the boxes before the names (these are chosen here because they are the smallest groups).&lt;br /&gt;
* Go to the '''Analysis''' component, and select '''SOM Analysis'''.&lt;br /&gt;
&lt;br /&gt;
Parameters:&lt;br /&gt;
Rows, Columns - give the number of bins into which to separate the different marker expression patterns.&lt;br /&gt;
Radius - &lt;br /&gt;
Iterations - &lt;br /&gt;
Alpha - &lt;br /&gt;
Function - Bubble or Gaussian&lt;br /&gt;
&lt;br /&gt;
The default parameters are shown below.  We will accept these parameters.&lt;br /&gt;
&lt;br /&gt;
[[Image:T_SOM_Analysis_Parameters.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
The resulting display of nine clusters is shown below.  The user should experiment with different parameters to attempt to discern informative groupings.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:T_SOM_display.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Any individual graph can be right-clicked on and &amp;quot;Add to Set&amp;quot; chosen.  This will add these markers to a new Set in the '''Markers''' component.  Each will be given a name starting with &amp;quot;Cluster Grid&amp;quot; and the number of markers will be shown.&lt;br /&gt;
&lt;br /&gt;
==Hierarchical Clustering - Example==&lt;br /&gt;
&lt;br /&gt;
* Load the microarray dataset &amp;quot;webmatrix_quantile_log2_dev1_mv0.exp&amp;quot;, available in the tutorial_data.zip [[Download]]. &lt;br /&gt;
* In the Arrays/Phenotypes component, select the set of arrays labeled &amp;quot;Class&amp;quot;.&lt;br /&gt;
* Activate two classes of arrays to compare, e.g. GC B-cell and non-GC B-cell, by checking the boxes before the names.&lt;br /&gt;
* Go to the Analysis component, and select Hierarchical Clustering.&lt;br /&gt;
* At the bottom of the Analysis component, the box that says '''All Arrays''' should be unchecked, so that the array selection above is used.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
* In Hierarchical Clustering, set the parameters to:&lt;br /&gt;
** Clustering Method: Total Linkage&lt;br /&gt;
** Clustering Dimension: Both&lt;br /&gt;
** Clustering Metric: Euclidean&lt;br /&gt;
&lt;br /&gt;
*Click '''Analyze'''.&lt;br /&gt;
&lt;br /&gt;
The results will be displayed in the Dendrogram component.&lt;br /&gt;
&lt;br /&gt;
[[Image:T_HierarchicalClustering_BCregion.png]]&lt;br /&gt;
&lt;br /&gt;
By scrolling down a bit, one finds a large interesting area, showing clear differences between groups of arrays.  We will select two clearly differentiated clusters.  Check the '''Enable Zoom''' checkbox.  Then highlight the first cluster of 12 markers as shown here:&lt;br /&gt;
&lt;br /&gt;
[[Image:T_HierarchicalClustering_BC12Markers.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Then left-click to select this subset of the dendrogram.  It will be displayed alone. &lt;br /&gt;
&lt;br /&gt;
[[Image:T_HierarchicalClustering_BC12MarkersZoom.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Now right-click and select &amp;quot;Add to Set&amp;quot;.  In the Markers component, the select genes are added as Cluster Tree [12], where 12 is the number of markers selected.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Repeat for the similar region just below, which contains another 44 markers.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:T_HierarchicalClustering_BC44Markers.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
This will result in two sets of markers having been added to the Markers component, as shown below:&lt;br /&gt;
&lt;br /&gt;
[[Image:T_Markers_ClusterTree12and44.png]]&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=Tutorial_-_Reverse_Engineering&amp;diff=3138</id>
		<title>Tutorial - Reverse Engineering</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=Tutorial_-_Reverse_Engineering&amp;diff=3138"/>
				<updated>2006-06-02T20:57:47Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: /* Reverse Engineering in the context of geWorkbench */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{TutorialsTopNav}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Overview==&lt;br /&gt;
The Reverse Engineering component included in geWorkbench is being rewritten (as of April 2006), and hence in a future release the details of the interface may change.  However, the functionality will be similar.&lt;br /&gt;
&lt;br /&gt;
The primary use of the Reverse Engineering component is to infer regulatory interactions between genes and gene products.  The Reverse Engineering component uses the information theory concept of mutual information to find these interactions.  Mutual Information is in principle more senstive and flexible than a simple correlation calculation.  It is also invariant under data transformations, so the details of normalization should not be important.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Reverse Engineering in the context of geWorkbench==&lt;br /&gt;
The Reverse Engineering component calculates the information that the expression pattern of one gene carries about the expression of another gene, that is, it is a pairwise calculation.  Larger datasets, containing more arrays per marker, will yield greater sensitivity and better statistical support.  Full scale runs of reverse engineering algorithms, comparing all markers against each other, and typically done on datasets containing several hundred microarrays, are typically  performed on large cluster computers and are not feasible on a desktop machine.  During 2006 we hope to provide a remote service that can host such calculations for jobs launched from geWorkbench.  However, at present, smaller scale calculations are supported directly in geWorkbench.&lt;br /&gt;
&lt;br /&gt;
As typically used in geWorkbench, the Reverse Engineering component calculates the Mutual Information score between a single hub gene and all other N markers in the dataset.  In a second step, a subset containing the best M markers is chosen (with a current limit of 100), and a complete pairwise MxM/2 mutual information calculation is performed between them.  The network resulting from this calculation can be displayed as a branched tree of interactions within the Cytoscape component.&lt;br /&gt;
&lt;br /&gt;
==Prerequisites==&lt;br /&gt;
A dataset  containing multiple arrays (the more the better) should be loaded into geWorkbench.  If data is loaded from separate files, it should be merged into a single microarray datset, either at the time of or after being read in.  See the section  [[Tutorial - Projects and Data Files | Projects and Data Files]].  In this tutorial we will load a dataset also used in other tutorial sections, which has been normalized, and filtered to reduce the number of genes.  This file, &amp;quot;webmatrix_quantile_log2_dev1_mv0.exp&amp;quot; will be made available in the tutorial data section.   It can also be obtained by starting with the file &amp;quot;webmatrix.exp&amp;quot; available in the ''downloads'' section and performing the following steps:&lt;br /&gt;
&lt;br /&gt;
1. Load the file webmatrix.exp.&lt;br /&gt;
&lt;br /&gt;
2. Quantile Normalize.&lt;br /&gt;
&lt;br /&gt;
3. Log2 transform (also in the Normalize tab).&lt;br /&gt;
&lt;br /&gt;
4. Filter out values having deviation less than 1.&lt;br /&gt;
&lt;br /&gt;
5. Remove markers with filtered-out values using the missing values filter with a threshold of 0.&lt;br /&gt;
&lt;br /&gt;
==Example - Profiler==&lt;br /&gt;
* Load the data file &amp;quot;webmatrix_quantile_log2_dev1_mv0.exp&amp;quot;.  This contains a set of 100 experiments on Affymetrix HG_U95Av2 chips.  As described above, this file has been quantile normalized, log2 converted, and then filtered to remove markers with a deviation of less than 1, in order to reduce the size of the dataset, leaving 3837 markers.&lt;br /&gt;
&lt;br /&gt;
* In the upper right section of geWorkbench find the Reverse Engineering component.  It should by default be displaying the '''Profiler''' tab.&lt;br /&gt;
&lt;br /&gt;
* In the Markers component search box, on the left side of the geWorkbench interface, enter 1973 and hit enter.  This will find the marker 1973_s_at, which is the c-Myc gene, a well-known transcription factor with many interactions.  Click on this marker in the list.  This will enter the marker into the '''Hub Gene Label''' field of the '''Profiler'''.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:T_Markers_Search1973.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
* The default setting in the Profiler is '''Mutual Information (fast)'''.  With this selected, hit '''Analyze(2D)'''.  This will return a list of all markers having a MI score of greater than the cutoff value (the default is 0.2).&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:T_ReverseEngineering_Basic.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
After the Mutual Information algoritnm has been run, an adjacency matrix will be placed in the Projects Folder:&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:T_ProjectFolders_AdjacencyMatrix.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
* If at this point you hit the Create Network button, a network will be displayed based on the top 100 markers interacting with c-Myc.  As described above, the MI algoritm is run again on these M=100 markers, in order to measure interactions between each pair.  Each marker is then connected via an edge with the marker it most strongly interacts with, with the chosen hub-gene at the center.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:T_ReverseEngineering_InitialNet.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
This is best seen in '''Cytoscape''' by going to the '''Layout''' menu, and chosing '''yFiles-&amp;gt;organic'''.  The layout will now appear as:&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:T_ReverseEngineering_Central1973.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
* If a smaller list is desired, a set of markers can be highlighted in the list originally returned.  Only this selected subset, up to 100 markers, will then be used if &amp;quot;Create Network&amp;quot; is pressed.&lt;br /&gt;
&lt;br /&gt;
* By right-clicking and selecting &amp;quot;Add to Set&amp;quot;, this group will be added to the '''Markers''' component as a new set of markers which can be used in other components (sequence retrieval, annotation retriever etc.).&lt;br /&gt;
&lt;br /&gt;
* Within the network created in Cytoscape, one can select the central gene, and then on the '''Cytoscape''' menu chose '''Select-&amp;gt;Nodes-&amp;gt;First Neighbors''' of selected nodes.  &lt;br /&gt;
&lt;br /&gt;
[[Image:T_ReverseEngineering_SelectFirstNeighbors.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
The first neighbors will be highlighted in the graph, &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:T_ReverseEngineering_FirstNeighbors.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
and also added as a new set in the '''Markers''' component.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:T_Markers_ReverseEng_Selected.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
We can return to the main Reverse Engineering component by clicking on the original dataset in the Project Folders component.  If we select the first (highest MI score) marker on the list, the graph shown below is drawn in the '''Motif Location Histogram''' display. This shows a plot of the expression values on each array for the selected hub marker vs any other marker selected in the list.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:T_ReverseEngineering_MotifHistogram.png]]&lt;br /&gt;
&lt;br /&gt;
==Options==&lt;br /&gt;
&lt;br /&gt;
'''Pearson''' - Uses a Pearson correlation function to calculate the interaction scores.&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=Tutorial_-_Reverse_Engineering&amp;diff=3136</id>
		<title>Tutorial - Reverse Engineering</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=Tutorial_-_Reverse_Engineering&amp;diff=3136"/>
				<updated>2006-06-02T18:54:53Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: /* Reverse Engineering in the context of geWorkbench */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{TutorialsTopNav}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Overview==&lt;br /&gt;
The Reverse Engineering component included in geWorkbench is being rewritten (as of April 2006), and hence in a future release the details of the interface may change.  However, the functionality will be similar.&lt;br /&gt;
&lt;br /&gt;
The primary use of the Reverse Engineering component is to infer regulatory interactions between genes and gene products.  The Reverse Engineering component uses the information theory concept of mutual information to find these interactions.  Mutual Information is in principle more senstive and flexible than a simple correlation calculation.  It is also invariant under data transformations, so the details of normalization should not be important.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Reverse Engineering in the context of geWorkbench==&lt;br /&gt;
The Reverse Engineering component calculates the information that the expression pattern of one gene carries about the expression of another gene, that is, it is a pairwise calculation.  Larger datasets, containing more arrays per marker, will yield greater sensitivity and better statistical support.  Full scale runs of reverse engineering algorithms, comparing all markers against each other, and typically done on datasets containing several hundred microarrays, are typically  performed on large cluster computers and are not feasible on a desktop machine.  During 2006 we hope to provide a remote service that can host such caluclations for jobs launched from geWorkbench.  However, at present, smaller scale calculations are supported directly in geWorkbench.&lt;br /&gt;
&lt;br /&gt;
As typically used in geWorkbench, the Reverse Engineering component calculates the Mutual Information score between a single hub gene and all other N markers in the dataset.  In a second step, a subset containing the best M markers is chosen (with a current limit of 100), and a complete pairwise MxM/2 mutual information calculation is performed between them.  The network resulting from this calculation can be displayed as a branched tree of interactions within the Cytoscape component.&lt;br /&gt;
&lt;br /&gt;
==Prerequisites==&lt;br /&gt;
A dataset  containing multiple arrays (the more the better) should be loaded into geWorkbench.  If data is loaded from separate files, it should be merged into a single microarray datset, either at the time of or after being read in.  See the section  [[Tutorial - Projects and Data Files | Projects and Data Files]].  In this tutorial we will load a dataset also used in other tutorial sections, which has been normalized, and filtered to reduce the number of genes.  This file, &amp;quot;webmatrix_quantile_log2_dev1_mv0.exp&amp;quot; will be made available in the tutorial data section.   It can also be obtained by starting with the file &amp;quot;webmatrix.exp&amp;quot; available in the ''downloads'' section and performing the following steps:&lt;br /&gt;
&lt;br /&gt;
1. Load the file webmatrix.exp.&lt;br /&gt;
&lt;br /&gt;
2. Quantile Normalize.&lt;br /&gt;
&lt;br /&gt;
3. Log2 transform (also in the Normalize tab).&lt;br /&gt;
&lt;br /&gt;
4. Filter out values having deviation less than 1.&lt;br /&gt;
&lt;br /&gt;
5. Remove markers with filtered-out values using the missing values filter with a threshold of 0.&lt;br /&gt;
&lt;br /&gt;
==Example - Profiler==&lt;br /&gt;
* Load the data file &amp;quot;webmatrix_quantile_log2_dev1_mv0.exp&amp;quot;.  This contains a set of 100 experiments on Affymetrix HG_U95Av2 chips.  As described above, this file has been quantile normalized, log2 converted, and then filtered to remove markers with a deviation of less than 1, in order to reduce the size of the dataset, leaving 3837 markers.&lt;br /&gt;
&lt;br /&gt;
* In the upper right section of geWorkbench find the Reverse Engineering component.  It should by default be displaying the '''Profiler''' tab.&lt;br /&gt;
&lt;br /&gt;
* In the Markers component search box, on the left side of the geWorkbench interface, enter 1973 and hit enter.  This will find the marker 1973_s_at, which is the c-Myc gene, a well-known transcription factor with many interactions.  Click on this marker in the list.  This will enter the marker into the '''Hub Gene Label''' field of the '''Profiler'''.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:T_Markers_Search1973.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
* The default setting in the Profiler is '''Mutual Information (fast)'''.  With this selected, hit '''Analyze(2D)'''.  This will return a list of all markers having a MI score of greater than the cutoff value (the default is 0.2).&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:T_ReverseEngineering_Basic.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
After the Mutual Information algoritnm has been run, an adjacency matrix will be placed in the Projects Folder:&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:T_ProjectFolders_AdjacencyMatrix.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
* If at this point you hit the Create Network button, a network will be displayed based on the top 100 markers interacting with c-Myc.  As described above, the MI algoritm is run again on these M=100 markers, in order to measure interactions between each pair.  Each marker is then connected via an edge with the marker it most strongly interacts with, with the chosen hub-gene at the center.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:T_ReverseEngineering_InitialNet.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
This is best seen in '''Cytoscape''' by going to the '''Layout''' menu, and chosing '''yFiles-&amp;gt;organic'''.  The layout will now appear as:&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:T_ReverseEngineering_Central1973.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
* If a smaller list is desired, a set of markers can be highlighted in the list originally returned.  Only this selected subset, up to 100 markers, will then be used if &amp;quot;Create Network&amp;quot; is pressed.&lt;br /&gt;
&lt;br /&gt;
* By right-clicking and selecting &amp;quot;Add to Set&amp;quot;, this group will be added to the '''Markers''' component as a new set of markers which can be used in other components (sequence retrieval, annotation retriever etc.).&lt;br /&gt;
&lt;br /&gt;
* Within the network created in Cytoscape, one can select the central gene, and then on the '''Cytoscape''' menu chose '''Select-&amp;gt;Nodes-&amp;gt;First Neighbors''' of selected nodes.  &lt;br /&gt;
&lt;br /&gt;
[[Image:T_ReverseEngineering_SelectFirstNeighbors.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
The first neighbors will be highlighted in the graph, &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:T_ReverseEngineering_FirstNeighbors.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
and also added as a new set in the '''Markers''' component.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:T_Markers_ReverseEng_Selected.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
We can return to the main Reverse Engineering component by clicking on the original dataset in the Project Folders component.  If we select the first (highest MI score) marker on the list, the graph shown below is drawn in the '''Motif Location Histogram''' display. This shows a plot of the expression values on each array for the selected hub marker vs any other marker selected in the list.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:T_ReverseEngineering_MotifHistogram.png]]&lt;br /&gt;
&lt;br /&gt;
==Options==&lt;br /&gt;
&lt;br /&gt;
'''Pearson''' - Uses a Pearson correlation function to calculate the interaction scores.&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=Tutorial_-_Reverse_Engineering&amp;diff=3135</id>
		<title>Tutorial - Reverse Engineering</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=Tutorial_-_Reverse_Engineering&amp;diff=3135"/>
				<updated>2006-06-02T18:53:56Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: /* Prerequisites */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{TutorialsTopNav}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Overview==&lt;br /&gt;
The Reverse Engineering component included in geWorkbench is being rewritten (as of April 2006), and hence in a future release the details of the interface may change.  However, the functionality will be similar.&lt;br /&gt;
&lt;br /&gt;
The primary use of the Reverse Engineering component is to infer regulatory interactions between genes and gene products.  The Reverse Engineering component uses the information theory concept of mutual information to find these interactions.  Mutual Information is in principle more senstive and flexible than a simple correlation calculation.  It is also invariant under data transformations, so the details of normalization should not be important.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Reverse Engineering in the context of geWorkbench==&lt;br /&gt;
The Reverse Engineering component calculates the information that the expression pattern of one gene carries about the expression of another gene, that is, it is a pairwise calculation.  Larger datasets, containing more arrays per marker, will yield greater sensitivity and better ststistical support.  Full scale runs of reverse engineering algorithms, comparing all markers against each other, and typically done on datasets containing several hundred microarrays, are typically  performed on large cluster computers and are not feasible on a desktop machine.  During 2006 we hope to provide a remote service that can host such caluclations for jobs launched from geWorkbench.  However, at present, smaller scale calculations are supported directly in geWorkbench.&lt;br /&gt;
&lt;br /&gt;
As typically used in geWorkbench, the Reverse Engineering component calculates the Mutual Information score between a single hub gene and all other N markers in the dataset.  In a second step, a subset containing the best M markers is chosen (with a current limit of 100), and a complete pairwise MxM/2 mutual information calculation is performed between them.  The network resulting from this calculation can be displayed as a branched tree of interactions within the Cytoscape component.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Prerequisites==&lt;br /&gt;
A dataset  containing multiple arrays (the more the better) should be loaded into geWorkbench.  If data is loaded from separate files, it should be merged into a single microarray datset, either at the time of or after being read in.  See the section  [[Tutorial - Projects and Data Files | Projects and Data Files]].  In this tutorial we will load a dataset also used in other tutorial sections, which has been normalized, and filtered to reduce the number of genes.  This file, &amp;quot;webmatrix_quantile_log2_dev1_mv0.exp&amp;quot; will be made available in the tutorial data section.   It can also be obtained by starting with the file &amp;quot;webmatrix.exp&amp;quot; available in the ''downloads'' section and performing the following steps:&lt;br /&gt;
&lt;br /&gt;
1. Load the file webmatrix.exp.&lt;br /&gt;
&lt;br /&gt;
2. Quantile Normalize.&lt;br /&gt;
&lt;br /&gt;
3. Log2 transform (also in the Normalize tab).&lt;br /&gt;
&lt;br /&gt;
4. Filter out values having deviation less than 1.&lt;br /&gt;
&lt;br /&gt;
5. Remove markers with filtered-out values using the missing values filter with a threshold of 0.&lt;br /&gt;
&lt;br /&gt;
==Example - Profiler==&lt;br /&gt;
* Load the data file &amp;quot;webmatrix_quantile_log2_dev1_mv0.exp&amp;quot;.  This contains a set of 100 experiments on Affymetrix HG_U95Av2 chips.  As described above, this file has been quantile normalized, log2 converted, and then filtered to remove markers with a deviation of less than 1, in order to reduce the size of the dataset, leaving 3837 markers.&lt;br /&gt;
&lt;br /&gt;
* In the upper right section of geWorkbench find the Reverse Engineering component.  It should by default be displaying the '''Profiler''' tab.&lt;br /&gt;
&lt;br /&gt;
* In the Markers component search box, on the left side of the geWorkbench interface, enter 1973 and hit enter.  This will find the marker 1973_s_at, which is the c-Myc gene, a well-known transcription factor with many interactions.  Click on this marker in the list.  This will enter the marker into the '''Hub Gene Label''' field of the '''Profiler'''.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:T_Markers_Search1973.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
* The default setting in the Profiler is '''Mutual Information (fast)'''.  With this selected, hit '''Analyze(2D)'''.  This will return a list of all markers having a MI score of greater than the cutoff value (the default is 0.2).&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:T_ReverseEngineering_Basic.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
After the Mutual Information algoritnm has been run, an adjacency matrix will be placed in the Projects Folder:&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:T_ProjectFolders_AdjacencyMatrix.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
* If at this point you hit the Create Network button, a network will be displayed based on the top 100 markers interacting with c-Myc.  As described above, the MI algoritm is run again on these M=100 markers, in order to measure interactions between each pair.  Each marker is then connected via an edge with the marker it most strongly interacts with, with the chosen hub-gene at the center.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:T_ReverseEngineering_InitialNet.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
This is best seen in '''Cytoscape''' by going to the '''Layout''' menu, and chosing '''yFiles-&amp;gt;organic'''.  The layout will now appear as:&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:T_ReverseEngineering_Central1973.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
* If a smaller list is desired, a set of markers can be highlighted in the list originally returned.  Only this selected subset, up to 100 markers, will then be used if &amp;quot;Create Network&amp;quot; is pressed.&lt;br /&gt;
&lt;br /&gt;
* By right-clicking and selecting &amp;quot;Add to Set&amp;quot;, this group will be added to the '''Markers''' component as a new set of markers which can be used in other components (sequence retrieval, annotation retriever etc.).&lt;br /&gt;
&lt;br /&gt;
* Within the network created in Cytoscape, one can select the central gene, and then on the '''Cytoscape''' menu chose '''Select-&amp;gt;Nodes-&amp;gt;First Neighbors''' of selected nodes.  &lt;br /&gt;
&lt;br /&gt;
[[Image:T_ReverseEngineering_SelectFirstNeighbors.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
The first neighbors will be highlighted in the graph, &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:T_ReverseEngineering_FirstNeighbors.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
and also added as a new set in the '''Markers''' component.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:T_Markers_ReverseEng_Selected.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
We can return to the main Reverse Engineering component by clicking on the original dataset in the Project Folders component.  If we select the first (highest MI score) marker on the list, the graph shown below is drawn in the '''Motif Location Histogram''' display. This shows a plot of the expression values on each array for the selected hub marker vs any other marker selected in the list.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:T_ReverseEngineering_MotifHistogram.png]]&lt;br /&gt;
&lt;br /&gt;
==Options==&lt;br /&gt;
&lt;br /&gt;
'''Pearson''' - Uses a Pearson correlation function to calculate the interaction scores.&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2947</id>
		<title>User:Ginhoven</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2947"/>
				<updated>2006-04-12T15:56:32Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==TUTORIAL - BLAST==&lt;br /&gt;
&lt;br /&gt;
In this Tutorial you will learn to:&lt;br /&gt;
&lt;br /&gt;
* Set up and perform a Blast search.&lt;br /&gt;
&lt;br /&gt;
* Decipher the Output.&lt;br /&gt;
&lt;br /&gt;
* Analyze the results.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
'''OVERVIEW'''&lt;br /&gt;
&lt;br /&gt;
A reason why you may want to do a BLAST Query may be that you have found an interesting marker, so you want to retrieve it's gene, and see what it is related to.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
BLAST searches are divided into categories according to the nature, and size of the input query and the primary goal of the search.&lt;br /&gt;
&lt;br /&gt;
A BLAST search has four components:&lt;br /&gt;
&lt;br /&gt;
* Query&lt;br /&gt;
&lt;br /&gt;
* Data Base Program&lt;br /&gt;
&lt;br /&gt;
* Search Purpose&lt;br /&gt;
&lt;br /&gt;
* Goal&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the purpose of this tutorial use the file &amp;quot;NM _024426-Wilms.Fasta&amp;quot; provided in the tutorial data directory.  This is a nucleotide sequence file.  There is a second file which contains the corresponding protein sequence &amp;quot;NP_077744-Wilms.fasta&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
'''Provide a little background info about Wilm's tumor. (It was chosen at random).'''&lt;br /&gt;
&lt;br /&gt;
* In the Visualization Area click on the Sequence Alignment tab.&lt;br /&gt;
&lt;br /&gt;
* Click on the Blast tab.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial.png]]&lt;br /&gt;
&lt;br /&gt;
The result displays the length of the longest sequence selected (here there is only one) due to the sample.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
There are five different types of queries you can run, depending on what data you are using:&lt;br /&gt;
&lt;br /&gt;
'''blastp'''- Compares an amino acid query sequence against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastn'''- Compares a nucleotide query sequence against a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastx'''- Compares a nucleotide query sequence translated in all reading frames against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''tblastn'''- Compares a protein query sequence against a nucleotide database dynamically  translated in all reading frames.&lt;br /&gt;
&lt;br /&gt;
'''tblastx'''- Compares the 6 frame translations of a nucleotide query sequence against the six frame translations of a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Click on the drop down arrow and select a program. Since this is a '''nucelotide query''', we want to select a nucleotide query program '''blastn'''.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutoria1.png]]&lt;br /&gt;
&lt;br /&gt;
Now that the program has been selected, make sure the appropriate databases are displayed (you need to verify this for all algorithms). Here select  '''ncbi/nt''' - the complete non-redundant nucleotide database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Advanced Options Tab&lt;br /&gt;
&lt;br /&gt;
*Make sure &amp;quot;dna mat&amp;quot; is selected for the Matrix.&lt;br /&gt;
&lt;br /&gt;
*Change the Expect Value to 0.01.&lt;br /&gt;
&lt;br /&gt;
*Leave the box checked for PFP filtering for repeated sequence elements (Paracel Filtering Package).&lt;br /&gt;
&lt;br /&gt;
*Leave the Display result in your web browser checked.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial2.png ]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Service tab, select Columbia.&lt;br /&gt;
&lt;br /&gt;
'''Note:''' The text field at the bottom shows that one sequence has been selected.  If you have a Fasta file that has multiple sequences, you can select the ones you want in the Markers component and activate this selection, letting you search on a subset. You may search on all sequences in a file by clicking the All Markers checkbox.&lt;br /&gt;
&lt;br /&gt;
*Press the curved arrow submit button.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial3.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Observe the progress bar, Blast is now runnning.&lt;br /&gt;
&lt;br /&gt;
*You can check the server status  by hitting the Refresh button, under the Service tab.  Then you will an idea if the Columbia Machine is processing a lot of queries.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial4.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
When the results are returned they are placed in the Project Folders as a child of the sequence they correspond to.  You can mouse over the result set to see how many sequences are in it.&lt;br /&gt;
&lt;br /&gt;
In the Blast results viewer, you can examine the alignments.  Each different target hit is listed on a line in the results table.&lt;br /&gt;
&lt;br /&gt;
In the Blast results viewer you can select sequences to add back to the main project by checking the include box and then the Add Selected Sequences To Your Project tab.&lt;br /&gt;
&lt;br /&gt;
You can also add just the aligned parts by clicking on the tab Only Add Aligned Parts.&lt;br /&gt;
&lt;br /&gt;
'''(Should we say anything about e value and bit score in this tutorial?)'''&lt;br /&gt;
'''Do we need more details on reading the data output?'''&lt;br /&gt;
'''We need to explain more about the output.'''&lt;br /&gt;
'''What about the separate page that pops up?'''&lt;br /&gt;
&lt;br /&gt;
The Load button allows you to load an external Blast file in HTML format into the viewer.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial5.png]]&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2939</id>
		<title>User:Ginhoven</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2939"/>
				<updated>2006-03-29T21:37:57Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==TUTORIAL - BLAST==&lt;br /&gt;
&lt;br /&gt;
In this Tutorial you will learn to:&lt;br /&gt;
&lt;br /&gt;
* Set up and perform a Blast search.&lt;br /&gt;
&lt;br /&gt;
* Decipher the Output.&lt;br /&gt;
&lt;br /&gt;
* Analyze the results.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
'''OVERVIEW'''&lt;br /&gt;
&lt;br /&gt;
A reason why you may want to do a BLAST Query may be that you have found an interesting marker, so you want to retrieve it's gene, and see what it is related to.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
BLAST searches are divided into categories according to the nature, and size of the input query and the primary goal of the search.&lt;br /&gt;
&lt;br /&gt;
A BLAST search has four components:&lt;br /&gt;
&lt;br /&gt;
* Query&lt;br /&gt;
&lt;br /&gt;
* Data Base Program&lt;br /&gt;
&lt;br /&gt;
* Search Purpose&lt;br /&gt;
&lt;br /&gt;
* Goal&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the purpose of this tutorial use the file &amp;quot;NM _024426-Wilms.Fasta&amp;quot; provided in the tutorial data directory.  This is a nucleotide sequence file.  There is a second file which contains the corresponding protein sequence &amp;quot;NP_077744-Wilms.fasta&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
'''Provide a little background info about Wilm's tumor. (It was chosen at random).'''&lt;br /&gt;
&lt;br /&gt;
* In the Visualization Area click on the Sequence Alignment tab.&lt;br /&gt;
&lt;br /&gt;
* Click on the Blast tab.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial.png]]&lt;br /&gt;
&lt;br /&gt;
The result displays the length of the longest sequence selected (here there is only one) due to the sample.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
There are five different types of queries you can run, depending on what data you are using:&lt;br /&gt;
&lt;br /&gt;
'''blastp'''- Compares an amino acid query sequence against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastn'''- Compares a nucleotide query sequence against a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastx'''- Compares a nucleotide query sequence translated in all reading frames against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''tblastn'''- Compares a protein query sequence against a nucleotide database dynamically  translated in all reading frames.&lt;br /&gt;
&lt;br /&gt;
'''tblastx'''- Compares the 6 frame translations of a nucleotide query sequence against the six frame translations of a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Click on the drop down arrow and select a program. Since this is a '''nucelotide query''', we want to select a nucleotide query program '''blastn'''.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutoria1.png]]&lt;br /&gt;
&lt;br /&gt;
Now that the program has been selected, make sure the appropriate databases are displayed (you need to verify this for all algorithms). Here select  '''ncbi/nt''' - the complete non-redundant nucleotide database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Advanced Options Tab&lt;br /&gt;
&lt;br /&gt;
*Make sure &amp;quot;dna mat&amp;quot; is selected for the Matrix.&lt;br /&gt;
&lt;br /&gt;
*Change the Expect Value to 0.01.&lt;br /&gt;
&lt;br /&gt;
*Leave the box checked for PFP filtering for repeated sequence elements (Paracel Filtering Package).&lt;br /&gt;
&lt;br /&gt;
*Leave the Display result in your web browser checked.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial2.png ]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Service tab, select Columbia.&lt;br /&gt;
&lt;br /&gt;
'''Note:''' The text field at the bottom shows that one sequence has been selected.  If you have a Fasta file that has multiple sequences, you can select the ones you want in the Markers component and activate this selection, letting you search on a subset. You may search on all sequences in a file by clicking the All Markers checkbox.&lt;br /&gt;
&lt;br /&gt;
*Press the curved arrow submit button.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial3.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Observe the progress bar, Blast is now runnning.&lt;br /&gt;
&lt;br /&gt;
*You can check the server status  by hitting the Refresh button, under the Service tab.  Then you will an idea if the Columbia Machine is processing a lot of queries.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial4.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
When the results are returned they are placed in the Project Folders as a child of the sequence they correspond to.  You can mouse over the result set to see how many sequences are in it.&lt;br /&gt;
&lt;br /&gt;
In the Blast results viewer, you can examine the alignments.  Each different target hit is listed on a line in the results table.&lt;br /&gt;
&lt;br /&gt;
In the Blast results viewer you can select sequences to add back to the main project by checking the include box and then the Add Selected Sequences To Your Project tab.&lt;br /&gt;
&lt;br /&gt;
You can also add just the aligned parts by clicking on the tab Only Add Aligned Parts.&lt;br /&gt;
&lt;br /&gt;
'''(Should we say anything about e value and bit score in this tutorial?)'''&lt;br /&gt;
'''Do we need more details on reading the data output?'''&lt;br /&gt;
'''We need to explain more about the output.'''&lt;br /&gt;
&lt;br /&gt;
The Load button allows you to load an external Blast file in HTML format into the viewer.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial5.png]]&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2938</id>
		<title>User:Ginhoven</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2938"/>
				<updated>2006-03-29T21:37:20Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==TUTORIAL - BLAST==&lt;br /&gt;
&lt;br /&gt;
In this Tutorial you will learn to:&lt;br /&gt;
&lt;br /&gt;
* Set up and perform a Blast search.&lt;br /&gt;
&lt;br /&gt;
* Decipher the Output.&lt;br /&gt;
&lt;br /&gt;
* Analyze the results.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
'''OVERVIEW'''&lt;br /&gt;
&lt;br /&gt;
A reason why you may want to do a BLAST Query may be that you have found an interesting marker, so you want to retrieve it's gene, and see what it is related to.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
BLAST searches are divided into categories according to the nature, and size of the input query and the primary goal of the search.&lt;br /&gt;
&lt;br /&gt;
A BLAST search has four components:&lt;br /&gt;
&lt;br /&gt;
* Query&lt;br /&gt;
&lt;br /&gt;
* Data Base Program&lt;br /&gt;
&lt;br /&gt;
* Search Purpose&lt;br /&gt;
&lt;br /&gt;
* Goal&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the purpose of this tutorial use the file &amp;quot;NM _024426-Wilms.Fasta&amp;quot; provided in the tutorial data directory.  This is a nucleotide sequence file.  There is a second file which contains the corresponding protein sequence &amp;quot;NP_077744-Wilms.fasta&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
'''Provide a little background info about Wilm's tumor. (It was chosen at random).'''&lt;br /&gt;
&lt;br /&gt;
* In the Visualization Area click on the Sequence Alignment tab.&lt;br /&gt;
&lt;br /&gt;
* Click on the Blast tab.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial.png]]&lt;br /&gt;
&lt;br /&gt;
The result displays the length of the longest sequence selected (here there is only one) due to the sample.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
There are five different types of queries you can run, depending on what data you are using:&lt;br /&gt;
&lt;br /&gt;
'''blastp'''- Compares an amino acid query sequence against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastn'''- Compares a nucleotide query sequence against a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastx'''- Compares a nucleotide query sequence translated in all reading frames against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''tblastn'''- Compares a protein query sequence against a nucleotide database dynamically  translated in all reading frames.&lt;br /&gt;
&lt;br /&gt;
'''tblastx'''- Compares the 6 frame translations of a nucleotide query sequence against the six frame translations of a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Click on the drop down arrow and select a program. Since this is a '''nucelotide query''', we want to select a nucleotide query program '''blastn'''.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutoria1.png]]&lt;br /&gt;
&lt;br /&gt;
Now that the program has been selected, make sure the appropriate databases are displayed (you need to verify this for all algorithms). Here select  '''ncbi/nt''' - the complete non-redundant nucleotide database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Advanced Options Tab&lt;br /&gt;
&lt;br /&gt;
*Make sure &amp;quot;dna mat&amp;quot; is selected for the Matrix.&lt;br /&gt;
&lt;br /&gt;
*Change the Expect Value to 0.01.&lt;br /&gt;
&lt;br /&gt;
*Leave the box checked for PFP filtering for repeated sequence elements (Paracel Filtering Package).&lt;br /&gt;
&lt;br /&gt;
*Leave the Display result in your web browser checked.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial2.png ]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Service tab, select Columbia.&lt;br /&gt;
&lt;br /&gt;
'''Note:''' The text field at the bottom shows that one sequence has been selected.  If you have a Fasta file that has multiple sequences, you can select the ones you want in the Markers component and activate this selection, letting you search on a subset. You may search on all sequences in a file by clicking the All Markers checkbox.&lt;br /&gt;
&lt;br /&gt;
*Press the curved arrow submit button.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial3.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Observe the progress bar, Blast is now runnning.&lt;br /&gt;
&lt;br /&gt;
*You can check the server status  by hitting the Refresh button, under the Service tab.  Then you will an idea if the Columbia Machine is processing a lot of queries.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial4.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
When the results are returned they are placed in the Project Folders as a child of the sequence they correspond to.  You can mouse over the result set to see how many sequences are in it.&lt;br /&gt;
&lt;br /&gt;
In the Blast results viewer, you can examine the alignments.  Each different target hit is listed on a line in the results table.&lt;br /&gt;
&lt;br /&gt;
In the Blast results viewer you can select sequences to add back to the main project by checking the include box and then the Add Selected Sequences To Your Project tab.&lt;br /&gt;
&lt;br /&gt;
You can also add just the aligned parts by clicking on the tab Only Add Aligned Parts.&lt;br /&gt;
&lt;br /&gt;
'''(Should we say anything about e value and bit score in this tutorial?)'''&lt;br /&gt;
'''Do we need more details on reading the data output?'''&lt;br /&gt;
'''We need to explain more about the output'''&lt;br /&gt;
&lt;br /&gt;
The Load button allows you to load an external Blast file in HTML format into the viewer.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial5.png]]&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2937</id>
		<title>User:Ginhoven</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2937"/>
				<updated>2006-03-29T20:22:17Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==TUTORIAL - BLAST==&lt;br /&gt;
&lt;br /&gt;
In this Tutorial you will learn to:&lt;br /&gt;
&lt;br /&gt;
* Set up and perform a Blast search.&lt;br /&gt;
&lt;br /&gt;
* Decipher the Output.&lt;br /&gt;
&lt;br /&gt;
* Analyze the results.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
'''OVERVIEW'''&lt;br /&gt;
&lt;br /&gt;
A reason why you may want to do a BLAST Query may be that you have found an interesting marker, so you want to retrieve it's gene, and see what it is related to.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
BLAST searches are divided into categories according to the nature, and size of the input query and the primary goal of the search.&lt;br /&gt;
&lt;br /&gt;
A BLAST search has four components:&lt;br /&gt;
&lt;br /&gt;
* Query&lt;br /&gt;
&lt;br /&gt;
* Data Base Program&lt;br /&gt;
&lt;br /&gt;
* Search Purpose&lt;br /&gt;
&lt;br /&gt;
* Goal&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the purpose of this tutorial use the file &amp;quot;NM _024426-Wilms.Fasta&amp;quot; provided in the tutorial data directory.  This is a nucleotide sequence file.  There is a second file which contains the corresponding protein sequence &amp;quot;NP_077744-Wilms.fasta&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
'''Provide a little background info about Wilm's tumor. (It was chosen at random).'''&lt;br /&gt;
&lt;br /&gt;
* In the Visualization Area click on the Sequence Alignment tab.&lt;br /&gt;
&lt;br /&gt;
* Click on the Blast tab.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial.png]]&lt;br /&gt;
&lt;br /&gt;
The result displays the length of the longest sequence selected (here there is only one) due to the sample.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
There are five different types of queries you can run, depending on what data you are using:&lt;br /&gt;
&lt;br /&gt;
'''blastp'''- Compares an amino acid query sequence against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastn'''- Compares a nucleotide query sequence against a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastx'''- Compares a nucleotide query sequence translated in all reading frames against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''tblastn'''- Compares a protein query sequence against a nucleotide database dynamically  translated in all reading frames.&lt;br /&gt;
&lt;br /&gt;
'''tblastx'''- Compares the 6 frame translations of a nucleotide query sequence against the six frame translations of a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Click on the drop down arrow and select a program. Since this is a '''nucelotide query''', we want to select a nucleotide query program '''blastn'''.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutoria1.png]]&lt;br /&gt;
&lt;br /&gt;
Now that the program has been selected, make sure the appropriate databases are displayed (you need to verify this for all algorithms). Here select  '''ncbi/nt''' - the complete non-redundant nucleotide database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Advanced Options Tab&lt;br /&gt;
&lt;br /&gt;
*Make sure &amp;quot;dna mat&amp;quot; is selected for the Matrix.&lt;br /&gt;
&lt;br /&gt;
*Change the Expect Value to 0.01.&lt;br /&gt;
&lt;br /&gt;
*Leave the box checked for PFP filtering for repeated sequence elements (Paracel Filtering Package).&lt;br /&gt;
&lt;br /&gt;
*Leave the Display result in your web browser checked.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial2.png ]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Service tab, select Columbia.&lt;br /&gt;
&lt;br /&gt;
'''Note:''' The text field at the bottom shows that one sequence has been selected.  If you have a Fasta file that has multiple sequences, you can select the ones you want in the Markers component and activate this selection, letting you search on a subset. You may search on all sequences in a file by clicking the All Markers checkbox.&lt;br /&gt;
&lt;br /&gt;
*Press the curved arrow submit button.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial3.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Observe the progress bar, Blast is now runnning.&lt;br /&gt;
&lt;br /&gt;
*You can check the server status  by hitting the Refresh button, under the Service tab.  Then you will an idea if the Columbia Machine is processing a lot of queries.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial4.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
When the results are returned they are placed in the Project Folders as a child of the sequence they correspond to.  You can mouse over the result set to see how many sequences are in it.&lt;br /&gt;
&lt;br /&gt;
In the Blast results viewer, you can examine the alignments.  Each different target hit is listed on a line in the results table.&lt;br /&gt;
&lt;br /&gt;
In the Blast results viewer you can select sequences to add back to the main project by checking the include box and then the Add Selected Sequences To Your Project tab.&lt;br /&gt;
&lt;br /&gt;
You can also add just the aligned parts by clicking on the tab Only Add Aligned Parts.&lt;br /&gt;
&lt;br /&gt;
'''(Should we say anything about e value and bit score in this tutorial?)'''&lt;br /&gt;
'''Do we need more details on reading the data output?'''&lt;br /&gt;
&lt;br /&gt;
The Load button allows you to load an external Blast file in HTML format into the viewer.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial5.png]]&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2936</id>
		<title>User:Ginhoven</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2936"/>
				<updated>2006-03-29T20:19:34Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==TUTORIAL - BLAST==&lt;br /&gt;
&lt;br /&gt;
In this Tutorial you will learn to:&lt;br /&gt;
&lt;br /&gt;
* Set up and perform a Blast search.&lt;br /&gt;
&lt;br /&gt;
* Decipher the Output.&lt;br /&gt;
&lt;br /&gt;
* Analyze the results.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
'''OVERVIEW'''&lt;br /&gt;
&lt;br /&gt;
A reason why you may want to do a BLAST Query may be that you have found an interesting marker, so you want to retrieve it's gene, and see what it is related to.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
BLAST searches are divided into categories according to the nature, and size of the input query and the primary goal of the search.&lt;br /&gt;
&lt;br /&gt;
A BLAST search has four components:&lt;br /&gt;
&lt;br /&gt;
* Query&lt;br /&gt;
&lt;br /&gt;
* Data Base Program&lt;br /&gt;
&lt;br /&gt;
* Search Purpose&lt;br /&gt;
&lt;br /&gt;
* Goal&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the purpose of this tutorial use the file &amp;quot;NM _024426-Wilms.Fasta&amp;quot; provided in the tutorial data directory.  This is a nucleotide sequence file.  There is a second file which contains the corresponding protein sequence &amp;quot;NP_077744-Wilms.fasta&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
'''Provide a little background info about Wilm's tumor. (It was chosen at random).'''&lt;br /&gt;
&lt;br /&gt;
* In the Visualization Area click on the Sequence Alignment tab.&lt;br /&gt;
&lt;br /&gt;
* Click on the Blast tab.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial.png]]&lt;br /&gt;
&lt;br /&gt;
The result displays the length of the longest sequence selected (here there is only one) due to the sample.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
There are five different types of queries you can run, depending on what data you are using:&lt;br /&gt;
&lt;br /&gt;
'''blastp'''- Compares an amino acid query sequence against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastn'''- Compares a nucleotide query sequence against a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastx'''- Compares a nucleotide query sequence translated in all reading frames against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''tblastn'''- Compares a protein query sequence against a nucleotide database dynamically  translated in all reading frames.&lt;br /&gt;
&lt;br /&gt;
'''tblastx'''- Compares the 6 frame translations of a nucleotide query sequence against the six frame translations of a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Click on the drop down arrow and select a program. Since this is a '''nucelotide query''', we want to select a nucleotide query program '''blastn'''.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutoria1.png]]&lt;br /&gt;
&lt;br /&gt;
Now that the program has been selected, make sure the appropriate databases are displayed (you need to verify this for all algorithms). Here select  '''ncbi/nt''' - the complete non-redundant nucleotide database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Advanced Options Tab&lt;br /&gt;
&lt;br /&gt;
*Make sure &amp;quot;dna mat&amp;quot; is selected for the Matrix.&lt;br /&gt;
&lt;br /&gt;
*Change the Expect Value to 0.01.&lt;br /&gt;
&lt;br /&gt;
*Leave the box checked for PFP filtering for repeated sequence elements (Paracel Filtering Package).&lt;br /&gt;
&lt;br /&gt;
*Leave the Display result in your web browser checked.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial2.png ]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Service tab, select Columbia.&lt;br /&gt;
&lt;br /&gt;
'''Note:''' The text field at the bottom shows that one sequence has been selected.  If you have a Fasta file that has multiple sequences, you can select the ones you want in the Markers component and activate this selection, letting you search on a subset. You may search on all sequences in a file by clicking the All Markers checkbox.&lt;br /&gt;
&lt;br /&gt;
*Press the curved arrow submit button.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial3.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Observe the progress bar, Blast is now runnning.&lt;br /&gt;
&lt;br /&gt;
*You can check the server status  by hitting the Refresh button, under the Service tab.  This will give you an idea of how busy the Columbia Machine is.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial4.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
When the results are returned they are placed in the Project Folders as a child of the sequence they correspond to.  You can mouse over the result set to see how many sequences are in it.&lt;br /&gt;
&lt;br /&gt;
In the Blast results viewer, you can examine the alignments.  Each different target hit is listed on a line in the results table.&lt;br /&gt;
&lt;br /&gt;
In the Blast results viewer you can select sequences to add back to the main project by checking the include box and then the Add Selected Sequences To Your Project tab.&lt;br /&gt;
&lt;br /&gt;
You can also add just the aligned parts by clicking on the tab Only Add Aligned Parts.&lt;br /&gt;
&lt;br /&gt;
'''(Should we say anything about e value and bit score in this tutorial?)'''&lt;br /&gt;
'''Do we need more details on reading the data output?'''&lt;br /&gt;
&lt;br /&gt;
The Load button allows you to load an external Blast file in HTML format into the viewer.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial5.png]]&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2935</id>
		<title>User:Ginhoven</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2935"/>
				<updated>2006-03-29T20:16:33Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==TUTORIAL - BLAST==&lt;br /&gt;
&lt;br /&gt;
In this Tutorial you will learn to:&lt;br /&gt;
&lt;br /&gt;
* Set up and perform a Blast search.&lt;br /&gt;
&lt;br /&gt;
* Decipher the Output.&lt;br /&gt;
&lt;br /&gt;
* Analyze the results.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
'''OVERVIEW'''&lt;br /&gt;
&lt;br /&gt;
A reason why you may want to do a BLAST Query may be that you have found an interesting marker, so you want to retrieve it's gene, and see what it is related to.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
BLAST searches are divided into categories according to the nature, and size of the input query and the primary goal of the search.&lt;br /&gt;
&lt;br /&gt;
A BLAST search has four components:&lt;br /&gt;
&lt;br /&gt;
* Query&lt;br /&gt;
&lt;br /&gt;
* Data Base Program&lt;br /&gt;
&lt;br /&gt;
* Search Purpose&lt;br /&gt;
&lt;br /&gt;
* Goal&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the purpose of this tutorial use the file &amp;quot;NM _024426-Wilms.Fasta&amp;quot; provided in the tutorial data directory.  This is a nucleotide sequence file.  There is a second file which contains the corresponding protein sequence &amp;quot;NP_077744-Wilms.fasta&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
'''Provide a little background info about Wilm's tumor. (It was chosen at random).'''&lt;br /&gt;
&lt;br /&gt;
* In the Visualization Area click on the Sequence Alignment tab.&lt;br /&gt;
&lt;br /&gt;
* Click on the Blast tab.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial.png]]&lt;br /&gt;
&lt;br /&gt;
The result displays the length of the longest sequence selected (here there is only one) due to the sample.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
There are five different types of queries you can run, depending on what data you are using:&lt;br /&gt;
&lt;br /&gt;
'''blastp'''- Compares an amino acid query sequence against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastn'''- Compares a nucleotide query sequence against a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastx'''- Compares a nucleotide query sequence translated in all reading frames against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''tblastn'''- Compares a protein query sequence against a nucleotide database dynamically  translated in all reading frames.&lt;br /&gt;
&lt;br /&gt;
'''tblastx'''- Compares the 6 frame translations of a nucleotide query sequence against the six frame translations of a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Click on the drop down arrow and select a program. Since this is a '''nucelotide query''', we want to select a nucleotide query program '''blastn'''.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutoria1.png]]&lt;br /&gt;
&lt;br /&gt;
Now that the program has been selected, make sure the appropriate databases are displayed (you need to verify this for all algorithms). Here select  '''ncbi/nt''' - the complete non-redundant nucleotide database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Advanced Options Tab&lt;br /&gt;
&lt;br /&gt;
*Make sure &amp;quot;dna mat&amp;quot; is selected for the Matrix.&lt;br /&gt;
&lt;br /&gt;
*Change the Expect Value to 0.01.&lt;br /&gt;
&lt;br /&gt;
*Leave the box checked for PFP filtering for repeated sequence elements (Paracel Filtering Package).&lt;br /&gt;
&lt;br /&gt;
*Leave the Display result in your web browser checked.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial2.png ]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Service tab, select Columbia.&lt;br /&gt;
&lt;br /&gt;
'''Note:''' The text field at the bottom shows that one sequence has been selected.  If you have a Fasta file that has multiple sequences, you can select the ones you want in the Markers componenet and activate this selection, letting you search on a subset. You may search on all sequences in a file by clicking the All Markers checkbox.&lt;br /&gt;
&lt;br /&gt;
*Press the curved arrow submit button.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial3.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Observe the progress bar, Blast is now runnning.&lt;br /&gt;
&lt;br /&gt;
*You can check the server status  by hitting the Refresh button, under the Service tab.  This will give you an idea of how busy the Columbia Machine is.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial4.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
When the results are returned they are placed in the Project Folders as a child of the sequence they correspond to.  You can mouse over the result set to see how many sequences are in it.&lt;br /&gt;
&lt;br /&gt;
In the Blast results viewer, you can examine the alignments.  Each different target hit is listed on a line in the results table.&lt;br /&gt;
&lt;br /&gt;
In the Blast results viewer you can select sequences to add back to the main project by checking the include box and then the Add Selected Sequences To Your Project tab.&lt;br /&gt;
&lt;br /&gt;
You can also add just the aligned parts by clicking on the tab Only Add Aligned Parts.&lt;br /&gt;
&lt;br /&gt;
'''(Should we say anything about e value and bit score in this tutorial?)'''&lt;br /&gt;
'''Do we need more details on reading the data output?'''&lt;br /&gt;
&lt;br /&gt;
The Load button allows you to load an external Blast file in HTML format into the viewer.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial5.png]]&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2934</id>
		<title>User:Ginhoven</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2934"/>
				<updated>2006-03-29T19:21:03Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==TUTORIAL - BLAST==&lt;br /&gt;
&lt;br /&gt;
In this Tutorial you will learn to:&lt;br /&gt;
&lt;br /&gt;
* Set up and perform a Blast search.&lt;br /&gt;
&lt;br /&gt;
* Decipher the Output.&lt;br /&gt;
&lt;br /&gt;
* Analyze the results.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
'''OVERVIEW'''&lt;br /&gt;
&lt;br /&gt;
A reason why you may want to do a BLAST Query may be that you have found an interesting marker, so you want to retrieve it's gene, and see what it is related to.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
BLAST searches are divided into categories according to the nature, and size of the input query and the primary goal of the search.&lt;br /&gt;
&lt;br /&gt;
A BLAST search has four components:&lt;br /&gt;
&lt;br /&gt;
* Query&lt;br /&gt;
&lt;br /&gt;
* Data Base Program&lt;br /&gt;
&lt;br /&gt;
* Search Purpose&lt;br /&gt;
&lt;br /&gt;
* Goal&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the purpose of this tutorial use the file &amp;quot;NM _024426-Wilms.Fasta&amp;quot; provided in the tutorial data directory.  This is a nucleotide sequence file.  There is a second file which contains the corresponding protein sequence &amp;quot;NP_077744-Wilms.fasta&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Provide a little background info about Wilm's tumor. (It was chosen at random).&lt;br /&gt;
&lt;br /&gt;
* In the Visualization Area click on the Sequence Alignment tab.&lt;br /&gt;
&lt;br /&gt;
* Click on the Blast tab.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial.png]]&lt;br /&gt;
&lt;br /&gt;
The result displays the length of the longest sequence selected (here there is only one) due to the sample.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
There are five different types of queries you can run, depending on what data you are using:&lt;br /&gt;
&lt;br /&gt;
'''blastp'''- Compares an amino acid query sequence against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastn'''- Compares a nucleotide query sequence against a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastx'''- Compares a nucleotide query sequence translated in all reading frames against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''tblastn'''- Compares a protein query sequence against a nucleotide database dynamically  translated in all reading frames.&lt;br /&gt;
&lt;br /&gt;
'''tblastx'''- Compares the 6 frame translations of a nucleotide query sequence against the six frame translations of a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Click on the drop down arrow and select a program. Since this is a '''nucelotide query''', we want to select a nucleotide query program '''blastn'''.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutoria1.png]]&lt;br /&gt;
&lt;br /&gt;
Now that the program has been selected, make sure the appropriate databases are displayed (you need to verify this for all algorithms). Here select  '''ncbi/nt''' - the complete non-redundant nucleotide database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Advanced Options Tab&lt;br /&gt;
&lt;br /&gt;
*Make sure &amp;quot;dna mat&amp;quot; is selected for the Matrix.&lt;br /&gt;
&lt;br /&gt;
*Change the Expect Value to 0.01.&lt;br /&gt;
&lt;br /&gt;
*Leave the box checked for PFP filtering for repeated sequence elements (Paracel Filtering Package).&lt;br /&gt;
&lt;br /&gt;
*Leave the Display result in your web browser checked.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial2.png ]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Service tab, select Columbia.&lt;br /&gt;
&lt;br /&gt;
'''Note:''' The text field at the bottom shows that one sequence has been selected.  If you have a Fasta file that has multiple sequences, you can select the ones you want in the Markers componenet and activate this selection, letting you search on a subset. You may search on all sequences in a file by clicking the All Markers checkbox.&lt;br /&gt;
&lt;br /&gt;
*Press the curved arrow submit button.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial3.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Observe the progress bar, Blast is now runnning.&lt;br /&gt;
&lt;br /&gt;
*You can check the server status  by hitting the Refresh button, under the Service tab.  This will give you an idea of how busy the Columbia Machine is.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial4.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
When the results are returned they are placed in the Project Folders as a child of the sequence they correspond to.  You can mouse over the result set to see how many sequences are in it.&lt;br /&gt;
&lt;br /&gt;
In the Blast results viewer, you can examine the alignments.  Each different target hit is listed on a line in the results table.&lt;br /&gt;
&lt;br /&gt;
In the Blast results viewer you can select sequences to add back to the main project by checking the include box and then the Add Selected Sequences To Your Project tab.&lt;br /&gt;
&lt;br /&gt;
You can also add just the aligned parts by clicking on the tab Only Add Aligned Parts.&lt;br /&gt;
&lt;br /&gt;
'''(Should we say anything about e value and bit score in this tutorial?)'''&lt;br /&gt;
'''Do we need more details on reading the data output?'''&lt;br /&gt;
&lt;br /&gt;
The Load button allows you to load an external Blast file in HTML format into the viewer.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial5.png]]&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2933</id>
		<title>User:Ginhoven</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2933"/>
				<updated>2006-03-29T19:14:25Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==TUTORIAL - BLAST==&lt;br /&gt;
&lt;br /&gt;
In this Tutorial you will learn to:&lt;br /&gt;
&lt;br /&gt;
* Set up and perform a Blast search.&lt;br /&gt;
&lt;br /&gt;
* Decipher the Output.&lt;br /&gt;
&lt;br /&gt;
* Analyze the results.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
'''OVERVIEW'''&lt;br /&gt;
&lt;br /&gt;
A reason why you may want to do a BLAST Query may be that you have found an interesting marker, so you want to retrieve it's gene, and see what it is related to.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
BLAST searches are divided into categories according to the nature, and size of the input query and the primary goal of the search.&lt;br /&gt;
&lt;br /&gt;
A BLAST search has four components:&lt;br /&gt;
&lt;br /&gt;
* Query&lt;br /&gt;
&lt;br /&gt;
* Data Base Program&lt;br /&gt;
&lt;br /&gt;
* Search Purpose&lt;br /&gt;
&lt;br /&gt;
* Goal&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the purpose of this tutorial use the file &amp;quot;NM _024426-Wilms.Fasta&amp;quot; provided in the tutorial data directory.  This is a nucleotide sequence file.  There is a second file which contains the corresponding protein sequence &amp;quot;NP_077744-Wilms.fasta&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Provide a little background info about Wilm's tumor. (It was chosen at random).&lt;br /&gt;
&lt;br /&gt;
* In the Visualization Area click on the Sequence Alignment tab.&lt;br /&gt;
&lt;br /&gt;
* Click on the Blast tab.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial.png]]&lt;br /&gt;
&lt;br /&gt;
The result displays the length of the longest sequence selected (here there is only one) due to the sample.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
There are five different types of queries you can run, depending on what data you are using:&lt;br /&gt;
&lt;br /&gt;
'''blastp'''- Compares an amino acid query sequence against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastn'''- Compares a nucleotide query sequence against a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastx'''- Compares a nucleotide query sequence translated in all reading frames against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''tblastn'''- Compares a protein query sequence against a nucleotide database dynamically  translated in all reading frames.&lt;br /&gt;
&lt;br /&gt;
'''tblastx'''- Compares the 6 frame translations of a nucleotide query sequence against the six frame translations of a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Click on the drop down arrow and select a program. Since this is a '''nucelotide query''', we want to select a nucleotide query program '''blastn'''.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutoria1.png]]&lt;br /&gt;
&lt;br /&gt;
Now that the program has been selected, make sure the appropriate databases are displayed (you need to verify this for all algorithms). Here select  '''ncbi/nt''' - the complete non-redundant nucleotide database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Advanced Options Tab&lt;br /&gt;
&lt;br /&gt;
*Make sure &amp;quot;dna mat&amp;quot; is selected for the Matrix.&lt;br /&gt;
&lt;br /&gt;
*Change the Expect Value to 0.01.&lt;br /&gt;
&lt;br /&gt;
*Leave the box checked for PFP filtering for repeated sequence elements (Paracel Filtering Package).&lt;br /&gt;
&lt;br /&gt;
*Leave the Display result in your web browser checked.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial2.png ]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Service tab, select Columbia.&lt;br /&gt;
&lt;br /&gt;
'''Note:''' The text field at the bottom shows that one sequence has been selected.  If you have a Fasta file that has multiple sequences, you can select the ones you want in the Markers componenet and activate this selection, letting you search on a subset. You may search on all sequences in a file by clicking the All Markers checkbox.&lt;br /&gt;
&lt;br /&gt;
*Press the curved arrow submit button.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial3.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Observe the progress bar, Blast is now runnning.&lt;br /&gt;
&lt;br /&gt;
*You can check the server status  by hitting the Refresh button, under the Service tab.  This will give you an idea of how busy the Columbia Machine is.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial4.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
When the results are returned they are placed in the Project Folders as a child of the sequence they correspond to.  You can mouse over the result set to see how many sequences are in it.&lt;br /&gt;
&lt;br /&gt;
In the Blast results viewer, you can examine the alignments.  Each different target hit is listed on a line in the results table.&lt;br /&gt;
&lt;br /&gt;
In the Blast results viewer you can select sequences to add back to the main project by checking the include box and then the Add Selected Sequences To Your Project tab.&lt;br /&gt;
&lt;br /&gt;
You can also add just the aligned parts by clicking on the tab Only Add Aligned Parts.&lt;br /&gt;
&lt;br /&gt;
The Load button allows you to load an external Blast file in HTML format into the viewer.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial5.png]]&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2932</id>
		<title>User:Ginhoven</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2932"/>
				<updated>2006-03-29T17:40:46Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==TUTORIAL - BLAST==&lt;br /&gt;
&lt;br /&gt;
In this Tutorial you will learn to:&lt;br /&gt;
&lt;br /&gt;
* Set up and perform a Blast search.&lt;br /&gt;
&lt;br /&gt;
* Decipher the Output.&lt;br /&gt;
&lt;br /&gt;
* Analyze the results.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
'''OVERVIEW'''&lt;br /&gt;
&lt;br /&gt;
A reason why you may want to do a BLAST Query may be that you have found an interesting marker, so you want to retrieve it's gene, and see what it is related to.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
BLAST searches are divided into categories according to the nature, and size of the input query and the primary goal of the search.&lt;br /&gt;
&lt;br /&gt;
A BLAST search has four components:&lt;br /&gt;
&lt;br /&gt;
* Query&lt;br /&gt;
&lt;br /&gt;
* Data Base Program&lt;br /&gt;
&lt;br /&gt;
* Search Purpose&lt;br /&gt;
&lt;br /&gt;
* Goal&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the purpose of this tutorial use the file &amp;quot;NM _024426-Wilms.Fasta&amp;quot; provided in the tutorial data directory.  This is a nucleotide sequence file.  There is a second file which contains the corresponding protein sequence &amp;quot;NP_077744-Wilms.fasta&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Provide a little background info about Wilm's tumor. (It was chosen at random).&lt;br /&gt;
&lt;br /&gt;
* In the Visualization Area click on the Sequence Alignment tab.&lt;br /&gt;
&lt;br /&gt;
* Click on the Blast tab.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial.png]]&lt;br /&gt;
&lt;br /&gt;
The result displays the length of the longest sequence selected (here there is only one) due to the sample.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
There are five different types of queries you can run, depending on what data you are using:&lt;br /&gt;
&lt;br /&gt;
'''blastp'''- Compares an amino acid query sequence against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastn'''- Compares a nucleotide query sequence against a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastx'''- Compares a nucleotide query sequence translated in all reading frames against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''tblastn'''- Compares a protein query sequence against a nucleotide database dynamically  translated in all reading frames.&lt;br /&gt;
&lt;br /&gt;
'''tblastx'''- Compares the 6 frame translations of a nucleotide query sequence against the six frame translations of a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Click on the drop down arrow and select a program. Since this is a '''nucelotide query''', we want to select a nucleotide query program '''blastn'''.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutoria1.png]]&lt;br /&gt;
&lt;br /&gt;
Now that the program has been selected, make sure the appropriate databases are displayed (you need to verify this for all algorithms). Here select  '''ncbi/nt''' - the complete non-redundant nucleotide database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Advanced Options Tab&lt;br /&gt;
&lt;br /&gt;
*Make sure &amp;quot;dna mat&amp;quot; is selected for the Matrix.&lt;br /&gt;
&lt;br /&gt;
*Change the Expect Value to 0.01.&lt;br /&gt;
&lt;br /&gt;
*Leave the box checked for PFP filtering for repeated sequence elements (Paracel Filtering Package).&lt;br /&gt;
&lt;br /&gt;
*Leave the Display result in your web browser checked.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial2.png ]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Service tab, select Columbia.&lt;br /&gt;
&lt;br /&gt;
'''Note:''' The text field at the bottom shows that one sequence has been selected.  If you have a Fasta file that has multiple sequences, you can select the ones you want in the Markers componenet and activate this selection, letting you search on a subset. You may search on all sequences in a file by clicking the All Markers checkbox.&lt;br /&gt;
&lt;br /&gt;
*Press the curved arrow submit button.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial3.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Observe the progress bar, Blast is now runnning.&lt;br /&gt;
&lt;br /&gt;
*You can check the server status  by hitting the Refresh button, under the Server tab.  This will give you an idea of how busy the Columbia Machine is.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial4.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
When the results are returned they are placed in the Project Folders as a child of the sequence they correspond to.  You can mouse over the result set to see how many sequences are in it.&lt;br /&gt;
&lt;br /&gt;
In the Blast results viewer, you can examine the alignments.  Each different target hit is listed on a line in the results table.&lt;br /&gt;
&lt;br /&gt;
In the Blast results viewer you can select sequences to add back to the main project by checking the include box and then the Add Selected Sequences To Your Project tab.&lt;br /&gt;
&lt;br /&gt;
You can also add just the aligned parts by clicking on the tab Only Add Aligned Parts.&lt;br /&gt;
&lt;br /&gt;
The Load button allows you to load an external Blast file in HTML format into the viewer.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial5.png]]&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2931</id>
		<title>User:Ginhoven</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2931"/>
				<updated>2006-03-29T17:39:54Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==TUTORIAL - BLAST==&lt;br /&gt;
&lt;br /&gt;
In this Tutorial you will learn to:&lt;br /&gt;
&lt;br /&gt;
* Set up and perform a Blast search.&lt;br /&gt;
&lt;br /&gt;
* Decipher the Output.&lt;br /&gt;
&lt;br /&gt;
* Analyze the results.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
'''OVERVIEW'''&lt;br /&gt;
&lt;br /&gt;
A reason why you may want to do a BLAST Query may be that you have found an interesting marker, so you want to retrieve it's gene, and see what it is related to.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
BLAST searches are divided into categories according to the nature, and size of the input query and the primary goal of the search.&lt;br /&gt;
&lt;br /&gt;
A BLAST search has four components:&lt;br /&gt;
&lt;br /&gt;
* Query&lt;br /&gt;
&lt;br /&gt;
* Data Base Program&lt;br /&gt;
&lt;br /&gt;
* Search Purpose&lt;br /&gt;
&lt;br /&gt;
* Goal&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the purpose of this tutorial use the file &amp;quot;NM _024426-Wilms.Fasta&amp;quot; provided in the tutorial data directory.  This is a nucleotide sequence file.  There is a second file which contains the corresponding protein sequence &amp;quot;NP_077744-Wilms.fasta&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Provide a little background info about Wilm's tumor. (It was chosen at random).&lt;br /&gt;
&lt;br /&gt;
* In the Visualization Area click on the Sequence Alignment tab.&lt;br /&gt;
&lt;br /&gt;
* Click on the Blast tab.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial.png]]&lt;br /&gt;
&lt;br /&gt;
The result displays the length of the longest sequence selected (here there is only one) due to the sample.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
There are five different types of queries you can run, depending on what data you are using:&lt;br /&gt;
&lt;br /&gt;
'''blastp'''- Compares an amino acid query sequence against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastn'''- Compares a nucleotide query sequence against a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastx'''- Compares a nucleotide query sequence translated in all reading frames against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''tblastn'''- Compares a protein query sequence against a nucleotide database dynamically  translated in all reading frames.&lt;br /&gt;
&lt;br /&gt;
'''tblastx'''- Compares the 6 frame translations of a nucleotide query sequence against the six frame translations of a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Click on the drop down arrow and select a program. Since this is a '''nucelotide query''', we want to select a nucleotide query program '''blastn'''.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutoria1.png]]&lt;br /&gt;
&lt;br /&gt;
Now that the program has been selected, make sure the appropriate databases are displayed (you need to verify this for all algorithms). Here select  '''ncbi/nt''' - the complete non-redundant nucleotide database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Advanced Options Tab&lt;br /&gt;
&lt;br /&gt;
*Make sure &amp;quot;dna mat&amp;quot; is selected for the Matrix.&lt;br /&gt;
&lt;br /&gt;
*Change the Expect Value to 0.01.&lt;br /&gt;
&lt;br /&gt;
*Leave the box checked for PFP filtering for repeated sequence elements (Paracel Filtering Package).&lt;br /&gt;
&lt;br /&gt;
*Leave the Display result in your web browser checked.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial2.png ]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Service tab, select Columbia.&lt;br /&gt;
&lt;br /&gt;
'''Note:''' The text field at the bottom shows that one sequence has been selected.  If you have a Fasta file that has multiple sequences, you can select the ones you want in the Markers componenet and activate this selection, letting you search on a subset. You may search on all sequences in a file by clicking the All Markers checkbox.&lt;br /&gt;
&lt;br /&gt;
*Press the curved arrow submit button.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial3.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Observe the progress bar, Blast is now runnning.&lt;br /&gt;
&lt;br /&gt;
*You can check the server status  by hitting the Refresh button, under the Server tab.  This will give you an idea of how busy the Columbia Machine is.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial4.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
When the results are returned they are placed in the Project Folders as a child of the sequence they correspond to.  You can mouse over the result set to see how many sequences are in it.&lt;br /&gt;
&lt;br /&gt;
In the Blast results viewer, you can examine the alignments.  Each different target hit is listed on a line in the results table.&lt;br /&gt;
&lt;br /&gt;
In the Blast results viewer you can select sequences to add back to the main project by checking the include box and then the Add Selected Sequences To Your Project tab.&lt;br /&gt;
&lt;br /&gt;
You can also add just the aligned parts by clicking on the tab Only Add Aligned Parts.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial5.png]]&lt;br /&gt;
&lt;br /&gt;
The Load button allows you to load an external Blast file in HTML format into the viewer.&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2930</id>
		<title>User:Ginhoven</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2930"/>
				<updated>2006-03-29T17:37:56Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==TUTORIAL - BLAST==&lt;br /&gt;
&lt;br /&gt;
In this Tutorial you will learn to:&lt;br /&gt;
&lt;br /&gt;
* Set up and perform a Blast search.&lt;br /&gt;
&lt;br /&gt;
* Decipher the Output.&lt;br /&gt;
&lt;br /&gt;
* Analyze the results.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
'''OVERVIEW'''&lt;br /&gt;
&lt;br /&gt;
A reason why you may want to do a BLAST Query may be that you have found an interesting marker, so you want to retrieve it's gene, and see what it is related to.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
BLAST searches are divided into categories according to the nature, and size of the input query and the primary goal of the search.&lt;br /&gt;
&lt;br /&gt;
A BLAST search has four components:&lt;br /&gt;
&lt;br /&gt;
* Query&lt;br /&gt;
&lt;br /&gt;
* Data Base Program&lt;br /&gt;
&lt;br /&gt;
* Search Purpose&lt;br /&gt;
&lt;br /&gt;
* Goal&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the purpose of this tutorial use the file &amp;quot;NM _024426-Wilms.Fasta&amp;quot; provided in the tutorial data directory.  This is a nucleotide sequence file.  There is a second file which contains the corresponding protein sequence &amp;quot;NP_077744-Wilms.fasta&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Provide a little background info about Wilm's tumor. (It was chosen at random).&lt;br /&gt;
&lt;br /&gt;
* In the Visualization Area click on the Sequence Alignment tab.&lt;br /&gt;
&lt;br /&gt;
* Click on the Blast tab.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial.png]]&lt;br /&gt;
&lt;br /&gt;
The result displays the length of the longest sequence selected (here there is only one) due to the sample.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
There are five different types of queries you can run, depending on what data you are using:&lt;br /&gt;
&lt;br /&gt;
'''blastp'''- Compares an amino acid query sequence against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastn'''- Compares a nucleotide query sequence against a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastx'''- Compares a nucleotide query sequence translated in all reading frames against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''tblastn'''- Compares a protein query sequence against a nucleotide database dynamically  translated in all reading frames.&lt;br /&gt;
&lt;br /&gt;
'''tblastx'''- Compares the 6 frame translations of a nucleotide query sequence against the six frame translations of a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Click on the drop down arrow and select a program. Since this is a '''nucelotide query''', we want to select a nucleotide query program '''blastn'''.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutoria1.png]]&lt;br /&gt;
&lt;br /&gt;
Now that the program has been selected, make sure the appropriate databases are displayed (you need to verify this for all algorithms). Here select  '''ncbi/nt''' - the complete non-redundant nucleotide database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Advanced Options Tab&lt;br /&gt;
&lt;br /&gt;
*Make sure &amp;quot;dna mat&amp;quot; is selected for the Matrix.&lt;br /&gt;
&lt;br /&gt;
*Change the Expect Value to 0.01.&lt;br /&gt;
&lt;br /&gt;
*Leave the box checked for PFP filtering for repeated sequence elements (Paracel Filtering Package).&lt;br /&gt;
&lt;br /&gt;
*Leave the Display result in your web browser checked.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial2.png ]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Service tab, select Columbia.&lt;br /&gt;
&lt;br /&gt;
'''Note:''' The text field at the bottom shows that one sequence has been selected.  If you have a Fasta file that has multiple sequences, you can select the ones you want in the Markers componenet and activate this selection, letting you search on a subset. You may search on all sequences in a file by clicking the All Markers checkbox.&lt;br /&gt;
&lt;br /&gt;
*Press the curved arrow submit button.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial3.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Observe the progress bar, Blast is now runnning.&lt;br /&gt;
&lt;br /&gt;
*You can check the server status  by hitting the Refresh button, under the Server tab.  This will give you an idea of how busy the Columbia Machine is.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial4.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
When the results are returned they are placed in the Project Folders as a child of the sequence they correspond to.  You can mouse over the result set to see how many sequences are in it.&lt;br /&gt;
&lt;br /&gt;
In the Blast results viewer, you can examine the alignments.  Each different target hit is listed on a line in the results table.&lt;br /&gt;
&lt;br /&gt;
In the Blast results viewer you can select sequences to add back to the main project by checking the include box and then the Add Selected Sequences To Your Project tab.&lt;br /&gt;
&lt;br /&gt;
You can also add just the aligned parts by clicking on the tab Only Add Aligned Parts.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial5.png]]&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2929</id>
		<title>User:Ginhoven</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2929"/>
				<updated>2006-03-29T17:33:52Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==TUTORIAL - BLAST==&lt;br /&gt;
&lt;br /&gt;
In this Tutorial you will learn to:&lt;br /&gt;
&lt;br /&gt;
* Set up and perform a Blast search.&lt;br /&gt;
&lt;br /&gt;
* Decipher the Output.&lt;br /&gt;
&lt;br /&gt;
* Analyze the results.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
'''OVERVIEW'''&lt;br /&gt;
&lt;br /&gt;
A reason why you may want to do a BLAST Query may be that you have found an interesting marker, so you want to retrieve it's gene, and see what it is related to.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
BLAST searches are divided into categories according to the nature, and size of the input query and the primary goal of the search.&lt;br /&gt;
&lt;br /&gt;
A BLAST search has four components:&lt;br /&gt;
&lt;br /&gt;
* Query&lt;br /&gt;
&lt;br /&gt;
* Data Base Program&lt;br /&gt;
&lt;br /&gt;
* Search Purpose&lt;br /&gt;
&lt;br /&gt;
* Goal&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the purpose of this tutorial use the file &amp;quot;NM _024426-Wilms.Fasta&amp;quot; provided in the tutorial data directory.  This is a nucleotide sequence file.  There is a second file which contains the corresponding protein sequence &amp;quot;NP_077744-Wilms.fasta&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Provide a little background info about Wilm's tumor. (It was chosen at random).&lt;br /&gt;
&lt;br /&gt;
* In the Visualization Area click on the Sequence Alignment tab.&lt;br /&gt;
&lt;br /&gt;
* Click on the Blast tab.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial.png]]&lt;br /&gt;
&lt;br /&gt;
The result displays the length of the longest sequence selected (here there is only one) due to the sample.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
There are five different types of queries you can run, depending on what data you are using:&lt;br /&gt;
&lt;br /&gt;
'''blastp'''- Compares an amino acid query sequence against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastn'''- Compares a nucleotide query sequence against a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastx'''- Compares a nucleotide query sequence translated in all reading frames against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''tblastn'''- Compares a protein query sequence against a nucleotide database dynamically  translated in all reading frames.&lt;br /&gt;
&lt;br /&gt;
'''tblastx'''- Compares the 6 frame translations of a nucleotide query sequence against the six frame translations of a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Click on the drop down arrow and select a program. Since this is a '''nucelotide query''', we want to select a nucleotide query program '''blastn'''.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutoria1.png]]&lt;br /&gt;
&lt;br /&gt;
Now that the program has been selected, make sure the appropriate databases are displayed (you need to verify this for all algorithms). Here select  '''ncbi/nt''' - the complete non-redundant nucleotide database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Advanced Options Tab&lt;br /&gt;
&lt;br /&gt;
*Make sure &amp;quot;dna mat&amp;quot; is selected for the Matrix.&lt;br /&gt;
&lt;br /&gt;
*Change the Expect Value to 0.01.&lt;br /&gt;
&lt;br /&gt;
*Leave the box checked for PFP filtering for repeated sequence elements (Paracel Filtering Package).&lt;br /&gt;
&lt;br /&gt;
*Leave the Display result in your web browser checked.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial2.png ]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Service tab, select Columbia.&lt;br /&gt;
&lt;br /&gt;
'''Note:''' The text field at the bottom shows that one sequence has been selected.  If you have a Fasta file that has multiple sequences, you can select the ones you want in the Markers componenet and activate this selection, letting you search on a subset. You may search on all sequences in a file by clicking the All Markers checkbox.&lt;br /&gt;
&lt;br /&gt;
*Press the curved arrow submit button.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial3.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Observe the progress bar, Blast is now runnning.&lt;br /&gt;
&lt;br /&gt;
*You can check the server status  by hitting the Refresh button, under the Server tab.  This will give you an idea of how busy the Columbia Machine is.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial4.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
When the results are returned they are placed in the Project Folders as a child of the sequence they correspond to.  You can mouse over the result set to see how many sequences are in it.&lt;br /&gt;
&lt;br /&gt;
In the Blast results viewer, you can examine the alignments.  Each different target hit is listed on a line in the results table.&lt;br /&gt;
&lt;br /&gt;
In the Blast results viewer you can select sequences to add back to the main project by checking the include box and then the Add Selected Sequences To Your Project tab.&lt;br /&gt;
&lt;br /&gt;
You can also add just the aligned parts by clicking on the tab Only Add Aligned Parts.&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2928</id>
		<title>User:Ginhoven</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2928"/>
				<updated>2006-03-29T17:33:21Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==TUTORIAL - BLAST==&lt;br /&gt;
&lt;br /&gt;
In this Tutorial you will learn to:&lt;br /&gt;
&lt;br /&gt;
* Set up and perform a Blast search.&lt;br /&gt;
&lt;br /&gt;
* Decipher the Output.&lt;br /&gt;
&lt;br /&gt;
* Analyze the results.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
'''OVERVIEW'''&lt;br /&gt;
&lt;br /&gt;
A reason why you may want to do a BLAST Query may be that you have found an interesting marker, so you want to retrieve it's gene, and see what it is related to.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
BLAST searches are divided into categories according to the nature, and size of the input query and the primary goal of the search.&lt;br /&gt;
&lt;br /&gt;
A BLAST search has four components:&lt;br /&gt;
&lt;br /&gt;
* Query&lt;br /&gt;
&lt;br /&gt;
* Data Base Program&lt;br /&gt;
&lt;br /&gt;
* Search Purpose&lt;br /&gt;
&lt;br /&gt;
* Goal&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the purpose of this tutorial use the file &amp;quot;NM _024426-Wilms.Fasta&amp;quot; provided in the tutorial data directory.  This is a nucleotide sequence file.  There is a second file which contains the corresponding protein sequence &amp;quot;NP_077744-Wilms.fasta&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Provide a little background info about Wilm's tumor. (It was chosen at random).&lt;br /&gt;
&lt;br /&gt;
* In the Visualization Area click on the Sequence Alignment tab.&lt;br /&gt;
&lt;br /&gt;
* Click on the Blast tab.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial.png]]&lt;br /&gt;
&lt;br /&gt;
The result displays the length of the longest sequence selected (here there is only one) due to the sample.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
There are five different types of queries you can run, depending on what data you are using:&lt;br /&gt;
&lt;br /&gt;
'''blastp'''- Compares an amino acid query sequence against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastn'''- Compares a nucleotide query sequence against a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastx'''- Compares a nucleotide query sequence translated in all reading frames against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''tblastn'''- Compares a protein query sequence against a nucleotide database dynamically  translated in all reading frames.&lt;br /&gt;
&lt;br /&gt;
'''tblastx'''- Compares the 6 frame translations of a nucleotide query sequence against the six frame translations of a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Click on the drop down arrow and select a program. Since this is a '''nucelotide query''', we want to select a nucleotide query program '''blastn'''.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutoria1.png]]&lt;br /&gt;
&lt;br /&gt;
Now that the program has been selected, make sure the appropriate databases are displayed (you need to verify this for all algorithms). Here select  '''ncbi/nt''' - the complete non-redundant nucleotide database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Advanced Options Tab&lt;br /&gt;
&lt;br /&gt;
*Make sure &amp;quot;dna mat&amp;quot; is selected for the Matrix.&lt;br /&gt;
&lt;br /&gt;
*Change the Expect Value to 0.01.&lt;br /&gt;
&lt;br /&gt;
*Leave the box checked for PFP filtering for repeated sequence elements (Paracel Filtering Package).&lt;br /&gt;
&lt;br /&gt;
*Leave the Display result in your web browser checked.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial2.png ]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Service tab, select Columbia.&lt;br /&gt;
&lt;br /&gt;
'''Note:''' The text field at the bottom shows that one sequence has been selected.  If you have a Fasta file that has multiple sequences, you can select the ones you want in the Markers componenet and activate this selection, letting you search on a subset. You may search on all sequences in a file by clicking the All Markers checkbox.&lt;br /&gt;
&lt;br /&gt;
*Press the curved arrow submit button.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial3.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Observe the progress bar, Blast is now runnning.&lt;br /&gt;
&lt;br /&gt;
*You can check the server status  by hitting the Refresh button, under the Server tab.  This will give you an idea of how busy the Columbia Machine is.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial4.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
When the results return they are placed in the Project Folders as a child of the sequence they correspond to.  You can mouse over the result set to see how many sequences are in it.&lt;br /&gt;
&lt;br /&gt;
In the Blast results viewer, you can examine the alignments.  Each different target hit is listed on a line in the results table.&lt;br /&gt;
&lt;br /&gt;
In the Blast results viewer you can select sequences to add back to the main project by checking the include box and then the Add Selected Sequences To Your Project tab.&lt;br /&gt;
&lt;br /&gt;
You can also add just the aligned parts by clicking on the tab Only Add Aligned Parts.&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2927</id>
		<title>User:Ginhoven</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2927"/>
				<updated>2006-03-29T17:22:40Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==TUTORIAL - BLAST==&lt;br /&gt;
&lt;br /&gt;
In this Tutorial you will learn to:&lt;br /&gt;
&lt;br /&gt;
* Set up and perform a Blast search.&lt;br /&gt;
&lt;br /&gt;
* Decipher the Output.&lt;br /&gt;
&lt;br /&gt;
* Analyze the results.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
'''OVERVIEW'''&lt;br /&gt;
&lt;br /&gt;
A reason why you may want to do a BLAST Query may be that you have found an interesting marker, so you want to retrieve it's gene, and see what it is related to.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
BLAST searches are divided into categories according to the nature, and size of the input query and the primary goal of the search.&lt;br /&gt;
&lt;br /&gt;
A BLAST search has four components:&lt;br /&gt;
&lt;br /&gt;
* Query&lt;br /&gt;
&lt;br /&gt;
* Data Base Program&lt;br /&gt;
&lt;br /&gt;
* Search Purpose&lt;br /&gt;
&lt;br /&gt;
* Goal&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the purpose of this tutorial use the file &amp;quot;NM _024426-Wilms.Fasta&amp;quot; provided in the tutorial data directory.  This is a nucleotide sequence file.  There is a second file which contains the corresponding protein sequence &amp;quot;NP_077744-Wilms.fasta&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Provide a little background info about Wilm's tumor. (It was chosen at random).&lt;br /&gt;
&lt;br /&gt;
* In the Visualization Area click on the Sequence Alignment tab.&lt;br /&gt;
&lt;br /&gt;
* Click on the Blast tab.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial.png]]&lt;br /&gt;
&lt;br /&gt;
The result displays the length of the longest sequence selected (here there is only one) due to the sample.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
There are five different types of queries you can run, depending on what data you are using:&lt;br /&gt;
&lt;br /&gt;
'''blastp'''- Compares an amino acid query sequence against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastn'''- Compares a nucleotide query sequence against a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastx'''- Compares a nucleotide query sequence translated in all reading frames against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''tblastn'''- Compares a protein query sequence against a nucleotide database dynamically  translated in all reading frames.&lt;br /&gt;
&lt;br /&gt;
'''tblastx'''- Compares the 6 frame translations of a nucleotide query sequence against the six frame translations of a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Click on the drop down arrow and select a program. Since this is a '''nucelotide query''', we want to select a nucleotide query program '''blastn'''.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutoria1.png]]&lt;br /&gt;
&lt;br /&gt;
Now that the program has been selected, make sure the appropriate databases are displayed (you need to verify this for all algorithms). Here select  '''ncbi/nt''' - the complete non-redundant nucleotide database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Advanced Options Tab&lt;br /&gt;
&lt;br /&gt;
*Make sure &amp;quot;dna mat&amp;quot; is selected for the Matrix.&lt;br /&gt;
&lt;br /&gt;
*Change the Expect Value to 0.01.&lt;br /&gt;
&lt;br /&gt;
*Leave the box checked for PFP filtering for repeated sequence elements (Paracel Filtering Package).&lt;br /&gt;
&lt;br /&gt;
*Leave the Display result in your web browser checked.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial2.png ]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Service tab, select Columbia.&lt;br /&gt;
&lt;br /&gt;
Note: The text field at the bottom shows that one sequence has been selected.  If you have a Fasta file that has multiple sequences, you can select the ones you want in the Markers componenet and activate this selection, letting you search on a subset. You may search on all sequences in a file by clicking the All Markers checkbox.&lt;br /&gt;
&lt;br /&gt;
*Press the curved arrow submit button.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial3.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Observe the progress bar, Blast is now runnning.&lt;br /&gt;
&lt;br /&gt;
*You can check the server status  by hitting the Refresh button, under the Server tab.  This will give you an idea of how busy the Columbia Machine is.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial4.png]]&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2926</id>
		<title>User:Ginhoven</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2926"/>
				<updated>2006-03-29T17:21:18Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==TUTORIAL - BLAST==&lt;br /&gt;
&lt;br /&gt;
In this Tutorial you will learn to:&lt;br /&gt;
&lt;br /&gt;
* Set up and perform a Blast search.&lt;br /&gt;
&lt;br /&gt;
* Decipher the Output.&lt;br /&gt;
&lt;br /&gt;
* Analyze the results.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
'''OVERVIEW'''&lt;br /&gt;
&lt;br /&gt;
A reason why you may want to do a BLAST Query may be that you have found an interesting marker, so you want to retrieve it's gene, and see what it is related to.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
BLAST searches are divided into categories according to the nature, and size of the input query and the primary goal of the search.&lt;br /&gt;
&lt;br /&gt;
A BLAST search has four components:&lt;br /&gt;
&lt;br /&gt;
* Query&lt;br /&gt;
&lt;br /&gt;
* Data Base Program&lt;br /&gt;
&lt;br /&gt;
* Search Purpose&lt;br /&gt;
&lt;br /&gt;
* Goal&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the purpose of this tutorial use the file &amp;quot;NM _024426-Wilms.Fasta&amp;quot; provided in the tutorial data directory.  This is a nucleotide sequence file.  There is a second file which contains the corresponding protein sequence &amp;quot;NP_077744-Wilms.fasta&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Provide a little background info about Wilm's tumor. (It was chosen at random).&lt;br /&gt;
&lt;br /&gt;
* In the Visualization Area click on the Sequence Alignment tab.&lt;br /&gt;
&lt;br /&gt;
* Click on the Blast tab.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial.png]]&lt;br /&gt;
&lt;br /&gt;
Note that the result displays the length of the longest sequence selected (here there is only one) due to the sample.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
There are five different types of queries you can run, depending on what data you are using:&lt;br /&gt;
&lt;br /&gt;
'''blastp'''- Compares an amino acid query sequence against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastn'''- Compares a nucleotide query sequence against a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastx'''- Compares a nucleotide query sequence translated in all reading frames against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''tblastn'''- Compares a protein query sequence against a nucleotide database dynamically  translated in all reading frames.&lt;br /&gt;
&lt;br /&gt;
'''tblastx'''- Compares the 6 frame translations of a nucleotide query sequence against the six frame translations of a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Click on the drop down arrow and select a program. Since this is a '''nucelotide query''', we want to select a nucleotide query program '''blastn'''.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutoria1.png]]&lt;br /&gt;
&lt;br /&gt;
Note: Now that the program has been selected, make sure the appropriate databases are displayed (you need to verify this for all algorithms). Here select  '''ncbi/nt''' - the complete non-redundant nucleotide database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Advanced Options Tab&lt;br /&gt;
&lt;br /&gt;
*Make sure &amp;quot;dna mat&amp;quot; is selected for the Matrix.&lt;br /&gt;
&lt;br /&gt;
*Change the Expect Value to 0.01.&lt;br /&gt;
&lt;br /&gt;
*Leave the box checked for PFP filtering for repeated sequence elements (Paracel Filtering Package).&lt;br /&gt;
&lt;br /&gt;
*Leave the Display result in your web browser checked.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial2.png ]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Service tab, select Columbia.&lt;br /&gt;
&lt;br /&gt;
Note: The text field at the bottom shows that one sequence has been selected.  If you have a Fasta file that has multiple sequences, you can select the ones you want in the Markers componenet and activate this selection, letting you search on a subset. You may search on all sequences in a file by clicking the All Markers checkbox.&lt;br /&gt;
&lt;br /&gt;
*Press the curved arrow submit button.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial3.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Observe the progress bar, Blast is now runnning.&lt;br /&gt;
&lt;br /&gt;
*You can check the server status  by hitting the Refresh button, under the Server tab.  This will give you an idea of how busy the Columbia Machine is.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial4.png]]&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2925</id>
		<title>User:Ginhoven</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2925"/>
				<updated>2006-03-29T17:20:36Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==TUTORIAL - BLAST==&lt;br /&gt;
&lt;br /&gt;
In this Tutorial you will learn to:&lt;br /&gt;
&lt;br /&gt;
* Set up and perform a Blast search.&lt;br /&gt;
&lt;br /&gt;
* Decipher the Output.&lt;br /&gt;
&lt;br /&gt;
* Analyze the results.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
'''OVERVIEW'''&lt;br /&gt;
&lt;br /&gt;
A reason why you may want to do a BLAST Query may be that you have found an interesting marker, so you want to retrieve it's gene, and see what it is related to.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
BLAST searches are divided into categories according to the nature, and size of the input query and the primary goal of the search.&lt;br /&gt;
&lt;br /&gt;
A BLAST search has four components:&lt;br /&gt;
&lt;br /&gt;
* Query&lt;br /&gt;
&lt;br /&gt;
* Data Base Program&lt;br /&gt;
&lt;br /&gt;
* Search Purpose&lt;br /&gt;
&lt;br /&gt;
* Goal&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the purpose of this tutorial use the file &amp;quot;NM _024426-Wilms.Fasta&amp;quot; provided in the tutorial data directory.  This is a nucleotide sequence file.  There is a second file which contains the corresponding protein sequence &amp;quot;NP_077744-Wilms.fasta&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Provide a little background info about Wilm's tumor. (It was chosen at random).&lt;br /&gt;
&lt;br /&gt;
* In the Visualization Area click on the Sequence Alignment tab.&lt;br /&gt;
&lt;br /&gt;
* Click on the Blast tab.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial.png]]&lt;br /&gt;
&lt;br /&gt;
Note that the result displays the length of the longest sequence selected (here there is only one) due to the sample.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
There are five different types of queries you can run, depending on what data you are using:&lt;br /&gt;
&lt;br /&gt;
'''blastp'''- Compares an amino acid query sequence against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastn'''- Compares a nucleotide query sequence against a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastx'''- Compares a nucleotide query sequence translated in all reading frames against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''tblastn'''- Compares a protein query sequence against a nucleotide database dynamically  translated in all reading frames.&lt;br /&gt;
&lt;br /&gt;
'''tblastx'''- Compares the 6 frame translations of a nucleotide query sequence against the six frame translations of a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Click on the drop down arrow and select a program. Since this is a '''nucelotide query''', we want to select a nucleotide query program '''blastn'''.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutoria1.png]]&lt;br /&gt;
&lt;br /&gt;
*Now that the program has been selected, make sure the appropriate databases are displayed (you need to verify this for all algorithms). Here select  '''ncbi/nt''' - the complete non-redundant nucleotide database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Advanced Options Tab&lt;br /&gt;
&lt;br /&gt;
*Make sure &amp;quot;dna mat&amp;quot; is selected for the Matrix.&lt;br /&gt;
&lt;br /&gt;
*Change the Expect Value to 0.01.&lt;br /&gt;
&lt;br /&gt;
*Leave the box checked for PFP filtering for repeated sequence elements (Paracel Filtering Package).&lt;br /&gt;
&lt;br /&gt;
*Leave the Display result in your web browser checked.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial2.png ]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Service tab, select Columbia.&lt;br /&gt;
&lt;br /&gt;
Note: The text field at the bottom shows that one sequence has been selected.  If you have a Fasta file that has multiple sequences, you can select the ones you want in the Markers componenet and activate this selection, letting you search on a subset. You may search on all sequences in a file by clicking the All Markers checkbox.&lt;br /&gt;
&lt;br /&gt;
*Press the curved arrow submit button.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial3.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Observe the progress bar, Blast is now runnning.&lt;br /&gt;
&lt;br /&gt;
*You can check the server status  by hitting the Refresh button, under the Server tab.  This will give you an idea of how busy the Columbia Machine is.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial4.png]]&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=File:(T)Blast_Tutorial5.png&amp;diff=2924</id>
		<title>File:(T)Blast Tutorial5.png</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=File:(T)Blast_Tutorial5.png&amp;diff=2924"/>
				<updated>2006-03-29T17:20:00Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2923</id>
		<title>User:Ginhoven</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2923"/>
				<updated>2006-03-29T17:14:42Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==TUTORIAL - BLAST==&lt;br /&gt;
&lt;br /&gt;
In this Tutorial you will learn to:&lt;br /&gt;
&lt;br /&gt;
* Set up and perform a Blast search.&lt;br /&gt;
&lt;br /&gt;
* Decipher the Output.&lt;br /&gt;
&lt;br /&gt;
* Analyze the results.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
'''OVERVIEW'''&lt;br /&gt;
&lt;br /&gt;
A reason why you may want to do a BLAST Query may be that you have found an interesting marker, so you want to retrieve it's gene, and see what it is related to.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
BLAST searches are divided into categories according to the nature, and size of the input query and the primary goal of the search.&lt;br /&gt;
&lt;br /&gt;
A BLAST search has four components:&lt;br /&gt;
&lt;br /&gt;
* Query&lt;br /&gt;
&lt;br /&gt;
* Data Base Program&lt;br /&gt;
&lt;br /&gt;
* Search Purpose&lt;br /&gt;
&lt;br /&gt;
* Goal&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the purpose of this tutorial use the file &amp;quot;NM _024426-Wilms.Fasta&amp;quot; provided in the tutorial data directory.  This is a nucleotide sequence file.  There is a second file which contains the corresponding protein sequence &amp;quot;NP_077744-Wilms.fasta&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Provide a little background info about Wilm's tumor. (It was chosen at random).&lt;br /&gt;
&lt;br /&gt;
* In the Visualization Area click on the Sequence Alignment tab.&lt;br /&gt;
&lt;br /&gt;
* Click on the Blast tab.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial.png]]&lt;br /&gt;
&lt;br /&gt;
Note that the result displays the length of the longest sequence selected (here there is only one) due to the sample.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
There are five different types of queries you can run, depending on what data you are using:&lt;br /&gt;
&lt;br /&gt;
'''blastp'''- Compares an amino acid query sequence against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastn'''- Compares a nucleotide query sequence against a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastx'''- Compares a nucleotide query sequence translated in all reading frames against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''tblastn'''- Compares a protein query sequence against a nucleotide database dynamically  translated in all reading frames.&lt;br /&gt;
&lt;br /&gt;
'''tblastx'''- Compares the 6 frame translations of a nucleotide query sequence against the six frame translations of a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Click on the drop down arrow and select a program. Since this is a '''nucelotide query''', we want to select a nucleotide query program '''blastn'''.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutoria1.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Now that the program has been selected, make sure the appropriate databases are displayed (you need to verify this for all algorithms). Here select  '''ncbi/nt''' - the complete non-redundant nucleotide database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Advanced Options Tab&lt;br /&gt;
&lt;br /&gt;
*Make sure &amp;quot;dna mat&amp;quot; is selected for the Matrix.&lt;br /&gt;
&lt;br /&gt;
*Change the Expect Value to 0.01.&lt;br /&gt;
&lt;br /&gt;
*Leave the box checked for PFP filtering for repeated sequence elements (Paracel Filtering Package).&lt;br /&gt;
&lt;br /&gt;
*Leave the Display result in your web browser checked.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial2.png ]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Service tab, select Columbia.&lt;br /&gt;
&lt;br /&gt;
Note: The text field at the bottom shows that one sequence has been selected.  If you have a Fasta file that has multiple sequences, you can select the ones you want in the Markers componenet and activate this selection, letting you search on a subset. You may search on all sequences in a file by clicking the All Markers checkbox.&lt;br /&gt;
&lt;br /&gt;
*Press the curved arrow submit button.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial3.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Observe the progress bar, Blast is now runnning.&lt;br /&gt;
&lt;br /&gt;
*You can check the server status  by hitting the Refresh button, under the Server tab.  This will give you an idea of how busy the Columbia Machine is.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial4.png]]&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2922</id>
		<title>User:Ginhoven</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2922"/>
				<updated>2006-03-29T17:13:48Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==TUTORIAL - BLAST==&lt;br /&gt;
&lt;br /&gt;
In this Tutorial you will learn to:&lt;br /&gt;
&lt;br /&gt;
* Set up and perform a Blast search.&lt;br /&gt;
&lt;br /&gt;
* Decipher the Output.&lt;br /&gt;
&lt;br /&gt;
* Analyze the results.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
'''OVERVIEW'''&lt;br /&gt;
&lt;br /&gt;
A reason why you may want to do a BLAST Query may be that you have found an interesting marker, so you want to retrieve it's gene, and see what it is related to.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
BLAST searches are divided into categories according to the nature, and size of the input query and the primary goal of the search.&lt;br /&gt;
&lt;br /&gt;
A BLAST search has four components:&lt;br /&gt;
&lt;br /&gt;
* Query&lt;br /&gt;
&lt;br /&gt;
* Data Base Program&lt;br /&gt;
&lt;br /&gt;
* Search Purpose&lt;br /&gt;
&lt;br /&gt;
* Goal&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the purpose of this tutorial use the file &amp;quot;NM _024426-Wilms.Fasta&amp;quot; provided in the tutorial data directory.  This is a nucleotide sequence file.  There is a second file which contains the corresponding protein sequence &amp;quot;NP_077744-Wilms.fasta&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Provide a little background info about Wilm's tumor. (It was chosen at random).&lt;br /&gt;
&lt;br /&gt;
* In the Visualization Area click on the Sequence Alignment tab.&lt;br /&gt;
&lt;br /&gt;
* Click on the Blast tab.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial.png]]&lt;br /&gt;
&lt;br /&gt;
Note that the result displays the length of the longest sequence selected (here there is only one) due to the sample.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
There are five different types of queries you can run, depending on what data you are using:&lt;br /&gt;
&lt;br /&gt;
'''blastp'''- Compares an amino acid query sequence against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastn'''- Compares a nucleotide query sequence against a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastx'''- Compares a nucleotide query sequence translated in all reading frames against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''tblastn'''- Compares a protein query sequence against a nucleotide database dynamically  translated in all reading frames.&lt;br /&gt;
&lt;br /&gt;
'''tblastx'''- Compares the 6 frame translations of a nucleotide query sequence against the six frame translations of a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Click on the drop down arrow and select a program. Since this is a '''nucelotide query''', we want to select a nucleotide query program '''blastn'''.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutoria1.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Now that the program has been selected, make sure the appropriate databases are displayed (you need to verify this for all algorithms). Here select  '''ncbi/nt''' - the complete non-redundant nucleotide database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Advanced Options Tab&lt;br /&gt;
&lt;br /&gt;
*Make sure &amp;quot;dna mat&amp;quot; is selected for the Matrix.&lt;br /&gt;
&lt;br /&gt;
*Change the Expect Value to 0.01.&lt;br /&gt;
&lt;br /&gt;
*Leave the box checked for PFP filtering for repeated sequence elements (Paracel Filtering Package).&lt;br /&gt;
&lt;br /&gt;
*Leave the Display result in your web browser checked.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial2.png ]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Service tab, select Columbia.&lt;br /&gt;
&lt;br /&gt;
Note: The text field at the bottom shows that one sequence has been selected.  If you have a Fasta file that has multiple sequences, you can select the ones you want in the Markers componenet and activate this selection, letting you search on a subset. You may search on all sequences in a file by clicking the All Markers checkbox.&lt;br /&gt;
&lt;br /&gt;
*Press the curved arrow submit button.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial3.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Observe the progress bar, Blast is now runnning.&lt;br /&gt;
&lt;br /&gt;
*You can check the server status  by hitting the Refresh button.  This will give you an idea of how busy the Columbia Machine is.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial4.png]]&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=File:(T)Blast_Tutorial4.png&amp;diff=2921</id>
		<title>File:(T)Blast Tutorial4.png</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=File:(T)Blast_Tutorial4.png&amp;diff=2921"/>
				<updated>2006-03-29T17:12:55Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2920</id>
		<title>User:Ginhoven</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2920"/>
				<updated>2006-03-29T17:09:55Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==TUTORIAL - BLAST==&lt;br /&gt;
&lt;br /&gt;
In this Tutorial you will learn to:&lt;br /&gt;
&lt;br /&gt;
* Set up and perform a Blast search.&lt;br /&gt;
&lt;br /&gt;
* Decipher the Output.&lt;br /&gt;
&lt;br /&gt;
* Analyze the results.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
'''OVERVIEW'''&lt;br /&gt;
&lt;br /&gt;
A reason why you may want to do a BLAST Query may be that you have found an interesting marker, so you want to retrieve it's gene, and see what it is related to.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
BLAST searches are divided into categories according to the nature, and size of the input query and the primary goal of the search.&lt;br /&gt;
&lt;br /&gt;
A BLAST search has four components:&lt;br /&gt;
&lt;br /&gt;
* Query&lt;br /&gt;
&lt;br /&gt;
* Data Base Program&lt;br /&gt;
&lt;br /&gt;
* Search Purpose&lt;br /&gt;
&lt;br /&gt;
* Goal&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the purpose of this tutorial use the file &amp;quot;NM _024426-Wilms.Fasta&amp;quot; provided in the tutorial data directory.  This is a nucleotide sequence file.  There is a second file which contains the corresponding protein sequence &amp;quot;NP_077744-Wilms.fasta&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Provide a little background info about Wilm's tumor. (It was chosen at random).&lt;br /&gt;
&lt;br /&gt;
* In the Visualization Area click on the Sequence Alignment tab.&lt;br /&gt;
&lt;br /&gt;
* Click on the Blast tab.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial.png]]&lt;br /&gt;
&lt;br /&gt;
Note that the result displays the length of the longest sequence selected (here there is only one) due to the sample.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
There are five different types of queries you can run, depending on what data you are using:&lt;br /&gt;
&lt;br /&gt;
'''blastp'''- Compares an amino acid query sequence against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastn'''- Compares a nucleotide query sequence against a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastx'''- Compares a nucleotide query sequence translated in all reading frames against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''tblastn'''- Compares a protein query sequence against a nucleotide database dynamically  translated in all reading frames.&lt;br /&gt;
&lt;br /&gt;
'''tblastx'''- Compares the 6 frame translations of a nucleotide query sequence against the six frame translations of a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Click on the drop down arrow and select a program. Since this is a '''nucelotide query''', we want to select a nucleotide query program '''blastn'''.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutoria1.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Now that the program has been selected, make sure the appropriate databases are displayed (you need to verify this for all algorithms). Here select  '''ncbi/nt''' - the complete non-redundant nucleotide database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Advanced Options Tab&lt;br /&gt;
&lt;br /&gt;
*Make sure &amp;quot;dna mat&amp;quot; is selected for the Matrix.&lt;br /&gt;
&lt;br /&gt;
*Change the Expect Value to 0.01.&lt;br /&gt;
&lt;br /&gt;
*Leave the box checked for PFP filtering for repeated sequence elements (Paracel Filtering Package).&lt;br /&gt;
&lt;br /&gt;
*Leave the Display result in your web browser checked.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial2.png ]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Service tab, select Columbia.&lt;br /&gt;
&lt;br /&gt;
Note: The text field at the bottom shows that one sequence has been selected.  If you have a Fasta file that has multiple sequences, you can select the ones you want in the Markers componenet and activate this selection, letting you search on a subset. You may search on all sequences in a file by clicking the All Markers checkbox.&lt;br /&gt;
&lt;br /&gt;
*Press the curved arrow submit button.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial3.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Observe the progress bar, Blast is now runnning.&lt;br /&gt;
&lt;br /&gt;
*You can check the server status  by hitting the Refresh button.  This will give you an idea of how busy the Columbia Machine is.&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2919</id>
		<title>User:Ginhoven</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2919"/>
				<updated>2006-03-29T17:07:18Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==TUTORIAL - BLAST==&lt;br /&gt;
&lt;br /&gt;
In this Tutorial you will learn to:&lt;br /&gt;
&lt;br /&gt;
* Set up and perform a Blast search.&lt;br /&gt;
&lt;br /&gt;
* Decipher the Output.&lt;br /&gt;
&lt;br /&gt;
* Analyze the results.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
'''OVERVIEW'''&lt;br /&gt;
&lt;br /&gt;
A reason why you may want to do a BLAST Query may be that you have found an interesting marker, so you want to retrieve it's gene, and see what it is related to.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
BLAST searches are divided into categories according to the nature, and size of the input query and the primary goal of the search.&lt;br /&gt;
&lt;br /&gt;
A BLAST search has four components:&lt;br /&gt;
&lt;br /&gt;
* Query&lt;br /&gt;
&lt;br /&gt;
* Data Base Program&lt;br /&gt;
&lt;br /&gt;
* Search Purpose&lt;br /&gt;
&lt;br /&gt;
* Goal&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the purpose of this tutorial use the file &amp;quot;NM _024426-Wilms.Fasta&amp;quot; provided in the tutorial data directory.  This is a nucleotide sequence file.  There is a second file which contains the corresponding protein sequence &amp;quot;NP_077744-Wilms.fasta&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Provide a little background info about Wilm's tumor. (It was chosen at random).&lt;br /&gt;
&lt;br /&gt;
* In the Visualization Area click on the Sequence Alignment tab.&lt;br /&gt;
&lt;br /&gt;
* Click on the Blast tab.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial.png]]&lt;br /&gt;
&lt;br /&gt;
Note that the result displays the length of the longest sequence selected (here there is only one) due to the sample.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
There are five different types of queries you can run, depending on what data you are using:&lt;br /&gt;
&lt;br /&gt;
'''blastp'''- Compares an amino acid query sequence against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastn'''- Compares a nucleotide query sequence against a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastx'''- Compares a nucleotide query sequence translated in all reading frames against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''tblastn'''- Compares a protein query sequence against a nucleotide database dynamically  translated in all reading frames.&lt;br /&gt;
&lt;br /&gt;
'''tblastx'''- Compares the 6 frame translations of a nucleotide query sequence against the six frame translations of a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Click on the drop down arrow and select a program. Since this is a '''nucelotide query''', we want to select a nucleotide query program '''blastn'''.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutoria1.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Now that the program has been selected, make sure the appropriate databases are displayed (you need to verify this for all algorithms). Here select  '''ncbi/nt''' - the complete non-redundant nucleotide database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Advanced Options Tab&lt;br /&gt;
&lt;br /&gt;
*Make sure &amp;quot;dna mat&amp;quot; is selected for the Matrix.&lt;br /&gt;
&lt;br /&gt;
*Change the Expect Value to 0.01.&lt;br /&gt;
&lt;br /&gt;
*Leave the box checked for PFP filtering for repeated sequence elements (Paracel Filtering Package).&lt;br /&gt;
&lt;br /&gt;
*Leave the Display result in your web browser checked.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial2.png ]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Service tab, select Columbia.&lt;br /&gt;
&lt;br /&gt;
Note: The text field at the bottom shows that one sequence has been selected.  If you have a Fasta file that has multiple sequences, you can select the ones you want in the Markers componenet and activate this selection, letting you search on a subset. You may search on all sequences in a file by clicking the All Markers checkbox.&lt;br /&gt;
&lt;br /&gt;
*Press the curved arrow submit button.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial3.png]]&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2918</id>
		<title>User:Ginhoven</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2918"/>
				<updated>2006-03-29T17:06:23Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==TUTORIAL - BLAST==&lt;br /&gt;
&lt;br /&gt;
In this Tutorial you will learn to:&lt;br /&gt;
&lt;br /&gt;
* Set up and perform a Blast search.&lt;br /&gt;
&lt;br /&gt;
* Decipher the Output.&lt;br /&gt;
&lt;br /&gt;
* Analyze the results.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
'''OVERVIEW'''&lt;br /&gt;
&lt;br /&gt;
A reason why you may want to do a BLAST Query may be that you have found an interesting marker, so you want to retrieve it's gene, and see what it is related to.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
BLAST searches are divided into categories according to the nature, and size of the input query and the primary goal of the search.&lt;br /&gt;
&lt;br /&gt;
A BLAST search has four components:&lt;br /&gt;
&lt;br /&gt;
* Query&lt;br /&gt;
&lt;br /&gt;
* Data Base Program&lt;br /&gt;
&lt;br /&gt;
* Search Purpose&lt;br /&gt;
&lt;br /&gt;
* Goal&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the purpose of this tutorial use the file &amp;quot;NM _024426-Wilms.Fasta&amp;quot; provided in the tutorial data directory.  This is a nucleotide sequence file.  There is a second file which contains the corresponding protein sequence &amp;quot;NP_077744-Wilms.fasta&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Provide a little background info about Wilm's tumor. (It was chosen at random).&lt;br /&gt;
&lt;br /&gt;
* In the Visualization Area click on the Sequence Alignment tab.&lt;br /&gt;
&lt;br /&gt;
* Click on the Blast tab.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial.png]]&lt;br /&gt;
&lt;br /&gt;
Note that the result displays the length of the longest sequence selected (here there is only one) due to the sample.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
There are five different types of queries you can run, depending on what data you are using:&lt;br /&gt;
&lt;br /&gt;
'''blastp'''- Compares an amino acid query sequence against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastn'''- Compares a nucleotide query sequence against a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastx'''- Compares a nucleotide query sequence translated in all reading frames against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''tblastn'''- Compares a protein query sequence against a nucleotide database dynamically  translated in all reading frames.&lt;br /&gt;
&lt;br /&gt;
'''tblastx'''- Compares the 6 frame translations of a nucleotide query sequence against the six frame translations of a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Click on the drop down arrow and select a program. Since this is a '''nucelotide query''', we want to select a nucleotide query program '''blastn'''.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutoria1.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Now that the program has been selected, make sure the appropriate databases are displayed (you need to verify this for all algorithms). Here select  '''ncbi/nt''' - the complete non-redundant nucleotide database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Advanced Options Tab&lt;br /&gt;
&lt;br /&gt;
*Make sure &amp;quot;dna mat&amp;quot; is selected for the Matrix.&lt;br /&gt;
&lt;br /&gt;
*Change the Expect Value to 0.01.&lt;br /&gt;
&lt;br /&gt;
*Leave the box checked for PFP filtering for repeated sequence elements (Paracel Filtering Package).&lt;br /&gt;
&lt;br /&gt;
*Leave the Display result in your web browser checked.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial2.png ]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Service tab, select Columbia.&lt;br /&gt;
&lt;br /&gt;
Note: The text field at the bottom shows that one sequence has been selected.  If you have a Fasta file that has multiple sequences, you can select the ones you want in the Markers componenet and activate this selection, letting you search on a subset. You may search on all sequences in a file by clicking the All Markers checkbox.&lt;br /&gt;
&lt;br /&gt;
*Press the curved arrow submit button.&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=File:(T)Blast_Tutorial3.png&amp;diff=2917</id>
		<title>File:(T)Blast Tutorial3.png</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=File:(T)Blast_Tutorial3.png&amp;diff=2917"/>
				<updated>2006-03-29T17:05:54Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2916</id>
		<title>User:Ginhoven</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2916"/>
				<updated>2006-03-29T16:41:44Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==TUTORIAL - BLAST==&lt;br /&gt;
&lt;br /&gt;
In this Tutorial you will learn to:&lt;br /&gt;
&lt;br /&gt;
* Set up and perform a Blast search.&lt;br /&gt;
&lt;br /&gt;
* Decipher the Output.&lt;br /&gt;
&lt;br /&gt;
* Analyze the results.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
'''OVERVIEW'''&lt;br /&gt;
&lt;br /&gt;
A reason why you may want to do a BLAST Query may be that you have found an interesting marker, so you want to retrieve it's gene, and see what it is related to.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
BLAST searches are divided into categories according to the nature, and size of the input query and the primary goal of the search.&lt;br /&gt;
&lt;br /&gt;
A BLAST search has four components:&lt;br /&gt;
&lt;br /&gt;
* Query&lt;br /&gt;
&lt;br /&gt;
* Data Base Program&lt;br /&gt;
&lt;br /&gt;
* Search Purpose&lt;br /&gt;
&lt;br /&gt;
* Goal&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the purpose of this tutorial use the file &amp;quot;NM _024426-Wilms.Fasta&amp;quot; provided in the tutorial data directory.  This is a nucleotide sequence file.  There is a second file which contains the corresponding protein sequence &amp;quot;NP_077744-Wilms.fasta&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Provide a little background info about Wilm's tumor. (It was chosen at random).&lt;br /&gt;
&lt;br /&gt;
* In the Visualization Area click on the Sequence Alignment tab.&lt;br /&gt;
&lt;br /&gt;
* Click on the Blast tab.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial.png]]&lt;br /&gt;
&lt;br /&gt;
Note that the result displays the length of the longest sequence selected (here there is only one) due to the sample.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
There are five different types of queries you can run, depending on what data you are using:&lt;br /&gt;
&lt;br /&gt;
'''blastp'''- Compares an amino acid query sequence against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastn'''- Compares a nucleotide query sequence against a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastx'''- Compares a nucleotide query sequence translated in all reading frames against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''tblastn'''- Compares a protein query sequence against a nucleotide database dynamically  translated in all reading frames.&lt;br /&gt;
&lt;br /&gt;
'''tblastx'''- Compares the 6 frame translations of a nucleotide query sequence against the six frame translations of a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Click on the drop down arrow and select a program. Since this is a '''nucelotide query''', we want to select a nucleotide query program '''blastn'''.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutoria1.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Now that the program has been selected, make sure the appropriate databases are displayed (you need to verify this for all algorithms). Here select  '''ncbi/nt''' - the complete non-redundant nucleotide database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Advanced Options Tab&lt;br /&gt;
&lt;br /&gt;
*Make sure &amp;quot;dna mat&amp;quot; is selected for the Matrix.&lt;br /&gt;
&lt;br /&gt;
*Change the Expect Value to 0.01.&lt;br /&gt;
&lt;br /&gt;
*Leave the box checked for PFP filtering for repeated sequence elements (Paracel Filtering Package).&lt;br /&gt;
&lt;br /&gt;
*Leave the Display result in your web browser checked.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial2.png ]]&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2915</id>
		<title>User:Ginhoven</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2915"/>
				<updated>2006-03-29T16:40:36Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==TUTORIAL - BLAST==&lt;br /&gt;
&lt;br /&gt;
In this Tutorial you will learn to:&lt;br /&gt;
&lt;br /&gt;
* Set up and perform a Blast search.&lt;br /&gt;
&lt;br /&gt;
* Decipher the Output.&lt;br /&gt;
&lt;br /&gt;
* Analyze the results.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
'''OVERVIEW'''&lt;br /&gt;
&lt;br /&gt;
A reason why you may want to do a BLAST Query may be that you have found an interesting marker, so you want to retrieve it's gene, and see what it is related to.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
BLAST searches are divided into categories according to the nature, and size of the input query and the primary goal of the search.&lt;br /&gt;
&lt;br /&gt;
A BLAST search has four components:&lt;br /&gt;
&lt;br /&gt;
* Query&lt;br /&gt;
&lt;br /&gt;
* Data Base Program&lt;br /&gt;
&lt;br /&gt;
* Search Purpose&lt;br /&gt;
&lt;br /&gt;
* Goal&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the purpose of this tutorial use the file &amp;quot;NM _024426-Wilms.Fasta&amp;quot; provided in the tutorial data directory.  This is a nucleotide sequence file.  There is a second file which contains the corresponding protein sequence &amp;quot;NP_077744-Wilms.fasta&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Provide a little background info about Wilm's tumor. (It was chosen at random).&lt;br /&gt;
&lt;br /&gt;
* In the Visualization Area click on the Sequence Alignment tab.&lt;br /&gt;
&lt;br /&gt;
* Click on the Blast tab.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial.png]]&lt;br /&gt;
&lt;br /&gt;
Note that the result displays the length of the longest sequence selected (here there is only one) due to the sample.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
There are five different types of queries you can run, depending on what data you are using:&lt;br /&gt;
&lt;br /&gt;
'''blastp'''- Compares an amino acid query sequence against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastn'''- Compares a nucleotide query sequence against a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastx'''- Compares a nucleotide query sequence translated in all reading frames against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''tblastn'''- Compares a protein query sequence against a nucleotide database dynamically  translated in all reading frames.&lt;br /&gt;
&lt;br /&gt;
'''tblastx'''- Compares the 6 frame translations of a nucleotide query sequence against the six frame translations of a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Click on the drop down arrow and select a program. Since this is a '''nucelotide query''', we want to select a nucleotide query program '''blastn'''.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutoria1.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Now that the program has been selected, make sure the appropriate databases are displayed (you need to verify this for all algorithms). Here select  '''ncbi/nt''' - the complete non-redundant nucleotide database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Click on the Advanced Options Tab&lt;br /&gt;
&lt;br /&gt;
*Make sure &amp;quot;dna mat&amp;quot; is selected for the Matrix.&lt;br /&gt;
&lt;br /&gt;
*Change the Expect Value to 0.01.&lt;br /&gt;
&lt;br /&gt;
*Leave the box checked for PFP filtering for repeated sequence elements (Paracel Filtering Package).&lt;br /&gt;
&lt;br /&gt;
*Leave the Display result in your web browser checked.&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=File:(T)Blast_Tutorial2.png&amp;diff=2914</id>
		<title>File:(T)Blast Tutorial2.png</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=File:(T)Blast_Tutorial2.png&amp;diff=2914"/>
				<updated>2006-03-29T16:40:06Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2913</id>
		<title>User:Ginhoven</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2913"/>
				<updated>2006-03-29T16:18:12Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==TUTORIAL - BLAST==&lt;br /&gt;
&lt;br /&gt;
In this Tutorial you will learn to:&lt;br /&gt;
&lt;br /&gt;
* Set up and perform a Blast search.&lt;br /&gt;
&lt;br /&gt;
* Decipher the Output.&lt;br /&gt;
&lt;br /&gt;
* Analyze the results.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
'''OVERVIEW'''&lt;br /&gt;
&lt;br /&gt;
A reason why you may want to do a BLAST Query may be that you have found an interesting marker, so you want to retrieve it's gene, and see what it is related to.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
BLAST searches are divided into categories according to the nature, and size of the input query and the primary goal of the search.&lt;br /&gt;
&lt;br /&gt;
A BLAST search has four components:&lt;br /&gt;
&lt;br /&gt;
* Query&lt;br /&gt;
&lt;br /&gt;
* Data Base Program&lt;br /&gt;
&lt;br /&gt;
* Search Purpose&lt;br /&gt;
&lt;br /&gt;
* Goal&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the purpose of this tutorial use the file &amp;quot;NM _024426-Wilms.Fasta&amp;quot; provided in the tutorial data directory.  This is a nucleotide sequence file.  There is a second file which contains the corresponding protein sequence &amp;quot;NP_077744-Wilms.fasta&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Provide a little background info about Wilm's tumor. (It was chosen at random).&lt;br /&gt;
&lt;br /&gt;
* In the Visualization Area click on the Sequence Alignment tab.&lt;br /&gt;
&lt;br /&gt;
* Click on the Blast tab.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial.png]]&lt;br /&gt;
&lt;br /&gt;
Note that the result displays the length of the longest sequence selected (here there is only one) due to the sample.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
There are five different types of queries you can run, depending on what data you are using:&lt;br /&gt;
&lt;br /&gt;
'''blastp'''- Compares an amino acid query sequence against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastn'''- Compares a nucleotide query sequence against a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastx'''- Compares a nucleotide query sequence translated in all reading frames against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''tblastn'''- Compares a protein query sequence against a nucleotide database dynamically  translated in all reading frames.&lt;br /&gt;
&lt;br /&gt;
'''tblastx'''- Compares the 6 frame translations of a nucleotide query sequence against the six frame translations of a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Click on the drop down arrow and select a program. Since this is a '''nucelotide query''', we want to select a nucleotide query program '''blastn'''.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutoria1.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Now that the program has been selected, make sure the appropriate databases are displayed (you need to verify this for all algorithms). Here select  '''ncbi/nt''' - the complete non-redundant nucleotide database.&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2912</id>
		<title>User:Ginhoven</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2912"/>
				<updated>2006-03-29T16:17:15Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==TUTORIAL - BLAST==&lt;br /&gt;
&lt;br /&gt;
In this Tutorial you will learn to:&lt;br /&gt;
&lt;br /&gt;
* Set up and perform a Blast search.&lt;br /&gt;
&lt;br /&gt;
* Decipher the Output.&lt;br /&gt;
&lt;br /&gt;
* Analyze the results.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
'''OVERVIEW'''&lt;br /&gt;
&lt;br /&gt;
A reason why you may want to do a BLAST Query may be that you have found an interesting marker, so you want to retrieve it's gene, and see what it is related to.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
BLAST searches are divided into categories according to the nature, and size of the input query and the primary goal of the search.&lt;br /&gt;
&lt;br /&gt;
A BLAST search has four components:&lt;br /&gt;
&lt;br /&gt;
* Query&lt;br /&gt;
&lt;br /&gt;
* Data Base Program&lt;br /&gt;
&lt;br /&gt;
* Search Purpose&lt;br /&gt;
&lt;br /&gt;
* Goal&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the purpose of this tutorial use the file &amp;quot;NM _024426-Wilms.Fasta&amp;quot; provided in the tutorial data directory.  This is a nucleotide sequence file.  There is a second file which contains the corresponding protein sequence &amp;quot;NP_077744-Wilms.fasta&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Provide a little background info about Wilm's tumor. (It was chosen at random).&lt;br /&gt;
&lt;br /&gt;
* In the Visualization Area click on the Sequence Alignment tab.&lt;br /&gt;
&lt;br /&gt;
* Click on the Blast tab.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial.png]]&lt;br /&gt;
&lt;br /&gt;
Note that the result displays the length of the longest sequence selected (here there is only one) due to the sample.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
There are five different types of queries you can run, depending on what data you are using:&lt;br /&gt;
&lt;br /&gt;
'''blastp'''- Compares an amino acid query sequence against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastn'''- Compares a nucleotide query sequence against a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastx'''- Compares a nucleotide query sequence translated in all reading frames against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''tblastn'''- Compares a protein query sequence against a nucleotide database dynamically  translated in all reading frames.&lt;br /&gt;
&lt;br /&gt;
'''tblastx'''- Compares the 6 frame translations of a nucleotide query sequence against the six frame translations of a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Click on the drop down arrow and select a program. Since this is a '''nucelotide query''', we want to select a nucleotide query program '''blastn'''.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutoria1.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Now that the program has been selected, make sure the appropriate databases are displayed (you need to verify this for all algorithms). Here select  ncbi/nt - the complete non-redundant nucleotide database.&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2911</id>
		<title>User:Ginhoven</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2911"/>
				<updated>2006-03-29T16:15:33Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==TUTORIAL - BLAST==&lt;br /&gt;
&lt;br /&gt;
In this Tutorial you will learn to:&lt;br /&gt;
&lt;br /&gt;
* Set up and perform a Blast search.&lt;br /&gt;
&lt;br /&gt;
* Decipher the Output.&lt;br /&gt;
&lt;br /&gt;
* Analyze the results.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
'''OVERVIEW'''&lt;br /&gt;
&lt;br /&gt;
A reason why you may want to do a BLAST Query may be that you have found an interesting marker, so you want to retrieve it's gene, and see what it is related to.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
BLAST searches are divided into categories according to the nature, and size of the input query and the primary goal of the search.&lt;br /&gt;
&lt;br /&gt;
A BLAST search has four components:&lt;br /&gt;
&lt;br /&gt;
* Query&lt;br /&gt;
&lt;br /&gt;
* Data Base Program&lt;br /&gt;
&lt;br /&gt;
* Search Purpose&lt;br /&gt;
&lt;br /&gt;
* Goal&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the purpose of this tutorial use the file &amp;quot;NM _024426-Wilms.Fasta&amp;quot; provided in the tutorial data directory.  This is a nucleotide sequence file.  There is a second file which contains the corresponding protein sequence &amp;quot;NP_077744-Wilms.fasta&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Provide a little background info about Wilm's tumor. (It was chosen at random).&lt;br /&gt;
&lt;br /&gt;
* In the Visualization Area click on the Sequence Alignment tab.&lt;br /&gt;
&lt;br /&gt;
* Click on the Blast tab.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial.png]]&lt;br /&gt;
&lt;br /&gt;
Note that the result displays the length of the longest sequence selected (here there is only one) due to the sample.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
There are five different types of queries you can run, depending on what data you are using:&lt;br /&gt;
&lt;br /&gt;
'''blastp'''- Compares an amino acid query sequence against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastn'''- Compares a nucleotide query sequence against a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastx'''- Compares a nucleotide query sequence translated in all reading frames against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''tblastn'''- Compares a protein query sequence against a nucleotide database dynamically  translated in all reading frames.&lt;br /&gt;
&lt;br /&gt;
'''tblastx'''- Compares the 6 frame translations of a nucleotide query sequence against the six frame translations of a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Click on the drop down arrow and select a program. Since this is a nucelotide query, we want to select a nucleotide query program blastn.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutoria1.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Now that the program has been selected, make sure the appropriate databases are displayed (you need to verify this for all algorithms). Here select  ncbi/nt - the complete non-redundant nucleotide database.&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2910</id>
		<title>User:Ginhoven</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2910"/>
				<updated>2006-03-29T16:14:45Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==TUTORIAL - BLAST==&lt;br /&gt;
&lt;br /&gt;
In this Tutorial you will learn to:&lt;br /&gt;
&lt;br /&gt;
* Set up and perform a Blast search.&lt;br /&gt;
&lt;br /&gt;
* Decipher the Output.&lt;br /&gt;
&lt;br /&gt;
* Analyze the results.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
'''OVERVIEW'''&lt;br /&gt;
&lt;br /&gt;
A reason why you may want to do a BLAST Query may be that you have found an interesting marker, so you want to retrieve it's gene, and see what it is related to.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
BLAST searches are divided into categories according to the nature, and size of the input query and the primary goal of the search.&lt;br /&gt;
&lt;br /&gt;
A BLAST search has four components:&lt;br /&gt;
&lt;br /&gt;
* Query&lt;br /&gt;
&lt;br /&gt;
* Data Base Program&lt;br /&gt;
&lt;br /&gt;
* Search Purpose&lt;br /&gt;
&lt;br /&gt;
* Goal&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the purpose of this tutorial use the file &amp;quot;NM _024426-Wilms.Fasta&amp;quot; provided in the tutorial data directory.  This is a nucleotide sequence file.  There is a second file which contains the corresponding protein sequence &amp;quot;NP_077744-Wilms.fasta&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Provide a little background info about Wilm's tumor. (It was chosen at random).&lt;br /&gt;
&lt;br /&gt;
* In the Visualization Area click on the Sequence Alignment tab.&lt;br /&gt;
&lt;br /&gt;
* Click on the Blast tab.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial.png]]&lt;br /&gt;
&lt;br /&gt;
Note that the result displays the length of the longest sequence selected (here there is only one) due to the sample.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
There are five different types of queries you can run, depending on what data you are using:&lt;br /&gt;
&lt;br /&gt;
'''blastp'''- Compares an amino acid query sequence against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastn'''- Compares a nucleotide query sequence against a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastx'''- Compares a nucleotide query sequence translated in all reading frames against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''tblastn'''- Compares a protein query sequence against a nucleotide database dynamically  translated in all reading frames.&lt;br /&gt;
&lt;br /&gt;
'''tblastx'''- Compares the 6 frame translations of a nucleotide query sequence against the six frame translations of a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Click on the drop down arrow and select a program. Since this is a nucelotide query, we want to select a nucleotide query program blastn.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial1.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Now that the program has been selected, make sure the appropriate databases are displayed (you need to verify this for all algorithms). Here select  ncbi/nt - the complete non-redundant nucleotide database.&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2909</id>
		<title>User:Ginhoven</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2909"/>
				<updated>2006-03-29T16:14:19Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==TUTORIAL - BLAST==&lt;br /&gt;
&lt;br /&gt;
In this Tutorial you will learn to:&lt;br /&gt;
&lt;br /&gt;
* Set up and perform a Blast search.&lt;br /&gt;
&lt;br /&gt;
* Decipher the Output.&lt;br /&gt;
&lt;br /&gt;
* Analyze the results.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
'''OVERVIEW'''&lt;br /&gt;
&lt;br /&gt;
A reason why you may want to do a BLAST Query may be that you have found an interesting marker, so you want to retrieve it's gene, and see what it is related to.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
BLAST searches are divided into categories according to the nature, and size of the input query and the primary goal of the search.&lt;br /&gt;
&lt;br /&gt;
A BLAST search has four components:&lt;br /&gt;
&lt;br /&gt;
* Query&lt;br /&gt;
&lt;br /&gt;
* Data Base Program&lt;br /&gt;
&lt;br /&gt;
* Search Purpose&lt;br /&gt;
&lt;br /&gt;
* Goal&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the purpose of this tutorial use the file &amp;quot;NM _024426-Wilms.Fasta&amp;quot; provided in the tutorial data directory.  This is a nucleotide sequence file.  There is a second file which contains the corresponding protein sequence &amp;quot;NP_077744-Wilms.fasta&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Provide a little background info about Wilm's tumor. (It was chosen at random).&lt;br /&gt;
&lt;br /&gt;
* In the Visualization Area click on the Sequence Alignment tab.&lt;br /&gt;
&lt;br /&gt;
* Click on the Blast tab.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial.png]]&lt;br /&gt;
&lt;br /&gt;
Note that the result displays the length of the longest sequence selected (here there is only one) due to the sample.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
There are five different types of queries you can run, depending on what data you are using:&lt;br /&gt;
&lt;br /&gt;
'''blastp'''- Compares an amino acid query sequence against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastn'''- Compares a nucleotide query sequence against a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastx'''- Compares a nucleotide query sequence translated in all reading frames against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''tblastn'''- Compares a protein query sequence against a nucleotide database dynamically  translated in all reading frames.&lt;br /&gt;
&lt;br /&gt;
'''tblastx'''- Compares the 6 frame translations of a nucleotide query sequence against the six frame translations of a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Click on the drop down arrow and select a program. Since this is a nucelotide query, we want to select a nucleotide query program blastn.&lt;br /&gt;
&lt;br /&gt;
[[Image:Blast Tutorial1.png]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Now that the program has been selected, make sure the appropriate databases are displayed (you need to verify this for all algorithms). Here select  ncbi/nt - the complete non-redundant nucleotide database.&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=File:(T)Blast_Tutoria1.png&amp;diff=2908</id>
		<title>File:(T)Blast Tutoria1.png</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=File:(T)Blast_Tutoria1.png&amp;diff=2908"/>
				<updated>2006-03-29T16:13:28Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2907</id>
		<title>User:Ginhoven</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2907"/>
				<updated>2006-03-29T16:04:57Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==TUTORIAL - BLAST==&lt;br /&gt;
&lt;br /&gt;
In this Tutorial you will learn to:&lt;br /&gt;
&lt;br /&gt;
* Set up and perform a Blast search.&lt;br /&gt;
&lt;br /&gt;
* Decipher the Output.&lt;br /&gt;
&lt;br /&gt;
* Analyze the results.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
'''OVERVIEW'''&lt;br /&gt;
&lt;br /&gt;
A reason why you may want to do a BLAST Query may be that you have found an interesting marker, so you want to retrieve it's gene, and see what it is related to.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
BLAST searches are divided into categories according to the nature, and size of the input query and the primary goal of the search.&lt;br /&gt;
&lt;br /&gt;
A BLAST search has four components:&lt;br /&gt;
&lt;br /&gt;
* Query&lt;br /&gt;
&lt;br /&gt;
* Data Base Program&lt;br /&gt;
&lt;br /&gt;
* Search Purpose&lt;br /&gt;
&lt;br /&gt;
* Goal&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the purpose of this tutorial use the file &amp;quot;NM _024426-Wilms.Fasta&amp;quot; provided in the tutorial data directory.  This is a nucleotide sequence file.  There is a second file which contains the corresponding protein sequence &amp;quot;NP_077744-Wilms.fasta&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Provide a little background info about Wilm's tumor. (It was chosen at random).&lt;br /&gt;
&lt;br /&gt;
* In the Visualization Area click on the Sequence Alignment tab.&lt;br /&gt;
&lt;br /&gt;
* Click on the Blast tab.&lt;br /&gt;
&lt;br /&gt;
[[Image:(T)Blast Tutorial.png]]&lt;br /&gt;
&lt;br /&gt;
Note that the result displays the length of the longest sequence selected (here there is only one) due to the sample.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
There are five different types of queries you can run, depending on what data you are using:&lt;br /&gt;
&lt;br /&gt;
'''blastp'''- Compares an amino acid query sequence against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastn'''- Compares a nucleotide query sequence against a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastx'''- Compares a nucleotide query sequence translated in all reading frames against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''tblastn'''- Compares a protein query sequence against a nucleotide database dynamically  translated in all reading frames.&lt;br /&gt;
&lt;br /&gt;
'''tblastx'''- Compares the 6 frame translations of a nucleotide query sequence against the six frame translations of a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Click on the drop down arrow and select a program. Since this is a nucelotide query, we want to select a nucleotide query program blastn.&lt;br /&gt;
&lt;br /&gt;
'insert screen shot here'&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Now that the program has been selected, make sure the appropriate databases are displayed (you need to verify this for all algorithms). Here select  ncbi/nt - the complete non-redundant nucleotide database.&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=File:(T)Blast_Tutorial.png&amp;diff=2906</id>
		<title>File:(T)Blast Tutorial.png</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=File:(T)Blast_Tutorial.png&amp;diff=2906"/>
				<updated>2006-03-29T15:58:36Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2896</id>
		<title>User:Ginhoven</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2896"/>
				<updated>2006-03-24T20:25:18Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==TUTORIAL - BLAST==&lt;br /&gt;
&lt;br /&gt;
In this Tutorial you will learn to:&lt;br /&gt;
&lt;br /&gt;
* Set up and perform a Blast search.&lt;br /&gt;
&lt;br /&gt;
* Decipher the Output.&lt;br /&gt;
&lt;br /&gt;
* Analyze the results.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
'''OVERVIEW'''&lt;br /&gt;
&lt;br /&gt;
A reason why you may want to do a BLAST Query may be that you have found an interesting marker, so you want to retrieve it's gene, and see what it is related to.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
BLAST searches are divided into categories according to the nature, and size of the input query and the primary goal of the search.&lt;br /&gt;
&lt;br /&gt;
A BLAST search has four components:&lt;br /&gt;
&lt;br /&gt;
* Query&lt;br /&gt;
&lt;br /&gt;
* Data Base Program&lt;br /&gt;
&lt;br /&gt;
* Search Purpose&lt;br /&gt;
&lt;br /&gt;
* Goal&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the purpose of this tutorial use the file &amp;quot;NM _024426-Wilms.Fasta&amp;quot; provided in the tutorial data directory.  This is a nucleotide sequence file.  There is a second file which contains the corresponding protein sequence &amp;quot;NP_077744-Wilms.fasta&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Provide a little background info about Wilm's tumor. (It was chosen at random).&lt;br /&gt;
&lt;br /&gt;
* In the Visualization Area click on the Sequence Alignment tab.&lt;br /&gt;
&lt;br /&gt;
* Click on the Blast tab.&lt;br /&gt;
&lt;br /&gt;
'insert screen shot here'&lt;br /&gt;
&lt;br /&gt;
Note that the result displays the length of the longest sequence selected (here there is only one)due to the sample.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
There are five different types of queries you can run, depending on what data you are using:&lt;br /&gt;
&lt;br /&gt;
'''blastp'''- Compares an amino acid query sequence against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastn'''- Compares a nucleotide query sequence against a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastx'''- Compares a nucleotide query sequence translated in all reading frames against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''tblastn'''- Compares a protein query sequence against a nucleotide database dynamically  translated in all reading frames.&lt;br /&gt;
&lt;br /&gt;
'''tblastx'''- Compares the 6 frame translations of a nucleotide query sequence against the six frame translations of a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Click on the drop down arrow and select a program. Since this is a nucelotide query, we want to select a nucleotide query program blastn.&lt;br /&gt;
&lt;br /&gt;
'insert screen shot here'&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Now that the program has been selected, make sure the appropriate databases are displayed (you need to verify this for all algorithms). Here select  ncbi/nt - the complete non-redundant nucleotide database.&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2895</id>
		<title>User:Ginhoven</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2895"/>
				<updated>2006-03-24T20:19:09Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==TUTORIAL - BLAST==&lt;br /&gt;
&lt;br /&gt;
In this Tutorial you will learn to:&lt;br /&gt;
&lt;br /&gt;
* Set up and perform a Blast search.&lt;br /&gt;
&lt;br /&gt;
* Decipher the Output.&lt;br /&gt;
&lt;br /&gt;
* Analyze the results.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
'''OVERVIEW'''&lt;br /&gt;
&lt;br /&gt;
A reason why you may want to do a BLAST Query may be that you have found an interesting marker, so you want to retrieve it's gene, and see what it is related to.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
BLAST searches are divided into categories according to the nature, and size of the input query and the primary goal of the search.&lt;br /&gt;
&lt;br /&gt;
A BLAST search has four components:&lt;br /&gt;
&lt;br /&gt;
* Query&lt;br /&gt;
&lt;br /&gt;
* Data Base Program&lt;br /&gt;
&lt;br /&gt;
* Search Purpose&lt;br /&gt;
&lt;br /&gt;
* Goal&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the purpose of this tutorial use the file &amp;quot;NM _024426-Wilms.Fasta&amp;quot; provided in the tutorial data directory.  This is a nucleotide sequence file.  There is a second file which contains the corresponding protein sequence &amp;quot;NP_077744-Wilms.fasta&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Provide a little background info about Wilm's tumor. (It was chosen at random).&lt;br /&gt;
&lt;br /&gt;
* In the Visualization Area click on the Sequence Alignment tab.&lt;br /&gt;
&lt;br /&gt;
* Click on the Blast tab.&lt;br /&gt;
&lt;br /&gt;
'insert screen shot here'&lt;br /&gt;
&lt;br /&gt;
Note that the result displays the length of the longest sequence selected (here there is only one)due to the sample.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
There are five different types of queries you can run, depending on what data you are using:&lt;br /&gt;
&lt;br /&gt;
'''blastp'''- Compares an amino acid query sequence against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastn'''- Compares a nucleotide query sequence against a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastx'''- Compares a nucleotide query sequence translated in all reading frames against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''tblastn'''- Compares a protein query sequence against a nucleotide database dynamically  translated in all reading frames.&lt;br /&gt;
&lt;br /&gt;
'''tblastx'''- Compares the 6 frame translations of a nucleotide query sequence against the six frame translations of a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Click on the drop down arrow and select a program. Since this is a nucelotide query, we want to select a nucleotide query program blastn.&lt;br /&gt;
&lt;br /&gt;
'insert screen shot here'&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Now that the program has been selected, make sure the appropriate databases are displayed (you need to verify this for all algorithms). Here select  ncbi/nt - the complete non-redundant nucleotide database.&lt;br /&gt;
&lt;br /&gt;
* Click on the Blast button.&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2894</id>
		<title>User:Ginhoven</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2894"/>
				<updated>2006-03-24T20:17:34Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==TUTORIAL - BLAST==&lt;br /&gt;
&lt;br /&gt;
In this Tutorial you will learn to:&lt;br /&gt;
&lt;br /&gt;
* Set up and perform a Blast search.&lt;br /&gt;
&lt;br /&gt;
* Decipher the Output.&lt;br /&gt;
&lt;br /&gt;
* Analyze the results.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
'''OVERVIEW'''&lt;br /&gt;
&lt;br /&gt;
A reason why you may want to do a BLAST Query may be that you have found an interesting marker, so you want to retrieve it's gene, and see what it is related to.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
BLAST searches are divided into categories according to the nature, and size of the input query and the primary goal of the search.&lt;br /&gt;
&lt;br /&gt;
A BLAST search has four components:&lt;br /&gt;
&lt;br /&gt;
* Query&lt;br /&gt;
&lt;br /&gt;
* Data Base Program&lt;br /&gt;
&lt;br /&gt;
* Search Purpose&lt;br /&gt;
&lt;br /&gt;
* Goal&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the purpose of this tutorial use the file &amp;quot;NM _024426-Wilms.Fasta&amp;quot; provided in the tutorial data directory.  This is a nucleotide sequence file.  There is a second file which contains the corresponding protein sequence &amp;quot;NP_077744-Wilms.fasta&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Provide a little background info about Wilm's tumor. (It was chosen at random).&lt;br /&gt;
&lt;br /&gt;
* In the Visualization Area click on the Sequence Alignment tab.&lt;br /&gt;
&lt;br /&gt;
* Click on the Blast button.&lt;br /&gt;
&lt;br /&gt;
'insert screen shot here'&lt;br /&gt;
&lt;br /&gt;
Note that the result displays the length of the longest sequence selected (here there is only one)due to the sample.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
There are five different types of queries you can run, depending on what data you are using:&lt;br /&gt;
&lt;br /&gt;
'''blastp'''- Compares an amino acid query sequence against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastn'''- Compares a nucleotide query sequence against a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastx'''- Compares a nucleotide query sequence translated in all reading frames against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''tblastn'''- Compares a protein query sequence against a nucleotide database dynamically  translated in all reading frames.&lt;br /&gt;
&lt;br /&gt;
'''tblastx'''- Compares the 6 frame translations of a nucleotide query sequence against the six frame translations of a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Click on the drop down arrow and select a program. Since this is a nucelotide query, we want to select a nucleotide query program blastn.&lt;br /&gt;
&lt;br /&gt;
'insert screen shot here'&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Now that the program has been selected, make sure the appropriate databases are displayed (you need to verify this for all algorithms). Here select ncbi/nt - the complete non-redundant nucleotide database.&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2893</id>
		<title>User:Ginhoven</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2893"/>
				<updated>2006-03-24T20:08:09Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==TUTORIAL - BLAST==&lt;br /&gt;
&lt;br /&gt;
In this Tutorial you will learn to:&lt;br /&gt;
&lt;br /&gt;
* Set up and perform a Blast search.&lt;br /&gt;
&lt;br /&gt;
* Decipher the Output.&lt;br /&gt;
&lt;br /&gt;
* Analyze the results.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
'''OVERVIEW'''&lt;br /&gt;
&lt;br /&gt;
A reason why you may want to do a BLAST Query may be that you have found an interesting marker, so you want to retrieve it's gene, and see what it is related to.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
BLAST searches are divided into categories according to the nature, and size of the input query and the primary goal of the search.&lt;br /&gt;
&lt;br /&gt;
A BLAST search has four components:&lt;br /&gt;
&lt;br /&gt;
* Query&lt;br /&gt;
&lt;br /&gt;
* Data Base Program&lt;br /&gt;
&lt;br /&gt;
* Search Purpose&lt;br /&gt;
&lt;br /&gt;
* Goal&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the purpose of this tutorial use the file &amp;quot;NM _024426-Wilms.Fasta&amp;quot; provided in the tutorial data directory.  This is a nucleotide sequence file.  There is a second file which contains the corresponding protein sequence &amp;quot;NP_077744-Wilms.fasta&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Provide a little background info about Wilm's tumor. (It was chosen at random).&lt;br /&gt;
&lt;br /&gt;
* In the Visualization Area click on the Sequence Alignment tab.&lt;br /&gt;
&lt;br /&gt;
* Click on the Blast button.&lt;br /&gt;
&lt;br /&gt;
'insert screen shot here'&lt;br /&gt;
&lt;br /&gt;
Note that the result displays the length of the longest sequence selected (here there is only one)due to the sample.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
There are five different types of queries you can run, depending on what data you are using:&lt;br /&gt;
&lt;br /&gt;
'''blastp'''- Compares an amino acid query sequence against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastn'''- Compares a nucleotide query sequence against a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastx'''- Compares a nucleotide query sequence translated in all reading frames against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''tblastn'''- Compares a protein query sequence against a nucleotide database dynamically  translated in all reading frames.&lt;br /&gt;
&lt;br /&gt;
'''tblastx'''- Compares the 6 frame translations of a nucleotide query sequence against the six frame translations of a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Click on the drop down arrow and select a program. Since this is a nucelotide query, we want to select a nucleotide query program blastn.&lt;br /&gt;
&lt;br /&gt;
'insert screen shot here'&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Now that the program has been selected, make sure the appropriate databases are displayed (you need to verify this for all algorithms). Here we will try ncbi/nt - the complete non-redundant nucleotide database.&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2892</id>
		<title>User:Ginhoven</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2892"/>
				<updated>2006-03-24T19:50:25Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==TUTORIAL - BLAST==&lt;br /&gt;
&lt;br /&gt;
In this Tutorial you will learn to:&lt;br /&gt;
&lt;br /&gt;
* Set up and perform a Blast search.&lt;br /&gt;
&lt;br /&gt;
* Decipher the Output.&lt;br /&gt;
&lt;br /&gt;
* Analyze the results.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
'''OVERVIEW'''&lt;br /&gt;
&lt;br /&gt;
A reason why you may want to do a BLAST Query may be that you have found an interesting marker, so you want to retrieve it's gene, and see what it is related to.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
BLAST searches are divided into categories according to the nature, and size of the input query and the primary goal of the search.&lt;br /&gt;
&lt;br /&gt;
A BLAST search has four components:&lt;br /&gt;
&lt;br /&gt;
* Query&lt;br /&gt;
&lt;br /&gt;
* Data Base Program&lt;br /&gt;
&lt;br /&gt;
* Search Purpose&lt;br /&gt;
&lt;br /&gt;
* Goal&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the purpose of this tutorial use the file &amp;quot;NM _024426-Wilms.Fasta&amp;quot; provided in the tutorial data directory.  This is a nucleotide sequence file.  There is a second file which contains the corresponding protein sequence &amp;quot;NP_077744-Wilms.fasta&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Provide a little background info about Wilm's tumor. (It was chosen at random).&lt;br /&gt;
&lt;br /&gt;
'insert screen shot here'&lt;br /&gt;
&lt;br /&gt;
* In the Visualization Area click on the Sequence Alignment tab.&lt;br /&gt;
&lt;br /&gt;
* Click on the Blast button.&lt;br /&gt;
&lt;br /&gt;
Note that the result displays the length of the longest sequence selected (here there is only one)due to the sample.&lt;br /&gt;
&lt;br /&gt;
Click on the drop down arrow and select a program. Since this is a nucelotide query, we want to select a nucleotide query program such as blastn.&lt;br /&gt;
&lt;br /&gt;
'insert screen shot here'&lt;br /&gt;
&lt;br /&gt;
There are five different types of queries you can run, depending on what data you are using:&lt;br /&gt;
&lt;br /&gt;
'''blastp'''- Compares an amino acid query sequence against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastn'''- Compares a nucleotide query sequence against a nucleotide sequence database.&lt;br /&gt;
&lt;br /&gt;
'''blastx'''- Compares a nucleotide query sequence translated in all reading frames against a protein sequence database.&lt;br /&gt;
&lt;br /&gt;
'''tblastn'''- Compares a protein query sequence against a nucleotide database dynamically  translated in all reading frames.&lt;br /&gt;
&lt;br /&gt;
'''tblastx'''- Compares the 6 frame translations of a nucleotide query sequence against the six frame translations of a nucleotide sequence database.&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2891</id>
		<title>User:Ginhoven</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2891"/>
				<updated>2006-03-24T19:41:00Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==TUTORIAL - BLAST==&lt;br /&gt;
&lt;br /&gt;
In this Tutorial you will learn to:&lt;br /&gt;
&lt;br /&gt;
* Set up and perform a Blast search.&lt;br /&gt;
&lt;br /&gt;
* Decipher the Output.&lt;br /&gt;
&lt;br /&gt;
* Analyze the results.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
'''OVERVIEW'''&lt;br /&gt;
&lt;br /&gt;
A reason why you may want to do a BLAST Query may be that you have found an interesting marker, so you want to retrieve it's gene, and see what it is related to.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
BLAST searches are divided into categories according to the nature, and size of the input query and the primary goal of the search.&lt;br /&gt;
&lt;br /&gt;
A BLAST search has four components:&lt;br /&gt;
&lt;br /&gt;
* Query&lt;br /&gt;
&lt;br /&gt;
* Data Base Program&lt;br /&gt;
&lt;br /&gt;
* Search Purpose&lt;br /&gt;
&lt;br /&gt;
* Goal&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the purpose of this tutorial use the file &amp;quot;NM _024426-Wilms.Fasta&amp;quot; provided in the tutorial data directory.  This is a nucleotide sequence file.  There is a second file which contains the corresponding protein sequence &amp;quot;NP_077744-Wilms.fasta&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Provide a little background info about Wilm's tumor. (It was chosen at random).&lt;br /&gt;
&lt;br /&gt;
'insert screen shot here'&lt;br /&gt;
&lt;br /&gt;
* In the Visualization Area click on the Sequence Alignment tab.&lt;br /&gt;
&lt;br /&gt;
* Click on the Blast button.&lt;br /&gt;
&lt;br /&gt;
Note that the result displays the length of the longest sequence selected (here there is only one)due to the sample.&lt;br /&gt;
&lt;br /&gt;
Click on the drop down arrow and select a program. Since this is a nucelotide query, we want to select a nucleotide query program such as blastn.&lt;br /&gt;
&lt;br /&gt;
'insert screen shot here'&lt;br /&gt;
&lt;br /&gt;
There are five different types of queries you can run, depending on what data you are using:&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2890</id>
		<title>User:Ginhoven</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2890"/>
				<updated>2006-03-24T18:02:47Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==TUTORIAL - BLAST==&lt;br /&gt;
&lt;br /&gt;
In this Tutorial you will learn to:&lt;br /&gt;
&lt;br /&gt;
* Set up and perform a Blast search.&lt;br /&gt;
&lt;br /&gt;
* Decipher the Output.&lt;br /&gt;
&lt;br /&gt;
* Analyze the results.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
'''OVERVIEW'''&lt;br /&gt;
&lt;br /&gt;
A reason why you may want to do a BLAST Query may be that you have found an interesting marker, so you want to retrieve it's gene, and see what it is related to.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
BLAST searches are divided into categories according to the nature, and size of the input query and the primary goal of the search.&lt;br /&gt;
&lt;br /&gt;
A BLAST search has four components:&lt;br /&gt;
&lt;br /&gt;
* Query&lt;br /&gt;
&lt;br /&gt;
* Data Base Program&lt;br /&gt;
&lt;br /&gt;
* Search Purpose&lt;br /&gt;
&lt;br /&gt;
* Goal&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the purpose of this tutorial use the file &amp;quot;NM _024426-Wilms.Fasta&amp;quot; provided in the tutorial data directory.  This is a nucleotide sequence file.  There is a second file which contains the corresponding protein sequence &amp;quot;NP_077744-Wilms.fasta&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Provide a little background info about Wilm's tumor. (It was chosen at random).&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	<entry>
		<id>http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2889</id>
		<title>User:Ginhoven</title>
		<link rel="alternate" type="text/html" href="http://wiki.c2b2.columbia.edu/workbench/index.php?title=User:Ginhoven&amp;diff=2889"/>
				<updated>2006-03-24T16:45:21Z</updated>
		
		<summary type="html">&lt;p&gt;Ginhoven: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==TUTORIAL - BLAST==&lt;br /&gt;
&lt;br /&gt;
In this Tutorial you will learn to:&lt;br /&gt;
&lt;br /&gt;
* Set up and perform a Blast search.&lt;br /&gt;
&lt;br /&gt;
* Decipher the Output.&lt;br /&gt;
&lt;br /&gt;
* Analyze the results.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
'''OVERVIEW'''&lt;br /&gt;
&lt;br /&gt;
A reason why you may want to do a BLAST Query may be that you have found an interesting marker, so you want to retrieve it's gene, and see what it is related to.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
BLAST searches are divided into categories according to the nature, and size of the input query and the primary goal of the search.&lt;br /&gt;
&lt;br /&gt;
A BLAST search has four components:&lt;br /&gt;
&lt;br /&gt;
* Query&lt;br /&gt;
&lt;br /&gt;
* Data Base Program&lt;br /&gt;
&lt;br /&gt;
* Search Purpose&lt;br /&gt;
&lt;br /&gt;
* Goal&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the purpose of this tutorial use the file &amp;quot;NM _024426-Wilms.Fasta&amp;quot; provided in the tutorial data directory.  This is a nucleotide sequence file.  There is a second file&lt;/div&gt;</summary>
		<author><name>Ginhoven</name></author>	</entry>

	</feed>