Difference between revisions of "User:Smith"

Line 1: Line 1:
 
  
 
==Tutorial: Promoter Analysis==
 
==Tutorial: Promoter Analysis==
Line 15: Line 14:
  
 
==Tutorial: Sequence Analysis - BLAST==
 
==Tutorial: Sequence Analysis - BLAST==
 +
 +
Use the file "NM_024426-Wilms.fasta" provided in the tutorial data directory.  This is a nucleotide sequence file.  There is a second file which contains the corresponding protein sequence, "NP_077744-Wilms.fasta".
 +
 +
Provide a little background info about Wilm's tumor.  (It was chosen at random).
 +
 +
Go to Sequence Alignment.
 +
 +
Select BLAST.
 +
 +
Note that the subsequence displays the length of the longest sequence selected (here there is only one).  It can be used to select out a portion of the sequence to use for the query ( probably wouldn't make much sense if more than one sequence is selected for the query).
 +
 +
Select a program.  Since this is a nucelotide query, we want to select a nucleotide query program such as blastn.
 +
 +
Provide a one line description of each of the different blast algorithms.  We have this info on the AMDeC website.
 +
 +
Now that the program has been selected, note that the appropriate databases are displayed (need to verify this for all algorithms).  Here we will try ncbi/nt - the complete non-redundant nucleotide database.
 +
 +
Go to the advanced options tab.  Make sure the matrix "dna.mat" is selected.  Change the Expect value to 0.01.  We will leave checked the box to use PFP filtering for repeated sequence elements (Paracel Filtering Package).
 +
 +
In the Service tab, select Columbia.
 +
 +
Note the text field at bottom which shows that one sequence has been selected.  If you have a fasta file with mulitple sequences, you can select the ones you want in the Markers component and activate this selection, letting you search on a subset.  Or, you can search on all the sequences in a file (all markers checkbox, or also by default if no subselection made?  find out).
 +
 +
You can check the server status by hitting the "Refresh" button.  For the columbia machine, this can give you an idea of how busy it is.
 +
 +
Press the curved arrow submit button.  Observe the progress bar "Blast is running".

Revision as of 12:28, 22 March 2006

Tutorial: Promoter Analysis

Tutorial: Regulatory Network Reverse Engineering

Tutorial: Integrated Annotation Information ???

Tutorial: Enrichment Analysis/GO Term component

Tutorial: Sequence Analysis - BLAST

Use the file "NM_024426-Wilms.fasta" provided in the tutorial data directory. This is a nucleotide sequence file. There is a second file which contains the corresponding protein sequence, "NP_077744-Wilms.fasta".

Provide a little background info about Wilm's tumor. (It was chosen at random).

Go to Sequence Alignment.

Select BLAST.

Note that the subsequence displays the length of the longest sequence selected (here there is only one). It can be used to select out a portion of the sequence to use for the query ( probably wouldn't make much sense if more than one sequence is selected for the query).

Select a program. Since this is a nucelotide query, we want to select a nucleotide query program such as blastn.

Provide a one line description of each of the different blast algorithms. We have this info on the AMDeC website.

Now that the program has been selected, note that the appropriate databases are displayed (need to verify this for all algorithms). Here we will try ncbi/nt - the complete non-redundant nucleotide database.

Go to the advanced options tab. Make sure the matrix "dna.mat" is selected. Change the Expect value to 0.01. We will leave checked the box to use PFP filtering for repeated sequence elements (Paracel Filtering Package).

In the Service tab, select Columbia.

Note the text field at bottom which shows that one sequence has been selected. If you have a fasta file with mulitple sequences, you can select the ones you want in the Markers component and activate this selection, letting you search on a subset. Or, you can search on all the sequences in a file (all markers checkbox, or also by default if no subselection made? find out).

You can check the server status by hitting the "Refresh" button. For the columbia machine, this can give you an idea of how busy it is.

Press the curved arrow submit button. Observe the progress bar "Blast is running".