Next: Pairwise Sequence Alignment
Up: 300632 UE Exercises for
Previous: Database Search
It has been shown that the TIM barrel domains of the two
proteins flavocytochrome b2 (PDB code 1fcb)
and phosphoribosylanthranilate isomerase (PDB code 1pii) are
very similar in terms of their three-dimensional structure
[Tang CL et al (2003) J Mol Biol 334(5):1043-1062;
doi:10.1016/j.jmb.2003.10.025]. This similarity may not be detectable by
traditional similarity searches.
- Download yeast flavocytochrome b2 protein sequence in
FASTA format from NCBI Entrez
database (Hint: accession number NP_013658).
- Perform a protein BLAST search
http://blast.ncbi.nlm.nih.gov/Blast.cgi of
flavocytochrome b2 against the PDB database using
default parameters. (Summarize the number of hits, highest and lowest
bit scores in a table).
- Change the
-value to 0.01 and the word size from 3 to 2 and do
the search again. (Do you see any differences in the number of hits?
Can you find 1pii in the search result?)
- Reset the
-value to 10. Change the substitution matrix from
BLOSUM62 to BLOSUM45. (Compare the search
results. What is your conclusion in terms of selectivity and
sensitivity of your search?)
- Reset the substitution matrix to BLOSUM62. Run the search
with and without the low-complexity filter. (Record the number of hits,
highest and lowest bit scores).
- Run the same search as in item 2. using FASTA
http://www.ebi.ac.uk/Tools/fasta33/ (Compare results with those
from BLAST).
- Return to the NCBI Blast homepage and repeat the search
using the algorithms psi-blast and
phi-blast. (Record the number of hits and try to find
1pii in the result page. Download all hits below
-value
0.005 in FASTA format).
- Repeat the search with the hidden Markov model based approach
HHPRED
http://toolkit.tuebingen.mpg.de/hhpred. (Within
the results can you find 1pii? Summarize how well different
programs can find remote homologies.)
- Goto the PDB database http://www.pdb.org/
and search for 1pii. Click on Tab ``3D Similarity" and check if
you find some of the proteins in your previous similarity search results.
Next: Pairwise Sequence Alignment
Up: 300632 UE Exercises for
Previous: Database Search
Christoph Flamm
2009-01-06