Using the Entrez Browser retrieve the protein sequence of th
Using the Entrez Browser, retrieve the protein sequence of the E. coli RecA. Choose the BLASTP program (http://www.ncbi.nlm.nih.gov/BLAST/) and enter the RecA sequence in FASTA format into the input data window. Click on the format results window (note that you must wait in a queue for the results).
a. Report the GeneBank accession number and length of protein sequence of the E. coli RecA protein.
b. What scoring matrix and gap penalties were used?
c. How many database sequences were searched?
d. Report the name and GenBank access number of the highest scoring sequence.
e. Report the name and GenBank access number of the lowest reported score in this search and explain whether the lowest scoring sequence is significant (E-value)
Solution
a. GeneBank accession number - AFU91764.1 , length of protein sequence of the E. coli RecA - 329
b. Bit Score (672 bits) and no gap (0/329 = 0% gap)
c. 100 database sequences
d. RecA [Escherichia coli] and GenBank access number AFU91764.1
e. Recombinase RecA [Citrobacter amalonaticus] and GenBank access number WP_061075067.1
The sequence of the Recombinase RecA [Citrobacter amalonaticus] is also significant at the lowest scoring
sequence at 96%.
