I need to know from which amino acid we start to copy the se
I need to know from which amino acid we start to copy the sequence and also size of the protein and its function!
I know it is a mannose-1-phosphate guanylyltransferase /mannose-6-phosphate isomerase , but i have no idea about the function (i know it is an enzyme) and its size!
I need the amino acid sequence too
Identify the open reading frame in the following DNA sequence, the protein that this gene encodes for, its function, and the source. You can consult the bioinformatics exercise “Project 1: Databases for the Storage and ‘Mining’ of Genome Sequences” under student resources for Chapter 3 on the WileyPLUS website. The procedure to identify the gene and the protein that it encodes is as follows:
Look carefully at the DNA sequence provided, and identify the start site for transcription.
Click on the DNA sequence from the start site of transcription and select all of the sequence and copy the sequence.
Go to the National Center for Biotechnology Information website http://www.ncbi.nlm.nih.gov/ and click on BLAST on the right hand side under “Popular Resources”. BLAST is a program that will allow you to find the protein sequence for the DNA sequence (gene) you submit. Next click on blastx on the left hand column under the title “Basic Blast”.
Paste the DNA sequence into the box and click BLAST!. The search may take a few seconds and the page will keep updating until the search is completed.
When the search is complete you will have a figure showing the most homologous results or “sequences producing significant alignments” and following that, a list of what these proteins are. Your protein will be the first one on the list. You can click on the left hand side on the accession number or sequence identifier information which will bring up more information. You should be able to find the name, function, size (number of amino acids) and source (name of the organism) for the protein. Your answer should include the:
Amino acid sequence of the protein
Size of the protein
Identity of the protein
Function of the protein (10 marks)
AGACAGACGCATCGCTTCAAGGAGAAACAACATGATCCCAGTAATCCTTTCCGGTGGCAGCGGCTCGCGA CTCTGGCCTCTTTCCCGCAAGCAGTACCCCAAGCAGTTCCTCGCCCTCACCGGCGACGACACCCTGTTCC AGCAGACCATCAAGCGCCTGGCCTTCGACGGCATGCAGGCACCGCTGCTGGTGTGCAACAAGGAGCACCG CTTCATCGTCCAGGAACAGCTGGAGGCACAGAACCTGGCGAGCCAGGCGATCCTCCTCGAACCCTTCGGC CGCAACACGGCGCCGGCGGTGGCCATCGCCGCGATGAAACTGGTCGCCGAAGGCCGCGACGAACTGCTGC TGATCCTTCCCGCCGACCACGTGATCGAGGACCAGCGCGCCTTCCAGCAGGCCCTGGCGCTGGCCACCAA CGCCGCCGAAAAGGGCGAGATGGTGCTCTTCGGCATTCCCGCCAGCCGCCCCGAGACCGGCTACGGCTAC ATCCGCGCGAGCGCCGATGCGCAACTGCCGGAAGGCGTCAGCCGGGTGCAGAGCTTCGTCGAGAAGCCCG ACGAAGCCCGCGCCCGCGAGTTCGTCGCCGCCGGCGGCTACTACTGGAACAGCGGCATGTTCCTGTTCCG CGCCAGCCGCTACCTGGAAGAACTGAAGAAGCACGACGCCGACATCTACGACACCTGCCTGCTGGCCCTG GAGCGCAGCCAGCACGACGGCGACCTGGTGAACATCGACGCCGCCACCTTCGAATGCTGCCCGGACAACT CCATCGACTACGCGGTGATGGAGAAGACCTCACGCGCCTGCGTGGTGCCGCTGTCCGCCGGCTGGAACGA TGTCGGCAGCTGGTCGTCGATCTGGGACGTGCACGCCAAGGACGCCAACGGCAACGTCACCAAGGGCGAC GTGCTGGTCCACGACAGCCACAACTGCCTGGTGCACGGCAACGGCAAGCTGGTCTCGGTGATCGGCCTGG AGGACATCGTGGTGGTGGAAACCAAGGACGCCATGATGATCGCCCACAAGGACCGGGTGCAGGACGTCAA GCACGTGGTCAAGGACCTCGACGCCCAGGGCCGCAGCGAGACCCAGAACCACTGCGAGGTCTACCGCCCG TGGGGCTCCTACGACTCGGTGGACATGGGCGGCCGCTTCCAGGTCAAGCACATCACCGTGAAGCCCGGCG CGCGCCTCTCGCTGCAGATGCACCACCACCGCGCCGAGCACTGGATCGTGGTTTCCGGGACCGCCCAGGT GACCTGCGACGACAAGACCTTCCTGCTCACCGAGAACCAGTCGACCTACATCCCGATCGCCTCCGTGCAC CGCCTGGCCAACCCCGGCAAGATCCCGCTGGAGATCATCGAGGTGCAGTCCGGCAGCTACCTCGGCGAGG ACGACATCGAGCGCCTGGAAGACGTCTACGGGCGCACCGCAGAACCGGCCCTGCAAGTGGTCGCCGGCAG CCGCTGA
Solution
Go to ncbi and choose blastx for nucleotides into protein.
The result suggests that the sequence has maximum score for the maximum similarity with alginate biosynthesis protein AlgA [Pseudomonas].
Accession number NCBI: WP_003092113.1
Total number of amino acids = 481
Molecular weight = 52997
Function: Pseudomonas has alginate layer which is made of alginate polysaccharide. The enzyme makes GDP-alpha-D-mannose through the pathway GDP-alpha-D-mannose which helps in polymerization of alginate. This alginate layer makes biofilm which helps the species in protection from bacteria and other microbes.
And sequence is:
MIPVILSGGS GSRLWPLSRK QYPKQFLALT GDDTLFQQTI KRLAFDGMQA PLLVCNKEHR FIVQEQLEAQ NLASQAILLE PFGRNTAPAV AIAAMKLVAE GRDELLLILP ADHVIEDQRA FQQALALATN AAEKGEMVLF GIPASRPETG YGYIRASADA QLPEGVSRVQ SFVEKPDEAR AREFVAAGGY YWNSGMFLFR ASRYLEELKK HDADIYDTCL LALERSQHDG DLVNIDAATF ECCPDNSIDY AVMEKTSRAC VVPLSAGWND VGSWSSIWDV HAKDANGNVT KGDVLVHDSH NCLVHGNGKL VSVIGLEDIV VVETKDAMMI AHKDRVQDVK HVVKDLDAQG RSETQNHCEV YRPWGSYDSV DMGGRFQVKH ITVKPGARLS LQMHHHRAEH WIVVSGTAQV TCDDKTFLLT ENQSTYIPIA SVHRLANPGK IPLEIIEVQS GSYLGEDDIE RLEDVYGRTA EPALQVVAGSR

