Program in python Biologists use a sequence of letters A C T
Program in python
Biologists use a sequence of letters A, C, T, and G to model a genome. A gene is a substring of a genome that starts after a triplet ATG and ends before a triplet TAG, TAA, or TGA. Furthermore, the length of a gene string is a multiple of 3 and the gene does not contain any of the triplets ATG, TAG, TAA, and TGA. Write a program that prompts the user to enter a genome and displays all genes in the genome. If no gene is found in the input sequence, the program displays \"No gene is found\". Here are the sample runs: Enter a genome string: TTATGTTTTAAGGATGGGGCGTTAGTT [ENTER] TTT GGGCGT Enter a genome string: TGTGTGTATAT [ENTER] No gene is foundSolution
from collections import Counter
#Enter the Genome
#genome = raw_input(\"Ask user for something.\")
genome = \"TTATGTTTTAAGGATGGGGCGTTAGTT\"
#print(genome)
#count the number of
number = genome.count(\"ATG\")
#i is counter till the number of gene found
i=0
#found_gene index
found_gene=0
#a,b,c are the respective index of TAG , TAA , TGA
a=0
b=0
c=0
#Crop is slicing the processed string
crop=0
while(i < number) :
found_gene = genome.find(\"ATG\",c+ 1)
#slicing the not used substring
genome = genome[found_gene+3:]
a = genome.find(\"TAG\")
b = genome.find(\"TAA\")
c = genome.find(\"TGA\")
foo = [a,b,c]
#j is finding the substring from TAG , TAA , TGA which came first
j= min([element for element in foo if element > 0])
#printing the gene
print (genome[0:j])
genome = genome[j:]
#counter till number of gene found
i = i+1
