You begin to sequence the genome of Tamatoa gathering 5000 r
You begin to sequence the genome of Tamatoa, gathering 5,000 reads that were each 600 base pairs long. You have hypothesized that the Tamatoa genome is about 2 million bases long.
(a) At what coverage have you sequenced the genome thusfar?
(b) If the coverage of the Tamatoa genome were 6X, what is the probability that a base will be unsequenced?
Solution
Genome coverage = LN/G
where L= average read length
N= no. of reads
G = haploid genome length
Therefore the coverage= 600 x5000 / 2000000
Coverage = 1.5x
b) If the coverage of the Tamatoa genome were 6X, the probability that a base will be unsequenced is zero.
