Expected value of finding a sequence (Tim Smith)
2
0
Entering edit mode
@alvaro-j-gonzalez-5813
Last seen 9.6 years ago
Hi Tim, There are 4^4 = 16 possibilities of forming a 4-mer, like ATTG in your example. So what is the probability that you pick any position k in your genome of length L and find that to be ATTG? It is 1/16. Now, what is the probability of finding it at position 1, or 2, or ... k, or ..., L? If you excuse the boundary conditions, and this is perfectly fine for short motifs and long genomes, it would be (1/16 + 1/16 + ...) L times, or L/16. I agree, this is an approximation, but works pretty well actually. Regards, - Al. [[alternative HTML version deleted]]
genomes genomes • 1.0k views
ADD COMMENT
0
Entering edit mode
@alvaro-j-gonzalez-5815
Last seen 9.6 years ago
:-D I caught that one two seconds after hitting "send" ... of course I meant 4^4 = 256. But you get the idea. - Al. [[alternative HTML version deleted]]
ADD COMMENT
0
Entering edit mode
@alvaro-j-gonzalez-5813
Last seen 9.6 years ago
:-D I caught that one two seconds after hitting "send" ... of course I meant 4^4 = 256. But you get the idea. - Al. [[alternative HTML version deleted]]
ADD COMMENT

Login before adding your answer.

Traffic: 943 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6