Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Question 1: Most Likely Mutation Tree This question is inspired by this research article: Spada et al. J Clin Microbiol. 2004 Sep; 42(9): 4230-4236. and

image text in transcribed

Question 1: Most Likely Mutation Tree This question is inspired by this research article: Spada et al. J Clin Microbiol. 2004 Sep; 42(9): 4230-4236. and this episode of the erstwhile popular TV show Numb3rs https://www.hulu.com/watch/315041 (need a hulu subscription). Viruses have RNA which mutate rapidly. Let us assume that the RNA of a viral species can be written as an l letter string made up of A, C, T and G. While replicating, a virus can mutate through random changes in k out of these l positions with probability proportional to 2k2. We collect viral samples starting from a single individual and after sequencing, we observe n mutants A1,,An, but we do not know which individual mutated to another nor what the order of these mutations were. We wish to reconstruct the mutation tree that connects Ai to Aj if Ai mutated into Aj or vice-versa. Assume that l is large enough that the same RNA sequence cannot be obtained through two different sequences of mutations. You are given a weighted undirected graph G whose vertices are the RNA sequences A1,,An and there is an edge between any two nodes (Ai,Aj) with weight 2d(i,j))2, where d(i,j) is the the number of differences between Ai and Aj. A spanning tree T of G then represents a possible history of mutations, the likelihood of which is given by the product of the edge weights of T. Show how to compute the most likely spanning tree T in this graph. Answer 1 (Expected Length: 6 lines) Question 1: Most Likely Mutation Tree This question is inspired by this research article: Spada et al. J Clin Microbiol. 2004 Sep; 42(9): 4230-4236. and this episode of the erstwhile popular TV show Numb3rs https://www.hulu.com/watch/315041 (need a hulu subscription). Viruses have RNA which mutate rapidly. Let us assume that the RNA of a viral species can be written as an l letter string made up of A, C, T and G. While replicating, a virus can mutate through random changes in k out of these l positions with probability proportional to 2k2. We collect viral samples starting from a single individual and after sequencing, we observe n mutants A1,,An, but we do not know which individual mutated to another nor what the order of these mutations were. We wish to reconstruct the mutation tree that connects Ai to Aj if Ai mutated into Aj or vice-versa. Assume that l is large enough that the same RNA sequence cannot be obtained through two different sequences of mutations. You are given a weighted undirected graph G whose vertices are the RNA sequences A1,,An and there is an edge between any two nodes (Ai,Aj) with weight 2d(i,j))2, where d(i,j) is the the number of differences between Ai and Aj. A spanning tree T of G then represents a possible history of mutations, the likelihood of which is given by the product of the edge weights of T. Show how to compute the most likely spanning tree T in this graph. Answer 1 (Expected Length: 6 lines)

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Students also viewed these Accounting questions

Question

Discuss how charts should be used in reports.

Answered: 1 week ago

Question

8. Explain the contact hypothesis.

Answered: 1 week ago

Question

2. Define the grand narrative.

Answered: 1 week ago