Answered step by step
Verified Expert Solution
Question
1 Approved Answer
Algorithms Question, thanks Recall the Edit Distance (Sequence Alignment) problem: given two strings over the same alphabet and mismatch and gap penalties, find an alignment
Algorithms Question, thanks
Recall the Edit Distance (Sequence Alignment) problem: given two strings over the same alphabet and mismatch and gap penalties, find an alignment of minimal cost. One of the most common uses of the minimum edit distance algorithm is in computational biology. DNA sequences are composed of four amino-acids, denoted by the letters A, C, T, G. Mutation over the course of evolution changes the sequences by deleting, inserting, or substituting amino-acids. The smaller the edit distance between some two sequences, the smaller the evolutionary distance between them. Thus, biological sequences are often aligned to minimize the edit distance between them 4-1 (a) Suppose the costs of mismatches and gaps are not the same for all the letters in the DNA sequences (since some mutations are more common than others). The following table states the cost of each operation for each letter A C T G C.2 0 .2 .3.1 G1 .2 .2 0 .1 In the substitution table, the entry in row A, column C, is the penalty OAC for A to C mismatch (but not vise versa) and the entry in row A, column -, is the cost of aligning A atop a gap Find the minimum edit distance AND the optimal alignment between the following sequences (show the matrix of your calculations) Sequence 1 (top): GATTACA Sequence 2 (bottom):A TTA A C Recall the Edit Distance (Sequence Alignment) problem: given two strings over the same alphabet and mismatch and gap penalties, find an alignment of minimal cost. One of the most common uses of the minimum edit distance algorithm is in computational biology. DNA sequences are composed of four amino-acids, denoted by the letters A, C, T, G. Mutation over the course of evolution changes the sequences by deleting, inserting, or substituting amino-acids. The smaller the edit distance between some two sequences, the smaller the evolutionary distance between them. Thus, biological sequences are often aligned to minimize the edit distance between them 4-1 (a) Suppose the costs of mismatches and gaps are not the same for all the letters in the DNA sequences (since some mutations are more common than others). The following table states the cost of each operation for each letter A C T G C.2 0 .2 .3.1 G1 .2 .2 0 .1 In the substitution table, the entry in row A, column C, is the penalty OAC for A to C mismatch (but not vise versa) and the entry in row A, column -, is the cost of aligning A atop a gap Find the minimum edit distance AND the optimal alignment between the following sequences (show the matrix of your calculations) Sequence 1 (top): GATTACA Sequence 2 (bottom):A TTA A CStep by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started