Answered step by step
Verified Expert Solution
Link Copied!

Question

00
1 Approved Answer

In this project, we will look for buried treasure, in the form of a 9-char motif, in the following three sequences: A: CCAGACATTGTCCTAAGTCTAGAGACTAGCAAAAAGTGCGGGGAGGTGGGGGTATTT CCTCCCGACCACAATGAACATACGGAGACGGTCTGGGACCTTA B:

image text in transcribed

In this project, we will look for buried treasure, in the form of a 9-char motif, in the following three sequences: A: CCAGACATTGTCCTAAGTCTAGAGACTAGCAAAAAGTGCGGGGAGGTGGGGGTATTT CCTCCCGACCACAATGAACATACGGAGACGGTCTGGGACCTTA B: GGCGGTGGTGGCAATCTCAGGGACCCCGACATAGTCACCAGGCCTTGTTCCGGATGT CACGCACGACTAGACCAACCTAAATTCACATGTCCTGCCTTCA C: AATTGACGAAGCTGAGCCCAGAGTGCTATGTCCTGAACCGTTAATGCCGACTTTGTC TTGTTAACTTTAACGTCAGGAAAATGACTCCTACGGAGCGGAA Each of the sequences is exactly 100 nucleotides long. Using Python, implement the motif finder described in Lecture 3 to identify the highest scoring motif in this collection. Report the motif and score. "Reporting the motif means the 9-letter extracts from A, B and C. HINT: Consider this a one-time problem - i.e. do not make your program in any way 'general'. Hardwire it to handle exactly three sequences, in fact feel free to hard code the sequences into your program. Importantly, start by writing a function (call it 'score') that computes the profile score for any given starting positions in the three sequences. Since you need to extract the three aligned rows of the motif to compute that score, your function might as well return those too (i.e. return score AND motif). Do you notice anything interesting about the motif you discover? In this project, we will look for buried treasure, in the form of a 9-char motif, in the following three sequences: A: CCAGACATTGTCCTAAGTCTAGAGACTAGCAAAAAGTGCGGGGAGGTGGGGGTATTT CCTCCCGACCACAATGAACATACGGAGACGGTCTGGGACCTTA B: GGCGGTGGTGGCAATCTCAGGGACCCCGACATAGTCACCAGGCCTTGTTCCGGATGT CACGCACGACTAGACCAACCTAAATTCACATGTCCTGCCTTCA C: AATTGACGAAGCTGAGCCCAGAGTGCTATGTCCTGAACCGTTAATGCCGACTTTGTC TTGTTAACTTTAACGTCAGGAAAATGACTCCTACGGAGCGGAA Each of the sequences is exactly 100 nucleotides long. Using Python, implement the motif finder described in Lecture 3 to identify the highest scoring motif in this collection. Report the motif and score. "Reporting the motif means the 9-letter extracts from A, B and C. HINT: Consider this a one-time problem - i.e. do not make your program in any way 'general'. Hardwire it to handle exactly three sequences, in fact feel free to hard code the sequences into your program. Importantly, start by writing a function (call it 'score') that computes the profile score for any given starting positions in the three sequences. Since you need to extract the three aligned rows of the motif to compute that score, your function might as well return those too (i.e. return score AND motif). Do you notice anything interesting about the motif you discover

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access with AI-Powered Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Students also viewed these Databases questions