Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Code should be for PYTHON and explain each step, beginners coding. DNA Sequencing This assignment deals with DNA sequencing. A DNA sequence is a series

image text in transcribed

Code should be for PYTHON and explain each step, beginners coding.

DNA Sequencing This assignment deals with DNA sequencing. A DNA sequence is a series of nucleotides: adenine (A). guanine (G), cytosine (C) and thymine (T). These sequences can be represented as strings of their first e.g., GCACTAG. Within these DNA sequences, researchers are interested in finding specific genic sequences (genes). The input we receive from DNA sequencers usually comes in a continuous stream, so the string may start with some upstream sequence (data from a previous gene), and may ontinue beyond the end of the specific gene that is of interest with a downstream sequence (the start of another gene). Fortunately, all genes start with the sequence ATG, and the sequence ATG cannot appear in the middle of a gne 1. This makes it possible to isolate and analyze a specific gene from a sequence. Your Tasks For this assignment, you will be required to build the following 5 functions: split-input: Takes in a DNA sequence (as described above) and returns a list with three elements. the upstream data, the gene (if any is found, or an empty string if no gene is found), and the downstream data, in that order .get-gene: Takes in a DNA sequence (as described above) and returns a string representation of the gene if one is present, or the string ERROR if no gene is present. validate-gene: Takes in a string representation of a gene, and returns True iff the gene presented is valid. For a gene to be valid it must satisfy the following critera: - It must start with the start codon (3 character sequence) ATG -It must contain at least one codon after the start codon - It must contain only full codons i.e, it cannot end mid-way through a 3 character codon) -It must never contain four consecutive identical nucleotides Note This isn't actually true in the real world, bat it makes our lives a lot eases, so we'll pretend is.palindromic. Takes a string representation of a gene, and returns True iff that gene is palindrome (reads the same forwards as backwards). evaluate.sequence: Takes in a DNA sequence (as described above) and retums one of the follow ing strings as appropriate: No Gene Found, Invalid Gene, Valid Gene Found, Valid Palindromic Gene Found

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Data And Information Quality Dimensions, Principles And Techniques

Authors: Carlo Batini, Monica Scannapieco

1st Edition

3319241060, 9783319241067

More Books

Students also viewed these Databases questions

Question

What is the difference between an annual report and a 10-K report?

Answered: 1 week ago

Question

6. Vanguard

Answered: 1 week ago