Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Oct 10, 2024

Problem Statement The GFF3 format is a commonly-used one in bioinformatics for representing sequence annotation. You can find the specification here: The genome and annotation

image text in transcribed

Problem Statement The GFF3 format is a commonly-used one in bioinformatics for representing sequence annotation. You can find the specification here: The genome and annotation for Saccharomyces cerevisiae S288C is on the class server here: /home/jorvisl/Saccharomyces_cerevisiae_5288C.annotation.gff Note that this same file has both the annotation feature table and the FASTA sequence for the molecules referenced. (See the FASTA' directive in the specification.) 'Within the feature table another column of note is the 9 th , where we can store any key=value pairs relevant to that row's feature such as ID, Ontology_term or Note. Your task is to write a GFF3 feature exporter. A user should be able to run your script like this: % export_gff3_feature.py --source_gff=/path/to/some.gff3 --tvpe=gene --attribute=ID --value=YAR003W There are 4 arguments here that correspond to values in the GFF3 columns. In this case, your script should read the path to a GFF3 file, find any gene (column 3) which has an ID=YAR003W (column 9). When it finds this, it should use the coordinates for that feature (columns 4, 5 and 7) and the FASTA sequence at the end of the document to return its FASTA sequence. Your script should work regardless of the parameter values passed, warning the user if no features were found that matched their query. (It should also check and warn if more than one feature matches the query.) The output should just be printed on STDOUT (no writing to a file is necessary.) It should have a header which matches their query, like this: rgene:ID:YARODIW o Sequence here .. As an extra challenge, you can format the sequence portion of the FASTA output as 60-characters per line, which follows the standard. Provide the complete source code AND the output of the program as it runs. You should do test runs with 3 features which are present in the file and 1 where you intentionally enter a feature NOT present in the file. Your script should handle this gracefully

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Mathematics for Economics and Business

Mathematics for Economics and Business

Authors: Ian Jacques

9th edition

129219166X, 9781292191706 , 978-1292191669

More Books

Students also viewed these Mathematics questions

Question

★★★★★

What types of receivables does Apple report on its balance sheet? Does it use the allowance method or the direct write-off method to account for uncollectibles?

Answered: 1 week ago

Question

★★★★★

Please help me answer ALL questions via excel 1. (26 points) On 01/01/2014, Company Big (B) acquired 25% of Company Small (S)'s shares. To acquire the shares, B borrowed $ 4,000,000 from bank, paid...

Answered: 1 week ago

Question

★★★★★

Discuss the role of data analytics in future supply chains

Answered: 1 week ago

Question

★★★★★

Match each situation with the fraud triangle factoropportunity, financial pressure, or rationalizationthat best describes it. 1. An employees monthly credit card payments are nearly 75% of their...

Answered: 1 week ago

Question

★★★★★

In R studio, Use the ifelse() function to set the elements that are less than 2 to zero in a 4 x 5 matrix A constructed from a random sample of size 20 from the normal distribution with mean 2 and...

Answered: 1 week ago

Question

★★★★★

Alma decided she wants to leave the hot weather of Arizona and move to the cooler climate of Colorado. To do this, she quit her job at the bank and will look for another job in Denver. Which type of...

Answered: 1 week ago

Question

★★★★★

Determine the ending balance of each of the following T-accounts. a. b. Cash 100 300 20 Accounts Payable 2,000 2.700 50 60 8,000 Supplies 10,000 1,100 3,800 d. Wages Payable Cash Accounts Receivable...

Answered: 1 week ago

Question

★★★★★

As examiners consider whether the use of remote administration of standardized achievement and intelligence tests is appropriate for each child under their care, issues of _____________ must be...

Answered: 1 week ago

Question

★★★★★

Edward's Excellent Bakery (EEB) has a slogan which represents to consumers that all of their bakery items are baked and sold to the consumer on the same day: "Our baked goods are baked and sold same...

Answered: 1 week ago

Question

★★★★★

(20 points). Consider again the SR-71 Blackbird. As we discussed, crucial to the aircraft performance was the engine inlet spike assembly, which represented a conical structure located in the center...

Answered: 1 week ago

Question

★★★★★

Exercise 3: Today On April 15, 1912, on her maiden voyage, the Titanic collided with an iceberg and sank. The ship was luxurious but did not have enough lifeboats for the 2224 passengers and crew. As...

Answered: 1 week ago

Previous Question Next Question