Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Note: I need help with the last 4 parts: f, g, h, i. Pls show the code you use as well as the output. Thank

Note: I need help with the last 4 parts: f, g, h, i. Pls show the code you use as well as the output. Thank you
image text in transcribed
Write a R code to do each of the following tasks : a. (10 points) search for DNA sequences from the organism "Chlamydia trachomatis in the ACNUC "genbank". b. (10 points) How many sequences retrieved? c. (10 points) How many bases are there in longest sequence among them? d. (10 points) For the first five sequences, print out the accession number and the first 10 nucleotide bases e. (10 points) For the 250 sequence, how many As, Cs, Gs and Ts does it contain? f. (10 points) For the 100th sequence, calculate the GC content for each 500-nucleotide chunks of the sequence. Create a sliding window plot of GC content. g. (10 points) For the 123th sequence, what are the top three most frequent 4-bp words? h. (10 points) Export the data of the first 10 sequences to a FASTA file. i. (20 points) Write a function to find and return all over-represented DNA words that are 2 nucleotides long in a given sequence. Write a R code to do each of the following tasks : a. (10 points) search for DNA sequences from the organism "Chlamydia trachomatis in the ACNUC "genbank". b. (10 points) How many sequences retrieved? c. (10 points) How many bases are there in longest sequence among them? d. (10 points) For the first five sequences, print out the accession number and the first 10 nucleotide bases e. (10 points) For the 250 sequence, how many As, Cs, Gs and Ts does it contain? f. (10 points) For the 100th sequence, calculate the GC content for each 500-nucleotide chunks of the sequence. Create a sliding window plot of GC content. g. (10 points) For the 123th sequence, what are the top three most frequent 4-bp words? h. (10 points) Export the data of the first 10 sequences to a FASTA file. i. (20 points) Write a function to find and return all over-represented DNA words that are 2 nucleotides long in a given sequence

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Professional Microsoft SQL Server 2012 Administration

Authors: Adam Jorgensen, Steven Wort

1st Edition

1118106881, 9781118106884

More Books

Students also viewed these Databases questions

Question

Name three clusters of customers who might shop at Walmart.

Answered: 1 week ago