Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Sep 26, 2024

In this exercise we work with next generation sequencing (NGS) data. Unix is excellent at manipulating the huge FASTA files that are generated in NGS

In this exercise we work with next generation sequencing (NGS) data. Unix is excellent at manipulating the huge FASTA files that are generated in NGS experiments. FASTA files contain sequence data in text format. Each sequence segment is preceded by a single-line description. The first character of the description line is a greater than sign (>).15 The NGS data set we will be working with was published by Marra and DeWoody (2014), who investigated the immunogenetic repertoire of rodents. You will find the sequence file Marra2014_data.fasta in the directory CSB/unix/data. The file contains sequence segments (contigs) of variable size. The description of each contig provides its length, the number of reads that contributed to the contig, its isogroup (representing the collection of alternative splice products of a possible gene), and the isotig status.

1. Change directory to CSB/unix/sandbox.

2. What is the size of the file Marra2014_data.fasta?

3. Create a copy of Marra2014_data.fasta in the sandbox and name it my_file.fasta.

4. How many contigs are classified as isogroup00036?

5. Replace the original two-spaces delimiter with a comma.

6. How many unique isogroups are in the file?

7. Which contig has the highest number of reads (numreads)? How many reads does it have?

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image_2

Step: 3

blur-text-image_3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Conceptual Database Design An Entity Relationship Approach

Conceptual Database Design An Entity Relationship Approach

Authors: Carol Batini, Stefano Ceri, Shamkant B. Navathe

1st Edition

0805302441, 978-0805302448

More Books

Students also viewed these Databases questions

Question

★★★★★

Why do developed countries experience a degree of convergence over time? Would you expect there to be total convergence of GDP per head?

Answered: 1 week ago

Question

★★★★★

Payback: Nakamichi Bancorp has made an investment in banking software at a cost of $1,875,000. If management expects productivity gains and cost savings to generate additional cash flows of $586,212,...

Answered: 1 week ago

Question

★★★★★

=+1. What advantages of government employment would you emphasize when advertising for new employees?

Answered: 1 week ago

Question

★★★★★

Find the present value of $3,500 under each of the following rates and periods: a. 8.9 percent compounded monthly for five years. b. 6.6 percent compounded quarterly for eight years. c. 4.3 percent...

Answered: 1 week ago

Question

★★★★★

In this exercise we work with next generation sequencing (NGS) data. Unix is excellent at manipulating the huge FASTA files that are generated in NGS experiments. FASTA files contain sequence data in...

Answered: 1 week ago

Question

★★★★★

You and a friend are riding your bikes to a restaurant that you think is east; your friend thinks the restaurant is north. You both leave from the same point, with you riding 12 mph east and your...

Answered: 1 week ago

Question

★★★★★

The rival software companies TechCo and TypePlus each released typing software designed to help college students improve typing accuracy. Both companies advertise the effectiveness of the software by...

Answered: 1 week ago

Question

★★★★★

3:35 You have been provided with the following account balances for Webber Ltd. for the years ended November 30, 2020, and 2021: 2021 2020 Advertising expense Cost of goods sold Income tax expense...

Answered: 1 week ago

Question

★★★★★

9. Did Emerson et al.'s (1995) findings support or refute the validity of descriptive assessments? Explain. 10. Did Lerman and Iwata's (1993) findings support or refute the validity of descriptive...

Answered: 1 week ago

Question

★★★★★

Riley is a 50% partner in the RF Partnership and has an outside basis of $56,000 at the end of the year prior to any distributions. On December 31, Riley receives a proportionate operating...

Answered: 1 week ago

Question

★★★★★

9. On the checkerboard shown, the checker can travel only diagonally upward. It cannot move through a square containing an X. Determine the number of paths from the checker's current position to the...

Answered: 1 week ago

Question

★★★★★

How are custom calculations developed that will refer back to columns in the Pivot Table on the same Excel worksheet?

Answered: 1 week ago

Question

★★★★★

What do the Length of Service and Length of Service Earnings Quotients indicate with reference to Female versus Male Wage and Job Progression in respect to Length of Service?

Answered: 1 week ago

Question

★★★★★

How do Excel Pivot Tables handle data from non OLAP databases?

Answered: 1 week ago

Previous Question Next Question