Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

You have an Illumina read with the following base sequence: GGCGGAACTATTGATCTCCAACAGAATCCAGTTCCACAAACGTCTTCATCAACAACGGACAGCCCACAAGGTAACCTCGCCGCCTTCTCAACGGCCTCAGAATT CGGAAACAAAATCGGATTCTCAAAATCGTTCGTAGAACACGGATATGTCCTAGGCTTCATCAGAGCCCGTGGACAAGTAACCTACCAACAAGGCCTTCACAAAA TGTGGTCACGTCAAACAAGATGGGACTTCTTCTGGCCTAAATTCCAAGAACTCGGAGAACAAGCTATCCTTAACAAAGAAATCTACGCACAAG And corresponding quality sequence: CCCCCGGGGGGGGGFGGGGGGGGGGGGG;GGGGGGGGGGFGFGGGGGFGGGGGGGGGGEGG EGGG F@D0*C7:585G;A19)6GC <267*6)/@)4?** Select the statements below

You have an Illumina read with the following base sequence:

GGCGGAACTATTGATCTCCAACAGAATCCAGTTCCACAAACGTCTTCATCAACAACGGACAGCCCACAAGGTAACCTCGCCGCCTTCTCAACGGCCTCAGAATT

CGGAAACAAAATCGGATTCTCAAAATCGTTCGTAGAACACGGATATGTCCTAGGCTTCATCAGAGCCCGTGGACAAGTAACCTACCAACAAGGCCTTCACAAAA

TGTGGTCACGTCAAACAAGATGGGACTTCTTCTGGCCTAAATTCCAAGAACTCGGAGAACAAGCTATCCTTAACAAAGAAATCTACGCACAAG

And corresponding quality sequence:

CCCCCGGGGGGGGGFGGGGGGGGGGGGG;GGGGGGGGGGFGFGGGGGFGGGGGGGGGGEGGEGGG

GDF:GG7GGGGFGGGFGGGGGEGBGGGFFGGGGFFFDGGGGGGGCGFCFGEGGGGFEGGGGGGGCCCCFGG5G9FGDGF,FGFE=F,CCC*GG6GGC;?FGGEG

F5EFG:CGC8GCGGFGC,7C*G6D9;GFGF=FCCD839GF*CD85755@FG3**C)A)>F@D0*C7:585G;A19)6GC<267*6)/@)4?**

Select the statements below which apply (there may be more than 1 answer that is correct, select ALL that apply).Please be careful using Python: Python uses 0-based indexing, but our sequences are discussed in terms of 1-based indexing (first base at 1 not 0).

If we use quality trimming to trim the read to the first base that is less than Q25, we would trim at base 178, that is, our new sequence would be bases 1 to 177

The length of the sequence indicates that this was mostly likely HiSeq data (as opposed to MiSeq data).

If we filter reads for which the average (mean) quality is less than 30, this read would be filtered out (deleted from the data set)

The probability of an error in the last base is approximately 300 times greater than the probability of error for the first base (the approximately is not a trick, if this bothers you, you can use + 50)

Trimming off the last 50 bases adds 5 to the overall average (mean) quality (adding 5 means that prior to trimming if it was 25, after trimming it is now 30)

If we use quality trimming to trim the read to the first base that is less than Q20, we would trim at base 175, that is, our new sequence would be bases 1 to 174

If there are errors, it is more likely that they are substitutions than indels

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Database Concepts

Authors: David M. Kroenke

1st Edition

0130086509, 978-0130086501

More Books

Students also viewed these Databases questions

Question

8. How are deferred loads usually structured?

Answered: 1 week ago

Question

d. What language(s) did they speak?

Answered: 1 week ago