Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

b) Given the following Fasta file format (A sample fasta file is shown in Appendix 1) >gene id, gene name, organism, number of nucleotide base

image text in transcribedimage text in transcribed

b) Given the following Fasta file format (A sample fasta file is shown in Appendix 1) >gene id, gene name, organism, number of nucleotide base DNA sequences: [each line is 60 nucleotide characters in length] Write python code that: i Extract and print the gene id and gene name and organism for a Fasta file called PalGene fasta. (5 marks) ii. Determine and print all 6 reading frame sequences for a DNA sequence in a DNA fasta file with the structure of PalGene fasta (9 marks) iii. Determine and print the amino acid sequence of a DNA fasta, file (assume there is a condon conversion hash table called %codon_conversion) (6 marks) iv. For a given amino acid sequence: display the amino acid and nucleotide start/stop position of its Open Reading Frame (OFR) and the ORF amino acid sequence. (12 marks) Sample Fasta file (only partial DNA sequence is shown) >gi|171361, E. Coli, gamma-lyase(CYS3) gene, 5124bp GCAGCGCACGACAGCTGTGCTATCCCGGCGAGCCCGTGGCAGAGGACCTCGCTTGCGAAA. GCTACAGAGCCAACCCGGTGGACAAACTCGAAGTCATTGTGGACCGAATGAGGCTCAATAA b) Given the following Fasta file format (A sample fasta file is shown in Appendix 1) >gene id, gene name, organism, number of nucleotide base DNA sequences: [each line is 60 nucleotide characters in length] Write python code that: i Extract and print the gene id and gene name and organism for a Fasta file called PalGene fasta. (5 marks) ii. Determine and print all 6 reading frame sequences for a DNA sequence in a DNA fasta file with the structure of PalGene fasta (9 marks) iii. Determine and print the amino acid sequence of a DNA fasta, file (assume there is a condon conversion hash table called %codon_conversion) (6 marks) iv. For a given amino acid sequence: display the amino acid and nucleotide start/stop position of its Open Reading Frame (OFR) and the ORF amino acid sequence. (12 marks) Sample Fasta file (only partial DNA sequence is shown) >gi|171361, E. Coli, gamma-lyase(CYS3) gene, 5124bp GCAGCGCACGACAGCTGTGCTATCCCGGCGAGCCCGTGGCAGAGGACCTCGCTTGCGAAA. GCTACAGAGCCAACCCGGTGGACAAACTCGAAGTCATTGTGGACCGAATGAGGCTCAATAA

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image_2

Step: 3

blur-text-image_3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Graph Databases In Action

Authors: Dave Bechberger, Josh Perryman

1st Edition

1617296376, 978-1617296376

More Books

Students also viewed these Databases questions