[Solved] # Constants MIN _ CODONS = 5 MIN _ MASS _

Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Jul 29, 2024

# Constants MIN _ CODONS = 5 MIN _ MASS _ PERCENTAGE _ CG = 3 0 NUM _ NUCLEOTIDES = 4 NUCLEOTIDES _ PER

# Constants

MIN $_$ CODONS $= 5$

MIN $_$ MASS $_$ PERCENTAGE $_$ CG $= 30$

NUM $_$ NUCLEOTIDES $= 4$

NUCLEOTIDES $_$ PER $_$ CODON $= 3$

# Function to convert a nucleotide into an index

def nucleotide $_$ to $_$ index $($ nucleotide $)$ :

nucleotide $=$ nucleotide.upper $()$

nucleotide $_$ index $= {'$ A $'$ : $0,'$ C $'$ : $1,'$ G $'$ : $2,'$ T $'$ : $3}$

return nucleotide $_$ index.get $($ nucleotide $, - 1)$

# Function to calculate nucleotide counts

def calculate $_$ nucleotide $_$ counts $($ sequence $)$ :

counts $= [0] $ NUM $_$ NUCLEOTIDES

for nucleotide in sequence:

index $=$ nucleotide $_$ to $_$ index $($ nucleotide $)$

if index $! = - 1$ :

counts $[$ index $] + = 1$

return counts

# Function to calculate mass percentages

def calculate $_$ mass $_$ percentages $($ sequence $)$ :

mass $_$ values $= {'$ A $'$ : $135.128,'$ C $'$ : $111.103,'$ G $'$ : $151.128,'$ T $'$ : $125.107,' -'$ : $100.000}$

# Filter out dashes $(' -')$ before calculating total mass

total $_$ mass $=$ sum $($ mass $_$ values $[$ nucleotide $]$ for nucleotide in sequence if nucleotide in mass $_$ values $)$

nucleotide $_$ counts $=$ calculate $_$ nucleotide $_$ counts $($ sequence $)$

# Calculate mass percentages based on the total mass of the sequence

mass $_$ percentages $= [$ round $(($ mass $_$ values $[$ nucleotide $] $ count $/$ total $_$ mass $) * 100, 1)$ for nucleotide, count in zip $("$ ACGT $",$ nucleotide $_$ counts $)]$

return mass $_$ percentages, total $_$ mass

# Function to extract codons from a sequence

def extract $_$ codons $($ sequence $)$ :

valid $_$ sequence $= [$ nucleotide for nucleotide in sequence if nucleotide.isupper $()]$

codons $= ['' .$ join $($ valid $_$ sequence $[$ i:i $+$ NUCLEOTIDES $_$ PER $_$ CODON $])$ for i in range $(0,$ len $($ valid $_$ sequence $),$ NUCLEOTIDES $_$ PER $_$ CODON $)]$

return codons

# Function to check if a sequence is a protein

def is $_$ protein $($ sequence $)$ :

start $_$ codon $=$ "ATG"

stop $_$ codons $= ["$ TAA $",$ "TAG", "TGA" $]$

# Check start codon

if not sequence.startswith $($ start $_$ codon $)$ :

return False

# Check stop codon

if not any $($ sequence $.$ endswith $($ stop $)$ for stop in stop $_$ codons $)$ :

return False

# Check minimum codons

if len $($ extract $_$ codons $($ sequence $)) <$ MIN $_$ CODONS:

return False

# Check minimum mass percentage of C and G

cg $_$ mass $_$ percentage $=$ sum $($ calculate $_$ mass $_$ percentages $($ sequence $) [1$ : $3])$

if cg $_$ mass $_$ percentage $<$ MIN $_$ MASS $_$ PERCENTAGE $_$ CG:

return False

return True

# Function to process a nucleotide sequence

def process $_$ sequence $($ region $_$ name, nucleotides $)$ :

nucleotide $_$ counts $=$ calculate $_$ nucleotide $_$ counts $($ nucleotides $)$

mass $_$ percentages, total $_$ mass $=$ calculate $_$ mass $_$ percentages $($ nucleotides $)$

codons $_$ list $=$ extract $_$ codons $($ nucleotides $)$

is $_$ protein $_$ result $=$ is $_$ protein $($ nucleotides $)$

# Print or write to the output file

print $($ f $"$ Region Name: ${$ region $_$ name $} ")$

print $($ f $"$ Nucleotides: ${$ nucleotides $} ")$

print $($ f $"$ Nuc $.$ Counts: ${$ nucleotide $_$ counts $} ")$

print $($ f $"$ Total Mass $%$ : ${$ mass $_$ percentages $}$ of ${$ total $_$ mass: $. 1$ f $} ")$

print $($ f $"$ Codons List: ${$ codons $_$ list $} ")$

print $($ f $"$ Is Protein?: ${'$ YES $'$ if is $_$ protein $_$ result else $'$ NO $'}$

$")$

# Main function

def main $()$ :

print $("$ This program reports information about DNA nucleotide sequences that may encode proteins." $)$

# Input file names

input $_$ file $_$ name $=$ input $("$ Input file name? $")$

output $_$ file $_$ name $=$ input $("$ Output file name? $")$

# Process input file

with open $($ input $_$ file $_$ name, $'$ r $')$ as input $_$ file:

# Assume each pair of lines represents a region name and nucleotide sequence

lines $=$ input $_$ file.readlines $()$

for i in range $(0,$ len $($ lines $), 2)$ :

region $_$ name $=$ lines $[$ i $] .$ strip $()$

nucleotides $=$ lines $[$ i $+ 1] .$ strip $() .$ upper $()$

process $_$ sequence $($ region $_$ name, nucleotides $)$

if $$ name $= = "$ main $"$ :

main $()$ what is the pseudocode and flowchart for this python code?

Step by Step Solution

There are 3 Steps involved in it

Step: 1

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

Step: 3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Microsoft SQL Server 2012 Unleashed

Authors: Ray Rankins, Paul Bertucci

1st Edition

ISBN: 0133408507, 9780133408508

More Books

Students also viewed these Databases questions

Question

★★★★★

Consider a portfolio position of $10 million on which returns are assumed to be normally distributed with a current standard deviation of 20 percent per annum. The average VAR on the previous 60 days...

Answered: 1 week ago

Question

★★★★★

HOW IS THE BREAK-EVEN POINT DETERMINED USING THE FORMULA APPROACH, GRAPH APPROACH, AND INCOME STATEMENT APPROACH? LO.1

Answered: 1 week ago

Question

★★★★★

What information is needed before the sample is selected in order to have a wellplanned and reliable hypothesis test?

Answered: 1 week ago

Question

★★★★★

DMA, Inc., processes corn into corn starch and corn syrup. The companys productivity and cost standards follow: From every bushel of corn processed, 12 pounds of starch and 6 pounds of syrup should...

Answered: 1 week ago

Question

★★★★★

10 seconds Moving to another question will save this response. Question 3 Which of the following statements is True about the capital allocation process? OA Promotes productivity. OB Determining how...

Answered: 1 week ago

Question

★★★★★

Give an example of a change an organization may make when responding to each of Thornburn and Langdale's Drivers of Change.

Answered: 1 week ago

Question

★★★★★

1. A source of yellow light ( = 570 nm) produces interference through two narrow slits separated by a distance of 0.01cm. A screen is placed 3m away. a. How far from the central max is the fifth...

Answered: 1 week ago

Question

★★★★★

You are revising your company's talent acquisition strategy to make it more competitive and appealing to potential candidates. Part of your strategy involves clearly presenting the compensation...

Answered: 1 week ago

Question

★★★★★

Inventory Land Book Value Fair Value $ 630,000 $ 600,000 750,000 990,000 1,700,000 2,000,000 Buildings Customer relationships Accounts payable 0 (80,000) Common stock (2,000,000) Additional paid-in...

Answered: 1 week ago

Question

★★★★★

The standard Treasury Bond futures contract has a face value of $100,000, at least 15 years to maturity and a coupon of 6%, payable semi-annually. The quoted price of the futures contract is based on...

Answered: 1 week ago

Question

★★★★★

15. What option allows the scheduler to go in and look at the information "behind" the scheduled appointments? For example, date "appointment made", wait times, service connection, etc).

Answered: 1 week ago

Question

★★★★★

Th ey have to wait a long time for an appointment?

Answered: 1 week ago

Question

★★★★★

5.3 Using the uniform probability density function shown in Figure 5.7, find the probability that the random variable X is less than 1.4.

Answered: 1 week ago

Question

★★★★★

What specifi c friendliness behaviors can help broaden your customers tolerance zones?

Answered: 1 week ago

Previous Question Next Question