Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

python Q2: Protein Center of Mass Calculation - 35 pts The Protein Data Bank archive serves as a single repository of information about the 3D

python
image text in transcribed
image text in transcribed
Q2: Protein Center of Mass Calculation - 35 pts The Protein Data Bank archive serves as a single repository of information about the 3D structures of proteins, nucleic acids and complex assemblies. Each PDB file contains various kinds of information. The type of information is indicated in the first six characters of each line, such as HEADER, SOURCE, COMPND, AUTHOR, REMARKS, etc. For this homework, you just need to concentrate on lines beginning with the word "ATOM". An example is given below, where the columns represent the atom record atom number, atom identifier, amino acid type, chain identifier, residue sequence number, x-coordinate (in A), y-coordinate (in A), z coordinate (in A), occupancy, B-factor and element symbol respectively. The symbol A denotes Angstrom (1A= 10m) HEADER CHROMOSOMAL PROTEIN 02-JAN-87 TUBO TITLE STRUCTURE OF UBIQUITIN REFINED AT 1.8 ANGSTROMS RESOLUTION COMPND MOL ID: 1; COMPND 2 MOLECULE: UBIQUITIN; COMPNO 3 CHAIN: A; COMPND 4 ENGINEERED: YES SOURCE MOL ID: 1; SOURCE 2 ORGANISM SCIENTIFIC: HOMO SAPIENS, SOURCE 3 ORGANISM COMMON: HUMAN; SOURCE 4 ORGANISM TAXID: 9606 KEYWIDS CHROMOSOMAL PROTEIN EXPDTA X-RAY DIFFRACTION AUTHOR S.VIJAY KUMAR, C.E.BUGG,W.J.COOK REVDAT 5 09-MAR-11 1UBQ REMARK REVDAT 24-FEB-29 IUBQ REVDAT OLAPR.03 BURO JRML 1 4 3 VERSN ORIGX3 SCALEI SCALEZ SCALE3 ATOM ATOM ATOM ATOM ATOM ATOM ATOM ATON ATOM ATOM ATOM ATOM ATOM ATOM ATOM ATOM ATOM ATOM 0.000000 0.000000 1.680000 0.019670 0.000000 0.000000 0.000000 0.023381 0.000000 0.000000 0.000000 0.034542 1 N MET A 1 27.340 24.438 2 1 26.266 25.413 3 C MET A 1 26.913 26.639 40 MET A 1 27.886 26.463 5 CB MET A 1 25.112 24.888 6 CG MET A 1 25.353 24.860 7 SD MET A 1 23.930 23.959 8 CE MET A 24.44723.984 9 N GLNA 2 26.335 27.770 10 CA GLNA 2 26.850 29.021 11 C GLNA 2 26.100 29.253 12 0 GLNA 2 24.865 29.024 13 CB GLNA 2 26.733 30.148 14 CG GUNA 2 26.882 31.546 15 CD GLNA 2 26.786 32.562 16 OE1 GUNA 27.783 33.160 17 NE GLNA 25,562 32.733 18 N ILE A 3 26.849 29.656 19 CA ILEA 26.235 30.058 X Y 0.00080 0.00000 0.00000 0.00000 2.614 1.00 9.67 2.842 1.00 10.38 3.531 1.00 9.62 4.263 1.00 9.62 3.649 1.00 13.77 5.134 1.00 16.29 5.964 1.00 17.17 7.620 1.00 16.11 3.258 1.00 9.27 3.898 1.80 9.07 5.202 1.00 8.72 5.338 1.00 8.22 2.985 1.00 14.46 3.409 1.00 17.01 2.270 1.00 20.10 1.870 1.00 21.89 1.866 1.00 19.49 6.217 1.60 5.87 7.497 1.005.07 0 z ELEMENT Implement the following functions: A. pdb_parser(pdb_filename): Receives one argument, pdb_filename. Opens the PDB file, reads all lines starting with ATOM and stores the X, Y, Z coordinates and the ELEMENT type of the atom in a list, atoms e.g. atoms=([27.340, 24.430, 2.614, 'N'],[26.266, 25.413, 2.842, 'C'),...) atoms list is of length N, number of atoms that make up the protein in the PDB file. The function should retum atoms B. center_of_mass(atoms): This function calculates the center of mass of the protein, as follows. Nimiri 1 Tom is coordinates of the center of mass of the protein, r, is a list containing the IX. y. 2] coordinates of the i. atom), is the mass of the latom and N is the total number of atoms. m, should be obtained from the dictionary mass={"C":12.01, 'O': 16.00, "HT:1.008, ..} by using the element type of the i, atom as key, mass dictionary is already provided in the pdb.py template. C. shift(atoms, vec): This function translates the protein by vec and returns the updated coordinates of the protein, atomsnew. vec is a list of size 3. Eg. vec=[a, b, c), atoms-[[x, y, z, 'C]]-> atomsnew=[[x+a, y+b, z+c, C']] After implementing these functions, demonstrate them as follows: 1) Read the lubq.pdb file and parse the information ii) Calculate the center of mass. MI) Shift the coordinates of the protein such that the molecule's center of mass is at the origin The sample output from the program should be: Center of mass of the protein is 30.317, 28.775, 15.353 New center of mass of the protein is 0.000, 0.000, 0.000

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image_2

Step: 3

blur-text-image_3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Databases Illuminated

Authors: Catherine M. Ricardo, Susan D. Urban, Karen C. Davis

4th Edition

1284231585, 978-1284231588

More Books

Students also viewed these Databases questions

Question

Why did the battle over Net neutrality intensify in 2014?

Answered: 1 week ago