Question
MySQL a) Download the clinvar data set in VCF format from the link below b) Clean and do some normalization of this data (get to
MySQL
a) Download the clinvar data set in VCF format from the link below
b) Clean and do some normalization of this data (get to at least 1NF) and put this into a new schema called clinvar inside a table called variants. Infer from reading the VCF specification and the downloaded clinvar data what attributes are needed and how you can determine each row and each column. Pay close attention to: CHROM, POS, ID, REF, ALT and. Because INFO is ; separated, you must expand it out to several columns of which CLNVC,CLNSIG, RS and GENEINFO are needed. You can toss the rest if you wish and may use any programming language or processing tool to complete this section. Take screenshots of the final solution (1NF) and some steps on the way
Download link: ftp://ftp.ncbi.nlm.nih.gov/pub/clinvar/vcf_GRCh37/clinvar_20180401.vcf.gz
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started