Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Your company is looking for candidates for the new Data Scientist role and you will write code that helps the HR department find the right

Your company is looking for candidates for the new Data Scientist role and you will write code that helps the HR department find the right candidate.

You will create the following data sources from scratch:

  • 15 text files, each containing the resume for one candidate. As expected, each resume contains, among many other data, the skills of the candidate. You can create the text files from scratch or copy them from ones found in the web. Use fictitious names. Each person should have their own set of resumes. No two people can have same (nor even similar) resumes submitted.
  • One csv file called Candidates.csv that contains 3 header rows to be skipped, and has a list of 15 candidates, including the name of the candidate and the name of the text file containing his/her resume. This file does NOT contain a list of skills.
  • 1 flat file (you should determine the type of file and/or format) that contains the job description. Your job description should contain a list of skills required.
  • You should make sure your data has a few errors or omissions (missing data) that need to be corrected (later with Python)
  • Your task is to create a Python script that:
    • reads the job description, and gathers the required skills
    • loops through the Candidates.csv file to identify candidates fit for the role (exact fit for skill matching, you are NOT required to use regular expressions, but you can, if you know how to use them)
    • creates a text file called ToInterview.csv that contains a list of the 5 most fit candidates in order of priority (the most fit first) with their most salient characteristics (you have to determine what they are). The candidates are prioritized based on how much their skills match the job description. Hint: this is a good case where the use of Python sets will help you.

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Current Trends In Database Technology Edbt 2006 Edbt 2006 Workshops Phd Datax Iidb Iiha Icsnw Qlqp Pim Parma And Reactivity On The Web Munich Germany March 2006 Revised Selected Papers Lncs 4254

Authors: Torsten Grust ,Hagen Hopfner ,Arantza Illarramendi ,Stefan Jablonski ,Marco Mesiti ,Sascha Muller ,Paula-Lavinia Patranjan ,Kai-Uwe Sattler ,Myra Spiliopoulou ,Jef Wijsen

2006th Edition

3540467882, 978-3540467885

More Books

Students also viewed these Databases questions

Question

What are the main objectives of IFRS 13

Answered: 1 week ago

Question

What is a niche market?

Answered: 1 week ago

Question

=+employee to take on the international assignment?

Answered: 1 week ago

Question

=+differences in home- and host-country costs of living?

Answered: 1 week ago