Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Your submission must be your original work. No more than a combined total of 30% of a submission can be directly quoted or closely paraphrased

Your submission must be your original work. No more than a combined total of 30% of a submission can be directly quoted or closely paraphrased from sources, even if cited correctly. Use the report provided when submitting your task as a guide.

You must use the rubric to direct the creation of your submission because it provides detailed criteria that will be used to evaluate your work. Each requirement below may be evaluated by more than one rubric aspect. The rubric aspect titles may contain hyperlinks to relevant portions of the course.

Submit one zipped folder that includes the code, input, and output files from the task. Place the responses to the task prompts in one PDF file.

Note: This assessment requires you to submit pictures, graphics, and/or diagrams. Each file must be an attachment no larger than 30 MB in size. Diagrams must be original and may be hand-drawn or drawn using a graphics program. Do not use CAD programs because attachments will be too large.

Develop a web links scraper program in Python that extracts all of the unique web links that point out to other web pages from the HTML code of the "Current Estimates" web link, both from the "US Census Bureau" website (see web link below) and outside that domain, and that populates them in a comma-separated values (CSV) file as absolute uniform resource indicators (URIs).

A.Explain how the Python program extracts the web links from the HTML code of the "Current Estimates," found in web links section.

B.Explain the criteria you used to determine if a link is a locator to another HTML page. Identify the code segment that executes this action as part of your explanation.

C.Explain how the program ensures that relative links are saved as absolute URIs in the output file. Identify the code segment that executes this action as part of your explanation.

D.Explain how the program ensures that there are no duplicated links in the output file. Identify the code that executes this action as part of your explanation.

Note: Please consider weblinks that point to the same web pages as identical(e.g.,www.commerce.gov and www.commerce.gov/).

E.Provide the Python code you wrote to extract all the unique web links from the HTML code of the "Current Estimates" (in the web links section), that point out to other HTML pages.

F.Provide the HTML code of the "Current Estimates" web page scrapped at the time when the scraper was run and the CSV file was generated.

G.Provide the CSV file that your script created.

H.Run your script and provide a screenshot of the successfully executed results.

I.Acknowledge sources, using in-text citations and references, for content that is quoted, paraphrased, or summarized.

J.Demonstrate professional communication in the content and presentation of your submission.

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Modern Dental Assisting

Authors: Doni Bird, Debbie Robinson

13th Edition

978-0323624855, 0323624855

Students also viewed these Programming questions