Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Mar 13, 2024

write code to scrape pages from Amazon and match certain information about books. I'm coding in HTML, CSS, and PHP. I got the first two

write code to scrape pages from Amazon and match certain information about books. I'm coding in HTML, CSS, and PHP. I got the first two preg_match regex to work, but the third one I can't get. I have included all files I am using, the main script is in scraper.php. And the information I'm looking for in the source code of the Amazon pages is the author, title, and publisher. I currently have copied two source codes and saved into files to test my regex as I can't constantly scrape from Amazon

Here is the main part of my php script that is matchin phrases with preg_match:

 (.+)<\/span> <\/h1>)/",$line,$result)) { echo "Title: $result[1] "; } if (preg_match("/(.+)<\/a> /",$line,$author)) { echo "Author: $author[1] "; } if (preg_match("/Publisher.+(.+)<\/span>/",$line,$goal)) { echo "Publisher: $goal[1] "; } } ?>

And here is the code snippet from the Amazon page that I'm trying to scrape:

Title:

   A Tutorial Introduction to Occam Programming

Author:

s Publisher ‏ : ‎  McGraw-Hill (December 1, 1987) 
 
 Language ‏ : ‎  
 I tried on the publisher regex to look for the span class of a-text-bold that had Publisher right after it and ignore any other characters until the beginning of the next span element. At that opening span tag, scrape all characters after it up until the next closing span tag. The print the result. Unfortunately, I received no text to my page and also had no error messages.

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Income Tax Fundamentals 2013

Income Tax Fundamentals 2013

Authors: Gerald E. Whittenburg, Martha Altus Buller, Steven L Gill

31st Edition

1111972516, 978-1285586618, 1285586611, 978-1285613109, 978-1111972516

More Books

Students also viewed these Algorithms questions

Question

★★★★★

Planning is one of the most important management functions in any business. A front office managers first step in planning should involve determine the departments goals. Planning also includes...

Answered: 1 week ago

Question

★★★★★

Design a Java class that represents a cache with a fixed size. It should support operations like add, retrieve, and remove, and it should evict the least recently used item when it reaches capacity.

Answered: 1 week ago

Question

★★★★★

Let A, B be sets. Define: (a) the Cartesian product (A B) (b) the set of relations R between A and B (c) the identity relation A on the set A [3 marks] Suppose S, T are relations between A and B, and...

Answered: 1 week ago

Question

★★★★★

Calculate the indirect quotations for Euros and Kronor.

Answered: 1 week ago

Question

★★★★★

Stratford Corporation is a diversified company whose products are marketed both domestically and internationally. Its major product lines are pharmaceutical products, sports equipment, and household...

Answered: 1 week ago

Question

★★★★★

What package includes thread-safe versions of popular collection classes like Queue and Map?

Answered: 1 week ago

Question

★★★★★

How will you establish groups?

Answered: 1 week ago

Question

★★★★★

Off-Shore Jewelry, Inc. is a relatively small, but fast-growing U.S. based private company. It designs, manufacturers, and distributes fine jewelry in the United States. To reduce costs, it has...

Answered: 1 week ago

Question

★★★★★

Financial Accounting Research System (FARS) Assignment Biotech recently purchased a subsidiary and records a goodwill of $5,000,000 on its balance sheet as an asset. In addition to the goodwill...

Answered: 1 week ago

Question

★★★★★

Provide the following for the following challenge exercise: a) Income Statement, Gross Margin Standard, year-to-date b) All Journal Entries c) Customer Aged Detail, all customers, with terms at Mar...

Answered: 1 week ago

Question

★★★★★

Let's go over the ethics involved in data collection as laid out in the social and life sciences. While this material is particularly relevant in the area of research, such as marketing research and...

Answered: 1 week ago

Question

★★★★★

All of the accounts of the Grass is Greener Company have been adjusted as of December 31, 2022, with the exception of Income taxes Incurred but not yet recorded. Those account balances appear below....

Answered: 1 week ago

Question

★★★★★

1. Describe a discrete event system example in your daily life. What discrete event system in and around your house can be studied using computer simulation? Present an example closely related to...

Answered: 1 week ago

Question

★★★★★

Online Lab: Mirrors, Lenses and Image Formation Name: Date: Ray Tracing for Mirrors 1. Is the diagram above a Concave or a Convex Mirror? 2. Which color is the Ray I in the ray diagram? 3. Which...

Answered: 1 week ago

Question

★★★★★

1) This is a graph of the horizontal displacement of a pendulum on Earth as a function of time. Displacement vs. Time for a Pendulum +1.0 m Displacement (m) 0 - 1.0 m 2.00 4.00 6.00 Time ($) a) On...

Answered: 1 week ago

Question

★★★★★

Challenges accountable care organizations (ACO) might face include issues of governance and leadership, and: Select all that apply a) Building trust b) Shared goals c) Managing potential conflict d)...

Answered: 1 week ago

Question

★★★★★

Required information [The following information applies to the questions dispiayed below.] Antuan Company set the following standard costs per unit for its product. The standard overhead rate (...

Answered: 1 week ago

Question

★★★★★

Why should you not model a decision variable as a random variable with a probability distribution?

Answered: 1 week ago

Question

★★★★★

Robert Ramos (age 36) is a single taxpayer, living at 8765 Bay Dr., Monterey, CA 93940. His Social Security number is 976-23-5132. Robert's earnings and income tax withholding as the manager of a...

Answered: 1 week ago

Question

★★★★★

Dr. Ivan I. Incisor and his wife Irene are married and file a joint return for 2012. Ivan's Social Security number is 477-34-4321 and he is 48 years old. Irene I. Incisor's Social Security number is...

Answered: 1 week ago

Question

★★★★★

The following additional information is available for the Dr. Ivan and Irene Incisor family. Ivan and Irene have the following investment income, in addition to that reported in Chapter 1: Dividends...

Answered: 1 week ago

Question

★★★★★

8. Which area is the main source of input to the cerebral cortexpg105 The drug phenylephrine is sometimes prescribed for people suffering from a sudden loss of blood pressure or other medical...

Answered: 1 week ago

Question

★★★★★

7. Of the following, which are in the hindbrain, which in the midbrain, and which in the forebrain: basal ganglia, cerebellum, hippocampus, hypothalamus, medulla, pituitary gland, pons, substantia...

Answered: 1 week ago

Question

★★★★★

12. Which lobe of the cerebral cortex includes the primary visual cortexpg105

Answered: 1 week ago

Previous Question Next Question