Question: I have written the code for extracting the data but I am unable to get the data into excel sheet do I need to add

I have written the code for extracting the data but I am unable to get the data into excel sheet do I need to add any forloop for printing the data please check the code below

import scrapy

import pandas as pd

import time

starttime=time.time()

class ArtisanDataSpider(scrapy.Spider):

name = "artisan_data"

start_urls = ['http://www.handicrafts.nic.in/ArtisanData.aspx?MID=SZmOd%2fCrxTo9CHD2XKF+pA%3d%3d']

def parse(self, response):

# Select the form and fill in the form data

form = response.xpath('//form[@id="form1"]')

form.xpath('.//select[@name="ddlState"]/option[text()="Uttar Pradesh"]/@value').extract_first()

form.xpath('.//select[@name="ddlDistrict"]/option[text()="Sant Ravidas Nagar"]/@value').extract_first()

form.xpath('.//select[@name="ddlDistrict"]/option[text()="Agra"]/@value').extract_first()

form.xpath('.//select[@name="ddlDistrict"]/option[text()="Varanasi"]/@value').extract_first()

yield scrapy.FormRequest.from_response(response, formdata={'ddlState': 'Uttar Pradesh', 'ddlDistrict': ['Sant Ravidas Nagar', 'Agra', 'Varanasi'],'btnSubmit': 'Submit'},

callback=self.parse_result)

def parse_result(self, response):

rows = response.xpath('//table[@id="gvArtisanData"]/tr')

for row in rows:

PEHCHAN_CARD_NO = row.xpath('./td[1]/text()').extract_first()

ARTISIAN_NAME = row.xpath('./td[2]/text()').extract_first()

Father_spouse = row.xpath('./td[3]/text()').extract_first()

Category = row.xpath('./td[4]/text()').extract_first()

AADHARNO = row.xpath('./td[5]/text()').extract_first()

NAME_OF_CRAFT = row.xpath('./td[6]/text()').extract_first()

MOBILENO= row.xpath('./td[7]/text()').extract_first()

VILLAGE= row.xpath('./td[8]/text()').extract_first()

TOWN= row.xpath('./td[9]/text()').extract_first()

CITY = row.xpath('./td[10]/text()').extract_first()

DISTRICT = row.xpath('./td[11]/text()').extract_first()

STATE = row.xpath('./td[12]/text()').extract_first()

yield {'PEHCHAN_CARD_NO':PEHCHAN_CARD_NO, 'ARTISIAN_NAME':ARTISIAN_NAME, 'Father_spouse':Father_spouse, 'Category': Category, 'AADHAR_NO': AADHAR_NO, 'NAME_OF_CRAFT':NAME_OF_CRAFT,'MOBILENO':MOBILENO,'VILLAGE':VILLAGE,'TOWN':TOWN,'CITY':CITY, 'DISTRICT':DISTRICT, 'STATE':STATE}

next_page = response.xpath('//a[text()="Next"]/@href').extract_first()

if next_page:

yield scrapy.Request(response.urljoin(next_page), callback=self.parse_result)

endtime=time.time()

result=endtime-starttime

print("the time taken is:", result)

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Below here I have written the code for extracting the data using scrapy for the given url for state=uttarpradesh and districts=[SANT RAVIDAS NAGAR, AGRA, VARANASI] But the data present in all pages...

Here Below I have written the code for extracting the data from the given url but the data is not saving in the excel sheet so please let me know why the data is not saving in the excel sheet below...

below I have written the code for extracting the data from url but the data is not getting in excel sheet and also the time calculation is getting 0 secs import scrapy import pandas as pd import time...

I have written the code for extracting the data but the data is not scrapping into the csv file or excel sheet import scrapy import pandas as pd import time import csv starttime=time.time() class...

please help me answer this question will rate your answer right away! Requirements: A summary of the key issues and your thoughts about important aspects of an analysis or recommendation. A...

(d) We've said that software models the real world. One aspect of the real world that we sometimes want to model is randomness. If we're writing software to play a card game, we don't want the same...

Using the PDF File, the Excel sheet needs to be completed. Master Budget Modeling Project for MyArmor, Inc. - 2017 Planning Year Project is Activated Save your project by following these directions -...

Welcome! Please read this page (in particular) very carefully. Instructions You need to understand how to send your assignments (deliverables) Instructor: to your instructor. The tabs (bottom of each...

Week 5: Quantitative Analysis Assignment Codebook: Quantitative Patient Safety Data Column Variable Name A ID B C AGE GENDER D UNIT Description Unique patient identification number Age of patient in...

Write c++ programming!! districts.txt Barryland,1,5,7 Rabbitville,1,55,12,2,654,0,3,79,711 Jelly Bean Forest,1,11,49,2,337,99,3,764,64091,4,79666,22278,5,116364,56350 Earth,1,0,1,2,45,67 New...

Calculate the purchase price of each of the $1000 face value bonds Issue date Maturity date Purchase date May 15, 2000 May 15, 2020 Nov 15, 2006 Coupon rate (%) Market rate (%) 6.0 4.0

The number of personnel in an information systems department may limit the extent to which segregation of duties is feasible. What is the minimum amount of segregation of duties that will permit...

When doing charge sale invoice what should go by prepared by

The first scenario will be a Verbal Judo scenario in which your scenario follows the standard Verbal Judo interaction: You need to ask somebody to modify their behavior either to do something or to...

Discuss consumer-driven health plans.

Evaluate the importance of a health navigator to healthcare consumers.

Examine the role of allied health professionals in the healthcare industry.