Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

For this project sheet you need a collection of CSV _files available in Project.zip. In this project you will write a suite of functions and

For this project sheet you need a collection of CSV _files available in Project.zip. In this project you will write a suite of functions and classes for analysing air quality data made available by the European

Environment Agency. For the pollutants we will look at, standards can be defined in terms of the following aggregations:

Number of hours per year for which a threshold was exceeded,

Number of days per year for which the daily average exceeded a threshold, and/or

Average concentration for the year.

Standards can combine more the one such criterion.

For example, the current air quality standards are:

Fine particulates (PM2.5) Yearly average of at most 25ug/m3.

Particulates (PM10) Daily average exceeding 50ug/m3 observed on at most 35 days a year, and

yearly average of at most 40ug/m3.

Sulphur dioxide (SO2) Hourly concentration exceeding 350mg/m3 for at most 24 hours per year, and

average daily concentration exceeding 125ug/m3 on at most 3 days per year.

Nitrogen dioxide (NO2) Hourly concentration exceeding 200ug/m3 for at most 18 hours per year, and average yearly concentration of at most 40ug/m3.

Write a suite of functions and classes, which, once the functions and/or classes have been implemented, let you perform thefollowing steps in one line of code.

Reading in the necessary data

Defning an air quality standard for a pollutant in terms of

a limit on the yearly average,

a limit on the number of daily exceedences of a given threshold (in terms of the daily average), and/or

a limit on the number of hourly exceedences of a given threshold (in terms of the hourly data).

Your code should not just implement the existing standards, but should also allow for de_ning custom standards by using the criteria set out above.

Applying the air quality standard to a measuring station for a given year and pollutant which should result in a binary outcome (standards met or not).

The data you are provided with are for all measuring stations in the Czech Republic for 2013 to 2016 for the following pollutants: PM2.5, PM10, SO2 and NO2. Not every pollutant is measured at every measuring station and there are stations with missing values. Calculate the averages and count exceedences based the data available.

The data is provided as CSV filles, with each file corresponding to a combination of measuring station and pollutant. The files are named _.csv, with PM2.5 appearing as PM2_5. Each row in the CSV files corresponds to an hourly measurement. If no measurement was taken for a given hour, the row is simply omitted.

The file Stations.csv contains meta-data on the stations (location etc.)

NB: i am unable to upload the files here. if you can assist please message me and i will send you the files directly.

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Machine Learning And Knowledge Discovery In Databases European Conference Ecml Pkdd 2022 Grenoble France September 19 23 2022 Proceedings Part 4 Lnai 13716

Authors: Massih-Reza Amini ,Stephane Canu ,Asja Fischer ,Tias Guns ,Petra Kralj Novak ,Grigorios Tsoumakas

1st Edition

3031264118, 978-3031264115

More Books

Students also viewed these Databases questions