4. Longitude For a further description of this dataset: http : / /crawdad . org/roma/taxi /2 014 O 7 17 X Purpose of this task: Perform a general analysis of this dataset. Learn to work with large datasets. Obtain general information of the behaviour of some taxi drivers. Analyse and interpret results. Questions: ('7 marks) By using the data in taxi . csv, perform the following tasks: a) Plot the location points (2D plot using all of the latitude, longitude value pairs in the dataset). Clearly indicate points that are invalid, outliers or noise points. The plot should be informative! Clearly explain the rationale that you used when identifying invalid points, noise points, and outliers. Remove invalid points, outliers and noise points before answering the subsequent questions. b) Compute the minimum, maximum, and mean location values. c) Obtain the most active, least active, and averagely active taxi drivers (i.e., most time driven, least time driven, and mean time driven). Explain the rationale of your approach and explain your results. d) Look up the le Student_Taxi_Mapping . txt. Use the taxi ID that is listed next to your Student Number to answer the following questions: i. Plot the location points for taxi=ID ii. Compare the mean, min, and max location values of taxi=ID with the global mean, min, and max location values. iii. Compare the total time driven by taxi=ID with the global mean, min, and max driven time values. iv. Compute the distance travelled by taxi=ID. To compute the distance between two points on the surface of the earth use the following method: dlon = longitudeZ Jongitua'el dlat = latitudez -latitude1 a = (sin(dlat/2))"2 + cos(lat1) * cos(.-'at2) * (sin(dlon/2))"2 c = 2 * atan2( sqrt(a), sqrt(l-a) ) distance = R * c (where R is the radius of the Earth) Assume that R=6,371,000 meters. Note that angles need to be in radians to pass to trig functions! More information on this calculation can be found at https:1/www.movabletype.co.uklscripts/1atlong.htm] With each of your answers: Explain what knowledge can be derived from your