Answered step by step
Verified Expert Solution
Question
1 Approved Answer
CSc 1 2 0 : Pokemon data analysisIntroductionThis problem involves some simple data analysis and aims to give you some more practice with combining Python
CSc : Pokemon data analysisIntroductionThis problem involves some simple data analysis and aims to give you some more practice with combining Python data structures in interesting ways: in this case, using twolevel dictionaries ie a dictionary of dictionaries The data, as it happens, is about Pokemon source: wwwkaggle.com You are to write a program to read in Pokemon data from a file and organize it according to Pokemon type we will only consider Type for this assignment then repeatedly read in queries from the user and print out solutions to those queries.RestrictionsYour code should follow the style guidelines for the class. You may not use Python material and concepts not yet covered in class. These concepts include recursion, classes, exceptions, type annotations, list comprehensions, the with statement, and importing libraries do not import the csv module If you don't know what any of these are, don't worry, because you can't use them accidentally.Input FormatThe input file, Pokemon.csv is in CSV format commaseparated values" This is a simple file format typically used for tabular data such as that for spreadsheets, and if you want you can open this file in a program like Excel or libreoffice to view the data in an easiertoread form.Any line in the input file that begins with the character #without quotes is a comment line that should be ignored for data analysis.The first line of the input file, which is a comment line, gives the meaning of the various data columns in this table, the number at the top of each entry gives its position in a row of commaseparated values, eg "Attack" is at position :NoNameType Type Total strengthHPAttackDefenseSpecial AttackSpecial DefenseSpeedGenerationLegendary?Expected BehaviorWrite a program, in a file pokemon.py that behaves as follows.Read in the name of a data file. Use the input function to prompt the user for the file name, but do not supply an argument to inputWe call this a silent prompt. This file will be a CSV file containing data about Pokemon in the format described above. It can be the full Pokemon.csv data file, but you can also specify other input files that contain more or less information eg a smaller file may be useful for testing or debuggingRead the data file specified above. Do not use the Python csv module. Organize the data into a data structure that collects together information about different Pokemon types for this assignment we will consider only the Type field for this, and ignore Type since this is not defined for all PokemonRepeatedly read and process queries from the user see Queries below until the user enters an empty line. Some examples are given here.QueriesYour program will read in queries from the user, and for each query, analyze the Pokemon data based on the query and print out the results see Output Format below The queries and the corresponding analyses are as follows:User queryProgram actionTotalCompute the Pokemon types that have highest average Total strength.HPCompute the Pokemon types that have highest average HPAttackCompute the Pokemon types that have highest average Attack.DefenseCompute the Pokemon types that have highest average Defense.SpecialAttackCompute the Pokemon types that have highest average Special Attack.SpecialDefenseCompute the Pokemon types that have highest average Special Defense.SpeedCompute the Pokemon types that have highest average Speed. empty lineTerminate query processinganything elseIgnore the queryNote that, in each case, there may be more than one type of Pokemon with the highest average value computed. You should print out information about each of them according to the output format given below.Matching the queries entered by the user's with the User query column shown above should be caseinsensitive. For example, the user inputs Attack, attack, ATTACK, and AtTaCk should all be processed the same way.Output FormatFor each Pokemon type identified by your analysis, print out the result as follows:print: formatpokemontype, maxaveragewhere pokemontype is the type of Pokemon, and maxaverage is the average value computed for that Pokemon type for that query eg average total, average HP average Attack, etc. which should be equal to the maximum value for that query across all types. If more than one Pokemon type has the same maximum average for a property, then print each one out, one per line, in alphabetical order of the Pokemon type.Programming RequirementsFollow the style guidelines for this class.Your code should not repeatedly and unnecessarily traverse all the data about all the Pokemon when processing queries. To this end, organize your code and data as follows.A Data organization.Use a twolevel dictionary ie a dictionary of dictionaries to implement your Pokemon database, as explained below:At the top level, information should be grouped by Pokemon type: ie all of the information about Pokemon belonging to a particular type should be grouped together. A data structure that will do this efficiently is the dictionary.For each Pokemon type, we have to store information about all of the different Pokemon that belong to that type. Again, this can be done efficiently using a dictionary that maps the Pokemon's name to its properties Total strength, Attack, Defense, etc.Additionally, for each Pokemon type you must precompute the average values for all of its properties see Code Organization below These average values must also be organized as a dictionary keyed by Pokemon type.B Code organization.Notice that the Pokemon properties you read in do not change during the computation. This means that the average value for any property for any given Pokemon type will remain the same as well. This, in turn, means that the maximum average values will also not change. Thus, these values can all be computed once and saved, with query processing simply looking up the saved values as needed. This approach is closely related to an speedup technique calledmemoization.Your code should be organized as follows:After reading in all the Pokemon data: for each Pokemon type, compute the average value for each of its properties across all of the Pokemon that belong to that type. Save this result into a dictionary indexed by Pokemon type.Next, process the average values obtained in the previous step to compute the maximum average value for each property. Optionally, at this point you can also compute which Pokemon types have the maximum average value for each property.Use these data to help process user queries until there are no queries to process.
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started