Question
In the IMDb database, you are asked to analyze the first names of the actors and how they correlate to gender. As a first step,
In the IMDb database, you are asked to analyze the first names of the actors and how they correlate to gender.
As a first step, limit your analysis only to first names that do not include a dot "." or parentheses (i.e, ignore "A.", "J.B.", "Alfred (I)" etc); also only consider first names that appear at least 50 times in the actors table (across both genders; ie., a name that appears in 35 males and 20 females qualifies as eligible). Generate a temporary table with the results, called "eligible_names". (You will use the table in later questions)
Find the eligible names that appear only for male actors but for no female actresses, and report the corresponding frequencies.
Find the eligible names that appear only for female actors but for no male actresses, and report the corresponding frequencies.
For eligible names that appear for both males and females, report the names and the frequencies for males and females. Rank the most ambiguous names on top. We consider as most ambiguous names the ones where the formula log( male_freq / female_freq ) is close to 0. Remember that the log can be positive and negative; ranking by absolute value (using theABS() function) will allow you to rank the smallest numbers first.
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started