Step 4: Explore categorical variables
Just like before, let’s construct a prompt consisting of the output of the code we just ran and let Copilot tell us what should happen next:
[Prompt]
Below is the output:
Unnamed: 0 star_rating helpful_votes total_votes review_year count 100.000000 100.000000 100.000000 100.000000 100.0 mean 49.500000 0.840000 0.540000
0.960000 2015.0 std 29.011492 0.368453 1.298173 2.420118 0.0 min 0.000000 0.000000 0.000000 0.000000 2015.0 25% 24.750000 1.000000 0.000000 0.000000 2015.0 50% 49.500000 1.000000 0.000000 0.000000 2015.0 75% 74.250000 1.000000 1.000000 1.000000 2015.0 max 99.000000 1.000000 9.000000 14.000000 2015.0
[End of prompt]
You should receive a response similar to the response below:
[Prompt response]
Great! Now that we have the summary statistics, let’s move on to step 4: exploring categorical variables. We’ll analyze the unique values and their frequencies for categorical variables like...