17  EDA lesson exercises

Open in Google Colab | Download notebook


Exercise 1

What is split-apply-combine and why is it important that a data set is tidy when doing split-apply-combine operations?

Exercise 2

What is the difference between joining and concatenating data frames?

Exercise 3

Describe the difference between categorical and quantitative variables. How are they fundamentally different in the way we plot them?

Exercise 4

Give pros and cons for using a histogram for display of repeated measurements. Then give pros and cons for using an ECDF.

Exercise 5

Write down any questions or points of confusion that you have.