Download the dataset posted on Ed. First, let's clean the data. Remove any rows that do not follow the rules:

Has a duration (seconds) that is not a number
- Then convert to a float
Has an unspecified country (is NaN)
Has an unspecified shape (is NaN)
Replace any timestamp that has 24:00 with 23:59 instead
- Then convert to a timestamp type

You should have 69001 rows after this.

Data Science Introduction: Numpy, Pandas, & Matplotlib

Reminders