Pandas Tutorial · Cleaning Data
Cleaning Data
Learn all about Cleaning Data in this comprehensive tutorial.
5 min read intermediate
- •Data cleaning means fixing bad data in your data set.
- •In the next chapters we will use this data set:
Data Cleaning
Data cleaning means fixing bad data in your data set.
Bad data could be:
- Empty cells
- Data in wrong format
- Wrong data
- Duplicates
In this tutorial you will learn how to deal with all of them.
Our Data Set
In the next chapters we will use this data set:
The data set contains some empty cells ("Date" in row 22, and "Calories" in row 18 and 28).
The data set contains wrong format ("Date" in row 26).
The data set contains wrong data ("Duration" in row 7).
The data set contains duplicates (row 11 and 12).
Module quiz
2 questions1
Which of the following is true about Cleaning Data?
2
What is the most common pitfall when working with Cleaning Data?
Answer all questions to submit.