Pandas Intro
Learn all about Pandas Intro in this comprehensive tutorial.
- •Pandas is a Python library used for working with data sets.
- •Pandas allows us to analyze big data and make conclusions based on statistical theories.
- •Pandas gives you answers about the data.
- •The source code for Pandas is located at this github repository https://github.
What is Pandas?
Pandas is a Python library used for working with data sets.
It has functions for analyzing, cleaning, exploring, and manipulating data.
The name "Pandas" has a reference to both "Panel Data", and "Python Data Analysis" and was created by Wes McKinney in 2008.
Why Use Pandas?
Pandas allows us to analyze big data and make conclusions based on statistical theories.
Pandas can clean messy data sets, and make them readable and relevant.
Relevant data is very important in data science.
What Can Pandas Do?
Pandas gives you answers about the data. Like:
- Is there a correlation between two or more columns?
- What is average value?
- Max value?
- Min value?
Pandas are also able to delete rows that are not relevant, or contains wrong values, like empty or NULL values. This is called cleaning the data.
Where is the Pandas Codebase?
The source code for Pandas is located at this github repository https://github.com/pandas-dev/pandas
Module quiz
2 questionsWhich of the following is true about Pandas Intro?
What is the most common pitfall when working with Pandas Intro?
Answer all questions to submit.