Course
Data Manipulation with pandas
Create Your Free Account
or
By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.Loved by learners at thousands of companies
Training 2 or more people?
Try DataCamp for BusinessCourse Description
Discover Data Manipulation with pandas
With this course, you’ll learn why pandas is the world's most popular Python library, used for everything from data manipulation to data analysis. You’ll explore how to manipulate DataFrames, as you extract, filter, and transform real-world datasets for analysis.With pandas, you’ll explore all the core data science concepts. Using real-world data, including Walmart sales figures and global temperature time series, you’ll learn how to import, clean, calculate statistics, and create visualizations—using pandas to add to the power of Python.
Work with pandas Data to Explore Core Data Science Concepts
You’ll start by mastering the pandas basics, including how to inspect DataFrames and perform some fundamental manipulations. You’ll also learn about aggregating DataFrames, before moving on to slicing and indexing.You’ll wrap up the course by learning how to visualize the contents of your DataFrames, working with a dataset that contains weekly US avocado sales.
Learn to Manipulate DataFrames
By completing this pandas course, you’ll understand how to use this Python library for data manipulation. You’ll have an understanding of DataFrames and how to use them, as well as be able to visualize your data in Python.Feels like what you want to learn?
Start Course for FreeWhat you'll learn
- Identify methods to import, inspect, and subset pandas DataFrames using functions like .head(), .info(), and .loc[].
- Differentiate between aggregation techniques using .groupby() and pivot tables for grouped statistics.
- Recognize how to modify DataFrames by adding new columns, setting indexes, and handling missing values.
- Define common visualization types in pandas, including bar, line, scatter, and histogram plots.
- Assess when to use Boolean masking, sorting, and slicing techniques for efficient data selection.
Prerequisites
Intermediate PythonData Aggregation
Summary Values
You will learn to summarize data by calculating key statistics like totals, averages, and counts, enabling you to extract meaningful insights from raw data.
One Grouping Column
You will learn to break down summary statistics by categories, enabling you to compare metrics across different groups and discover patterns in your data.
Multiple Grouping Columns
You will learn to analyze data across multiple dimensions at once, enabling you to discover nuanced patterns by breaking down summaries along several categories simultaneously.
Data Transformation
Basic Transformations
You will learn to create new columns by combining and calculating values from existing data, enabling you to derive ratios and other metrics not available in the original dataset.
Complex Transformations
You will learn to handle multi-step calculations and compute percentages of totals, enabling you to build complex metrics that depend on intermediate results.
Data Filtering
Basic Filtering
You will learn to extract specific rows from your data based on conditions, enabling you to focus your analysis on relevant subsets and handle missing values and text patterns.
Multiple Conditions
You will learn to filter data using multiple criteria at once, enabling you to extract precisely the rows you need by combining conditions with AND and OR logic.
Complex Filtering
You will learn to simplify complex filtering by breaking conditions into separate columns, and to extract the opposite of a filter result, making your analysis more transparent and verifiable.
Conditional Operations
Conditional Transformation
You will learn to apply different calculations based on specific conditions, enabling you to standardize values, classify data into categories, and handle varied scenarios within your data.
Conditional Aggregation
You will learn to calculate summaries that include only values meeting specific criteria, enabling you to compute nuanced metrics like "average of delayed flights only" within each group.
Complete
Earn Statement of Accomplishment
Add this credential to your LinkedIn profile, resume, or CVShare it on social media and in your performance review
Included withPremium or Teams
Enroll NowFAQs
Join over 19 million learners and start Data Manipulation with pandas today!
Create Your Free Account
or
By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.