
Loved by learners at thousands of companies
Course Description
pandas DataFrames are the most widely used in-memory representation of complex data collections within Python. Whether in finance, a scientific field, or data science, familiarity with pandas is essential. This course teaches you to work with real-world datasets containing both string and numeric data, often structured around time series. You will learn powerful analysis, selection, and visualization techniques in this course.
Training 2 or more people?
Get your team access to the full DataCamp platform, including all the features.- 1
Data ingestion & inspection
FreeIn this chapter, you will be introduced to pandas DataFrames. You will use pandas to import and inspect a variety of datasets, ranging from population data obtained from the World Bank to monthly stock data obtained via Yahoo Finance. You will also practice building DataFrames from scratch and become familiar with the intrinsic data visualization capabilities of pandas.
Review of pandas DataFrames50 xpInspecting your data50 xpDataFrame data types50 xpNumPy and pandas working together100 xpBuilding DataFrames from scratch50 xpZip lists to build a DataFrame100 xpLabeling your data100 xpBuilding DataFrames with broadcasting100 xpImporting & exporting data50 xpReading a flat file100 xpDelimiters, headers, and extensions100 xpPlotting with pandas50 xpPlotting series using pandas100 xpPlotting DataFrames100 xp - 2
Exploratory data analysis
Now that you’ve learned how to ingest and inspect your data, you will next learn how to explore it visually and quantitatively. This process, known as exploratory data analysis (EDA), is a crucial component of any data science project. pandas has powerful methods that help with statistical and visual EDA. In this chapter, you will learn how and when to apply these techniques.
Visual exploratory data analysis50 xppandas line plots100 xppandas scatter plots100 xppandas box plots100 xppandas hist, pdf and cdf100 xpStatistical exploratory data analysis50 xpFuel efficiency50 xpBachelor's degrees awarded to women100 xpMedian vs mean100 xpQuantiles100 xpStandard deviation of temperature100 xpSeparating populations50 xpFiltering and counting50 xpSeparate and summarize100 xpSeparate and plot100 xp - 3
Time series in pandas
In this chapter, you will learn how to manipulate and visualize time series data using pandas. You will become familiar with concepts such as upsampling, downsampling, and interpolation. You will practice using method chaining to efficiently filter your data and perform time series analyses. From stock prices to flight timings, time series data can be found in a wide variety of domains, and being able to effectively work with it is an invaluable skill.
Indexing time series50 xpReading and slicing times50 xpCreating and using a DatetimeIndex100 xpPartial string indexing and slicing100 xpReindexing the Index100 xpResampling time series data50 xpResampling and frequency100 xpSeparating and resampling100 xpRolling mean and frequency100 xpResample and roll with it100 xpManipulating time series data50 xpMethod chaining and filtering100 xpMissing values and interpolation100 xpTime zones and conversion100 xpTime series visualization50 xpPlotting time series, datetime indexing100 xpPlotting date ranges, partial indexing100 xp - 4
Case Study - Sunlight in Austin
Working with real-world weather and climate data, this chapter will allow you to apply all of the skills you have acquired in this course. You will use pandas to manipulate the data into a usable form for analysis and systematically explore it using the techniques you’ve learned.
Reading and cleaning the data50 xpWhat method should we use to read the data?50 xpReading in a data file100 xpRe-assigning column names100 xpCleaning and tidying datetime data100 xpCleaning the numeric columns100 xpStatistical exploratory data analysis50 xpSignal min, max, median100 xpSignal variance100 xpSunny or cloudy100 xpVisual exploratory data analysis50 xpWeekly average temperature and visibility100 xpDaily hours of clear sky100 xpHeat or humidity100 xpProbability of high temperatures100 xpFinal thoughts50 xp
Training 2 or more people?
Get your team access to the full DataCamp platform, including all the features.datasets
1981-2010 NOAA Austin Climate NormalsJuly 2015 Austin airport departures (Southwest Airlines)Automobile miles per gallonLife expectancy at birth (Gapminder)Stock data (messy)Percentage of bachelor's degrees awarded to women in the USATipsTitanic2010 Austin weatherWorld Bank World Development IndicatorsWorld populationprerequisites
Intermediate PythonJoin over 16 million learners and start pandas Foundations today!
Create Your Free Account
or
By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.