Skip to content
Exploratory Data Analysis in Python
Run the hidden code cell below to import the data used in this course.~
Take Notes
Add notes about the concepts you've learned and code cells with code you want to keep.
PMF - Probablity Mass Function will be used in ML as it is normalised.
# Add your code snippets here
Explore Datasets
Use the DataFrames imported in the first cell to explore the data and practice your skills!
- Begin by calculating the number of rows and columns and displaying the names of columns for each DataFrame. Change any column names for better readability.
- Experiment and compute a correlation matrix for variables in
nsfg
. - Compute the simple linear regression of
WTKG3
(weight) andHTM4
(height) inbrfss
(or any other variables you are interested in!). Then, compute the line of best fit and plot it. If the fit doesn't look good, try a non-linear model.