Skip to content
Machine Learning with scikit-learn
Run the hidden code cell below to import the data used in this course.
# Import the course packages
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import sklearn
import scipy.stats
# Import the course datasets
auto = pd.read_csv("datasets/auto.csv")
boston = pd.read_csv("datasets/boston.csv")
diabetes = pd.read_csv("datasets/diabetes.csv")
gapminder = pd.read_csv("datasets/gm_2008_region.csv")
votes = pd.read_csv("datasets/votes.csv")
whitewine = pd.read_csv("datasets/white-wine.csv")
Take Notes
Add notes about the concepts you've learned and code cells with code you want to keep.
Add your notes here
# Add your code snippets here
Explore Datasets
Use the DataFrames imported in the first cell to explore the data and practice your skills! Below are the DataFrames and the target variables you can pratice making predictions on.
auto
:mpg
, the miles per gallon of used cars.boston
:MEDV
, the median value of owner-occupied homes in thousands of dollars.diabetes
:diabetes
,0
indicates that the patient does not have diabetes, while a value of1
indicates that the patient does have diabetes.gapminder
:life
, life expectancyparty
: party affiliation (democrat
orrepublican
)whitewine
:quality