Skip to content

Machine Learning with scikit-learn

Run the hidden code cell below to import the data used in this course.

# Import the course packages
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import sklearn
import scipy.stats

# Import the course datasets
auto = pd.read_csv("datasets/auto.csv")
boston = pd.read_csv("datasets/boston.csv")
diabetes = pd.read_csv("datasets/diabetes.csv")
gapminder = pd.read_csv("datasets/gm_2008_region.csv")
votes = pd.read_csv("datasets/votes.csv")
whitewine = pd.read_csv("datasets/white-wine.csv")

Take Notes

Add notes about the concepts you've learned and code cells with code you want to keep.

Add your notes here

# Add your code snippets here

Explore Datasets

Use the DataFrames imported in the first cell to explore the data and practice your skills! Below are the DataFrames and the target variables you can pratice making predictions on.

  • auto: mpg, the miles per gallon of used cars.
  • boston: MEDV, the median value of owner-occupied homes in thousands of dollars.
  • diabetes: diabetes, 0 indicates that the patient does not have diabetes, while a value of 1 indicates that the patient does have diabetes.
  • gapminder: life, life expectancy
  • party: party affiliation (democrat or republican)
  • whitewine: quality