Subscribe now. Save 33% on DataCamp and make data science your superpower.

Offer ends in  days  hrs  mins  secs

This course is part of these tracks:

Brett Lantz
Brett Lantz

Data Scientist at the University of Michigan

Brett Lantz is a data scientist at the University of Michigan and the author of Machine Learning with R. After training as a sociologist, Brett has applied his endless thirst for data to projects that involve understanding and predicting human behavior.

See More
  • Nick Carchedi

    Nick Carchedi

  • Nick Solomon

    Nick Solomon

Course Description

This beginner-level introduction to machine learning covers four of the most common classification algorithms. You will come away with a basic understanding of how each algorithm approaches a learning task, as well as learn the R functions needed to apply these tools to your own work.

  1. 1

    Chapter 1: k-Nearest Neighbors (kNN)


    As the kNN algorithm literally "learns by example" it is a case in point for starting to understand supervised machine learning. This chapter will introduce classification while working through the application of kNN to self-driving vehicle road sign recognition.

  2. Chapter 2: Naive Bayes

    Naive Bayes uses principles from the field of statistics to make predictions. This chapter will introduce the basics of Bayesian methods while exploring how to apply these techniques to iPhone-like destination suggestions.

  3. Chapter 3: Logistic Regression

    Logistic regression involves fitting a curve to numeric data to make predictions about binary events. Arguably one of the most widely used machine learning methods, this chapter will provide an overview of the technique while illustrating how to apply it to fundraising data.

  4. Chapter 4: Classification Trees

    Classification trees use flowchart-like structures to make decisions. Because humans can readily understand these tree structures, classification trees are useful when transparency is needed, such as in loan approval. We'll use the Lending Club dataset to simulate this scenario.