Introduction to Anomaly Detection in R Course

Name: Introduction to Anomaly Detection in R
Rating: 4.846153846153846 (26 reviews)

Introduction to Anomaly Detection in R

IntermediateSkill Level

4.8+

Updated 09/2024

Learn statistical tests for identifying outliers and how to use sophisticated anomaly scoring algorithms.

Course Description

Are you concerned about inaccurate or suspicious records in your data, but not sure where to start? An anomaly detection algorithm could help! Anomaly detection is a collection of techniques designed to identify unusual data points, and are crucial for detecting fraud and for protecting computer networks from malicious activity. In this course, you'll explore statistical tests for identifying outliers, and learn to use sophisticated anomaly scoring algorithms like the local outlier factor and isolation forest. You'll apply anomaly detection algorithms to identify unusual wines in the UCI Wine quality dataset and also to detect cases of thyroid disease from abnormal hormone measurements.

Prerequisites

Intermediate R

Statistical outlier detection

In this chapter, you'll learn how numerical and graphical summaries can be used to informally assess whether data contain unusual points. You'll use a statistical procedure called Grubbs' test to check whether a point is an outlier, and learn about the Seasonal-Hybrid ESD algorithm, which can help identify outliers when the data are a time series.

What do we mean when we talk about anomalies?

50 XP

Recognizing anomaly types

50 XP

Exploring the river nitrate data

100 XP

Testing the extremes with Grubbs' test

50 XP

Visual check of normality

100 XP

Grubbs' test

100 XP

Hunting multiple outliers using Grubbs' test

100 XP

Anomalies in time series

50 XP

Visual assessment of seasonality

100 XP

Seasonal Hybrid ESD algorithm

100 XP

Interpreting Seasonal-Hybrid ESD output

100 XP

Seasonal-Hybrid ESD versus Grubbs' test

50 XP

Start Chapter

Distance and density based anomaly detection

In this chapter, you'll learn how to calculate the k-nearest neighbors distance and the local outlier factor, which are used to construct continuous anomaly scores for each data point when the data have multiple features. You'll learn the difference between local and global anomalies and how the two algorithms can help in each case.

k-nearest neighbors distance score

50 XP

Exploring wine

100 XP

kNN distance matrix

100 XP

kNN distance score

100 XP

Visualizing kNN distance

50 XP

Standardizing features

100 XP

Appending the kNN score

100 XP

Visualizing kNN distance score

100 XP

Local outlier factor

50 XP

LOF calculation

100 XP

LOF visualization

100 XP

LOF vs kNN

100 XP

Start Chapter

Isolation forest

k-nearest neighbors distance and local outlier factor use the distance or relative density of the nearest neighbors to score each point. In this chapter, you'll explore an alternative tree-based approach called an isolation forest, which is a fast and robust method of detecting anomalies that measures how easily points can be separated by randomly splitting the data into smaller and smaller regions.

Isolation trees

50 XP

Fit and predict with an isolation tree

100 XP

Score interpretation

50 XP

Isolation forest

50 XP

Fit an isolation forest

100 XP

Checking convergence

100 XP

Visualizing the isolation score

50 XP

A grid of points

100 XP

Prediction over a grid

100 XP

Anomaly contours

100 XP

Start Chapter

Comparing performance

You've now been introduced to a few different algorithms for anomaly scoring. In this final chapter, you'll learn to compare the detection performance of the algorithms in instances where labeled anomalies are available. You'll learn to calculate and interpret the precision and recall statistics for an anomaly score, and how to adapt the algorithms so they can accommodate data with categorical features.

Labeled anomalies

50 XP

Thyroid data

100 XP

Visualizing thyroid disease

100 XP

Anomaly score

100 XP

Measuring performance

50 XP

Binarized scores

100 XP

Cross-tabulate binary scores

100 XP

Thyroid precision and recall

100 XP

Working with categorical features

50 XP

Converting character to factor

100 XP

Isolation forest with factors

100 XP

LOF with factors

100 XP

Wrap-up

50 XP

Start Chapter

Introduction to Anomaly Detection in R

Course
Complete

Earn Statement of Accomplishment

Add this credential to your LinkedIn profile, resume, or CV
Share it on social media and in your performance reviewEnroll Now

Don’t just take our word for it

*4.8

from 26 reviews

85%

15%

Sort by

Jose Antonio

2 days ago

Shinyeong

2 months ago

abe

2 months ago

Stanislau

4 months ago

Joaquim

5 months ago

Tung

5 months ago

Jose Antonio

Shinyeong

abe

FAQs

Is this course suitable for beginners?

Yes, this course is suitable for beginners. You'll learn all the basics of anomaly detection and apply the algorithms to useful datasets. We recommend first taking the "Intermediate R" course.

Will I receive a certificate at the end of the course?

Yes, you will receive a certificate of completion when you have finished the course.

Who will benefit from this course?

This course would be beneficial for data scientists, fraud investigators, cybersecurity experts, and anyone who works with data that includes anomalies or suspicious records.

What kind of anomaly detection algorithms will be discussed?

In this course, you'll explore statistical tests for identifying outliers, and learn to use sophisticated anomaly scoring algorithms like the local outlier factor and isolation forest.

What type of data will be discussed?

You'll apply anomaly detection algorithms to identify unusual wines in the UCI Wine quality dataset and also to detect cases of thyroid disease from abnormal hormone measurements.

What type of summaries will be used to find outliers?

You'll use numerical and graphical summaries to informally assess whether data contain unusual points. You'll also use a statistical procedure called Grubbs' test to check whether a point is an outlier.

What is the difference between local and global anomalies?

Local anomalies are outliers relative to their neighbors, while global anomalies are outliers relative to all other points in the dataset. You'll learn how the local outlier factor and isolation forest algorithms can help in each case.

How will the algorithms be compared?

In the final chapter, you'll learn to calculate and interpret the precision and recall statistics for an anomaly score, and how to adapt the algorithms so they can accommodate data with categorical features.

Introduction to Anomaly Detection in R

Training a Team?

Course Description

Prerequisites

Statistical outlier detection

Distance and density based anomaly detection

Isolation forest

Comparing performance

Earn Statement of Accomplishment

Don’t just take our word for it

FAQs

Is this course suitable for beginners?

Will I receive a certificate at the end of the course?

Who will benefit from this course?

What kind of anomaly detection algorithms will be discussed?

What type of data will be discussed?

What type of summaries will be used to find outliers?

What is the difference between local and global anomalies?

How will the algorithms be compared?

Join over 19 million learners and start Introduction to Anomaly Detection in R today!

Grow your data skills with DataCamp for Mobile

Course Description

Earn Statement of Accomplishment

Don’t just take our word for it

FAQs

Will I receive a certificate at the end of the course?

Who will benefit from this course?

What kind of anomaly detection algorithms will be discussed?

What type of data will be discussed?

What type of summaries will be used to find outliers?

What is the difference between local and global anomalies?

How will the algorithms be compared?

Join over .css-nklxlk{color:var(--wf-brand--main, #03EF62);}19 million learners and start Introduction to Anomaly Detection in R today!

Create Your Free Account

Grow your data skills with DataCamp for Mobile

Join over 19 million learners and start Introduction to Anomaly Detection in R today!