Interactive Course

Predictive Analytics using Networked Data in R

Learn to predict labels of nodes in networks using network learning and by extracting descriptive features from the network

  • 4 hours
  • 14 Videos
  • 56 Exercises
  • 1,696 Participants
  • 4,300 XP

Loved by learners at thousands of top companies:

whole-foods-grey.svg
roche-grey.svg
3m-grey.svg
deloitte-grey.svg
axa-grey.svg
mls-grey.svg

Course Description

In this course, you will learn to perform state-of-the art predictive analytics using networked data in R. The aim of network analytics is to predict to which class a network node belongs, such as churner or not, fraudster or not, defaulter or not, etc. To accomplish this, we discuss how to leverage information from the network and its underlying structure in a predictive way. More specifically, we introduce the idea of featurization such that network features can be added to non-network features as such boosting the performance of any resulting analytical model. In this course, you will use the igraph package to generate and label a network of customers in a churn setting and learn about the foundations of network learning. Then, you will learn about homophily, dyadicity and heterophilicty, and how these can be used to get key exploratory insights in your network. Next, you will use the functionality of the igraph package to compute various network features to calculate both node-centric as well as neighbor based network features. Furthermore, you will use the Google PageRank algorithm to compute network features and empirically validate their predictive power. Finally, we teach you how to generate a flat dataset from the network and analyze it using logistic regression and random forests.

  1. 1

    Introduction, networks and labelled networks

    Free

    In this chapter you will be introduced to labelled networks, network learning and the challanges that can arise.

  2. Homophily

    In this chapter you will learn about homophily and how to compute the two measures that can be used to characterice it, dyadicity and heterophilicty.

  3. Network Featurization

    In this chapter you will use the igraph package to compute various network features and add them to the network.

  4. Putting it all together

    In this chapter you will use the network from Chapter 3 to create a flat dataset. Using standard data mining techniques, you will build predictive models and measure their performance with AUC and top decile lift.

What do other learners have to say?

Devon

“I've used other sites, but DataCamp's been the one that I've stuck with.”

Devon Edwards Joseph

Lloyd's Banking Group

Louis

“DataCamp is the top resource I recommend for learning data science.”

Louis Maiden

Harvard Business School

Ronbowers

“DataCamp is by far my favorite website to learn from.”

Ronald Bowers

Decision Science Analytics @ USAA

Maria Oskarsdottir
Maria Oskarsdottir

Post-doctoral Researcher

María Óskarsdóttir is a post-doctoral researcher and an active R user. She holds a PhD in Business Economics from KU Leuven (Belgium). Her research puts focus on applying social network analytics techniques for predictive modeling in marketing, credit scoring and insurance.

See More
Bart Baesens
Bart Baesens

Professor in Analytics and Data Science at KU Leuven

Professor Bart Baesens is a professor of Big Data & Analytics at KU Leuven (Belgium), and a lecturer at the University of Southampton (United Kingdom). He has done extensive research on big data & analytics, credit risk modeling, fraud detection, and marketing analytics. He co-authored more than 250 scientific papers and 10 books some of which have been translated into Chinese, Kazakh and Korean, and sold more than 20,000 copies of these books world-wide. Bart received the OR Society’s Goodeve medal for best JORS paper in 2016 and the EURO 2014 and EURO 2017 award for best EJOR paper. His research is summarized at www.dataminingapps.com. He also regularly tutors, advises and provides consulting support to international firms with respect to their analytics and credit risk management strategy.

See More
Icon Icon Icon professional info