- 16 Videos
- 52 Exercises
- 4 hours
- 16,987 Participants
- 4000 XP

**Instructor(s):**

Lore is a data scientist at DataCamp. She obtained her PhD in Business Economics and Statistics at KU Leuven, Belgium. During her PhD, she collaborated with several banks working on advanced methods for the analysis of credit risk data. At DataCamp, she is in charge of building out the Applied Finance curriculum.

This hands-on-course with real-life credit data will teach you how to model credit risk by using logistic regression and decision trees in R.

Modeling credit risk for both personal and company loans is of major importance for banks. The probability that a debtor will default is a key component in getting to a measure for credit risk. While introducing other models in this course as well, we focus on two model types that are often used in the credit scoring context; logistic regression and decision trees. We will teach you how to use them in this particular context, and how these models are evaluated by banks.

If you'd like to continue exploring after the course, the data set used at the start of the course can be downloaded here. The preprocessed data set from chapter 2 onwards can be found here.

This chapter begins with a general introduction to credit risk models. We'll explore a real-life data set, then preprocess the data set such that it's in the appropriate format before applying the credit risk models.

- Introduction 50 xp
- Exploring the credit data 100 xp
- Interpreting a CrossTable() 50 xp
- Histograms and outliers 50 xp
- Histograms 100 xp
- Outliers 100 xp
- Missing data and coarse classification 50 xp
- Deleting missing data 100 xp
- Replacing missing data 100 xp
- Keeping missing data 100 xp
- Data splitting and confusion matrices 50 xp
- Splitting the data set 100 xp
- Creating a confusion matrix 100 xp

Logistic regression is still a widely used method in credit risk modeling. In this chapter, you will learn how to apply logistic regression models on credit data in R.

Classification trees are another popular method in the world of credit risk modeling. In this chapter, you will learn how to build classification trees using credit data in R.

In this chapter, you'll learn how you can evaluate and compare the results obtained through several credit risk models.