Skip to main content

Home PythonCredit Risk Modeling in Python

Credit Risk Modeling in Python

Learn how to prepare credit application data, apply machine learning and business rules to reduce risk and ensure profitability.

Start Course for Free

4 hours15 videos57 exercises20,499 learnersStatement of Accomplishment

Create Your Free Account

Google LinkedIn Facebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.

Training 2 or more people?

Try DataCamp for Business

Loved by learners at thousands of companies

Course Description

If you've ever applied for a credit card or loan, you know that financial firms process your information before making a decision. This is because giving you a loan can have a serious financial impact on their business. But how do they make a decision? In this course, you will learn how to prepare credit application data. After that, you will apply machine learning and business rules to reduce risk and ensure profitability. You will use two data sets that emulate real credit applications while focusing on business value. Join me and learn the expected value of credit risk modeling!

For Business

Training 2 or more people?

Get your team access to the full DataCamp library, with centralized reporting, assignments, projects and more

In the following Tracks

Applied Finance in Python

1
Exploring and Preparing Loan Data
Free
In this first chapter, we will discuss the concept of credit risk and define how it is calculated. Using cross tables and plots, we will explore a real-world data set. Before applying machine learning, we will process this data by finding and resolving problems.
Play Chapter Now
Understanding credit risk
50 xp
Explore the credit data
100 xp
Crosstab and pivot tables
100 xp
Outliers in credit data
50 xp
Finding outliers with cross tables
100 xp
Visualizing credit outliers
100 xp
Risk with missing data in loan data
50 xp
Replacing missing credit data
100 xp
Removing missing data
100 xp
Missing data intuition
50 xp
2
Logistic Regression for Defaults
With the loan data fully prepared, we will discuss the logistic regression model which is a standard in risk modeling. We will understand the components of this model as well as how to score its performance. Once we've created predictions, we can explore the financial impact of utilizing this model.
Play Chapter Now
Logistic regression for probability of default
50 xp
Logistic regression basics
100 xp
Multivariate logistic regression
100 xp
Creating training and test sets
100 xp
Predicting the probability of default
50 xp
Changing coefficients
100 xp
One-hot encoding credit data
100 xp
Predicting probability of default
100 xp
Credit model performance
50 xp
Default classification reporting
100 xp
Selecting report metrics
100 xp
Visually scoring credit models
100 xp
Model discrimination and impact
50 xp
Thresholds and confusion matrices
100 xp
How thresholds affect performance
100 xp
Threshold selection
100 xp
3
Gradient Boosted Trees Using XGBoost
Decision trees are another standard credit risk model. We will go beyond decision trees by using the trendy XGBoost package in Python to create gradient boosted trees. After developing sophisticated models, we will stress test their performance and discuss column selection in unbalanced data.
Play Chapter Now
Gradient boosted trees with XGBoost
50 xp
Trees for defaults
100 xp
Gradient boosted portfolio performance
100 xp
Assessing gradient boosted trees
100 xp
Column selection for credit risk
50 xp
Column importance and default prediction
100 xp
Visualizing column importance
100 xp
Column selection and model performance
100 xp
Cross validation for credit models
50 xp
Cross validating credit models
100 xp
Limits to cross-validation testing
100 xp
Cross-validation scoring
100 xp
Class imbalance in loan data
50 xp
Undersampling training data
100 xp
Undersampled tree performance
100 xp
Undersampling intuition
50 xp
4
Model Evaluation and Implementation
After developing and testing two powerful machine learning models, we use key performance metrics to compare them. Using advanced model selection techniques specifically for financial modeling, we will select one model. With that model, we will: develop a business strategy, estimate portfolio value, and minimize expected loss.
Play Chapter Now
Model evaluation and implementation
50 xp
Comparing model reports
100 xp
Comparing with ROCs
100 xp
Calibration curves
100 xp
Credit acceptance rates
50 xp
Acceptance rates
100 xp
Visualizing quantiles of acceptance
100 xp
Bad rates
100 xp
Acceptance rate impact
100 xp
Credit strategy and minimum expected loss
50 xp
Making the strategy table
100 xp
Visualizing the strategy
100 xp
Estimated value profiling
100 xp
Total expected loss
100 xp
Course wrap up
50 xp

For Business

Training 2 or more people?

Get your team access to the full DataCamp library, with centralized reporting, assignments, projects and more

In the following Tracks

Applied Finance in Python

datasets

Raw credit data Clean credit data (outliers and missing data removed)Credit data (ready for modeling)

collaborators

Mona Khalil

Ruanne Van Der Walt

prerequisites

Intermediate Python for Finance

Michael Crabtree

Data Scientist

What do other learners have to say?

Join over 15 million learners and start Credit Risk Modeling in Python today!

Create Your Free Account

Google LinkedIn Facebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.