Back to Projects
Bayesian Networks

Credit Risk Assessment

Bayesian network model for credit risk prediction using Hill-Climbing algorithm and BICScore method.

Course: Introduction to Data Science
Credit Risk Assessment

Objectives

  • 1Identify risk clients when granting loans using Bayesian Networks.
  • 2Find optimal structure of the Bayesian network using Hill-Climbing algorithm and BICScore method.
  • 3Manually improve network structure so that it is consistent with the business context.

Conclusions

  • Historical credit evaluation is the strongest risk predictor: ECH=0 (Very Poor) yields 64.5% high risk probability vs ECH=4 (Excellent) at only 18.8%.
  • The connections found by Hill-climbing with BICScore make sense in the data context, though some causality directions are reversed.
  • Loans with duration 24-72 months have 52.2% high risk probability, significantly higher than shorter terms.
  • Automated Bayesian network construction is a good starting point but must be analyzed under business experience.

Technologies

  • Pgmpy
  • FastAPI
  • Networkx
  • Pandas
  • NumPy
  • Matplotlib