Skip to content

This project aim is to facing challenges in ensuring that customers who can repay their loans are not rejected to increase profitability, while also identifying customers likely to face repayment issues to minimize loans granted to unqualified customers and reduce potential losses.

Notifications You must be signed in to change notification settings

Yogaaprila/Home-Credit-Indonesia-Scorecard-Modeling

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 

Repository files navigation

Home Credit Indonesia Scorecard Modeling

Project Overview

Home Credit is currently using various statistical methods and Machine Learning techniques to make credit score predictions. Now, as a Data Scientist Intern, we will unlock the full potential of our data. By doing so, we can ensure that customers who are able to repay are not rejected when applying for loans, and that loans are provided with a principal, maturity, and repayment calendar that will motivate customers to succeed.

Problems

Home Credit Indonesia is facing several challenges:

  1. Ensuring that customers who can repay their loans (class 0) are not rejected to increase profitability.
  2. Identifying customers who are likely to face repayment issues to minimize loans granted to unqualified customers and reduce potential losses.

Dataset

This dataset 307,511 customers who have taken loans, with 122 features such as ID, gender, income, etc.

Goals

To address these challenges, a machine learning model will be developed to predict credit scores, helping determine clients capable of repaying loans and those who may encounter payment issues.

Objectives

  1. Identify the main problems to be solved.
  2. Select appropriate machine learning models and evaluation metrics.
  3. Conduct exploratory data analysis (EDA) to understand the dataset characteristics and uncover valuable insights.
  4. Perform data preprocessing, including data cleansing, feature encoding, handling class imbalance, etc.
  5. Build classification models, such as logistic regression, decision tree, or others.
  6. Determine the best model based on evaluation metric results.
  7. Provide business recommendations based on analysis and findings.

Metric Evaluation

  1. Balanced Accuracy: The average recall for class 0 and class 1, ensuring good performance for both classes.
  2. Recall: Measures how well the model identifies all positive cases out of the actual positives in the data.
  3. Precision: Measures the accuracy of the model in predicting the positive class, indicating how many predicted positives are truly positive.

Tools

  1. Python Programming Language
  2. JupyterLab / Jupyter Notebook

Documentation

Presentation

Python Code

About

This project aim is to facing challenges in ensuring that customers who can repay their loans are not rejected to increase profitability, while also identifying customers likely to face repayment issues to minimize loans granted to unqualified customers and reduce potential losses.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published