Skip to content

Objective of this competition is to use historical loan application data to predict whether or not an applicant will be able to repay a loan.

Notifications You must be signed in to change notification settings

hadimh93/Home-Credit-Default-Risk

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 

Repository files navigation

Home-Credit-Default-Risk

Introduction

Many people struggle to get loans due to insufficient or non-existent credit histories. And, unfortunately, this population is often taken advantage of by untrustworthy lenders.

While Home Credit is currently using various statistical and machine learning methods to make these predictions, they're challenging Kagglers to help them unlock the full potential of their data. Doing so will ensure that clients capable of repayment are not rejected and that loans are given with a principal, maturity, and repayment calendar that will empower their clients to be successful.

Dataset

Find out the dataset here :

https://www.kaggle.com/c/home-credit-default-risk/data

Installation

  • Download this repository in a zip file by clicking on this link or execute this from the terminal:

git clone https://github.com/vjgpt/Home-Credit-Default-Risk.git

  • Download Jupyter Notebook
  • Open the home_credit.ipynb file using Jupyter and start playing.

Dependencies

  • NumPy
  • IPython
  • Pandas
  • SciKit-Learn
  • Matplotlib
  • Seaborn
  • Garbage Collector( gc )
  • Light GBM
  • K-Fold

Table of Content

    1. Introduction
    1. Importing requires packages
    1. Retrieving the Data
    1. Exploration of Application Train/Test Data.
    • 4.1 Merge both train and test dataset
    • 4.2 Removing Anamolies / Outlier
    • 4.3 Categorical Variables
    • 4.4 Label Encoder and One Hot Encoding
    • 4.5 Add some feature variables
    1. Exploration of Bureau and Bureau_data
    • 5.1 One Hot Encoding
    • 5.2 Feature Engineering - Bureau Data
    1. Exploration of Previous Application
    • 6.1 Handling Outliers
    • 6.2 One Hot Encoding
    • 6.3 Feature Engineering - Previous Application
    1. Exploration of POS Cash Balance
    • 7.1 One Hot Encoding
    • 7.2 Feature Engineering - POS Cash Balance
    1. Exploration of Installment Payments
    • 8.1 One Hot Encoding
    • 8.2 Adding some new features
    • 8.3 Feature Engineering - Installment Payments
    1. Exploration of Credit Card
    • 9.1 One Hot Encoding
    • 9.2 Feature Engineering - Credit Card
    1. LightGBM
    • 10.1 Prepare Final Train and Test data
    • 10.2 Cross Validation Model
    • 10.3 Fitting the model and Predicting
    • 10.5 Submission

About

Objective of this competition is to use historical loan application data to predict whether or not an applicant will be able to repay a loan.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 100.0%