Skip to content

Loan Default Analysis - Multi-file joins, DateTime operations, String handling, DTI calculations

Notifications You must be signed in to change notification settings

Akshdeep1013/loan-default-analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

3 Commits
Β 
Β 
Β 
Β 

Repository files navigation

Loan Default Analysis

Multi-file data processing with Pandas - Joins, DateTime, and String operations

πŸ“Š Project Overview

Analysis of loan default risk using customer and loan data from multiple CSV files. Implements data joining, temporal analysis, and string cleaning operations.

🎯 Features Implemented

Multi-File Joins

  • Read multiple CSV files (customers.csv, loans.csv)
  • Join customer + loan data on customer_id
  • Inner, left, and right joins
  • Handle missing values

DateTime Operations

  • Convert string dates to datetime
  • Extract date components (year, month, day)
  • Filter data by date ranges
  • Calculate date differences
  • Find recent loans (last 6 months)

String Operations

  • Clean city names: .str.upper()
  • Remove whitespace: .str.strip()
  • Pattern matching: .str.contains()
  • Split full names
  • Replace values: 'Bombay' β†’ 'Mumbai'

DTI Calculation

  • DTI ratio = (loan_emi / monthly_income) * 100
  • Identify high-risk customers (DTI > 40%)
  • Group by credit score ranges

πŸ› οΈ Technologies

  • Python 3.x
  • Pandas
  • NumPy

πŸ“ˆ Key Concepts

βœ… Multi-file joining (merge operations)
βœ… DateTime handling (.dt accessor)
βœ… String manipulation (.str accessor)
βœ… Missing data handling
βœ… Feature calculation (DTI ratio)

πŸ“‚ Dataset

  • customers.csv: 2000 records
  • loans.csv: 2000 records

About

Loan Default Analysis - Multi-file joins, DateTime operations, String handling, DTI calculations

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages