Dataset:
Link for the dataset: https://www.kaggle.com/datasets/mlg-ulb/creditcardfraud
The data-set used here consists of 284,807 European credit card transaction instances of which 492 (0.172%) are fraudu- lent (denoted by the class label ’1’). 2 of the 30 predictors are the output of PCA transformation applied in part to anonymize the data set. The remaining 2 are the ’Time’ and ’Amount’ features.
Three aproaches are used to solve this problem.
- Local - Local Map Reduce
- Global - Smote_Enn_Global
- Hybrid - Smote_Enn_Cluster_Global