A Hybrid Bi-level Metaheuristic for Credit Scoring


Sen D., DÖNMEZ C. Ç., Yildirim U. M.

INFORMATION SYSTEMS FRONTIERS, cilt.22, sa.5, ss.1009-1019, 2020 (SCI-Expanded) identifier identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 22 Sayı: 5
  • Basım Tarihi: 2020
  • Doi Numarası: 10.1007/s10796-020-10037-0
  • Dergi Adı: INFORMATION SYSTEMS FRONTIERS
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, IBZ Online, ABI/INFORM, Applied Science & Technology Source, Compendex, Computer & Applied Sciences, INSPEC
  • Sayfa Sayıları: ss.1009-1019
  • Anahtar Kelimeler: Support vector machine, Genetic algorithm, Credit scoring, Classification, Feature selection, FEATURE-SELECTION, GENETIC ALGORITHM, FINANCIAL RATIOS, ROUGH SET, PREDICTION, SEARCH, MODELS, SVM, SYSTEM
  • Marmara Üniversitesi Adresli: Evet

Özet

This research aims to propose a framework for evaluating credit applications by assigning a binary score to the applicant. The score is targeted to determine whether the credit application is 'good' or 'bad' in small business purpose loans. Even tiny performance improvements in small businesses may yield a positive impact on the economy as they generate more than 60% of the value. The method presented in this paper hybridizes the Genetic Algorithm (GA) and the Support Vector Machine (SVM) in a bi-level feeding mechanism for increased prediction accuracy. The first level is to determine the parameters of SVM and the second is to find a feature set that increases classification accuracy. To test the proposed approach, we have investigated three different data sets; UCI Australian data set for preliminary works, Lending Club data set for large training and testing, and UCI German and Australian datasets for benchmarking against some other notable methods that use GA. Our computational results show that our proposed method using a feedback mechanism under the hybrid bi-level GA-SVM structure outperforms other classification algorithms in the literature, namely Decision Tree, Random Forests, Logistic Regression, SVM and Artificial Neural Networks, effectively improves the classification accuracy.