Nearest Centroid Classifier Based on Information Value and Homogeneity


Özçelik M. H., BULKAN S.

12th International Symposium on Intelligent Manufacturing and Service Systems, IMSS 2023, İstanbul, Türkiye, 26 - 28 Mayıs 2023, ss.36-45 identifier

  • Yayın Türü: Bildiri / Tam Metin Bildiri
  • Doi Numarası: 10.1007/978-981-99-6062-0_5
  • Basıldığı Şehir: İstanbul
  • Basıldığı Ülke: Türkiye
  • Sayfa Sayıları: ss.36-45
  • Anahtar Kelimeler: Classification, Information Value, Machine Learning, Nearest Centroid, Similarity Classifier
  • Marmara Üniversitesi Adresli: Evet

Özet

The aim of this paper is to introduce a novel classification algorithm based on distance to class centroids with weighted Euclidean distance metric. Features are weighted by their predictive powers and in-class homogeneities. For predictive power, information value metric is used. For in-class homogeneity different measures are used. The algorithm is memory based but only the centroid information needs to be stored. The experimentations are carried at 45 benchmark datasets and 5 randomly generated datasets. The results are compared against Nearest Centroid, Logistic Regression, K-Nearest Neighbors and Decision Tree algorithms. The parameters of the new algorithm and of these traditional classification algorithms are tuned before comparison. The results are promising and has potential to trigger further research.