Identification of influential observations based on binary particle swarm optimization in the cox PH model

Sancar N., İNAN D.

COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, cilt.49, sa.3, ss.567-590, 2020 (SCI İndekslerine Giren Dergi) identifier identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 49 Konu: 3
  • Basım Tarihi: 2020
  • Doi Numarası: 10.1080/03610918.2019.1682156
  • Sayfa Sayıları: ss.567-590


Proper identification of influential observations should be an integral and significant part of the Cox modeling process. This is because the failure to identify influential observations may have negative effect on the estimates acquired from the process. Furthermore, in survival analysis, influential observations frequently propose that the model has imperfections, including poorly specified factor, non-proportional hazards, loss of important information and/or a variable that has been omitted. Nevertheless, many procedures have been developed for the identification of a single influential observation based on the leave-one-out method. However, the results from leave-one-out diagnostic techniques are often misleading as a result of swamping and masking problems in the presence of multiple influential observations in the dataset. In this paper, identification of the optimal set of influential observations problem has been considered as the combinatorial optimization problem and a new simultaneous approach for identification of the optimal set of influential observations is proposed based on Binary Particle Swarm Optimization (BPSO) approach in the Cox PH model. The performance of the proposed BPSO-based approach and conventional diagnostic techniques have been compared according to various evaluation criteria by simulation studies. The performance of the BPSO-based approach has been also demonstrated by the clinical real dataset.