Запис Детальніше

THE METHODS FOR QUANTITATIVE SOLVING THE CLASS IMBALANCE PROBLEM

Науковий журнал «Радіоелектроніка, інформатика, управління»

Переглянути архів Інформація
 
 
Поле Співвідношення
 
##plugins.schemas.marc.fields.042.name## dc
 
##plugins.schemas.marc.fields.245.name## THE METHODS FOR QUANTITATIVE SOLVING THE CLASS IMBALANCE PROBLEM
 
##plugins.schemas.marc.fields.720.name## Kavrin, D. А.; Zaporizhzhya National Technical University, Zaporizhzhya, Ukraine
Subbotin, S. A.; Zaporizhzhya National Technical University, Zaporizhzhya, Ukraine
 
##plugins.schemas.marc.fields.653.name## sample; example; quality metric; cluster; classificatory; majority class; minority class.
 
##plugins.schemas.marc.fields.520.name## Context. The problem of recovery the classes’ balance in imbalanced samples is solved to increase the efficiency of diagnostic and<br />recognition models.<br />Objective. The purpose of the work is to modify the existing method of recovery classes’ balance and to conduct comparative analysis<br />of performance indicators with some modern methods.<br />Method. The proposed data preprocessing method is based on combining the undersampling and cluster-analysis technologies. The<br />method has allowed restoring the balance and reducing the sample while maintaining important topological properties of the sample, high<br />accuracy and acceptable operating time.<br />Results. The software that implements in proposed method has been developed and used in the computational experiments on the study<br />of method’s properties and comparative analysis with other methods of restoring classes’ balance.<br />Conclusions. The experiments confirmed the efficiency of the proposed method and its implemented software. The method has allowed<br />reducing the majority class to the size of the minority class, thus reducing the training sample (the sample is considered imbalanced if the size of the minority class is less than 10% of the original sample size), while demonstrating the best indicators of model accuracy and comparable sampling speed. It can be recommended for the practical application in solving problems of imbalance data for diagnostic and recognition models.
 
##plugins.schemas.marc.fields.260.name## Zaporizhzhya National Technical University
2018-05-29 13:24:17
 
##plugins.schemas.marc.fields.856.name## application/pdf
http://ric.zntu.edu.ua/article/view/131565
 
##plugins.schemas.marc.fields.786.name## Radio Electronics, Computer Science, Control; No 1 (2018): Radio Electronics, Computer Science, Control
 
##plugins.schemas.marc.fields.546.name## ru
 
##plugins.schemas.marc.fields.540.name## Copyright (c) 2018 D. А. Kavrin, S. A. Subbotin