Please use this identifier to cite or link to this item:
Title: Prototype Selection and Generation with Minority Classes Preservation
Authors: Xouveroudis, Konstantinos
Ougiaroglou, Stefanos
Evangelidis, Georgios
Dervos, Dimitris A.
Type: Conference Paper
Subjects: FRASCATI::Natural sciences::Computer and information sciences
Keywords: k-NN Classification
Imbalanced data
Prototype Selection
Prototype Generation
Rare classes
Issue Date: 8-Oct-2021
First Page: 1
Last Page: 8
Volume Title: 2021 12th International Conference on Information, Intelligence, Systems & Applications (IISA)
Abstract: Instance-based classifiers become inefficient when the size of their training dataset or model is large. Therefore, they are usually applied in conjunction with a Data Reduction Technique that collects prototypes from the available training data. The set of prototypes is called the condensing set and has the benefit of low computational cost during classification, while, at the same time, accuracy is not negatively affected. In case of imbalanced training data, the number of prototypes collected for the minority (rare) classes may be insufficient. Even worse, the rare classes may be eliminated. This paper presents three methods that preserve the rare classes when data reduction is applied. Two of the methods apply data reduction only on the instances that belong to common classes and avoid costly under-sampling or over-sampling procedures that deal with class imbalances. The third method utilizes SMOTE over-sampling before data reduction. The three methods were tested by conducting experiments on twelve imbalanced datasets. Experimental results reveal high recall and very good reduction rates.
ISBN: 978-1-6654-0032-9
Other Identifiers: 10.1109/IISA52424.2021.9555514
Appears in Collections:Department of Applied Informatics

Files in This Item:
File Description SizeFormat 
2021_IISA_rare_classes.pdf282,57 kBAdobe PDFThumbnail

This item is licensed under a Creative Commons License Creative Commons