Παρακαλώ χρησιμοποιήστε αυτό το αναγνωριστικό για να παραπέμψετε ή να δημιουργήσετε σύνδεσμο προς αυτό το τεκμήριο: https://ruomo.lib.uom.gr/handle/7000/1582
Πλήρης εγγραφή μεταδεδομένων
Πεδίο DCΤιμήΓλώσσα
dc.contributor.authorOugiaroglou, Stefanos-
dc.contributor.authorEvangelidis, Georgios-
dc.date.accessioned2023-06-23T11:31:04Z-
dc.date.available2023-06-23T11:31:04Z-
dc.date.issued2023-05-
dc.identifier10.1145/3589462.3589493en_US
dc.identifier.isbn9798400707445en_US
dc.identifier.urihttps://doi.org/10.1145/3589462.3589493en_US
dc.identifier.urihttps://ruomo.lib.uom.gr/handle/7000/1582-
dc.description.abstractReduction through Homogeneous Clustering (RHC) and its editing variant (ERHC) are effective data reduction techniques for the k-NN classifier. They are based on an iterative k-means clustering task that discovers homogeneous clusters. The centers of the resulting homogeneous clusters constitute the instances of the reduced training set. Although RHC and ERHC are quite fast compared to several well-known data reduction techniques, the iterative execution of k-means clustering renders both of them inappropriate for data reduction tasks that need to be performed quickly, especially, when run over large training datasets. The present paper proposes simple and very fast variations of the algorithms, which are appropriate for such environments. The variations are called RHC2 and ERHC2 and replace the complete execution of k-means clustering with a fast task that assigns instances to the class centers. The experimental study based on fourteen datasets, and, the corresponding statistical tests, show that the proposed RHC2 and ERHC2 variations are very fast and, at the cost of a small penalty on classification accuracy, they achieve higher reduction rates than their predecessors and other two well-known data reduction techniques. They are good candidates when fast reduction on large datasets is required.en_US
dc.language.isoenen_US
dc.publisherAssociation for Computing Machineryen_US
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 International*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/*
dc.subjectFRASCATI::Natural sciences::Computer and information sciencesen_US
dc.subject.otherdata reductionen_US
dc.subject.otherprototype generationen_US
dc.subject.otherRHCen_US
dc.subject.otherhomogeneous clustersen_US
dc.subject.otherk-NN Classificationen_US
dc.titleVery fast variations of training set size reduction algorithms for instance-based classificationen_US
dc.typeConference Paperen_US
dc.contributor.departmentΤμήμα Εφαρμοσμένης Πληροφορικήςen_US
local.identifier.firstpage64en_US
local.identifier.lastpage70en_US
local.identifier.volumetitleInternational Database Engineered Applications Symposium Conferenceen_US
Εμφανίζεται στις Συλλογές: Τμήμα Εφαρμοσμένης Πληροφορικής

Αρχεία σε αυτό το Τεκμήριο:
Αρχείο Περιγραφή ΜέγεθοςΜορφότυπος 
IDEAS2023_RHC2.pdf168,42 kBAdobe PDFThumbnail
Προβολή/Ανοιγμα


Αυτό το τεκμήριο προστατεύεται από Αδεια Creative Commons Creative Commons