Efficient editing and data abstraction by finding homogeneous clusters

Ougiaroglou, Stefanos; Evangelidis, Georgios

Παρακαλώ χρησιμοποιήστε αυτό το αναγνωριστικό για να παραπέμψετε ή να δημιουργήσετε σύνδεσμο προς αυτό το τεκμήριο: https://ruomo.lib.uom.gr/handle/7000/344

Τίτλος:	Efficient editing and data abstraction by finding homogeneous clusters
Συγγραφείς:	Ougiaroglou, Stefanos Evangelidis, Georgios
Θέματα:	FRASCATI::Engineering and technology
Ημερομηνία Έκδοσης:	Απρ-2016
Πηγή:	Annals of Mathematics and Artificial Intelligence
Τόμος:	76
Τεύχος:	3-4
Πρώτη Σελίδα:	327
Τελευταία Σελίδα:	349
Επιτομή:	The efficiency of the k-Nearest Neighbour classifier depends on the size of the training set as well as the level of noise in it. Large datasets with high level of noise lead to less accurate classifiers with high computational cost and storage requirements. The goal of editing is to improve accuracy by improving the quality of the training datasets. To obtain such datasets, editing removes noise and mislabeled data as well as smooths the decision boundaries between the discrete classes. On the other hand, prototype abstraction aims to reduce the computational cost and the storage requirements of classifiers by condensing the training data. This paper proposes an editing algorithm called Editing through Homogeneous Clusters (EHC). Then, it extends the idea by introducing a prototype abstraction algorithm that integrate the EHC mechanism and is capable of creating a small noise-free representative set of the initial training data. This algorithm is called Editing and Reduction through Homogeneous Clusters (ERHC). Both are based on a fast and parameter free iterative execution of k-means clustering that forms homogeneous clusters. Both consider as noise and remove clusters consisting of a single item. In addition, ERHC summarizes the items of the remaining clusters by storing the mean item for each one in the representative set. EHC and ERHC are tested on several datasets. The results show that both run very fast and achieve high accuracy. In addition, ERHC achieves high reduction rates.
URI:	https://doi.org/10.1007/s10472-015-9472-8 https://ruomo.lib.uom.gr/handle/7000/344
ISSN:	1012-2443 1573-7470
Αλλοι Προσδιοριστές:	10.1007/s10472-015-9472-8
Εμφανίζεται στις Συλλογές:	Τμήμα Εφαρμοσμένης Πληροφορικής

Αρχεία σε αυτό το Τεκμήριο:

Αρχείο	Περιγραφή	Μέγεθος	Μορφότυπος
AMAI.pdf		747,24 kB	Adobe PDF	Προβολή/Ανοιγμα

Εμφανίστε την πλήρη εγγραφή

Τα τεκμήρια στο Αποθετήριο προστατεύονται από πνευματικά δικαιώματα, εκτός αν αναφέρεται κάτι διαφορετικό.

Ιδρυματικό Αποθετήριο Ακαδημαϊκής Έρευνας Πανεπιστήμιο Μακεδονίας

Ιδρυματικό Αποθετήριο Ακαδημαϊκής Έρευνας
Πανεπιστήμιο Μακεδονίας