Παρακαλώ χρησιμοποιήστε αυτό το αναγνωριστικό για να παραπέμψετε ή να δημιουργήσετε σύνδεσμο προς αυτό το τεκμήριο: https://ruomo.lib.uom.gr/handle/7000/1572
Πλήρης εγγραφή μεταδεδομένων
Πεδίο DCΤιμήΓλώσσα
dc.contributor.authorTsagalidis, Evangelos-
dc.contributor.authorEvangelidis, Georgios-
dc.date.accessioned2023-03-20T11:27:59Z-
dc.date.available2023-03-20T11:27:59Z-
dc.date.issued2022-12-04-
dc.identifier10.3390/app122312402en_US
dc.identifier.issn2076-3417en_US
dc.identifier.urihttps://doi.org/10.3390/app122312402en_US
dc.identifier.urihttps://ruomo.lib.uom.gr/handle/7000/1572-
dc.description.abstractWe deal with the problem of class imbalance in data mining and machine learning classification algorithms. This is the case where some of the class labels are represented by a small number of examples in the training dataset compared to the rest of the class labels. Usually, those minority class labels are the most important ones, implying that classifiers should primarily perform well on predicting those labels. This is a well-studied problem and various strategies that use sampling methods are used to balance the representation of the labels in the training dataset and improve classifier performance. We explore whether expert knowledge in the field of Meteorology can enhance the quality of the training dataset when treated by pre-processing sampling strategies. We propose four new sampling strategies based on our expertise on the data domain and we compare their effectiveness against the established sampling strategies used in the literature. It turns out that our sampling strategies, which take advantage of expert knowledge from the data domain, achieve class balancing that improves the performance of most classifiers.en_US
dc.language.isoenen_US
dc.publisherMultidisciplinary Digital Publishing Instituteen_US
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 International*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/*
dc.sourceApplied Sciencesen_US
dc.subjectFRASCATI::Natural sciences::Computer and information sciencesen_US
dc.subject.othermeteorological data mining and machine learningen_US
dc.subject.otherclass imbalanceen_US
dc.subject.otherclassificationen_US
dc.subject.otherrandomized undersamplingen_US
dc.subject.otherSMOTE oversamplingen_US
dc.subject.otherundersampling using temporal distancesen_US
dc.titleExploiting Domain Knowledge to Address Class Imbalance in Meteorological Data Miningen_US
dc.typeArticleen_US
dc.contributor.departmentΤμήμα Εφαρμοσμένης Πληροφορικήςen_US
local.identifier.volume12en_US
local.identifier.issue23en_US
local.identifier.firstpage12402en_US
Εμφανίζεται στις Συλλογές: Τμήμα Εφαρμοσμένης Πληροφορικής

Αρχεία σε αυτό το Τεκμήριο:
Αρχείο Περιγραφή ΜέγεθοςΜορφότυπος 
2022_applsci-12-12402-v3.pdf308,43 kBAdobe PDFΠροβολή/Ανοιγμα


Αυτό το τεκμήριο προστατεύεται από Αδεια Creative Commons Creative Commons