Παρακαλώ χρησιμοποιήστε αυτό το αναγνωριστικό για να παραπέμψετε ή να δημιουργήσετε σύνδεσμο προς αυτό το τεκμήριο: https://ruomo.lib.uom.gr/handle/7000/1228
Πλήρης εγγραφή μεταδεδομένων
Πεδίο DCΤιμήΓλώσσα
dc.contributor.authorOutsios, Evangelos-
dc.contributor.authorEvangelidis, Georgios-
dc.date.accessioned2022-08-29T07:51:52Z-
dc.date.available2022-08-29T07:51:52Z-
dc.date.issued2011-
dc.identifier10.15556/IJIIM.01.01.003en_US
dc.identifier.issn2241-827Xen_US
dc.identifier.urihttps://doi.org/10.15556/IJIIM.01.01.003en_US
dc.identifier.urihttp://ejournals.uniwa.gr/index.php/JIIM/article/view/3050en_US
dc.identifier.urihttps://ruomo.lib.uom.gr/handle/7000/1228-
dc.description.abstractIndexing of k-dimensional point data is becoming again a hot research topic because of the need to efficiently index and retrieve high dimensional vectors (points) in data mining applications. The most common query on such vectors is kNN searching, which is a variation of range searching. Most multidimensional indexes for point data follow the paradigm of the ubiquitous B+tree and store data entries at the leaf level of the index (data nodes). Since this level naturally occupies the majority of nodes in a multidimensional index tree, it is crucial that an index structure achieves the best possible average storage utilization regardless of data distribution and order of data insertion. An additional conflicting goal is the minimization of the index term that is posted at the levels above when data nodes are split. In this paper we revisit data node splitting techniques for point access methods like the KDB-tree, hB-tree, and, in general, any index that stores point data at its leaf level nodes and splits them so that no overlapping subspaces are created at the leaf level. We experiment with various splitting techniques that produce the minimum index term for posting but differ in the shape of the resulting nodes and the average storage utilization. We also test our splitting techniques using uniform and skewed data distributions. The comparison is on the average data node storage utilization and the efficiency of range query searches.en_US
dc.language.isoenen_US
dc.rightsAttribution-NonCommercial-ShareAlike 4.0 International*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/*
dc.sourceInternational Journal on Integrated Information Managementen_US
dc.subjectFRASCATI::Natural sciences::Computer and information sciencesen_US
dc.subject.otherIndexing of k-dimensional point dataen_US
dc.subject.otherPoint Access Methodsen_US
dc.titleAchieving optimal average data node storage utilization in k-dimensional point data indexesen_US
dc.typeArticleen_US
dc.contributor.departmentΤμήμα Εφαρμοσμένης Πληροφορικήςen_US
Εμφανίζεται στις Συλλογές: Τμήμα Εφαρμοσμένης Πληροφορικής

Αρχεία σε αυτό το Τεκμήριο:
Αρχείο Περιγραφή ΜέγεθοςΜορφότυπος 
2011_IJIIM_outsios.pdf274,55 kBAdobe PDFΠροβολή/Ανοιγμα


Αυτό το τεκμήριο προστατεύεται από Αδεια Creative Commons Creative Commons