Achieving optimal average data node storage utilization in k-dimensional point data indexes

Outsios, Evangelos; Evangelidis, Georgios

Παρακαλώ χρησιμοποιήστε αυτό το αναγνωριστικό για να παραπέμψετε ή να δημιουργήσετε σύνδεσμο προς αυτό το τεκμήριο: https://ruomo.lib.uom.gr/handle/7000/1228

Πλήρης εγγραφή μεταδεδομένων

Πεδίο DC	Τιμή	Γλώσσα
dc.contributor.author	Outsios, Evangelos	-
dc.contributor.author	Evangelidis, Georgios	-
dc.date.accessioned	2022-08-29T07:51:52Z	-
dc.date.available	2022-08-29T07:51:52Z	-
dc.date.issued	2011	-
dc.identifier	10.15556/IJIIM.01.01.003	en_US
dc.identifier.issn	2241-827X	en_US
dc.identifier.uri	https://doi.org/10.15556/IJIIM.01.01.003	en_US
dc.identifier.uri	http://ejournals.uniwa.gr/index.php/JIIM/article/view/3050	en_US
dc.identifier.uri	https://ruomo.lib.uom.gr/handle/7000/1228	-
dc.description.abstract	Indexing of k-dimensional point data is becoming again a hot research topic because of the need to efficiently index and retrieve high dimensional vectors (points) in data mining applications. The most common query on such vectors is kNN searching, which is a variation of range searching. Most multidimensional indexes for point data follow the paradigm of the ubiquitous B+tree and store data entries at the leaf level of the index (data nodes). Since this level naturally occupies the majority of nodes in a multidimensional index tree, it is crucial that an index structure achieves the best possible average storage utilization regardless of data distribution and order of data insertion. An additional conflicting goal is the minimization of the index term that is posted at the levels above when data nodes are split. In this paper we revisit data node splitting techniques for point access methods like the KDB-tree, hB-tree, and, in general, any index that stores point data at its leaf level nodes and splits them so that no overlapping subspaces are created at the leaf level. We experiment with various splitting techniques that produce the minimum index term for posting but differ in the shape of the resulting nodes and the average storage utilization. We also test our splitting techniques using uniform and skewed data distributions. The comparison is on the average data node storage utilization and the efficiency of range query searches.	en_US
dc.language.iso	en	en_US
dc.rights	Attribution-NonCommercial-ShareAlike 4.0 International	*
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/4.0/	*
dc.source	International Journal on Integrated Information Management	en_US
dc.subject	FRASCATI::Natural sciences::Computer and information sciences	en_US
dc.subject.other	Indexing of k-dimensional point data	en_US
dc.subject.other	Point Access Methods	en_US
dc.title	Achieving optimal average data node storage utilization in k-dimensional point data indexes	en_US
dc.type	Article	en_US
dc.contributor.department	Τμήμα Εφαρμοσμένης Πληροφορικής	en_US
Εμφανίζεται στις Συλλογές:	Τμήμα Εφαρμοσμένης Πληροφορικής

Αρχεία σε αυτό το Τεκμήριο:

Αρχείο	Περιγραφή	Μέγεθος	Μορφότυπος
2011_IJIIM_outsios.pdf		274,55 kB	Adobe PDF	Προβολή/Ανοιγμα

Εμφανίστε την απλή εγγραφή

Αυτό το τεκμήριο προστατεύεται από Αδεια Creative Commons

Ιδρυματικό Αποθετήριο Ακαδημαϊκής Έρευνας Πανεπιστήμιο Μακεδονίας

Ιδρυματικό Αποθετήριο Ακαδημαϊκής Έρευνας
Πανεπιστήμιο Μακεδονίας