Παρακαλώ χρησιμοποιήστε αυτό το αναγνωριστικό για να παραπέμψετε ή να δημιουργήσετε σύνδεσμο προς αυτό το τεκμήριο: https://ruomo.lib.uom.gr/handle/7000/576
Πλήρης εγγραφή μεταδεδομένων
Πεδίο DCΤιμήΓλώσσα
dc.contributor.authorKoutsonikola, Vassiliki A.-
dc.contributor.authorPetridou, Sophia-
dc.contributor.authorVakali, Athena I.-
dc.contributor.authorPapadimitriou, Georgios I.-
dc.date.accessioned2020-01-09T10:56:03Z-
dc.date.available2020-01-09T10:56:03Z-
dc.date.issued2009-08-
dc.identifier10.1108/17440080910983583en_US
dc.identifier.issn1744-0084en_US
dc.identifier.urihttps://doi.org/10.1108/17440080910983583en_US
dc.identifier.urihttps://ruomo.lib.uom.gr/handle/7000/576-
dc.description.abstractPurpose: Web users' clustering is an important mining task since it contributes in identifying usage patterns, a beneficial task for a wide range of applications that rely on the web. The purpose of this paper is to examine the usage of Kullback‐Leibler (KL) divergence, an information theoretic distance, as an alternative option for measuring distances in web users clustering. Design/methodology/approach: KL‐divergence is compared with other well‐known distance measures and clustering results are evaluated using a criterion function, validity indices, and graphical representations. Furthermore, the impact of noise (i.e. occasional or mistaken page visits) is evaluated, since it is imperative to assess whether a clustering process exhibits tolerance in noisy environments such as the web. Findings: The proposed KL clustering approach is of similar performance when compared with other distance measures under both synthetic and real data workloads. Moreover, imposing extra noise on real data, the approach shows minimum deterioration among most of the other conventional distance measures. Practical implications: The experimental results show that a probabilistic measure such as KL‐divergence has proven to be quite efficient in noisy environments and thus constitute a good alternative, the web users clustering problem. Originality/value: This work is inspired by the usage of divergence in clustering of biological data and it is introduced by the authors in the area of web clustering. According to the experimental results presented in this paper, KL‐divergence can be considered as a good alternative for measuring distances in noisy environments such as the web.en_US
dc.language.isoenen_US
dc.publisherEmeralden_US
dc.sourceInternational Journal of Web Information Systemsen_US
dc.subjectFRASCATI::Natural sciences::Computer and information sciencesen_US
dc.subject.otherCluster analysisen_US
dc.subject.otherInternet Data miningen_US
dc.subject.otherUser studiesen_US
dc.titleA new approach to web users clustering and validation: a divergence‐based schemeen_US
dc.typeArticleen_US
dc.contributor.departmentΤμήμα Εφαρμοσμένης Πληροφορικήςen_US
local.identifier.volume5en_US
local.identifier.issue3en_US
local.identifier.firstpage348en_US
local.identifier.lastpage371en_US
Εμφανίζεται στις Συλλογές: Τμήμα Εφαρμοσμένης Πληροφορικής

Αρχεία σε αυτό το Τεκμήριο:
Αρχείο Περιγραφή ΜέγεθοςΜορφότυπος 
IJWIS-2009-WebUsersClustering.pdf902,8 kBAdobe PDFΠροβολή/Ανοιγμα


Τα τεκμήρια στο Αποθετήριο προστατεύονται από πνευματικά δικαιώματα, εκτός αν αναφέρεται κάτι διαφορετικό.