Παρακαλώ χρησιμοποιήστε αυτό το αναγνωριστικό για να παραπέμψετε ή να δημιουργήσετε σύνδεσμο προς αυτό το τεκμήριο: https://ruomo.lib.uom.gr/handle/7000/1279
Πλήρης εγγραφή μεταδεδομένων
Πεδίο DCΤιμήΓλώσσα
dc.contributor.authorKalatzantonakis, Panagiotis-
dc.contributor.authorSifaleras, Angelo-
dc.contributor.authorSamaras, Nikolaos-
dc.date.accessioned2022-09-19T07:20:32Z-
dc.date.available2022-09-19T07:20:32Z-
dc.date.issued2022-
dc.identifier10.1016/j.eswa.2022.118812en_US
dc.identifier.issn0957-4174en_US
dc.identifier.urihttps://doi.org/10.1016/j.eswa.2022.118812en_US
dc.identifier.urihttps://ruomo.lib.uom.gr/handle/7000/1279-
dc.description.abstractFinding the best sequence of local search operators that yield the optimal performance of Variable Neighborhood Search is an important open research question in the field of metaheuristics. This paper proposes a Reinforcement Learning method to address this question. We introduce a new hyperheuristic scheme, termed Bandit VNS, inspired by the Multi-armed Bandit, a particular type of a single state reinforcement learning problem. In Bandit VNS, we utilize the General Variable Neighborhood Search metaheuristic and enhance it by a hyperheuristic strategy. We examine several variations of the Upper Confidence Bound algorithm to create a reliable strategy for adaptive neighborhood selection. Furthermore, we utilize Adaptive Windowing, a state of the art algorithm to estimate and detect changes in the data stream. Bandit VNS is designed for effective parallelization and encourages cooperation between agents to produce the best solution quality. We demonstrate this concept's advantages in accuracy and speed by extensive experimentation using the Capacitated Vehicle Routing Problem. We compare the novel scheme's performance against the conventional General Variable Neighborhood Search metaheuristic in terms of the CPU time and solution quality. The Bandit VNS method shows excellent results and reaches significantly higher performance metrics when applied to well-known benchmark instances. Our experiments show that, our approach achieves an improvement of more than 25% in solution quality when compared to the General Variable Neighborhood Search method using standard library instances of medium and large size.en_US
dc.language.isoenen_US
dc.publisherElsevieren_US
dc.sourceExpert Systems with Applicationsen_US
dc.subjectFRASCATI::Natural sciences::Mathematics::Applied Mathematicsen_US
dc.subjectFRASCATI::Natural sciences::Computer and information sciencesen_US
dc.subject.otherReinforcement Learningen_US
dc.subject.otherMulti-Armed Banditsen_US
dc.subject.otherIntelligent Optimizationen_US
dc.subject.otherBandit Learningen_US
dc.subject.otherMetaheuristicsen_US
dc.subject.otherVariable Neighborhood Searchen_US
dc.subject.otherVehicle Routing Problemen_US
dc.titleA reinforcement learning—Variable neighborhood search method for the capacitated Vehicle Routing Problemen_US
dc.typeArticleen_US
dc.contributor.departmentΤμήμα Εφαρμοσμένης Πληροφορικήςen_US
local.identifier.firstpage118812en_US
Εμφανίζεται στις Συλλογές: Τμήμα Εφαρμοσμένης Πληροφορικής

Αρχεία σε αυτό το Τεκμήριο:
Αρχείο Περιγραφή ΜέγεθοςΜορφότυπος 
A_Reinforcement_Learning_-_VNS_Method_for_the_CVRP.pdf
  Until 2024-09-16
1,36 MBAdobe PDFΠροβολή/Ανοιγμα    Αίτηση αντιτύπου


Τα τεκμήρια στο Αποθετήριο προστατεύονται από πνευματικά δικαιώματα, εκτός αν αναφέρεται κάτι διαφορετικό.