Please use this identifier to cite or link to this item:
Title: A Clustering Approach Towards Cross-Project Technical Debt Forecasting
Authors: Tsoukalas, Dimitrios
Mathioudaki, Maria
Siavvas, Miltiadis
Kehagias, Dionysios
Chatzigeorgiou, Alexander
Type: Article
Subjects: FRASCATI::Natural sciences::Computer and information sciences
Keywords: technical debt
software maintenance
software quality
Issue Date: 7-Jan-2021
Publisher: SpringerLink
Source: SN Computer Science
Volume: 2
Issue: 1
Abstract: Technical debt (TD) describes quality compromises that can yield short-term bene ts but may negatively affect the quality of software products in the long run. A wide range of tools and techniques have been introduced over the years in order for the developers to be able to determine and manage TD. However, being able to also predict its future evolution is of equal importance in order to avoid its accumulation, and, in turn, the unlikely event of making the project unmaintainable. Although recent research endeavors have showcased the feasibility of building accurate project-specific TD forecasting models, there is a gap in the field regarding cross-project TD forecasting. Cross-project TD forecasting is of practical importance, since it would enable the application of pre-existing forecasting models on previously unknown software projects, especially new projects that do not exhibit sufficient commit history to enable the construction of project-specific models. To this end, in the present paper we focus on cross-project TD forecasting, and we examine whether the consideration of similarities between software projects could be the key for more accurate forecasting. More specifically, we propose an approach based on data clustering. In fact, a relatively large repository of software projects is divided into clusters of similar projects with respect to their TD aspects, and specific TD forecasting models are built for each cluster, using regression algorithms. According to our approach, previously unknown software projects are assigned to one of the de fined clusters and the cluster-specific TD forecasting model is applied to predict future TD values. The approach was evaluated through several experiments based on real-world applications. The results of the analysis suggest that the proposed approach comprises a promising solution for accurate cross-project TD forecasting.
ISSN: 2662-995X
Other Identifiers: 10.1007/s42979-020-00408-4
Appears in Collections:Department of Applied Informatics

Files in This Item:
File Description SizeFormat 
tsoukalas_sn_preprint.pdf1,2 MBAdobe PDFView/Open

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.