Research output: Contribution to journal › Article › peer-review
TraceSim : An Alignment Method for Computing Stack Trace Similarity. / Rodrigues, Irving Muller; Khvorov, Aleksandr; Aloise, Daniel; Vasiliev, Roman; Koznov, Dmitrij; Fernandes, Eraldo Rezende; Chernishev, George; Luciv, Dmitry; Povarov, Nikita.
In: Empirical Software Engineering, Vol. 27, No. 2, 53, 03.2022.Research output: Contribution to journal › Article › peer-review
}
TY - JOUR
T1 - TraceSim
T2 - An Alignment Method for Computing Stack Trace Similarity
AU - Rodrigues, Irving Muller
AU - Khvorov, Aleksandr
AU - Aloise, Daniel
AU - Vasiliev, Roman
AU - Koznov, Dmitrij
AU - Fernandes, Eraldo Rezende
AU - Chernishev, George
AU - Luciv, Dmitry
AU - Povarov, Nikita
N1 - Publisher Copyright: © 2022, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.
PY - 2022/3
Y1 - 2022/3
N2 - Software systems can automatically submit crash reports to a repository for investigation when program failures occur. A significant portion of these crash reports are duplicate, i.e., they are caused by the same software issue. Therefore, if the volume of submitted reports is very large, automatic grouping of duplicate crash reports can significantly ease and speed up analysis of software failures. This task is known as crash report deduplication. Given a huge volume of incoming reports, increasing quality of deduplication is an important task. The majority of studies address it via information retrieval or sequence matching methods based on the similarity of stack traces from two crash reports. While information retrieval methods disregard the position of a frame in a stack trace, the existing works based on sequence matching algorithms do not fully consider subroutine global frequency and unmatched frames. Besides, due to data distribution differences among software projects, parameters that are learned using machine learning algorithms are necessary to provide more flexibility to the methods. In this paper, we propose TraceSim – an approach for crash report deduplication which combines TF-IDF, optimum global alignment, and machine learning (ML) in a novel way. Moreover, we propose a new evaluation methodology for this task that is more comprehensive and robust than previously used evaluation approaches. TraceSim significantly outperforms seven baselines and state-of-the-art methods in the majority of the scenarios. It is the only approach that achieves competitive results on all datasets regarding all considered metrics. Moreover, we conduct an extensive ablation study that demonstrates the importance of each TraceSim’s element to its final performance and robustness. Finally, we provide the source code for all considered methods and evaluation methodology as well as the created datasets.
AB - Software systems can automatically submit crash reports to a repository for investigation when program failures occur. A significant portion of these crash reports are duplicate, i.e., they are caused by the same software issue. Therefore, if the volume of submitted reports is very large, automatic grouping of duplicate crash reports can significantly ease and speed up analysis of software failures. This task is known as crash report deduplication. Given a huge volume of incoming reports, increasing quality of deduplication is an important task. The majority of studies address it via information retrieval or sequence matching methods based on the similarity of stack traces from two crash reports. While information retrieval methods disregard the position of a frame in a stack trace, the existing works based on sequence matching algorithms do not fully consider subroutine global frequency and unmatched frames. Besides, due to data distribution differences among software projects, parameters that are learned using machine learning algorithms are necessary to provide more flexibility to the methods. In this paper, we propose TraceSim – an approach for crash report deduplication which combines TF-IDF, optimum global alignment, and machine learning (ML) in a novel way. Moreover, we propose a new evaluation methodology for this task that is more comprehensive and robust than previously used evaluation approaches. TraceSim significantly outperforms seven baselines and state-of-the-art methods in the majority of the scenarios. It is the only approach that achieves competitive results on all datasets regarding all considered metrics. Moreover, we conduct an extensive ablation study that demonstrates the importance of each TraceSim’s element to its final performance and robustness. Finally, we provide the source code for all considered methods and evaluation methodology as well as the created datasets.
KW - Automatic crash reporting
KW - Crash report deduplication
KW - Duplicate crash report
KW - Duplicate crash report detection
KW - Stack trace
UR - http://www.scopus.com/inward/record.url?scp=85125623765&partnerID=8YFLogxK
UR - https://www.mendeley.com/catalogue/fec2bb19-97d3-3284-92d4-c1a34b8447fe/
U2 - 10.1007/s10664-021-10070-w
DO - 10.1007/s10664-021-10070-w
M3 - Article
AN - SCOPUS:85125623765
VL - 27
JO - Empirical Software Engineering
JF - Empirical Software Engineering
SN - 1382-3256
IS - 2
M1 - 53
ER -
ID: 95165822