Transformer Models for Automated Bug Triaging and Duplicate Bug Detection
In the software engineering field, developer teams must handle bug reports of varying sources and formats to maintain and optimize software applications as issues arise. A team's workflow for handling bugs involves multiple stages to review, assess, assign, and resolve bugs. In teams for large-scale applications, streamlining such processes is vital for efficient operations as they are exposed to greater volumes and varieties of bug reports. This thesis focuses on bug triaging and duplicate bug detection as preliminary processing options for the automated bucketing and assignment of bugs. In the bug triaging task, Transformer-based models are found to outperform in mean Rank-5, Rank-10, and Mean Reciprocal Rank across several open-source datasets for various software projects. In the duplicate bug detection task, similarity learning is employed and Transformer-based siamese models with domain adaptation are shown to improve similarity learning capabilities with improvements in mean Area under the Curve, Recall-rate @ k, and Mean Reciprocal Rank performance.
History
Language
EnglishDegree
- Master of Science
Program
- Data Science and Analytics
Granting Institution
Toronto Metropolitan UniversityLAC Thesis Type
- Thesis