Toronto Metropolitan University
Browse

Transformer Models for Automated Bug Triaging and Duplicate Bug Detection

Download (4.67 MB)
thesis
posted on 2024-08-30, 20:28 authored by Patrick de Guzman

In the software engineering field, developer teams must handle bug reports of varying sources and formats to maintain and optimize software applications as issues arise. A team's workflow for handling bugs involves multiple stages to review, assess, assign, and resolve bugs. In teams for large-scale applications, streamlining such processes is vital for efficient operations as they are exposed to greater volumes and varieties of bug reports. This thesis focuses on bug triaging and duplicate bug detection as preliminary processing options for the automated bucketing and assignment of bugs. In the bug triaging task, Transformer-based models are found to outperform in mean Rank-5, Rank-10, and Mean Reciprocal Rank across several open-source datasets for various software projects. In the duplicate bug detection task, similarity learning is employed and Transformer-based siamese models with domain adaptation are shown to improve similarity learning capabilities with improvements in mean Area under the Curve, Recall-rate @ k, and Mean Reciprocal Rank performance.

History

Language

English

Degree

  • Master of Science

Program

  • Data Science and Analytics

Granting Institution

Toronto Metropolitan University

LAC Thesis Type

  • Thesis

Thesis Advisor

Dr. Mucahit Cevik

Year

2023

Usage metrics

    Computer Science (Theses)

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC