Toronto Metropolitan University
Browse

Exploration and Mitigation of Stereotypical Gender Biases in Information Retrieval Systems

Download (1.54 MB)
thesis
posted on 2024-03-18, 19:28 authored by Amin Bigdeli
<p>Recent studies in information retrieval have shown that gender biases have found their way into representational and algorithmic aspects of retrieval methods. In this thesis, we focus on the exploration and mitigation of gender biases in information retrieval gold standard datasets, often referred to as relevance judgements, and di▯erent retrieval methods including ad hoc retrieval and neural rankers. We investigate the presence of stereotypical gender biases in relevance judgment datasets and show that stereotypical gender biases are prevalent in relevance judgement datasets. The presence of gender biases in relevance judgements would immediately find its way into how neural ranking models are trained and evaluated. Therefore, we propose a systematic method to de-bias the relevance judgement datasets with a set of balanced and well-matched query pairs from different gender identities to reduce the level of bias transferred into neural ranking models that are trained based on them. The main premise of our work is that such a debiasing process will expose neural rankers to comparable queries from across gender identities that have associated relevant documents with compatible degrees of gender bias. Our experiments show that our proposed approach is able to systematically reduce stereotypical gender biases associated with different gender identities, and at the same time maintain the same level of retrieval effectiveness. In addition to relevance judgment datasets, we propose methods for reducing biases in different retrieval methods. To this end, we propose a bias-aware pseudo-relevance feedback method for reducing the level of bias in the ad hoc retrieval method and show that this decrease in bias is not accompanied with the cost of reduction in the utility. We also propose a light-weight strategy that considers the degrees of gender bias when sampling documents to be used for training neural ranking models. Our results show that the proposed light-weight strategy is able to show competitive (or even better) performance compared to the state-of-the-art neural architectures specifically designed to reduce gender biases.</p>

History

Language

English

Degree

  • Master of Science in Management

Program

  • Master of Science in Management

Granting Institution

Ryerson University

LAC Thesis Type

  • Thesis

Thesis Advisor

Morteza Zihayat/ Ebrahim Bagheri

Year

2022

Usage metrics

    Management (TRSM) (Theses)

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC