Toronto Metropolitan University
Browse

Hand Gesture Recognition Using CNN & Publication of World’s Largest ASL Database

Download (1.79 MB)
thesis
posted on 2023-08-29, 16:08 authored by Ashwin Kannoth

Sign language is used throughout the world by the hearing impaired to communicate. Recent advancements in Computer Vision and Deep Learning has given rise to many machine learning based translators. In this research report, a solution to recognize the English alphabet presented as static signs in the American Sign Language (ASL) is proposed. The classifications are achieved by a four layer CNN. The model is trained and tested on a dataset created for this project. This dataset will be published as a contribution to the community and is currently the world’s largest ASL database consisting of 624,000 images. Split into two sections, the database contains images in both the IR and RGB spectrum. Classifications on both sets of data achieve state-of-the-art when compared to similar research. An accuracy of 99.89% and 99.91% are achieved when classifying the IR and RGB datasets respectively. 

History

Language

English

Degree

  • Master of Engineering

Program

  • Electrical and Computer Engineering

Granting Institution

Ryerson University

LAC Thesis Type

  • MRP

Thesis Advisor

Dr. Cungang Yang

Year

2021

Usage metrics

    Electrical and Computer Engineering (Theses)

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC