Toronto Metropolitan University
Browse
Chen, Cui.pdf (616.97 kB)

An Application Of Machine Learning Methods On Complicated Large Scale Dataset

Download (616.97 kB)
thesis
posted on 2023-06-09, 17:52 authored by Cui Chen

 Machine learning research has been an upcoming trend over the last few years. With more computational power and increasing volume of data available thanks to the development of the Internet, machine learning methods could be applied to real life problems and produce fascinating outcomes. Furthermore, with the rise of deep learning methodologies, machine learning practitioners can work on unstructured datasets and achieve human level accuracy. The present thesis focuses on a structured dataset with unstructured fields and information, aiming to apply multiple machine learning methods from a supervised learning perspective. Firstly, linear regression models and extreme gradient boosting machine, as conventional machine learning methods, are applied on certain selected features of the dataset, achieving remarkable results. They serve as base models. Next, an ensembled neural network model with four modules, i.e., fully connected module, embedding module, convolutional network module, and recurrent network module, is created and implemented with the transfer learning technique, which yields better outcomes. This work leads to a typical supervised learning structure and can be considered as a stepping stone for similar practical implementations. 

History

Language

English

Degree

  • Master of Science

Program

  • Applied Mathematics

Granting Institution

Ryerson University

LAC Thesis Type

  • Thesis

Thesis Advisor

Dr. Niushan Gao

Year

2020

Usage metrics

    Applied Mathematics (Theses)

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC