Date of Award
Spring 5-21-2026
Document Type
Honors Project
University Scholars Director
Dr. Joshua Tom
First Advisor/Committee Member
Dr. Carlos R. Arias
Keywords
machine learning, natural language processing, gated recurrent neural networks, empirical evidence, classification
Abstract
Gated recurrent neural networks such as Long Short-Term Memory (LSTM) and Gated Recurrent Units (GRU) help fix instability present in normal recurrent neural networks. This allows them to be used for various real-world tasks, and due to their architecture, they are uniquely qualified to handle variable sized input such as text. However, even before training can begin on a machine learning model, various hyperparameters must be chosen to decide how the model will be architectured. Choosing good hyperparameters is vital for creating a model that performs well but is not larger and more computationally expensive to run than it needs to be. This paper identifies various hyperparameters in both LSTM and GRU models and empirically tests a wide range of hyperparameter values across various different natural language processing (NLP) tasks and finds the resulting accuracy. From there, various graphs are presented and patterns are identified which can highlight various properties of each hyperparameter. Using these results, the developer is able to create more informed decisions when deciding what hyperparameter values to choose when creating a gated recurrent neural network from scratch.
Recommended Citation
Fechete, Joshua Paul, "Optimizing Gated RNNs" (2026). Honors Projects. 258.
https://digitalcommons.spu.edu/honorsprojects/258
Copyright Status
http://rightsstatements.org/vocab/InC/1.0/
Additional Rights Information
Copyright held by author.

Comments
A project submitted in partial fulfillment of the requirements of the University Scholars Honors Program.