Applications of Stochastic Gradient Descent to Nonnegative Matrix Factorization

Slavin, Matthew

Applications of Stochastic Gradient Descent to Nonnegative Matrix Factorization

Files

Slavin_Matthew.pdf (2.24 MB)

Date

2019-07-15

Authors

Slavin, Matthew

Advisor

Vavasis, Stephen

Publisher

University of Waterloo

Abstract

We consider the application of stochastic gradient descent (SGD) to the nonnegative matrix factorization (NMF) problem and the unconstrained low-rank matrix factorization problem. While the literature on the SGD algorithm is rich, the application of this specific algorithm to the field of matrix factorization problems is an unexplored area. We develop a series of results for the unconstrained problem, beginning with an analysis of standard gradient descent with a known zero-loss solution, and culminating with results for SGD in the general case where no zero-loss solution is assumed. We show that, with initialization close to a minimizer, there exist linear rate convergence guarantees. We explore these results further with numerical experiments, and examine how the matrix factorization solutions found by SGD can be used as machine learning classifiers in two specific applications. In the first application, handwritten digit recognition, we show that our approach produces classification performance competitive with existing matrix factorization algorithms. In the second application, document topic classification, we examine how well SGD can recover an unknown words-to-topics matrix when the topics-to-document matrix is generated using the Latent Dirichlet Allocation model. This approach allows us to simulate two regimes for SGD: a fixed-sample regime where a large set of data is iterated over to train the model, and a generated-sample regime where a new data point is generated at each training iteration. In both regimes, we show that SGD can be an effective tool for recovering the hidden words-to-topic matrix. We conclude with some suggestions for further expansion of this work.

Keywords

Optimization, Data Science, Machine Learning

URI

http://hdl.handle.net/10012/14798

Collections

Theses
Combinatorics and Optimization

Full item page

Applications of Stochastic Gradient Descent to Nonnegative Matrix Factorization

Files

Date

Authors

Advisor

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

LC Subject Headings

Citation

URI

Collections