A Hamiltonian Systems Approach to Neural Network Optimization

George, Joshua Joseph

A Hamiltonian Systems Approach to Neural Network Optimization

dc.contributor.author	George, Joshua Joseph
dc.date.accessioned	2026-05-21T18:44:03Z
dc.date.available	2026-05-21T18:44:03Z
dc.date.issued	2026-05-21
dc.date.submitted	2026-05-19
dc.description.abstract	We propose and analyze structure-preserving methods for first-order optimization of Lipschitz smooth objectives by interpreting the dynamics as a dissipative Hamiltonian system, in which the model parameters evolve jointly with an auxiliary momentum variable. This formulation induces a natural energy dissipation mechanism that motivates the design of optimization algorithms that inherit a discrete energy decay property. We develop discrete gradient (DG) methods that pre- serve an exact discrete time energy decay property, ensuring monotone dissipation independent of stepsize. Building on this framework, we introduce variants which empirically reduce oscillations, improve runtime, and improve robustness to ill-conditioned problems. To address the computational cost of the implicit DG methods, we propose semi-implicit discrete gradient (SIDG) schemes obtained by linearizing the DG updates and incorporating curvature through L-BFGS Hessian approximations, which are used to efficiently solve the result- ing linear systems. These schemes retain key structure-preserving properties while significantly reducing computational cost, yielding a practical balance between stability and efficiency. We es- tablish monotone energy decay, boundedness of iterates, and sublinear convergence to first-order stationary points. Numerical experiments on ill-conditioned least-squares problems, regularized logistic regres- sion, physics-informed neural networks, and CIFAR-10 image classification demonstrate good performance despite ill-conditioning and competitive performance as compared to widely used optimizers such as ADAM, Stochastic gradient descent, and L-BFGS.
dc.identifier.uri	https://hdl.handle.net/10012/23378
dc.language.iso	en
dc.pending	false
dc.publisher	University of Waterloo	en
dc.title	A Hamiltonian Systems Approach to Neural Network Optimization
dc.type	Master Thesis
uws-etd.degree	Master of Mathematics
uws-etd.degree.department	Applied Mathematics
uws-etd.degree.discipline	Applied Mathematics
uws-etd.degree.grantor	University of Waterloo	en
uws-etd.embargo.terms	0
uws.contributor.advisor	Morris, Kirsten
uws.contributor.advisor	C. Del Rey Fern´andez, David
uws.contributor.affiliation1	Faculty of Mathematics
uws.peerReviewStatus	Unreviewed	en
uws.published.city	Waterloo	en
uws.published.country	Canada	en
uws.published.province	Ontario	en
uws.scholarLevel	Graduate	en
uws.typeOfResource	Text	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: George_Joshua.pdf
Size:: 2.43 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 6.4 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Theses