Kinetic Langevin Monte Carlo methods

Whalley, Peter Archibald

Kinetic Langevin Monte Carlo methods

Simple item page

dc.contributor.advisor

Leimkuhler, Benedict

dc.contributor.advisor

Paulin, Daniel

dc.contributor.author

Whalley, Peter Archibald

dc.date.accessioned

2024-09-17T11:28:36Z

dc.date.available

2024-09-17T11:28:36Z

dc.date.issued

2024-09-17

dc.description.abstract

In this thesis, we study discretizations of kinetic Langevin dynamics within the context of Markov chain Monte Carlo. We compare the convergence properties for different choices of integrators, we provide asymptotic bias estimates and numerics to compare them. We also present an alternative to Metropolis-Hastings which corrects for the bias. This thesis consists of two parts. In the first part, we provide a framework to analyze the convergence of discretized kinetic Langevin dynamics for M-∇ Lipschitz, m-convex potentials with and without stochastic gradients. Our approach gives convergence rates of O(m/M), with explicit stepsize restrictions, which are of the same order as the stability threshold for Gaussian targets and are valid for a large interval of the friction parameter. We apply this methodology to various integration schemes which are popular in the molecular dynamics and machine learning communities. Further, we introduce the property ``γ-limit convergence" (GLC) to characterize underdamped Langevin schemes that converge to overdamped dynamics in the high-friction limit and which have stepsize restrictions that are independent of the friction parameter; we show that this property is not generic by exhibiting methods from both the class and its complement. We present numerical experiments for a Bayesian logistic regression example, where BAOAB is shown to perform the best. Finally, we provide asymptotic bias estimates for the BAOAB scheme, which remain accurate in the high-friction limit by comparison to a modified stochastic dynamics which preserves the invariant measure. In the second part, we present an unbiased method for Bayesian posterior means based on kinetic Langevin dynamics that combines advanced splitting methods with enhanced gradient approximations. Our approach avoids Metropolis correction by coupling Markov chains at different discretization levels in a multilevel Monte Carlo approach. Theoretical analysis demonstrates that our proposed estimator is unbiased, attains finite variance, and satisfies a central limit theorem. It can achieve accuracy ɛ > 0 for estimating expectations of Lipschitz functions in $d$ dimensions with O(d¹/⁴ɛ⁻²) expected gradient evaluations, without assuming warm start. We exhibit similar bounds using both approximate and stochastic gradients, and our method's computational cost is shown to scale independently of the size of the dataset. The proposed method is tested using a multinomial regression problem on the MNIST dataset and a Poisson regression model for soccer scores. Experiments indicate that the number of gradient evaluations per effective sample is independent of dimension, even when using inexact gradients. For product distributions, we give dimension-independent variance bounds. Our results demonstrate that the unbiased algorithm we present can be much more efficient than the ``gold-standard" randomized Hamiltonian Monte Carlo.

en

dc.identifier.uri

https://hdl.handle.net/1842/42177

dc.identifier.uri

http://dx.doi.org/10.7488/era/4898

dc.language.iso

en

dc.publisher

The University of Edinburgh

en

dc.relation.hasversion

Neil K. Chada, Benedict Leimkuhler, Daniel Paulin, and Peter A. Whalley. Unbiased Kinetic Langevin Monte Carlo with Inexact Gradients. arXiv preprint arXiv:2311.05025, 2023

en

dc.relation.hasversion

Benedict J. Leimkuhler, Daniel Paulin, and Peter A. Whalley. Contraction and Convergence Rates for Discretized Kinetic Langevin Dynamics. SIAM Journal on Numerical Analysis, 62(3):1226–1258, 2024.

en

dc.relation.hasversion

Daniel Paulin and Peter A. Whalley. Correction to “Wasserstein distance estimates for the distributions of numerical approximations to ergodic stochastic differential equations”. arXiv preprint arXiv:2402.08711, 2024.

en

dc.relation.hasversion

Katharina Schuh and Peter A. Whalley. Convergence of kinetic Langevin samplers for non-convex potentials. arXiv preprint arXiv:2405.09992, 2024.

en

dc.subject

probability distribution

en

dc.subject

computational cost

en

dc.subject

kinetic Langevin dynamics

en

dc.subject

Markov chain Monte Carlo

en

dc.subject

Metropolis-Hastings

en

dc.subject

Poisson regression model

en

dc.subject

efficiency

en

dc.title

Kinetic Langevin Monte Carlo methods

en

dc.type

Thesis or Dissertation

en

dc.type.qualificationlevel

Doctoral

en

dc.type.qualificationname

PhD Doctor of Philosophy

en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Whalley2024.pdf
Size:: 3.19 MB
Format:: Adobe Portable Document Format
Description:: Signature redacted

Download

This item appears in the following Collection(s)

Mathematics thesis and dissertation collection