Kinetic Langevin Monte Carlo methods
dc.contributor.advisor
Leimkuhler, Benedict
dc.contributor.advisor
Paulin, Daniel
dc.contributor.author
Whalley, Peter Archibald
dc.date.accessioned
2024-09-17T11:28:36Z
dc.date.available
2024-09-17T11:28:36Z
dc.date.issued
2024-09-17
dc.description.abstract
In this thesis, we study discretizations of kinetic Langevin dynamics within the context of Markov chain Monte Carlo. We compare the convergence properties for different choices of integrators, we provide asymptotic bias estimates and numerics to compare them. We also present an alternative to Metropolis-Hastings which corrects for the bias. This thesis consists of two parts.
In the first part, we provide a framework to analyze the convergence of discretized kinetic Langevin dynamics for M-∇ Lipschitz, m-convex potentials with and without stochastic gradients. Our approach gives convergence rates of O(m/M), with explicit stepsize restrictions, which are of the same order as the stability threshold for Gaussian targets and are valid for a large interval of the friction parameter. We apply this methodology to various integration schemes which are popular in the molecular dynamics and machine learning communities. Further, we introduce the property ``γ-limit convergence" (GLC) to characterize underdamped Langevin schemes that converge to overdamped dynamics in the high-friction limit and which have stepsize restrictions that are independent of the friction parameter; we show that this property is not generic by exhibiting methods from both the class and its complement. We present numerical experiments for a Bayesian logistic regression example, where BAOAB is shown to perform the best. Finally, we provide asymptotic bias estimates for the BAOAB scheme, which remain accurate in the high-friction limit by comparison to a modified stochastic dynamics which preserves the invariant measure.
In the second part, we present an unbiased method for Bayesian posterior means based on kinetic Langevin dynamics that combines advanced splitting methods with enhanced gradient approximations. Our approach avoids Metropolis correction by coupling Markov chains at different discretization levels in a multilevel Monte Carlo approach. Theoretical analysis demonstrates that our proposed estimator is unbiased, attains finite variance, and satisfies a central limit theorem. It can achieve accuracy ɛ > 0 for estimating expectations of Lipschitz functions in $d$ dimensions with O(d¹/⁴ɛ⁻²) expected gradient evaluations, without assuming warm start. We exhibit similar bounds using both approximate and stochastic gradients, and our method's computational cost is shown to scale independently of the size of the dataset. The proposed method is tested using a multinomial regression problem on the MNIST dataset and a Poisson regression model for soccer scores. Experiments indicate that the number of gradient evaluations per effective sample is independent of dimension, even when using inexact gradients. For product distributions, we give dimension-independent variance bounds. Our results demonstrate that the unbiased algorithm we present can be much more efficient than the ``gold-standard" randomized Hamiltonian Monte Carlo.
en
dc.identifier.uri
https://hdl.handle.net/1842/42177
dc.identifier.uri
http://dx.doi.org/10.7488/era/4898
dc.language.iso
en
en
dc.publisher
The University of Edinburgh
en
dc.relation.hasversion
Neil K. Chada, Benedict Leimkuhler, Daniel Paulin, and Peter A. Whalley. Unbiased Kinetic Langevin Monte Carlo with Inexact Gradients. arXiv preprint arXiv:2311.05025, 2023
en
dc.relation.hasversion
Benedict J. Leimkuhler, Daniel Paulin, and Peter A. Whalley. Contraction and Convergence Rates for Discretized Kinetic Langevin Dynamics. SIAM Journal on Numerical Analysis, 62(3):1226–1258, 2024.
en
dc.relation.hasversion
Daniel Paulin and Peter A. Whalley. Correction to “Wasserstein distance estimates for the distributions of numerical approximations to ergodic stochastic differential equations”. arXiv preprint arXiv:2402.08711, 2024.
en
dc.relation.hasversion
Katharina Schuh and Peter A. Whalley. Convergence of kinetic Langevin samplers for non-convex potentials. arXiv preprint arXiv:2405.09992, 2024.
en
dc.subject
probability distribution
en
dc.subject
computational cost
en
dc.subject
kinetic Langevin dynamics
en
dc.subject
Markov chain Monte Carlo
en
dc.subject
Metropolis-Hastings
en
dc.subject
Poisson regression model
en
dc.subject
efficiency
en
dc.title
Kinetic Langevin Monte Carlo methods
en
dc.type
Thesis or Dissertation
en
dc.type.qualificationlevel
Doctoral
en
dc.type.qualificationname
PhD Doctor of Philosophy
en
Files
Original bundle
1 - 1 of 1
- Name:
- Whalley2024.pdf
- Size:
- 3.19 MB
- Format:
- Adobe Portable Document Format
- Description:
- Signature redacted
This item appears in the following Collection(s)

