Edinburgh Research Archive

Bayesian inference for challenging scientific models

dc.contributor.advisor
Murray, Iain
dc.contributor.advisor
Gutmann, Michael
dc.contributor.advisor
Zuntz, Joe
dc.contributor.author
Ritchie, James A.
dc.date.accessioned
2024-01-31T12:03:25Z
dc.date.available
2024-01-31T12:03:25Z
dc.date.issued
2024-01-31
dc.description.abstract
Advances in technology and computation have led to ever more complicated scientific models of phenomena across a wide variety of fields. Many of these models present challenges for Bayesian inference, as a result of computationally intensive likelihoods, high-dimensional parameter spaces or large dataset sizes. In this thesis we show how we can apply developments in probabilistic machine learning and statistics to do inference with examples of these types of models. As a demonstration of an applied inference problem involving a non-trivial likelihood computation, we show how a combination of optimisation and MCMC methods along with careful consideration of priors can be used to infer the parameters of an ODE model of the cardiac action potential. We then consider the problem of pileup, a phenomenon that occurs in astronomy when using CCD detectors to observe bright sources. It complicates the fitting of even simple spectral models by introducing an observation model with a large number of continuous and discrete latent variables that scales with the size of the dataset. We develop an MCMC-based method that can work in the presence of pileup by explicitly marginalising out discrete variables and using adaptive HMC on the remaining continuous variables. We show with synthetic experiments that it allows us to fit spectral models in the presence of pileup without biasing the results. We also compare it to neural Simulation- Based Inference approaches, and find that they perform comparably to the MCMC-based approach whilst being able to scale to larger datasets. As an example of a problem where we wish to do inference with extremely large datasets, we consider the Extreme Deconvolution method. The method fits a probability density to a dataset where each observation has Gaussian noise added with a known sample-specific covariance, originally intended for use with astronomical datasets. The existing fitting method is batch EM, which would not normally be applied to large datasets such as the Gaia catalog containing noisy observations of a billion stars. In this thesis we propose two minibatch variants of extreme deconvolution, based on an online variation of the EM algorithm, and direct gradient-based optimisation of the log-likelihood, both of which can run on GPUs. We demonstrate that these methods provide faster fitting, whilst being able to scale to much larger models for use with larger datasets. We then extend the extreme deconvolution approach to work with non- Gaussian noise, and to use more flexible density estimators such as normalizing flows. Since both adjustments lead to an intractable likelihood, we resort to amortized variational inference in order to fit them. We show that for some datasets that flows can outperform Gaussian mixtures for extreme deconvolution, and that fitting with non-Gaussian noise is now possible.
en
dc.identifier.uri
https://hdl.handle.net/1842/41392
dc.identifier.uri
http://dx.doi.org/10.7488/era/4126
dc.language.iso
en
en
dc.publisher
The University of Edinburgh
en
dc.relation.hasversion
Dockhorn, Tim, James A. Ritchie, Yaoliang Yu, and Iain Murray (July 13, 2020). “Density Deconvolution with Normalizing Flows”. arXiv: 2006.09396 [cs, stat] (cit. on pp. 5, 128)
en
dc.relation.hasversion
Ritchie, James A. and Iain Murray (Nov. 26, 2019). “Scalable Extreme Deconvolution”. arXiv: 1911.11663 [cs, stat] (cit. on p. 4)
en
dc.subject
Bayesian inference
en
dc.subject
probabilistic machine learning
en
dc.subject
MCMC methods
en
dc.subject
cardiac action potential
en
dc.subject
pileup
en
dc.subject
MCMC-based method
en
dc.subject
adaptive HMC
en
dc.subject
neural Simulation- Based Inference approaches
en
dc.subject
Simulation- Based Inference approaches
en
dc.subject
Extreme Deconvolution method
en
dc.title
Bayesian inference for challenging scientific models
en
dc.type
Thesis or Dissertation
en
dc.type.qualificationlevel
Doctoral
en
dc.type.qualificationname
PhD Doctor of Philosophy
en

Files

Original bundle

Now showing 1 - 1 of 1
Name:
RitchieJA_2023.pdf
Size:
5.17 MB
Format:
Adobe Portable Document Format
Description:

This item appears in the following Collection(s)