Edinburgh Research Archive

Metric learning on high-dimensional data with optimal transport distances

dc.contributor.advisor
Danos, Vincent
dc.contributor.advisor
Sanguinetti, Guido
dc.contributor.advisor
Swain, Peter
dc.contributor.author
Fulop, Patric-Manuel
dc.contributor.sponsor
Microsoft Research
en
dc.date.accessioned
2025-01-23T15:52:02Z
dc.date.available
2025-01-23T15:52:02Z
dc.date.issued
2025-01-23
dc.description.abstract
Optimal transport distances (OT), also known as Kantorovich or Wasserstein distances and its approximate variants such as Sinkhorn Divergences have been widely used in recent years in the field of Machine Learning and its applications. From being used as loss functions in generative model setups such as GANs & unsupervised domain adaptation, to the more recent cluster assignment in self-supervised large state-of-art models such as SWAV, they offer a principled way to compare probability distributions. It is an automatic machinery that takes as an input a ground metric on the data features and lifts this to distances between probabilities on that data space. One of the pitfalls of other often-used methods such as Kullback-Leibler from the family of f-Divergences is the breaking of euclidean metrics in high dimensional spaces, as well as infite solutions when the support of the distributions doesn’t match. In the first part of the thesis, we provide an introduction to Optimal Transport theory, followed by the relevant literature in metric learning & generative modelling that covers OT, including a few advancements in approximations of OT distances, i.e. Sinkhorn divergences, that make training generative models with the Wasserstein distance, faster and scalable. Two of the main challenges with using OT distances in practice is the cost of computation when the data lives in high dimension, and the choice of a suitable ground metric. Firstly, we look at the recent work by Paty and Cuturi (2019), which aims specifically at reducing the computational cost by computing OT using low-rank projections of the data, seen as discrete measures. We extend this approach and show that one can approximate OT distances by using more general families of maps provided they are 1-Lipschitz. The best estimate is obtained by maximising OT over the given family. As OT calculations are done after mapping data to a lower dimensional space, our method scales well with the original data dimension and is robust against noise. We demonstrate the idea with neural networks and provide some insights into using these methods for training generative models. Secondly, we look at learning the ground metric for OT distances in a supervised manner and compare this to traditional metric learning methods such as learning the parameters of Mahalanobis distances on MNIST. Finally, we consider potential avenues for future research in this area.
en
dc.identifier.uri
https://hdl.handle.net/1842/43017
dc.identifier.uri
http://dx.doi.org/10.7488/era/5565
dc.language.iso
en
en
dc.publisher
The University of Edinburgh
en
dc.relation.references
Fulop, P., Manataki, A., Agachi, A., Capital, E., & Pop, P. (n.d.). Predicting survival after surgery for brain
en
dc.relation.references
Fulop, P. M., & Danos, V. (2021). Efficient estimates of optimal transport via low-dimensional embeddings. arXiv preprint arXiv:2111.04838
en
dc.relation.references
Pop, R., & Fulop, P. (2018). Deep ensemble bayesian active learning: Addressing the mode collapse issue in monte carlo dropout via ensembles. arXiv preprint arXiv:1811.03897
en
dc.subject
Metric learning
en
dc.subject
high-dimensional data
en
dc.subject
optimal transport distances
en
dc.subject
Kantorovich
en
dc.subject
Wasserstein
en
dc.subject
Sinkhorn Divergences
en
dc.subject
Machine Learning
en
dc.subject
Kullback-Leibler
en
dc.subject
f-Divergences
en
dc.subject
Optimal Transport theory
en
dc.title
Metric learning on high-dimensional data with optimal transport distances
en
dc.type
Thesis or Dissertation
en
dc.type.qualificationlevel
Doctoral
en
dc.type.qualificationname
MPhil Master of Philosophy
en

Files

Original bundle

Now showing 1 - 1 of 1
Name:
FulopP_2025.pdf
Size:
4 MB
Format:
Adobe Portable Document Format
Description:

This item appears in the following Collection(s)