Statistical models for natural scene data
dc.contributor.advisor
Williams, Chris
en
dc.contributor.advisor
Mckenna, Stephen
en
dc.contributor.author
Kivinen, Jyri Juhani
en
dc.contributor.sponsor
Scottish Informatics and Computer Science Alliance (SICSA)
en
dc.contributor.sponsor
Engineering and Physical Sciences Research Council (EPSRC)
en
dc.date.accessioned
2014-06-03T09:44:19Z
dc.date.available
2014-06-03T09:44:19Z
dc.date.issued
2014-06-27
dc.description.abstract
This thesis considers statistical modelling of natural image data. Obtaining advances in this field can have significant impact for both engineering applications, and for the understanding of the human visual system. Several recent advances in natural image modelling have been obtained with the use of unsupervised feature learning. We consider a class of such models, restricted Boltzmann machines (RBMs), used in many recent state-of-the-art image models. We develop extensions of these stochastic artificial neural networks, and use them as a basis for building more effective image models, and tools for computational vision. We first develop a novel framework for obtaining Boltzmann machines, in which the hidden unit activations co-transform with transformed input stimuli in a stable and predictable way throughout the network. We define such models to be transformation equivariant. Such properties have been shown useful for computer vision systems, and have been motivational for example in the development of steerable filters, a widely used classical feature extraction technique. Translation equivariant feature sharing has been the standard method for scaling image models beyond patch-sized data to large images. In our framework we extend shallow and deep models to account for other kinds of transformations as well, focusing on in-plane rotations. Motivated by the unsatisfactory results of current generative natural image models, we take a step back, and evaluate whether they are able to model a subclass of the data, natural image textures. This is a necessary subcomponent of any credible model for visual scenes. We assess the performance of a state- of-the-art model of natural images for texture generation, using a dataset and evaluation techniques from in prior work. We also perform a dissection of the model architecture, uncovering the properties important for good performance. Building on this, we develop structured extensions for more complicated data comprised of textures from multiple classes, using the single-texture model architecture as a basis. These models are shown to be able to produce state-of-the-art texture synthesis results quantitatively, and are also effective qualitatively. It is demonstrated empirically that the developed multiple-texture framework provides a means to generate images of differently textured regions, more generic globally varying textures, and can also be used for texture interpolation, where the approach is radically dfferent from the others in the area. Finally we consider visual boundary prediction from natural images. The work aims to improve understanding of Boltzmann machines in the generation of image segment boundaries, and to investigate deep neural network architectures for learning the boundary detection problem. The developed networks (which avoid several hand-crafted model and feature designs commonly used for the problem), produce the fastest reported inference times in the literature, combined with state-of-the-art performance.
en
dc.identifier.uri
http://hdl.handle.net/1842/8879
dc.language.iso
en
dc.publisher
The University of Edinburgh
en
dc.relation.hasversion
J. J. Kivinen and C. K. I. Williams. Transformation equivariant Boltzmann ma- chines. In Proceedings, International Conference on Arti cial Neural Networks (ICANN), pages 1{8, 2011.
en
dc.relation.hasversion
J. J. Kivinen and C. K. I. Williams. Multiple texture Boltzmann machines. In Proceedings, International Conference on Arti cial Intelligence and Statistics (AISTATS), 2012.
en
dc.relation.hasversion
J. J. Kivinen, E. B. Sudderth, and M. I. Jordan. Image denoising with non- parametric hidden Markov trees. In Proceedings, International Conference on Image Processing (ICIP), volume 3, pages 121{124, 2007a.
en
dc.relation.hasversion
J. J. Kivinen, E. B. Sudderth, and M. I. Jordan. Learning multiscale representa- tions of natural scenes using Dirichlet processes. In Proceedings, International Conference on Computer Vision (ICCV), pages 1{8, 2007b.
en
dc.subject
Image understanding
en
dc.subject
Statistical image modelling
en
dc.subject
Texture modelling
en
dc.subject
Visual boundary prediction
en
dc.subject
Boltzmann machines
en
dc.subject
Neural networks
en
dc.title
Statistical models for natural scene data
en
dc.type
Thesis or Dissertation
en
dc.type.qualificationlevel
Doctoral
en
dc.type.qualificationname
PhD Doctor of Philosophy
en
Files
Original bundle
1 - 1 of 1
- Name:
- Kivinen2014.pdf
- Size:
- 19.65 MB
- Format:
- Adobe Portable Document Format
This item appears in the following Collection(s)

