Statistical models for natural scene data

Kivinen, Jyri Juhani

Statistical models for natural scene data

Simple item page

dc.contributor.advisor

Williams, Chris

en

dc.contributor.advisor

Mckenna, Stephen

en

dc.contributor.author

Kivinen, Jyri Juhani

en

dc.contributor.sponsor

Scottish Informatics and Computer Science Alliance (SICSA)

en

dc.contributor.sponsor

Engineering and Physical Sciences Research Council (EPSRC)

en

dc.date.accessioned

2014-06-03T09:44:19Z

dc.date.available

2014-06-03T09:44:19Z

dc.date.issued

2014-06-27

dc.description.abstract

This thesis considers statistical modelling of natural image data. Obtaining advances in this field can have significant impact for both engineering applications, and for the understanding of the human visual system. Several recent advances in natural image modelling have been obtained with the use of unsupervised feature learning. We consider a class of such models, restricted Boltzmann machines (RBMs), used in many recent state-of-the-art image models. We develop extensions of these stochastic artificial neural networks, and use them as a basis for building more effective image models, and tools for computational vision. We first develop a novel framework for obtaining Boltzmann machines, in which the hidden unit activations co-transform with transformed input stimuli in a stable and predictable way throughout the network. We define such models to be transformation equivariant. Such properties have been shown useful for computer vision systems, and have been motivational for example in the development of steerable filters, a widely used classical feature extraction technique. Translation equivariant feature sharing has been the standard method for scaling image models beyond patch-sized data to large images. In our framework we extend shallow and deep models to account for other kinds of transformations as well, focusing on in-plane rotations. Motivated by the unsatisfactory results of current generative natural image models, we take a step back, and evaluate whether they are able to model a subclass of the data, natural image textures. This is a necessary subcomponent of any credible model for visual scenes. We assess the performance of a state- of-the-art model of natural images for texture generation, using a dataset and evaluation techniques from in prior work. We also perform a dissection of the model architecture, uncovering the properties important for good performance. Building on this, we develop structured extensions for more complicated data comprised of textures from multiple classes, using the single-texture model architecture as a basis. These models are shown to be able to produce state-of-the-art texture synthesis results quantitatively, and are also effective qualitatively. It is demonstrated empirically that the developed multiple-texture framework provides a means to generate images of differently textured regions, more generic globally varying textures, and can also be used for texture interpolation, where the approach is radically dfferent from the others in the area. Finally we consider visual boundary prediction from natural images. The work aims to improve understanding of Boltzmann machines in the generation of image segment boundaries, and to investigate deep neural network architectures for learning the boundary detection problem. The developed networks (which avoid several hand-crafted model and feature designs commonly used for the problem), produce the fastest reported inference times in the literature, combined with state-of-the-art performance.

en

dc.identifier.uri

http://hdl.handle.net/1842/8879

dc.language.iso

en

dc.publisher

The University of Edinburgh

en

dc.relation.hasversion

J. J. Kivinen and C. K. I. Williams. Transformation equivariant Boltzmann ma- chines. In Proceedings, International Conference on Arti cial Neural Networks (ICANN), pages 1{8, 2011.

en

dc.relation.hasversion

J. J. Kivinen and C. K. I. Williams. Multiple texture Boltzmann machines. In Proceedings, International Conference on Arti cial Intelligence and Statistics (AISTATS), 2012.

en

dc.relation.hasversion

J. J. Kivinen, E. B. Sudderth, and M. I. Jordan. Image denoising with non- parametric hidden Markov trees. In Proceedings, International Conference on Image Processing (ICIP), volume 3, pages 121{124, 2007a.

en

dc.relation.hasversion

J. J. Kivinen, E. B. Sudderth, and M. I. Jordan. Learning multiscale representa- tions of natural scenes using Dirichlet processes. In Proceedings, International Conference on Computer Vision (ICCV), pages 1{8, 2007b.

en

dc.subject

Image understanding

en

dc.subject

Statistical image modelling

en

dc.subject

Texture modelling

en

dc.subject

Visual boundary prediction

en

dc.subject

Boltzmann machines

en

dc.subject

Neural networks

en

dc.title

Statistical models for natural scene data

en

dc.type

Thesis or Dissertation

en

dc.type.qualificationlevel

Doctoral

en

dc.type.qualificationname

PhD Doctor of Philosophy

en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Kivinen2014.pdf
Size:: 19.65 MB
Format:: Adobe Portable Document Format

Download

This item appears in the following Collection(s)

Informatics thesis and dissertation collection