Edinburgh Research Archive

Image context for object detection, object context for part detection

dc.contributor.advisor
Ferrari, Vittorio
en
dc.contributor.advisor
Komura, Taku
en
dc.contributor.author
Gonzalez-Garcia, Abel
en
dc.contributor.sponsor
other
en
dc.date.accessioned
2018-03-16T12:29:19Z
dc.date.available
2018-03-16T12:29:19Z
dc.date.issued
2018-07-02
dc.description.abstract
Objects and parts are crucial elements for achieving automatic image understanding. The goal of the object detection task is to recognize and localize all the objects in an image. Similarly, semantic part detection attempts to recognize and localize the object parts. This thesis proposes four contributions. The first two make object detection more efficient by using active search strategies guided by image context. The last two involve parts. One of them explores the emergence of parts in neural networks trained for object detection, whereas the other improves on part detection by adding object context. First, we present an active search strategy for efficient object class detection. Modern object detectors evaluate a large set of windows using a window classifier. Instead, our search sequentially chooses what window to evaluate next based on all the information gathered before. This results in a significant reduction on the number of necessary window evaluations to detect the objects in the image. We guide our search strategy using image context and the score of the classifier. In our second contribution, we extend this active search to jointly detect pairs of object classes that appear close in the image, exploiting the valuable information that one class can provide about the location of the other. This leads to an even further reduction on the number of necessary evaluations for the smaller, more challenging classes. In the third contribution of this thesis, we study whether semantic parts emerge in Convolutional Neural Networks trained for different visual recognition tasks, especially object detection. We perform two quantitative analyses that provide a deeper understanding of their internal representation by investigating the responses of the network filters. Moreover, we explore several connections between discriminative power and semantics, which provides further insights on the role of semantic parts in the network. Finally, the last contribution is a part detection approach that exploits object context. We complement part appearance with the object appearance, its class, and the expected relative location of the parts inside it. We significantly outperform approaches that use part appearance alone in this challenging task.
en
dc.identifier.uri
http://hdl.handle.net/1842/28842
dc.language.iso
en
dc.publisher
The University of Edinburgh
en
dc.relation.hasversion
Gonzalez-Garcia, A., Modolo, D., and Ferrari, V. (2017). Do semantic parts emerge in convolutional neural networks? International Journal of Computer Vision.
en
dc.relation.hasversion
Gonzalez-Garcia, A., Modolo, D., and Ferrari, V. (2017). Objects as context for part detection. arXiv preprint arXiv:1703.09529.
en
dc.relation.hasversion
Gonzalez-Garcia, A., Vezhnevets, A., and Ferrari, V. (2015). An active search strategy for efficient object class detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
en
dc.subject
object detection
en
dc.subject
automatic image
en
dc.subject
object class detection
en
dc.subject
window classifiers
en
dc.subject
convolutional neural networks
en
dc.title
Image context for object detection, object context for part detection
en
dc.type
Thesis or Dissertation
en
dc.type.qualificationlevel
Doctoral
en
dc.type.qualificationname
PhD Doctor of Philosophy
en

Files

Original bundle

Now showing 1 - 1 of 1
Name:
Gonzalez-Garcia2018.pdf
Size:
59.2 MB
Format:
Adobe Portable Document Format

This item appears in the following Collection(s)