Learning Structured Prediction Models in Computer Vision

Learning Structured Prediction Models in Computer Vision
Author :
Publisher :
Total Pages : 119
Release :
ISBN-10 : OCLC:950740988
ISBN-13 :
Rating : 4/5 (88 Downloads)

Book Synopsis Learning Structured Prediction Models in Computer Vision by : Fayao Liu

Download or read book Learning Structured Prediction Models in Computer Vision written by Fayao Liu and published by . This book was released on 2015 with total page 119 pages. Available in PDF, EPUB and Kindle. Book excerpt: Most of the real world applications can be formulated as structured learning problems, in which the output domain can be arbitrary, e.g., a sequence or a graph. By modelling the structures (constraints and correlations) of the output variables, structured learning provides a more general learning scheme than simple binary classification or regression models. This thesis is dedicated to learning such structured prediction models, i.e., conditional random fields (CRFs) and their applications in computer vision. CRFs are popular probabilistic graphical models, which model the conditional distribution of the output variables given the observations. They play an essential role in the computer vision community and have found wide applications in various vision tasks-semantic labelling, object detection, pose estimation, to name a few. Specifically, we here focus on two challenging tasks in this thesis: image segmentation (also referred as semantic labelling) and depth estimation from single monocular images, which represent two types of CRFs models-discrete and continuous. In summary, we made three contributions in this thesis. First, we present a new approach to exploit tree potentials in CRFs for the task of image segmentation. This method combines the advantages of both CRFs and decision trees. Different from traditional methods, in which the potential functions of CRFs are defined as a linear combination of some pre-defined parametric models, we formulate the unary and the pairwise potentials as nonparametric forests-ensembles of decision trees, and learn the ensemble parameters and the trees in a unified optimization problem within the large-margin framework. In this fashion, we easily achieve nonlinear learning of potential functions on both unary and pairwise terms in CRFs. Moreover, we learn class-wise decision trees for each object that appears in the image. We further show that this challenging optimization can be efficiently solved by combining a modified column generation and cutting-planes techniques. Experimental results on both binary and multi-class segmentation datasets demonstrate the power of the learned nonlinear nonparametric potentials. Second, we propose to model the unary potentials of the CRFs using a convolutional neural network (CNN). The deep CNN is trained on the large-scale ImageNet dataset and transferred to image segmentation here for constructing unary potentials of super-pixels. The CRFs parameters are then learned within the max-margin framework using structured support vector machines (SSVM). To fully exploit context information in inference, we construct spatially related co-occurrence pairwise potentials and incorporate them into the energy function. This prefers labellings of object pairs that frequently co-occur in a certain spatial layout and at the same time avoids implausible labellings during the inference. Extensive experiments on binary and multi-class segmentation benchmarks demonstrate the potentials of the proposed method. Third, different from the previous two works, we address the problem of continuous CRFs learning, applied to the task of depth estimation from single images. Specifically, we formulate and learn the unary and pairwise potentials of a continuous CRFs model with CNN networks in a unified framework. We term this new method as deep convolutional neural fields, abbreviated as DCNF. It jointly explores the capacity of deep CNN and continuous CRFs. The proposed method can be used for depth estimation of general scenes with no geometric priors nor any extra information injected. Specifically, in our case, the integral of the partition function can be calculated in a closed form such that we can exactly solve the log-likelihood maximization. Moreover, solving the inference problem for predicting depths of a test image is highly efficient as closed-form solutions exist. We then further propose an equally effective model based on fully convolutional networks and a novel superpixel pooling method, which is ~ 10 times faster, to speedup the patch-wise convolutions in the deep model. With this more efficient model, we are able to design very deep networks to pursue further performance gain. Experiments on both indoor and outdoor scene datasets demonstrate that the proposed method significantly outperforms state-of-the-art depth estimation approaches. We also show experimentally that the proposed method generalizes well to depth estimations of images unrelated to the training data. This indicates the potential of our method for benefiting other vision tasks.


Learning Structured Prediction Models in Computer Vision Related Books

Learning Structured Prediction Models in Computer Vision
Language: en
Pages: 119
Authors: Fayao Liu
Categories: Computer vision
Type: BOOK - Published: 2015 - Publisher:

DOWNLOAD EBOOK

Most of the real world applications can be formulated as structured learning problems, in which the output domain can be arbitrary, e.g., a sequence or a graph.
Advanced Structured Prediction
Language: en
Pages: 430
Authors: Sebastian Nowozin
Categories: Computers
Type: BOOK - Published: 2014-12-05 - Publisher: MIT Press

DOWNLOAD EBOOK

An overview of recent work in the field of structured prediction, the building of predictive machine learning models for interrelated and dependent outputs. The
Structured Learning and Prediction in Computer Vision
Language: en
Pages: 195
Authors: Sebastian Nowozin
Categories: Computers
Type: BOOK - Published: 2011 - Publisher: Now Publishers Inc

DOWNLOAD EBOOK

Structured Learning and Prediction in Computer Vision introduces the reader to the most popular classes of structured models in computer vision.
Advanced Structured Prediction
Language: en
Pages: 430
Authors: Sebastian Nowozin
Categories: Computers
Type: BOOK - Published: 2014-11-21 - Publisher: MIT Press

DOWNLOAD EBOOK

An overview of recent work in the field of structured prediction, the building of predictive machine learning models for interrelated and dependent outputs. The
Building More Expressive Structured Models
Language: en
Pages:
Authors: Yujia Li
Categories:
Type: BOOK - Published: 2017 - Publisher:

DOWNLOAD EBOOK

Structured data and structured problems are common in machine learning, and they appear in many applications from computer vision, natural language understandin