Convolutional Neural Networks for Visual Tracking 2015 02
























- Slides: 24
Convolutional Neural Networks for Visual Tracking 2015. 02. 17 Computer Vision Lab. 남현섭
Contents • H. Li, Y. Li, and F. Robust Online Visual Tracking with an Single Convolutional Neural Network, ACCV 2014 • N. Wang, S. Li, A. Gupta and D. Y. Yeung, Transferring Rich Feature Hierarchies for Robust Visual Tracking, ar. Xiv 2015
H. Li, Y. Li, and F. Robust Online Visual Tracking with an Single Convolutional Neural Network ACCV 2014
Contributions • Online learning of a single CNN • Robust, structural, truncated loss function • Iterative SGD method with a temporal sampling
CNN Architecture
Structural Loss Function • Increase the number of available training samples by introducing the structural importance • Traditional loss function • Structural loss function Structural importance CNN loss overlapping ratio
Structural Loss With a Robust Term • Regularize the ordinary structural loss using a set of positive instances rather than only one positive instance.
Structural Loss With a Robust Term • Predicted state • Positive instance set • Robust term • Structural loss function with a robust term
Structural Loss With a Robust Term and the Truncated Norm •
Online Learning of CNN • Temporal Sampling – Learn a discriminative model on a long-term positive set and a shortterm negative set. • Iterative Stochastic Gradient Descent – Iteratively update the convolutional layers, and jointly update the fully connected layers.
Online Learning of CNN
Online Learning of CNN
Results
Results (CNN tracker variations)
N. Wang, S. Li, A. Gupta and D. Y. Yeung Transferring Rich Feature Hierarchies for Robust Visual Tracking ar. Xiv 2015
Contributions • Transferring a pretrained CNN into online tracking • Structured output corresponding to a probability map
Objectness Pretraining • Pretrain a CNN to learn generic features for distinguishing objects from non-objects
Online Tracking – State Determination – Sample Collection – Differentially-paced Fine-tuning
Online Tracking • State Determination average
Online Tracking •
Results
Results (non-rigid object)
Discussion • Transferring a pretrained CNN • Employing structural information • Modifying loss function • Efficient online learning • CNN update policy
On-going research Finetuned fully connected layers Pretrained convolutional layers conv+pool 3 96 conv+pool 256 conv+pool conv 384 256 64 2 Probability map 256 64 2