Lecture 10 Causal Estimation of 3 D Structure

VISION as a SENSOR machine to INTERACT with the environment NEED to estimate relative

TRADEOFFS SFM LARGE BASELINE MASKS © 2004 EASY Invitation to 3 D vision CORRESPONDENCE

TRADEOFFS SFM LARGE BASELINE SMALL BASELINE MASKS © 2004 EASY HARD/IMPOSSIBLE Invitation to 3

WHAT DOES IT TAKE ? SFM LARGE BASELINE SMALL BASELINE MASKS © 2004 EASY

Structure and motion as a filtering problem • • • Given measurements of the

Difficulties … • • Model is non-linear (output map = projection) State-space is non-linear!

Observability • Equivalent class of state-space trajectories generate the same measurements • Fix, e.

Local coordinatization of the state space MASKS © 2004 Invitation to 3 D vision

Minimal realization • Now it looks very much like: • And we are looking

EFK vs. particle filter? • • For single rigid body/static scene, expect unimodal posterior

In practice … • • • Adding/removing features (subfilters) Multiple motions/outliers (M-filter, innovation tests)

Slides: 15

Download presentation

VISION as a SENSOR machine to INTERACT with the environment NEED to estimate relative 3 D MOTION 3 D SHAPE (TASK) REAL-TIME CAUSAL processing representation of SHAPE (only supportive of representation of motion) POINT-FEATURES MASKS © 2004 Invitation to 3 D vision

WHAT DOES IT TAKE ? SFM LARGE BASELINE SMALL BASELINE MASKS © 2004 EASY HARD/IMPOSSIBLE INTEGRATE visual information OVER TIME Invitation to 3 D vision CORRESPONDENCE HARD/IMPOSSIBLE EASY OCCLUSIONS! GLOBAL LOCAL

Setup and notation Temporal evolution:

Structure and motion as a filtering problem • • • Given measurements of the “output” (feature point positions) Given modeling assumptions about the “input” (acceleration = noise) Estimate the “state” (3 D structure, pose, velocity) MASKS © 2004 Invitation to 3 D vision

Difficulties … • • Model is non-linear (output map = projection) State-space is non-linear! (SE(3)) Noise: need to specify what we mean by “estimate” Even without noise: model is not observable! MASKS © 2004 Invitation to 3 D vision

Observability • Equivalent class of state-space trajectories generate the same measurements • Fix, e. g. , the direction of 3 points, and the depth of one point (Gauge transformation) MASKS © 2004 Invitation to 3 D vision

EFK vs. particle filter? • • For single rigid body/static scene, expect unimodal posterior No need to estimate entire density; point estimate suffices Robust (M-) version of EKF works well in practice … … and in real time for a few hundred feature points MASKS © 2004 Invitation to 3 D vision

In practice … • • • Adding/removing features (subfilters) Multiple motions/outliers (M-filter, innovation tests) Tracking drift (reset with wide-baseline matching) Switching the reference features (hard! Causes unavoidable global drift) Global registration (maintain DB of lost features) MASKS © 2004 Invitation to 3 D vision