Computer vision models learning and inference Chapter 14

  • Slides: 60
Download presentation
Computer vision: models, learning and inference Chapter 14 The pinhole camera Please send errata

Computer vision: models, learning and inference Chapter 14 The pinhole camera Please send errata to s. prince@cs. ucl. ac. uk

Structure • • Pinhole camera model Three geometric problems Homogeneous coordinates Solving the problems

Structure • • Pinhole camera model Three geometric problems Homogeneous coordinates Solving the problems – Exterior orientation problem – Camera calibration – 3 D reconstruction • Applications Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 2

Motivation Sparse stereo reconstruction Compute the depth at a set of sparse matching points

Motivation Sparse stereo reconstruction Compute the depth at a set of sparse matching points Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 3

Pinhole camera Real camera image is inverted Instead model impossible but more convenient virtual

Pinhole camera Real camera image is inverted Instead model impossible but more convenient virtual image Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 4

Pinhole camera terminology Computer vision: models, learning and inference. © 2011 Simon J. D.

Pinhole camera terminology Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 5

Normalized Camera By similar triangles: Computer vision: models, learning and inference. © 2011 Simon

Normalized Camera By similar triangles: Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 6

Are real cameras normalized? • Unfortunately real cameras are not normalized • They have

Are real cameras normalized? • Unfortunately real cameras are not normalized • They have different sizes, shapes and configurations that lead differences between cameras • This means that we don’t know the world ray that corresponds to a particular pixel • The calibration of a camera precisely defines these differences 7

Pixel Size Field of View Focal length combines two issues Computer vision: models, learning

Pixel Size Field of View Focal length combines two issues Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 8

Focal length parameters Can model both • the effect of the distance to the

Focal length parameters Can model both • the effect of the distance to the focal plane • the density of the receptors with a single focal length parameter f In practice, the receptors may not be square: So use different focal length parameter for x and y dims Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 9

Offset parameters • Current model assumes that pixel (0, 0) is where the principal

Offset parameters • Current model assumes that pixel (0, 0) is where the principal ray strikes the image plane (i. e. the center) • Model offset to center Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 10

Skew parameter • Finally, add skew parameter • Accounts for image plane being not

Skew parameter • Finally, add skew parameter • Accounts for image plane being not exactly perpendicular to the principal ray Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 11

Position and orientation of camera • Position w=(u, v, w)T of point in the

Position and orientation of camera • Position w=(u, v, w)T of point in the world is generally not expressed in the frame of reference of the camera. • Transform using 3 D transformation or Point in frame of reference of camera Point in frame of reference of world Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 12

Complete pinhole camera model • Intrinsic parameters (stored as intrinsic matrix) • Extrinsic parameters

Complete pinhole camera model • Intrinsic parameters (stored as intrinsic matrix) • Extrinsic parameters Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 13

Complete pinhole camera model For short: Add noise – uncertainty in localizing feature in

Complete pinhole camera model For short: Add noise – uncertainty in localizing feature in image Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 14

http: //en. wikipedia. org/wiki/Multivariate_normal_distribution 15

http: //en. wikipedia. org/wiki/Multivariate_normal_distribution 15

One additional complication… radial distortion Computer vision: models, learning and inference. © 2011 Simon

One additional complication… radial distortion Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 16

Other types of projection • The standard camera we study maps an array of

Other types of projection • The standard camera we study maps an array of pixels to a family of rays using the pinhole projection • But, there are lots of intriguing variants, I’ll just mention a few.

360 degree field of view… • Basic approach – Take a photo of a

360 degree field of view… • Basic approach – Take a photo of a parabolic mirror with an orthographic lens (Nayar) – Or buy one a lens from a variety of omnicam manufacturers… • See http: //www. cis. upenn. edu/~kostas/omni. html

Tilt-shift http: //www. northlight-images. co. uk/article_pages/tilt_and_shift_ts-e. html http: //timelapseblog. com/2011/01/15/1 1 -best-tilt-shift-videos/ Titlt-shift images

Tilt-shift http: //www. northlight-images. co. uk/article_pages/tilt_and_shift_ts-e. html http: //timelapseblog. com/2011/01/15/1 1 -best-tilt-shift-videos/ Titlt-shift images from Olivo Barbieri and Photoshop imitations

wikipedia

wikipedia

Rotating sensor (or object) Rollout Photographs © Justin Kerr http: //research. mayavase. com Also

Rotating sensor (or object) Rollout Photographs © Justin Kerr http: //research. mayavase. com Also known as “cyclographs”, “peripheral images”

Photofinish

Photofinish

Random Lens Fergus

Random Lens Fergus

Grossman

Grossman

Structure • • Pinhole camera model Three geometric problems Homogeneous coordinates Solving the problems

Structure • • Pinhole camera model Three geometric problems Homogeneous coordinates Solving the problems – Exterior orientation problem – Camera calibration – 3 D reconstruction • Applications Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 25

Problem 1: Learning extrinsic parameters (exterior orientation) Use maximum likelihood: Computer vision: models, learning

Problem 1: Learning extrinsic parameters (exterior orientation) Use maximum likelihood: Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 26

Problem 2 – Learning intrinsic parameters (calibration) Use maximum likelihood: Computer vision: models, learning

Problem 2 – Learning intrinsic parameters (calibration) Use maximum likelihood: Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 27

Calibration • Use 3 D target with known 3 D points Computer vision: models,

Calibration • Use 3 D target with known 3 D points Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 28

Problem 3 – Inferring 3 D points (triangulation / reconstruction) Use maximum likelihood: Computer

Problem 3 – Inferring 3 D points (triangulation / reconstruction) Use maximum likelihood: Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 29

Solving the problems • None of these problems can be solved in closed form

Solving the problems • None of these problems can be solved in closed form • Can apply non-linear optimization to find best solution but slow and prone to local minima • Solution – convert to a new representation (homogeoneous coordinates) where we can solve in closed form. • Caution! We are not solving the true problem – finding global minimum of wrong problem. But can use as starting point for non-linear optimization of true problem Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 30

Structure • • Pinhole camera model Three geometric problems Homogeneous coordinates Solving the problems

Structure • • Pinhole camera model Three geometric problems Homogeneous coordinates Solving the problems – Exterior orientation problem – Camera calibration – 3 D reconstruction • Applications Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 31

Homogeneous coordinates Convert 2 D coordinate to 3 D To convert back Computer vision:

Homogeneous coordinates Convert 2 D coordinate to 3 D To convert back Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 32

Geometric interpretation of homogeneous coordinates Computer vision: models, learning and inference. © 2011 Simon

Geometric interpretation of homogeneous coordinates Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 33

Pinhole camera in homogeneous coordinates Camera model: In homogeneous coordinates: (linear!) Computer vision: models,

Pinhole camera in homogeneous coordinates Camera model: In homogeneous coordinates: (linear!) Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 34

Pinhole camera in homogeneous coordinates Writing out these three equations Eliminate l to retrieve

Pinhole camera in homogeneous coordinates Writing out these three equations Eliminate l to retrieve original equations Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 35

Adding in extrinsic parameters Or for short: Or even shorter: Computer vision: models, learning

Adding in extrinsic parameters Or for short: Or even shorter: Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 36

Structure • • Pinhole camera model Three geometric problems Homogeneous coordinates Solving the problems

Structure • • Pinhole camera model Three geometric problems Homogeneous coordinates Solving the problems – Exterior orientation problem – Camera calibration – 3 D reconstruction • Applications Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 37

Problem 1: Learning extrinsic parameters (exterior orientation) Use maximum likelihood: Non-convex 38

Problem 1: Learning extrinsic parameters (exterior orientation) Use maximum likelihood: Non-convex 38

Exterior orientation Start with camera equation in homogeneous coordinates Pre-multiply both sides by inverse

Exterior orientation Start with camera equation in homogeneous coordinates Pre-multiply both sides by inverse of camera calibration matrix Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 39

Exterior orientation The third equation gives us an expression for l Substitute back into

Exterior orientation The third equation gives us an expression for l Substitute back into first two lines Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 40

Exterior orientation Linear equation – two equations per point – form system of equations

Exterior orientation Linear equation – two equations per point – form system of equations Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 41

Exterior orientation Minimum direction problem of the form Find minimum of subject to To

Exterior orientation Minimum direction problem of the form Find minimum of subject to To solve, compute the SVD set to the last column of . . , and then Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 42

Converting parameters into proper form Now we extract the values of and from .

Converting parameters into proper form Now we extract the values of and from . Problem: the scale is arbitrary and the rows and columns of the rotation matrix may not be orthogonal. Solution: compute SVD choose. and then Use the ratio between the rotation matrix before and after to rescale the translation Use these estimates for start of non-linear optimisation. Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 43

Structure • • Pinhole camera model Three geometric problems Homogeneous coordinates Solving the problems

Structure • • Pinhole camera model Three geometric problems Homogeneous coordinates Solving the problems – Exterior orientation problem – Camera calibration – 3 D reconstruction • Applications Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 44

Problem 2 – Learning intrinsic parameters (calibration) Use maximum likelihood: Computer vision: models, learning

Problem 2 – Learning intrinsic parameters (calibration) Use maximum likelihood: Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 45

Calibration One approach (not very efficient) is to alternately • Optimize extrinsic parameters for

Calibration One approach (not very efficient) is to alternately • Optimize extrinsic parameters for fixed intrinsic • Optimize intrinsic parameters for fixed extrinsic Then use non-linear optimization. Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 46

Intrinsic parameters Maximum likelihood approach This is a least squares problem. Computer vision: models,

Intrinsic parameters Maximum likelihood approach This is a least squares problem. Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 47

Intrinsic parameters The function is linear w. r. t. intrinsic parameters. Can be written

Intrinsic parameters The function is linear w. r. t. intrinsic parameters. Can be written in form Now solve least squares problem Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 48

Structure • • Pinhole camera model Three geometric problems Homogeneous coordinates Solving the problems

Structure • • Pinhole camera model Three geometric problems Homogeneous coordinates Solving the problems – Exterior orientation problem – Camera calibration – 3 D reconstruction • Applications Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 49

Problem 3 – Inferring 3 D points (triangulation / reconstruction) Use maximum likelihood: Computer

Problem 3 – Inferring 3 D points (triangulation / reconstruction) Use maximum likelihood: Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 50

Reconstruction Write jth pinhole camera in homogeneous coordinates: Pre-multiply with inverse of intrinsic matrix

Reconstruction Write jth pinhole camera in homogeneous coordinates: Pre-multiply with inverse of intrinsic matrix Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 51

Reconstruction Last equations gives Substitute back into first two equations Re-arranging get two linear

Reconstruction Last equations gives Substitute back into first two equations Re-arranging get two linear equations for [u, v, w] Solve using >1 cameras and then use non-linear optimization Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 52

Structure • • Pinhole camera model Three geometric problems Homogeneous coordinates Solving the problems

Structure • • Pinhole camera model Three geometric problems Homogeneous coordinates Solving the problems – Exterior orientation problem – Camera calibration – 3 D reconstruction • Applications Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 53

Depth from structured light Computer vision: models, learning and inference. © 2011 Simon J.

Depth from structured light Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 54

Depth from structured light Computer vision: models, learning and inference. © 2011 Simon J.

Depth from structured light Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 55

Depth from structured light Computer vision: models, learning and inference. © 2011 Simon J.

Depth from structured light Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 56

Shape from silhouette Computer vision: models, learning and inference. © 2011 Simon J. D.

Shape from silhouette Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 57

Shape from silhouette Computer vision: models, learning and inference. © 2011 Simon J. D.

Shape from silhouette Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 58

Shape from silhouette Computer vision: models, learning and inference. © 2011 Simon J. D.

Shape from silhouette Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 59

Conclusion • Pinhole camera model is a non-linear function who takes points in 3

Conclusion • Pinhole camera model is a non-linear function who takes points in 3 D world and finds where they map to in image • Parameterized by intrinsic and extrinsic matrices • Difficult to estimate intrinsic/extrinsic/depth because non-linear • Use homogeneous coordinates where we can get closed form solutions (initial solns only) Computer vision: models, learning and inference. © 2011 Simon J. D. Prince 60