PCA and eigenfaces T11 Computer Vision University of

Face detection and recognition Detection Recognition “Sally”

Consumer application: i. Photo 2009 http: //www. apple. com/ilife/iphoto/

Consumer application: i. Photo 2009 It can be trained to recognize pets! http: //www.

Consumer application: i. Photo 2009 i. Photo decides that this is a face

Outline • Face recognition – Eigenfaces • Face detection – The Viola and Jones

The space of all face images • When viewed as vectors of pixel values,

The space of all face images • Insight into principal component analysis (PCA). •

Principal Component Analysis • Given: N data points x 1, … , x. N

Principal Component Analysis • The variance of the projected data: Projection of data point

Principal Component Analysis • We now estimate vector u maximizing the variance: subject to:

Principal Component Analysis • The direction that captures the maximum covariance of the data

Principal Component Analysis • By using all of the d eigenvectors, we transform the

Principal Component Analysis The above is true because the eigenvectors of a symmetric positive

Principal Component Analysis • Property: the covariance matrix of the transformed data is diagonal

Eigenfaces: Key idea • Assume that most face images lie on a low-dimensional subspace

Eigenfaces example Top eigenvectors Mean: μ

Eigenfaces example • Face x in “face space” coordinates: • Reconstruction: = +

Eigenfaces example • Any face of the training set may be expressed as a

Eigenfaces example Intra-personal subspace (variations in expression) Extra-personal subspace (variations between people)

Face Recognition with eigenfaces Process the training images: • Find mean µ and covariance

Application: MRF features for texture representation

Application: view-based modeling • PCA on various views of a 3 D object. H.

Application: view-based modeling • Subspace of the first three PC (varying view angle). H.

Application: view-based modeling • View angle recognition. H. Murase and S. Nayar. Visual Learning

Application: view-based modeling • Extension to more 3 D objects with varying pose and

Limitations • Global appearance method – not robust to misalignment, background variation.

Limitations • Projection may suppress important detail – The smallest variance directions may not

Limitations • PCA does not take the discriminative task into account – Typically, we

Canonical Variates • Also called “Linear Discriminant Analysis” • A labeled training set is

Canonical Variates • We have a set of vectorized images xi (i=1, 2…, N).

Canonical Variates • Between class covariance: with Ni being the number of images in

Canonical Variates Using the same argument as in PCA (and graph-based segmentation) we seek

Canonical Variates • 71 views of 10 objects at a variety of poses at

Canonical Variates • The first two canonical variates. The clusters are tight and well

Slides: 39

Download presentation

PCA and eigenfaces T-11 Computer Vision University of Ioannina Christophoros Nikou Images and slides from: James Hayes, Brown University, Computer Vision course D. Forsyth and J. Ponce. Computer Vision: A Modern Approach, Prentice Hall, 2003. Computer Vision course by Svetlana Lazebnik, University of North Carolina at Chapel Hill. Computer Vision course by Michael Black, Brown University. Research page of Antonio Torralba, MIT.

Face detection and recognition

Face detection and recognition Detection Recognition “Sally”

Consumer application: i. Photo 2009 http: //www. apple. com/ilife/iphoto/

Consumer application: i. Photo 2009 It can be trained to recognize pets! http: //www. maclife. com/article/news/iphotos_faces_recognizes_cats

Consumer application: i. Photo 2009 i. Photo decides that this is a face

Outline • Face recognition – Eigenfaces • Face detection – The Viola and Jones algorithm. M. Turk and A. Pentland. "Eigenfaces for recognition". Journal of Cognitive Neuroscience, 3(1): 71– 86, 1991. Also in CVPR 1991. P. Viola and M. Jones. Robust real-time face detection. International Journal of Computer Vision, 57(2), 2004. Also in CVPR 2001.

The space of all face images • When viewed as vectors of pixel values, face images are extremely high-dimensional – 100 x 100 image = 10, 000 dimensions • However, relatively few 10, 000 dimensional vectors correspond to valid face images. • Is there a compact and effective representation of the subspace of face images?

The space of all face images • Insight into principal component analysis (PCA). • We want to construct a low-dimensional linear subspace that best explains the variation in the set of face images.

Principal Component Analysis • Given: N data points x 1, … , x. N in Rd • We want to find a new set of features that are linear combinations of the original ones: w(xi) = u. T(xi – µ) (µ: mean of data points) • What unit vector u in Rd captures the most variance of the data?

Principal Component Analysis • The variance of the projected data: Projection of data point Covariance matrix of data

Principal Component Analysis • We now estimate vector u maximizing the variance: subject to: because any multiple of u maximizes the objective function. • The Lagrangian is leading to the solution: which is the eigenvector corresponding to the largest eigenvalue of Σ.

Principal Component Analysis • The direction that captures the maximum covariance of the data is the eigenvector corresponding to the largest eigenvalue of the data covariance matrix. • The top k orthogonal directions that capture the most variance of the data are the k eigenvectors corresponding to the k largest eigenvalues.

Principal Component Analysis • By using all of the d eigenvectors, we transform the original data into a new orthogonal basis: • Most important: we may recover the original coordinates as a linear combination of the eigenvectors:

Principal Component Analysis The above is true because the eigenvectors of a symmetric positive semi-definite matrix are orthonormal: and

Principal Component Analysis • Property: the covariance matrix of the transformed data is diagonal – The transformed data are decorrelated:

Eigenfaces: Key idea • Assume that most face images lie on a low-dimensional subspace determined by the first k (k<d) directions of maximum variance. • Use PCA to determine the vectors or “eigenfaces” u 1, …uk that span that subspace. • Represent all face images in the dataset as linear combinations of eigenfaces.

Eigenfaces example Training images

Eigenfaces example Top eigenvectors Mean: μ

Eigenfaces example • Face x in “face space” coordinates: • Reconstruction: = +

Eigenfaces example • Any face of the training set may be expressed as a linear combination of a number of eigenfaces. • In matrix-vector form: Approximation error: = +

Eigenfaces example

Eigenfaces example Intra-personal subspace (variations in expression) Extra-personal subspace (variations between people)

Face Recognition with eigenfaces Process the training images: • Find mean µ and covariance matrix Σ. • Find k principal components (eigenvectors of Σ) u 1, …uk. • Project each training image xi onto subspace spanned by principal components: (wi 1, …, wik) = (u 1 T(xi – µ), … , uk. T(xi – µ)) Given a new image x: • Project onto subspace: (w 1, …, wk) = (u 1 T(x – µ), … , uk. T(x – µ)). • Optional: check the reconstruction error x – x^ to determine whether the new image is really a face. • Classify as closest training face in k-dimensional subspace. M. Turk and A. Pentland, Face Recognition using Eigenfaces, CVPR 1991

Application: MRF features for texture representation

Application: view-based modeling • PCA on various views of a 3 D object. H. Murase and S. Nayar. Visual Learning and Recognition of 3 D objects from appearance, International Journal of Computer Vision, Vol. 14, pp. 5 -24, 1995.

Application: view-based modeling • Subspace of the first three PC (varying view angle). H. Murase and S. Nayar. Visual Learning and Recognition of 3 D objects from appearance, International Journal of Computer Vision, Vol. 14, pp. 5 -24, 1995.

Application: view-based modeling • View angle recognition. H. Murase and S. Nayar. Visual Learning and Recognition of 3 D objects from appearance, International Journal of Computer Vision, Vol. 14, pp. 5 -24, 1995.

Application: view-based modeling • Extension to more 3 D objects with varying pose and illumination. H. Murase and S. Nayar. Visual Learning and Recognition of 3 D objects from appearance, International Journal of Computer Vision, Vol. 14, pp. 5 -24, 1995.

Limitations • Global appearance method – not robust to misalignment, background variation.

Limitations • Projection may suppress important detail – The smallest variance directions may not be unimportant. PCA assumes Gaussian data. The shape of this dataset is not well described by its principal components

Limitations • PCA does not take the discriminative task into account – Typically, we wish to compute features that allow good discrimination. • Projection onto the major axis can not separate the green from the red data. • The second principal component captures what is required for classification.

Canonical Variates • Also called “Linear Discriminant Analysis” • A labeled training set is necessary. • We wish to choose linear functions of the features that allow good discrimination. – Assume class-conditional covariances are the same. – We seek for linear feature maximizing the spread of class means for a fixed within-class variance.

Canonical Variates • We have a set of vectorized images xi (i=1, 2…, N). • The images belong to C categories each having a mean μj, ( j=1, 2, …, C ). • The mean of the class means:

Canonical Variates • Between class covariance: with Ni being the number of images in class i. • Within class covariance:

Canonical Variates Using the same argument as in PCA (and graph-based segmentation) we seek vector u maximizing which is equivalent to s. t. The solution is the top eigenvector of the generalized eigenvalue problem:

Canonical Variates • 71 views of 10 objects at a variety of poses at a black background. • 60 images were used to determine a set of canonical variates. • 11 images were used for testing.

Canonical Variates • The first two canonical variates. The clusters are tight and well separated (a different symbol per object is used). • We could probably get a quite good classification.