Machine Learning for Signal Processing Fundamentals of Linear

Overview • • Vectors and matrices Basic vector/matrix operations Various matrix types Projections 11/30/2020

Book • Fundamentals of Linear Algebra, Gilbert Strang • Important to be very comfortable

Incentive to use linear algebra • Simplified notation! • Easier intuition – Really convenient

And other things you can do Frequency From Bach’s Fugue in Gm Rotation +

Scalars, vectors, matrices, … • A scalar a is a number – a =

Vectors in the abstract • Ordered collection of numbers – Examples: [3 4 5],

Vectors in reality • Vectors usually hold sets of numerical attributes – X, Y,

Matrices • Matrices can be square or rectangular – Can hold data • Images,

Dimensions of a matrix • The matrix size is specified by the number of

Representing an image as a matrix • 3 pacmen • A 321 x 399

Basic arithmetic operations • Addition and subtraction – Element-wise operations 11/30/2020 11 -755/18 -797

Vector Operations 3 (3, 4, 5) 5 -2 (3, -2, -3) -3 4 (6,

Vector norm • Measure of how long a vector is: (3, 4, 5) Length

Transposition • A transposed row vector becomes a column (and vice versa) • A

Vector multiplication • Multiplication by scalar • Dot product, or inner product – Vectors

Vector dot product • Example: – Coordinates are yards, not ave/st – a =

Vector dot product E C 2 Sqrt(energy) C frequency • Vectors are spectra –

Vector dot product E C 2 Sqrt(energy) C frequency • How much of C

Vector outer product • The column vector is the spectrum • The row vector

Multiplying a vector by a matrix • Generalization of vector scaling – Left multiplication:

Multiplying a vector by a matrix • Generalization of vector multiplication – Right multiplication:

Multiplication of vector space by matrix • The matrix rotates and scales the space

Multiplication of vector space by matrix • The normals to the row vectors in

Matrix Multiplication • The k-th axis corresponds to the normal to the hyperplane represented

Matrix Multiplication: Column space • So much for spaces. . what does multiplying a

Matrix Multiplication: Row space • Left multiplication mixes the row vectors of the matrix.

Matrix multiplication: Mixing vectors X Y = • A physical example – The three

Matrix multiplication: Mixing vectors 200 x 200 2 x 1 40000 x 2 40000

Multiplying matrices • Simple vector multiplication: Vector outer product 11/30/2020 11 -755/18 -797 31

Multiplying matrices • Generalization of vector multiplication – Outer product of dot products!! –

Multiplying matrices: Another view • Simple vector multiplication: Vector inner product 11/30/2020 11 -755/18

Matrix multiplication: another view n n The outer product of the first column of

Why is that useful? Y X • Sounds: Three notes modulated independently 11/30/2020 11

Matrix multiplication: Mixing modulated spectra Y X • Sounds: Three notes modulated independently 11/30/2020

Matrix multiplication: Mixing modulated spectra X • Sounds: Three notes modulated independently 11/30/2020 11

Matrix multiplication: Mixing modulated spectra • Sounds: Three notes modulated independently 11/30/2020 11 -755/18

Matrix multiplication: Image transition • Image 1 fades out linearly • Image 2 fades

Matrix multiplication: Image transition • Each column is one image – The columns represent

Matrix multiplication: Image transition • Image 2 fades in linearly 11/30/2020 11 -755/18 -797

The Identity Matrix • An identity matrix is a square matrix where – All

Diagonal Matrix • All off-diagonal elements are zero • Diagonal elements are non-zero •

Diagonal matrix to transform images • How? 11/30/2020 11 -755/18 -797 47

Stretching • Location-based representation • Scaling matrix – only scales the X axis –

Stretching D = N is the width of the original image • Better way

Modifying color • Scale only Green 11/30/2020 11 -755/18 -797 50

Permutation Matrix (3, 4, 5) Z (old X) 5 Y Z 4 X 3

Permutation Matrix • Reflections and 90 degree rotations of images and objects 11/30/2020 11

Permutation Matrix • Reflections and 90 degree rotations of images and objects – Object

Rotation Matrix (x’, y’) y’ y (x, y) Y (x, y) q Y X

Rotating a picture • Note the representation: 3 -row matrix – Rotation only applies

3 -D Rotation Xnew Ynew Z Y Znew a q X • 2 degrees

Orthogonal/Orthonormal vectors • Two vectors are orthogonal if they are perpendicular to one another

Orthogonal matrices All 3 at 90 o to on another • Orthogonal Matrix :

Orthogonal and Orthonormal Matrices q Ax q • Orthogonal matrices will retain the length

Orthogonal and Orthonormal Matrices • If the vectors in the matrix are not unit

Matrix Operations: Properties • A+B = B+A • AB != BA 11/30/2020 11 -755/18

Matrix Inversion • A matrix transforms an N-dimensional object to a different N-dimensional object

Matrix Inversion T T-1 • The product of a matrix and its inverse is

Matrix inversion (division) • The inverse of matrix multiplication – Not element-wise division!! •

Projections • What would we see if the cone to the left were transparent

Projection Matrix 90 degrees W 2 W 1 projection • Consider any plane specified

Projection Matrix 90 degrees W 2 W 1 projection • Given a set of

Projections • HOW? 11/30/2020 11 -755/18 -797 68

Projections • Draw any two vectors W 1 and W 2 that lie on

Projections • The projection actually projects it onto the plane, but you’re still seeing

Projection matrix properties • The projection of any vector that is already on the

Projections: A more physical meaning • Let W 1, W 2. . Wk be

Projection : an example with sounds • The spectrogram (matrix) of a piece of

Projection: one note M = • The spectrogram (matrix) of a piece of music

Projection: one note – cleaned up M = • The spectrogram (matrix) of a

Projection: multiple notes M = • The spectrogram (matrix) of a piece of music

Projection: multiple notes, cleaned up M = • The spectrogram (matrix) of a piece

Projection and Least Squares • Projection actually computes a least squared error estimate •

Perspective • The picture is the equivalent of “painting” the viewed scenery on a

An aside on Perspective. . • Perspective is the result of convergence of the

Representing Perspective • Perspective was not always understood. • Carefully represented perspective can create

Central Projection x’, y’, z’ Y x, y z X Property of a line

Homogeneous Coordinates x’, y’ Y x, y X • Represent points by a triplet

Homogeneous Coordinates in 3 -D x 1’, y 1’, z 1’ x 1, y

Homogeneous Coordinates • In both cases, constant “c” represents distance along the line with

Matrix Rank and Rank-Deficient Matrices P * Cone = • Some matrices will eliminate

Matrix Rank and Rank-Deficient Matrices Rank = 2 Rank = 1 • Some matrices

Non-square Matrices X = 2 D data P = transform PX = 3 D,

Non-square Matrices X = 3 D data, rank 3 P = transform PX =

The Rank of a Matrix • The matrix rank is the dimensionality of the

Are Projections Full-Rank? M = W = n n P = W (WTW)-1 WT

The Rank of Matrix M = n Projected Spectrogram = P * M q

Matrix rank is unchanged by transposition • If an N-dimensional object is compressed to

Inverting rank-deficient matrices • Rank deficient matrices “flatten” objects – In the process, multiple

Rank Deficient Matrices M = n The projection matrix is rank deficient You cannot

Matrix Determinant (r 1+r 2) (r 1) (r 2) (r 1) • The determinant

Matrix Determinant: Another Perspective Volume = V 1 Volume = V 2 • The

Matrix Determinants • Matrix determinants are only defined for square matrices – They characterize

Multiplication properties • Properties of vector/matrix products – Associative – Distributive – NOT commutative!!!

Determinant properties • Associative for square matrices – Scaling volume sequentially by several matrices

Revisiting Projections and Least Squares note 1 note 2 note 3 • Projection computes

The Pseudo Inverse (PINV) • We are approximating spectral vectors V as the transformation

Explaining music with one note M = X =PINV(W)*M W = n n Recap:

Explanation with multiple notes M = X=PINV(W)M W = n X = Pinv(W) *

How about the other way? M = V = W = n ? U

Pseudo-inverse (PINV) • Pinv() applies to non-square matrices • Pinv (A))) = A •

Slides: 106

Download presentation

Machine Learning for Signal Processing Fundamentals of Linear Algebra Class 2. 22 Jan 2015 Instructor: Bhiksha Raj 11/30/2020 11 -755/18 -797 1

Overview • • Vectors and matrices Basic vector/matrix operations Various matrix types Projections 11/30/2020 11 -755/18 -797 2

Book • Fundamentals of Linear Algebra, Gilbert Strang • Important to be very comfortable with linear algebra – Appears repeatedly in the form of Eigen analysis, SVD, Factor analysis – Appears through various properties of matrices that are used in machine learning – Often used in the processing of data of various kinds – Will use sound and images as examples • Today’s lecture: Definitions – Very small subset of all that’s used – Important subset, intended to help you recollect 11/30/2020 11 -755/18 -797 3

Incentive to use linear algebra • Simplified notation! • Easier intuition – Really convenient geometric interpretations • Easy code translation! for i=1: n for j=1: m c(i)=c(i)+y(j)*x(i)*a(i, j) end 11/30/2020 C=x*A*y 11 -755/18 -797 4

And other things you can do Frequency From Bach’s Fugue in Gm Rotation + Projection + Scaling + Perspective • • Time Decomposition (NMF) Manipulate Data Extract information from data Represent data. . Etc. 11/30/2020 11 -755/18 -797 5

Scalars, vectors, matrices, … • A scalar a is a number – a = 2, a = 3. 14, a = -1000, etc. • A vector a is a linear arrangement of a collection of scalars • A matrix A is a rectangular arrangement of a collection of scalars 11/30/2020 11 -755/18 -797 6

Vectors in the abstract • Ordered collection of numbers – Examples: [3 4 5], [a b c d], . . – [3 4 5] != [4 3 5] Order is important • Typically viewed as identifying (the path from origin to) a location in an N-dimensional space (3, 4, 5) 5 (4, 3, 5) 3 z y 4 x 11/30/2020 11 -755/18 -797 7

Vectors in reality • Vectors usually hold sets of numerical attributes – X, Y, Z coordinates [-2. 5 av 6 st] [1 av 8 st] • [1, 2, 0] – [height(cm) weight(kg)] – [175 72] – A location in Manhattan • [3 av 33 st] • A series of daily temperatures • Samples in an audio signal [2 av 4 st] • Etc. 11/30/2020 11 -755/18 -797 8

Matrices • Matrices can be square or rectangular – Can hold data • Images, collections of sounds, etc. • Or represent operations as we shall see – A matrix can be vertical stacking of row vectors – Or a horizontal arrangement of column vectors 11/30/2020 11 -755/18 -797 9

Dimensions of a matrix • The matrix size is specified by the number of rows and columns – c = 3 x 1 matrix: 3 rows and 1 column – r = 1 x 3 matrix: 1 row and 3 columns – S = 2 x 2 matrix – R = 2 x 3 matrix – Pacman = 321 x 399 matrix 11/30/2020 11 -755/18 -797 10

Representing an image as a matrix • 3 pacmen • A 321 x 399 matrix – Row and Column = position • A 3 x 128079 matrix – Triples of x, y and value • A 1 x 128079 vector – “Unraveling” the matrix Y X v Values only; X and Y are implicit • Note: All of these can be recast as the matrix that forms the image – Representations 2 and 4 are equivalent • The position is not represented 11/30/2020 11 -755/18 -797 11

Basic arithmetic operations • Addition and subtraction – Element-wise operations 11/30/2020 11 -755/18 -797 12

Vector Operations 3 (3, 4, 5) 5 -2 (3, -2, -3) -3 4 (6, 2, 2) 3 • Operations tell us how to get from origin to the result of the vector operations – (3, 4, 5) + (3, -2, -3) = (6, 2, 2) 11/30/2020 11 -755/18 -797 13

Vector norm • Measure of how long a vector is: (3, 4, 5) Length = sqrt(32 + 42 + 52) 5 – Represented as • Geometrically the shortest distance to travel from the origin to the destination – As the crow flies – Assuming Euclidean Geometry 4 3 [-2 av 17 st] [-6 av 10 st] • MATLAB syntax: norm(x) 11/30/2020 11 -755/18 -797 15

Transposition • A transposed row vector becomes a column (and vice versa) • A transposed matrix gets all its row (or column) vectors transposed in order • MATLAB syntax: a’ 11/30/2020 11 -755/18 -797 16

Vector multiplication • Multiplication by scalar • Dot product, or inner product – Vectors must have the same number of elements – Row vector times column vector = scalar • Outer product or vector direct product – Column vector times row vector = matrix 11/30/2020 11 -755/18 -797 17

Vector dot product • Example: – Coordinates are yards, not ave/st – a = [200 1600], b = [770 300] [200 yd 1600 yd] norm ≈ 1612 • The dot product of the two vectors relates to the length of a projection – How much of the first vector have we covered by following the second one? – Must normalize by the length of the “target” vector norm ≈ 393 yd 11/30/2020 11 -755/18 -797 [770 yd 300 yd] norm ≈ 826 18

Vector dot product E C 2 Sqrt(energy) C frequency • Vectors are spectra – Energy at a discrete set of frequencies – Actually 1 x 4096 – X axis is the index of the number in the vector • Represents frequency – Y axis is the value of the number in the vector • Represents magnitude 11/30/2020 11 -755/18 -797 19

Vector dot product E C 2 Sqrt(energy) C frequency • How much of C is also in E – How much can you fake a C by playing an E – C. E / |C||E| = 0. 1 – Not very much • How much of C is in C 2? – C. C 2 / |C| /|C 2| = 0. 5 – Not bad, you can fake it • To do this, C, E, and C 2 must be the same size 11/30/2020 11 -755/18 -797 20

Vector outer product • The column vector is the spectrum • The row vector is an amplitude modulation • The outer product is a spectrogram – Shows how the energy in each frequency varies with time – The pattern in each column is a scaled version of the spectrum – Each row is a scaled version of the modulation 11/30/2020 11 -755/18 -797 21

Multiplying a vector by a matrix • Generalization of vector scaling – Left multiplication: Dot product of each vector pair – Dimensions must match!! • No. of columns of matrix = size of vector • Result inherits the number of rows from the matrix 11/30/2020 11 -755/18 -797 22

Multiplying a vector by a matrix • Generalization of vector multiplication – Right multiplication: Dot product of each vector pair – Dimensions must match!! • No. of rows of matrix = size of vector • Result inherits the number of columns from the matrix 11/30/2020 11 -755/18 -797 23

Multiplication of vector space by matrix • The matrix rotates and scales the space – Including its own vectors 11/30/2020 11 -755/18 -797 24

Multiplication of vector space by matrix • The normals to the row vectors in the matrix become the new axes – X axis = normal to the second row vector • Scaled by the inverse of the length of the first row vector 11/30/2020 11 -755/18 -797 25

Matrix Multiplication • The k-th axis corresponds to the normal to the hyperplane represented by the 1. . k-1, k+1. . N-th row vectors in the matrix – Any set of K-1 vectors represent a hyperplane of dimension K-1 or less • The distance along the new axis equals inner product with the k-th row vector – Expressed in inverse-lengths of the vector 11/30/2020 11 -755/18 -797 26

Matrix Multiplication: Column space • So much for spaces. . what does multiplying a matrix by a vector really do? • It mixes the column vectors of the matrix using the numbers in the vector • The column space of the Matrix is the complete set of all vectors that can be formed by mixing its columns 11/30/2020 11 -755/18 -797 27

Matrix Multiplication: Row space • Left multiplication mixes the row vectors of the matrix. • The row space of the Matrix is the complete set of all vectors that can be formed by mixing its rows 11/30/2020 11 -755/18 -797 28

Matrix multiplication: Mixing vectors X Y = • A physical example – The three column vectors of the matrix X are the spectra of three notes – The multiplying column vector Y is just a mixing vector – The result is a sound that is the mixture of the three notes 11/30/2020 11 -755/18 -797 29

Matrix multiplication: Mixing vectors 200 x 200 2 x 1 40000 x 2 40000 x 1 • Mixing two images – The images are arranged as columns • position value not included – The result of the multiplication is rearranged as an image 11/30/2020 11 -755/18 -797 30

Multiplying matrices • Simple vector multiplication: Vector outer product 11/30/2020 11 -755/18 -797 31

Multiplying matrices • Generalization of vector multiplication – Outer product of dot products!! – Dimensions must match!! • Columns of first matrix = rows of second • Result inherits the number of rows from the first matrix and the number of columns from the second matrix 11/30/2020 11 -755/18 -797 32

Multiplying matrices: Another view • Simple vector multiplication: Vector inner product 11/30/2020 11 -755/18 -797 33

Matrix multiplication: another view n n The outer product of the first column of A and the first row of B + outer product of the second column of A and the second row of B + …. Sum of outer products 11/30/2020 11 -755/18 -797 34

Why is that useful? Y X • Sounds: Three notes modulated independently 11/30/2020 11 -755/18 -797 35

Matrix multiplication: Mixing modulated spectra Y X • Sounds: Three notes modulated independently 11/30/2020 11 -755/18 -797 36

Matrix multiplication: Mixing modulated spectra Y X • Sounds: Three notes modulated independently 11/30/2020 11 -755/18 -797 37

Matrix multiplication: Mixing modulated spectra X • Sounds: Three notes modulated independently 11/30/2020 11 -755/18 -797 38

Matrix multiplication: Mixing modulated spectra X • Sounds: Three notes modulated independently 11/30/2020 11 -755/18 -797 39

Matrix multiplication: Mixing modulated spectra • Sounds: Three notes modulated independently 11/30/2020 11 -755/18 -797 40

Matrix multiplication: Image transition • Image 1 fades out linearly • Image 2 fades in linearly 11/30/2020 11 -755/18 -797 41

Matrix multiplication: Image transition • Each column is one image – The columns represent a sequence of images of decreasing intensity • Image 1 fades out linearly 11/30/2020 11 -755/18 -797 42

Matrix multiplication: Image transition • Image 2 fades in linearly 11/30/2020 11 -755/18 -797 43

Matrix multiplication: Image transition • Image 1 fades out linearly • Image 2 fades in linearly 11/30/2020 11 -755/18 -797 44

The Identity Matrix • An identity matrix is a square matrix where – All diagonal elements are 1. 0 – All off-diagonal elements are 0. 0 • Multiplication by an identity matrix does not change vectors 11/30/2020 11 -755/18 -797 45

Diagonal Matrix • All off-diagonal elements are zero • Diagonal elements are non-zero • Scales the axes – May flip axes 11/30/2020 11 -755/18 -797 46

Diagonal matrix to transform images • How? 11/30/2020 11 -755/18 -797 47

Stretching • Location-based representation • Scaling matrix – only scales the X axis – The Y axis and pixel value are scaled by identity • Not a good way of scaling. 11/30/2020 11 -755/18 -797 48

Stretching D = N is the width of the original image • Better way • Interpolate 11/30/2020 11 -755/18 -797 49

Modifying color • Scale only Green 11/30/2020 11 -755/18 -797 50

Permutation Matrix (3, 4, 5) Z (old X) 5 Y Z 4 X 3 Y (old Z) 3 5 X (old Y) 4 • A permutation matrix simply rearranges the axes – The row entries are axis vectors in a different order – The result is a combination of rotations and reflections • The permutation matrix effectively permutes the arrangement of the elements in a vector 11/30/2020 11 -755/18 -797 51

Permutation Matrix • Reflections and 90 degree rotations of images and objects 11/30/2020 11 -755/18 -797 52

Permutation Matrix • Reflections and 90 degree rotations of images and objects – Object represented as a matrix of 3 -Dimensional “position” vectors – Positions identify each point on the surface 11/30/2020 11 -755/18 -797 53

Rotation Matrix (x’, y’) y’ y (x, y) Y (x, y) q Y X X x’ x • A rotation matrix rotates the vector by some angle q • Alternately viewed, it rotates the axes – The new axes are at an angle q to the old one 11/30/2020 11 -755/18 -797 54

Rotating a picture • Note the representation: 3 -row matrix – Rotation only applies on the “coordinate” rows – The value does not change – Why is pacman grainy? 11/30/2020 11 -755/18 -797 55

3 -D Rotation Xnew Ynew Z Y Znew a q X • 2 degrees of freedom – 2 separate angles • What will the rotation matrix be? 11/30/2020 11 -755/18 -797 56

Orthogonal/Orthonormal vectors • Two vectors are orthogonal if they are perpendicular to one another – A. B = 0 – A vector that is perpendicular to a plane is orthogonal to every vector on the plane • Two vectors are orthonormal if – They are orthogonal – The length of each vector is 1. 0 – Orthogonal vectors can be made orthonormal by normalizing their lengths to 1. 0 11/30/2020 11 -755/18 -797 57

Orthogonal matrices All 3 at 90 o to on another • Orthogonal Matrix : AAT = ATA = I – The matrix is square – All row vectors are orthonormal to one another • Every vector is perpendicular to the hyperplane formed by all other vectors – All column vectors are also orthonormal to one another – Observation: In an orthogonal matrix if the length of the row vectors is 1. 0, the length of the column vectors is also 1. 0 – Observation: In an orthogonal matrix no more than one row can have all entries with the same polarity (+ve or –ve) 11/30/2020 11 -755/18 -797 58

Orthogonal and Orthonormal Matrices q Ax q • Orthogonal matrices will retain the length and relative angles between transformed vectors – Essentially, they are combinations of rotations, reflections and permutations – Rotation matrices and permutation matrices are all orthonormal 11/30/2020 11 -755/18 -797 59

Orthogonal and Orthonormal Matrices • If the vectors in the matrix are not unit length, it cannot be orthogonal – AAT != I, ATA != I – AAT = Diagonal or ATA = Diagonal, but not both – If all the entries are the same length, we can get AAT = ATA = Diagonal, though • A non-square matrix cannot be orthogonal – AAT=I or ATA = I, but not both 11/30/2020 11 -755/18 -797 60

Matrix Operations: Properties • A+B = B+A • AB != BA 11/30/2020 11 -755/18 -797 61

Matrix Inversion • A matrix transforms an N-dimensional object to a different N-dimensional object • What transforms the new object back to the original? – The inverse transformation • The inverse transformation is called the matrix inverse 11/30/2020 11 -755/18 -797 62

Matrix Inversion T T-1 • The product of a matrix and its inverse is the identity matrix – Transforming an object, and then inverse transforming it gives us back the original object 11/30/2020 11 -755/18 -797 63

Matrix inversion (division) • The inverse of matrix multiplication – Not element-wise division!! • Provides a way to “undo” a linear transformation – Inverse of the unit matrix is itself – Inverse of a diagonal is diagonal – Inverse of a rotation is a (counter)rotation (its transpose!) 11/30/2020 11 -755/18 -797 64

Projections • What would we see if the cone to the left were transparent if we looked at it from above the plane shown by the grid? – Normal to the plane – Answer: the figure to the right • How do we get this? Projection 11/30/2020 11 -755/18 -797 65

Projection Matrix 90 degrees W 2 W 1 projection • Consider any plane specified by a set of vectors W 1, W 2. . – Or matrix [W 1 W 2. . ] – Any vector can be projected onto this plane – The matrix A that rotates and scales the vector so that it becomes its projection is a projection matrix 11/30/2020 11 -755/18 -797 66

Projection Matrix 90 degrees W 2 W 1 projection • Given a set of vectors W 1, W 2, which form a matrix W = [W 1 W 2. . ] • The projection matrix to transform a vector X to its projection on the plane is – P = W (WTW)-1 WT • We will visit matrix inversion shortly • Magic – any set of vectors from the same plane that are expressed as a matrix will give you the same projection matrix – P = V (VTV)-1 VT 11/30/2020 11 -755/18 -797 67

Projections • HOW? 11/30/2020 11 -755/18 -797 68

Projections • Draw any two vectors W 1 and W 2 that lie on the plane – ANY two so long as they have different angles • • Compose a matrix W = [W 1 W 2] Compose the projection matrix P = W (WTW)-1 WT Multiply every point on the cone by P to get its projection View it – I’m missing a step here – what is it? 11/30/2020 11 -755/18 -797 69

Projections • The projection actually projects it onto the plane, but you’re still seeing the plane in 3 D – The result of the projection is a 3 -D vector – P = W (WTW)-1 WT = 3 x 3, P*Vector = 3 x 1 – The image must be rotated till the plane is in the plane of the paper • The Z axis in this case will always be zero and can be ignored • How will you rotate it? (remember you know W 1 and W 2) 11/30/2020 11 -755/18 -797 70

Projection matrix properties • The projection of any vector that is already on the plane is the vector itself – Px = x if x is on the plane – If the object is already on the plane, there is no further projection to be performed • The projection of a projection is the projection – P (Px) = Px – That is because Px is already on the plane • Projection matrices are idempotent – P 2 = P 11/30/2020 • Follows from the above 11 -755/18 -797 71

Projections: A more physical meaning • Let W 1, W 2. . Wk be “bases” • We want to explain our data in terms of these “bases” – We often cannot do so – But we can explain a significant portion of it • The portion of the data that can be expressed in terms of our vectors W 1, W 2, . . Wk, is the projection of the data on the W 1. . Wk (hyper) plane – In our previous example, the “data” were all the points on a cone, and the bases were vectors on the plane 11/30/2020 11 -755/18 -797 72

Projection : an example with sounds • The spectrogram (matrix) of a piece of music n How much of the above music was composed of the above notes q I. e. how much can it be explained by the notes 11/30/2020 11 -755/18 -797 73

Projection: one note M = • The spectrogram (matrix) of a piece of music W = M = spectrogram; W = note n P = W (WTW)-1 WT n Projected Spectrogram = P * M 11/30/2020 11 -755/18 -797 n 74

Projection: one note – cleaned up M = • The spectrogram (matrix) of a piece of music W = n Floored all matrix values below a threshold to zero 11/30/2020 11 -755/18 -797 75

Projection: multiple notes M = • The spectrogram (matrix) of a piece of music W = n n P = W (WTW)-1 WT Projected Spectrogram = P * M 11/30/2020 11 -755/18 -797 76

Projection: multiple notes, cleaned up M = • The spectrogram (matrix) of a piece of music W = n n P = W (WTW)-1 WT Projected Spectrogram = P * M 11/30/2020 11 -755/18 -797 77

Projection and Least Squares • Projection actually computes a least squared error estimate • For each vector V in the music spectrogram matrix note 1 note 2 note 3 – Approximation: Vapprox = a*note 1 + b*note 2 + c*note 3. . – Error vector E = V – Vapprox – Squared error energy for V e(V) = norm(E)2 – Total error = sum over all V { e(V) } = SV e(V) • Projection computes Vapprox for all vectors such that Total error is minimized – It does not give you “a”, “b”, “c”. . Though • That needs a different operation – the inverse / pseudo inverse 11/30/2020 11 -755/18 -797 78

Perspective • The picture is the equivalent of “painting” the viewed scenery on a glass window • Feature: The lines connecting any point in the scenery and its projection on the window merge at a common point – The eye – As a result, parallel lines in the scene apparently merge to a point 11/30/2020 11 -755/18 -797 79

An aside on Perspective. . • Perspective is the result of convergence of the image to a point • Convergence can be to multiple points – Top Left: One-point perspective – Top Right: Two-point perspective – Right: Three-point perspective 11/30/2020 11 -755/18 -797 80

Representing Perspective • Perspective was not always understood. • Carefully represented perspective can create illusions. . 11/30/2020 11 -755/18 -797 81

Central Projection x’, y’, z’ Y x, y z X Property of a line through origin • The positions on the “window” are scaled along the line • To compute (x, y) position on the window, we need z (distance of window from eye), and (x’, y’, z’) (location being projected) 11/30/2020 11 -755/18 -797 82

Homogeneous Coordinates x’, y’ Y x, y X • Represent points by a triplet – – Using yellow window as reference: (x, y) = (x, y, 1) (x’, y’) = (x, y, c’) c’ = a’/a Locations on line generally represented as (x, y, c) • x’= x/c’ , y’= y/c’ 11/30/2020 11 -755/18 -797 83

Homogeneous Coordinates in 3 -D x 1’, y 1’, z 1’ x 1, y 1, z 1 x 2, y 2, z 2 x 2’, y 2’, z 2’ • Points are represented using FOUR coordinates – (X, Y, Z, c) – “c” is the “scaling” factor that represents the distance of the actual scene • Actual Cartesian coordinates: – Xactual = X/c, Yactual = Y/c, Zactual = Z/c 11/30/2020 11 -755/18 -797 84

Homogeneous Coordinates • In both cases, constant “c” represents distance along the line with respect to a reference window – In 2 D the plane in which all points have values (x, y, 1) • Changing the reference plane changes the representation • I. e. there may be multiple Homogenous representations (x, y, c) that represent the same cartesian point (x’ y’) 11/30/2020 11 -755/18 -797 85

Matrix Rank and Rank-Deficient Matrices P * Cone = • Some matrices will eliminate one or more dimensions during transformation – These are rank deficient matrices – The rank of the matrix is the dimensionality of the transformed version of a full-dimensional object 11/30/2020 11 -755/18 -797 86

Matrix Rank and Rank-Deficient Matrices Rank = 2 Rank = 1 • Some matrices will eliminate one or more dimensions during transformation – These are rank deficient matrices – The rank of the matrix is the dimensionality of the transformed version of a full-dimensional object 11/30/2020 11 -755/18 -797 87

Non-square Matrices X = 2 D data P = transform PX = 3 D, rank 2 • Non-square matrices add or subtract axes – More rows than columns add axes • But does not increase the dimensionality of the dataaxes • May reduce dimensionality of the data 11/30/2020 11 -755/18 -797 88

Non-square Matrices X = 3 D data, rank 3 P = transform PX = 2 D, rank 2 • Non-square matrices add or subtract axes – More rows than columns add axes • But does not increase the dimensionality of the data – Fewer rows than columns reduce axes • May reduce dimensionality of the data 11/30/2020 11 -755/18 -797 89

The Rank of a Matrix • The matrix rank is the dimensionality of the transformation of a fulldimensioned object in the original space • The matrix can never increase dimensions – Cannot convert a circle to a sphere or a line to a circle • The rank of a matrix can never be greater than the lower of its two dimensions 11/30/2020 11 -755/18 -797 90

Are Projections Full-Rank? M = W = n n P = W (WTW)-1 WT ; Projected Spectrogram = P*M The original spectrogram can never be recovered q n P is rank deficient P explains all vectors in the new spectrogram as a mixture of only the 4 vectors in W q q There are only a maximum of 4 linearly independent bases Rank of P is 4 11/30/2020 11 -755/18 -797 91

The Rank of Matrix M = n Projected Spectrogram = P * M q n Every vector in it is a combination of only 4 bases The rank of the matrix is the smallest no. of bases required to describe the output q q E. g. if note no. 4 in P could be expressed as a combination of notes 1, 2 and 3, it provides no additional information Eliminating note no. 4 would give us the same projection The rank of P would be 3! q 11/30/2020 11 -755/18 -797 92

Matrix rank is unchanged by transposition • If an N-dimensional object is compressed to a K-dimensional object by a matrix, it will also be compressed to a K-dimensional object by the transpose of the matrix 11/30/2020 11 -755/18 -797 93

Inverting rank-deficient matrices • Rank deficient matrices “flatten” objects – In the process, multiple points in the original object get mapped to the same point in the transformed object • It is not possible to go “back” from the flattened object to the original object – Because of the many-to-one forward mapping • Rank deficient matrices have no inverse 11/30/2020 11 -755/18 -797 94

Rank Deficient Matrices M = n The projection matrix is rank deficient You cannot recover the original spectrogram from the projected one. . n Rank deficient matrices have no inverse n 11/30/2020 11 -755/18 -797 95

Matrix Determinant (r 1+r 2) (r 1) (r 2) (r 1) • The determinant is the “volume” of a matrix • Actually the volume of a parallelepiped formed from its row vectors – Also the volume of the parallelepiped formed from its column vectors • Standard formula for determinant: in text book 11/30/2020 11 -755/18 -797 96

Matrix Determinant: Another Perspective Volume = V 1 Volume = V 2 • The determinant is the ratio of N-volumes – If V 1 is the volume of an N-dimensional sphere “O” in N-dimensional space • O is the complete set of points or vertices that specify the object – If V 2 is the volume of the N-dimensional ellipsoid specified by A*O, where A is a matrix that transforms the space – |A| = V 2 / V 1 11/30/2020 11 -755/18 -797 97

Matrix Determinants • Matrix determinants are only defined for square matrices – They characterize volumes in linearly transformed space of the same dimensionality as the vectors • Rank deficient matrices have determinant 0 – Since they compress full-volumed N-dimensional objects into zerovolume N-dimensional objects • E. g. a 3 -D sphere into a 2 -D ellipse: The ellipse has 0 volume (although it does have area) • Conversely, all matrices of determinant 0 are rank deficient – Since they compress full-volumed N-dimensional objects into zero-volume objects 11/30/2020 11 -755/18 -797 98

Multiplication properties • Properties of vector/matrix products – Associative – Distributive – NOT commutative!!! • left multiplications ≠ right multiplications – Transposition 11/30/2020 11 -755/18 -797 99

Determinant properties • Associative for square matrices – Scaling volume sequentially by several matrices is equal to scaling once by the product of the matrices • Volume of sum != sum of Volumes • Commutative – The order in which you scale the volume of an object is irrelevant 11/30/2020 11 -755/18 -797 100

Revisiting Projections and Least Squares note 1 note 2 note 3 • Projection computes a least squared error estimate • For each vector V in the music spectrogram matrix – Approximation: Vapprox = a*note 1 + b*note 2 + c*note 3. . – Error vector E = V – Vapprox – Squared error energy for V e(V) = norm(E)2 • Projection computes Vapprox for all vectors such that Total error is minimized • But WHAT ARE “a” “b” and “c”? 11/30/2020 11 -755/18 -797 101

The Pseudo Inverse (PINV) • We are approximating spectral vectors V as the transformation of the vector [a b c]T – Note – we’re viewing the collection of bases in T as a transformation • The solution is obtained using the pseudo inverse – This give us a LEAST SQUARES solution • If T were square and invertible Pinv(T) = T-1, and V=Vapprox 11/30/2020 11 -755/18 -797 102

Explaining music with one note M = X =PINV(W)*M W = n n Recap: P = W (WTW)-1 WT, Projected Spectrogram = P*M Approximation: M = W*X The amount of W in each vector = X = PINV(W)*M W*Pinv(W)*M = Projected Spectrogram q W*Pinv(W) = Projection matrix!! 11/30/2020 11 -755/18 -797 PINV(W) = (WTW)-1 WT 103

Explanation with multiple notes M = X=PINV(W)M W = n X = Pinv(W) * M; Projected matrix = W*X = W*Pinv(W)*M 11/30/2020 11 -755/18 -797 104

How about the other way? M = V = W = n ? U = ? W = M Pinv(V) U = WV 11/30/2020 11 -755/18 -797 105

Pseudo-inverse (PINV) • Pinv() applies to non-square matrices • Pinv (A))) = A • A*Pinv(A)= projection matrix! – Projection onto the columns of A • If A = K x N matrix and K > N, A projects N-D vectors into a higher-dimensional K-D space – Pinv(A) = Nx. K matrix – Pinv(A)*A = I in this case • Otherwise A * Pinv(A) = I 11/30/2020 11 -755/18 -797 106

Matrix inversion (division) • The inverse of matrix multiplication – Not element-wise division!! • Provides a way to “undo” a linear transformation – – Inverse of the unit matrix is itself Inverse of a diagonal is diagonal Inverse of a rotation is a (counter)rotation (its transpose!) Inverse of a rank deficient matrix does not exist! • But pseudoinverse exists • For square matrices: Pay attention to multiplication side! • If matrix is not square use a matrix pseudoinverse: 11/30/2020 11 -755/18 -797 107