The Diagonalized Newton Algorithm for Nonnegative Matrix Factorization

  • Slides: 18
Download presentation
The Diagonalized Newton Algorithm for Non-negative Matrix Factorization Hugo Van Hamme Reporter: Yi-Ting Wang

The Diagonalized Newton Algorithm for Non-negative Matrix Factorization Hugo Van Hamme Reporter: Yi-Ting Wang 2013/3/26

Outline • Introduction • NMF formulation • The Diagonalized Newton Algorithm for KLDNMF •

Outline • Introduction • NMF formulation • The Diagonalized Newton Algorithm for KLDNMF • Experiments • Conclusions

Introduction •

Introduction •

Introduction •

Introduction •

Introduction • The resulting Diagonalized Newton Algorithm (DNA) is proposed showing faster convergence while

Introduction • The resulting Diagonalized Newton Algorithm (DNA) is proposed showing faster convergence while the implementation remains simple and suitable for high-rank problems. • http: //zh. wikipedia. org/wiki/File: Newton. Iterati on_Ani. gif

NMF formulation •

NMF formulation •

NMF formulation •

NMF formulation •

NMF formulation •

NMF formulation •

NMF formulation •

NMF formulation •

Multiplicative updates •

Multiplicative updates •

Newton updates •

Newton updates •

Newton updates •

Newton updates •

Step size limitation •

Step size limitation •

Non-increase of the cost • Despite section 2. 3 (size limitation), the divergence can

Non-increase of the cost • Despite section 2. 3 (size limitation), the divergence can still increase. • A very safe option is to compute the EM update. • If the EM update is be better, the Newton update is rejected and the EM update is taken instead. • This will guarantee non-increase of the cost function. • The computational cost of this operation is dominated by evaluating the KL-divergence, not in computing the update itself.

The Diagonalized Newton Algorithm for KLD-NMF

The Diagonalized Newton Algorithm for KLD-NMF

Experiments-Dense data matrices

Experiments-Dense data matrices

Experiments-Sparse data matrices

Experiments-Sparse data matrices

Conclusions • Depending on the case and matrix sizes, DNA iterations are 2 to

Conclusions • Depending on the case and matrix sizes, DNA iterations are 2 to 3 times slower than MU iterations. • In most cases, the diagonal approximation is good enough such that faster convergence is observed and a net gain results. • Since Newton updates can in general not ensure monotonic decrease of the cost function, the step size was controlled with a brute force strategy of falling back to MU in case the cost is increased. • More refined step damping methods could speed up DNA by avoiding evaluations of the cost function, which is next on the research agenda.