LEARNING MIXTURES OF LINEAR REGRESSIONS IN SUBEXPONENTIAL TIME

Sign up to view full document!

LEARNING MIXTURES OF LINEAR REGRESSIONS IN SUBEXPONENTIAL TIME VIA FOURIER MOMENTS Sitan Chen MIT

LEARNING MIXTURES OF LINEAR REGRESSIONS IN SUBEXPONENTIAL TIME VIA FOURIER MOMENTS Sitan Chen MIT Jerry Li MSR Zhao Song Princeton, IAS

SOLVING MANY LINEAR SYSTEMS •

SOLVING MANY LINEAR SYSTEMS •

SOLVING MANY GAUSSIAN LINEAR SYSTEMS •

SOLVING MANY GAUSSIAN LINEAR SYSTEMS •

SOLVING MANY GAUSSIAN LINEAR SYSTEMS •

SOLVING MANY GAUSSIAN LINEAR SYSTEMS •

SOLVING MANY GAUSSIAN LINEAR SYSTEMS •

SOLVING MANY GAUSSIAN LINEAR SYSTEMS •

SOLVING MANY GAUSSIAN LINEAR SYSTEMS •

SOLVING MANY GAUSSIAN LINEAR SYSTEMS •

SOLVING MANY GAUSSIAN LINEAR SYSTEMS •

SOLVING MANY GAUSSIAN LINEAR SYSTEMS •

MIXTURE-OF-EXPERTS • Popular model for ensemble learning, basic unit in various successful NN architectures

MIXTURE-OF-EXPERTS • Popular model for ensemble learning, basic unit in various successful NN architectures (GRU, Attention) • Challenge: develop a theory of learnability for mixtures-of-experts Gating Network …

MIXTURE MODELS • A mixture model is a convex combination of structured distributions •

MIXTURE MODELS • A mixture model is a convex combination of structured distributions • e. g. Gaussians, exponentials, rankings, HMMs, topic models, product measures • Popular class of latent variable models for representing data coming from heterogeneous sources • Powerful testbed for understanding: • • clustering distribution learning, robust statistics alternating minimization information-computation gaps • Goal: Given iid samples from mixture, recover parameters of components

PREVIOUS RESULTS •

PREVIOUS RESULTS •

OUR RESULTS

OUR RESULTS

LEARNING ONE COMPONENT •

LEARNING ONE COMPONENT •

LEARNING ONE COMPONENT •

LEARNING ONE COMPONENT •

MIN VARIANCE OF A GAUSSIAN MIXTURE •

MIN VARIANCE OF A GAUSSIAN MIXTURE •

LEARNING ALL COMPONENTS • In the noiseless case: 1. Use moment descent to get

LEARNING ALL COMPONENTS • In the noiseless case: 1. Use moment descent to get a warm start to one component 2. Boost to arbitrary accuracy with existing local convergence guarantees 3. “Peel off” the component by removing points with low residual • In the noisy case, peeling will not work. Instead: 1. Tracking: If initial guess noticeably closer to one component than to others, stays closest to that component throughout moment descent with decent chance 2. Initialization: By randomly choosing initial guess in careful way, can ensure that we initialize close to any component with reasonable probability. 3. Repeat many times.

OPEN QUESTIONS •

OPEN QUESTIONS •

Thanks!

Thanks!

Slides: 20

Download presentation

I just ran

Cuadro comparativo de e-learning b-learning y m-learning

Cuadro comparativo de e-learning b-learning y m-learning

Start time end time and elapsed time

Start time end time and elapsed time

Simple multiple linear regression

Simple multiple linear regression

Contoh soal metode biseksi dan penyelesaiannya

Contoh soal metode biseksi dan penyelesaiannya

Linear texts

What is non linear plot

What is non linear plot

Metode iterasi

Difference between linear and non linear pipeline

Difference between linear and non linear pipeline

What is linear multimedia

What is linear multimedia

Convert right linear grammar to left

Convert right linear grammar to left

Contoh soal fungsi non linear hiperbola

Contoh soal fungsi non linear hiperbola

Penerapan fungsi non linier

Penerapan fungsi non linier

Linearly dependent

Linearly dependent

Linear algebra linear transformation

Linear algebra linear transformation

Koordinat lereng

Koordinat lereng

Linear impulse and momentum

Linear impulse and momentum

Persamaan linier simultan

Persamaan linier simultan

Linear vs nonlinear chart

Linear vs nonlinear chart

Linear vs nonlinear

Linear vs nonlinear

Difference between linear and nonlinear equations

Difference between linear and nonlinear equations

Regressions Regressions A regression refers to the set

Regressions Regressions A regression refers to the set

Separation of Mixtures 1 1 Mixtures Many mixtures

Separation of Mixtures 1 1 Mixtures Many mixtures

Separating Mixtures Separating Mixtures Separating Mixtures Mixing sulphur

Separating Mixtures Separating Mixtures Separating Mixtures Mixing sulphur

Separation of Mixtures Separation of Mixtures Mixtures both

Separation of Mixtures Separation of Mixtures Mixtures both

Mixtures Matter Substances Elements Compounds Mixtures Heterogeneous Mixtures

Mixtures Matter Substances Elements Compounds Mixtures Heterogeneous Mixtures

Matter MIXTURES HOMOGENEOUS AND HETEROGENEOUS MIXTURES Matter Mixtures

Matter MIXTURES HOMOGENEOUS AND HETEROGENEOUS MIXTURES Matter Mixtures

Mixtures and Separating Mixtures Mixtures Many of the

Mixtures and Separating Mixtures Mixtures Many of the

Separation of Mixtures Separation of Mixtures Mixtures both

Separation of Mixtures Separation of Mixtures Mixtures both

Separation of Mixtures Separation of Mixtures Mixtures both

Separation of Mixtures Separation of Mixtures Mixtures both

Separating Mixtures Mixtures Their Separation Mixtures elements and

Separating Mixtures Mixtures Their Separation Mixtures elements and

Separating Mixtures Two types of mixtures Homogeneous mixtures

Separating Mixtures Two types of mixtures Homogeneous mixtures

Separating Mixtures Separating Mixtures Separating Mixtures Mixing sulphur

Separating Mixtures Separating Mixtures Separating Mixtures Mixing sulphur

Separation of Mixtures Mixtures Types of mixtures Homogeneous

Separation of Mixtures Mixtures Types of mixtures Homogeneous

Pseudodeterministic Constructions in Subexponential Time Igor Carboni Oliveira

Pseudodeterministic Constructions in Subexponential Time Igor Carboni Oliveira

Optimization problems subexponential time Lasserre algorithms Featuring work

Optimization problems subexponential time Lasserre algorithms Featuring work

Fitting the models Linear and Logit Regressions Dr

Fitting the models Linear and Logit Regressions Dr

Advanced Quantitative Techniques Logistic regressions Difference between linear

Advanced Quantitative Techniques Logistic regressions Difference between linear

Advanced Quantitative Techniques Logistic regressions Difference between linear

Advanced Quantitative Techniques Logistic regressions Difference between linear

Chapter 10 Reexpressing Data to Straighten Linear Regressions

Chapter 10 Reexpressing Data to Straighten Linear Regressions