Undirected Graphical Models Yuan Yao Peking University Whats

  • Slides: 60
Download presentation
Undirected Graphical Models Yuan Yao Peking University

Undirected Graphical Models Yuan Yao Peking University

What’s a graphical model?

What’s a graphical model?

Markov property: Conditional Independence

Markov property: Conditional Independence

Hammersley-Clifford Theorem A clique is a complete subgraph A maximal clique is a clique

Hammersley-Clifford Theorem A clique is a complete subgraph A maximal clique is a clique where no other clique contains it A joint probability admits the following factorization with cliques where Z is the partition function

Clique Factorization is not unique

Clique Factorization is not unique

Example I: 西游记 west. Rdata 408 -by-303 data matrix The first column contains chapter

Example I: 西游记 west. Rdata 408 -by-303 data matrix The first column contains chapter ID (1, …, 100) 302 characters appeared {1, 0} in 408 scenes (samples) 16 main characters who appeared no less than 40 samples

An Ising model Green edges:positive interactions Red edges: negative interactions

An Ising model Green edges:positive interactions Red edges: negative interactions

Example II: 红楼梦 dream. Rdata 475 -by-375 data matrix 374 characters appeared {1, 0}

Example II: 红楼梦 dream. Rdata 475 -by-375 data matrix 374 characters appeared {1, 0} in 475 scenes (samples) The first column is an indicator if the scene is in the first 80 chapters (by Xueqin Cao) or later (by E Gao) 18 main characters who appeared no less than 30 scenes in the first 80 chapters

Main Content Gaussian Graphical Models for real random variables Semiparametric Gaussian Copula Graphical Models

Main Content Gaussian Graphical Models for real random variables Semiparametric Gaussian Copula Graphical Models Ising Models (Boltzman Machine) for discrete random variables

Gaussian Graphical Model

Gaussian Graphical Model

Precision Matrix

Precision Matrix

Sparsity in High Dimensional Statistics

Sparsity in High Dimensional Statistics

Gaussian Graphical Models

Gaussian Graphical Models

Proof: Linear regression Y ~ Z whose coefficient:

Proof: Linear regression Y ~ Z whose coefficient:

Sparse precision matrix estimation

Sparse precision matrix estimation

Neighborhood Selection

Neighborhood Selection

Recall:

Recall:

Parallel LASSO

Parallel LASSO

Estimator and Symmetrization

Estimator and Symmetrization

L 1 -penalized Maximum Likelihood Estimator (MLE)

L 1 -penalized Maximum Likelihood Estimator (MLE)

Graphical LASSO, also known as

Graphical LASSO, also known as

CLIME: motivation

CLIME: motivation

CLIME: Dantzig Selector

CLIME: Dantzig Selector

CLIME as Linear Programming

CLIME as Linear Programming

Symmetrization

Symmetrization

Nonconvex Penalized MLE

Nonconvex Penalized MLE

SCAD Penalty

SCAD Penalty

Locally Linear Approximation: Adaptive LASSO

Locally Linear Approximation: Adaptive LASSO

Reference

Reference

Normality?

Normality?

Semiparametric Gaussian Copula Model

Semiparametric Gaussian Copula Model

Nonparanormal Gaussian Model

Nonparanormal Gaussian Model

Semiparametric Gaussian Copula Model

Semiparametric Gaussian Copula Model

Conditional Independence

Conditional Independence

Nonparametric Part: Estimate of the marginal monotone transform

Nonparametric Part: Estimate of the marginal monotone transform

Rank Correlation

Rank Correlation

Semiparametric Graphical LASSO R package: huge

Semiparametric Graphical LASSO R package: huge

Ising Model

Ising Model

A Brief History

A Brief History

Ising Model

Ising Model

Sparsity

Sparsity

Boltzman Distribution

Boltzman Distribution

Penalized MLE

Penalized MLE

Sparsity Enforced Estimates

Sparsity Enforced Estimates

Partition function is intractable

Partition function is intractable

Conditional Likelihood

Conditional Likelihood

Neighborhood Selection: L 1 -regularized Logistic Regression

Neighborhood Selection: L 1 -regularized Logistic Regression

Composite Conditional Likelihod

Composite Conditional Likelihod

Penalized Composite Conditional Likelihood

Penalized Composite Conditional Likelihood

Penalties

Penalties

Sparse. Ising: LASSO-penalization R package, by Xue-Zou-Cai’ 2012

Sparse. Ising: LASSO-penalization R package, by Xue-Zou-Cai’ 2012

Algorithm

Algorithm

Alternative approach: Linearized Bregman Iteration R package: Libra Version 1. 4

Alternative approach: Linearized Bregman Iteration R package: Libra Version 1. 4

west. R library(“Libra”) data(“west”) s 0<-col. Sums(as. matrix(west)) data<-west[, s 0>=20] #Important characters appeared

west. R library(“Libra”) data(“west”) s 0<-col. Sums(as. matrix(west)) data<-west[, s 0>=20] #Important characters appeared more than 20 X<-as. matrix(2*data[, 1: 10]-1); obj 1 = ising(X, 10, 0. 1, nt=1000, trate=100) #for version 1. 4 or above … library('huge') obj 2<- huge(as. matrix(data), method = "glasso") …

Which model is better, Ising or Gaussian?

Which model is better, Ising or Gaussian?

Reference

Reference