Dimension reduction methods for multiple 2 datasets aka

  • Slides: 11
Download presentation
Dimension reduction methods for multiple >2 datasets (aka tensor decomposition) Meng C, Zeleznik OA,

Dimension reduction methods for multiple >2 datasets (aka tensor decomposition) Meng C, Zeleznik OA, Thallinger GG, , Amin Gholami A, Kuster B, Culhane AC. Dimension reduction techniques for the integrative analysis of multi-omics data Brief Bioinform (2016) 17 (4): 628 -641 Bioc 2017

MCIA of 5 data sets (NCI 60) B C Bioc 2017 A

MCIA of 5 data sets (NCI 60) B C Bioc 2017 A

mogsa (Bioconductor package) • moa() performs multiple factor analysis or STATIS mbpca() performs either

mogsa (Bioconductor package) • moa() performs multiple factor analysis or STATIS mbpca() performs either consensus PCA, generalized CCA and multiple co-inertia analysis • method= “global. Score” : consensus PCA • method= “block. Score” : generalized canonical correlation analysis • method= “block. Loading” : multiple co-inertia analysis se. c. PCA<-mbpca(lapply(se, exprs), ncomp=10, method="global. Score") # c. PCA Meng C, Kuster B, Culhane AC* and Gholami AM* (2014) A multivariate approach to the integration of multi-omics datasets. BMC bioinformatics 15(1), 162 Bioc 2017

Projection of GO terms Variables Sample with variables GO Terms Fagan A, Culhane AC,

Projection of GO terms Variables Sample with variables GO Terms Fagan A, Culhane AC, Higgins DG. (2007) A Multivariate Analysis approach to the Integration of Proteomic and Gene Expression Data. Proteomics. 7(13): 2162 -71. Bioc 2017

MOGSA Multiple omics data Binary gene set annotation matrices observation Output gene sets Input

MOGSA Multiple omics data Binary gene set annotation matrices observation Output gene sets Input Gene space Observation space/ Latent variables Step 2 Gene set space Matrix multiplication featu re observati on Multivariate analysis Step 1 Matrix multiplication gene set axis Gene set gen e observation Step 3 Meng C, Kuster B , Peters B, Gholami AM, Culhane AC. mo. GSA: a multivariate approach for integrative gene-set analysis of multiple omics data. BMC Bioinformatics. In Review. Bioc 2017

MOGSA outperform other ss. GSA approaches when applied to Synthetic data Bioc 2017

MOGSA outperform other ss. GSA approaches when applied to Synthetic data Bioc 2017

mo. GSA, GSEA, GSVA of TCGA BRCA GSVA, ss. GSEA: RNAseq mo. GSA =

mo. GSA, GSEA, GSVA of TCGA BRCA GSVA, ss. GSEA: RNAseq mo. GSA = RNA+GISTIC ss. GSEA GSVA Bioc 2017 GSVA mo. GSA 0. 014 0. 001 0. 167

Bioc 2017 p 53 pathway DNA Repair Data-wise decomposed gene set score MTORC 1

Bioc 2017 p 53 pathway DNA Repair Data-wise decomposed gene set score MTORC 1 Signaling Apoptosis RNA Seq + Gistict + TFReg RNA Seq

Remove a component 58 NCI 60 samples and 18 Hallmark Genesets 4 datasets -

Remove a component 58 NCI 60 samples and 18 Hallmark Genesets 4 datasets - hgu 195, hgu 133 p 2, agilent Bioc 2017

mo. GSA applied to TCGA pan. Immune Cluster discovery Bioc 2017

mo. GSA applied to TCGA pan. Immune Cluster discovery Bioc 2017

Using MFA to impute NA A) Missing (at random) B) Missing row/cols MFA (blue),

Using MFA to impute NA A) Missing (at random) B) Missing row/cols MFA (blue), low rank SVD (red), fast alternatively least square (ALS, green)