CMU SCS Mining graphs and time series patterns
- Slides: 110
CMU SCS Mining graphs and time series: patterns, anomalies, and fraud detection Part 1: Graphs Node importance & community detection Christos Faloutsos CMU SCS https: //www. cs. cmu. edu/~christos/TALKS/19 -Go. I
CMU SCS Roadmap • • • Introduction Part#1: Graphs Part#2: Time series Part#3: extras (visualization, etc) Conclusions Gov. of India Copyright (C) 2019 C. Faloutsos 2
CMU SCS Roadmap • Introduction – Motivation • Part#1: Graphs – P 1. 1: properties/patterns in graphs – P 1. 2: node importance – P 1. 3: community detection – P 1. 4: fraud/anomaly detection – P 1. 5: belief propagation Gov. of India Copyright (C) 2019 C. Faloutsos ? 3
CMU SCS Roadmap • Introduction – Motivation • Part#1: Graphs – P 1. 1: properties/patterns in graphs – P 1. 2: node importance • • Gov. of India ? Page. Rank and Personalized PR HITS (SVD) SALSA Copyright (C) 2019 C. Faloutsos 4
CMU SCS ‘Recipe’ Structure: • Problem definition • Short answer/solution • LONG answer – details • Conclusion/short-answer Gov. of India Copyright (C) 2019 C. Faloutsos 5
CMU SCS Node importance - Motivation: • Given a graph (eg. , web pages containing the desirable query word) • Q 1: Which node is the most important? • Q 2: How close is node ‘A’ to node ‘B’? Gov. of India Copyright (C) 2019 C. Faloutsos 6
CMU SCS Node importance - Motivation: • Given a graph (eg. , web pages containing the desirable query word) • Q 1: Which node is the most important? – Page. Rank (PR = RWR), HITS, SALSA • Q 2: How close is node ‘A’ to node ‘B’? – Personalized P. R. (/SALSA) Gov. of India Copyright (C) 2019 C. Faloutsos 7
CMU SCS SVD properties ü Hidden/latent variable detection ü Compute node importance (HITS) ü Block detection ü Dimensionality reduction ü Embedding (linear) – SVD is a special case of ’deep neural net’ v 0 u 0 Gov. of India v 1 u 1 Copyright (C) 2019 C. Faloutsos 8
CMU SCS Roadmap • Introduction – Motivation • Part#1: Graphs – P 1. 1: properties/patterns in graphs – P 1. 2: node importance ? • Page. Rank and Personalized PR • HITS • SALSA Gov. of India Copyright (C) 2019 C. Faloutsos 9
CMU SCS Page. Rank (google) Larry Page Gov. of India Sergey Brin • Brin, Sergey and Lawrence Page (1998). Anatomy of a Large. Scale Hypertextual Web Search Engine. 7 th Intl World Wide Web Conf. • Page, Brin, Motwani, and Winograd (1999). The Page. Rank citation ranking: Bringing order to the web. Technical Report Copyright (C) 2019 C. Faloutsos 10
CMU SCS Problem: Page. Rank Given a directed graph, find its most interesting/central node A node is important, if its parents are important (recursive, but OK!) Gov. of India Copyright (C) 2019 C. Faloutsos 11
CMU SCS Problem: Page. Rank - solution Given a directed graph, find its most interesting/central node Proposed solution: Random walk; spot most ‘popular’ node (-> steady state prob. (ssp)) A node high ssp, if its parents have high ssp (recursive, but OK!) Gov. of India Copyright (C) 2019 C. Faloutsos 12
CMU SCS DET AILS (Simplified) Page. Rank algorithm • Let A be the adjacency matrix; • let B be the transition matrix: transpose, column-normalized - then From To 2 1 4 Gov. of India B 3 = 5 Copyright (C) 2019 C. Faloutsos 13
CMU SCS DET AILS (Simplified) Page. Rank algorithm • Bp=p B 2 1 4 Gov. of India 3 p = 5 Copyright (C) 2019 C. Faloutsos 14
CMU SCS Definitions A D B DET AILS Adjacency matrix (from-to) Degree matrix = (diag ( d 1, d 2, …, dn) ) Transition matrix: to-from, column normalized B = AT D-1 Gov. of India Copyright (C) 2019 C. Faloutsos 15
CMU SCS DET AILS (Simplified) Page. Rank algorithm • Bp=1*p • thus, p is the eigenvector that corresponds to the highest eigenvalue (=1, since the matrix is column-normalized) • Why does such a p exist? – p exists if B is nxn, nonnegative, irreducible [Perron–Frobenius theorem] Gov. of India Copyright (C) 2019 C. Faloutsos 16
CMU SCS (Simplified) Page. Rank algorithm • In short: imagine a particle randomly moving along the edges • compute its steady-state probabilities (ssp) Full version of algo: with occasional random jumps Why? To make the matrix irreducible Gov. of India Copyright (C) 2019 C. Faloutsos 17
CMU SCS (Simplified) Page. Rank algorithm • In short: imagine a particle randomly moving along the edges • compute its steady-state probabilities (ssp) Full version of algo: with occasional random jumps Why? To make the matrix irreducible Gov. of India Copyright (C) 2019 C. Faloutsos 18
CMU SCS (Simplified) Page. Rank algorithm • In short: imagine a particle randomly moving along the edges • compute its steady-state probabilities (ssp) Full version of algo: with occasional random jumps Why? To make the matrix irreducible Gov. of India Copyright (C) 2019 C. Faloutsos 19
CMU SCS (Simplified) Page. Rank algorithm • In short: imagine a particle randomly moving along the edges • compute its steady-state probabilities (ssp) Full version of algo: with occasional random jumps Why? To make the matrix irreducible Gov. of India Copyright (C) 2019 C. Faloutsos 20
CMU SCS (Simplified) Page. Rank algorithm • In short: imagine a particle randomly moving along the edges • compute its steady-state probabilities (ssp) Full version of algo: with occasional random jumps Why? To make the matrix irreducible Gov. of India Copyright (C) 2019 C. Faloutsos 21
CMU SCS (Simplified) Page. Rank algorithm • In short: imagine a particle randomly moving along the edges • compute its steady-state probabilities (ssp) Page. Rank = PR = Random Walk with Restarts = RWR = Random surfer Gov. of India Copyright (C) 2019 C. Faloutsos 22
CMU SCS Full Algorithm DET AILS • With probability 1 -c, fly-out to a random node • Then, we have p = c B p + (1 -c)/n 1 => p = (1 -c)/n [I - c B] -1 1 Gov. of India Copyright (C) 2019 C. Faloutsos 23
CMU SCS Full Algorithm DET AILS • With probability 1 -c, fly-out to a random node • Then, we have p = c B p + (1 -c)/n 1 => p = (1 -c)/n [I - c B] -1 1 Gov. of India Copyright (C) 2019 C. Faloutsos 24
CMU SCS Notice: • page. Rank ~ in-degree • (and HITS, also: ~ in-degree) Gov. of India Copyright (C) 2019 C. Faloutsos 25
CMU SCS Roadmap • Introduction – Motivation • Part#1: Graphs – P 1. 1: properties/patterns in graphs – P 1. 2: node importance ? • Page. Rank and Personalized PR • HITS Gov. of India Copyright (C) 2019 C. Faloutsos 26
CMU SCS Node importance - Motivation: • Given a graph (eg. , web pages containing the desirable query word) • Q 1: Which node is the most important? • Q 2: How close is node ‘A’ to node ‘B’? A B Gov. of India Copyright (C) 2019 C. Faloutsos 27
CMU SCS Personalized P. R. Taher H. Haveliwala. 2002. Topic-sensitive Page. Rank. (WWW '02). 517 -526. http: //dx. doi. org/10. 1145/511446. 511513 Page L. , Brin S. , Motwani R. , and Winograd T. (1999). The Page. Rank citation ranking: Bringing order to the web. Technical Report Gov. of India Copyright (C) 2019 C. Faloutsos 29
CMU SCS Extension: Personalized P. R. • How close is ‘ 4’ to ‘ 2’? • (or: if I like page/node ‘ 2’, what else would you recommend? ) 2 1 4 Gov. of India 3 5 Copyright (C) 2019 C. Faloutsos 30
CMU SCS Extension: Personalized P. R. • How close is ‘ 4’ to ‘ 2’? • (or: if I like page/node ‘ 2’, what else would you recommend? ) 2 1 4 Gov. of India 3 5 Copyright (C) 2019 C. Faloutsos 31
CMU SCS Extension: Personalized P. R. • How close is ‘ 4’ to ‘ 2’? • (or: if I like page/node ‘ 2’, what else would you recommend? ) 2 1 4 Gov. of India 3 5 Copyright (C) 2019 C. Faloutsos 32
CMU SCS Extension: Personalized P. R. • How close is ‘ 4’ to ‘ 2’? • (or: if I like page/node ‘ 2’, what else would you recommend? ) 2 1 4 Gov. of India 5 3 High score (A -> B) if • Many • Short • Heavy paths A->B Copyright (C) 2019 C. Faloutsos 33
CMU SCS Extension: Personalized P. R. e t i r o v a your f • With probability 1 -c, fly-out to a random node(s) • Then, we have p = c B p + (1 -c)/n 1 => p = (1 -c)/n [I - c B] -1 1 Gov. of India Copyright (C) 2019 C. Faloutsos 34
CMU SCS Extension: Personalized P. R. • How close is ‘ 4’ to ‘ 2’? • A: compute Personalized P. R. of ‘ 4’, restarting from ‘ 2’ 2 1 4 Gov. of India 3 5 Copyright (C) 2019 C. Faloutsos 35
CMU SCS Extension: Personalized P. R. • How close is ‘ 4’ to ‘ 2’? • A: compute Personalized P. R. of ‘ 4’, restarting from ‘ 2’ – Related to – ‘escape’ probability – ‘round trip’ probability –… Gov. of India Copyright (C) 2019 C. Faloutsos 36
CMU SCS Roadmap • Introduction – Motivation • Part#1: Graphs – P 1. 1: properties/patterns in graphs – P 1. 2: node importance ? • Page. Rank and Personalized PR – Fast computation - ‘Pixie’ • HITS Gov. of India Copyright (C) 2019 C. Faloutsos 37
CMU SCS DET Extension: Personalized P. R. AILS • Q: Faster computation than: p = (1 -c)/n [I - c B] -1 1 Gov. of India Copyright (C) 2019 C. Faloutsos 38
CMU SCS Pixie algorithm Chantat Eksombatchai, Pranav Jindal, Jerry Zitao Liu, Yuchen Liu, Rahul Sharma, Charles Sugnet, Mark Ulrich, Jure Leskovec: Pixie: A System for Recommending 3+ Billion Items to 200+ Million Users in Real-Time. WWW 2018: 1775 -1784 https: //dl. acm. org/citation. cfm? doid=3178876. 3186183 Gov. of India Copyright (C) 2019 C. Faloutsos 39
CMU SCS Pixie algorithm • Q: Faster computation than: p = (1 -c)/n [I - c B] -1 1 • A: simulate a few R. W. – keep visit counts Ci – fast and nimble Gov. of India Copyright (C) 2019 C. Faloutsos 40
CMU SCS Personalized Page. Rank algorithm Gov. of India Copyright (C) 2019 C. Faloutsos 41
CMU SCS Personalized Page. Rank algorithm Gov. of India Copyright (C) 2019 C. Faloutsos 42
CMU SCS Personalized Page. Rank algorithm Gov. of India Copyright (C) 2019 C. Faloutsos 43
CMU SCS Personalized Page. Rank algorithm Gov. of India Copyright (C) 2019 C. Faloutsos 44
CMU SCS Personalized Page. Rank algorithm Gov. of India Copyright (C) 2019 C. Faloutsos 45
CMU SCS Personalized Page. Rank algorithm Gov. of India Copyright (C) 2019 C. Faloutsos 46
CMU SCS Personalized Page. Rank algorithm . . Gov. of India Copyright (C) 2019 C. Faloutsos … … 47
CMU SCS Roadmap • Introduction – Motivation • Part#1: Graphs – P 1. 1: properties/patterns in graphs – P 1. 2: node importance ? • Page. Rank and Personalized PR – Fast computation - ‘Pixie’ – Other applications • HITS Gov. of India Copyright (C) 2019 C. Faloutsos 48
CMU SCS Applications of node proximity • • Recommendation Link prediction ‘Center Piece Subgraphs’ … … … Gov. of India Copyright (C) 2019 C. Faloutsos ? 49
CMU SCS Applications of node proximity • • Recommendation Link prediction ‘Center Piece Subgraphs’ … … … Gov. of India Copyright (C) 2019 C. Faloutsos ? 50
CMU SCS Applications of node proximity • • Recommendation Link prediction ‘Center Piece Subgraphs’ … Gov. of India Copyright (C) 2019 C. Faloutsos 51
CMU SCS Applications of node proximity • • Recommendation Link prediction ‘Center Piece Subgraphs’ … Gov. of India Copyright (C) 2019 C. Faloutsos 52
CMU SCS Applications of node proximity • • Recommendation Link prediction ‘Center Piece Subgraphs’ … Gov. of India Copyright (C) 2019 C. Faloutsos 53
CMU SCS Applications of node proximity • • Recommendation Link prediction ‘Center Piece Subgraphs’ … Fast Algorithms for Querying and Mining Large Graphs Hanghang Tong, Ph. D dissertation, CMU, 2009. TR: CMU-ML Gov. of India Copyright (C) 2019 C. Faloutsos 54 -09 -112.
CMU SCS Roadmap • Introduction – Motivation • Part#1: Graphs – P 1. 1: properties/patterns in graphs – P 1. 2: node importance ? • Page. Rank and Personalized PR • HITS • SVD Gov. of India Copyright (C) 2019 C. Faloutsos 55
CMU SCS Kleinberg’s algo (HITS) Kleinberg, Jon (1998). Authoritative sources in a hyperlinked environment. Proc. 9 th ACM-SIAM Symposium on Discrete Algorithms. Gov. of India Copyright (C) 2019 C. Faloutsos 56
CMU SCS Recall: problem dfn • Given a graph (eg. , web pages containing the desirable query word) • Q 1: Which node is the most important? Gov. of India Copyright (C) 2019 C. Faloutsos 57
CMU SCS Why not just Page. Rank? 1. HITS (and its derivative, SALSA), differentiate between “hubs” and “authorities” 2. HITS can help to find the largest community 3. (SVD: powerful tool) idols fans Gov. of India Copyright (C) 2019 C. Faloutsos 58
CMU SCS Kleinberg’s algorithm • Problem dfn: given the web and a query • find the most ‘authoritative’ web pages for this query Gov. of India Copyright (C) 2019 C. Faloutsos 59
CMU SCS Problem: Page. Rank Given a directed graph, find its most interesting/central node A node is important, if its parents are important (recursive, but OK!) Gov. of India Copyright (C) 2019 C. Faloutsos 60
CMU SCS HITS Problem: Page. Rank Given a directed graph, find its most interesting/central node Gov. of India ``wise’’ A node is important, if its parents are important (recursive, but OK!) AND: A node is ``wise’’ Copyright (C)if 2019 C. Faloutsos 61 its children are important
CMU SCS Kleinberg’s algorithm • Step 0: find nodes with query word(s) • Step 1: expand by one move forward and backward Gov. of India Copyright (C) 2019 C. Faloutsos 62
CMU SCS Kleinberg’s algorithm • on the resulting graph, give high score (= ‘authorities’) to nodes that many ``wise’’ nodes point to • give high wisdom score (‘hubs’) to nodes that point to good ‘authorities’ hubs Gov. of India authorities Copyright (C) 2019 C. Faloutsos 63
CMU SCS Kleinberg’s algorithm Then: ai = hk + hl + hm k l m i that is ai = Sum (hj) edge exists or a = AT h over all j that (j, i) = Gov. of India Copyright (C) 2019 C. Faloutsos 64
CMU SCS Kleinberg’s algorithm Then: ai = hk + hl + hm k l m i that is ai = Sum (hj) edge exists or a = AT h over all j that (j, i) = Gov. of India Copyright (C) 2019 C. Faloutsos 65
CMU SCS Kleinberg’s algorithm symmetrically, for the ‘hubness’: n hi = an + ap + aq that is p hi = Sum (qj) over all j that (i, j) q edge exists or h=Aa i = Gov. of India Copyright (C) 2019 C. Faloutsos 66
CMU SCS Kleinberg’s algorithm symmetrically, for the ‘hubness’: n hi = an + ap + aq that is p hi = Sum (qj) over all j that (i, j) q edge exists or h=Aa i = Gov. of India Copyright (C) 2019 C. Faloutsos 67
CMU SCS Kleinberg’s algorithm In conclusion, we want vectors h and a such that: = h=Aa a = AT h Gov. of India Copyright (C) 2019 C. Faloutsos 68
CMU SCS Kleinberg’s algorithm In conclusion, we want vectors h and a such that: = h=Aa a = AT h Gov. of India Copyright (C) 2019 C. Faloutsos 69
CMU SCS Kleinberg’s algorithm In conclusion, we want vectors h and a such that: = h=Aa a = AT h Gov. of India Copyright (C) 2019 C. Faloutsos 70
CMU SCS Kleinberg’s algorithm In conclusion, we want vectors h and a such that: = h=Aa a = AT h Gov. of India Copyright (C) 2019 C. Faloutsos 71
CMU SCS Kleinberg’s algorithm In short, the solutions to Dfn: in h=Aa +2 T a=A h are the left- and right- singular-vectors of the adjacency matrix A. Starting from random a’ and iterating, we’ll eventually converge … to the vector of strongest singular value. Gov. of India Copyright (C) 2019 C. Faloutsos 72
CMU SCS Kleinberg’s algorithm - results Eg. , for the query ‘java’: 0. 328 www. gamelan. com 0. 251 java. sun. com 0. 190 www. digitalfocus. com (“the java developer”) Gov. of India Copyright (C) 2019 C. Faloutsos 73
CMU SCS Roadmap • Introduction – Motivation • Part#1: Graphs – P 1. 1: properties/patterns in graphs – P 1. 2: node importance ? • Page. Rank and Personalized PR • HITS • (SVD) Gov. of India Copyright (C) 2019 C. Faloutsos 74
CMU SCS SVD properties • • • Hidden/latent variable detection Compute node importance (HITS) Block detection Dimensionality reduction Embedding Gov. of India Copyright (C) 2019 C. Faloutsos 75
CMU SCS Crush intro to SVD • (SVD) matrix factorization: finds blocks M idols N fans Gov. of India ‘music lovers’ ‘sports lovers’ ‘citizens’ ‘singers’ ‘athletes’ ‘politicians’ ~ + Copyright (C) 2019 C. Faloutsos + 76
CMU SCS Crush intro to SVD • (SVD) matrix factorization: finds blocks M idols N fans Gov. of India ‘music lovers’ ‘sports lovers’ ‘citizens’ ‘singers’ ‘athletes’ ‘politicians’ ~ + Copyright (C) 2019 C. Faloutsos + 77
CMU SCS Crush intro to SVD • (SVD) matrix factorization: finds blocks M idols N fans Gov. of India ‘music lovers’ ‘sports lovers’ ‘citizens’ ‘singers’ ‘athletes’ ‘politicians’ ~ + Copyright (C) 2019 C. Faloutsos + 78
CMU SCS Crush intro to SVD • (SVD) matrix factorization: finds blocks HITS: first singular vector, ie, fixates on largest group M idols N fans Gov. of India ‘music lovers’ ‘sports lovers’ ‘citizens’ Authority ‘singers’ scores ‘athletes’ ‘politicians’ ~ + Copyright (C) 2019 C. Faloutsos Hub scores + 79
CMU SCS Crush intro to SVD • Basis for anomaly detection – P 1. 4 • Basis for tensor/PARAFAC – P 2. 5 M idols N fans Gov. of India ‘music lovers’ ‘sports lovers’ ‘citizens’ ‘singers’ ‘athletes’ ‘politicians’ ~ + Copyright (C) 2019 C. Faloutsos + 80
CMU SCS SVD properties ü Hidden/latent variable detection ü Compute node importance (HITS) ü Block detection • Dimensionality reduction • Embedding v 0 u 0 Gov. of India v 1 u 1 Copyright (C) 2019 C. Faloutsos 81
CMU SCS #retweets for … SVD - intuition #retweets for Byonce v 1 u 1 Gov. of India Copyright (C) 2019 C. Faloutsos v 2 u 2 82
CMU SCS SVD properties ü Hidden/latent variable detection ü Compute node importance (HITS) ü Block detection ü Dimensionality reduction • Embedding v 0 u 0 Gov. of India v 1 u 1 Copyright (C) 2019 C. Faloutsos 83
CMU SCS Crush intro to SVD • SVD compression is a linear autoencoder M idols N fans … scores … Gov. of India Copyright (C) 2019 C. Faloutsos 84
CMU SCS Crush intro to SVD • SVD compression is a linear autoencoder M idols N fans … scores … Gov. of India Copyright (C) 2019 C. Faloutsos 85
CMU SCS SVD properties ü Hidden/latent variable detection ü Compute node importance (HITS) ü Block detection ü Dimensionality reduction ü Embedding (linear) – SVD is a special case of ’deep neural net’ v 0 u 0 Gov. of India v 1 u 1 Copyright (C) 2019 C. Faloutsos 86
CMU SCS Node importance - Motivation: • Given a graph (eg. , web pages containing the desirable query word) • Q 1: Which node is the most important? – Page. Rank (PR = RWR), HITS, SALSA • Q 2: How close is node ‘A’ to node ‘B’? – Personalized P. R. (/SALSA) Gov. of India Copyright (C) 2019 C. Faloutsos 87
CMU SCS SVD properties ü Hidden/latent variable detection ü Compute node importance (HITS) ü Block detection ü Dimensionality reduction ü Embedding (linear) – SVD is a special case of ’deep neural net’ v 0 u 0 Gov. of India v 1 u 1 Copyright (C) 2019 C. Faloutsos 88
CMU SCS SVD properties ü Hidden/latent variable detection ü Compute node importance (HITS) ! D ü Block detection SV ü Dimensionality reduction ? x i ü Embedding atr (linear) M – SVD is a special case of ’deep neural net’ v 0 u 0 Gov. of India v 1 u 1 Copyright (C) 2019 C. Faloutsos 89
CMU SCS Roadmap • Introduction – Motivation • Part#1: Graphs – P 1. 1: properties/patterns in graphs – P 1. 2: node importance – P 1. 3: community detection – P 1. 4: fraud/anomaly detection – P 1. 5: belief propagation Gov. of India Copyright (C) 2019 C. Faloutsos 90
CMU SCS Roadmap • Introduction – Motivation • Part#1: Graphs – P 1. 1: properties/patterns in graphs – P 1. 2: node importance – P 1. 3: community detection – P 1. 4: fraud/anomaly detection – P 1. 5: belief propagation Gov. of India Copyright (C) 2019 C. Faloutsos ? 91
CMU SCS Roadmap • Introduction – Motivation • Part#1: Graphs – P 1. 1: properties/patterns in graphs – P 1. 2: node importance – P 1. 3: community detection ? • Algorithm • Warning: ‘no good cuts’ – P 1. 4: fraud/anomaly detection Gov. of India Copyright (C) 2019 C. Faloutsos 92
CMU SCS Problem • Given a graph, and k • Break it into k (disjoint) communities Gov. of India Copyright (C) 2019 C. Faloutsos P 2 -93
CMU SCS Short answer • METIS [Karypis, Kumar] Gov. of India Copyright (C) 2019 C. Faloutsos P 2 -94
CMU SCS Solution#1: METIS • Arguably, the best algorithm • Open source, at – http: //glaros. dtc. umn. edu/gkhome/fetch/sw/metis-5. 1. 0. tar. gz • and *many* related papers, at same url • Main idea: – coarsen the graph; – partition; – un-coarsen Gov. of India Copyright (C) 2019 C. Faloutsos P 2 -95
CMU SCS Solution #1: METIS • G. Karypis and V. Kumar. METIS 4. 0: Unstructured graph partitioning and sparse matrix ordering system. TR, Dept. of CS, Univ. of Minnesota, 1998. • <and many extensions> Gov. of India Copyright (C) 2019 C. Faloutsos P 2 -96
CMU SCS Solutions #2, 3… • Fiedler vector (2 nd singular vector of Laplacian). • Modularity: Community structure in social and biological networks M. Girvan and M. E. J. Newman, PNAS June 11, 2002. 99 (12) 7821 -7826; https: //doi. org/10. 1073/pnas. 122653799 • Co-clustering: [Dhillon+, KDD’ 03] • Clustering on the A 2 (square of adjacency matrix) [Zhou, Woodruff, PODS’ 04] • Minimum cut / maximum flow [Flake+, KDD’ 00] • …. Gov. of India Copyright (C) 2019 C. Faloutsos P 2 -97
CMU SCS Roadmap • Introduction – Motivation • Part#1: Graphs – P 1. 1: properties/patterns in graphs – P 1. 2: node importance – P 1. 3: community detection ? • Algorithm • Warning: ‘no good cuts’ – P 1. 4: fraud/anomaly detection Gov. of India Copyright (C) 2019 C. Faloutsos 98
CMU SCS A word of caution • BUT: often, there are no good cuts: Gov. of India Copyright (C) 2019 C. Faloutsos P 2 -99
CMU SCS A word of caution • BUT: often, there are no good cuts: Gov. of India Copyright (C) 2019 C. Faloutsos P 2 -100
CMU SCS A word of caution • Maybe there are no good cuts: ``jellyfish’’ shape [Tauro+’ 01], [Siganos+, ’ 06], strange behavior of cuts [Chakrabarti+’ 04], [Leskovec+, ’ 08] Gov. of India Copyright (C) 2019 C. Faloutsos P 2 -101
CMU SCS A word of caution • Maybe there are no good cuts: ``jellyfish’’ shape [Tauro+’ 01], [Siganos+, ’ 06], strange behavior of cuts [Chakrabarti+, ’ 04], [Leskovec+, ’ 08] ? Gov. of India ? Copyright (C) 2019 C. Faloutsos P 2 -102
CMU SCS R 1: Jellyfish model [Tauro+] … A Simple Conceptual Model for the Internet Topology, L. Tauro, C. Palmer, G. Siganos, M. Faloutsos, Global Internet, November 25 -29, 2001 Jellyfish: A Conceptual Model for the AS Internet Topology G. Siganos, Sudhir L Tauro, Faloutsos, J. of Communications and Networks, Vol. 8, No. 3, pp Gov. M. of India Copyright (C) 2019 C. Faloutsos P 2 -103 339 -350, Sept. 2006.
CMU SCS R 1: Jellyfish model [Tauro+] … A Simple Conceptual Model for the Internet Topology, L. Tauro, C. Palmer, G. Siganos, M. Faloutsos, Global Internet, November 25 -29, 2001 Jellyfish: A Conceptual Model for the AS Internet Topology G. Siganos, Sudhir L Tauro, Faloutsos, J. of Communications and Networks, Vol. 8, No. 3, pp Gov. M. of India Copyright (C) 2019 C. Faloutsos P 2 -104 339 -350, Sept. 2006.
CMU SCS R 1: Jellyfish model [Tauro+] … A Simple Conceptual Model for the Internet Topology, L. Tauro, C. Palmer, G. Siganos, M. Faloutsos, Global Internet, November 25 -29, 2001 Jellyfish: A Conceptual Model for the AS Internet Topology G. Siganos, Sudhir L Tauro, Faloutsos, J. of Communications and Networks, Vol. 8, No. 3, pp Gov. M. of India Copyright (C) 2019 C. Faloutsos P 2 -105 339 -350, Sept. 2006.
CMU SCS R 2: 'Familiar strangers’ • Bipartite graph (‘heterophily’) ‘lawyers’ 'eng. ’ . g n e ? rs e y law ? Gov. of India Copyright (C) 2019 C. Faloutsos 106
CMU SCS R 3: ``Core-periphery’’ • Bipartite graph + clique Main rs e b m me es t i l l sate ? ? Gov. of India Copyright (C) 2019 C. Faloutsos 107
CMU SCS Strange behavior of min cuts log (mincut-size / #edges) • ‘negative dimensionality’ (!) Slope~ -0. 45 1 -1/d log (# edges) Clickstream graph Net. Mine: New Mining Tools for Large Graphs, by D. Chakrabarti, Y. Zhan, D. Blandford, C. Faloutsos and G. Blelloch, in the SDM 2004 Workshop on Link Analysis, Counter-terrorism and Privacy Statistical Properties of Community Structure in Large Social and Information Networks, J. Leskovec, K. C. Lang, A. Dasgupta, M. P 2 -108 Mahoney. Gov. of India Copyright (C) 2019 Faloutsos WWW 2008.
CMU SCS Strange behavior of min cuts log (mincut-size / #edges) • ‘negative dimensionality’ (!) Slope~ -0. 45 1 -1/d log (# edges) Clickstream graph Net. Mine: New Mining Tools for Large Graphs, by D. Chakrabarti, Y. Zhan, D. Blandford, C. Faloutsos and G. Blelloch, in the SDM 2004 Workshop on Link Analysis, Counter-terrorism and Privacy Statistical Properties of Community Structure in Large Social and Information Networks, J. Leskovec, K. C. Lang, A. Dasgupta, M. P 2 -109 Mahoney. Gov. of India Copyright (C) 2019 Faloutsos WWW 2008.
CMU SCS Short answer • METIS [Karypis, Kumar] • (but: maybe NO good cuts exist!) Gov. of India Copyright (C) 2019 C. Faloutsos P 2 -110
CMU SCS Roadmap • Introduction – Motivation • Part#1: Graphs – P 1. 1: properties/patterns in graphs – P 1. 2: node importance – P 1. 3: community detection – P 1. 4: fraud/anomaly detection – P 1. 5: belief propagation Gov. of India Copyright (C) 2019 C. Faloutsos 111
- Graphs that compare distance and time are called
- Data mining cmu
- Cmu data mining
- Difference between strip mining and open pit mining
- Text and web mining
- Transition bugs in software testing
- Graphs that enlighten and graphs that deceive
- Strip mining vs open pit mining
- Strip mining vs open pit mining
- Multimedia data mining
- Mining complex types of data in data mining
- Mining social network graphs
- Polynomial end behavior chart
- Mining frequent patterns associations and correlations
- Associations and correlations in data mining
- Mining frequent patterns associations and correlations
- Patterns equations and graphs
- Mining frequent patterns without candidate generation
- Mining frequent patterns without candidate generation
- Travin hazelrigg
- Applied hydrology
- Lluvia neta
- Apa itu lengkung peralihan
- Infiltration index
- Dioda diac
- Scs curve number
- Tirstor
- Color 9132005
- Scs.ryerson.ca harley
- Simbol scs
- Scs reasonable person principle
- Scs thyristor
- Scs carleton
- Scs archiver
- Jenis lengkung
- Scs elogs
- Scs lulu
- Scs methode
- Doc scs
- Carotenoid score
- Elapsed time
- Series aiding and series opposing
- Constant speed on distance time graph
- Dating patterns since the 1960s are
- Maclaurin series vs taylor series
- Heisenberg 1925 paper
- Taylor vs maclaurin
- Maclaurin series vs taylor series
- Ibm p series models
- Series-shunt feedback amplifier examples
- Arithmetic sum formula
- Interpreting distance time graphs
- Position vs time graphs
- What is the average velocity of the object
- Interpreting distance time graphs
- Depth time graphs
- Creating and interpreting distance time graph
- Non linear distance time graphs
- Irregular variation in time series example
- Objectives of time
- Unit root time series
- Unit root test lecture notes
- Time series analysis
- Analisis time series
- Time series cross validation
- Time series binning
- Pengertian time series
- Aggregating time series data
- Time series further maths
- Utility of time series
- Discrete time fourier series
- Discrete fourier transform formula
- Magnitude and phase response
- Centered moving average example
- Covariance of two time series
- Time series collection
- Time series for dummies
- Additive model time series
- Time series definition
- Interrupted time series vs regression discontinuity
- Pooled time series
- Time series analysis using stata
- Importance of time series analysis
- Components of time series
- Time series motifs
- Sax time series
- Time series forecasting
- Equation fourier
- Time series shapelets
- The time series consists of
- Teknik analisis data time series
- Time series gcse
- Scala time series
- Networks and graphs circuits paths and graph structures
- How to understand graphs and charts
- Setup time and hold time in digital electronics
- How to find principal formula
- Calculating infusion time and completion time
- Seektime
- Definition of work study
- Time study procedure
- 16-385 computer vision
- Greg kesden cmu
- Hci major
- Tw cmu xxx
- Cmu 15-418
- Lorrie cranor cmu
- Mitchell 1997 machine learning
- 14848 cmu
- Cmu 14848
- Hui zhang cmu