Distributed Databases Review Distributed Databases Distributed Transaction Management
Distributed Databases: Review Distributed Databases Distributed Transaction Management Distributed Query Processing Semantic Data Control Distributed Database Design System Architecture 1 Yangjun Chen May 2003
Distributed Databases: Review ANSI/SPARC Architecture System Architecture DBMS Implementation Alternatives Datalogical Architecture Clients/Server Architecture Components of a Multi-DBMS May 2003 Yangjun Chen 2
Distributed Databases: Review May 2003 Yangjun Chen 3
Distributed Databases: Review May 2003 Yangjun Chen 4
Distributed Databases: Review May 2003 Yangjun Chen 5
Distributed Databases: Review May 2003 Yangjun Chen 6
Distributed Databases: Review May 2003 Yangjun Chen 7
Distributed Databases: Review May 2003 Yangjun Chen 8
Distributed Databases: Review May 2003 Yangjun Chen 9
Distributed Databases: Review May 2003 Yangjun Chen 10
Distributed Databases: Review May 2003 Yangjun Chen 11
Distributed Databases: Review May 2003 Yangjun Chen 12
Distributed Databases: Review Relation Fragmentation Distributed DBMS Design Horizontal, Vertical, Hybrid Correctness of Fragmentation Completeness, Reconstruction, Disjointness Algorithms COM_MIN Al. , Bond Energy Al. Fragment Allocation May 2003 Yangjun Chen 13
Distributed Databases: Review May 2003 Yangjun Chen 14
Distributed Databases: Review May 2003 Yangjun Chen 15
Distributed Databases: Review May 2003 Yangjun Chen 16
Distributed Databases: Review May 2003 Yangjun Chen 17
Distributed Databases: Review May 2003 Yangjun Chen 18
Distributed Databases: Review May 2003 Yangjun Chen 19
Distributed Databases: Review Algorithms for horizontal fragmentation: Ø COM_MIN Algorithm Given a relation and a set of predicates, it will find a complete and minimal set of predicates Ø Horizontal Algorithm Given a complete and minimal set of predicates, it will produce a horizontal fragmentation May 2003 Yangjun Chen 20
Distributed Databases: Review May 2003 Yangjun Chen 21
Distributed Databases: Review May 2003 Yangjun Chen 22
Distributed Databases: Review May 2003 Yangjun Chen 23
Distributed Databases: Review May 2003 Yangjun Chen 24
Distributed Databases: Review May 2003 Yangjun Chen 25
Distributed Databases: Review May 2003 Yangjun Chen 26
Distributed Databases: Review May 2003 Yangjun Chen 27
Distributed Databases: Review May 2003 Yangjun Chen 28
Distributed Databases: Review May 2003 Yangjun Chen 29
Distributed Databases: Review May 2003 Yangjun Chen 30
Distributed Databases: Review May 2003 Yangjun Chen 31
Distributed Databases: Review May 2003 Yangjun Chen 32
Distributed Databases: Review May 2003 Yangjun Chen 33
Distributed Databases: Review May 2003 Yangjun Chen 34
Distributed Databases: Review Global affinity measure: GAM = aff(Ai, Aj)*[aff(Ai, Aj-1) + aff(Ai, Aj+1) + aff(Ai-1, Aj) + aff(Ai+1, Aj)] Since the affinity matrix is symmetric, we have GAM = 2 * AM = May 2003 aff(Ai, Aj)*[aff(Ai, Aj-1) + aff(Ai, Aj+1 )] Yangjun Chen 35
Distributed Databases: Review Global affinity measure: AM = aff(Ai, Aj)*[aff(Ai, Aj-1) + aff(Ai, Aj+1)] = [aff(Ai, Aj)*aff(Ai, Aj-1) + aff(Ai, Aj)*aff(Ai, Aj+1)] = [bond(Ai, Aj-1) + bond(Ai, Aj+1)], aff(Ai, Aj)*aff(Ai, Aj-1) + Where bond(Ax, Ay) = May 2003 aff(Ai, Aj)*aff(Ai, Aj+1)] aff(Az, Ax)*aff(Az, Ay). Yangjun Chen 36
Distributed Databases: Review Relation schema: R[A 1, …, Ai, …, Aj, …, Ak-1, Ak, …, An] Affinity matrix: A 1, …, Ai, …, Aj, …, Ak-1, Ak, …, An … … …… … Clustered affinity matrix: Ai 1, …, Ail, Ai , Aj, Ai(l+3), …, Ai(k-1) … … … new old May 2003 Ai 1, …, Ail, Ai , Ak , Aj, Ai(l+3), …, Yangjun Chen 37
Distributed Databases: Review [bond(Ais, Ai(s-1)) + bond(Ais, Aj(s+1))] + bond(Ai, Ail) + bond(Ai, Aj) + bond(Aj, Ai(l+3)) + [bond(Ais, Ai(s-1)) + bond(Ais, Aj(s+1))] AMold = [bond(Ais, Ai(s-1)) + bond(Ais, Aj(s+1))] + bond(Ai, Ail) + bond(Ai, Ak) + bond(Ak, Ai) + bond(Ak, Aj) + bond(Aj, Ak) + bond(Aj, Ai(l+3)) + [bond(Ais, Ai(s-1)) + bond(Ais, Aj(s+1))] AMnew = AMnew – AMold = 2* bond(Ai, Ak) + 2* bond(Aj, Ak) – 2* bond(Ai, Aj) May 2003 Yangjun Chen 38
Distributed Databases: Review May 2003 Yangjun Chen 39
Distributed Databases: Review May 2003 Yangjun Chen 40
Distributed Databases: Review May 2003 Yangjun Chen 41
Distributed Databases: Review May 2003 Yangjun Chen 42
Distributed Databases: Review May 2003 Yangjun Chen 43
Distributed Databases: Review May 2003 Yangjun Chen 44
Distributed Databases: Review May 2003 Yangjun Chen 45
Distributed Databases: Review May 2003 Yangjun Chen 46
Distributed Databases: Review May 2003 Yangjun Chen 47
Distributed Databases: Review May 2003 Yangjun Chen 48
Distributed Databases: Review May 2003 Yangjun Chen 49
Distributed Databases: Review May 2003 Yangjun Chen 50
Distributed Databases: Review total cost for query processing cost for data processing cost for storage cost for transmission update cost for data access integrity concurrency checking control May 2003 Yangjun Chen retrieval cost 51
Distributed Databases: Review View Management Semantic Data Control Data Security Semantic Integrity Control Constraint Specification Language Integrity Enforcement May 2003 Yangjun Chen 52
Distributed Databases: Review May 2003 Yangjun Chen 53
Distributed Databases: Review May 2003 Yangjun Chen 54
Distributed Databases: Review May 2003 Yangjun Chen 55
Distributed Databases: Review May 2003 Yangjun Chen 56
Distributed Databases: Review May 2003 Yangjun Chen 57
Distributed Databases: Review May 2003 Yangjun Chen 58
Distributed Databases: Review May 2003 Yangjun Chen 59
Distributed Databases: Review May 2003 Yangjun Chen 60
Distributed Databases: Review May 2003 Yangjun Chen 61
Distributed Databases: Review Query Processing Methodology Distributed Query Processing May 2003 - Query Decomposition - Data Localization - Global Optimization Local Optimization Yangjun Chen 62
Distributed Databases: Review May 2003 Yangjun Chen 63
Distributed Databases: Review May 2003 Yangjun Chen 64
Distributed Databases: Review May 2003 Yangjun Chen 65
Distributed Databases: Review May 2003 Yangjun Chen 66
Distributed Databases: Review May 2003 Yangjun Chen 67
Distributed Databases: Review May 2003 Yangjun Chen 68
Distributed Databases: Review May 2003 Yangjun Chen 69
Distributed Databases: Review May 2003 Yangjun Chen 70
Distributed Databases: Review May 2003 Yangjun Chen 71
Distributed Databases: Review May 2003 Yangjun Chen 72
Distributed Databases: Review May 2003 Yangjun Chen 73
Distributed Databases: Review May 2003 Yangjun Chen 74
Distributed Databases: Review May 2003 Yangjun Chen 75
Distributed Databases: Review May 2003 Yangjun Chen 76
Distributed Databases: Review May 2003 Yangjun Chen 77
Distributed Databases: Review R* Algorithm SDD-1 Algorithm May 2003 Yangjun Chen 78
Distributed Databases: Review May 2003 Yangjun Chen 79
Distributed Databases: Review May 2003 Yangjun Chen 80
Distributed Databases: Review May 2003 Yangjun Chen 81
Distributed Databases: Review May 2003 Yangjun Chen 82
Distributed Databases: Review May 2003 Yangjun Chen 83
Distributed Databases: Review May 2003 Yangjun Chen 84
Distributed Databases: Review May 2003 Yangjun Chen 85
Distributed Databases: Review May 2003 Yangjun Chen 86
Distributed Databases: Review May 2003 Yangjun Chen 87
Distributed Databases: Review May 2003 Yangjun Chen 88
Distributed Databases: Review May 2003 Yangjun Chen 89
Distributed Databases: Review May 2003 Yangjun Chen 90
Distributed Databases: Review Transaction Concepts and Models Distributed Transaction Management Distributed Concurrency Control Distributed 2 PL Timestamp Ordering Optimistic Concurrency Control Deadlock May 2003 Yangjun Chen 91
Distributed Databases: Review May 2003 Yangjun Chen 92
Distributed Databases: Review May 2003 Yangjun Chen 93
Distributed Databases: Review May 2003 Yangjun Chen 94
Distributed Databases: Review May 2003 Yangjun Chen 95
Distributed Databases: Review May 2003 Yangjun Chen 96
Distributed Databases: Review May 2003 Yangjun Chen 97
Distributed Databases: Review May 2003 Yangjun Chen 98
Distributed Databases: Review May 2003 Yangjun Chen 99
Distributed Databases: Review May 2003 Yangjun Chen 100
Distributed Databases: Review May 2003 Yangjun Chen 101
Distributed Databases: Review May 2003 Yangjun Chen 102
Distributed Databases: Review Phenomena description P 1 dirty read (transaction can read data that is not committed) P 2 nonrepeatable read (transaction can read the same row twice, and it could be different) P 3 phantom May 2003 Yangjun Chen 103
Distributed Databases: Review Phenomena occurs? May 2003 P 1 P 2 P 3 serializable no no no repeatable read no no yes read committed no yes read uncommitted yes yes Yangjun Chen 104
Distributed Databases: Review May 2003 Yangjun Chen 105
Distributed Databases: Review May 2003 Yangjun Chen 106
Distributed Databases: Review May 2003 Yangjun Chen 107
Distributed Databases: Review May 2003 Yangjun Chen 108
Distributed Databases: Review May 2003 Yangjun Chen 109
Distributed Databases: Review May 2003 Yangjun Chen 110
Distributed Databases: Review May 2003 Yangjun Chen 111
Distributed Databases: Review May 2003 Yangjun Chen 112
Distributed Databases: Review May 2003 Yangjun Chen 113
Distributed Databases: Review May 2003 Yangjun Chen 114
Distributed Databases: Review May 2003 Yangjun Chen 115
Distributed Databases: Review May 2003 Yangjun Chen 116
Distributed Databases: Review May 2003 Yangjun Chen 117
Distributed Databases: Review May 2003 Yangjun Chen 118
Distributed Databases: Review May 2003 Yangjun Chen 119
Distributed Databases: Review May 2003 Yangjun Chen 120
Distributed Databases: Review May 2003 Yangjun Chen 121
Distributed Databases: Review May 2003 Yangjun Chen 122
Distributed Databases: Review May 2003 Yangjun Chen 123
Distributed Databases: Review May 2003 Yangjun Chen 124
Distributed Databases: Review May 2003 Yangjun Chen 125
Distributed Databases: Review May 2003 Yangjun Chen 126
- Slides: 126