Range Queries on Uncertain Data Jian Li Tsinghua
- Slides: 30
Range Queries on Uncertain Data Jian Li, Tsinghua University Haitao Wang, Utah State University ISAAC 2014
One dimensional range queries § A trivial solution: balanced binary search tree
An uncertain point p § 0. 1 0. 3 0. 2 0. 4
An uncertain point p: A general case § 0. 25 0. 2 0. 15 0. 22
The cumulative distribution function (CDF) § 1 0
Range query problems on uncertain points §
An application on deterministic data
An application on deterministic data (cont. ) §
Previous work: only on threshold queries § A heuristic solution using R-trees, Cheng et al. VLDB 04’ § fast in practice, but O(n) time in the worst case § Theoretical results: Agarwal et al. PODS 09’ § preprocessing: O(n log 2 n) space and O(n log 3 n) expected time § query: O(m+log 3 n) time, where m is the output size § A special case: t is fixed for all queries, preprocessing: O(n) space and O(n log n) time v query: O(m + log n) time, where m is the output size v § Heuristic solutions in 2 -D or higher-D, Tao et al. 2005 § O(n) time in the worst case
An application on deterministic data (cont. ) §
Variations § four variations
Our results: uniform unbounded preprocessing time space query time top-1 O(n log n) O(log n) top-k O(n log n) O(k + log n) threshold O(n log n) O(m + log n)
Our results: histogram unbounded preprocessing time space query time top-1 O(n log n) O(log n) top-k O(n log n) O(n) T threshold O(n log n) O(m + log n) T=O(k) if k = Ω(log n loglog n) and O(log n + k log k) otherwise
Our results: uniform bounded preprocessing time space query time top-1 O(n log n) O(log n) top-k O(n log 2 n) O(n log n) T threshold O(n log 2 n) O(n log n) O(m + log n) T=O(k) if k = Ω(log n loglog n) and O(log n + k log k) otherwise
Future work: histogram bounded § No new results § Previous work only on threshold queries, § P. K. Agarwal, S. -W. Cheng, Y. Tao, and K. Yi, PODS 2009 § preprocessing: O(n log 2 n) space and O(n log 3 n) expected time § query: O(m+log 3 n) time, where m is the output size
The arrangement of CDFs § t
Top-1: unbounded §
Difficulty for top-k queries § Arrangements of segments: difficult! § Arrangements of lines: much easier! § Uniform case: change each CDF to a line 1 0
Uniform unbounded § t
A half-plane range reporting data structure § Problem: Given a line arrangement, for any query point q, return the lines above q § Data structure: Partition lines into layers: each layer consists of lines in the upper envelop after removing the previous layers
Threshold query: uniform unbounded § query time: O(log n + m) t
Top-k query: uniform unbounded § Use a heap: O(log n + k log k) query time § Observation: largest k elements in O(k) sorted arrays § a selection algorithm on sorted matrices, Frederickson and Johnson, 82’ ----> O(log n + k) time
Our results: uniform unbounded preprocessing time space query time top-1 O(n log n) O(log n) top-k O(n log n) O(k + log n) threshold O(n log n) O(m + log n)
Uniform bounded §
Uniform bounded (cont. ) §
Uniform bounded (cont. ) § Top-1 queries: § L-type and R-type: use a persistent data structure to maintain O(n) upper envelops in the preprocessing § M-type: transform to segment dragging queries in 2 D § Top-k queries: § L-type and R-type: use a binary tree T, and on each node, build a data structure as in the unbounded case v build a fractional cascading structure on T v § M-type: transform to a range query in 3 D § Threshold queries: § Similar as for top-k queries
Histogram unbounded § A segment query problem § Given a set of n segments, for any point q, return all segments vertically above q § P. K. Agarwal, S. -W. Cheng, Y. Tao, and K. Yi, PODS 2009 § preprocessing: O(n) space and O(n log n) time § query: O(log n + m) time q
Thank you for your attention!
- Jian yun nie
- Jian-jiun ding
- Jian-jiun ding
- Activities in jian
- Jian sheng wang
- Jian sun
- Web learning tsinghua
- Tsinghua university school of economics and management
- Tsinghua university subsidiaries
- Lib.tsinghua.edu.n
- Tsinghua university
- Helloß
- Tsinghua
- National tsinghua university
- Tsinghua university
- Peng cui tsinghua
- As compared to long-range forecasts, short-range forecasts
- Using subqueries to solve queries
- Standing queries are
- Action queries in access
- Any queries slide
- Multirelation queries
- Wildcard query in information retrieval
- Any queries
- Complex sql join queries
- Basic retrieval queries in sql
- Hotel.hotelno=room.hotelno(hotel room)
- Sql insert update delete query
- Answering my queries
- Wide world importers
- Sql queries for banking database