Human Genome Project DNA Sliding Technique Index Building


背景 Human Genome Project DNA研究 未來發展



Sliding Technique

Index Building Algorithm 1. Extracting 4 -subsequences 2. Building the back-end index 3. Extracting 2 -grams 4. Building the front-end index

建構back-end-index

建構front-end-index

Query Processing Query: A B B C D 2 -gram: AB, BC, CD Subsequence: 0 matches ABCD


Query Processing Query: A B B C D 2 -gram: AB, BC, CD Subsequence: 0 matches ABCD

Query Processing Query: A B B C D 2 -gram: AB, BC, CD Subsequence: 1 matches BBCD Sub. X Sub 1 ABBCD

Query Processing Query: A B B C D 2 -gram: AB, BC, CD Subsequence: 2 matches BCD Sub. X Sub 2 ABBCD

Query Processing Query: A B B C D 2 -gram: AB, BC, CD Subsequence: 3 matches CDAB Sub. X Sub 3 ABBCDAB

Query Processing Query: A B B C D 2 -gram: AB, BC, CD Subsequence: 4 matches ABC

Query Processing Query: A B B C D 2 -gram: AB, BC, CD Subsequence: 5 matches AB Sub 5 Subx ABBCD

Query Processing 0. ABCD 1. BBCD 2. BCD 3. CDAB 4. ABC 5. AB Sub 1: BBCD Sub 2: BCDA Sub 3: CDAB Sub 5: DDAB

Query Processing Sub 1: BBCD Sub 2: BCDA Sub 3: CDAB Sub 5: DDAB Document 0

Query Processing Sub 1: BBCD Sub 2: BCDA Sub 5: DDAB Document 1

Query Processing Sub 1: BBCD Sub 2: BCDA Sub 3: CDAB Sub 5: DDAB Document 2

開發環境 硬體環境: CPU : AMD Athlon 64 3000+ RAM : DDR 400 2 GB 軟體環境: Windows XP Service Pack 2 JDK 1. 5. 0_09 My. SQL 4. 1. 12 a-nt Net. Beans IDE 5. 5


DEMO
- Slides: 23