Deep Web Technologies Show and Tell Presentation to
Deep Web Technologies Show and Tell Presentation to 03 December 2012 By Abe Lederman, CEO © 2012 Deep Web Technologies, Inc.
Abe Lederman Deep Web Technologies was founded by Abe Lederman in 2002. –BS & MS Degrees in Computer Science from MIT –A co-founder of Verity, acquired by Autonomy (now HP) –Developed Sci. Search@LANL (part of “Library without Walls”) – 25 years experience in Information Retrieval © 2012 Deep Web Technologies, Inc. 2
About Deep Web Technologies. . . • 20 person company based in Santa Fe, New Mexico • Over $5 M in DOE SBIR Grants (2003 -2011) • Pioneer/trailblazer in federated search • 100+ solutions in production © 2012 Deep Web Technologies, Inc. 3
Customers Include. . . Government: • Defense Technical Info Center (DTIC) • Office of Sci. & Tech. Info (DOE-OSTI) • UN Economic Comm. for Africa (UNECA) • European Space Agency Academic: • Stanford University • George Mason University • Texas Medical Center • University College of Cork Corporate: • Boeing • BASF • Intel • HP • P&G Public Portals: • World. Wide. Science. org • Science. gov • Biznar • Mednar • Science. Research. com © 2012 Deep Web Technologies, Inc.
History of Partnership 2012 2011 2010 2007 Develop 3 POC’s (Top 10 DB, 5 Catalogs, Digital Repositories © 2012 Deep Web Technologies, Inc. Launch x. Search for Science & Engineering (28 sources) Expand x. Search to include Social Sciences & Humanities. Also, expanded later in the year for GSB sources (170 sources) 5 In November 2011 the Charleston Advisor Review was published Upgrade and Expand x. Search to 200 sources in December 2012
2008 PR © 2012 Deep Web Technologies, Inc.
© 2012 Deep Web Technologies, Inc.
© 2012 Deep Web Technologies, Inc. 8
© 2012 Deep Web Technologies, Inc. 9
What Is Federated Search? Federated Search allows users to submit a real-time search in parallel to multiple information sources and retrieve aggregated, ranked and de -duplicated results. © 2012 Deep Web Technologies, Inc. 10
In Other Words… One Search, Many Sources Begin Search Subscription Sources News & Social Media Reports © 2012 Deep Web Technologies, Inc. Public Web Sources 11 Internal Sources Blogs & Wikis
x. Search Status • Upgraded early fall to v. 3. 2. 2 • GSB linked to x. Search • 200 collections in application • 30 new connectors in acceptance testing (roll-out imminent) © 2012 Deep Web Technologies, Inc. 12
User Queries by Month/Year 7000 6000 5000 4000 2012 User Queries 3000 2011 User Queries 2000 2010 User Queries 1000 Ja Fe nua br ry ua M ry ar c Ap h ril M ay Ju ne J Se Au uly pt gu em st O b N ct er ov ob D em er ec b em er be r 0 © 2012 Deep Web Technologies, Inc.
Source Queries 700, 000 600, 000 500, 000 400, 000 300, 000 200, 000 100, 000 Ja nu Fe ar br y ua r M y ar ch Ap ril M ay Ju ne Ju A l Se ug y pt us em t O be r c N to ov be e r D mb ec em er be r 0 2012 Source Queries 2010 Source Queries © 2012 Deep Web Technologies, Inc. 2011 Source Queries
© 2012 Deep Web Technologies, Inc. Project Muse Geo. Ref MLA International Bibliography ACS Publications CRCnet. BASE Periodicals Archive Online Science. Direct Web of Science Access World News ABI/Inform Global Psyc. INFO Pub. Med Academic Search Premier. Sciences nvironmental Pollution Management e. Brary Engineering Village ERIC Sociological Abstracts Highwire Press Business Source PAISComplete International Scopus Lexis. Nexis Academic: News JSTOR Top click-throughs: Jan 1, 2012 – November 30, 2012 6000 5000 4000 3000 2000 1000 0
Explorit Release 3. 2. 3 Starting customer upgrades • Visual clusters • Full-text filters • Content type/Media type • Integration with Zotero, Mendeley © 2012 Deep Web Technologies, Inc. 16
© 2012 Deep Web Technologies, Inc. 17
© 2012 Deep Web Technologies, Inc. 18
© 2012 Deep Web Technologies, Inc. 19
© 2012 Deep Web Technologies, Inc.
Explorit Release 4. 0 Coming mid-2013 • Dynamic tab searching • Thesaurus-based searching (Do you also want to search for? ) • Personal library • Big Data mashups (Enhanced content) • Faceted navigation © 2012 Deep Web Technologies, Inc. 21
Big Data Mashups © 2012 Deep Web Technologies, Inc. 22
Related Content • Major Science portals • Science News • Patent Databases • Scholar Networks • Subscription Sources • Public Databases • Open Access Journals © 2012 Deep Web Technologies, Inc. 23
Article Grouping © 2012 Deep Web Technologies, Inc.
Non-Invasive imaging © 2012 Deep Web Technologies, Inc.
Linking Open Data Cloud Diagram by Richard Cyganiak and Anja Jentzsch. http: //lod-cloud. net/ © 2012 Deep Web Technologies, Inc.
Nature has 297 million triples. © 2012 Deep Web Technologies, Inc.
Journey to 10, 000 sources © 2012 Deep Web Technologies, Inc. 28
Scalability Challenges • Source selection • Ranking and organizing of results • Traffic management • System load management • Finding, building, and maintaining connectors © 2012 Deep Web Technologies, Inc. 29
Scalability - Divide and Conquer Science. Research. com Other Federated Search Engines World. Wide. Science. org Science. gov Science. Accelerator © 2012 Deep Web Technologies, Inc. 30
© 2012 Deep Web Technologies, Inc. 31
© 2012 Deep Web Technologies, Inc. 32
Multilingual World. Wide. Science. org © 2012 Deep Web Technologies, Inc. 33
How Multilingual Federated Search Works Results in source’s language Foreign German language Chinese search engines Russian Query in source’s language Query to be translated for each source Ranking Microsoft Translator EXPLORIT Ranked results translated by Microsoft to user’s language Ranked results in user’s language Query in user’s language © 2012 Deep Web Technologies, Inc. Results returned to user 34
© 2012 Deep Web Technologies, Inc.
© 2012 Deep Web Technologies, Inc.
© 2012 Deep Web Technologies, Inc.
© 2012 Deep Web Technologies, Inc.
© 2012 Deep Web Technologies, Inc.
© 2012 Deep Web Technologies, Inc. 40
ESN – x 2 © 2012 Deep Web Technologies, Inc. 41
© 2012 Deep Web Technologies, Inc. 42
© 2012 Deep Web Technologies, Inc. 43
UNECA ASKIA Portal (United Nations – Access Scientific Knowledge in Africa) © 2012 Deep Web Technologies, Inc. 44
© 2012 Deep Web Technologies, Inc. 45
© 2012 Deep Web Technologies, Inc. 46
© 2012 Deep Web Technologies, Inc. 47
© 2012 Deep Web Technologies, Inc.
© 2012 Deep Web Technologies, Inc.
BASF © 2012 Deep Web Technologies, Inc. 50
© 2012 Deep Web Technologies, Inc. 51
© 2012 Deep Web Technologies, Inc. 52
© 2012 Deep Web Technologies, Inc. 53
© 2012 Deep Web Technologies, Inc. 54
© 2012 Deep Web Technologies, Inc. 55
Find It! TMC’s Link Resolver © 2012 Deep Web Technologies, Inc. 56
© 2012 Deep Web Technologies, Inc. 57
© 2012 Deep Web Technologies, Inc. 58
© 2012 Deep Web Technologies, Inc. 59
© 2012 Deep Web Technologies, Inc. 60
© 2012 Deep Web Technologies, Inc.
© 2012 Deep Web Technologies, Inc. 62
Abe’s Stanford Projects WISHLIST • Assist SULAIR in integrating Explorit preview via Web Services into new library portal. • Integration with Search. Works • Develop a stand-alone portal focused on Stanford core-competency (Energy, Environment, …) © 2012 Deep Web Technologies, Inc. 63
Abe’s Stanford Projects WISHLIST (cont. ) • Develop Chinese Explorit in collaboration with Library of Congress and/other library • x. Search / Explorit for Stanford Medical School • Mash up Big Data (Link Data, Mendeley, citations) and articles • Data Portal (expansion of Data searching in World. Wide. Science. org) • Integration with Sakai or other CMS © 2012 Deep Web Technologies, Inc. 64
Explore our Applications • x. Search • World. Wide. Science. org • Science. gov • Ciencia. Science. gov • DTIC Multisearch © 2012 Deep Web Technologies, Inc. 65
Thank you! Abe Lederman abe@deepwebtech. com View this presentation online © 2012 Deep Web Technologies, Inc. 66
- Slides: 66