Searching for Knowledge and Data on the Semantic

  • Slides: 38
Download presentation
Searching for Knowledge and Data on the Semantic Web Tim Finin University of Maryland,

Searching for Knowledge and Data on the Semantic Web Tim Finin University of Maryland, Baltimore County http: //ebiquity. umbc. edu/resource/html/id/179/ Joint work with Li Ding, Anupam Joshi, Yun Peng, Cynthia Parr, Pranam Kolari, Pavan Reddivari, Sandor Dornbush, Rong Pan, Akshay Java, Joel Sachs, Scott Cost and Vishal Doshi http: //creativecommons. org/licenses/by-nc-sa/2. 0/ This work was partially supported by DARPA contract F 3060297 -1 -0215, NSF grants CCR 007080 and IIS 9875433 and grants from IBM, Fujitsu and HP. UMBC an Honors University in Maryland 1

Google has made us smarter UMBC an Honors University in Maryland 3

Google has made us smarter UMBC an Honors University in Maryland 3

But what about our agents? tell register UMBC an Honors University in Maryland Agents

But what about our agents? tell register UMBC an Honors University in Maryland Agents still have a very minimal understanding of text and images. 4

XML helps “XML is Lisp's bastard nephew, with uglier syntax and no semantics. Yet

XML helps “XML is Lisp's bastard nephew, with uglier syntax and no semantics. Yet XML is poised to enable the creation of a Web of data that dwarfs anything since the Library at Alexandria. ” -- Philip Wadler, Et tu XML? The fall of the relational empire, VLDB, Rome, September 2001. UMBC an Honors University in Maryland 6

Semantic Web adds semantics “The Semantic Web will globalize KR*, just as the WWW

Semantic Web adds semantics “The Semantic Web will globalize KR*, just as the WWW globalize hypertext” -- Tim Berners-Lee * Knowledge Representation UMBC an Honors University in Maryland 7

But what about our agents? Swoogle tell Swoogle register Swoogle Swoogle A Google for

But what about our agents? Swoogle tell Swoogle register Swoogle Swoogle A Google for knowledge on the Semantic Web is needed by software agents and programs UMBC an Honors University in Maryland 13

 • http: //swoogle. umbc. edu/ • Running since summer 2004 • 1. 5

• http: //swoogle. umbc. edu/ • Running since summer 2004 • 1. 5 M RDF documents, 300 M RDF triples, 10 K ontologies UMBC an Honors University in Maryland 15

Swoogle Architecture Analysis SWD classifier Ranking Index … Search Services IR Indexer SWD Indexer

Swoogle Architecture Analysis SWD classifier Ranking Index … Search Services IR Indexer SWD Indexer Semantic Web metadata Web Server Web Service html Discovery document cache Candidate URLs Swoogle. Bot Bounded Web Crawler Google Crawler rdf/xml the Web Semantic Web human machine Legends UMBC an Honors University in Maryland Information flow Swoogle‘s web interface 16

Applications and use cases • Supporting Semantic Web developers – Ontology designers, vocabulary discovery,

Applications and use cases • Supporting Semantic Web developers – Ontology designers, vocabulary discovery, who’s using my ontologies or data? , use analysis, errors, statistics, etc. • Searching specialized collections – Spire: aggregating observations and data from biologists – Inference. Web: searching over and enhancing proofs – Sem. News: Text Meaning of news stories • Supporting SW tools – Triple shop: finding data for SPARQL queries UMBC an Honors University in Maryland 21

UMBC an Honors University in Maryland 22

UMBC an Honors University in Maryland 22

80 ontologies were found that had these three terms By default, ontologies are ordered

80 ontologies were found that had these three terms By default, ontologies are ordered by their ‘popularity’, but they can also be ordered by recency or size. Let’s look at this one UMBC an Honors University in Maryland 23

Basic Metadata has. Date. Discovered: 2005 -01 -17 has. Date. Ping: 2006 -03 -21

Basic Metadata has. Date. Discovered: 2005 -01 -17 has. Date. Ping: 2006 -03 -21 has. Ping. State: Ping. Modified type: Semantic. Web. Document is. Embedded: false has. Grammar: RDFXML has. Parse. State: Parse. Success has. Date. Lastmodified: 2005 -04 -29 has. Date. Cache: 2006 -03 -21 has. Encoding: ISO-8859 -1 has. Length: 18 K has. Cnt. Triple: 311. 00 has. Onto. Ratio: 0. 98 has. Cnt. Swt: 94. 00 has. Cnt. Swt. Def: 72. 00 has. Cnt. Instance: 8. 00 UMBC an Honors University in Maryland 24

UMBC an Honors University in Maryland 25

UMBC an Honors University in Maryland 25

UMBC an Honors University in Maryland 26

UMBC an Honors University in Maryland 26

These are the namespaces this ontology uses. Clicking on one shows all of the

These are the namespaces this ontology uses. Clicking on one shows all of the documents using the namespace. All of this is available in RDF form for the agents among us. UMBC an Honors University in Maryland 27

Here’s what the agent sees. Note the swoogle and wob (web of belief) ontologies.

Here’s what the agent sees. Note the swoogle and wob (web of belief) ontologies. UMBC an Honors University in Maryland 28

We can also search for terms (classes, properties) like terms for “person”. UMBC an

We can also search for terms (classes, properties) like terms for “person”. UMBC an Honors University in Maryland 29

10 K terms associated with “person”! Ordered by use. Let’s look at foaf: Person’s

10 K terms associated with “person”! Ordered by use. Let’s look at foaf: Person’s metadata UMBC an Honors University in Maryland 30

UMBC an Honors University in Maryland 31

UMBC an Honors University in Maryland 31

UMBC an Honors University in Maryland 32

UMBC an Honors University in Maryland 32

UMBC an Honors University in Maryland 33

UMBC an Honors University in Maryland 33

UMBC an Honors University in Maryland 34

UMBC an Honors University in Maryland 34

UMBC an Honors University in Maryland 35

UMBC an Honors University in Maryland 35

UMBC an Honors University in Maryland 36

UMBC an Honors University in Maryland 36

UMBC Triple Shop • http: //sparql. cs. umbc. edu/ • Online SPARQL RDF query

UMBC Triple Shop • http: //sparql. cs. umbc. edu/ • Online SPARQL RDF query processing based on HP’s Jena and Joseki with several interesting features • Selectable level of inference over model • Automatically finds SWDs for give queries using Swoogle backend database – Provide dataset creation wizard – Dataset can be stored on our server or downloaded – Tag, share and search over saved datasets UMBC an Honors University in Maryland 38

Who knows Anupam Joshi? Show me their names, email address and pictures UMBC an

Who knows Anupam Joshi? Show me their names, email address and pictures UMBC an Honors University in Maryland 40

The UMBC ebiquity site publishes lots of RDF data, including FOAF profiles UMBC an

The UMBC ebiquity site publishes lots of RDF data, including FOAF profiles UMBC an Honors University in Maryland 41

No FROM clause! Constraints on where the data comes from UMBC an Honors University

No FROM clause! Constraints on where the data comes from UMBC an Honors University in Maryland 42

PREFIX foaf: <http: //xmlns. com/foaf/0. 1/> SELECT DISTINCT ? p 2 name ? p

PREFIX foaf: <http: //xmlns. com/foaf/0. 1/> SELECT DISTINCT ? p 2 name ? p 2 mbox ? p 2 pix WHERE { ? p 1 foaf: name "Anupam Joshi". ? p 1 foaf: mbox ? p 1 mbox. ? p 2 foaf: knows ? p 3 foaf: mbox ? p 1 mbox. ? p 2 foaf: name ? p 2 name. ? p 2 foaf: mbox ? p 2 mbox. OPTIONAL { ? p 2 foaf: depiction ? p 2 pix }. } ORDER BY ? p 2 name UMBC an Honors University in Maryland 43

UMBC an Honors University in Maryland 44

UMBC an Honors University in Maryland 44

Swoogle found 292 RDF data files that appear relevant to answering our query UMBC

Swoogle found 292 RDF data files that appear relevant to answering our query UMBC an Honors University in Maryland 45

Let’s save the dataset before we use it UMBC an Honors University in Maryland

Let’s save the dataset before we use it UMBC an Honors University in Maryland 46

UMBC an Honors University in Maryland 47

UMBC an Honors University in Maryland 47

And tag it so we and others can find it more easily. UMBC an

And tag it so we and others can find it more easily. UMBC an Honors University in Maryland 48

Here we are using it to get an answer to “Who knows Anupam Joshi”

Here we are using it to get an answer to “Who knows Anupam Joshi” UMBC an Honors University in Maryland 49

He has many friends! UMBC an Honors University in Maryland 50

He has many friends! UMBC an Honors University in Maryland 50

Conclusion • The web will contain the world’s knowledge in forms accessible to people

Conclusion • The web will contain the world’s knowledge in forms accessible to people and computers – We need better ways to discover, index, search and reason over SW knowledge • SW search engines address different tasks than html search engines – So they require different techniques and APIs • Swoogle like systems can help create consensus ontologies and foster best practices – Swoogle is for Semantic Web 1. 0 – Semantic Web 2. 0 will make different demands UMBC an Honors University in Maryland 56

For more information http: //ebiquity. umbc. edu/ Annotated in OWL UMBC an Honors University

For more information http: //ebiquity. umbc. edu/ Annotated in OWL UMBC an Honors University in Maryland 57