KAn OE Research Centre for Knowledge Analytics and

  • Slides: 42
Download presentation
KAn. OE: Research Centre for Knowledge Analytics and Ontological Engineering Managing Semantic Data 1

KAn. OE: Research Centre for Knowledge Analytics and Ontological Engineering Managing Semantic Data 1 NACLIN-2014, 10 Dec 2014 Dr. Kavi Mahesh Dean of Research, PES University (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021

2 Managing Semantic Data in Research Data Services Trend: Publish research data along with

2 Managing Semantic Data in Research Data Services Trend: Publish research data along with paper Digital library of research data How do we manage this data? E. g. , our research requires: Several Tera Bytes of data 5 billion data elements so far (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021

3 Inverting the Publication Model Past: Description of research results in English Show samples

3 Inverting the Publication Model Past: Description of research results in English Show samples of data “Results, Discussion, Conclusion” framework Present: Publish article and entire dataset No links between article and data (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021

4 The Inverted Publication Model Future: Inverted model: Publish self-contained data Publish data analytics

4 The Inverted Publication Model Future: Inverted model: Publish self-contained data Publish data analytics Annotate the data with English descriptions where needed Rich linkage between datasets Web of linked data… (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021

5 Illustration of Publishing Data (c) Dr. Kavi Mahesh; Do not copy or distribute

5 Illustration of Publishing Data (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021

ASMEG BIHAR 6 8000 CHHAT COAPR COKNT 7000 EMPRA ERJST EUPRA 6000 GNWBL GUJRT

ASMEG BIHAR 6 8000 CHHAT COAPR COKNT 7000 EMPRA ERJST EUPRA 6000 GNWBL GUJRT HARYA 5000 JHNKD KERLA KNGOA 4000 MADMH MARAT NIKNT 3000 NMAMT ORISS PUNJB 2000 RLSMA SAUKU SHWBL 1000 SIKNT TELNG TLNAD 0(c) Dr. Kavi Mahesh; Do not copy or distribute VDABH 2/23/2021 101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172 WMPRA WRJST

7 (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021

7 (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021

8 Self-Contained Dataset Requirements: Have a proper and consistent structure; Define each element both

8 Self-Contained Dataset Requirements: Have a proper and consistent structure; Define each element both syntactically and semantically; Specify all the semantic constraints on permissible data values, their types and cardinalities; and Specify data provenance, etc. (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021

9 Ontology of Research Data In other words, an ontology of research data Where

9 Ontology of Research Data In other words, an ontology of research data Where is the “Dublin Core” of research data? E. g. , CERIF (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021

10 Why Semantic Data Management? Epistemology of science: Verifying research results Making sense of

10 Why Semantic Data Management? Epistemology of science: Verifying research results Making sense of someone else’s data Documenting the usage scenario of data (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021

11 (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021

11 (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021

13 Data on the Web: 5 -Star Rating System * Data on the Web:

13 Data on the Web: 5 -Star Rating System * Data on the Web: ** Machine-Readable Data: E. g. , data published as a spreadsheet *** Non-Proprietary Format: E. g. , data published as a CSV file E. g. , data published as a set of scanned images **** RDF Data: E. g. , a drug database published in RDF ***** Linked RDF Data: Links to other people’s data are included. E. g. , the Dbpedia dataset extracted from wikipedia (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021

14 Linked Open Data: Principles Use URIs as names of things: E. g, mention

14 Linked Open Data: Principles Use URIs as names of things: E. g, mention author by URI, not just name. Use HTTP URIs so that people can look up those names. When someone looks up a URI, provide useful information, using the standards (RDF*, SPARQL). Include links to other URIs, so people can discover more things. Sir Tim Berners-Lee (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021

15 Linked Open Research Data Services Requirements: Uniquely identify all entities used in datasets

15 Linked Open Research Data Services Requirements: Uniquely identify all entities used in datasets such as experiments, specimens, locations, organizations, etc. ; Interlink parts of datasets with precise parts of an article in both directions; Classify datasets using a suitable universal classification scheme; Cite other datasets, i. e. , refer to them through links; Manage multiple versions and revisions of datasets; and Incorporate a suitable controlled vocabulary or ontology. (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021

16 Architecture of Digital Library of Data (c) Dr. Kavi Mahesh; Do not copy

16 Architecture of Digital Library of Data (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021

17 An Ontology for Research Data (c) Dr. Kavi Mahesh; Do not copy or

17 An Ontology for Research Data (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021

18 Concluding Remarks Publishing and citing research data will be a common practice Digital

18 Concluding Remarks Publishing and citing research data will be a common practice Digital libraries need to manage research data Data needs to be self-contained, therefore semantic Linked open data is promising We need a proper ontology of research data Keyword search may be good enough for documents, but not for datasets (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021

19 Questions? Thank you! http: //www. kanoe. org http: //ontology. org. in ontology@pes. edu

19 Questions? Thank you! http: //www. kanoe. org http: //ontology. org. in ontology@pes. edu (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021

http: //www. kanoe. org 20 (c) Dr. Kavi Mahesh; Do not copy or distribute

http: //www. kanoe. org 20 (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021

21 How? By applying Natural Language Generation Techniques on structure and semantics of Linked

21 How? By applying Natural Language Generation Techniques on structure and semantics of Linked Open Datasets and underlying Ontologies. (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021

Input Triples 22 Subject Predicate object <Mrinalini_Sarabhai> <is. Married_to> <Vikram_sarabhai> <has. Academic. Advisor> <C_V_Raman>

Input Triples 22 Subject Predicate object <Mrinalini_Sarabhai> <is. Married_to> <Vikram_sarabhai> <has. Academic. Advisor> <C_V_Raman> <Vikram_sarabhai> <was. Born. In> <Ahmedabad> <Vikram_sarabhai> <is. Marriedto> <Mriinalini. Sarabhai> <Vikram_sarabhai> <is. Citizen. Of> <India> <Vikram_sarabhai> <has. Won. Prize> <Padm_Vibhushan> <Vikram_sarabhai> <Padma_Bhushan> <Vikram_sarabhai> <died. On. Date> <1971 -12 -31> <Vikram_sarabhai> <graduated. From> <University_of_cambridge> <Vikram_sarabhai> <has. Wikipedia. Url> <http: //www. wiki. . > <Vikram_sarabhai> <created> <Nehru_Foundation_for_develop ment> <Vikram_sarabhai> <has. Gender> <Male> (c) Dr. Kavi Mahesh; Do not copy<lives. In> or distribute <Vikram_sarabhai> <India> 2/23/2021

23 Ontology for Discourse Structuring (c) Dr. Kavi Mahesh; Do not copy or distribute

23 Ontology for Discourse Structuring (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021

24 Classes (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021

24 Classes (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021

Subclasses 25 (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021

Subclasses 25 (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021

26 Individuals (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021

26 Individuals (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021

Ontology as a Chart 27 (c) Dr. Kavi Mahesh; Do not copy or distribute

Ontology as a Chart 27 (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021

33 (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021

33 (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021

34 Subclasses and their Descriptions (c) Dr. Kavi Mahesh; Do not copy or distribute

34 Subclasses and their Descriptions (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021

35 Object properties (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021

35 Object properties (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021

36 Data properties added (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021

36 Data properties added (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021

37 Linked Open Data Tools Pallavi Karanth (c) Dr. Kavi Mahesh; Do not copy

37 Linked Open Data Tools Pallavi Karanth (c) Dr. Kavi Mahesh; Do not copy or distribute ©KAn. OE, PES Institute of Technology 2/23/2021

Data 38 Data (c) Dr. Kavi Mahesh; Do not copy or distribute ©KAn. OE,

Data 38 Data (c) Dr. Kavi Mahesh; Do not copy or distribute ©KAn. OE, PES Institute of Technology 2/23/2021

39 Web for Data Discovery (c) Dr. Kavi Mahesh; Do not copy or distribute

39 Web for Data Discovery (c) Dr. Kavi Mahesh; Do not copy or distribute ©KAn. OE, PES Institute of Technology 2/23/2021

40 Web for Data Discovery (c) Dr. Kavi Mahesh; Do not copy or distribute

40 Web for Data Discovery (c) Dr. Kavi Mahesh; Do not copy or distribute ©KAn. OE, PES Institute of Technology 2/23/2021

41 Machine Understandable Data (c) Dr. Kavi Mahesh; Do not copy or distribute ©KAn.

41 Machine Understandable Data (c) Dr. Kavi Mahesh; Do not copy or distribute ©KAn. OE, PES Institute of Technology 2/23/2021

Machine Understandable Data 42 Ram Nickname DOB Ram 19 -04 -78 Location Bangalore (c)

Machine Understandable Data 42 Ram Nickname DOB Ram 19 -04 -78 Location Bangalore (c) Dr. Kavi Mahesh; Do not copy or distribute ©KAn. OE, PES Institute of Technology 2/23/2021

43 Open Data and Linked Data Open Data - open access Linked Data Semantic

43 Open Data and Linked Data Open Data - open access Linked Data Semantic Machine Readable (c) Dr. Kavi Mahesh; Do not copy or distribute ©KAn. OE, PES Institute of Technology 2/23/2021

LOD - IT (Kappa) 45 For Software Developers Technical Helpdesk LOD-IT Video LOD-IT Demo

LOD - IT (Kappa) 45 For Software Developers Technical Helpdesk LOD-IT Video LOD-IT Demo (c) Dr. Kavi Mahesh; Do not copy or distribute ©KAn. OE, PES Institute of Technology 2/23/2021

LODScape 46 Ontology based Multiple LOD Object Browser Db. Pedia and Freebase datasets used

LODScape 46 Ontology based Multiple LOD Object Browser Db. Pedia and Freebase datasets used LODScape Demo (c) Dr. Kavi Mahesh; Do not copy or distribute ©KAn. OE, PES Institute of Technology

47 Semantic Smart-Aleck Automatic Fact Generator Based on Interestingness Algorithm Uses Dbpedia and Yago

47 Semantic Smart-Aleck Automatic Fact Generator Based on Interestingness Algorithm Uses Dbpedia and Yago datasets Semantic. Smart. Aleck Demo (c) Dr. Kavi Mahesh; Do not copy or distribute ©KAn. OE, PES Institute of Technology 2/23/2021

48 Acknowledgments (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021

48 Acknowledgments (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021

49 Suggestions? Thank you! http: //www. kanoe. org http: //ontology. org. in ontology@pes. edu

49 Suggestions? Thank you! http: //www. kanoe. org http: //ontology. org. in ontology@pes. edu (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021