KAn OE Research Centre for Knowledge Analytics and










































- Slides: 42
KAn. OE: Research Centre for Knowledge Analytics and Ontological Engineering Managing Semantic Data 1 NACLIN-2014, 10 Dec 2014 Dr. Kavi Mahesh Dean of Research, PES University (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021
2 Managing Semantic Data in Research Data Services Trend: Publish research data along with paper Digital library of research data How do we manage this data? E. g. , our research requires: Several Tera Bytes of data 5 billion data elements so far (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021
3 Inverting the Publication Model Past: Description of research results in English Show samples of data “Results, Discussion, Conclusion” framework Present: Publish article and entire dataset No links between article and data (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021
4 The Inverted Publication Model Future: Inverted model: Publish self-contained data Publish data analytics Annotate the data with English descriptions where needed Rich linkage between datasets Web of linked data… (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021
5 Illustration of Publishing Data (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021
ASMEG BIHAR 6 8000 CHHAT COAPR COKNT 7000 EMPRA ERJST EUPRA 6000 GNWBL GUJRT HARYA 5000 JHNKD KERLA KNGOA 4000 MADMH MARAT NIKNT 3000 NMAMT ORISS PUNJB 2000 RLSMA SAUKU SHWBL 1000 SIKNT TELNG TLNAD 0(c) Dr. Kavi Mahesh; Do not copy or distribute VDABH 2/23/2021 101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172 WMPRA WRJST
7 (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021
8 Self-Contained Dataset Requirements: Have a proper and consistent structure; Define each element both syntactically and semantically; Specify all the semantic constraints on permissible data values, their types and cardinalities; and Specify data provenance, etc. (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021
9 Ontology of Research Data In other words, an ontology of research data Where is the “Dublin Core” of research data? E. g. , CERIF (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021
10 Why Semantic Data Management? Epistemology of science: Verifying research results Making sense of someone else’s data Documenting the usage scenario of data (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021
11 (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021
13 Data on the Web: 5 -Star Rating System * Data on the Web: ** Machine-Readable Data: E. g. , data published as a spreadsheet *** Non-Proprietary Format: E. g. , data published as a CSV file E. g. , data published as a set of scanned images **** RDF Data: E. g. , a drug database published in RDF ***** Linked RDF Data: Links to other people’s data are included. E. g. , the Dbpedia dataset extracted from wikipedia (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021
14 Linked Open Data: Principles Use URIs as names of things: E. g, mention author by URI, not just name. Use HTTP URIs so that people can look up those names. When someone looks up a URI, provide useful information, using the standards (RDF*, SPARQL). Include links to other URIs, so people can discover more things. Sir Tim Berners-Lee (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021
15 Linked Open Research Data Services Requirements: Uniquely identify all entities used in datasets such as experiments, specimens, locations, organizations, etc. ; Interlink parts of datasets with precise parts of an article in both directions; Classify datasets using a suitable universal classification scheme; Cite other datasets, i. e. , refer to them through links; Manage multiple versions and revisions of datasets; and Incorporate a suitable controlled vocabulary or ontology. (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021
16 Architecture of Digital Library of Data (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021
17 An Ontology for Research Data (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021
18 Concluding Remarks Publishing and citing research data will be a common practice Digital libraries need to manage research data Data needs to be self-contained, therefore semantic Linked open data is promising We need a proper ontology of research data Keyword search may be good enough for documents, but not for datasets (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021
19 Questions? Thank you! http: //www. kanoe. org http: //ontology. org. in ontology@pes. edu (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021
http: //www. kanoe. org 20 (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021
21 How? By applying Natural Language Generation Techniques on structure and semantics of Linked Open Datasets and underlying Ontologies. (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021
Input Triples 22 Subject Predicate object <Mrinalini_Sarabhai> <is. Married_to> <Vikram_sarabhai> <has. Academic. Advisor> <C_V_Raman> <Vikram_sarabhai> <was. Born. In> <Ahmedabad> <Vikram_sarabhai> <is. Marriedto> <Mriinalini. Sarabhai> <Vikram_sarabhai> <is. Citizen. Of> <India> <Vikram_sarabhai> <has. Won. Prize> <Padm_Vibhushan> <Vikram_sarabhai> <Padma_Bhushan> <Vikram_sarabhai> <died. On. Date> <1971 -12 -31> <Vikram_sarabhai> <graduated. From> <University_of_cambridge> <Vikram_sarabhai> <has. Wikipedia. Url> <http: //www. wiki. . > <Vikram_sarabhai> <created> <Nehru_Foundation_for_develop ment> <Vikram_sarabhai> <has. Gender> <Male> (c) Dr. Kavi Mahesh; Do not copy<lives. In> or distribute <Vikram_sarabhai> <India> 2/23/2021
23 Ontology for Discourse Structuring (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021
24 Classes (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021
Subclasses 25 (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021
26 Individuals (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021
Ontology as a Chart 27 (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021
33 (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021
34 Subclasses and their Descriptions (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021
35 Object properties (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021
36 Data properties added (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021
37 Linked Open Data Tools Pallavi Karanth (c) Dr. Kavi Mahesh; Do not copy or distribute ©KAn. OE, PES Institute of Technology 2/23/2021
Data 38 Data (c) Dr. Kavi Mahesh; Do not copy or distribute ©KAn. OE, PES Institute of Technology 2/23/2021
39 Web for Data Discovery (c) Dr. Kavi Mahesh; Do not copy or distribute ©KAn. OE, PES Institute of Technology 2/23/2021
40 Web for Data Discovery (c) Dr. Kavi Mahesh; Do not copy or distribute ©KAn. OE, PES Institute of Technology 2/23/2021
41 Machine Understandable Data (c) Dr. Kavi Mahesh; Do not copy or distribute ©KAn. OE, PES Institute of Technology 2/23/2021
Machine Understandable Data 42 Ram Nickname DOB Ram 19 -04 -78 Location Bangalore (c) Dr. Kavi Mahesh; Do not copy or distribute ©KAn. OE, PES Institute of Technology 2/23/2021
43 Open Data and Linked Data Open Data - open access Linked Data Semantic Machine Readable (c) Dr. Kavi Mahesh; Do not copy or distribute ©KAn. OE, PES Institute of Technology 2/23/2021
LOD - IT (Kappa) 45 For Software Developers Technical Helpdesk LOD-IT Video LOD-IT Demo (c) Dr. Kavi Mahesh; Do not copy or distribute ©KAn. OE, PES Institute of Technology 2/23/2021
LODScape 46 Ontology based Multiple LOD Object Browser Db. Pedia and Freebase datasets used LODScape Demo (c) Dr. Kavi Mahesh; Do not copy or distribute ©KAn. OE, PES Institute of Technology
47 Semantic Smart-Aleck Automatic Fact Generator Based on Interestingness Algorithm Uses Dbpedia and Yago datasets Semantic. Smart. Aleck Demo (c) Dr. Kavi Mahesh; Do not copy or distribute ©KAn. OE, PES Institute of Technology 2/23/2021
48 Acknowledgments (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021
49 Suggestions? Thank you! http: //www. kanoe. org http: //ontology. org. in ontology@pes. edu (c) Dr. Kavi Mahesh; Do not copy or distribute 2/23/2021