CIDOC CRM Family Harmonized models for the Digital

  • Slides: 30
Download presentation
CIDOC CRM Family Harmonized models for the Digital World: CIDOC CRM and extensions Martin

CIDOC CRM Family Harmonized models for the Digital World: CIDOC CRM and extensions Martin Dörr Center for Cultural Informatics, Institute of Computer Science Foundation for Research and Technology – Hellas Nuremberg, Germany May 19, 2015 1

The CRM Family Standards, Mapping and Data Transformation Making Standards The good with standards

The CRM Family Standards, Mapping and Data Transformation Making Standards The good with standards is there are so many! When you have a standard, You need to transform to the standard You need to renew and adapt the standard You need to transform to the renewed standards Why not just transform data? There are too many transformations, you need a standard Nuremberg, May 19, 2015 2

The CRM Family CIDOC, CRM and SIG CIDOC is the International Committee for Documentation

The CRM Family CIDOC, CRM and SIG CIDOC is the International Committee for Documentation of ICOM the International Council of Museums CIDOC CRM is the Conceptual Reference Model of CIDOC CRM SIG is the CIDOC CRM Special Interest Group ¡ ¡ ¡ founded Aug. 2000 as Working Group of CIDOC; open to non-ICOM members. Membership is by organisation and a representative, to develop the CIDOC CRM as ISO standard for information integration of cultural-historical data across institutions, to act as forum for dissemination and development of good practice of documentation for publication and global integration of cultural-historical data, to act as forum to foster adequate technology compatible with CIDOC recommendations Nuremberg, May 19, 2015 3

The CRM Family The CRM Supports Science and Humanities Phases of the scholarly/scientific process:

The CRM Family The CRM Supports Science and Humanities Phases of the scholarly/scientific process: ¡ ¡ collecting and organizing evidence (observation and primary sources) connecting facts via the things involved interpreting facts – contextualizing and hypothesis building (dependency & impact) presenting results - publication Problem: Billions of facts, artefacts and documents possibly shed light on the past in unexpected contexts across all disciplines and sciences The CIDOC CRM (ISO 21127: 2006, 2014) ¡ ¡ ¡ is made for integrating and connecting evidential data and derived facts It contains the most basic relationships to describe of what happened in the past at a human scale, i. e. , people and things meeting in space-time, parts and wholes, use, influence and reference. more detailed kinds of discourse require extensions… Nuremberg, May 19, 2015 4

The CRM Family Metadata Are Not Enough! Type: Title. Subtitle: Date: Creator: Text Protocol

The CRM Family Metadata Are Not Enough! Type: Title. Subtitle: Date: Creator: Text Protocol of Proceedings of Crimea Conference II. Declaration of Liberated Europe February 11, 1945 The Premier of the Union of Soviet Socialist Republics The Prime Minister of the United Kingdom The President of the United States of America State Department Postwar division of Europe and Japan Publisher: Subject: Metadata Documents About… Nuremberg, May 19, 2015 “The following declaration has been approved: The Premier of the Union of Soviet Socialist Republics, the Prime Minister of the United Kingdom and the President of the United States of America have consulted with each other in the common interests of the people of their countries and those of liberated Europe. They jointly declare their mutual agreement to concert… …. and to ensure that Germany will never again be able to disturb the peace of the world…… “ 5

The CRM Family Finding Aids Do Not Integrate Type: Title: Date: Publisher: Source: Copyright:

The CRM Family Finding Aids Do Not Integrate Type: Title: Date: Publisher: Source: Copyright: References: Image Allied Leaders at Yalta 1945 United Press International (UPI) The Bettmann Archive Corbis Churchill, Roosevelt, Stalin Photos, Persons Metadata About… Nuremberg, May 19, 2015 6

The CRM Family Explicit Events, Object Identity, Symmetry E 52 Time-Span E 39 Actor

The CRM Family Explicit Events, Object Identity, Symmetry E 52 Time-Span E 39 Actor E 53 Place 7012124 February 1945 P 11 par P 82 at some time within tici pat ed in P 7 took place at E 7 Activity “Crimea Conference” E 39 Actor E 38 Image P 6 P 86 falls within 7 i sr efe E 65 Creation Event E 39 Actor ed p P 14 rm o f r e * P 81 ongoing throughout E 52 Time-Span P 9 4 h as rre dt ob y E 31 Document cre “Yalta Agreement” ate d 1945 -02 -11 Nuremberg, May 19, 2015 the world ! the documents 7

The CRM Family Top-level classes useful for integration refer to / refine refer to

The CRM Family Top-level classes useful for integration refer to / refine refer to / identify E 41 Appellations E 55 Types E 39 Actors E 28 Conceptual Objects E 18 Physical Thing participate in affect or / refer to location E 2 Temporal Entities E 52 Time-Spans within Nuremberg, May 19, 2015 at E 53 Places 8

The CRM Family Temporal Entity: Main Properties ¡ E 2 Temporal Entity n Is.

The CRM Family Temporal Entity: Main Properties ¡ E 2 Temporal Entity n Is. A ¡ Properties: P 7 took place at (witnessed): P 9 consists of (forms part of): E 53 Place E 4 Period E 5 Event n Is. A E 52 Time-Span E 4 Period n ¡ Properties: P 4 has time-span (is time-span of): Properties: P 12 occurred in the presence of (was present at): E 77 Persistent Item P 11 had participant (participated in): E 39 Actor E 7 Activity n Properties: P 14 carried out by (performed): E 39 Actor P 20 had specific purpose (was purpose of): E 5 Event P 21 had general purpose (was purpose of): E 55 Type P 16 used specific object (was used for): E 70 Thing P 125 used object of type (was type of object used in) E 55 Type Nuremberg, May 19, 2015 9

The CRM Family CRM: “What happened (to…) ? ” Example: (inferred from inscription…) Nuremberg,

The CRM Family CRM: “What happened (to…) ? ” Example: (inferred from inscription…) Nuremberg, May 19, 2015 10

The CRM Family Extending the CRM The CRM standardizes only stable concepts for information

The CRM Family Extending the CRM The CRM standardizes only stable concepts for information sharing. local extensions are encouraged for subjective concepts and local practices using the CRM starts with 1 property and does not restrict data to CRM ¡ We have now created a modular structure Maintaining a core so that all extensions are (property) specializations All more detailed facts can be reached by querying core concepts For being interoperable, no more restriction of data to a “core vocabulary”! ¡ What is “core” is not historical, not community domination, but the dynamic result of applying functional principles. CRM is an open invitation to extend it by sharing, respecting and evolving common concepts: The CRM becomes an open “family of models” Nuremberg, May 19, 2015 11

The CRM Family Outcome: CRM compatible Extensions FRBRoo: modelling the new library practice of

The CRM Family Outcome: CRM compatible Extensions FRBRoo: modelling the new library practice of IFLA (approved) ¡ a causal model of intellectual creation and derivation ¡ how to identify intellectual content ¡ the thing and the word: integrating museum and library perspectives PRESSoo: modelling journals and serials (approved) CRMInf: who said that? – from data to knowledge (under review) ¡ integrating data with their scholarly justification ¡ being validated with scholarly annotations Nuremberg, May 19, 2015 12

The CRM Family Outcome: CRM compatible Extensions CRMsci: a Scientific Observation model (under review)

The CRM Family Outcome: CRM compatible Extensions CRMsci: a Scientific Observation model (under review) ¡ generalizes over INSPIRE, OBOE, SEEK, Darwin Core ¡ generalizes concepts of units of matter and their “(physical) genesis” ¡ introduces concept of observation and data evaluation ¡ validated in archeology, biodiversity and geology CRMarchaeo/CRMBA: an Excavation model (under review) ¡ introduces concepts of stratigraphy and excavation ¡ being validated by archaeological records Nuremberg, May 19, 2015 13

The CRM Family Outcome: CRM compatible Extensions CRMgeo: a Spatiotemporal model (to be reviewed)

The CRM Family Outcome: CRM compatible Extensions CRMgeo: a Spatiotemporal model (to be reviewed) ¡ integrates CRM with OGC standards ¡ a complete model of phenomena occupying spacetime (consistent with modern physics) ¡ integrates geometry- and semantics-derived topological relations ¡ core concepts being integrated into CRMdig: a model of Digitization processes (to be reviewed) ¡ validated in European & US projects, to be adapted to CRMsci …. give us your extension for review and approval ! Nuremberg, May 19, 2015 14

The CRM Family CIDOC CRM extension suite CIDOC Conceptual Reference Model (CRM) CRM Few

The CRM Family CIDOC CRM extension suite CIDOC Conceptual Reference Model (CRM) CRM Few concepts, high recall Event Thing Actor happened at oo was present at PR ES So eo o MG FR CRMInf CRMSci Nuremberg, May 19, 2015 CR BR Special concepts, high precision CRMArcheo CRMDig 15

The CRM Family FRBROO : “who’s idea was that? ” “do you have a

The CRM Family FRBROO : “who’s idea was that? ” “do you have a translation of…? ” Nuremberg, May 19, 2015 16

The CRM Family The Externalization A Causal Interpretation of FRBR E 65 Creation E

The CRM Family The Externalization A Causal Interpretation of FRBR E 65 Creation E 28 Conceptual Object F 1 Work F 28 Expression Creation R 4 carriers provided by (comprises carriers of) R 19 created a realization of (was realised through) F 22 Self Contained Expression E 24 Physical Man-Made Thing R 18 created (was created by) F 23 Expression Fragment F 32 Carrier Production Event R 28 produced (was produced by) F 2 Expression F 14 Individual Work Nuremberg, May 19, 2015 F 3 Manifestation Production Type R 17 created (was created by) R 9 is realised in (realises) F 15 Complex Work E 12 Production E 84 Information Carrier R 7 is example of (has example) F 4 Manifestation Singleton here it becomes real! F 5 Item 17

The CRM Family Performing Arts : An “Added Value” Chain F 15 Complex Work

The CRM Family Performing Arts : An “Added Value” Chain F 15 Complex Work “Henry IV ” Idea R 10 has member (is member of) F 15 Complex Work F 16 Container Work R 10 has member (is member of) F 15 Complex Work “Henry IV part 1 ” Idea “Henry IV part 2 ” Idea F 21 Recording Work F 20 Performance Work R 2 is derivative of (has derivative) “Henry IV part 1 ” Adaptation R 19 created a realisation of (was realised through) Idea F 28 Expression Creation Adaptation of Henry IV part 1 mise-en-scene action Nuremberg, May 19, 2015 R 20 recorded (was recorded though) F 31 Performance R 17 created (was created by) F 22 Self-Contained Expression Henry IV part 1 Adaptation Recording Performance 25/12/07 action R 12 is realised in (realises) R 9 is realised in (realises) Text R 22 realised (was realised through) F 29 Recording Event R 19 created a realisation of (was realised through) F 28 Expression Creation R 17 created (was created by) “Henry IV part 1 ” Idea of recording “Henry IV part 1 ” Idea of mise-en-scene F 15 Complex Work Performance 25/12/07 action R 25 performed (was performed in) F 25 Performance Plan R 14 incorporates Henry IV part 1 “mise-en-scène” R 14 incorporates (is incorporated in) guidelines R 13 is realised in (realises) R 21 created (was created by) F 26 Recording Henry IV part 1 Play 25/12/07 DVD 18

The CRM Family CRMinf : “why is it true that? ” Nuremberg, May 19,

The CRM Family CRMinf : “why is it true that? ” Nuremberg, May 19, 2015 19

The CRM Family The Three Sources of Scientific Knowledge has belief time E 13

The CRM Family The Three Sources of Scientific Knowledge has belief time E 13 Attribute Assignment Here is CRMInf: resulted in or confirms Belief Argumentation E 52 Time-Span Belief Value is (True, False, Unknown) that Proposition relates to Inference Making E 1 CRM Entity Belief Adoption is about E 1 CRM Entity Observation Data Evaluation Here is any Information System Simulation Here is CRMSci ! CRMSci: Knowledge from observation, data evaluation and (computer)simulation (engineered from OBOE, SEEK, INSPIRE Darwin Core etc) property Nuremberg, May 19, 2015 Is. A CRM Entity CRMInf Entity CRMSci Entity 20

The CRM Family E 39 Actor P 14 carried out by Stephen I 5

The CRM Family E 39 Actor P 14 carried out by Stephen I 5 Inference Making Evolution Decision J 1 used as premise I 2 Belief J 4 that I 4 Proposition Set is left of main margin J 2 concluded that I 2 Belief Stephen’s Belief in Evolution A of Misc 4 J 4 that J 3 applies I 3 Inference Logic Catalogue Text Rules BM Belief in Spatial Relations of Misc 4 Modelling my Beliefs in Evolutions A, B and C I 4 Proposition Set was written after J 5 holds to be I 6 Belief Value True Hans Sloane collection inventory entry Nuremberg, May 19, 2015 21

The CRM Family CRMsci “what have you seen there? ” “how did you calculate

The CRM Family CRMsci “what have you seen there? ” “how did you calculate that? ” Nuremberg, May 19, 2015 22

The CRM Family Scanning and 3 D Model Creation as Meetings t 3 D

The CRM Family Scanning and 3 D Model Creation as Meetings t 3 D model coherence volume of rendering coherence volume of mesh-creation mesh-data 2 nd Computer scanner museum object scan-data 1 st Computer operator coherence volume of acquisition Museum Nuremberg, May 19, 2015 It-Lab S

The CRM Family What means“Finding”: the Encounter Event S 19 Encounter Event Scope Note:

The CRM Family What means“Finding”: the Encounter Event S 19 Encounter Event Scope Note: Activities of S 4 Observation (substance) where an E 39 Actor encounters an instance of E 18 Physical Thing of a kind relevant for the mission of the observation or regarded as potentially relevant for some community (identity). This observation produces knowledge about the existence of the respective thing at a particular place in or on surrounding matter. This knowledge may be new to the group of people the actor belongs to. In that case we would talk about a discovery. E 7 Activity E 16 Measurement S 4 Observation S 21 Measurement S 19 Encounter Event O 21 has found at E 53 Place O 19 has found object E 18 Physical Thing E 92 Spacetime Volume S 20 / E 26 Physical Feature O 22 partly or completely contains O 23 is defined by E 25 Man-Made Feature Nuremberg, May 19, 2015 E 27 Site S 22 Segment of Matter property Is. A CRM Entity CRMSci Entity 24

The CRM Family Biodiversity App: “Occurrence Discourse” an S 19 Encounter Event urn: catalog:

The CRM Family Biodiversity App: “Occurrence Discourse” an S 19 Encounter Event urn: catalog: IOL: POLY: Sphaerosyllis -levantina-ALA-IL-7 -Oct. 2009 O 32 has P 1 25 us ed P 1 0 s ha do e 4 i r P car P 14 at place k o o t P 7 BC 14 Ecosystem Environment Haifa Bay Ecosystem No found o ob bject je ct of w E 21 Person S. Faulwetter y ut b lls tim fa p es E 53 Place Haifa Bay ith in E 52 Time-Span 11/10/2009 P 2 h as ty pe BC 38 Biotic Element Sphaero-levantina-003 BT 7 Ecosystem Type sandy - muddy sediments ty p e BT 11 Equipment Type WA 265/SS 214 Nuremberg, May 19, 2015 P 127 has broader term BT 11 Equipment Type Van Veen Grab

The CRM Family CRMArcheo / CRMBA: “what was here before? ” Nuremberg, May 19,

The CRM Family CRMArcheo / CRMBA: “what was here before? ” Nuremberg, May 19, 2015 26

The CRM Family CRMarcheo: Excavation is Observation A 1 Excavation Process Unit AP 4

The CRM Family CRMarcheo: Excavation is Observation A 1 Excavation Process Unit AP 4 created surface. S 2 with spit method AP 4 created surface. S 1 with stratigraphic method A 3 Stratigraphic Interface A 7 Embedding A 2 Stratigraphic Deposit Unit AP 7 produced A 4 Stratigraphic Genesis Nuremberg, May 19, 2015 AP 7 produced AP 13 has stratigraphic relation “after” A 4 Stratigraphic Genesis 27

The CRM Family Example: Embedding A 1 Excavation Process Unit E 5 Event P

The CRM Family Example: Embedding A 1 Excavation Process Unit E 5 Event P 9 consists of S 16 State “positioning” S 19 Encounter Event AP 17 is found by “a state, a refinement of position” A 7 Embedding AP 15 has found object AP 18 is embedding of E 18 Physical Thing “the Physical Object has a position at least up to the point of discovery” Nuremberg, May 19, 2015 AP 20 is embedding at E 53 Place AP 19 is embedding in A 2 Stratigraphic Volume Unit “reference space that is relative to the Context Stuff” 28

The CRM Family A Competitor: The PROV-O Ontology prov: was. Derived. From prov: value

The CRM Family A Competitor: The PROV-O Ontology prov: was. Derived. From prov: value Literal To prov: Entity uted s. Attrib a prov: acted. On. Behalf. Of prov: Agent prov: w pr As prov: Person prov: Organization prov: Software. Agent prov: specialization. Of prov: was. Invalidated. By prov: was. Started. By prov: was. Ended. By ov : wa s prov: alternate. Of so cia te d. W ith prov: used prov: was. Generated. By prov: Activity prov: started. At. Time prov: ended. At. Time date. Time prov: was. Informed. By property Nuremberg, May 19, 2015 Generalization

The CRM Family Conclusions The CIDOC CRM with its extensions allows to create global

The CRM Family Conclusions The CIDOC CRM with its extensions allows to create global networks of integrated knowledge about human history, its evidence and scientific observation regardless discipline and in surprising detail, . . (CRMSci is currently the most powerful generic e-Science ontology) …. and you can add more detail. But remember: KR technology (RDF) is made for data with potentially global relations. and The CRM does not tell you what to document! With 4 CRM properties only you can already describe a story! Nuremberg, May 19, 2015