Open Provenance Model Tutorial Session 5 OPM Emerging

  • Slides: 47
Download presentation
Open Provenance Model Tutorial Session 5: OPM Emerging Profiles

Open Provenance Model Tutorial Session 5: OPM Emerging Profiles

Session 5: Aims In this session, you will learn about: – How to extend

Session 5: Aims In this session, you will learn about: – How to extend OPM through profiles – The content of a profile – Four emerging profiles for OPM – How to get involved with your own profile

Session 5: Contents • Profile Definition • Essential Profiles – Collections Profile – Signature

Session 5: Contents • Profile Definition • Essential Profiles – Collections Profile – Signature Profile • Domain Profiles – Dublin Core Profile – D-Profile • Feedback

OPM LAYERED ARCHITECTURE

OPM LAYERED ARCHITECTURE

OPM Domain Specialization: Workflow, Web OPM Essential Profiles: Collections, Attribution OPM Core OPM Sig

OPM Domain Specialization: Workflow, Web OPM Essential Profiles: Collections, Attribution OPM Core OPM Sig OPM based APIs: record, query Technology Bindings: XML, RDF OPM Layered Model 5

PROFILE DEFINITION

PROFILE DEFINITION

Concept of a Profile • A specialisation of an OPM graph for a specific

Concept of a Profile • A specialisation of an OPM graph for a specific domain or to handle a specific problem • Profile definitions are welcome! • Note: profile multiplicity challenges interoperability

What’s in a profile • • • A unique id Vocabulary of Annotations Guidance

What’s in a profile • • • A unique id Vocabulary of Annotations Guidance Profile Expansion Rules Syntactical Short-cuts

Vocabulary of Annotations • • Controlled Vocabulary Subtyping of edges & nodes Application specific

Vocabulary of Annotations • • Controlled Vocabulary Subtyping of edges & nodes Application specific properties Easy! has. Photo Reviewer review. Created. B y Review

Guidance • Many ways to represent the same process within an OPM Graph •

Guidance • Many ways to represent the same process within an OPM Graph • System may expect a particular structure or associated vocabulary Reviewer submitted. Review. From review. Created. B y review draft Review Publishing System review. Finalised. From Review

Profile Expansion Rules • Provide more compact representations of provenance • Maintain OPM Compatibility

Profile Expansion Rules • Provide more compact representations of provenance • Maintain OPM Compatibility PS Reviewer draft 1 review. Created. B y Review Rules Reviewer submitted. Review. From review draft Publishing System review. Finalised. From Review

Profile Compliance PROFILE • Id • Vocabulary • Guidance • Expansion directives • Serialisation

Profile Compliance PROFILE • Id • Vocabulary • Guidance • Expansion directives • Serialisation Profile Expansion Profile Compliant Graph Profile-expanded Graph

Profile Compliance Profile Compliant Graph Profile-expanded Graph OPM Inference Inferred Graph 1 Inferred Graph

Profile Compliance Profile Compliant Graph Profile-expanded Graph OPM Inference Inferred Graph 1 Inferred Graph 2

Syntactic Shortcuts • Allow for parsimony in serializations • Understand how to get back

Syntactic Shortcuts • Allow for parsimony in serializations • Understand how to get back to the OPM model Paul Groth (Sept 18, 2010): review 1, review 2 for paper 12 Paul Groth r 1 Paul Groth r 2 P 12

Profile Summary • OPM is a top level representation • Profiles allow for best

Profile Summary • OPM is a top level representation • Profiles allow for best practice & usage guidelines • Defining community specific: – Vocabulary – Graph structure – Derivations from vocabulary – Serializations

COLLECTION PROFILE

COLLECTION PROFILE

http: //www. flickr. com/photos/stripeyanne/3539864111/sizes/l/in/photostream/

http: //www. flickr. com/photos/stripeyanne/3539864111/sizes/l/in/photostream/

Provenance? http: //www. flickr. com/photos/stripeyanne/3539864111/sizes/l/in/photostream/

Provenance? http: //www. flickr. com/photos/stripeyanne/3539864111/sizes/l/in/photostream/

Provenance? http: //www. flickr. com/photos/stripeyanne/3539864111/sizes/l/in/photostream/

Provenance? http: //www. flickr. com/photos/stripeyanne/3539864111/sizes/l/in/photostream/

Provenance? http: //www. flickr. com/photos/stripeyanne/3539864111/sizes/l/in/photostream/

Provenance? http: //www. flickr. com/photos/stripeyanne/3539864111/sizes/l/in/photostream/

Collection Profile (draft) with Paolo Missier, Paul Groth and Simon Miles Notion of collection

Collection Profile (draft) with Paolo Missier, Paul Groth and Simon Miles Notion of collection (a kind of artifact) Collections can be nested Process types: constructor and artifact Edge types: contained, was. Part. Of, was. Identical. To • Completion guidance to derive dependencies on elements from collections • •

Collections

Collections

Collections • From – c 2 ->c 1, a 1 i->c 1 • derive

Collections • From – c 2 ->c 1, a 1 i->c 1 • derive – a 2 i->a 1 i , c 2 ->a 2 i • And likewise from – c 2 ->c 1, c 2 ->a 2 i

SIGNATURE PROFILE

SIGNATURE PROFILE

Some Provenance Security Concerns • How can we ensure the integrity of an OPM

Some Provenance Security Concerns • How can we ensure the integrity of an OPM graph? – Has it been tampered with? Is it authentic? • Who created an OPM graph? – Is there non-repudiable evidence that an entity is its author? • Note: many other security requirements, cf. [Tan 06], [Braun 08], [Moreau 10].

Signature of OPM Graphs • Cryptographic signatures provide: – Non repudiable evidence – Means

Signature of OPM Graphs • Cryptographic signatures provide: – Non repudiable evidence – Means to check authenticity • Leveraging existing standards, e. g. XMLSignature • Need to define a “normal form” for XML OPM graph before applying XML-Signature • Implementation available from opm toolbox

Attribution and Signatures Distinguished Name Embedded Signature X 509 Certificate An annotation to an

Attribution and Signatures Distinguished Name Embedded Signature X 509 Certificate An annotation to an OPM graph that contains a signature Timestamp and Replay Protection Role 27

Alternative implementation • J. Myers (NCSA) implementation on top of RDF serialization • More

Alternative implementation • J. Myers (NCSA) implementation on top of RDF serialization • More challenging since: – There is no standard way of serializing RDF – There is no standard RDF-Signature

DUBLIN CORE PROFILE

DUBLIN CORE PROFILE

Dublin Core Profile (draft) with Simon Miles and Joe Futrelle • To many people,

Dublin Core Profile (draft) with Simon Miles and Joe Futrelle • To many people, provenance is primarily about attribution, citation, bibliographic information • DC provides terms to relate resources to such information • DC profile aims to use of Dublin Core terms to OPM concepts and graph patterns • http: //twiki. ipaw. info/pub/OPM/Change. Prop osal. Dublin. Core. Mapping/dcprofile. pdf

Dublin Core Terms • • Accrual method Available Bibliographic citation Contributor Publisher Date Version

Dublin Core Terms • • Accrual method Available Bibliographic citation Contributor Publisher Date Version …

dc: accural. Method The method by which items are added to a collection I

dc: accural. Method The method by which items are added to a collection I dc: accural. Method M Collection Before Method (M) New item (I) Addition dc: version. Of New Collection

dc: publisher state=unpublished A 1 d use publish Ag person name=Luc was. Action. Of

dc: publisher state=unpublished A 1 d use publish Ag person name=Luc was. Action. Of P was. Same. Resource. As was. Generated. By A 2 state=published

OPM benefit: refinement state=unpublished A 1 review d use publish Ag person name=Luc was.

OPM benefit: refinement state=unpublished A 1 review d use publish Ag person name=Luc was. Action. Of P was. Same. Resource. As was. Generated. By approve catalog A 2 state=published

dc: contributor Ag A 1 d use contribution P dc: is. Version. Of was.

dc: contributor Ag A 1 d use contribution P dc: is. Version. Of was. Generated. By A 2

OPM benefit: additional details Ag A 1 d use contribution Contribution content used P

OPM benefit: additional details Ag A 1 d use contribution Contribution content used P dc: is. Version. Of was. Generated. By A 2

D-PROFILE

D-PROFILE

Provenance Across Application Application Provenance Inter-Operability Layer The Open Provenance Model (OPM)

Provenance Across Application Application Provenance Inter-Operability Layer The Open Provenance Model (OPM)

OPM Usage Thus Far • OPM has been used for integration between monolithic systems

OPM Usage Thus Far • OPM has been used for integration between monolithic systems • Assumptions: – Agreement between applications on integration points – Little communication mostly through the environment – Clear demarcation of functional components – The other party is “a good guy”

OPM in Distributed Systems • Is OPM suitable for Distributed Systems? • Can OPM

OPM in Distributed Systems • Is OPM suitable for Distributed Systems? • Can OPM deal with… – asynchronous / synchronous systems – failure, corruption, errors – transient processes – independent processes – defining applications across systems

OPM in Distributed Systems • Is OPM suitable for Distributed Systems? • Can OPM

OPM in Distributed Systems • Is OPM suitable for Distributed Systems? • Can OPM deal with… – asynchronous / synchronous systems – failure, corruption, errors – transient processes – independent processes – defining applications across systems • YES! (but we need some additions)

D-PROFILE • A profile for modeling distributed systems within OPM • Message-passing model •

D-PROFILE • A profile for modeling distributed systems within OPM • Message-passing model • Examples: – Web services – Pervasive systems – Mobile

Guidance: communication

Guidance: communication

Vocabulary Edges Was. Constructed. From Was. Copy. Of Was. Same. Message. As Was. Extracted.

Vocabulary Edges Was. Constructed. From Was. Copy. Of Was. Same. Message. As Was. Extracted. From Properties attributed. To tracer

Compact Representation • Subclass of Artifact a D-Artifact • Has annotations including: – Payload

Compact Representation • Subclass of Artifact a D-Artifact • Has annotations including: – Payload for sender & receiver – A message id – Tracers – Attribution • Expansion Rules • Save roughly half the nodes & edges

FEEDBACK: WHAT PROFILES ARE MISSING? ?

FEEDBACK: WHAT PROFILES ARE MISSING? ?

Extend OPM through a Profile • Any one can make a profile (Go for

Extend OPM through a Profile • Any one can make a profile (Go for it!) • Easiest route is through a Vocabulary • Post to the wiki and gain a community following – Can also become endorsed… • Lightweight Governance Model – http: //twiki. ipaw. info/pub/OPM/Web. Home/gove rnance. pdf