Sample Use Cases for the Data Dictionary View

  • Slides: 31
Download presentation
Sample Use Cases for the Data. Dictionary View in DDI Views (DDI 4) Dan

Sample Use Cases for the Data. Dictionary View in DDI Views (DDI 4) Dan Gillman, Arofan Gregory, Larry Hoyle, Knut Wenzig 9/29/2020 NADDI 2017 Use Cases for the Data. Dictionary View in DDI Views 1

DDI Views - http: //lion. ddialliance. org/ DDIViews (DDI 4) has Functional Views, which

DDI Views - http: //lion. ddialliance. org/ DDIViews (DDI 4) has Functional Views, which are subsets of the whole model, typically many fewer classes All Functional Views are compatible 9/29/2020 NADDI 2017 Use Cases for the Data. Dictionary View in DDI Views 2

Data. Dictionary Functional View Focuses on physical file structure 9/29/2020 NADDI 2017 Use Cases

Data. Dictionary Functional View Focuses on physical file structure 9/29/2020 NADDI 2017 Use Cases for the Data. Dictionary View in DDI Views 3

Classes in the Data. Dictionary Functional View • • • • 9/29/2020 Access Attribute.

Classes in the Data. Dictionary Functional View • • • • 9/29/2020 Access Attribute. Role Category. Set Code. Item Code. List Concept Coverage Data. Point Data. Record Data. Store Datum Funding. Information Identifier. Role • • • • Individual Instance. Variable. Mapping Level Logical. Record. Layout Machine Measure. Role Organization Physical. Layout. Ordered. Pair Population Record. Relation Rectangular. Layout Segment. By. Text Sentinel. Conceptual. Domain NADDI 2017 Use Cases for the Data. Dictionary View in DDI Views • • • • Sentinel. Value. Domain Spatial. Coverage Structure. Description Substantive. Conceptual. Domain Substantive. Value. Domain Temporal. Coverage Topical. Coverage Unit. Type Universe Value. And. Concept. Description Value. Mapping Viewpoint This view has 43 of the current 858 classes in Lion 4

DDI-Views Packages (Example: Format. Description) Packages are useful when looking at, or developing, the

DDI-Views Packages (Example: Format. Description) Packages are useful when looking at, or developing, the overall model. They are not namespaces. 9/29/2020 NADDI 2017 Use Cases for the Data. Dictionary View in DDI Views 5

DDI-Views Packages (Example: Format. Description Detail) UML model features: • • Inheritance Realization Properties

DDI-Views Packages (Example: Format. Description Detail) UML model features: • • Inheritance Realization Properties Relationships • • 9/29/2020 NADDI 2017 Use Cases for the Data. Dictionary View in DDI Views aggregations plain 6

Describing a class in Lion – definition, notes, and examples http: //lion. ddialliance. org/

Describing a class in Lion – definition, notes, and examples http: //lion. ddialliance. org/ ddiobjects/valuemapping 9/29/2020 NADDI 2017 Use Cases for the Data. Dictionary View in DDI Views 7

Describing a class in Lion – properties 9/29/2020 NADDI 2017 Use Cases for the

Describing a class in Lion – properties 9/29/2020 NADDI 2017 Use Cases for the Data. Dictionary View in DDI Views 8

Describing a class in Lion – relationships 9/29/2020 NADDI 2017 Use Cases for the

Describing a class in Lion – relationships 9/29/2020 NADDI 2017 Use Cases for the Data. Dictionary View in DDI Views 9

Dagstuhl Week 2 Sprint 2016 https: //ddi-alliance. atlassian. net/wiki/pages/viewpage. action? page. Id=39911463 Data Description

Dagstuhl Week 2 Sprint 2016 https: //ddi-alliance. atlassian. net/wiki/pages/viewpage. action? page. Id=39911463 Data Description working group Tested the Data. Description View Physical file layouts: • CSV Simple rectangular • Fixed Column rectangular • CSV Segmented • CSV Hierarchical • CSV Aggregate (ncube) We also looked at event history data and think it will be describable with these elements too see: https: //ddi- alliance. atlassian. net/wiki/display/DDI 4/Data+Description+View+Team 9/29/2020 NADDI 2017 Use Cases for the Data. Dictionary View in DDI Views 10

Event History Data • The event is the unit of observation • Examples •

Event History Data • The event is the unit of observation • Examples • A person visits a doctor: datasets holds all persons' id, date of visit and type of doctor; each row is a doctor's visit of a person • A petrol station registers each car in case of refilling: dataset holds license plate number, date/time of refilling and amount of fuel • Additions to DDI needed for event data • Which variable holds temporal information, which variable holds id of object with multiple events (see Viewpoint) • A datatype of “pointer” or “reference”, the value of which is a DDI Instance. Variable or perhaps a Value. Mapping 9/29/2020 NADDI 2017 Use Cases for the Data. Dictionary View in DDI Views 11

Simple Rectangular CSV https: //ddi-alliance. atlassian. net/wiki/display/DDI 4/Data+Description+View+Team More details are available on the

Simple Rectangular CSV https: //ddi-alliance. atlassian. net/wiki/display/DDI 4/Data+Description+View+Team More details are available on the web in the report from the working group 9/29/2020 NADDI 2017 Use Cases for the Data. Dictionary View in DDI Views 12

Simple Rectangular CSV – Australian Election Survey The example uses a small representative set

Simple Rectangular CSV – Australian Election Survey The example uses a small representative set of variables See: http: //www. australianelectionstudy. org/about. html 9/29/2020 NADDI 2017 Use Cases for the Data. Dictionary View in DDI Views 13

Example DDI 4 XML 9/29/2020 NADDI 2017 Use Cases for the Data. Dictionary View

Example DDI 4 XML 9/29/2020 NADDI 2017 Use Cases for the Data. Dictionary View in DDI Views 14

More XML 9/29/2020 NADDI 2017 Use Cases for the Data. Dictionary View in DDI

More XML 9/29/2020 NADDI 2017 Use Cases for the Data. Dictionary View in DDI Views 15

Variable Cascade in DDI http: //lion. ddialliance. org/package/conceptual The Instance. Variable can reference Represented.

Variable Cascade in DDI http: //lion. ddialliance. org/package/conceptual The Instance. Variable can reference Represented. Variable and Conceptual. Variable It also inherits all properties and relationships from them. 9/29/2020 NADDI 2017 Use Cases for the Data. Dictionary View in DDI Views 16

Modeled to comply with GSIM http: //www 1. unece. org/stat/platform/display/GSIMclick/Instance+Variable http: //www 1. unece.

Modeled to comply with GSIM http: //www 1. unece. org/stat/platform/display/GSIMclick/Instance+Variable http: //www 1. unece. org/stat/platform/display/GSIMclick/Represented+Variable 9/29/2020 NADDI 2017 Use Cases for the Data. Dictionary View in DDI Views 17

Logical and physical records – at the record level 9/29/2020 Logical Physical • Variables

Logical and physical records – at the record level 9/29/2020 Logical Physical • Variables • Logical variable order • Logical. Record link • Physical variable order • Number physical records per logical • Encoding etc. • Record parameters (e. g. terminator) • Delimited? /delimiter, quote char … NADDI 2017 Use Cases for the Data. Dictionary View in DDI Views 18

Physical Records – at the variable level • Physical representation • Datatype, decimal separator,

Physical Records – at the variable level • Physical representation • Datatype, decimal separator, group separator … • Location • Starting position, ending position, length • Record number 9/29/2020 NADDI 2017 Use Cases for the Data. Dictionary View in DDI Views 19

Describing a CSV Instance. Variable Logical. Record. Layout Value. Mapping 9/29/2020 Rectangular. Layout Relationships

Describing a CSV Instance. Variable Logical. Record. Layout Value. Mapping 9/29/2020 Rectangular. Layout Relationships NADDI 2017 Use Cases for the Data. Dictionary View in DDI Views Physical. Layout. Order Physical 20

Instance Variable - Properties And Relationships http: //lion. ddialliance. org/ddiobjects/instancevariable • • • 9/29/2020

Instance Variable - Properties And Relationships http: //lion. ddialliance. org/ddiobjects/instancevariable • • • 9/29/2020 Agency ID Version Name Display. Label Descriptive. Text ANNOTATION Unit. Of. Measurement Variable. Role Physical. Datatype Intended. Datatype • • • Substantive. Conceptual. Domain Substantive. Value. Domain Sentinel. Conceptual. Domain Sentinel. Value. Domain Concept Unit. Type Universe Population Conceptual. Variable Represented. Variable External. Material NADDI 2017 Use Cases for the Data. Dictionary View in DDI Views 21

Logical. Record. Layout - Properties and Relationships • • • Agency, Id, Version Annotation

Logical. Record. Layout - Properties and Relationships • • • Agency, Id, Version Annotation Type Name Purpose Instance. Variable View. Point Logical. Reord. Layout External. Material http: //lion. ddialliance. org/ddiobjects/logicalrecordlayout As a Collection the Logical. Record. Layout allows for ordering Instance. Variables within the Record. Viewpoints allow assigning roles to Instance. Variables – Identifiers, Measures, or Attributes. • Realizes Collection 9/29/2020 NADDI 2017 Use Cases for the Data. Dictionary View in DDI Views 22

Rectangular. Layout (Physical. Layout) - Properties and Relationships • • • Agency, ID, Version

Rectangular. Layout (Physical. Layout) - Properties and Relationships • • • Agency, ID, Version Annotation Is. Delimited Delimiter Is. Fixed. Width Escape. Character Line. Terminator Quote. Character Comment. Prefix Encoding Header 9/29/2020 • • • Header. Row. Count Skip. Bank. Rows Skip. Data. Columns Skip. Initial. Space Skip. Rows Trim Null. Sequence Header. Is. Case. Sensitive Array. Base Treat. Consecutive. Delimiters. A s. One NADDI 2017 Use Cases for the Data. Dictionary View in DDI Views • • • http: //lion. ddialliance. org/ddiobjects/physicallayout Overview Table. Direction Text. Direction Type Name Purpose External. Material Value. Mapping Logical. Record. Layout Realizes Collection 23

Value. Mapping - Properties and Relationships • • • 9/29/2020 Agency, Id, Version Physical.

Value. Mapping - Properties and Relationships • • • 9/29/2020 Agency, Id, Version Physical. Datatype Default. Decimal. Separator Default. Digital. Group. Separator Number. Pattern Default. Value Null. Sequence Format Length Minimum. Length Maximum. Length • • http: //lion. ddialliance. org/ddiobjects/valuemapping Scale Decimal. Positions Required External. Material • Instance. Variable • Physical. Segment. Location • (e. g. Start, end, length) NADDI 2017 Use Cases for the Data. Dictionary View in DDI Views 24

Physical. Layout. Order - Properties and Relationships http: //lion. ddialliance. org/ddiobjects/valuemapping Order Criteria (alphabetical,

Physical. Layout. Order - Properties and Relationships http: //lion. ddialliance. org/ddiobjects/valuemapping Order Criteria (alphabetical, numerical …) Semantics (before) Reflexivity (Anti-Reflexive – a variable is not before itself) Symmetry (Anti – A before B means B not before A) Transitivity( Transitive (if A before B and B before C then A before C) Physical. Layout it structures Physical. Layout. Ordered. Pairs 9/29/2020 NADDI 2017 Use Cases for the Data. Dictionary View in DDI Views 25

Physical. Layout. Orded. Pair - Properties and Relationships http: //lion. ddialliance. org/ddiobjects/valuemapping Source Instance.

Physical. Layout. Orded. Pair - Properties and Relationships http: //lion. ddialliance. org/ddiobjects/valuemapping Source Instance. Variable Target Instance. Variable The source variable is “before” the target variable. The Physical. Layout. Order defined the semantic: “before” Example: divisnum before uniqueid before mode … 9/29/2020 NADDI 2017 Use Cases for the Data. Dictionary View in DDI Views 26

Typical CSV Rectangular. Layout in XML 9/29/2020 NADDI 2017 Use Cases for the Data.

Typical CSV Rectangular. Layout in XML 9/29/2020 NADDI 2017 Use Cases for the Data. Dictionary View in DDI Views 27

Instance. Variable for variable “Divis. Num” in XML Actually an “External. Controlled. Vocabulary. Entry”

Instance. Variable for variable “Divis. Num” in XML Actually an “External. Controlled. Vocabulary. Entry” 9/29/2020 Much more about the Value. Domain for Divis. Num could be described with Substantive. Value. Domain and Sentinel. Value. Domain NADDI 2017 Use Cases for the Data. Dictionary View in DDI Views 28

CSV Value. Mapping for variable “Divis. Num” in XML Also an “External. Controlled. Vocabulary.

CSV Value. Mapping for variable “Divis. Num” in XML Also an “External. Controlled. Vocabulary. Entry” 9/29/2020 NADDI 2017 Use Cases for the Data. Dictionary View in DDI Views 29

Fixed Layout Value. Mapping for variable “Divis. Num” in XML 9/29/2020 NADDI 2017 Use

Fixed Layout Value. Mapping for variable “Divis. Num” in XML 9/29/2020 NADDI 2017 Use Cases for the Data. Dictionary View in DDI Views 30

Viewpoint http: //lion. ddialliance. org/ddiobjects/viewpoint • Assigns one of three roles to variables: •

Viewpoint http: //lion. ddialliance. org/ddiobjects/viewpoint • Assigns one of three roles to variables: • Identifier • Measure • Attribute • Useful for • Aggregate data (ncubes) • Event data (Which variable holds temporal information, which variable holds id of object with multiple events) • Annotating variables (e. g. paradata) 9/29/2020 NADDI 2017 Use Cases for the Data. Dictionary View in DDI Views 31