Introduction to DDI 3 0 Sanda Ionescu ICPSR
- Slides: 155
Introduction to DDI 3. 0 Sanda Ionescu ICPSR CESSDA Expert Seminar, September 2007
DDI Version 3. 0 • Radically different. • More complex… (…but certainly doable!) • Brings important benefits.
Workshop Schedule 14: 30 – 15: 10 – 15: 35 – 15: 45 – 16: 10 – 16: 30 – 16: 40 – 17: 10 – 17: 30 Overview (40) Structure and Technical Mechanisms (25) Break (10) Study Unit – Modules Content (25) Variable Markup Example (20) Break (10) Grouping – Modules Content and Examples (30) Getting Started (20)
DDI 3. 0 Overview
DDI Background Development History • 1995 – A grant-funded project initiated and organized by ICPSR proposes to create a new standard for documenting social science data, to replace OSIRIS tagged codebooks. • First drafts used SGML, then converted to Webfriendly XML. • 2000 – DDI Version 1. 0 published as a mainly document- and codebook-centric standard.
DDI Background Development History • 2003 – DDI Version 2. 0 published with extended scope: – Aggregate data coverage (based on matrix structure) – Additional geographic representation to assist geographic search systems and GIS users • Versions 1. 0 through 2. 1 (latest published) are backwards compatible, and based on the same structure.
DDI Background Development History • February 2003 – Formation of the DDI Alliance, a self-sustaining membership organization whose members have a voice in the development of the DDI specification. http: //www. ddialliance. org/
DDI Background Development History Version 3. 0: • • 2004 -2006: Planning and Development November 2006: Internal Review February 2007: Public Review July 2007: Candidate Draft Release http: //www. ddialliance. org/ddi 3/index. html
Benefits of using DDI as an XML-based standard • Interoperability: – Enables seamless exchange and reuse by other systems. • Repurposing: – Provides a core document from which different types of outputs can be generated. • Value-added documentation: – Tagging carries “intelligence” in the document by describing content. • Enhanced Data Discovery: – Increases precision and granularity of searches. • Support for Data Analysis: – Variables description is accepted as input by online analysis systems. • Multiple presentation formats: – ASCII – text; PDF; HTML; RTF. • Preservation-friendly: – Non-proprietary format.
Why DDI 3. 0? DDI 3. 0 presents new features in response to: • Perceived needs of: -Data users -Data producers -Data archivists/librarians • Developments in documenting and archiving data • Advances in XML technology
DDI 3. 0 and the Data Life Cycle Model DDI Versions 1/2 were codebook-centric: • Closely followed the structure of traditional print codebooks. • Captured data documentation at a single, “frozen” point in time – archiving.
DDI 3. 0 and the Data Life Cycle Model Version 3. 0 is Life Cycle oriented: -Designed to cover all stages in the life cycle of a data collection: pre-production post-production secondary use
Life Cycle Coverage in DDI 3. 0 Planning for the Study: Proposal / Design Study Purpose / Outline Concepts Study Population Author(s) Funding Sources Version 3. 1 Survey / Sample Design Pre-testing
Life Cycle Coverage in DDI 3. 0 Proposal becomes reality… Data Collection methodology: sampling, time, etc. Instrument characteristics Questionnaire Data cleaning, weighting, coding, etc.
Life Cycle Coverage in DDI 3. 0 Publishing the data… Intellectual content: Variables, Categories, Codes. Physical representation: Data format, Record structure, Statistics.
Life Cycle Coverage in DDI 3. 0 Archiving / (Re)Distributing the data collection… Processing checks Holdings, availability and access conditions
Life Cycle Coverage in DDI 3. 0 DDI becomes “visible” to the outside world… DDI Instance: Pulls together all life cycle stages Acquires its own identity as an object Becomes a tool for data discovery and analysis
Life Cycle Coverage in DDI 3. 0 Secondary use of data – new conceptual framework… New DDI Instance: New Purpose New Logical Product New Physical Description of Data
DDI 3. 0 and the Data Life Cycle Model Advantages of Life Cycle orientation: • Allows capture and preservation of metadata generated by different agents at different points in time. • Facilitates tracking changes and updates in both data and documentation.
DDI 3. 0 and the Data Life Cycle Model Advantages of Life Cycle orientation: • Enables investigators, data collectors and producers to document their work directly in DDI, thus increasing the metadata’s visibility and usability. • Benefits data users, who need information from the full data life cycle for optimal discovery, evaluation, interpretation, and re-use of data resources.
New / Extended Functionalities in DDI 3. 0: Questionnaire Versions 1/2: - No instrument coverage. - Question text only as part of variable description. - No documentation for question flow / conditions. Version 3. 0: - Full description of instrument as a separate entity. - Documents specific use of questions: flow, conditions, loops. - Compatible with Computer Assisted Interviewing software.
New / Extended Functionalities in DDI 3. 0: Complex Data Versions 1/2: - Inadequate representation of complex / hierarchical data Version 3. 0: - Detailed documentation for complex / hierarchical data Logical structure of records Record Types and Relationships Relevant variables: key-link, case identification, record type locator Physical layout of records Single “hierarchical” file for all records, multiple rectangular files, relational database, etc.
New / Extended Functionalities in DDI 3. 0: Aggregate Data Versions 1/2: - Initially designed for microdata only - Aggregate data section added in V 2. 1 to support limited representation (Census-type data, delimited files) Version 3. 0: - Adds support for tabular, spreadsheet-type, representation of aggregate data - Aggregate data transport option: cell content may be included inline with the data item description
New / Extended Functionalities in DDI 3. 0: Data Transport Versions 1/2: -None Version 3. 0: -In-line inclusion enabled for both aggregate data and microdata
New / Extended Functionalities in DDI 3. 0: Longitudinal / Time Series / Cross-national Data Comparability Versions 1/2: -None Version 3. 0: -Grouping structure documents studies related on one or several dimensions (time, geography, language, etc. ) as well as their comparability
New / Extended Functionalities in DDI 3. 0: Increased Multilingual Support Versions 1/2: - Limited <anytag xml: lang=“”> Version 3. 0: - Support for multiple language use and translations <International. String. Type xml: lang=“” translated=“” translatable=“”> <Variable> <Label xml: lang=“ger” translated=“false” translatable=“true”> Geburtsjahr</Label> <Label xml: lang=“eng” translated=“true”>Year of Birth</Label> </Variable>
DDI 3. 0 Specification: Schema-based Versions 1/2: - DTD-based Version 3. 0: - Schema-based: Data typing supports machine actionability Use of namespaces supports - Modularity - Extensibility and reuse - Alignment with / use of other standards
DDI 3. 0 Specification: Machine-actionable Versions 1/2: - Machine-readable Version 3. 0: - Machine-actionable: 1. Data typing: increased use of controlled vocabularies and standard codes 2. Larger set of required elements Predictable content = a more consistent base for programming
DDI 3. 0: Modular Structure Version 1/2: - Single file, hierarchical design Version 3. 0: - Modular design: - Facilitates reuse Facilitates versioning and maintenance Supports life cycle model Allows flexibility in organizing the DDI Instance Supports grouping and comparing studies Supports creation of metadata registries
DDI 3. 0: Alignment with other metadata standards Versions 1/2: - MARC, Dublin Core (bibliographic standards) Version 3. 0: - MARC, DC, but also… SDMX (Statistical Data and Metadata Exchange) ISO 11179 (Metadata Registries) FGDC (Digital Geospatial Metadata) ISO 19115 (Geographic Information Metadata)
DDI 1/2 or DDI 3. 0? • DDI 3. 0 will not supersede DDI 2. 1. • Both versions will – coexist – continue to be maintained – be used according to specific needs. • All DDI 1/2 markup will not have to be migrated to Version 3. 0.
DDI 3. 0 Structure and Mechanisms
DDI 3. 0 – Modular Structure Building blocks of DDI 3. 0: » Modules » Schemes
DDI 3. 0 – Modular Structure Modules: • Document different aspects of a study, or group of studies, following the data through their life cycle (Conceptual Components, Data Collection, Logical Product, Physical Instance, etc. ) Schemes: • Include collections of sibling “objects” that are traditionally components of a variable description: Concepts, Universes, Questions, Variable Labels and Names, Categories, Codes.
DDI 3. 0 – Modular Structure Modules: • Can live independently (have their own schemas) or connected to one another within a hierarchical structure. Schemes: • Can live semi-independently (need a higher-level wrapper as they do not have their own schemas) or in-line within a Study Unit or Group module.
DDI 3. 0 – Modular Structure DDI 3. 0 model = a multi-branched hierarchy Module level: DDI Instance Study Unit Conceptual Components Data Collection Group Archive Organizations Study Unit Resource Package Subgroup Study Unit Subgroup (Sub)group
DDI 3. 0 – Modular Structure DDI 3. 0 model = a multi-branched hierarchy Within modules: Data Collection Methodology Sampling Time Method Question Scheme Question Item Processing Weighting Coding
DDI 3. 0 – Modular Structure Relationships are established through: • In-line inclusion (Relational order is explicit) • Referencing Internal External (Relational order is implicit)
DDI 3. 0 – Structural mechanisms Enable modular design and help actualize its benefits. • Inheritance • Referencing • Identification
DDI 3. 0: Inheritance • Inheritance is based on the hierarchical structure of the model. • In DDI 3. 0 a number of elements are reused at different levels of the hierarchy. • When the same element is present at multiple levels, lower levels inherit content from the upper levels, and only need to specify differences (=local overrides).
DDI 3. 0 Inheritance Example • Instance: Coverage: Spatial: 50 US states -Study Unit A – no Spatial Coverage defined = will be inherited from Instance -Study Unit B – Coverage: Spatial: 48 coterminous states = supersedes definition in Instance
DDI 3. 0: Referencing • DDI 3. 0 modular structure is dependent upon creating relationships by reference. • Referencing implies bringing up the content of a DDI object within, or in association with, another object, by specifying its Unique Identifier. • Identifiers are the key links between DDI objects.
DDI 3. 0: Referencing Example Data Collection Module: Question Scheme: Question: ID: “Q 1” Conceptual Components Module: Concept Scheme: Concept: ID: “C 1” Text: “How many days in the past week did you watch the national network news on TV? ” Description: “Exposure to national TV news” Logical Product Module: Variable Scheme: Variable: ID: “V 1” Name: V 043014 Label: Days past week watch natl news on TV Question Reference: ID: “Q 1” Concept Reference: ID : “C 1”
DDI 3. 0: Referencing Example
DDI 3. 0: Identification Consistency in building and using identifiers is needed for: – Proper functioning of reference systems, enabling a smooth exchange and reuse of existing metadata. – Machine-actionability of DDI instances, allowing them to serve as a basis for running programs and processes.
DDI 3. 0: Identification Element types used in the Identification system: Maintainable Versionable Identifiable All elements
DDI 3. 0: Identification Element Types Non-identified elements: – Require context, which is provided by containing parents. Example: codes within code schemes – Are not reusable. Example: variable and category statistics
DDI 3. 0: Identification Element Types Identifiables – Carry their own ID – May be referenced / reused – Cannot be versioned or maintained, except as part of a complex parent element (Example: Variable – a change implies a new version of the entire scheme).
DDI 3. 0: Identification Element Types Versionables – Carry their own ID – Carry their own Version: content changes are important to note (Example: Concept – may be independently versioned within a scheme).
DDI 3. 0: Identification Element Types Maintainables – Are higher level DDI objects – Are both identifiable and versionable – Can also be published and maintained as separate entities (Example: all modules, schemes, comparison maps)
DDI 3. 0: Identification Structure • Maintainable elements: – URN and / or ID + Identifying Agency + Versioning Information: Version Date Version Responsibility Version Rationale • Versionable elements: – URN and / or ID + Versioning Information • Identifiable elements: – URN and / or ID
DDI 3. 0: Identification Structure Non-specified Identification information is inherited from the levels above. Example 1: Inheritance is assumed…. Maintainable: Variable Scheme: ID: Var. Scheme_A Identifying Agency: ICPSR Version: 1. 0 Identifiable: Variable: ID: Var_1 [Identifying Agency] [Version]
DDI 3. 0: Identification Structure Non-specified Identification information is inherited from the levels above. Example 1: Inheritance is assumed… Example 2: Inheritance is applied by default Maintainable: Variable Scheme: ID: Var. Scheme_A Identifying Agency: ICPSR Version: 1. 0 Identifiable: Variable: ID: V 1 [Identifying Agency] [Version] Maintainable: Logical Product ID: Logical. Prod_Y Identifying Agency: ICPSR Version: 1. 0 Maintainable: Variable Scheme: ID: Var. Scheme_A Identifying Agency: [ ] Version: [ ]
DDI 3. 0: Identification Structure: IDs Uniqueness of Identifiers is necessary for both internal and external referencing: 1) All IDs MUST be unique within a maintainable 2) All maintainables MUST have unique IDs across an Agency
DDI 3. 0: Identification Structure: Creating unique Identifiers A DDI Instance may include multiple maintainables at different hierarchical levels: Instance (maintainable) – unique ID within Identifying Agency Study Unit (maintainable) – unique ID within Identifying Agency Logical Product (maintainable) – unique ID within Identifying Agency Variable Scheme (maintainable) – unique ID within Identifying Agency
DDI 3. 0: Identification Structure: Creating Unique Identifiers Markup: Instance_A (unique at ICPSR) Study. Unit_1 Logical Product_1 Instance_B (unique at ICPSR) Study. Unit_1 Logical Product_1 Variable. Scheme_1 Variable_1 Post-markup: Variable ID: Instance_AStudy. Unit_1 Logical. Product_1 Variable. Scheme_1 Variable_1 Instance_BStudy. Unit_1 Logical. Product_1 Variable. Scheme_1 Variable_1
DDI 3. 0: Identification Structure: URNs • Have a fixed structure and MUST include object ID, Identifying Agency, and Version. • For versionable and identifiable elements, the containing maintainable is specified. • Take precedence when both a URN and the Identification sequence are used for the same object. • May be constructed post-markup from the Identification sequence.
DDI 3. 0: Identification: URN Structure Examples: • Object name Identifying Agency Object ID Object Version Maintainables: urn: ddi: 3. 0: Study. Unit: ddialliance. org: Study. Unit_ID: 1. 0 • Versionables: urn: ddi: 3. 0: Concept. Scheme: ddialliance. org: Concept. Scheme_ID: 1. 0: Concept_ID: 2. 1 • Identifiables: urn: ddi: 3. 0: Variable. Scheme: ddialliance. org: Variable. Scheme_ID: 1. 0: Variable_ID
DDI 3. 0: Referencing Reference structure: • URN, and/or: • [Referenced object’s] ID + Identifying Agency + Version + [Containing] Module ID + [Containing] Scheme ID
DDI 3. 0: Reuse of Information Referencing Mechanisms for REUSE Inheritance Reuse of Information: 1. 2. 3. 4. 5. Facilitates development of documentation throughout the study life cycle Promotes interoperability and standardization across organizations Saves markup time and effort Reduces the risk of human entry error Provides a basic level of implicit comparability
DDI 3. 0 Modules Content, Markup Examples
DDI Version 3. 0 Modules -- Structural Overview -- DDI Instance Study Unit Concepts Data Coll. Logical Pr. etc… Group Study Unit Subgroup Resource Package Study Unit Sub(Group)
Other “specialized” DDI 3. 0 modules • Aggregate Data: – NCube Logical Product – Inline NCube Record Layout – Tabular NCube Record Layout • Inline Microdata: – Dataset • User-specific Markup Templates: – DDI Profile
DDI Version 3. 0 Modules -- Structural Overview -DDI Instance Study Unit Group Conceptual Component Data Collection Logical Product Physical Data Product Archive Physical Instance Comparative Archive Organizations Study Unit Group
DDI 3. 0 Modules used to mark up a simple study
DDI 3. 0 modules for documenting a single, survey-type study DDI Instance Study Unit Group Conceptual Component Data Collection Logical Product Physical Data Product Archive Physical Instance Comparative Archive Organizations Study Unit Group
DDI 3. 0 modules for documenting a single, survey-type study • Instance • [Reusable] • [XHTML] – Study Unit • Conceptual Component • Data Collection • Logical product • Physical Data Product • Physical Instance • Archive – Organizations
DDI Version 3. 0 Modules -- Structural Overview -DDI Instance Study Unit Group Conceptual Component Data Collection Logical Product Physical Data Product Archive Physical Instance Comparative Archive Organizations Study Unit Group
DDI Instance -- wrapper for all modules - • Identification – URN – Identification Sequence – Name Citation … (+ optional DC Elements) • • Coverage – Topical – Spatial – Temporal • • • Group (module) – repeatable Resource Package (module) - repeatable Study Unit (module) - repeatable Other Material(s) Note(s) Translation Information
Coverage in DDI 3. 0 Study: American National Election Study (ANES), 2004 • Topical Coverage: – Subject: • Historical and Contemporary Electoral Processes – Keyword: • Electoral campaigns • Political attitudes • Political participation • Spatial Coverage: – Description: United States – Top level: nation – Lowest level: congressional district • Temporal Coverage: – Date: 2004
DDI Version 3. 0 Modules -- Structural Overview -DDI Instance Study Unit Group Conceptual Component Data Collection Logical Product Physical Data Product Archive Physical Instance Comparative Archive Organizations Study Unit Group
Study Unit -- documents a single “study” - • • • • Identification, Other Material(s), Note(s) Citation Abstract Universe Reference Funding Information Purpose Coverage Analysis Unit Embargo Conceptual Component (module) Data Collection (module) Logical Product (module) Physical Data Product (module) Physical Instance (module) Archive (module) – Organizations (module)
DDI Version 3. 0 Modules -- Structural Overview -DDI Instance Study Unit Group Conceptual Component Data Collection Logical Product Physical Data Product Archive Physical Instance Comparative Archive Organizations Study Unit Group
Conceptual Component -- lists concepts and universes - • Identification, Other Material(s), Notes • Coverage • Concept Scheme… or Reference to External Scheme – Vocabulary – describes vocabulary used – Concept • Label • Description • Similar Concept – Difference – Concept Group • Concept Reference (nestable) • Universe Scheme … or Reference to External Scheme – Universe • Human Readable • Machine Readable • Subuniverse – Subuniverse
DDI Version 3. 0 Modules -- Structural Overview -DDI Instance Study Unit Group Conceptual Component Data Collection Logical Product Physical Data Product Archive Physical Instance Comparative Archive Organizations Study Unit Group
Data Collection • Identification, Other Material(s), Note(s) • Coverage • Methodology – Time Method – Sampling • Collection Event – – Data Collector Data Source Collection Date (s) Mode of data collection Question Scheme – lists actual questions Instrument – documents question flow, conditions • • • Processing Event – – Control and cleaning operations Weighting Data Appraisal Information Coding
DDI Version 3. 0 Modules -- Structural Overview -DDI Instance Study Unit Group Conceptual Component Data Collection Logical Product Physical Data Product Archive Physical Instance Comparative Archive Organizations Study Unit Group
Logical Product -- documents intellectual content of data - • Identification, Other Material(s), Note(s) • Coverage • Category Scheme … or Reference to external category scheme – Category • Label • Derivation (if applicable) • Definition • Code Scheme – – … or Reference to external code scheme Category Scheme Reference Hierarchy Type Level (in the hierarchy) Code • Category Reference • Value • Code (nestable) • Variable Scheme … or Reference to external variable scheme
Logical Product Variable Scheme: Variable • Variable … or Reference to an externally documented variable – Identification • Name – – – – Label Definition Universe Reference Concept Reference Question Reference Embargo Reference Response Unit Analysis Unit – Representation • • Imputation Derivation Coding Instructions Value Representation: » Text » Date / Time » Numeric » Code
Logical Product Variable Scheme: Variable Group • Variable Group: – – – – Type Label Definition Universe Reference Concept Reference Variable Reference (lists variables in the group) Variable Group Reference (allows nesting of groups) • Variable Group Reference (use for externally documented Variable Group)
DDI Version 3. 0 Modules -- Structural Overview -DDI Instance Study Unit Group Conceptual Component Data Collection Logical Product Physical Data Product Archive Physical Instance Comparative Archive Organizations Study Unit Group
Physical Data Product -- Describes Physical Layout of Data - • Identification, Other Material(s), Note(s) • Logical Product Reference • Gross Record Structure: – Records Per Case – Variable Quantity – Logical Record Reference – Physical Record Reference • Related Logical Records • Record Layout: – Data Item – Variable Reference – Physical Location – Value Location » Start. Position » Width • Dataset (module)
DDI Version 3. 0 Modules -- Structural Overview -DDI Instance Study Unit Group Conceptual Component Data Collection Logical Product Physical Data Product Archive Physical Instance Comparative Archive Organizations Study Unit Group
Physical Instance -- Documents a specific data file -- • • Identification, Other Material(s), Note(s) Citation Coverage Physical Data Product Reference Data File Identification – Location – URI Gross File Structure – Creation Software – Case Quantity – Overall Record Count Statistics – Logical Product Reference – Variable Statistics • Variable Reference • Total Responses • Summary Statistics • Category Statistics » Value » Statistic
DDI Version 3. 0 Modules -- Structural Overview -DDI Instance Study Unit Group Conceptual Component Data Collection Logical Product Physical Data Product Archive Physical Instance Comparative Archive Organizations Study Unit Group
Archive • • Identification, Other Material(s), Note(s) Archive Specific – Item • • • Location Call Number URI Format Media Availability Status – Access • • Confidentiality Statement Access Permission Restrictions Citation Requirement Deposit Requirement Access Conditions Disclaimer Contact – Funding Information • Life Cycle Information – Event • • • Type Date Agency Description Organizations (module)
DDI Version 3. 0 Modules -- Structural Overview -DDI Instance Study Unit Group Conceptual Component Data Collection Logical Product Physical Data Product Archive Physical Instance Comparative Archive Organizations Study Unit Group
Organizations • • Identification Organization – URL – Individual • Individual – Organization – Title – Language • Name Description Location Telephone E-mail Relation Role – – – • • Entity Reference Organization Reference Individual Reference Description Period Relation – – Organization Reference Individual Reference Description Period
DDI 3. 0 Markup Example A Survey Variable
Version 2. 1 vs. Version 3. 0 Example: A survey variable ASCII codebook:
Version 2. 1 vs. Version 3. 0 Example: A survey variable in Version 2. 1 Data Description: Variable
Version 2. 1 vs. Version 3. 0 Example: A survey variable in Version 2. 1 name=“V 043015”
Version 2. 1 vs. Version 3. 0 Example: A survey variable in Version 3. 0 Logical Product: Variable Scheme Conceptual Component: Concept Scheme Universe Scheme Data Collection: Question Scheme Logical Product: Code Scheme Logical Product: Category Scheme Physical Instance: Statistics
Version 2. 1 vs. Version 3. 0 Example: A survey variable in Version 3. 0 Conceptual Component Concept Scheme: Concept: ID Universe Scheme: (Sub)Universe: ID Logical Product: Category Scheme: ID Category: ID Logical Product Variable Scheme: ID Variable: ID Logical Product: Code Scheme: ID Code Data Collection: Question Scheme: ID Question: ID Physical Instance: Statistics: Variable Statistic Category Statistics
DDI 3. 0 Markup: A Survey Variable Concept: Attention to Presidential Campaign on National TV Conceptual Component: Concept Scheme: Concept
DDI 3. 0 Markup: A Survey Variable Concept
DDI 3. 0 Markup: A Survey Variable Universe Conceptual Component: Universe Scheme: (Sub)Universe (A 7: How many days in the PAST WEEK did you watch the NATIONAL network news on TV? 0 -7; 8=DK; 9=RF)
DDI 3. 0 Markup: A Survey Variable Universe
DDI 3. 0 Markup: A Survey Variable Question ID, Question Text Data Collection: Question Scheme: Question Item
DDI 3. 0 Markup: A Survey Variable Question ID, Question Text Other Response Domains:
DDI 3. 0 Markup: A Survey Variable name, label, type of physical representation Logical Product: Variable Scheme: Variable
DDI 3. 0 Markup: A Survey Variable name, label, type of physical representation Other types of Representation:
DDI 3. 0 Markup: A Survey Variable Category labels, missing data information Logical Product: Category Scheme: Category
DDI 3. 0 Markup: A Survey Variable Category labels, missing data information missing=“true”
DDI 3. 0 Markup: A Survey Variable Category Values Logical Product: Code Scheme: Code
DDI 3. 0 Markup: A Survey Variable Category Values
DDI 3. 0 Markup: A Survey Variable Statistics Physical Instance: Statistics Variable Statistics: Category Statistic
DDI 3. 0 Markup: A Survey Variable Statistics
DDI 3. 0 Markup: A Survey Variable Logical Product Module
DDI 3. 0 Markup Modules used in a full variable description Concept Universe Question Values Value Labels Variable name Variable label Statistics Location: Physical Data Product
DDI 3. 0 Modular Approach Advantages • Modules and schemes can be independently maintained. • Pieces of information can be reused without being repeated.
DDI 3. 0 Modular Approach: Reusing information
Variable Markup in Version 2 -- carries redundant information--
Variable Markup in Version 3. 0 Modular Approach: Reusing Information
DDI 3. 0 Grouping
DDI 3. 0: Groups • Entirely new feature in DDI 3. 0. • Designed to document and compare related studies.
DDI 3. 0 Modules -- Structural Overview -DDI Instance Study Unit Group Conceptual Component Data Collection Logical Product Physical Data Product Archive Physical Instance Comparative Archive Organizations Study Unit Group
DDI Version 3. 0 Modules -- Structural Overview -DDI Instance Study Unit Group Conceptual Component Data Collection Logical Product Physical Data Product Archive Physical Instance Comparative Archive Organizations Study Unit Group
Group -- documents “families” of studies - • • • Identification, Other Material(s), Note(s) Citation Abstract Universe Funding Information Purpose Coverage Universe Reference Conceptual Component (module) Data Collection (module) Logical Product (module) Archive (module) – Organizations (module) • Study Unit (module) • Group (module) • Comparative (module)
DDI 3. 0 Grouping Attributes • Set of mandatory attributes indicate the nature of the relationships among group members • Group parameters: – – – Time Instrument Panel (population of respondents) Geography Datasets Language
DDI 3. 0 Grouping Attributes Example
DDI 3. 0: Types of Groups • Groups of studies may be: – Formal (“by design”): • Designed to be compared (longitudinal, time-series, or cross-national studies) • Documented and compared through use of Inheritance – Informal (“ad-hoc”): • Decision to group and compare is taken postproduction, or “after the fact”. • Comparability documented in the Comparative module
Formal Groups: Inheritance Example 1: Time-series: Same questions repeated over time, same resulting variables. Group (Studies A-C) Temporal Coverage_G 1: 1991 -1993 Data Collection: Question Scheme Logical Product: Variable Scheme Study A Temporal Coverage: 1991 Temporal 1991 (Replace Ref: G_1) …… Product Physical Data Product Physical Instance: Statistics Physical Instance Study B Study C Temporal Coverage: 1992 Temporal Coverage: 1993 (Replace Ref: G_1) …… Product. . . . Physical Data Product Physical Instance: Statistics Physical Instance
Formal Groups: Inheritance Attributes “Add”, “Replace”, “Delete”. • In a complex grouping structure inheritance paths may become quite intricate. • ID attributes ADD, REPLACE and DELETE are introduced to resolve potential inheritance ambiguities: – ADD = [empty] -> flags element as a new addition. – REPLACE = “Reference. Type” -> referenced element is being replaced at the lower level (“local override”). – DELETE = “Reference. Type” -> referenced element is being deleted at the lower level.
Formal Groups: Inheritance Example 2: Time-series: Same core questions repeated over time, different topical modules added to each iteration. Group (Studies A-C) Data Collection: Core Questions(Q 1 -Q 50) Logical Product: Core Variables (V 1 -V 50) Study A Study B Topical Module “Health Status” Topical Module “Gun Control” Data Collection: ADD: Questions (Q 51 A-Q 80 A) Logical Product: ADD: Variables (V 51 A-V 80 A) Data Collection: ADD: Questions (Q 51 B-Q 80 B) Logical Product: ADD: Variables (V 51 B-V 80 B) etc…
Formal Groups: Inheritance Example 3: Any group by design: some questions are not asked in some iterations. Group (Studies A-E) Data Collection: All Questions (Q 1 -Q 100) Logical Product: All Variables (V 1 -V 100) Study B Study A Data Collection: DELETE: Question Q 55 Logical Product: DELETE: Variable V 55 Group (Studies C-E) Data Collection: DELETE: Questions Q 60 -Q 69 Logical Product: DELETE: Variables V 60 -V 69 Study C Study D Study E
Formal Groups: Inheritance Example 4 (SOEP, Germany): Longitudinal: Same variables, with different name each year. (No name) ADD: Name only
Formal Groups: Inheritance Example 5 (SOEP, Germany): Longitudinal: In 2002 variable “Income” changes currency from DM to Euro: change in question wording. (No question) ADD: question only
Formal Groups: Inheritance Example 5 (SOEP, Germany) continued: variables also change names every year… These
Formal Groups: Inheritance Example 5 (SOEP, Germany) – the final picture: information is inherited down the hierarchy.
Inheritance in Formal Groups • Simplification of DDI Instances: common metadata is only entered once. • More efficient means of documentation: for new additions, only differences need to be specified. • Relational information embedded in the inheritance structure: comparison becomes machine-actionable.
DDI Version 3. 0 Modules -- Structural Overview -DDI Instance Study Unit Group Conceptual Component Data Collection Logical Product Physical Data Product Archive Physical Instance Comparative Archive Organizations Study Unit Group
Comparative -- documents comparability in ad-hoc groups - • Identification, Note(s) • Comparison Description (human-readable) • Concept Map – Source Scheme Reference – Target Scheme Reference – Item Map • • • Source Item Target Item Map Type Difference Variable Map Question Map Category Map Code Map Universe Map
DDI 3. 0 Using the Comparative Module Instructions on how to use the Comparative Module and build comparison maps: “DDI 3. 0 User Guide”, pp. 45 -49. http: //www. ddialliance. org/DDI/ddi 3
Producing DDI 3. 0 markup Getting started
DDI 3. 0: Tools projects DDI Toolkit: • Core library for developing open source tools • • Version 1/2 <-> Version 3. 0 converters DDI 3. 0 URN resolution tool DDI 3. 0 validation tool Version 3. 0 stylesheets with display and editing layers • Grouping tool • Concept management tool • Registry applications
Producing DDI 3. 0 markup -- Getting started -Software to assist in document creation: • De. Xtris: – XML browser – Converts DDI 1/2 to DDI 3. 0 http: //www. opendatafoundation. org/tools/dextris
DDI 3. 0 Tools: Using Dextris
DDI 3. 0 Tools: Using Dextris
DDI 3. 0 Tools: Using Dextris
DDI 3. 0 Tools: Using Dextris
DDI 3. 0 Tools: Using Dextris
DDI 3. 0 Tools: Using Dextris
DDI 3. 0 Tools: Using Dextris
DDI 3. 0 Tools: Using Dextris
DDI 3. 0 Tools: Using Dextris
Producing DDI 3. 0 markup -- Getting started -Software to assist in document creation: • SPSS system to DDI 3. 0 converter: (See description and link on DDI 3. 0 Proof of Concept page) http: //www. ddialliance. org/DDI/ddi 3/proof. html
Producing DDI 3. 0 markup -- Getting started -XML editors • • o. Xygen: Create new DDI instance Edit/update DDI instance Validate DDI instance View schemas
DDI 3. 0: Viewing Schemas in o. Xygen
DDI 3. 0: Viewing Schemas in o. Xygen
Producing DDI 3. 0 markup -- Getting started -Other tools to assist in producing DDI 3. 0 markup: • DDI “core” template • Version 3. 0 documentation: – Module descriptions – Field level documentation – DDI Help Center http: //www. ddialliance. org/ddi 3/index. html
Producing DDI 3. 0 markup -- Using multiple modules -- Resource: “Getting Started with DDI 3. 0” http: //www. ddialliance. org/DDI/ddi 3/getting-started. html
DDI Version 3. 0 Displaying Markup Stylesheets: • Basic: Web presentation in XHTML • Enhanced: Adds graphics for presenting frequencies Automated calculation of valid percentages http: //www. ddialliance. org/DDI/ddi 3/proof. html
DDI Version 3. 0 Questions? Comments? • Sanda Ionescu: sandai@umich. edu • DDI Users Listserv: ddi-users@icpsr. umich. edu http: //www. ddialliance. org/codebook/listserv. html
The End
- Sanda puljiz vidović
- Sanda oswald
- Raspravljački esej
- Sanda puljiz vidović
- Tamara ionescu
- Tamara ionescu veterinar
- Liceul ecologic pitesti
- Andrei ionescu deloitte
- Ddi heat exchangers
- Ddi corp
- Movie ddi.to
- Leap 360 interim assessment answer key
- Ddi assessment
- Cupom cambial
- Api driven ddi
- Ddi
- Ddi-cdi
- Ddi acronym
- Ddi 3
- Ddi compendium
- Ddi heat exchangers inc
- Ddi early identifier
- Intro paragraph outline
- Learning without burden pics
- Introduction to yacc
- Dare essay 5th grade
- Academic essay
- Abstract vs introduction
- How long is a introduction
- Topics to write an email
- Bridge introduction paragraph
- Introduction answer the question
- Informative essay examples
- Editorial by
- How to write disadvantage and advantages essay
- Introduction of research example
- How to write diary
- Character analysis thesis examples
- Jane schaffer introduction paragraph example
- Conclusion for mesopotamian civilization
- Prayer for work immersion portfolio
- Wireless communication introduction
- Band introduction speech
- Introduction of personal selling
- What is humor
- Ending of informal email
- Theme essay conclusion
- Good introduction paragraph
- Smart note taker seminar
- Introduction to e business
- Introduction to semiconductors
- Introduction of infosys
- How is water transported in humans
- Introduction to acids and bases webquest
- Introduction to dynamic web content
- Pollution introduction
- Introduction of aristotle
- Introduction of purification of water
- Victorian poetry introduction
- Victorian era literature characteristics
- Introduction les contemplations de victor hugo
- Vegetable production introduction
- Anecdote introduction example
- Introduction of urinary system
- Uric acid introduction
- Inside order takers
- Epic poem the odyssey
- Oltp vs olap
- Enduring issues essay graphic organizer
- Difference between primitive and non primitive data types
- Introduction to computer organization and architecture
- Unit 1 introduction to accounting
- Online ups block diagram
- Basic principle of ultrasonic testing
- Hook in essays
- Cathode ray oscilloscope applications
- Tuesdays with morrie aphorisms
- Introduction of clutch
- Transistor c b e
- Introduction of transformer
- Mechanical transducer
- Romeo juliet introduction
- Introduction to traffic management
- Touch screen introduction
- Introduction of tpr
- Maximum torque of induction motor
- They say, i say examples
- Introduction to thermochemistry
- Therapeutic communication introduction
- Theme of a paragraph
- Web presentation outline design and management
- Story short
- The cars model
- Conclusion prayer
- Formula de poisson
- Thesis statement examples for the odyssey
- Who is homer
- Introduction sur le microscope
- The last lesson introduction
- Hook for a paragraph
- High school introduction paragraph
- Charter act of 1813
- Iliad by homer full story
- The great gatsby introduction essay
- Foreshadowing in the great gatsby chapter 4
- The giver synopsis
- Hook introduction examples
- Human digestive system introduction
- Alternating method thesis
- Introduction to heart
- Introduction body and conclusion
- Do i need a title page for apa
- Roadmap for essay
- What is team teaching
- System analysis and design
- Symposium advantages and disadvantages
- Jaguar land rover swot analysis
- Argumentative introduction example
- Welcome address for webinar sample
- Meghalaya introduction
- Peer review examples for students
- Internship portfolio introduction
- Introduction to stoichiometry
- Stations of the cross introduction
- Spm jiscmail
- Specific purpose statements
- Spectrophotometry in chemistry
- Introduction to waste management
- Introduction to waste management
- Solar tree introduction
- Software testing concepts
- Max weber introduction
- Society and culture pip examples
- Social media introduction
- Soapstone paragraph
- Small group communication
- A power miter saw combines a miter box with a
- Module 00105-15 introduction to construction drawings
- What are engineered plans for motors pumps
- Single entry system introduction
- Blinds simple machine
- Simple distillation introduction
- Madame butterfly opera genre
- Introduction of shg
- Sheet metal workshop introduction
- Sheet metal introduction
- She stoops to conquer introduction
- Introduction of hamlet by william shakespeare
- Common themes in shakespeare sonnets
- Seven segment display introduction
- Ooa&d
- Molecular model of service marketing
- Introduction to server side programming
- Dilution and dilution factor microbiology
- Define plant breeding
- Introduction of selling