GSBPM and Data Life Cycle Generic Statistic Business

GSBPM and Data Life Cycle

Generic Statistic Business Process Model

Generic Statistic Business Process Model

GSBPM – Data Integration

Generic Statistic Information Model - GSIM

GSIM and GSBPM

GSIM – reference colours

GSBPM - Data Life Cycle

Specify Needs – Blue Path - Business

Functionality – Blue Path - Business overall analysis of all available data and meta data

Supported by interpretation and analysis layer

Design – Blue Path - Business

Functionality – Blue Path - Business Interaction between the statistical framework and the interpretation layer - eg. sampling

Functionality – Blue Path - Business These sub processes can create active and passive metadata

Build – Yellow Path - Structures

Functionality – Yellow Path - Structures For statistical outputs produced on a regular basis, this phase usually occurs for: • First iteration • Following a review • Following a change in methodology

Functionality – Yellow Path - Structures Each new output production line is basically a work flow configuration

Collect – Red Path - Exchange

Functionality – Red Path - Exchange Typically this phase does not include any data transformations

Functionality – Red Path - Exchange • Controlled Data Collection – eg our survey • Uncontrolled Data Collection – eg administrative sources may involve mapping < T, S, m> T – Target schema S – Source schema m – mapping

Process – Red Path - Exchange

Functionality – Red Path - Exchange Typical ETL phase of a DWH happens in the Integration Layer Sub process 5. 1 “integrate data” connects different sources and uses the provider management in order to update asynchronous business register status

Analyze – Green Path – Concepts

Functionality – Green Path – Concepts The flow is bidirectional all non consolidated concepts must be first created and tested directly in the interpretation and analysis layer The Analysis phase includes: • primary data scrutinizing • interpretation to support the data evaluating the effective fitting of the Outputs with the initial expectations

Disseminate – Red Path - Exchange

Functionality – Red Path - Exchange Manages the release of the statistical products to customers For statistical outputs produced regularly, this phase occurs in every iteration Ocurrs in the Acces Layer

GSBPM - Data Life Cycle

creative commons Thanks to UNECE on GSIM https: //statswiki. unece. org/display/gsim/Generic+Statistical+Information+Model Thanks to UNECE on GSBPM https: //statswiki. unece. org/display/GSBPM+Training+Materials Thanks to Centre of Excellence on Data Warehousing http: //ec. europa. eu/eurostat/cros/content/centre-excellence-data-warehousing
- Slides: 28