2012.10 - workshop on semantic statistics - 1

24
Leveraging the DDI Model for Linked Statistical Data in the Social, Behavioural, and Economic Sciences Workshop on Semantic Statistics 15.10.2012 19.10.2012 Thomas Bosch M.Sc. (TUM) postgraduate student http://boschthomas.blogspot.com GESIS - Leibniz Institute for the Social Sciences

Upload: thomas-bosch

Post on 09-Jul-2015

161 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: 2012.10 - Workshop on Semantic Statistics - 1

Leveraging the DDI Model for Linked Statistical Data in the Social, Behavioural, and Economic Sciences

Workshop on Semantic Statistics

15.10.2012 – 19.10.2012

Thomas Bosch

M.Sc. (TUM)postgraduate student

http://boschthomas.blogspot.comGESIS - Leibniz Institute for the Social Sciences

Page 2: 2012.10 - Workshop on Semantic Statistics - 1

2

Agenda

Page 3: 2012.10 - Workshop on Semantic Statistics - 1

• Currently no such ontology available

• To increase visibility of data holdings using mainstream Web technologies

• To open DDI to the Linked Data community

• To process DDI-RDF by RDF tools

• To link DDI-RDF to other RDF data

• To better identify opportunities for merging datasets

• To enable inferencing

• To research microdata within the LOD cloud

3

Why DDI as Linked Data?

Page 4: 2012.10 - Workshop on Semantic Statistics - 1

• DDI subset • of the most important DDI elements

• Use cases• Experts in the statistics domain formulated use cases which are seen

as most significant to solve frequent problems

• Most important use case: discover microdata connected with multiple studies

• Leverage existing DDI-XML docs to DDI-RDF automatically• Direct mapping

• Generic mapping (Bosch and Mathiak, 2011)

4

How was the DDI Ontology developed?

Page 5: 2012.10 - Workshop on Semantic Statistics - 1

• Which studies are connected with a specific coverage consisting of the 3 dimensions: time, country, and subject?

• What questions with a specific question text are contained in the study questionnaire?

• What questions are connected with a concept with a specific label?

• What questions are combined with a variable with an associated coverage consisting of the 3 dimensions time, country, and subject?

• What concepts are linked to particular variables or questions?

• What representation does a specific variable have?

• What codes and what categories are part of this representation?

• What variable label does a variable with a particular variable name have?

• What‘s the maximum value of a certain variable?

• What are the absolute and relative frequencies of a specific code?

• What data files contain the entire dataset?

5

Discovery Use Case

Page 6: 2012.10 - Workshop on Semantic Statistics - 1

6

Page 7: 2012.10 - Workshop on Semantic Statistics - 1

7

study | coverage

Page 8: 2012.10 - Workshop on Semantic Statistics - 1

8

Page 9: 2012.10 - Workshop on Semantic Statistics - 1

9

instrument | question | concept

Page 10: 2012.10 - Workshop on Semantic Statistics - 1

10

Page 11: 2012.10 - Workshop on Semantic Statistics - 1

11

Page 12: 2012.10 - Workshop on Semantic Statistics - 1

values | value labels

12

Page 13: 2012.10 - Workshop on Semantic Statistics - 1

13

Page 14: 2012.10 - Workshop on Semantic Statistics - 1

14

Page 15: 2012.10 - Workshop on Semantic Statistics - 1

15

variable | descriptive statistics

Page 16: 2012.10 - Workshop on Semantic Statistics - 1

16

Page 17: 2012.10 - Workshop on Semantic Statistics - 1

17

Page 18: 2012.10 - Workshop on Semantic Statistics - 1

18

logical dataset | dataset | data file

Page 19: 2012.10 - Workshop on Semantic Statistics - 1

19

Page 20: 2012.10 - Workshop on Semantic Statistics - 1

20

Page 21: 2012.10 - Workshop on Semantic Statistics - 1

conceptual model

21

Page 22: 2012.10 - Workshop on Semantic Statistics - 1

22

Page 23: 2012.10 - Workshop on Semantic Statistics - 1

Open Issues

• DDI Ontology URL and Prefix

• DC namespace

• Naming Conventions

• Cardinalities

• Consistency Check

• Universe vs. Coverage

• DescriptiveStatistics

• Study Groups

• Classes

• Datatype Properties

• Object Properties

23

Page 24: 2012.10 - Workshop on Semantic Statistics - 1

Thank you for you attention!

24