the dataset projectemmanuel.rousseaux.me/rousseaux2012a.pdf · introduction the dataset package...
TRANSCRIPT
![Page 1: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/1.jpg)
IntroductionThe Dataset package
Perspectives
The Dataset Project
Emmanuel Rousseaux
Institut d’études démographiques et du parcours de vieUniversité de Genève1211 Genève 4, Suisse
Rousseaux E. – The Dataset Project May 24, 2012 1/30
![Page 2: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/2.jpg)
IntroductionThe Dataset package
Perspectives
Outline
Introduction
The Dataset package
Perspectives
Rousseaux E. – The Dataset Project May 24, 2012 2/30
![Page 3: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/3.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
Plan
Introduction
The Dataset package
Perspectives
Rousseaux E. – The Dataset Project May 24, 2012 3/30
![Page 4: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/4.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
Teaching and Research Assistant at the Departement ofEconomics, SES, Unige
PhD directed byI Gilbert Ritschard, iDemo, UnigeI Michel Léonard, ISS, Unige
Rousseaux E. – The Dataset Project May 24, 2012 4/30
![Page 5: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/5.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
Teaching and Research Assistant at the Departement ofEconomics, SES, Unige
PhD directed byI Gilbert Ritschard, iDemo, UnigeI Michel Léonard, ISS, Unige
Rousseaux E. – The Dataset Project May 24, 2012 4/30
![Page 6: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/6.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
Teaching and Research Assistant at the Departement ofEconomics, SES, Unige
PhD directed byI Gilbert Ritschard, iDemo, UnigeI Michel Léonard, ISS, Unige
Rousseaux E. – The Dataset Project May 24, 2012 4/30
![Page 7: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/7.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
Teaching and Research Assistant at the Departement ofEconomics, SES, Unige
PhD directed byI Gilbert Ritschard, iDemo, UnigeI Michel Léonard, ISS, Unige
Rousseaux E. – The Dataset Project May 24, 2012 4/30
![Page 8: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/8.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
(some) Research interestsI Decision TreesI Association rules miningI Nature-based optimization algorithmsI Health sociologyI Cognitive psychology
Rousseaux E. – The Dataset Project May 24, 2012 5/30
![Page 9: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/9.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
(some) Research interestsI Decision TreesI Association rules miningI Nature-based optimization algorithmsI Health sociologyI Cognitive psychology
Rousseaux E. – The Dataset Project May 24, 2012 5/30
![Page 10: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/10.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
(some) Research interestsI Decision TreesI Association rules miningI Nature-based optimization algorithmsI Health sociologyI Cognitive psychology
Rousseaux E. – The Dataset Project May 24, 2012 5/30
![Page 11: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/11.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
(some) Research interestsI Decision TreesI Association rules miningI Nature-based optimization algorithmsI Health sociologyI Cognitive psychology
Rousseaux E. – The Dataset Project May 24, 2012 5/30
![Page 12: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/12.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
(some) Research interestsI Decision TreesI Association rules miningI Nature-based optimization algorithmsI Health sociologyI Cognitive psychology
Rousseaux E. – The Dataset Project May 24, 2012 5/30
![Page 13: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/13.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
(some) Research interestsI Decision TreesI Association rules miningI Nature-based optimization algorithmsI Health sociologyI Cognitive psychology
Rousseaux E. – The Dataset Project May 24, 2012 5/30
![Page 14: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/14.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
NCCR LIVES "Overcoming vulnerability: life courseperspectives"
I work within the IP14 "Measuring life sequences and thedisorder of lives" leaded by Gilbert Ritschard
Aims at providing ad hoc methods for life course analysis inorder to have more insight about dynamics of vulnerability
Rousseaux E. – The Dataset Project May 24, 2012 6/30
![Page 15: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/15.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
NCCR LIVES "Overcoming vulnerability: life courseperspectives"
I work within the IP14 "Measuring life sequences and thedisorder of lives" leaded by Gilbert Ritschard
Aims at providing ad hoc methods for life course analysis inorder to have more insight about dynamics of vulnerability
Rousseaux E. – The Dataset Project May 24, 2012 6/30
![Page 16: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/16.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
NCCR LIVES "Overcoming vulnerability: life courseperspectives"
I work within the IP14 "Measuring life sequences and thedisorder of lives" leaded by Gilbert Ritschard
Aims at providing ad hoc methods for life course analysis inorder to have more insight about dynamics of vulnerability
Rousseaux E. – The Dataset Project May 24, 2012 6/30
![Page 17: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/17.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
OverviewI Providing a software framework for handling survey data
in RI Providing a software framework for handling life courses
as a wholeI Providing new mining tools for rare events
I Decision trees for the discovery of vulnerable profilesI Multi-channel association rules mining
I Apply these tools for getting new insight on poor healthsituations
Rousseaux E. – The Dataset Project May 24, 2012 7/30
![Page 18: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/18.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
OverviewI Providing a software framework for handling survey data
in RI Providing a software framework for handling life courses
as a wholeI Providing new mining tools for rare events
I Decision trees for the discovery of vulnerable profilesI Multi-channel association rules mining
I Apply these tools for getting new insight on poor healthsituations
Rousseaux E. – The Dataset Project May 24, 2012 7/30
![Page 19: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/19.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
OverviewI Providing a software framework for handling survey data
in RI Providing a software framework for handling life courses
as a wholeI Providing new mining tools for rare events
I Decision trees for the discovery of vulnerable profilesI Multi-channel association rules mining
I Apply these tools for getting new insight on poor healthsituations
Rousseaux E. – The Dataset Project May 24, 2012 7/30
![Page 20: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/20.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
OverviewI Providing a software framework for handling survey data
in RI Providing a software framework for handling life courses
as a wholeI Providing new mining tools for rare events
I Decision trees for the discovery of vulnerable profilesI Multi-channel association rules mining
I Apply these tools for getting new insight on poor healthsituations
Rousseaux E. – The Dataset Project May 24, 2012 7/30
![Page 21: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/21.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
OverviewI Providing a software framework for handling survey data
in RI Providing a software framework for handling life courses
as a wholeI Providing new mining tools for rare events
I Decision trees for the discovery of vulnerable profilesI Multi-channel association rules mining
I Apply these tools for getting new insight on poor healthsituations
Rousseaux E. – The Dataset Project May 24, 2012 7/30
![Page 22: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/22.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
OverviewI Providing a software framework for handling survey data
in RI Providing a software framework for handling life courses
as a wholeI Providing new mining tools for rare events
I Decision trees for the discovery of vulnerable profilesI Multi-channel association rules mining
I Apply these tools for getting new insight on poor healthsituations
Rousseaux E. – The Dataset Project May 24, 2012 7/30
![Page 23: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/23.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
OverviewI Providing a software framework for handling survey data
in RI Providing a software framework for handling life courses
as a wholeI Providing new mining tools for rare events
I Decision trees for the discovery of vulnerable profilesI Multi-channel association rules mining
I Apply these tools for getting new insight on poor healthsituations
Rousseaux E. – The Dataset Project May 24, 2012 7/30
![Page 24: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/24.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
Motivation
I Currently no framework to handle survey data in RI Possible in SPSS, SAS, Stata
However
I In theses commercial softwares, no rigourous consistencytest
I No real standard for sharing datasetI A lot of methods in the state-of-the-art are provided on R
only
Rousseaux E. – The Dataset Project May 24, 2012 8/30
![Page 25: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/25.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
Motivation
I Currently no framework to handle survey data in RI Possible in SPSS, SAS, Stata
However
I In theses commercial softwares, no rigourous consistencytest
I No real standard for sharing datasetI A lot of methods in the state-of-the-art are provided on R
only
Rousseaux E. – The Dataset Project May 24, 2012 8/30
![Page 26: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/26.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
Motivation
I Currently no framework to handle survey data in RI Possible in SPSS, SAS, Stata
However
I In theses commercial softwares, no rigourous consistencytest
I No real standard for sharing datasetI A lot of methods in the state-of-the-art are provided on R
only
Rousseaux E. – The Dataset Project May 24, 2012 8/30
![Page 27: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/27.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
Motivation
I Currently no framework to handle survey data in RI Possible in SPSS, SAS, Stata
However
I In theses commercial softwares, no rigourous consistencytest
I No real standard for sharing datasetI A lot of methods in the state-of-the-art are provided on R
only
Rousseaux E. – The Dataset Project May 24, 2012 8/30
![Page 28: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/28.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
Motivation
I Currently no framework to handle survey data in RI Possible in SPSS, SAS, Stata
However
I In theses commercial softwares, no rigourous consistencytest
I No real standard for sharing datasetI A lot of methods in the state-of-the-art are provided on R
only
Rousseaux E. – The Dataset Project May 24, 2012 8/30
![Page 29: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/29.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
Motivation
I Currently no framework to handle survey data in RI Possible in SPSS, SAS, Stata
However
I In theses commercial softwares, no rigourous consistencytest
I No real standard for sharing datasetI A lot of methods in the state-of-the-art are provided on R
only
Rousseaux E. – The Dataset Project May 24, 2012 8/30
![Page 30: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/30.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
Motivation
I Currently no framework to handle survey data in RI Possible in SPSS, SAS, Stata
However
I In theses commercial softwares, no rigourous consistencytest
I No real standard for sharing datasetI A lot of methods in the state-of-the-art are provided on R
only
Rousseaux E. – The Dataset Project May 24, 2012 8/30
![Page 31: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/31.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
Motivation
I Currently no framework to handle survey data in RI Possible in SPSS, SAS, Stata
However
I In theses commercial softwares, no rigourous consistencytest
I No real standard for sharing datasetI A lot of methods in the state-of-the-art are provided on R
only
Rousseaux E. – The Dataset Project May 24, 2012 8/30
![Page 32: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/32.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
Goals
I store and manipulate life courses dataI sophisticated managment of missing valuesI automatic consistency testsI representativity of the initial population testsI user-oriented functionsI automatic summaries
Rousseaux E. – The Dataset Project May 24, 2012 9/30
![Page 33: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/33.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
Goals
I store and manipulate life courses dataI sophisticated managment of missing valuesI automatic consistency testsI representativity of the initial population testsI user-oriented functionsI automatic summaries
Rousseaux E. – The Dataset Project May 24, 2012 9/30
![Page 34: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/34.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
Goals
I store and manipulate life courses dataI sophisticated managment of missing valuesI automatic consistency testsI representativity of the initial population testsI user-oriented functionsI automatic summaries
Rousseaux E. – The Dataset Project May 24, 2012 9/30
![Page 35: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/35.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
Goals
I store and manipulate life courses dataI sophisticated managment of missing valuesI automatic consistency testsI representativity of the initial population testsI user-oriented functionsI automatic summaries
Rousseaux E. – The Dataset Project May 24, 2012 9/30
![Page 36: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/36.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
Goals
I store and manipulate life courses dataI sophisticated managment of missing valuesI automatic consistency testsI representativity of the initial population testsI user-oriented functionsI automatic summaries
Rousseaux E. – The Dataset Project May 24, 2012 9/30
![Page 37: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/37.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
Goals
I store and manipulate life courses dataI sophisticated managment of missing valuesI automatic consistency testsI representativity of the initial population testsI user-oriented functionsI automatic summaries
Rousseaux E. – The Dataset Project May 24, 2012 9/30
![Page 38: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/38.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
Goals
I store and manipulate life courses dataI sophisticated managment of missing valuesI automatic consistency testsI representativity of the initial population testsI user-oriented functionsI automatic summaries
Rousseaux E. – The Dataset Project May 24, 2012 9/30
![Page 39: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/39.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
Representativity is central. Efforts have to be made for helpingthe user
I real structure for handling weights in the databaseI representativity checks on each variableI compute new weights to correctly balance a subdataset
Rousseaux E. – The Dataset Project May 24, 2012 10/30
![Page 40: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/40.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
Representativity is central. Efforts have to be made for helpingthe user
I real structure for handling weights in the databaseI representativity checks on each variableI compute new weights to correctly balance a subdataset
Rousseaux E. – The Dataset Project May 24, 2012 10/30
![Page 41: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/41.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
Representativity is central. Efforts have to be made for helpingthe user
I real structure for handling weights in the databaseI representativity checks on each variableI compute new weights to correctly balance a subdataset
Rousseaux E. – The Dataset Project May 24, 2012 10/30
![Page 42: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/42.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
Proposal: the ’Dataset’ project
I 2 librairies in R:I ’Dataset’: for cross-sectional survey dataI ’stDataset’: for spatio-temporal survey data
I Full S4I https://r-forge.r-project.org/projects/dataset/
Rousseaux E. – The Dataset Project May 24, 2012 11/30
![Page 43: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/43.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
Proposal: the ’Dataset’ project
I 2 librairies in R:I ’Dataset’: for cross-sectional survey dataI ’stDataset’: for spatio-temporal survey data
I Full S4I https://r-forge.r-project.org/projects/dataset/
Rousseaux E. – The Dataset Project May 24, 2012 11/30
![Page 44: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/44.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
Proposal: the ’Dataset’ project
I 2 librairies in R:I ’Dataset’: for cross-sectional survey dataI ’stDataset’: for spatio-temporal survey data
I Full S4I https://r-forge.r-project.org/projects/dataset/
Rousseaux E. – The Dataset Project May 24, 2012 11/30
![Page 45: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/45.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
Proposal: the ’Dataset’ project
I 2 librairies in R:I ’Dataset’: for cross-sectional survey dataI ’stDataset’: for spatio-temporal survey data
I Full S4I https://r-forge.r-project.org/projects/dataset/
Rousseaux E. – The Dataset Project May 24, 2012 11/30
![Page 46: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/46.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
Proposal: the ’Dataset’ project
I 2 librairies in R:I ’Dataset’: for cross-sectional survey dataI ’stDataset’: for spatio-temporal survey data
I Full S4I https://r-forge.r-project.org/projects/dataset/
Rousseaux E. – The Dataset Project May 24, 2012 11/30
![Page 47: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/47.jpg)
IntroductionThe Dataset package
Perspectives
About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day
Proposal: the ’Dataset’ project
I 2 librairies in R:I ’Dataset’: for cross-sectional survey dataI ’stDataset’: for spatio-temporal survey data
I Full S4I https://r-forge.r-project.org/projects/dataset/
Rousseaux E. – The Dataset Project May 24, 2012 11/30
![Page 48: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/48.jpg)
IntroductionThe Dataset package
Perspectives
IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided
Plan
Introduction
The Dataset package
Perspectives
Rousseaux E. – The Dataset Project May 24, 2012 12/30
![Page 49: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/49.jpg)
IntroductionThe Dataset package
Perspectives
IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided
In the Dataset package we mainly definedI the Variable object: store one measure on individualsI the Dataset object: store the full output of the survey
Rousseaux E. – The Dataset Project May 24, 2012 13/30
![Page 50: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/50.jpg)
IntroductionThe Dataset package
Perspectives
IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided
In the Dataset package we mainly definedI the Variable object: store one measure on individualsI the Dataset object: store the full output of the survey
Rousseaux E. – The Dataset Project May 24, 2012 13/30
![Page 51: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/51.jpg)
IntroductionThe Dataset package
Perspectives
IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided
In the Dataset package we mainly definedI the Variable object: store one measure on individualsI the Dataset object: store the full output of the survey
Rousseaux E. – The Dataset Project May 24, 2012 13/30
![Page 52: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/52.jpg)
IntroductionThe Dataset package
Perspectives
IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided
The Variable object is represented byI codes: vector of codes for each individualsI missings: vector specifying the coding of missings valuesI values: vector specifying the coding of valid casesI description: a label
Then the Variable is declined into different kind of measures
Rousseaux E. – The Dataset Project May 24, 2012 14/30
![Page 53: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/53.jpg)
IntroductionThe Dataset package
Perspectives
IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided
The Variable object is represented byI codes: vector of codes for each individualsI missings: vector specifying the coding of missings valuesI values: vector specifying the coding of valid casesI description: a label
Then the Variable is declined into different kind of measures
Rousseaux E. – The Dataset Project May 24, 2012 14/30
![Page 54: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/54.jpg)
IntroductionThe Dataset package
Perspectives
IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided
The Variable object is represented byI codes: vector of codes for each individualsI missings: vector specifying the coding of missings valuesI values: vector specifying the coding of valid casesI description: a label
Then the Variable is declined into different kind of measures
Rousseaux E. – The Dataset Project May 24, 2012 14/30
![Page 55: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/55.jpg)
IntroductionThe Dataset package
Perspectives
IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided
The Variable object is represented byI codes: vector of codes for each individualsI missings: vector specifying the coding of missings valuesI values: vector specifying the coding of valid casesI description: a label
Then the Variable is declined into different kind of measures
Rousseaux E. – The Dataset Project May 24, 2012 14/30
![Page 56: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/56.jpg)
IntroductionThe Dataset package
Perspectives
IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided
The Variable object is represented byI codes: vector of codes for each individualsI missings: vector specifying the coding of missings valuesI values: vector specifying the coding of valid casesI description: a label
Then the Variable is declined into different kind of measures
Rousseaux E. – The Dataset Project May 24, 2012 14/30
![Page 57: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/57.jpg)
IntroductionThe Dataset package
Perspectives
IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided
The Variable object is represented byI codes: vector of codes for each individualsI missings: vector specifying the coding of missings valuesI values: vector specifying the coding of valid casesI description: a label
Then the Variable is declined into different kind of measures
Rousseaux E. – The Dataset Project May 24, 2012 14/30
![Page 58: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/58.jpg)
IntroductionThe Dataset package
Perspectives
IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided
The Variable object is represented byI codes: vector of codes for each individualsI missings: vector specifying the coding of missings valuesI values: vector specifying the coding of valid casesI description: a label
Then the Variable is declined into different kind of measures
Rousseaux E. – The Dataset Project May 24, 2012 14/30
![Page 59: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/59.jpg)
IntroductionThe Dataset package
Perspectives
IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided
Variable (V)
Quantitative (V)
Scale svar()
Weighting wvar()
Timestamp tvar()
Categorical (V) cvar()
Ordered (V) Nominal nvar()
Ordered Binary bvar() Ordinal ovar()
Figure: Class diagramme of objects inheriting of the Variable class.
Rousseaux E. – The Dataset Project May 24, 2012 15/30
![Page 60: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/60.jpg)
IntroductionThe Dataset package
Perspectives
IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided
Standard builders for VariablesI svarI cvarI ovar
Rousseaux E. – The Dataset Project May 24, 2012 16/30
![Page 61: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/61.jpg)
IntroductionThe Dataset package
Perspectives
IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided
Standard builders for VariablesI svarI cvarI ovar
Rousseaux E. – The Dataset Project May 24, 2012 16/30
![Page 62: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/62.jpg)
IntroductionThe Dataset package
Perspectives
IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided
Standard builders for VariablesI svarI cvarI ovar
Rousseaux E. – The Dataset Project May 24, 2012 16/30
![Page 63: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/63.jpg)
IntroductionThe Dataset package
Perspectives
IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided
Standard builders for VariablesI svarI cvarI ovar
Rousseaux E. – The Dataset Project May 24, 2012 16/30
![Page 64: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/64.jpg)
IntroductionThe Dataset package
Perspectives
IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided
Demo 1
Working with Variable objects
Rousseaux E. – The Dataset Project May 24, 2012 17/30
![Page 65: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/65.jpg)
IntroductionThe Dataset package
Perspectives
IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided
The Dataset object is represented byI variables: list of variablesI name: name of the datasetI description: a long labelI row.names: names for rowsI weights: a variable used for weightingI control: some control variablesI infos: a list for storing other information the user want to
share
Rousseaux E. – The Dataset Project May 24, 2012 18/30
![Page 66: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/66.jpg)
IntroductionThe Dataset package
Perspectives
IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided
The Dataset object is represented byI variables: list of variablesI name: name of the datasetI description: a long labelI row.names: names for rowsI weights: a variable used for weightingI control: some control variablesI infos: a list for storing other information the user want to
share
Rousseaux E. – The Dataset Project May 24, 2012 18/30
![Page 67: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/67.jpg)
IntroductionThe Dataset package
Perspectives
IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided
The Dataset object is represented byI variables: list of variablesI name: name of the datasetI description: a long labelI row.names: names for rowsI weights: a variable used for weightingI control: some control variablesI infos: a list for storing other information the user want to
share
Rousseaux E. – The Dataset Project May 24, 2012 18/30
![Page 68: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/68.jpg)
IntroductionThe Dataset package
Perspectives
IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided
The Dataset object is represented byI variables: list of variablesI name: name of the datasetI description: a long labelI row.names: names for rowsI weights: a variable used for weightingI control: some control variablesI infos: a list for storing other information the user want to
share
Rousseaux E. – The Dataset Project May 24, 2012 18/30
![Page 69: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/69.jpg)
IntroductionThe Dataset package
Perspectives
IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided
The Dataset object is represented byI variables: list of variablesI name: name of the datasetI description: a long labelI row.names: names for rowsI weights: a variable used for weightingI control: some control variablesI infos: a list for storing other information the user want to
share
Rousseaux E. – The Dataset Project May 24, 2012 18/30
![Page 70: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/70.jpg)
IntroductionThe Dataset package
Perspectives
IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided
The Dataset object is represented byI variables: list of variablesI name: name of the datasetI description: a long labelI row.names: names for rowsI weights: a variable used for weightingI control: some control variablesI infos: a list for storing other information the user want to
share
Rousseaux E. – The Dataset Project May 24, 2012 18/30
![Page 71: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/71.jpg)
IntroductionThe Dataset package
Perspectives
IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided
The Dataset object is represented byI variables: list of variablesI name: name of the datasetI description: a long labelI row.names: names for rowsI weights: a variable used for weightingI control: some control variablesI infos: a list for storing other information the user want to
share
Rousseaux E. – The Dataset Project May 24, 2012 18/30
![Page 72: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/72.jpg)
IntroductionThe Dataset package
Perspectives
IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided
The Dataset object is represented byI variables: list of variablesI name: name of the datasetI description: a long labelI row.names: names for rowsI weights: a variable used for weightingI control: some control variablesI infos: a list for storing other information the user want to
share
Rousseaux E. – The Dataset Project May 24, 2012 18/30
![Page 73: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/73.jpg)
IntroductionThe Dataset package
Perspectives
IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided
How to create a Dataset objectI from an existing data.frameI from a SPSS fileI by hand (as a list of Variable objects)
Rousseaux E. – The Dataset Project May 24, 2012 19/30
![Page 74: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/74.jpg)
IntroductionThe Dataset package
Perspectives
IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided
How to create a Dataset objectI from an existing data.frameI from a SPSS fileI by hand (as a list of Variable objects)
Rousseaux E. – The Dataset Project May 24, 2012 19/30
![Page 75: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/75.jpg)
IntroductionThe Dataset package
Perspectives
IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided
How to create a Dataset objectI from an existing data.frameI from a SPSS fileI by hand (as a list of Variable objects)
Rousseaux E. – The Dataset Project May 24, 2012 19/30
![Page 76: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/76.jpg)
IntroductionThe Dataset package
Perspectives
IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided
How to create a Dataset objectI from an existing data.frameI from a SPSS fileI by hand (as a list of Variable objects)
Rousseaux E. – The Dataset Project May 24, 2012 19/30
![Page 77: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/77.jpg)
IntroductionThe Dataset package
Perspectives
IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided
Demo 2
Preprocessing step: starting from a data set in a spss file, wewant get the data in R, et create our data set for our study
importing - recoding - exporting
Rousseaux E. – The Dataset Project May 24, 2012 20/30
![Page 78: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/78.jpg)
IntroductionThe Dataset package
Perspectives
IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided
Native operations on Dataset/Variable objectsI recoding VariablesI bivariate analysisI logistic regression
Rousseaux E. – The Dataset Project May 24, 2012 21/30
![Page 79: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/79.jpg)
IntroductionThe Dataset package
Perspectives
IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided
Native operations on Dataset/Variable objectsI recoding VariablesI bivariate analysisI logistic regression
Rousseaux E. – The Dataset Project May 24, 2012 21/30
![Page 80: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/80.jpg)
IntroductionThe Dataset package
Perspectives
IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided
Native operations on Dataset/Variable objectsI recoding VariablesI bivariate analysisI logistic regression
Rousseaux E. – The Dataset Project May 24, 2012 21/30
![Page 81: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/81.jpg)
IntroductionThe Dataset package
Perspectives
IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided
Native operations on Dataset/Variable objectsI recoding VariablesI bivariate analysisI logistic regression
Rousseaux E. – The Dataset Project May 24, 2012 21/30
![Page 82: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/82.jpg)
IntroductionThe Dataset package
Perspectives
IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided
Demo 3
Launching analysis
bivariate analysis - logistic regression
Rousseaux E. – The Dataset Project May 24, 2012 22/30
![Page 83: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/83.jpg)
IntroductionThe Dataset package
Perspectives
For the Dataset packageReleasing the stDataset packagePresentations forthcoming
Plan
Introduction
The Dataset package
Perspectives
Rousseaux E. – The Dataset Project May 24, 2012 23/30
![Page 84: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/84.jpg)
IntroductionThe Dataset package
Perspectives
For the Dataset packageReleasing the stDataset packagePresentations forthcoming
Finish up implementingI Weights managment toolsI Basic spatial toolsI Easy data capture of a surveyI (Detection/correction of bugs)
Rousseaux E. – The Dataset Project May 24, 2012 24/30
![Page 85: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/85.jpg)
IntroductionThe Dataset package
Perspectives
For the Dataset packageReleasing the stDataset packagePresentations forthcoming
Finish up implementingI Weights managment toolsI Basic spatial toolsI Easy data capture of a surveyI (Detection/correction of bugs)
Rousseaux E. – The Dataset Project May 24, 2012 24/30
![Page 86: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/86.jpg)
IntroductionThe Dataset package
Perspectives
For the Dataset packageReleasing the stDataset packagePresentations forthcoming
Finish up implementingI Weights managment toolsI Basic spatial toolsI Easy data capture of a surveyI (Detection/correction of bugs)
Rousseaux E. – The Dataset Project May 24, 2012 24/30
![Page 87: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/87.jpg)
IntroductionThe Dataset package
Perspectives
For the Dataset packageReleasing the stDataset packagePresentations forthcoming
Finish up implementingI Weights managment toolsI Basic spatial toolsI Easy data capture of a surveyI (Detection/correction of bugs)
Rousseaux E. – The Dataset Project May 24, 2012 24/30
![Page 88: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/88.jpg)
IntroductionThe Dataset package
Perspectives
For the Dataset packageReleasing the stDataset packagePresentations forthcoming
Finish up implementingI Weights managment toolsI Basic spatial toolsI Easy data capture of a surveyI (Detection/correction of bugs)
Rousseaux E. – The Dataset Project May 24, 2012 24/30
![Page 89: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/89.jpg)
IntroductionThe Dataset package
Perspectives
For the Dataset packageReleasing the stDataset packagePresentations forthcoming
Taking spatial data into account
Figure: Poor/Good SRH ratio. PSM 2011, wave 2010 (no weighted)
Rousseaux E. – The Dataset Project May 24, 2012 25/30
![Page 90: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/90.jpg)
IntroductionThe Dataset package
Perspectives
For the Dataset packageReleasing the stDataset packagePresentations forthcoming
stDataset: handling longitudinal survey dataI Same design as the Dataset packageI Longitudinal summary (in PDF)I Manipulating trajectories directlyI Construction of an object "life course" ready for analysis
Rousseaux E. – The Dataset Project May 24, 2012 26/30
![Page 91: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/91.jpg)
IntroductionThe Dataset package
Perspectives
For the Dataset packageReleasing the stDataset packagePresentations forthcoming
stDataset: handling longitudinal survey dataI Same design as the Dataset packageI Longitudinal summary (in PDF)I Manipulating trajectories directlyI Construction of an object "life course" ready for analysis
Rousseaux E. – The Dataset Project May 24, 2012 26/30
![Page 92: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/92.jpg)
IntroductionThe Dataset package
Perspectives
For the Dataset packageReleasing the stDataset packagePresentations forthcoming
stDataset: handling longitudinal survey dataI Same design as the Dataset packageI Longitudinal summary (in PDF)I Manipulating trajectories directlyI Construction of an object "life course" ready for analysis
Rousseaux E. – The Dataset Project May 24, 2012 26/30
![Page 93: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/93.jpg)
IntroductionThe Dataset package
Perspectives
For the Dataset packageReleasing the stDataset packagePresentations forthcoming
stDataset: handling longitudinal survey dataI Same design as the Dataset packageI Longitudinal summary (in PDF)I Manipulating trajectories directlyI Construction of an object "life course" ready for analysis
Rousseaux E. – The Dataset Project May 24, 2012 26/30
![Page 94: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/94.jpg)
IntroductionThe Dataset package
Perspectives
For the Dataset packageReleasing the stDataset packagePresentations forthcoming
stDataset: handling longitudinal survey dataI Same design as the Dataset packageI Longitudinal summary (in PDF)I Manipulating trajectories directlyI Construction of an object "life course" ready for analysis
Rousseaux E. – The Dataset Project May 24, 2012 26/30
![Page 95: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/95.jpg)
IntroductionThe Dataset package
Perspectives
For the Dataset packageReleasing the stDataset packagePresentations forthcoming
Presentations forthcomingI (LACOSA) The ’Dataset’ Project (poster)I (SRSSS) Manipulating panel data with stDatasetI (SRSSS) Handling weights with Dataset and stDataset,
illustration with the PSM
Rousseaux E. – The Dataset Project May 24, 2012 27/30
![Page 96: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/96.jpg)
IntroductionThe Dataset package
Perspectives
For the Dataset packageReleasing the stDataset packagePresentations forthcoming
Presentations forthcomingI (LACOSA) The ’Dataset’ Project (poster)I (SRSSS) Manipulating panel data with stDatasetI (SRSSS) Handling weights with Dataset and stDataset,
illustration with the PSM
Rousseaux E. – The Dataset Project May 24, 2012 27/30
![Page 97: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/97.jpg)
IntroductionThe Dataset package
Perspectives
For the Dataset packageReleasing the stDataset packagePresentations forthcoming
Presentations forthcomingI (LACOSA) The ’Dataset’ Project (poster)I (SRSSS) Manipulating panel data with stDatasetI (SRSSS) Handling weights with Dataset and stDataset,
illustration with the PSM
Rousseaux E. – The Dataset Project May 24, 2012 27/30
![Page 98: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/98.jpg)
IntroductionThe Dataset package
Perspectives
For the Dataset packageReleasing the stDataset packagePresentations forthcoming
Presentations forthcomingI (LACOSA) The ’Dataset’ Project (poster)I (SRSSS) Manipulating panel data with stDatasetI (SRSSS) Handling weights with Dataset and stDataset,
illustration with the PSM
Rousseaux E. – The Dataset Project May 24, 2012 27/30
![Page 99: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/99.jpg)
IntroductionThe Dataset package
Perspectives
Elements
of bibliography
Rousseaux E. – The Dataset Project May 24, 2012 28/30
![Page 100: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/100.jpg)
IntroductionThe Dataset package
Perspectives
Elements of bibliography I
[Voorpostel et al.] Voorpostel, M., Tillmann, R, Lebert, F., Weaver, B., Kuhn, U., Lipps, O., Ryser,V.-A., Schmid, F., Rothenbühler, M., and Wernli, B. Swiss Household Panel Userguide(1999-2010), Wave 12. Lausanne: FORS.(October 2011).
Rousseaux E. – The Dataset Project May 24, 2012 29/30
![Page 101: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be](https://reader030.vdocuments.net/reader030/viewer/2022040910/5e8387b219a78a25aa2635d3/html5/thumbnails/101.jpg)
IntroductionThe Dataset package
Perspectives
Thank you for your attention
Any question?
Rousseaux E. – The Dataset Project May 24, 2012 30/30