policy meets social & decision informatics · social and decision analytics laboratory social...
TRANSCRIPT
SOCIAL AND DECISION ANALYTICS LABORATORY
SOCIAL AND DECISION ANALYTICS LABORATORY
Stephanie Shipp, Deputy Director and Research Professor [email protected]
Social and Decision Analytics Laboratory http://sdal.vbi.vt.edu/
Policy Meets Social & Decision Informatics
An Overview of
The Fields Institute for Research in Mathematical Sciences Workshop on Big
Data for Social Policy
June 12, 2015
SOCIAL AND DECISION ANALYTICS LABORATORYSOCIAL AND DECISION ANALYTICS LABORATORY
Overview of Big Data and Social Policy Workshop
Day 1 – Big Data and Official/Government Statistics Day 2 – Network Models and Agent Based Modeling Day 3 – Living Analytics and Privacy Day 4 – Urban Analytics
SOCIAL AND DECISION ANALYTICS LABORATORYSOCIAL AND DECISION ANALYTICS LABORATORY
Defining Big Data is not easy…..
• “The definition of Big Data is an imprecise description of a rich and complicated set of characteristics, practices, techniques, ethical issues, and outcomes all associated with data.” AAPOR 2015.
SOCIAL AND DECISION ANALYTICS LABORATORYSOCIAL AND DECISION ANALYTICS LABORATORY
It all starts with data: The All Data Revolution
1. Volume 2. Velocity 3. Variety
SOCIAL AND DECISION ANALYTICS LABORATORYSOCIAL AND DECISION ANALYTICS LABORATORY
Then there was 4
1. Volume 2. Velocity 3. Variety 4. Veracity
SOCIAL AND DECISION ANALYTICS LABORATORYSOCIAL AND DECISION ANALYTICS LABORATORY
And 5
1. Volume 2. Velocity 3. Variety 4. Veracity 5. Value
SOCIAL AND DECISION ANALYTICS LABORATORYSOCIAL AND DECISION ANALYTICS LABORATORY
Now 7
1. Volume 2. Velocity 3. Variety 4. Veracity 5. Value 6. Variability 7. Visualization
SOCIAL AND DECISION ANALYTICS LABORATORYSOCIAL AND DECISION ANALYTICS LABORATORY
All Data & Social Informatics
• Slogging thru the Hard Stuff – People are in all the loops – Problems are not ‘puzzles’ to be
solved
• Need questions to drive the informatics
SOCIAL AND DECISION ANALYTICS LABORATORYSOCIAL AND DECISION ANALYTICS LABORATORY
Big Data and Official/Government Statistics - Disruptive Innovations -- Game Changers
- • 1790 - First U.S. census • 1871 - First national census for country of Canada • 1894/1902 – U.S. Bureau of Labor Statistics & Census Bureau created • 1930s – Statistically designed surveys • 1971 – Statistics Canada created • 1970s – Quiet revolution (role of analysts for policy) • 1980s - Increasing deployment of longitudinal surveys • 1990s – Network analysis, ABMs, GIS, machine learning, ….. • 2010s – Big All Data revolution and applications to govt at all levels, companies, and
academia
SOCIAL AND DECISION ANALYTICS LABORATORYSOCIAL AND DECISION ANALYTICS LABORATORY
Big Data and Social Policy – Opportunities Compared to survey data, big (digital, organic, all) data can improve:
• Small area estimates • Unit cost is low • Real-time availability • Finer granularity of data • Tracking and understanding behavior • Opportunities for natural experiments • Increased capacity - advances in computational methods - ABM, network
analysis, machine learning, GIS, other computer and statistical methods that can ingest and process massive amounts of data
SOCIAL AND DECISION ANALYTICS LABORATORYSOCIAL AND DECISION ANALYTICS LABORATORY
Big Data and Social Policy Challenges • Lack of ‘common good’ approach to access data (Groves) • Lack of skills – need for a data savvy population • No control over data quality; unknown error structure • Lack of standards - data quality framework, replicability, other… • Lack of transparency • Privacy and confidentiality –
– Proposed Canadian legislation to amend human rights law to clarify that discrimination based on genetic tests is prohibited
SOCIAL AND DECISION ANALYTICS LABORATORYSOCIAL AND DECISION ANALYTICS LABORATORY
Real Questions vs. Perceived Question Type
Jeffery T. Leek, and Roger D. Peng Science 2015;347:1314-1315
SOCIAL AND DECISION ANALYTICS LABORATORYSOCIAL AND DECISION ANALYTICS LABORATORY
Lessons of the Past Keep Are Still Relevant Today!
Competing Ingredients • Random Innovation • Systematic Thinking for Social Action
SOCIAL AND DECISION ANALYTICS LABORATORYSOCIAL AND DECISION ANALYTICS LABORATORY
Lessons of the Past Keep Getting Re-Printed!
Competing Ingredients • Random Innovation • Systematic Thinking for Social Action
"Theory without data is myth; data without theory is madness.” Phil Zuckerman
SOCIAL AND DECISION ANALYTICS LABORATORYSOCIAL AND DECISION ANALYTICS LABORATORY
Mash It Up with Bioinformatics
ontariowildflower.com
Locust Photo by Ocean/Corbis
bioweb.uwlax.edu
DNA is Middleware
Dynamics will overrun global optimization
CAD CallSummary
Unit MovementSummary
IncidentSummary
NFIRS,Med Report
Incident OccursCall
Arrives
@ EOC
Unit(s)
Dispatc
hed
Unit(s)
Arrive
Inciden
t Close
d
Post Inc
ident
SocialMedia
MediaReports
Inte
rnal
Dat
aEx
tern
al D
ata
Flow
of E
vent
s
CAD CallSummary
Unit MovementSummary
IncidentSummary
NFIRS,Med Report
Incident OccursCall
Arrives
@ EOC
Unit(s)
Dispatc
hed
Unit(s)
Arrive
Inciden
t Close
d
Post Inc
ident
SocialMedia
MediaReports
Inte
rnal
Dat
aEx
tern
al D
ata
Flow
of E
vent
s
CAD CallSummary
Unit MovementSummary
IncidentSummary
NFIRS,Med Report
Incident OccursCall
Arrives
@ EOC
Unit(s)
Dispatc
hed
Unit(s)
Arrive
Inciden
t Close
d
Post Inc
ident
SocialMedia
MediaReports
Inte
rnal
Dat
aEx
tern
al D
ata
Flow
of E
vent
s
CAD CallSummary
Unit MovementSummary
IncidentSummary
NFIRS,Med Report
Incident OccursCall
Arrives
@ EOC
Unit(s)
Dispatc
hed
Unit(s)
Arrive
Inciden
t Close
d
Post Inc
ident
SocialMedia
MediaReports
Inte
rnal
Dat
aEx
tern
al D
ata
Flow
of E
vent
s
CAD CallSummary
Unit MovementSummary
IncidentSummary
NFIRS,Med Report
Incident OccursCall
Arrives
@ EOC
Unit(s)
Dispatc
hed
Unit(s)
Arrive
Inciden
t Close
d
Post Inc
ident
SocialMedia
MediaReports
Inte
rnal
Dat
aEx
tern
al D
ata
Flow
of E
vent
s
CAD CallSummary
Unit MovementSummary
IncidentSummary
NFIRS,Med Report
Incident OccursCall
Arrives
@ EOC
Unit(s)
Dispatc
hed
Unit(s)
Arrive
Inciden
t Close
d
Post Inc
ident
SocialMedia
MediaReports
Inte
rnal
Dat
aEx
tern
al D
ata
Flow
of E
vent
s
Loyal Customers or Loyal Loca)ons?
SOCIAL AND DECISION ANALYTICS LABORATORYSOCIAL AND DECISION ANALYTICS LABORATORY
Ensor, et al., Circulation, Volume 127(11):1192-1199 American Heart Association
Expanding to Population Dynamics: Houston, TX
High Performance Computing
High Performance Computing
Data
Data Storage and
Warehousing
Air Quality Model
High Performance Computing
Data Data Storage and
Warehousing
Synthetic Population Information Model
Database
Database
Monitor Readings
LandScanHERENCESACS
U.S. Census
American Community
Survey
National Household
Travel Survey
Population synthesis using Iterative
Proportional Fitting
Activity assignments using Fitted Value
Methods
Location choice using Gravity Models
Baseline Synthetic
Population for Houston
Concentration Levels by Hour
and Activity Location
Temporal and Geographic
Concentrations using Inverse
Distance Weighting
in silico Platform for Environmental CouplingPersonal Exposure Model
Exposure Calculations by Individual and
Activity Location
Inputs
Exposure by
Individual
Individual Temporal and Geographic
Activity Patterns
Temporal and Geographic
Ozone Concentrations
10:00 am, August 26, 2008 High Performance Computing
High Performance Computing
Data
Data Storage and
Warehousing
Air Quality Model
High Performance Computing
Data Data Storage and
Warehousing
Synthetic Population Information Model
Database
Database
Monitor Readings
LandScanHERENCESACS
U.S. Census
American Community
Survey
National Household
Travel Survey
Population synthesis using Iterative
Proportional Fitting
Activity assignments using Fitted Value
Methods
Location choice using Gravity Models
Baseline Synthetic
Population for Houston
Concentration Levels by Hour
and Activity Location
Temporal and Geographic
Concentrations using Inverse
Distance Weighting
in silico Platform for Environmental CouplingPersonal Exposure Model
Exposure Calculations by Individual and
Activity Location
Inputs
Exposure by
Individual
Individual Temporal and Geographic
Activity Patterns
Temporal and Geographic
Ozone Concentrations
10:00 am, August 26, 2008 High Performance Computing
High Performance Computing
Data
Data Storage and
Warehousing
Air Quality Model
High Performance Computing
Data Data Storage and
Warehousing
Synthetic Population Information Model
Database
Database
Monitor Readings
LandScanHERENCESACS
U.S. Census
American Community
Survey
National Household
Travel Survey
Population synthesis using Iterative
Proportional Fitting
Activity assignments using Fitted Value
Methods
Location choice using Gravity Models
Baseline Synthetic
Population for Houston
Concentration Levels by Hour
and Activity Location
Temporal and Geographic
Concentrations using Inverse
Distance Weighting
in silico Platform for Environmental CouplingPersonal Exposure Model
Exposure Calculations by Individual and
Activity Location
Inputs
Exposure by
Individual
Individual Temporal and Geographic
Activity Patterns
Temporal and Geographic
Ozone Concentrations
26
!"#$%&'( !)*+',,)$-%* .*/%(&)$-%*
!)((-'( .*/%(&)$-%*
0-*-#1'2 3(%2"+$ .*4'*$%(5
!"#$%&'( 6',-4'(5 )*2 71-8&'*$!"#$%&'( 9'$"(*# )*2 !)*+',,)$-%*#
!"#$%&'( :(2'(#
3(%2"+$-%* 6)$)
9); <)$'(-),# .*/%(&)$-%*
0-*-#1'2 3(%2"+$ .*/%(&)$-%*
9); <)$'(-),# .*4'*$%(5
='*2%( .*/%(&)$-%*
!"#$%&'( .*/%(&)$-%*
!"#$%&'(
)&$( *
+,-.&$/
+0"!1-%
+0"!1-%2&! *
3,%-4
+0"!2!,56
,7
6%,0%2!,%(%&'( *
($!2!,%(%&'(
6(%
'%38
'%%0
,7/26!
,72)"6
6%"+6
%"%,)201$%&'(
02%9:
%"%,)21+%&'(
%"%,)2!"#$%&'(
02:
02%
6%"+62/0(,%(021%;
6%"+62/0(,%(021%9:
8,1)%26%,0%6
6107&7,)2;<:
6107&7,)2;;=
6"10-(28&)(
!"#$%&'%()*+
,-.'%/-*#/-(' 0
"*)/1,(#$/ 0
"*)/1,(#/'2,
31)4#()(-.#".-%(#2(),5#,2
21#()(-.#".-%(#2(),5
12/#()(-.#".-%(#2(),5
31)4#4*"#-&-$.-3.'#,2
31)4#61-.$(+#$%2"',($)%#,2
31)4#3.),5'/#,2
31)4#2-!'(+#2(),5#,2
31)4#4-⌧#2(),5
2)1*,'#!$.'
!"#"$%&'()*+',-.$
"./-+' 0
"./'$-
".&/(1*+,)
)233&.(.4)&
-2$&$()&*(324&*
324&*$()&*()233&.
)233&.(5"'.%
)233&.(%&"1%.
$,()&*()233&.
23)(!&2$"63&
$+,*-&(!"3& 00
!"#$%&'(&)*+,-.
#/$(0!&
+-1(0!&
.#%"&-(02/- 33
!"#$%&'()"#%!*
)"#%!(&#$+ ,
)"#%!($+*&"-!'-#.
*#%"&+(/-0+ ,,
!"##$%&%'!$(
!"##$%&%'!$ )
*++,&%'!$
,-.$/(-+/
(%"01"2#$
0+..$/%(
(+340$&5-#$ ))
!"#
#$%&'()* +
)*,'!-"%-(.
%-*#&/!(0"
-%*#&/!(0"
"$11*%,&(!&.(&"$11*%,
2-/2
3%4&"*!&1$4*!
3%4&"*!&"$11*%
5-)%2
)*"%2
"$11*%&%4"*
#$⌧&,%$"1*
0.-%&(7&0,$/*
)*7$01%&3$&,%$%0,
#$⌧&"$11*%,&)!-8*)&,$#*&%-#*
/!(0"&)*,'
8*.)(!&'()*
8*.)(!&.$#*
*30-"#*.%&7(!9&:&(!&'1$#"&;
,(0!'*&7-1* ++
!"#$%&'(%$")
!"#$%&'(%$"*
!"#$%&'#+,"
)%-&("'./0" **
!"#
$%&
'()*
"%+&*##,+%-*% .
"%+-/&*-,&#
"'0))*-,&#
-(11,&#
"&2,-(11,&#
"%+-/&2(+),#20%2
"%+-/&2(+),*)-
-(11,#20%2
-(11,*)-
3(2,!(##
()&'/-*-,*⌧&'/-*-
"'0))*-,"%+-/&2(+),#20%2
"'0))*-,"%+-/&2(+),*)-
5*0%,6**7
5*0%,!+)23
%*0#+),!(##
&+!!*)2,!(##
(-
5*0%
#+/%&*,1('*
!"#
!$#%&'$( )
&%*+"&,%
-&+,%**.+&/%& )
/%*,&'-#'+0
"+!
-($00%/.1#2.3+&.3-
/%('4%&%/.1#2.3+&.3-
*#$0/."*$5%.1#2
$,#"$(."*$5%
*,&$-.-,#
-&+/",#'+0."*$5%
*#$0/."*$5%.1#2.6+.*,&$-
-&+/."*5.6+.*,&-
1#2.4$&.6.*,&$-
1#2.4$&.6+.*,&$-
-,#.4$&.6.*,&$-
4$(.4$&.6.*,&$-
4$(.4$&.6+.*,&$-
-,#.4$&.6+.*,&$-
*#$0/$&/.-&',%
3'0'*7%/.-&+/",#
+&'5'0.5&+"-
2222!!
*+"&,%.3'(%
!"#$%&'(&)*!+
,-.(&/-!$/-)( 0
#-)$,*/( 0
#-)$/(1,
2*#
32*#$)*)-.$".-&)$1)*,4
21/$)*)-.$".-&)$1)*,4
32*#$1)*,4$%&$)!-&1%)
32*#$#!"$-'-%.-3.(
32*#$52-.%)+$%&1"(,)%*&
32*#$3.*,4(/
32*#$1-6()+$1)*,4
32*#$#-⌧$1)*,4
1*2!,($6%.(
!"#$"%&'()*
%)+$%&#!,-.$/0 1
%!+#&0!&2)"03
!"#$"&#)0$
"$4-$%0$#&#$+5'$"3&#)0$
/$0&')+-$
"$+$)%$&,"$#50&')+-$
.)0$"5)+&%0)65/6&)')5+)(5+503&#)0$
!"#$"%&'()*
%)+$%&#!,-.$/0 1
%)+$%&#!,-.$/0&20$. 1
.)0$"2)+&/!
/$0&')+-$&!"#$"&20$.
/$0&*"2,$
!"#$"&#)0$
!"#$"&3-)/0204
!"252/)+&!"#$"&3-)/0204
!"#$%"&'
!"#$%"&'($! )
!"#$%"&'($*"+ )
&",(!-.
&",($*"+
!/*"(-,(+/01,/.*1&"
+/*"&$/#(2*/3$03(/%/$#/4$#$*'
+/*"&$/#(2*/3$03(*$+"
!"#$%"&'(!/*"
/.*1/#(51/0*$*'(!"#$%"&"! !"#$%"&'()"*!"&
!"#$%"&'($! +
,)$-(./
,/#!(./(-*&.'
!"#$%"&'(!*."
.$0"(/1(!"#$%"&'
-$23$45(!*."
-$23$45(.$0"
#/*!$45 !*."
*2.6*#(5//!,(0/%"0"4.
.&*4,-/&.*.$/4(-#*44$45(
.&*4,-/&.*.$/4(-#*44$45(.
2/0-#".$/4(!*."
2/0-#".$/4(.$0"
!"#$%&'()"&*+&,
!"#$%&'()#+
+*(&),&-.,+)-,&*(&+
(#%&),&-.,+)-,&*(&+
&'+)./)$0*''#'1)+*(&
&'+)./)$0*''#'1)(#%&
$0*''&+)-"&-2)#')+*(&
$0*''&+)-"&-2)#')(#%&
*-(3*0)-"&-2)#')+*(&
-3,,&'()-"&-2)#')(#%&
-3,,&'()!(*,()./)0.*+#'1)+*(&
*-(3*0)0.*+#'1)!(*,()(#%&
*-(3*0)&'+)./)0.*+#'1)+*(&
*-(3*0)&'+)./)0.*+#'1)(#%&
-3,,&'()!"#$%&'()-.%$0&(#.')+*(&
$0*''&+)(,*'!$.,(*(#.')!(*,()(#%&
-3,,&'()!(*,()./)!"#$%&'()+*(&
*-(3*0)(,*!'$.,(*(#.')!(*,()(#%&
$0*''&+)(,*'!$.,(*(#.')&'+)(#%&
!"#$%"&'()*(+,$-."/)
!"#$%"&'($! 0
+,$-."/)($!
!"#$%&'(#
#)*+,$% -
!"#$%&'(,./&'
#$(''$
!*$0
!%".$(0
#%"(!',1*2' --
!"#$%&'%!'()%&'
!*(+!%',-*.#/*&#'. 0
!*(+!%',-%)+!1
12!&'3+,%!'()%&' 0
12!&'3+,%!'()%&'%)+!1
12!&'3+,%!"#$%&' 0
12!&'3+,%!"#$%&'%)+!1
!'2,1+%4#(+ 00
!"##$%#&!'(%)
!"##$%#&!'(% *
!"##$%#&+",%
-#./$+
)'0#!%&1$/% **
!"#$!%&
'(#")*$&#+,"!)*+,-.(/%#
!"#$!%)'"./0"!1)/'
*$&#+,"!)/'
-!+'$*#)/')+!/2/%(.
3#1)+!2/%(..1)&4/--"'
$+,)+!/2/%(.
!"#$!%)+!'"!)/'
+!2/%/(.)'"./0"!1)/'
*()&$5#+#(.
'/&-$#")*(&"
-+,*+'"
!!)4"('"!
'(#")'"./0"!1
4($."!
!#*/&
*+.."*#/+%)!#*/&
-!+'$*#)/')!"#$!%
3#1)!"#$!%"'
$+,)!"#$!%
3#1)-(.."#&
!"(&+%
-+&#
*+$%#!1 66
&+$!*")7/."66
!"#!$%%"&'(#)
*"&$
)+,-"!&(.
".&'!%$,#(
/.(*+!&,!"&$0(.1
*$)!.'/&'(#
0$)&.,2&1,'#,3$
'&$4,3$
!+&,2&1,'#,)+
)5'/4$#&,#(
!+)&(4$.
!+)&(4$.,'*
(--'!'"%,.((&,!"+)$,!(*$
!(..$!&$*,.((&,!"+)$,!(*$
5"%%,'#,6".$5(+)$
!+)&(4$.,.$%$3"#&
.((&,!"+)$
*$&"'%)
)(+.!$,-'%$77
!"#!$%%"&'(#)*+((&*!",)$
+((&*!",)$*!(-$.
-$)!+'/&'(#.
-$)!+'/&'(#*#$0
)(,+!$*1'%$..
!"#!$%%"&'(#)*)(%+*&(*),'-*&(
!.)&(/$0
!.)&(/$0*)(%+*&(1
!.)&(/$0*),'-*&(1
!.)&(/$0*&2-$
)(.0!$*3'%$11
!"#!$%%"&'(#)*+"%%$&)
!,)&(-$.*'/0
!,)&(-$.*#"-$
!,)&(-$.*%(!"&'(#
!,)&(-$.*1*22*+"%%$&)
)(,.!$*3'%$00
Social Informa)cs – Produc)on Flows & Human Interac)ons
Social Informa)cs: Distribu)on Flows !"#$%&'(
!)*+',,)$-%* .*/%(&)$-%*
!)((-'( .*/%(&)$-%*
0-*-#1'2 3(%2"+$ .*4'*$%(5
!"#$%&'( 6',-4'(5 )*2 71-8&'*$!"#$%&'( 9'$"(*# )*2 !)*+',,)$-%*#
!"#$%&'( :(2'(#
3(%2"+$-%* 6)$)
9); <)$'(-),# .*/%(&)$-%*
0-*-#1'2 3(%2"+$ .*/%(&)$-%*
9); <)$'(-),# .*4'*$%(5
='*2%( .*/%(&)$-%*
!"#$%&'( .*/%(&)$-%*
!"#$%&'(
)&$( *
+,-.&$/
+0"!1-%
+0"!1-%2&! *
3,%-4
+0"!2!,56
,7
6%,0%2!,%(%&'( *
($!2!,%(%&'(
6(%
'%38
'%%0
,7/26!
,72)"6
6%"+6
%"%,)201$%&'(
02%9:
%"%,)21+%&'(
%"%,)2!"#$%&'(
02:
02%
6%"+62/0(,%(021%;
6%"+62/0(,%(021%9:
8,1)%26%,0%6
6107&7,)2;<:
6107&7,)2;;=
6"10-(28&)(
!"#$%&'%()*+
,-.'%/-*#/-(' 0
"*)/1,(#$/ 0
"*)/1,(#/'2,
31)4#()(-.#".-%(#2(),5#,2
21#()(-.#".-%(#2(),5
12/#()(-.#".-%(#2(),5
31)4#4*"#-&-$.-3.'#,2
31)4#61-.$(+#$%2"',($)%#,2
31)4#3.),5'/#,2
31)4#2-!'(+#2(),5#,2
31)4#4-⌧#2(),5
2)1*,'#!$.'
!"#"$%&'()*+',-.$
"./-+' 0
"./'$-
".&/(1*+,)
)233&.(.4)&
-2$&$()&*(324&*
324&*$()&*()233&.
)233&.(5"'.%
)233&.(%&"1%.
$,()&*()233&.
23)(!&2$"63&
$+,*-&(!"3& 00
!"#$%&'(&)*+,-.
#/$(0!&
+-1(0!&
.#%"&-(02/- 33
!"#$%&'()"#%!*
)"#%!(&#$+ ,
)"#%!($+*&"-!'-#.
*#%"&+(/-0+ ,,
!"##$%&%'!$(
!"##$%&%'!$ )
*++,&%'!$
,-.$/(-+/
(%"01"2#$
0+..$/%(
(+340$&5-#$ ))
!"#
#$%&'()* +
)*,'!-"%-(.
%-*#&/!(0"
-%*#&/!(0"
"$11*%,&(!&.(&"$11*%,
2-/2
3%4&"*!&1$4*!
3%4&"*!&"$11*%
5-)%2
)*"%2
"$11*%&%4"*
#$⌧&,%$"1*
0.-%&(7&0,$/*
)*7$01%&3$&,%$%0,
#$⌧&"$11*%,&)!-8*)&,$#*&%-#*
/!(0"&)*,'
8*.)(!&'()*
8*.)(!&.$#*
*30-"#*.%&7(!9&:&(!&'1$#"&;
,(0!'*&7-1* ++
!"#$%&'(%$")
!"#$%&'(%$"*
!"#$%&'#+,"
)%-&("'./0" **
!"#
$%&
'()*
"%+&*##,+%-*% .
"%+-/&*-,&#
"'0))*-,&#
-(11,&#
"&2,-(11,&#
"%+-/&2(+),#20%2
"%+-/&2(+),*)-
-(11,#20%2
-(11,*)-
3(2,!(##
()&'/-*-,*⌧&'/-*-
"'0))*-,"%+-/&2(+),#20%2
"'0))*-,"%+-/&2(+),*)-
5*0%,6**7
5*0%,!+)23
%*0#+),!(##
&+!!*)2,!(##
(-
5*0%
#+/%&*,1('*
!"#
!$#%&'$( )
&%*+"&,%
-&+,%**.+&/%& )
/%*,&'-#'+0
"+!
-($00%/.1#2.3+&.3-
/%('4%&%/.1#2.3+&.3-
*#$0/."*$5%.1#2
$,#"$(."*$5%
*,&$-.-,#
-&+/",#'+0."*$5%
*#$0/."*$5%.1#2.6+.*,&$-
-&+/."*5.6+.*,&-
1#2.4$&.6.*,&$-
1#2.4$&.6+.*,&$-
-,#.4$&.6.*,&$-
4$(.4$&.6.*,&$-
4$(.4$&.6+.*,&$-
-,#.4$&.6+.*,&$-
*#$0/$&/.-&',%
3'0'*7%/.-&+/",#
+&'5'0.5&+"-
2222!!
*+"&,%.3'(%
!"#$%&'(&)*!+
,-.(&/-!$/-)( 0
#-)$,*/( 0
#-)$/(1,
2*#
32*#$)*)-.$".-&)$1)*,4
21/$)*)-.$".-&)$1)*,4
32*#$1)*,4$%&$)!-&1%)
32*#$#!"$-'-%.-3.(
32*#$52-.%)+$%&1"(,)%*&
32*#$3.*,4(/
32*#$1-6()+$1)*,4
32*#$#-⌧$1)*,4
1*2!,($6%.(
!"#$"%&'()*
%)+$%&#!,-.$/0 1
%!+#&0!&2)"03
!"#$"&#)0$
"$4-$%0$#&#$+5'$"3&#)0$
/$0&')+-$
"$+$)%$&,"$#50&')+-$
.)0$"5)+&%0)65/6&)')5+)(5+503&#)0$
!"#$"%&'()*
%)+$%&#!,-.$/0 1
%)+$%&#!,-.$/0&20$. 1
.)0$"2)+&/!
/$0&')+-$&!"#$"&20$.
/$0&*"2,$
!"#$"&#)0$
!"#$"&3-)/0204
!"252/)+&!"#$"&3-)/0204
!"#$%"&'
!"#$%"&'($! )
!"#$%"&'($*"+ )
&",(!-.
&",($*"+
!/*"(-,(+/01,/.*1&"
+/*"&$/#(2*/3$03(/%/$#/4$#$*'
+/*"&$/#(2*/3$03(*$+"
!"#$%"&'(!/*"
/.*1/#(51/0*$*'(!"#$%"&"! !"#$%"&'()"*!"&
!"#$%"&'($! +
,)$-(./
,/#!(./(-*&.'
!"#$%"&'(!*."
.$0"(/1(!"#$%"&'
-$23$45(!*."
-$23$45(.$0"
#/*!$45 !*."
*2.6*#(5//!,(0/%"0"4.
.&*4,-/&.*.$/4(-#*44$45(
.&*4,-/&.*.$/4(-#*44$45(.
2/0-#".$/4(!*."
2/0-#".$/4(.$0"
!"#$%&'()"&*+&,
!"#$%&'()#+
+*(&),&-.,+)-,&*(&+
(#%&),&-.,+)-,&*(&+
&'+)./)$0*''#'1)+*(&
&'+)./)$0*''#'1)(#%&
$0*''&+)-"&-2)#')+*(&
$0*''&+)-"&-2)#')(#%&
*-(3*0)-"&-2)#')+*(&
-3,,&'()-"&-2)#')(#%&
-3,,&'()!(*,()./)0.*+#'1)+*(&
*-(3*0)0.*+#'1)!(*,()(#%&
*-(3*0)&'+)./)0.*+#'1)+*(&
*-(3*0)&'+)./)0.*+#'1)(#%&
-3,,&'()!"#$%&'()-.%$0&(#.')+*(&
$0*''&+)(,*'!$.,(*(#.')!(*,()(#%&
-3,,&'()!(*,()./)!"#$%&'()+*(&
*-(3*0)(,*!'$.,(*(#.')!(*,()(#%&
$0*''&+)(,*'!$.,(*(#.')&'+)(#%&
!"#$%"&'()*(+,$-."/)
!"#$%"&'($! 0
+,$-."/)($!
!"#$%&'(#
#)*+,$% -
!"#$%&'(,./&'
#$(''$
!*$0
!%".$(0
#%"(!',1*2' --
!"#$%&'%!'()%&'
!*(+!%',-*.#/*&#'. 0
!*(+!%',-%)+!1
12!&'3+,%!'()%&' 0
12!&'3+,%!'()%&'%)+!1
12!&'3+,%!"#$%&' 0
12!&'3+,%!"#$%&'%)+!1
!'2,1+%4#(+ 00
!"##$%#&!'(%)
!"##$%#&!'(% *
!"##$%#&+",%
-#./$+
)'0#!%&1$/% **
!"#$!%&
'(#")*$&#+,"!)*+,-.(/%#
!"#$!%)'"./0"!1)/'
*$&#+,"!)/'
-!+'$*#)/')+!/2/%(.
3#1)+!2/%(..1)&4/--"'
$+,)+!/2/%(.
!"#$!%)+!'"!)/'
+!2/%/(.)'"./0"!1)/'
*()&$5#+#(.
'/&-$#")*(&"
-+,*+'"
!!)4"('"!
'(#")'"./0"!1
4($."!
!#*/&
*+.."*#/+%)!#*/&
-!+'$*#)/')!"#$!%
3#1)!"#$!%"'
$+,)!"#$!%
3#1)-(.."#&
!"(&+%
-+&#
*+$%#!1 66
&+$!*")7/."66
!"#!$%%"&'(#)
*"&$
)+,-"!&(.
".&'!%$,#(
/.(*+!&,!"&$0(.1
*$)!.'/&'(#
0$)&.,2&1,'#,3$
'&$4,3$
!+&,2&1,'#,)+
)5'/4$#&,#(
!+)&(4$.
!+)&(4$.,'*
(--'!'"%,.((&,!"+)$,!(*$
!(..$!&$*,.((&,!"+)$,!(*$
5"%%,'#,6".$5(+)$
!+)&(4$.,.$%$3"#&
.((&,!"+)$
*$&"'%)
)(+.!$,-'%$77
!"#!$%%"&'(#)*+((&*!",)$
+((&*!",)$*!(-$.
-$)!+'/&'(#.
-$)!+'/&'(#*#$0
)(,+!$*1'%$..
!"#!$%%"&'(#)*)(%+*&(*),'-*&(
!.)&(/$0
!.)&(/$0*)(%+*&(1
!.)&(/$0*),'-*&(1
!.)&(/$0*&2-$
)(.0!$*3'%$11
!"#!$%%"&'(#)*+"%%$&)
!,)&(-$.*'/0
!,)&(-$.*#"-$
!,)&(-$.*%(!"&'(#
!,)&(-$.*1*22*+"%%$&)
)(,.!$*3'%$00
• Ingests informa)on – Data – Judgment – Models (procedures)
• Protect informa)on • Capture interac)on paHerns • Capture and generate dynamics • Result: In-‐silica experimental
pla0orm
Need to Build the Infrastructure – Synthe'c Informa'on Pla2orm
SYNTHETIC PLATFORM
Individual
ECONOMY
Harnessing Real World Data
Exploratory Analysis
Textual Analysis
Network Analysis
Agent-Based Modeling Dynamic
Simulation
Machine Learning
Consumer Price Index GDP Unemployment Rate
To Empower Policy and Decision Making
References American Associa)on of Public Opinion Research, AAPOR Big Data Task Force, AAPOR Report on Big Data, February 12, 2015 Execu)ve Office of the President. 2014. “Big Data: Seizing Opportuni)es, Preserving Values.” Washington DC. Retrieved January 28, 2015 (hHp://1.usa.gov/1hqgibM). Keller, Sallie Ann, Steven E. Koonin, and Stephanie Shipp. 2012. “Big Data and City Living – What Can It Do For Us?” Significance, 9(4): 4-‐7 Lazer, David M., Ryan Kennedy, Gary King, and Alessandro Vespignani. 2014. “The parable of Google Flu: Traps in big data analysis.” Science, 343(6176): 1203-‐1205. Leek, Jeff. 2014a. “Why big data is in trouble: they forgot about applied sta)s)cs.” Simplystats blog, May 7. Retrieved January 28, 2015 (hHp://bit.ly/1fUzZO1). New York Times Editorial Board. 2014. “BeHer Governing Through Data.” New York Times, August 19. Retrieved January 28, 2015 (hHp://ny).ms/1qehhWr). OECD. 2013. New Data for Understanding the Social Condi)on.OECD Global Science Forum. February. Okun, Arthur M. Equality and efficiency, the big tradeoff. Brookings Ins)tu)on Press, 1975. Rivlin, Alice M. Systema)c thinking for social ac)on. Vol. 3. Brookings Ins)tu)on Press, 1971 United Na)ons. 2014. A Suggested Framework for the Quality of Big Data, Deliverables of the UNECE Big Data Quality Task Team, December, 2014