policy meets social & decision informatics · social and decision analytics laboratory social...

30
SOCIAL AND DECISION ANALYTICS LABORATORY SOCIAL AND DECISION ANALYTICS LABORATORY Stephanie Shipp, Deputy Director and Research Professor [email protected] Social and Decision Analytics Laboratory http://sdal.vbi.vt.edu/ Policy Meets Social & Decision Informatics An Overview of The Fields Institute for Research in Mathematical Sciences Workshop on Big Data for Social Policy June 12, 2015

Upload: others

Post on 28-May-2020

5 views

Category:

Documents


0 download

TRANSCRIPT

SOCIAL AND DECISION ANALYTICS LABORATORY

SOCIAL AND DECISION ANALYTICS LABORATORY

Stephanie Shipp, Deputy Director and Research Professor [email protected]

Social and Decision Analytics Laboratory http://sdal.vbi.vt.edu/

Policy Meets Social & Decision Informatics

An Overview of

The Fields Institute for Research in Mathematical Sciences Workshop on Big

Data for Social Policy

June 12, 2015

SOCIAL AND DECISION ANALYTICS LABORATORYSOCIAL AND DECISION ANALYTICS LABORATORY

Overview of Big Data and Social Policy Workshop

Day 1 – Big Data and Official/Government Statistics Day 2 – Network Models and Agent Based Modeling Day 3 – Living Analytics and Privacy Day 4 – Urban Analytics

SOCIAL AND DECISION ANALYTICS LABORATORYSOCIAL AND DECISION ANALYTICS LABORATORY

Defining Big Data is not easy…..

•  “The definition of Big Data is an imprecise description of a rich and complicated set of characteristics, practices, techniques, ethical issues, and outcomes all associated with data.” AAPOR 2015.

SOCIAL AND DECISION ANALYTICS LABORATORYSOCIAL AND DECISION ANALYTICS LABORATORY

It all starts with data: The All Data Revolution

1.  Volume 2.  Velocity 3.  Variety

SOCIAL AND DECISION ANALYTICS LABORATORYSOCIAL AND DECISION ANALYTICS LABORATORY

Then there was 4

1.  Volume 2.  Velocity 3.  Variety 4.  Veracity

SOCIAL AND DECISION ANALYTICS LABORATORYSOCIAL AND DECISION ANALYTICS LABORATORY

And 5

1.  Volume 2.  Velocity 3.  Variety 4.  Veracity 5.  Value

SOCIAL AND DECISION ANALYTICS LABORATORYSOCIAL AND DECISION ANALYTICS LABORATORY

Now 7

1.  Volume 2.  Velocity 3.  Variety 4.  Veracity 5.  Value 6.  Variability 7.  Visualization

SOCIAL AND DECISION ANALYTICS LABORATORYSOCIAL AND DECISION ANALYTICS LABORATORY

All Data & Social Informatics

•  Slogging thru the Hard Stuff –  People are in all the loops –  Problems are not ‘puzzles’ to be

solved

•  Need questions to drive the informatics

SOCIAL AND DECISION ANALYTICS LABORATORYSOCIAL AND DECISION ANALYTICS LABORATORY

Big Data and Official/Government Statistics - Disruptive Innovations -- Game Changers

- •  1790 - First U.S. census •  1871 - First national census for country of Canada •  1894/1902 – U.S. Bureau of Labor Statistics & Census Bureau created •  1930s – Statistically designed surveys •  1971 – Statistics Canada created •  1970s – Quiet revolution (role of analysts for policy) •  1980s - Increasing deployment of longitudinal surveys •  1990s – Network analysis, ABMs, GIS, machine learning, ….. •  2010s – Big All Data revolution and applications to govt at all levels, companies, and

academia

SOCIAL AND DECISION ANALYTICS LABORATORYSOCIAL AND DECISION ANALYTICS LABORATORY

Big Data and Social Policy – Opportunities Compared to survey data, big (digital, organic, all) data can improve:

•  Small area estimates •  Unit cost is low •  Real-time availability •  Finer granularity of data •  Tracking and understanding behavior •  Opportunities for natural experiments •  Increased capacity - advances in computational methods - ABM, network

analysis, machine learning, GIS, other computer and statistical methods that can ingest and process massive amounts of data

SOCIAL AND DECISION ANALYTICS LABORATORYSOCIAL AND DECISION ANALYTICS LABORATORY

Big Data and Social Policy Challenges •  Lack of ‘common good’ approach to access data (Groves) •  Lack of skills – need for a data savvy population •  No control over data quality; unknown error structure •  Lack of standards - data quality framework, replicability, other… •  Lack of transparency •  Privacy and confidentiality –

–  Proposed Canadian legislation to amend human rights law to clarify that discrimination based on genetic tests is prohibited

SOCIAL AND DECISION ANALYTICS LABORATORYSOCIAL AND DECISION ANALYTICS LABORATORY

Real Questions vs. Perceived Question Type

Jeffery T. Leek, and Roger D. Peng Science 2015;347:1314-1315

SOCIAL AND DECISION ANALYTICS LABORATORYSOCIAL AND DECISION ANALYTICS LABORATORY

Lessons of the Past Keep Are Still Relevant Today!

Competing Ingredients •  Random Innovation •  Systematic Thinking for Social Action

SOCIAL AND DECISION ANALYTICS LABORATORYSOCIAL AND DECISION ANALYTICS LABORATORY

Lessons of the Past Keep Getting Re-Printed!

Competing Ingredients •  Random Innovation •  Systematic Thinking for Social Action

"Theory without data is myth; data without theory is madness.” Phil Zuckerman

SOCIAL AND DECISION ANALYTICS LABORATORYSOCIAL AND DECISION ANALYTICS LABORATORY

Mash It Up with Bioinformatics

ontariowildflower.com

Locust Photo by Ocean/Corbis

bioweb.uwlax.edu

DNA is Middleware

Dynamics will overrun global optimization

CAD CallSummary

Unit MovementSummary

IncidentSummary

NFIRS,Med Report

Incident OccursCall

Arrives

@ EOC

Unit(s)

Dispatc

hed

Unit(s)

Arrive

Inciden

t Close

d

Post Inc

ident

SocialMedia

MediaReports

Inte

rnal

Dat

aEx

tern

al D

ata

Flow

of E

vent

s

CAD CallSummary

Unit MovementSummary

IncidentSummary

NFIRS,Med Report

Incident OccursCall

Arrives

@ EOC

Unit(s)

Dispatc

hed

Unit(s)

Arrive

Inciden

t Close

d

Post Inc

ident

SocialMedia

MediaReports

Inte

rnal

Dat

aEx

tern

al D

ata

Flow

of E

vent

s

CAD CallSummary

Unit MovementSummary

IncidentSummary

NFIRS,Med Report

Incident OccursCall

Arrives

@ EOC

Unit(s)

Dispatc

hed

Unit(s)

Arrive

Inciden

t Close

d

Post Inc

ident

SocialMedia

MediaReports

Inte

rnal

Dat

aEx

tern

al D

ata

Flow

of E

vent

s

CAD CallSummary

Unit MovementSummary

IncidentSummary

NFIRS,Med Report

Incident OccursCall

Arrives

@ EOC

Unit(s)

Dispatc

hed

Unit(s)

Arrive

Inciden

t Close

d

Post Inc

ident

SocialMedia

MediaReports

Inte

rnal

Dat

aEx

tern

al D

ata

Flow

of E

vent

s

CAD CallSummary

Unit MovementSummary

IncidentSummary

NFIRS,Med Report

Incident OccursCall

Arrives

@ EOC

Unit(s)

Dispatc

hed

Unit(s)

Arrive

Inciden

t Close

d

Post Inc

ident

SocialMedia

MediaReports

Inte

rnal

Dat

aEx

tern

al D

ata

Flow

of E

vent

s

CAD CallSummary

Unit MovementSummary

IncidentSummary

NFIRS,Med Report

Incident OccursCall

Arrives

@ EOC

Unit(s)

Dispatc

hed

Unit(s)

Arrive

Inciden

t Close

d

Post Inc

ident

SocialMedia

MediaReports

Inte

rnal

Dat

aEx

tern

al D

ata

Flow

of E

vent

s

Loyal  Customers  or  Loyal  Loca)ons?  

SOCIAL AND DECISION ANALYTICS LABORATORYSOCIAL AND DECISION ANALYTICS LABORATORY

Ensor, et al., Circulation, Volume 127(11):1192-1199 American Heart Association

Expanding to Population Dynamics: Houston, TX

High Performance Computing

High Performance Computing

Data

Data Storage and

Warehousing

Air Quality Model

High Performance Computing

Data Data Storage and

Warehousing

Synthetic Population Information Model

Database

Database

Monitor Readings

LandScanHERENCESACS

U.S. Census

American Community

Survey

National Household

Travel Survey

Population synthesis using Iterative

Proportional Fitting

Activity assignments using Fitted Value

Methods

Location choice using Gravity Models

Baseline Synthetic

Population for Houston

Concentration Levels by Hour

and Activity Location

Temporal and Geographic

Concentrations using Inverse

Distance Weighting

in silico Platform for Environmental CouplingPersonal Exposure Model

Exposure Calculations by Individual and

Activity Location

Inputs

Exposure by

Individual

Individual Temporal and Geographic

Activity Patterns

Temporal and Geographic

Ozone Concentrations

10:00 am, August 26, 2008 High Performance Computing

High Performance Computing

Data

Data Storage and

Warehousing

Air Quality Model

High Performance Computing

Data Data Storage and

Warehousing

Synthetic Population Information Model

Database

Database

Monitor Readings

LandScanHERENCESACS

U.S. Census

American Community

Survey

National Household

Travel Survey

Population synthesis using Iterative

Proportional Fitting

Activity assignments using Fitted Value

Methods

Location choice using Gravity Models

Baseline Synthetic

Population for Houston

Concentration Levels by Hour

and Activity Location

Temporal and Geographic

Concentrations using Inverse

Distance Weighting

in silico Platform for Environmental CouplingPersonal Exposure Model

Exposure Calculations by Individual and

Activity Location

Inputs

Exposure by

Individual

Individual Temporal and Geographic

Activity Patterns

Temporal and Geographic

Ozone Concentrations

10:00 am, August 26, 2008 High Performance Computing

High Performance Computing

Data

Data Storage and

Warehousing

Air Quality Model

High Performance Computing

Data Data Storage and

Warehousing

Synthetic Population Information Model

Database

Database

Monitor Readings

LandScanHERENCESACS

U.S. Census

American Community

Survey

National Household

Travel Survey

Population synthesis using Iterative

Proportional Fitting

Activity assignments using Fitted Value

Methods

Location choice using Gravity Models

Baseline Synthetic

Population for Houston

Concentration Levels by Hour

and Activity Location

Temporal and Geographic

Concentrations using Inverse

Distance Weighting

in silico Platform for Environmental CouplingPersonal Exposure Model

Exposure Calculations by Individual and

Activity Location

Inputs

Exposure by

Individual

Individual Temporal and Geographic

Activity Patterns

Temporal and Geographic

Ozone Concentrations

26  

!"#$%&'( !)*+',,)$-%* .*/%(&)$-%*

!)((-'( .*/%(&)$-%*

0-*-#1'2 3(%2"+$ .*4'*$%(5

!"#$%&'( 6',-4'(5 )*2 71-8&'*$!"#$%&'( 9'$"(*# )*2 !)*+',,)$-%*#

!"#$%&'( :(2'(#

3(%2"+$-%* 6)$)

9); <)$'(-),# .*/%(&)$-%*

0-*-#1'2 3(%2"+$ .*/%(&)$-%*

9); <)$'(-),# .*4'*$%(5

='*2%( .*/%(&)$-%*

!"#$%&'( .*/%(&)$-%*

!"#$%&'(

)&$( *

+,-.&$/

+0"!1-%

+0"!1-%2&! *

3,%-4

+0"!2!,56

,7

6%,0%2!,%(%&'( *

($!2!,%(%&'(

6(%

'%38

'%%0

,7/26!

,72)"6

6%"+6

%"%,)201$%&'(

02%9:

%"%,)21+%&'(

%"%,)2!"#$%&'(

02:

02%

6%"+62/0(,%(021%;

6%"+62/0(,%(021%9:

8,1)%26%,0%6

6107&7,)2;<:

6107&7,)2;;=

6"10-(28&)(

!"#$%&'%()*+

,-.'%/-*#/-(' 0

"*)/1,(#$/ 0

"*)/1,(#/'2,

31)4#()(-.#".-%(#2(),5#,2

21#()(-.#".-%(#2(),5

12/#()(-.#".-%(#2(),5

31)4#4*"#-&-$.-3.'#,2

31)4#61-.$(+#$%2"',($)%#,2

31)4#3.),5'/#,2

31)4#2-!'(+#2(),5#,2

31)4#4-⌧#2(),5

2)1*,'#!$.'

!"#"$%&'()*+',-.$

"./-+' 0

"./'$-

".&/(1*+,)

)233&.(.4)&

-2$&$()&*(324&*

324&*$()&*()233&.

)233&.(5"'.%

)233&.(%&"1%.

$,()&*()233&.

23)(!&2$"63&

$+,*-&(!"3& 00

!"#$%&'(&)*+,-.

#/$(0!&

+-1(0!&

.#%"&-(02/- 33

!"#$%&'()"#%!*

)"#%!(&#$+ ,

)"#%!($+*&"-!'-#.

*#%"&+(/-0+ ,,

!"##$%&%'!$(

!"##$%&%'!$ )

*++,&%'!$

,-.$/(-+/

(%"01"2#$

0+..$/%(

(+340$&5-#$ ))

!"#

#$%&'()* +

)*,'!-"%-(.

%-*#&/!(0"

-%*#&/!(0"

"$11*%,&(!&.(&"$11*%,

2-/2

3%4&"*!&1$4*!

3%4&"*!&"$11*%

5-)%2

)*"%2

"$11*%&%4"*

#$⌧&,%$"1*

0.-%&(7&0,$/*

)*7$01%&3$&,%$%0,

#$⌧&"$11*%,&)!-8*)&,$#*&%-#*

/!(0"&)*,'

8*.)(!&'()*

8*.)(!&.$#*

*30-"#*.%&7(!9&:&(!&'1$#"&;

,(0!'*&7-1* ++

!"#$%&'(%$")

!"#$%&'(%$"*

!"#$%&'#+,"

)%-&("'./0" **

!"#

$%&

'()*

"%+&*##,+%-*% .

"%+-/&*-,&#

"'0))*-,&#

-(11,&#

"&2,-(11,&#

"%+-/&2(+),#20%2

"%+-/&2(+),*)-

-(11,#20%2

-(11,*)-

3(2,!(##

()&'/-*-,*⌧&'/-*-

"'0))*-,"%+-/&2(+),#20%2

"'0))*-,"%+-/&2(+),*)-

5*0%,6**7

5*0%,!+)23

%*0#+),!(##

&+!!*)2,!(##

(-

5*0%

#+/%&*,1('*

!"#

!$#%&'$( )

&%*+"&,%

-&+,%**.+&/%& )

/%*,&'-#'+0

"+!

-($00%/.1#2.3+&.3-

/%('4%&%/.1#2.3+&.3-

*#$0/."*$5%.1#2

$,#"$(."*$5%

*,&$-.-,#

-&+/",#'+0."*$5%

*#$0/."*$5%.1#2.6+.*,&$-

-&+/."*5.6+.*,&-

1#2.4$&.6.*,&$-

1#2.4$&.6+.*,&$-

-,#.4$&.6.*,&$-

4$(.4$&.6.*,&$-

4$(.4$&.6+.*,&$-

-,#.4$&.6+.*,&$-

*#$0/$&/.-&',%

3'0'*7%/.-&+/",#

+&'5'0.5&+"-

2222!!

*+"&,%.3'(%

!"#$%&'(&)*!+

,-.(&/-!$/-)( 0

#-)$,*/( 0

#-)$/(1,

2*#

32*#$)*)-.$".-&)$1)*,4

21/$)*)-.$".-&)$1)*,4

32*#$1)*,4$%&$)!-&1%)

32*#$#!"$-'-%.-3.(

32*#$52-.%)+$%&1"(,)%*&

32*#$3.*,4(/

32*#$1-6()+$1)*,4

32*#$#-⌧$1)*,4

1*2!,($6%.(

!"#$"%&'()*

%)+$%&#!,-.$/0 1

%!+#&0!&2)"03

!"#$"&#)0$

"$4-$%0$#&#$+5'$"3&#)0$

/$0&')+-$

"$+$)%$&,"$#50&')+-$

.)0$"5)+&%0)65/6&)')5+)(5+503&#)0$

!"#$"%&'()*

%)+$%&#!,-.$/0 1

%)+$%&#!,-.$/0&20$. 1

.)0$"2)+&/!

/$0&')+-$&!"#$"&20$.

/$0&*"2,$

!"#$"&#)0$

!"#$"&3-)/0204

!"252/)+&!"#$"&3-)/0204

!"#$%"&'

!"#$%"&'($! )

!"#$%"&'($*"+ )

&",(!-.

&",($*"+

!/*"(-,(+/01,/.*1&"

+/*"&$/#(2*/3$03(/%/$#/4$#$*'

+/*"&$/#(2*/3$03(*$+"

!"#$%"&'(!/*"

/.*1/#(51/0*$*'(!"#$%"&"! !"#$%"&'()"*!"&

!"#$%"&'($! +

,)$-(./

,/#!(./(-*&.'

!"#$%"&'(!*."

.$0"(/1(!"#$%"&'

-$23$45(!*."

-$23$45(.$0"

#/*!$45 !*."

*2.6*#(5//!,(0/%"0"4.

.&*4,-/&.*.$/4(-#*44$45(

.&*4,-/&.*.$/4(-#*44$45(.

2/0-#".$/4(!*."

2/0-#".$/4(.$0"

!"#$%&'()"&*+&,

!"#$%&'()#+

+*(&),&-.,+)-,&*(&+

(#%&),&-.,+)-,&*(&+

&'+)./)$0*''#'1)+*(&

&'+)./)$0*''#'1)(#%&

$0*''&+)-"&-2)#')+*(&

$0*''&+)-"&-2)#')(#%&

*-(3*0)-"&-2)#')+*(&

-3,,&'()-"&-2)#')(#%&

-3,,&'()!(*,()./)0.*+#'1)+*(&

*-(3*0)0.*+#'1)!(*,()(#%&

*-(3*0)&'+)./)0.*+#'1)+*(&

*-(3*0)&'+)./)0.*+#'1)(#%&

-3,,&'()!"#$%&'()-.%$0&(#.')+*(&

$0*''&+)(,*'!$.,(*(#.')!(*,()(#%&

-3,,&'()!(*,()./)!"#$%&'()+*(&

*-(3*0)(,*!'$.,(*(#.')!(*,()(#%&

$0*''&+)(,*'!$.,(*(#.')&'+)(#%&

!"#$%"&'()*(+,$-."/)

!"#$%"&'($! 0

+,$-."/)($!

!"#$%&'(#

#)*+,$% -

!"#$%&'(,./&'

#$(''$

!*$0

!%".$(0

#%"(!',1*2' --

!"#$%&'%!'()%&'

!*(+!%',-*.#/*&#'. 0

!*(+!%',-%)+!1

12!&'3+,%!'()%&' 0

12!&'3+,%!'()%&'%)+!1

12!&'3+,%!"#$%&' 0

12!&'3+,%!"#$%&'%)+!1

!'2,1+%4#(+ 00

!"##$%#&!'(%)

!"##$%#&!'(% *

!"##$%#&+",%

-#./$+

)'0#!%&1$/% **

!"#$!%&

'(#")*$&#+,"!)*+,-.(/%#

!"#$!%)'"./0"!1)/'

*$&#+,"!)/'

-!+'$*#)/')+!/2/%(.

3#1)+!2/%(..1)&4/--"'

$+,)+!/2/%(.

!"#$!%)+!'"!)/'

+!2/%/(.)'"./0"!1)/'

*()&$5#+#(.

'/&-$#")*(&"

-+,*+'"

!!)4"('"!

'(#")'"./0"!1

4($."!

!#*/&

*+.."*#/+%)!#*/&

-!+'$*#)/')!"#$!%

3#1)!"#$!%"'

$+,)!"#$!%

3#1)-(.."#&

!"(&+%

-+&#

*+$%#!1 66

&+$!*")7/."66

!"#!$%%"&'(#)

*"&$

)+,-"!&(.

".&'!%$,#(

/.(*+!&,!"&$0(.1

*$)!.'/&'(#

0$)&.,2&1,'#,3$

'&$4,3$

!+&,2&1,'#,)+

)5'/4$#&,#(

!+)&(4$.

!+)&(4$.,'*

(--'!'"%,.((&,!"+)$,!(*$

!(..$!&$*,.((&,!"+)$,!(*$

5"%%,'#,6".$5(+)$

!+)&(4$.,.$%$3"#&

.((&,!"+)$

*$&"'%)

)(+.!$,-'%$77

!"#!$%%"&'(#)*+((&*!",)$

+((&*!",)$*!(-$.

-$)!+'/&'(#.

-$)!+'/&'(#*#$0

)(,+!$*1'%$..

!"#!$%%"&'(#)*)(%+*&(*),'-*&(

!.)&(/$0

!.)&(/$0*)(%+*&(1

!.)&(/$0*),'-*&(1

!.)&(/$0*&2-$

)(.0!$*3'%$11

!"#!$%%"&'(#)*+"%%$&)

!,)&(-$.*'/0

!,)&(-$.*#"-$

!,)&(-$.*%(!"&'(#

!,)&(-$.*1*22*+"%%$&)

)(,.!$*3'%$00

Social  Informa)cs  –    Produc)on  Flows    &  Human  Interac)ons  

Social  Informa)cs:  Distribu)on  Flows  !"#$%&'(

!)*+',,)$-%* .*/%(&)$-%*

!)((-'( .*/%(&)$-%*

0-*-#1'2 3(%2"+$ .*4'*$%(5

!"#$%&'( 6',-4'(5 )*2 71-8&'*$!"#$%&'( 9'$"(*# )*2 !)*+',,)$-%*#

!"#$%&'( :(2'(#

3(%2"+$-%* 6)$)

9); <)$'(-),# .*/%(&)$-%*

0-*-#1'2 3(%2"+$ .*/%(&)$-%*

9); <)$'(-),# .*4'*$%(5

='*2%( .*/%(&)$-%*

!"#$%&'( .*/%(&)$-%*

!"#$%&'(

)&$( *

+,-.&$/

+0"!1-%

+0"!1-%2&! *

3,%-4

+0"!2!,56

,7

6%,0%2!,%(%&'( *

($!2!,%(%&'(

6(%

'%38

'%%0

,7/26!

,72)"6

6%"+6

%"%,)201$%&'(

02%9:

%"%,)21+%&'(

%"%,)2!"#$%&'(

02:

02%

6%"+62/0(,%(021%;

6%"+62/0(,%(021%9:

8,1)%26%,0%6

6107&7,)2;<:

6107&7,)2;;=

6"10-(28&)(

!"#$%&'%()*+

,-.'%/-*#/-(' 0

"*)/1,(#$/ 0

"*)/1,(#/'2,

31)4#()(-.#".-%(#2(),5#,2

21#()(-.#".-%(#2(),5

12/#()(-.#".-%(#2(),5

31)4#4*"#-&-$.-3.'#,2

31)4#61-.$(+#$%2"',($)%#,2

31)4#3.),5'/#,2

31)4#2-!'(+#2(),5#,2

31)4#4-⌧#2(),5

2)1*,'#!$.'

!"#"$%&'()*+',-.$

"./-+' 0

"./'$-

".&/(1*+,)

)233&.(.4)&

-2$&$()&*(324&*

324&*$()&*()233&.

)233&.(5"'.%

)233&.(%&"1%.

$,()&*()233&.

23)(!&2$"63&

$+,*-&(!"3& 00

!"#$%&'(&)*+,-.

#/$(0!&

+-1(0!&

.#%"&-(02/- 33

!"#$%&'()"#%!*

)"#%!(&#$+ ,

)"#%!($+*&"-!'-#.

*#%"&+(/-0+ ,,

!"##$%&%'!$(

!"##$%&%'!$ )

*++,&%'!$

,-.$/(-+/

(%"01"2#$

0+..$/%(

(+340$&5-#$ ))

!"#

#$%&'()* +

)*,'!-"%-(.

%-*#&/!(0"

-%*#&/!(0"

"$11*%,&(!&.(&"$11*%,

2-/2

3%4&"*!&1$4*!

3%4&"*!&"$11*%

5-)%2

)*"%2

"$11*%&%4"*

#$⌧&,%$"1*

0.-%&(7&0,$/*

)*7$01%&3$&,%$%0,

#$⌧&"$11*%,&)!-8*)&,$#*&%-#*

/!(0"&)*,'

8*.)(!&'()*

8*.)(!&.$#*

*30-"#*.%&7(!9&:&(!&'1$#"&;

,(0!'*&7-1* ++

!"#$%&'(%$")

!"#$%&'(%$"*

!"#$%&'#+,"

)%-&("'./0" **

!"#

$%&

'()*

"%+&*##,+%-*% .

"%+-/&*-,&#

"'0))*-,&#

-(11,&#

"&2,-(11,&#

"%+-/&2(+),#20%2

"%+-/&2(+),*)-

-(11,#20%2

-(11,*)-

3(2,!(##

()&'/-*-,*⌧&'/-*-

"'0))*-,"%+-/&2(+),#20%2

"'0))*-,"%+-/&2(+),*)-

5*0%,6**7

5*0%,!+)23

%*0#+),!(##

&+!!*)2,!(##

(-

5*0%

#+/%&*,1('*

!"#

!$#%&'$( )

&%*+"&,%

-&+,%**.+&/%& )

/%*,&'-#'+0

"+!

-($00%/.1#2.3+&.3-

/%('4%&%/.1#2.3+&.3-

*#$0/."*$5%.1#2

$,#"$(."*$5%

*,&$-.-,#

-&+/",#'+0."*$5%

*#$0/."*$5%.1#2.6+.*,&$-

-&+/."*5.6+.*,&-

1#2.4$&.6.*,&$-

1#2.4$&.6+.*,&$-

-,#.4$&.6.*,&$-

4$(.4$&.6.*,&$-

4$(.4$&.6+.*,&$-

-,#.4$&.6+.*,&$-

*#$0/$&/.-&',%

3'0'*7%/.-&+/",#

+&'5'0.5&+"-

2222!!

*+"&,%.3'(%

!"#$%&'(&)*!+

,-.(&/-!$/-)( 0

#-)$,*/( 0

#-)$/(1,

2*#

32*#$)*)-.$".-&)$1)*,4

21/$)*)-.$".-&)$1)*,4

32*#$1)*,4$%&$)!-&1%)

32*#$#!"$-'-%.-3.(

32*#$52-.%)+$%&1"(,)%*&

32*#$3.*,4(/

32*#$1-6()+$1)*,4

32*#$#-⌧$1)*,4

1*2!,($6%.(

!"#$"%&'()*

%)+$%&#!,-.$/0 1

%!+#&0!&2)"03

!"#$"&#)0$

"$4-$%0$#&#$+5'$"3&#)0$

/$0&')+-$

"$+$)%$&,"$#50&')+-$

.)0$"5)+&%0)65/6&)')5+)(5+503&#)0$

!"#$"%&'()*

%)+$%&#!,-.$/0 1

%)+$%&#!,-.$/0&20$. 1

.)0$"2)+&/!

/$0&')+-$&!"#$"&20$.

/$0&*"2,$

!"#$"&#)0$

!"#$"&3-)/0204

!"252/)+&!"#$"&3-)/0204

!"#$%"&'

!"#$%"&'($! )

!"#$%"&'($*"+ )

&",(!-.

&",($*"+

!/*"(-,(+/01,/.*1&"

+/*"&$/#(2*/3$03(/%/$#/4$#$*'

+/*"&$/#(2*/3$03(*$+"

!"#$%"&'(!/*"

/.*1/#(51/0*$*'(!"#$%"&"! !"#$%"&'()"*!"&

!"#$%"&'($! +

,)$-(./

,/#!(./(-*&.'

!"#$%"&'(!*."

.$0"(/1(!"#$%"&'

-$23$45(!*."

-$23$45(.$0"

#/*!$45 !*."

*2.6*#(5//!,(0/%"0"4.

.&*4,-/&.*.$/4(-#*44$45(

.&*4,-/&.*.$/4(-#*44$45(.

2/0-#".$/4(!*."

2/0-#".$/4(.$0"

!"#$%&'()"&*+&,

!"#$%&'()#+

+*(&),&-.,+)-,&*(&+

(#%&),&-.,+)-,&*(&+

&'+)./)$0*''#'1)+*(&

&'+)./)$0*''#'1)(#%&

$0*''&+)-"&-2)#')+*(&

$0*''&+)-"&-2)#')(#%&

*-(3*0)-"&-2)#')+*(&

-3,,&'()-"&-2)#')(#%&

-3,,&'()!(*,()./)0.*+#'1)+*(&

*-(3*0)0.*+#'1)!(*,()(#%&

*-(3*0)&'+)./)0.*+#'1)+*(&

*-(3*0)&'+)./)0.*+#'1)(#%&

-3,,&'()!"#$%&'()-.%$0&(#.')+*(&

$0*''&+)(,*'!$.,(*(#.')!(*,()(#%&

-3,,&'()!(*,()./)!"#$%&'()+*(&

*-(3*0)(,*!'$.,(*(#.')!(*,()(#%&

$0*''&+)(,*'!$.,(*(#.')&'+)(#%&

!"#$%"&'()*(+,$-."/)

!"#$%"&'($! 0

+,$-."/)($!

!"#$%&'(#

#)*+,$% -

!"#$%&'(,./&'

#$(''$

!*$0

!%".$(0

#%"(!',1*2' --

!"#$%&'%!'()%&'

!*(+!%',-*.#/*&#'. 0

!*(+!%',-%)+!1

12!&'3+,%!'()%&' 0

12!&'3+,%!'()%&'%)+!1

12!&'3+,%!"#$%&' 0

12!&'3+,%!"#$%&'%)+!1

!'2,1+%4#(+ 00

!"##$%#&!'(%)

!"##$%#&!'(% *

!"##$%#&+",%

-#./$+

)'0#!%&1$/% **

!"#$!%&

'(#")*$&#+,"!)*+,-.(/%#

!"#$!%)'"./0"!1)/'

*$&#+,"!)/'

-!+'$*#)/')+!/2/%(.

3#1)+!2/%(..1)&4/--"'

$+,)+!/2/%(.

!"#$!%)+!'"!)/'

+!2/%/(.)'"./0"!1)/'

*()&$5#+#(.

'/&-$#")*(&"

-+,*+'"

!!)4"('"!

'(#")'"./0"!1

4($."!

!#*/&

*+.."*#/+%)!#*/&

-!+'$*#)/')!"#$!%

3#1)!"#$!%"'

$+,)!"#$!%

3#1)-(.."#&

!"(&+%

-+&#

*+$%#!1 66

&+$!*")7/."66

!"#!$%%"&'(#)

*"&$

)+,-"!&(.

".&'!%$,#(

/.(*+!&,!"&$0(.1

*$)!.'/&'(#

0$)&.,2&1,'#,3$

'&$4,3$

!+&,2&1,'#,)+

)5'/4$#&,#(

!+)&(4$.

!+)&(4$.,'*

(--'!'"%,.((&,!"+)$,!(*$

!(..$!&$*,.((&,!"+)$,!(*$

5"%%,'#,6".$5(+)$

!+)&(4$.,.$%$3"#&

.((&,!"+)$

*$&"'%)

)(+.!$,-'%$77

!"#!$%%"&'(#)*+((&*!",)$

+((&*!",)$*!(-$.

-$)!+'/&'(#.

-$)!+'/&'(#*#$0

)(,+!$*1'%$..

!"#!$%%"&'(#)*)(%+*&(*),'-*&(

!.)&(/$0

!.)&(/$0*)(%+*&(1

!.)&(/$0*),'-*&(1

!.)&(/$0*&2-$

)(.0!$*3'%$11

!"#!$%%"&'(#)*+"%%$&)

!,)&(-$.*'/0

!,)&(-$.*#"-$

!,)&(-$.*%(!"&'(#

!,)&(-$.*1*22*+"%%$&)

)(,.!$*3'%$00

•  Ingests  informa)on  –  Data  –  Judgment  –  Models  (procedures)  

•  Protect  informa)on  •  Capture  interac)on  paHerns  •  Capture  and  generate  dynamics  •  Result:  In-­‐silica  experimental  

pla0orm  

Need  to  Build  the  Infrastructure  –    Synthe'c  Informa'on  Pla2orm  

SYNTHETIC PLATFORM

Individual

ECONOMY

Harnessing Real World Data

Exploratory Analysis

Textual Analysis

Network Analysis

Agent-Based Modeling Dynamic

Simulation

Machine Learning

Consumer Price Index GDP Unemployment Rate

To Empower Policy and Decision Making

References    American  Associa)on  of  Public  Opinion  Research,  AAPOR  Big  Data  Task  Force,  AAPOR  Report  on  Big  Data,  February  12,  2015  Execu)ve  Office  of  the  President.  2014.  “Big  Data:  Seizing  Opportuni)es,  Preserving  Values.”  Washington  DC.  Retrieved  January  28,  2015  (hHp://1.usa.gov/1hqgibM).  Keller,  Sallie  Ann,  Steven  E.  Koonin,  and  Stephanie  Shipp.  2012.  “Big  Data  and  City  Living  –  What  Can  It  Do  For  Us?”  Significance,  9(4):  4-­‐7    Lazer,  David  M.,  Ryan  Kennedy,  Gary  King,  and  Alessandro  Vespignani.  2014.  “The  parable  of  Google  Flu:  Traps  in  big  data  analysis.”  Science,  343(6176):  1203-­‐1205.  Leek,  Jeff.  2014a.  “Why  big  data  is  in  trouble:  they  forgot  about  applied  sta)s)cs.”  Simplystats  blog,  May  7.  Retrieved  January  28,  2015  (hHp://bit.ly/1fUzZO1).  New  York  Times  Editorial  Board.  2014.  “BeHer  Governing  Through  Data.”  New  York  Times,  August  19.  Retrieved  January  28,  2015  (hHp://ny).ms/1qehhWr).  OECD.  2013.  New  Data  for  Understanding  the  Social  Condi)on.OECD  Global  Science  Forum.  February.    Okun,  Arthur  M.  Equality  and  efficiency,  the  big  tradeoff.  Brookings  Ins)tu)on  Press,  1975.  Rivlin,  Alice  M.  Systema)c  thinking  for  social  ac)on.  Vol.  3.  Brookings  Ins)tu)on  Press,  1971  United  Na)ons.  2014.    A  Suggested  Framework  for  the  Quality  of  Big  Data,  Deliverables  of  the  UNECE  Big  Data  Quality  Task  Team,  December,  2014