matt wood, chief data scientist, amazon web services

Post on 14-Feb-2017

221 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

a presentation at the UNITED NATIONS STATISTICAL COMMISSION

by

DR. MATT WOOD

introducing

BIG DATA ANALYTICS

Hello.

Thank you.

IData, data everywhere

I IIData, data everywhere

Data timeline

I II IIIData

securityData, data everywhere

Data timeline

I II III IVData

movementData, data everywhere

Data security

Data timeline

I II III IVData

movementData, data everywhere

Data security

Data timeline

0.Amazon web

Services

Compute, storage & databases.

Retail Merchantservices

Web services

Blinding flash of the obvious.

Available.

Low cost.

Flexible.

1.3 trillion objects835k peak requests/second

300 government agencies.1,500 educational institutions.

Data, data everywhereI

Generation

Collection & storage

Analytics & computation

Collaboration & sharing

Cost of data generation is falling.

Generation

Collection & storage

Analytics & computation

Collaboration & sharing

lower cost,increased throughput

Generation

Collection & storage

Analytics & computation

Collaboration & sharing

highly constrained

Gap.

1990 2000 2010 2020

The Data Analysis Gap

Enterprise Data Data in Warehouse

Gartner: User Survey Analysis: Key Trends Shaping the Future of Data Center Infrastructure Through 2011 IDC: Worldwide Business Analytics Software 2012–2016 Forecast and 2011 Vendor Shares

Generated data

Available for analysis

Data volume

Utility.

Remove constraints.

Generation

Collection & storage

Analytics & computation

Collaboration & sharing

highly constrained

Generation

Collection & storage

Analytics & computation

Collaboration & sharing

Close the gap.

Technologies and techniques for working productively with data, at any scale.

Data timelineII

Lots of data.Lots of users.Lots of uses.

Lots of locations.

Cost.

Multipliers.

Generation challenge.

Analytics challenge.

Co-evolution.

Co-evolution.

software

Co-evolution.

software

utility computing

Hadoop.

Availability challenge.

Beautiful and unique.

Snowflake Statistics

Data has gravity.

Move data to users.

Move data to users.X

Move tools to data.

Place data where it can be easily consumed.

Reusable environment.

Always more people outside your team, than within it.

Technologies and techniques for working productively with data, at any scale.

Data security.III

Security is our number one priority.

Shared responsibility.

Choose your region.

Availability zones.

ITAR

FIPS 140-2

MPAAISO 27001

SOC 2 ISAE 3402 PCI DSS

HIPAA

FISMA Moderate

Virtual Private Cloud.

Network isolated environment.

Data movement.IV

“How do I get my data into the cloud?”

Generated and stored in the AWS cloud.

Inbound transfer if free.

Multipart upload.

Physical media.

AWS Direct Connect.

1Gbps or 10Gbps

Built in AZ replication.

Regional replication.

aws.amazon.com

IData, data everywhere

I IIData, data everywhere

Data timeline

I II IIIData

securityData, data everywhere

Data timeline

I II III IVData

movementData, data everywhere

Data security

Data timeline

Thank you.

top related