strata online_road_to_enterprise_data_2011

Post on 28-May-2015

978 Views

Category:

Technology

2 Downloads

Preview:

Click to see full reader

DESCRIPTION

Deck for Strata Online Dec 2011 - for more see my blog at www.LynnLangit.com

TRANSCRIPT

@LynnLangitPractioner, Author, Instructor

The Road for Enterprise DataFrom Traditional BI to Big Data

BI = ‘Current State’ Questions

• What did we sell?• When did we sell it?• Where did we sell it?• What did we sell with it?

Collecting Transactional

data

BTW…Do you use Data Mining?

BI Data Landscape

StorageProcessing

Query

Presentation

Mix-in #1 -- the Cloud and…

• Host Data in the Cloud• Process & Query Data in the Cloud– Click to query and (data) mine– Return the data locally– Use Self-service BI visualizers

• Mash-up Cloud data – Combine with local data

NoSQL and the Cloud

• The Elephant in the room…Hadoop• Over 120+ types of noSQL databases– http://nosql-database.org/

Can’t We All Play Together?

Data in the Cloud - Microsoft

Windows Azure DataMarket

Amazon AWS

Google App Engine Data

New on Google – MySQL++

Comparing RDBMS and MapReduce

Traditional RDBMS MapReduce

Data Size Gigabytes (Terabytes) Petabytes (Hexabytes)

Access Interactive and Batch Batch

Updates Read / Write many times Write once, Read many times

Structure Static Schema Dynamic Schema

Integrity High (ACID) Low

Scaling Nonlinear Linear

DBA Ratio 1:40 1:3000

Reference: Tom White’s Hadoop: The Definitive Guide

BTW…NoSQL is 50x CHEAPER

BigData = ‘Next State’ Questions

• What could happen?• Why didn’t this happen?• When will the next new thing

happen?• What will the next new thing

be?

Collecting behavioral

data

Splunk

Mining Log Files

Presenting the results

Freebase

Mix-in #2 - Data Scientists

• Who asks the ‘right’ questions now? • Who understands the languages? • Who can understand the results?

Is Data Science your next Career?

Becoming a Data Scientist

• Conferences– Strata – Data Scientist

Summit– CloudCamps

• Practice– here

Mix-in #3 - Presentation

• New Devices – iPad, Kindle Fire• New User Experiences – touch, Kinect• EVERYTHING on the phone

HortonWorks, Cloudera…

Karmasphere Studio for Amazon Elastic MapReduce

More PowerPivot

Cloud-based Data Mining Predixion

QlikView

QlikView on iPad

BI >BigData ‘To Do ListStore some (more) data on the cloud• Relational and non-relational• Transaction AND Behavioral

Process some data in the cloud• Try data mining• Learn about Data Science

Update your client tools• New UI (touch, gestures)• Click to Query• New form factors (phone, tablet)

Hadoop Connector to Excel - Demo

www.TeachingKidsProgramming.org

• Do a Recipe Teach a Kid (Ages 10 ++)• Microsoft SmallBasic Free Courseware (recipes)

Keep up with Big Data

Follow me @LynnLangit

RSS my blog www.LynnLangit.com

Hire me• To help build your BI/Big Data solution• To teach your team next gen BI

top related