2adada5d7c1791db35640595765c19d1 osbc2011 big data m200 michael driscoll

Upload: jason-cole

Post on 07-Apr-2018

214 views

Category:

Documents


0 download

TRANSCRIPT

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    1/44

    5/19/2011

    1

    Pick ing Winners and Losers

    in the Bi Data Market

    Michael E. Driscoll, CTO, Metamarkets@medriscoll

    Open Sourc e Busines s Conferen c e | May 16, 2011

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    2/44

    5/19/2011

    2

    I . The Winning

    Oppor tun i ty

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    3/44

    5/19/2011

    3

    Info rm at ion is t he

    o i l o f t he 21s t c ent ur y.

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    4/44

    5/19/2011

    4

    Big Data Force #1: The Attack of t he Exponentials

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    5/44

    5/19/2011

    5

    Big Data Force #1: The Attack of t he Exponentials

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    6/44

    5/19/2011

    6

    Big Dat a Forc e #2:

    The Grow t h of Sensor Ne tw ork s

    5/19/2011

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    7/44

    5/19/2011

    7

    Big Dat a Forc e #3:

    The Rise o f Cloud Com put ing

    5/19/2011

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    8/44

    5/19/2011

    8

    The Intersect ion of these Three Forces

    Yields An Explosion of Data

    exponentialeconomicssensor

    networ scloudcomputing

    5/19/2011

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    9/44

    5/19/2011

    9

    5/19/2011

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    10/44

    5/19/2011

    10

    I I . Winning Sk i l ls

    5/19/2011

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    11/44

    5/19/2011

    11

    Thesexyjobinthenexttenyearswillbestatisticians

    HalVarian

    + =

    5/19/2011

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    12/44

    5/19/2011

    12

    5/19/2011

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    13/44

    / /

    13

    5/19/2011

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    14/44

    14

    5/19/2011

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    15/44

    15

    5/19/2011

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    16/44

    16

    =if ($foo =~

    n 2,3 A-Z 5,7 2,5

    5/19/2011

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    17/44

    17

    =s ta t i s t i cs

    5/19/2011

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    18/44

    18

    1000bytes 2bytes

    5/19/2011

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    19/44

    19

    The Promise of Predict ive Analyt ics

    ex t r a c t l e a r n p r e d i c t

    5/19/2011

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    20/44

    20

    RecommendRecommend

    5/19/2011

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    21/44

    21

    revenreven

    5/19/2011

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    22/44

    22

    5/19/2011

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    23/44

    23

    =s to ry te l l i ng

    5/19/2011

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    24/44

    24

    5/19/2011

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    25/44

    25

    credit: MiguelRios,Twitter

    5/19/2011

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    26/44

    26

    credit: PaulButler,Facebook

    5/19/2011

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    27/44

    27

    credit:JoeReisinger,Metamarkets

    5/19/2011

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    28/44

    28

    I I I . Winning

    Technologies

    5/19/2011

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    29/44

    29

    The Emerging Big Data Stack

    Actions

    (FraudDetection,RecEngines)

    (R,SPSS,SAS,SAP)

    RDBMS,Hadoop,CEPData

    5/19/2011

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    30/44

    30

    The Emerging Big Data Stack

    Kdb

    CompetitiveMatrix

    speed Vertica

    Netezza

    Esper

    MySQLGreenplum

    InfoBright

    Aster

    MapR

    datascale

    5/19/2011

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    31/44

    31

    The Emerging Big Data Stack

    CompetitiveMatrixcustom GPU

    speedMatlab

    R RevolutionRExcel

    customdistributedSciPy SAP

    datascale

    SPSS

    5/19/2011

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    32/44

    32

    The Emerging Big Data Stack

    CompetitiveMatrix

    cost

    (McKinsey,BCG)frauddetection

    focus

    5/19/2011

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    33/44

    33

    Winners: Focused, Big, and Fast

    Apps&Services focusedapplications&services

    Analytics ggerana yt cs

    a a fasterdataplatforms

    5/19/2011

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    34/44

    34

    Thank You. Quest ions?

    . , ,@medriscoll

    Open Sourc e Busines s Conferen c e | May 16, 2011

    5/19/2011

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    35/44

    35

    5/19/2011

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    36/44

    36

    The Promise of Predict ive Analyt ics

    ex t r a c t l e a r n p r e d i c t

    5/19/2011

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    37/44

    37

    RecommendRecommend

    5/19/2011

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    38/44

    38

    revenreven

    5/19/2011

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    39/44

    39

    5/19/2011

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    40/44

    40

    I. Data Privacy & Ow nership

    5/19/2011

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    41/44

    41

    II. Scalability of Algorithms

    5/19/2011

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    42/44

    42

    III. User Data & Analyt ics

    5/19/2011

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    43/44

    43

    IV. Data Exchanges & Ecosystems

    5/19/2011

  • 8/4/2019 2adada5d7c1791db35640595765c19d1 OSBC2011 Big Data M200 Michael Driscoll

    44/44

    44

    Predict ive Analyt ics:

    The Consumer Crystal Ball

    Omar Tawakol, Chief Executive Officer, Bluekai. , ,Scott Burke, Senior Vice President, Yahoo!Theresia Gouw Ranzetta Partner Accel Partners

    Moderator: Michael E. Driscoll, Metamarkets

    MIT/St anfor d Vent ure La b | Febr uar y 15, 2011