big data & the importance of data science

18
Big Data & the importance of Data Science 18 december 2014 @wimvanleuven [email protected] 1

Upload: wim-van-leuven

Post on 21-Apr-2017

9.591 views

Category:

Data & Analytics


0 download

TRANSCRIPT

Page 1: Big Data & the importance of Data Science

Big Data & the importance of Data Science

18 december 2014

@wimvanleuven [email protected]

1

Page 2: Big Data & the importance of Data Science

2

http://www.slideshare.net/kuonen/big-tent-bddsunigenov2014

Page 3: Big Data & the importance of Data Science

–Edd Dumbill

“Big data is data that exceeds the processing capacity of conventional database systems.

The data is too big, moves too fast, or doesn’t fit the strictures of your database architectures.”

3

http://radar.oreilly.com/2012/01/what-is-big-data.html

What is Big Data?

Page 4: Big Data & the importance of Data Science

The 3 V’s of Big Data4

• Volume

• Velocity

• Variety

• (Veracity)

Page 5: Big Data & the importance of Data Science

…too big…5

IOIIOIIOIOIOIIIOIOOOOIOIOOIIOIIIIIOIIOIIOIOIOIOIOIOIIOIIOIOIOIIIOIOOOOIOIOOIIOIIIIIOIIOIIOIOIOIOIOIOIIOIIOIOIOIIIOIOOOOIOIOOIIOIIIIIOIIOIIOIOIOIOIOIOIIOIIOIOIOIIIOIOOOOIOIOOIIOIIIIIOIIOIIOIOIOIOIOIOIIOIIOIOIOIIIOIOOOOIOIOOIIOIIIIIOIIOIIOIOIOIOIOIOIIOIIOIOIOIIIOIOOOOIOIOOIIOIIIIIOIIOIIOIOIOIOIOIOIIOIIOIOIOIIIOIOOOOIOIOOIIOIIIIIOIIOIIOIOIOIO

Page 6: Big Data & the importance of Data Science

… moves to fast …6

Page 7: Big Data & the importance of Data Science

… doesn’t fit …7

Page 8: Big Data & the importance of Data Science

… what?8

Page 9: Big Data & the importance of Data Science

New tools and technologies to store and process all data on a cluster of commodity hardware so that the system acts as one, is

resilient and scales linearly.

9

What is Big Data? — revisited

Page 10: Big Data & the importance of Data Science

So what?10

the data lake is a large data pool in which the schema and data requirements are not defined

until the data is queried, processed, analysed or delivered as information to the end-user

Page 11: Big Data & the importance of Data Science

–???

“We don’t do Hadoop because we have Big Data; we do Big Data because we have

Hadoop.”

11

So what?

Page 12: Big Data & the importance of Data Science

–Matt Ehrlichman

“In the years ahead, the same power that big data awards enterprise companies will be the

norm for small business.”

12

So what?

http://blogs.wsj.com/accelerators/2014/10/31/matt-ehrlichman-big-data-for-small-firms/

Page 13: Big Data & the importance of Data Science

13

What does Big Data enable?

• Combine data from within and without your organisation

• Build new products and services

• Analyse all data (e.g. 5TB historic event data at rest in Oracle db)

Page 14: Big Data & the importance of Data Science

Big Data is no panacea14

• First decide what problem you want to solve; pick a real business problem to add immediate value

• Start small, the technology is made for linear scalability (a 3-node cluster is a cluster!)

• Then become lean: learn through experimentation

Page 15: Big Data & the importance of Data Science

Big Data challenges• Beware of hype, Big Data - washing and fad

• Tech infancy

• IT | Biz

• Data is hard

• Lack of skills! shameless self plug: BigBoards!

15

Page 16: Big Data & the importance of Data Science

Big Data opportunity

• Big Data is here to stay

• Vendor market is HUGE and will grow massively as Big Data will blend in within the datacenter

• However, the Practitioner market can deliver EXPONENTIALLY more value

16

Page 17: Big Data & the importance of Data Science

17

It is time to band together and build these systems that deliver this kind of value

for fun

for profit for good

for Belgium?

Call for Action

Page 18: Big Data & the importance of Data Science

https://www.ted.com/talks/susan_etlinger_what_do_we_do_with_all_this_big_data

“Data doesn't create meaning. We do.”

–Susan Etlinger

18

Data Science FTW