practical advice to build a data driven company

29
50 AVENUE DES CHAMPS-ÉLYSÉES 75008 PARIS > FRANCE > WWW.OCTO.COM HADOOP SUMMIT 2016 - DUBLIN PRACTICAL ADVICE TO BUILD A DATA DRIVEN COMPANY Simon MABY @simonmaby

Upload: dataworks-summithadoop-summit

Post on 16-Apr-2017

485 views

Category:

Technology


2 download

TRANSCRIPT

Page 1: Practical advice to build a data driven company

50 AVENUE DES CHAMPS-ÉLYSÉES 75008 PARIS > FRANCE > WWW.OCTO.COM

HADOOP SUMMIT 2016 - DUBLIN

PRACTICAL ADVICE TO BUILD A DATA DRIVEN COMPANY

Simon MABY@simonmaby

Page 2: Practical advice to build a data driven company

2OCTO TECHNOLOGY > THERE IS A BETTER WAY

Story : Data Driven E-Commerce

Page 3: Practical advice to build a data driven company

3

A continuous improvement of all business processes, through a smart use of the data, all the

time, everywhere and to all purposes

OCTO TECHNOLOGY > THERE IS A BETTER WAY

Page 4: Practical advice to build a data driven company

4

BEING DATA DRIVEN IS BEING LEAN

OCTO TECHNOLOGY > THERE IS A BETTER WAY

IDEA

CODEDATA

BUILD

MEASURE

LEARN

Page 5: Practical advice to build a data driven company

5

REQUIREMENTS

OCTO TECHNOLOGY > THERE IS A BETTER WAY

IDEA

CODE

DATA Data must be easily accessible

Business must be aware of opportunities to use algorithms

Datascience projects should have the lowest time to market possible

Page 6: Practical advice to build a data driven company

6

DATA

Page 7: Practical advice to build a data driven company

7

DATAData must be easily accessible

OCTO TECHNOLOGY > THERE IS A BETTER WAY

Page 8: Practical advice to build a data driven company

8

Your Datalake is a service to your company. It should be managed like a startup

Your employees are you first clients. The more they use it, the more you are Data Driven

OCTO TECHNOLOGY > THERE IS A BETTER WAY

Page 9: Practical advice to build a data driven company

9

FOCUS ON USABILITY OVER ARCHITECTURE

OCTO TECHNOLOGY > THERE IS A BETTER WAY

Services

Datalake

Datalake Team :OPS - DEVs - DESIGNERS

End Users and projects

Design services for usability and grant support

Gather requirements

and usage metrics

Page 10: Practical advice to build a data driven company

10

FOCUS ON USABILITY OVER ARCHITECTURE : EXAMPLES

How simple is it to share data to other projects?

How simple is it to suscribe to a data feed?

Is it possible to run a full search on available datasets?

Is it possible to ask other projects for details about their data through a social network?

Auto-completion over SQL request from other projects?

Bookmarking, sharing, upvoting datasets, tagging metadata…OCTO TECHNOLOGY > THERE IS A BETTER WAY

Page 11: Practical advice to build a data driven company

11

CODE

Page 12: Practical advice to build a data driven company

12

CODEDatascience projects should have the lowest time

to market possible

OCTO TECHNOLOGY > THERE IS A BETTER WAY

Page 13: Practical advice to build a data driven company

13

EXPLORATION VERSUS PREDICTION

OCTO TECHNOLOGY > THERE IS A BETTER WAY

Explore as quickly as possible

Deliver frequently in production

Page 14: Practical advice to build a data driven company

14OCTO TECHNOLOGY > THERE IS A BETTER WAY

(Not so) Big Data Infrastructure(For exploration)

Page 15: Practical advice to build a data driven company

15

WHAT IF WE GIVE LESS DATA TO OUR ALGORITHMS?

OCTO TECHNOLOGY > THERE IS A BETTER WAY

Cf.  Zoltan Prekopcsak, Hadoop Summit EU. 2015

Page 16: Practical advice to build a data driven company

16

FEATURE TEAMS TO DELIVER CODE READY FOR PRODUCTION

OCTO TECHNOLOGY > THERE IS A BETTER WAY

Business rep.

Developer

Data Sc.

Page 17: Practical advice to build a data driven company

17

MESSAGE BROKER TO REUSE DATA FLOWS

OCTO TECHNOLOGY > THERE IS A BETTER WAY

App A App B

DWDB X

App A App B

DW DB X

Kafka

App C

? ? ?- Custom dev- Data formats?- SLA?- Scheduling?…

- Standard format- Prod Ready- Exploration and prod will share same formats

Page 18: Practical advice to build a data driven company

18

KAPPA ARCHITECTURE : EVERYTHING IS A STREAM

OCTO TECHNOLOGY > THERE IS A BETTER WAY

Stream Data Stream Processing Serving DB

Topic Streaming app v1

Streaming app v2

Result data v1

Result data v2Kafka

Batch jobs are just historical data you send into a streaming app Application code is decoupled from technical requirements One shot exploration code respecting the stream abstraction can go in

production easily

Page 19: Practical advice to build a data driven company

19

IDEAS

Page 20: Practical advice to build a data driven company

20

IDEASBusiness must be aware of the opportunities to

use algorithms

OCTO TECHNOLOGY > THERE IS A BETTER WAY

Page 21: Practical advice to build a data driven company

21

MIX THESE PEOPLE

OCTO TECHNOLOGY > THERE IS A BETTER WAY

BusinessKnows what is

valuable

Data ScientistKnows what is

feasible

Culture &Collaboration

Page 22: Practical advice to build a data driven company

22

FEATURE TEAMS ONCE AGAIN

OCTO TECHNOLOGY > THERE IS A BETTER WAY

Business rep.

Developer

Data Sc.

Page 23: Practical advice to build a data driven company

23

EXPLAIN THEM THAT MACHINE LEARNING IS EASY (IT’S METHODOLOGY)

OCTO TECHNOLOGY > THERE IS A BETTER WAY

Page 24: Practical advice to build a data driven company

24

EXPLAIN THEM THAT MACHINE LEARNING IS EASY (IT’S MAGIC)

OCTO TECHNOLOGY > THERE IS A BETTER WAY

Page 25: Practical advice to build a data driven company

25

SPEND TIME TOGETHER

Show them the data

Pair Programming

Swap roles for one day

OCTO TECHNOLOGY > THERE IS A BETTER WAY

Page 26: Practical advice to build a data driven company

26

SOFTWARE IS EATING THE WORLD : MAKE THEM CODE

Page 27: Practical advice to build a data driven company

27OCTO TECHNOLOGY > THERE IS A BETTER WAY

Story : Octo Datascience Competition Platform

Page 28: Practical advice to build a data driven company

HOW WIDELY DATADRIVEN IS YOUR COMPANY?

Everybody is willing to make value out of the available data

Data serves not only the core business but every single function

Data is used in day-to-day activity in real-time

OCTO TECHNOLOGY > THERE IS A BETTER WAY

Page 29: Practical advice to build a data driven company

HOW DEEPLY DATADRIVEN IS YOUR COMPANY?

OCTO TECHNOLOGY > THERE IS A BETTER WAY

You are using cutting edges algorithms to automate processes

You are used to A/B testing based on data every week

You cross multiple data sources to build insights and models