cloud austin hadoop automation lighting talk 2014.11.18

14
dataFundamentals Hadoop Automation in 15 Minutes Or how to get to the fun stuff before your boss pulls the plug.

Upload: datafundamentals

Post on 02-Jul-2015

119 views

Category:

Data & Analytics


0 download

DESCRIPTION

Hadoop ETL Automation - How to get to the fun part of big data in the shortest amount of time.

TRANSCRIPT

Page 1: Cloud Austin Hadoop Automation Lighting Talk 2014.11.18

dataFundamentals

Hadoop Automation in 15 Minutes

Or how to get to the fun stuff before your boss pulls the plug.

Page 2: Cloud Austin Hadoop Automation Lighting Talk 2014.11.18

ETL is not the Fun Stuff, in Big Data

❖ Analytics

❖ Machine Learning

❖ Spark

❖ [even just Building APIs]

But you can’t do the fun stuff until your corporate data is in place to work against. Chicken and egg problem.

Page 3: Cloud Austin Hadoop Automation Lighting Talk 2014.11.18

Quick!Before your boss turns off the spigot!

❖ Automate your ETL processes.

❖ Automate your server instances.

Page 4: Cloud Austin Hadoop Automation Lighting Talk 2014.11.18
Page 5: Cloud Austin Hadoop Automation Lighting Talk 2014.11.18
Page 6: Cloud Austin Hadoop Automation Lighting Talk 2014.11.18
Page 7: Cloud Austin Hadoop Automation Lighting Talk 2014.11.18
Page 8: Cloud Austin Hadoop Automation Lighting Talk 2014.11.18
Page 9: Cloud Austin Hadoop Automation Lighting Talk 2014.11.18
Page 10: Cloud Austin Hadoop Automation Lighting Talk 2014.11.18
Page 11: Cloud Austin Hadoop Automation Lighting Talk 2014.11.18

What kind of code to Automate?

❖ Clean code. Super clean.

❖ Well designed code.

Page 12: Cloud Austin Hadoop Automation Lighting Talk 2014.11.18

Other pitfalls?

❖ NIH, Not Invented Here

Page 13: Cloud Austin Hadoop Automation Lighting Talk 2014.11.18

How to get the fun tasks?

❖ 2 week P.O.C.

❖ Your sample data

Page 14: Cloud Austin Hadoop Automation Lighting Talk 2014.11.18

Code, Content, Contacts❖ This Slide Deck: http://www.slideshare.net/petecarapetyan/cloud-austin-hadoop-automationlightingtalk141118

❖ or just remember slideshare.net/datafundamentals

❖ Youtube - 11 minute slide-less version of code demo - https://www.youtube.com/playlist?list=PLO_T9AjxEaYeByfqBqHVCmg4GbLFkYCJe

❖ Dev Code

❖ Carrie (ruby UI and generator) https://github.com/datafundamentals/df_ui_carrie

❖ Avro from delimited https://bitbucket.org/datafundamentals/avro_from_delimited

❖ Camel-Avro https://bitbucket.org/datafundamentals/camel-avro-etl

❖ Ops Code - cookbook recipes

❖ https://github.com/datafundamentals

❖ Contact

[email protected] [email protected] Jeff Twitter @devopsjeff Pete Twitter @appwritercom Site: datafundamentals.com

Be careful! It’s a competitive world out there!