4 hadoop for-the-disillusioned

15

@wattsteve Hadoop for the disillusioned Steve Watt, Red Hat CC flickr rubenswieringa

Upload: bigdatacamp

Post on 14-Jul-2015

67 views

Category:

Technology

0 download

Report

Download

Tags:

Embed Size (px):

TRANSCRIPT

Page 1: 4 hadoop for-the-disillusioned

@wattsteve

Hadoop for the disillusioned Steve Watt, Red Hat

CC flickr rubenswieringa

Page 2: 4 hadoop for-the-disillusioned

@wattsteve

Page 3: 4 hadoop for-the-disillusioned

@wattsteve

Wired Magazine - July 2008

Page 4: 4 hadoop for-the-disillusioned

@wattsteve

Hadoop in 2013

CC flickr lowfatbrains

Platform Layers Technologies

Computational Runtimes

YARN, GiRAPH, MapReduce, HBase, Phoenix, Spark/BDAS, Drill, Impala, Stinger & more

FileSystems Azure, CassandraFS, CephFS, CleverSafe, GlusterFS, GridGain, HDFS, LustreMapR FS, S3, SWIFT, Quantcast FS, Symantec VCFS & more

Infrastructures System on a Chip, x86, Virtualization and Cloud

Distributions Cloudera, Hortonworks, IBM, Intel, MapR, WanDisco

Page 5: 4 hadoop for-the-disillusioned

@wattsteveSource: Gartner Hype Cycle

Page 6: 4 hadoop for-the-disillusioned

@wattsteveCC flickr kakadu

Your data is growing beyond your ability to manage & query it

Page 7: 4 hadoop for-the-disillusioned

@wattsteveCC flickr martijnsnels

Save money when asking the same questions of your data

Page 8: 4 hadoop for-the-disillusioned

@wattsteve

Geoffrey Moore’s Technology Adoption Lifecycle

CHASM

Innovators EarlyAdopters

EarlyMajority

LateMajority

Laggards

Hadoop Customer, “Great, but now what?”

Page 9: 4 hadoop for-the-disillusioned

@wattsteveCC flickr cbcastro

new

and build data products

Page 10: 4 hadoop for-the-disillusioned

@wattsteveCC flickr birdwatcher63

Ask your domain experts and LOB folks what unanswered questions they have Where can you get the data you need to answer that question? (domain experts should know

where to get it) Some of this data may be outside your organization (Social Media, Sensor Data, Data

brokerages/Marketplaces, Web Pages) and some of it may be inside. If the data for the query doesn’t exist, figure out how to instrument or gather it. Pair your domain experts with your data engineers so they can work out how to obtain and

massage the data given the types of queries desired

Page 11: 4 hadoop for-the-disillusioned

@wattsteveCC flickr syume

• Building data products is a similar exercise except that it involves typical product planning, such as identifying a market.

• This is also a great way for an organization to explore what assets they have within their data

Page 12: 4 hadoop for-the-disillusioned

@wattsteve

Mapping the night sky

CC flickr bobfamiliar

Page 13: 4 hadoop for-the-disillusioned

@wattsteveCC flickr oxfam

Analyzing farm soil content to predict human conflict

Page 14: 4 hadoop for-the-disillusioned

@wattsteveCC flickr flodigrip

Crisis Management for the Chilean Earthquake

Page 15: 4 hadoop for-the-disillusioned

@wattsteve

Thanks for listening

Steve Watt [email protected]

Hadoop Operations Powered By ... Hadoop (Hadoop Summit 2014 Amsterdam)

Hadoop , Hadoop , Hadoop !!!

On the Energy (In)efficiency of Hadoop: Scale-down Efficiencycsl.stanford.edu/~christos/publications/2009.hadoopenergy.hotpowe… · Hadoop crash-course 4 Hadoop == Distributed Processing

Introduction Apache oozie (Hadoop workflow engine)€¦ · Hadoop Professional Training 4. Apache OOZie HandsOn Professional Training INTRODUCTION APACHE OOZIE (HADOOP WORKFLOW ENGINE)

4. v sphere big data extensions hadoop

Analyzing Hadoop with Hadoop

Hadoop Installation Guide | Hadoop Configuration

Disillusioned of the Dream.pdf

Лекция 4. MapReduce в Hadoop (введение)

ANALISIS INTELIGENTE DE DATOS: MINERIA DE … · analisis multidimensional olap herramientas bi cuadros de mando, kpis 4- big data 5- hadoop ecosistema hadoop distribuciones hadoop

Hadoop Deployment Manual - Hyadespleiades.ucsc.edu/doc/bright/hadoop-deployment-manual.pdf2.2 Ncurses Installation Of Hadoop Using cm-hadoop-setup ... •The Hadoop Deployment Manual

Curso Hadoop. FcoJavierLahozSevilla v1.0.pdf · Introducción+a Hadoop. InstalaciónenAWS • Parte+1.+Introducción+a Hadoop+ – ¿Que+es+Hadoop?+ – Versionesde+Hadoop+ – Gesón

Hadoop Workflow Automatisierung - ca.com · 4 • WHITE PAPER • HADOOP-WORKFLOW-AUTOMATISIERUNG ca.com SECTION 1 Warum eine Workflow-Engine für Hadoop so wichtig ist In der Welt

Continuous Delivery for Linux/Windows/Hadoop...Beta Cluster Hadoop JobTracker Jenkins Slave Hadoop node Hadoop node Hadoop node Hadoop node Slave Node Gateway Prod. Cluster PigServer

Hadoop Present - Open Enterprise Hadoop

Hadoop : The Definitive Guide Chap. 4 Hadoop I/O

ShmStreaming: A Shared Memory Approach for Improving Hadoop Streaming … · 2019. 6. 4. · Hadoop Streaming is a set of extra utilities provided by Hadoop for developing applications

Hadoop Data Analytics 4

Hadoop - fnac-static.com · 2017. 4. 14. · Introduction • Contexte de création d’Hadoop • Architecture in-frastructurelle d’Hadoop • MapReduce • Hadoop • HDFS •

Build your own 4 node virtualized hadoop cluster

On the Energy (In)efficiency of Hadoop: Scale-down …kozyraki/publications/2009...Hadoop crash-course 4 Hadoop == Distributed Processing Framework 1000s of nodes, PBs of data Hadoop

Die 10 wichtigsten Big Data-Technologien · Hadoop - Ein bewährtes Konzept 4 2. Cloudera – Hadoop für Unternehmen 4 3. Apache Hive - Das Data Warehouse für Hadoop 5 4. Cloudera

The Rise of Fascism. Italy after WWI After WWI, most people in Italy were very disillusioned. After WWI, most people in Italy were very disillusioned

Hadoop, Hadoop, Hadoop!!! Jerome Mitchell Indiana University

The Hadoop Ecosystem & HBase - Meetupfiles.meetup.com/3137102/WHUG 4. Hadoop Ecosystem... · 2012-07-13 · The Hadoop Ecosystem & HBase Kai Voigt, Cloudera Inc. Warsaw Hadoop User

Discovering & Protecting Sensitive Data in Hadoop & Protecting Sensitive Data in Hadoop ... Overview © 2014 Dataguise ... • The 4 approaches to address security within Hadoop (Perimeter,

, a disillusioned author obsessed To Love the Coming End In

Hadoop Virtualization: VMware, Inc.Introduction 2 MapReduce 2 Hadoop 2 Virtualizing Hadoop 4 Another Form of Virtualization: Aggregation 5 Benefits of Hadoop in a Private Cloud 7 Agility

Big Data and Analytics - ADA Universityaadamov/sources/slides/bigdata/week-4-BDA-Hadoop-Ecosystem.pdfHadoop 2.0 vs Hadoop 1.0 – Processing The Hadoop Ecosystem Hadoop. Hortonworks

Hadoop operation chaper 4

4.hadoop 2.x feature

Hadoop Administrator … · Hadoop Administration: 1. Types Of Data and Tools used 2. Characteristics Of Big Data 3. Hadoop And Traditional Rdbms 4. Hadoop Core Services and Daemons

Disillusioned Manatees [Piano Duet]

Increasing Hadoop Performance with SanDisk ® SSDs · Increasing Hadoop Performance with SanDisk® Solid State Drives (SSDs) 5 4. Terasort Benchmark Terasort is a standard Hadoop

Configuring the Hadoop Cluster for Use by …support.sas.com/rnd/scalability/grid/hadoop/SGMforHadoop...Configuring the Hadoop Cluster for Use by ... Set Up Shared File System.....4