beyond tco

27
2016-06-29 Beyond TCO Architecting Hadoop for adoption and data applications Reid Levesque – Head, Solution Engineering

Upload: dataworks-summithadoop-summit

Post on 16-Apr-2017

355 views

Category:

Technology


0 download

TRANSCRIPT

Page 1: Beyond TCO

2016-06-29

Beyond TCOArchitecting Hadoop for adoption and data applications

Reid Levesque – Head, Solution Engineering

Page 2: Beyond TCO

Introduction

Page 3: Beyond TCO

Topics

Technology

Use cases

Deployment Impact Next

steps

Page 4: Beyond TCO

Technology – Let’s talk Hadoop

Page 5: Beyond TCO

Every company is a technology company…

some just don’t know it yet.

Page 6: Beyond TCO

Traditional systems under pressure

Conventional wisdom• Put the code on an Application Server• Move the data to/from database• Move the data to/from NASReality check• This works well for small amounts of data• As data volumes increase this design falls apart

Page 7: Beyond TCO

Hadoop to the rescue

Enterprise

Page 8: Beyond TCO

How do we get Hadoop into the organization?

Page 9: Beyond TCO

How about these use cases?

File archive +Hadoop

Data-intensive grid compute analytics

Database replacement

ETL off-load +Hadoop

+Hadoop

+Hadoop

• Data is online; no need for tape backup

• Cheaper than NAS / SAN

• Increased performance / scalability

• Metadata is easier to get; all the data is in one spot

• Improved performance

• Lower TCO

• Reduced dependence on proprietary software

• Reduce RDBMS licensing

• Reduced operational cost for analysis

• Improved functionality with stored XML

• Lower TCO

• Additional analytic capability

• Better hardware utilization

• Lower platform management

Page 10: Beyond TCO

Not so much

File archive +Hadoop

Data-intensive grid compute analytics

Database replacement

ETL off-load +Hadoop

+Hadoop

+HadoopTCO

Page 11: Beyond TCO

Which use case did work?

Current batch was taking 4 hours; which limited the way they did their job

Users wanted interactive response times to design and test their financial models

This was net new functionality that could only be achieved in Hadoop

Page 12: Beyond TCO

Now TCO makes more sense

File archive +Hadoop

Data-intensive grid compute analytics

Database replacement

ETL off-load +Hadoop

+Hadoop

+Hadoop

With Hadoop TCO covered, previous use cases are now more compelling.

Page 13: Beyond TCO

How do we deploy this?

Page 14: Beyond TCO

Which distribution?

Pick one:

Page 15: Beyond TCO

Time to pick the hardware

Is this true?

Page 16: Beyond TCO

Commodity hardware + commodity networking = bad architecture

Page 17: Beyond TCO

Before there was Hadoop, there were enterprise IT standards

To name a few conflicts during the rollout…

• Local account UID / names• OS settings• Root access• File locations• Standard mount sizes• Enterprise Active Directory• Monitoring systems

Hadoop is NOT flexible on deployment requirements

Page 18: Beyond TCO

Who does the work?

Single team including:• Dedicated infrastructure team (Compute, Network, Data Center, Operations)• Dedicated Hadoop team (sysadmin/operations, engineering)• Hardware vendor engineers• Hadoop distribution engineers

Page 19: Beyond TCO

Into production we go!

Page 20: Beyond TCO

What was the impact?

Page 21: Beyond TCO

Changing perceptions

Page 22: Beyond TCO

Impact across the organization

Infrastructure• Networking / Data Center designs• Relationship with storage, cloud,

virtualization capabilities• Generating analytic use cases

Development• Mega-attractor for talent• Application consolidation• Shifting from IT to business focus

Management• Understanding (or accepting) new

paradigm• Cross-department architecture

alignment• Data-focus rather than application-

focus

Business• Continuously evolving understanding of

capability / possibilities• Next generation IT w/ rapidly evolving

ecosystem• Self-service innovation for business

users

Page 23: Beyond TCO

Lessons Learned

Hadoop doesn’t remove hardware maintenance

Hadoop development is still development!

New paradigm – requires skilled developers

A whole new set of error messages to decode

There aren’t that many experts

Page 24: Beyond TCO

Where do we go next?

Page 25: Beyond TCO

Self-service tools

Page 26: Beyond TCO

Selling Hadoop internally• This journey has taught me a lot about Hadoop; more than most people at the organization• The biggest tasks are educating the organization and doing simple things as a first step

Page 27: Beyond TCO

Thank You