improving devops through better monitoring

28
Improving DevOps through better monitoring Leon Fayer @papa_fire

Upload: leon-fayer

Post on 07-Jul-2015

514 views

Category:

Technology


0 download

DESCRIPTION

Some developers believe that monitoring is a function of operations team. Some operations teams firmly believe that monitoring the systems they maintain is sufficient to run the business successfully. Most of them are wrong. The complexity of today’s applications have gone far and beyond the capabilities of “traditional” system-level monitoring tools and approaches and requires much broader knowledge of business and application as a whole. The goal of DevOps is to connect all aspects of application development and operations, and monitoring provides visibility and troubleshooting tools to accomplish that goal. This talk is intended to provide real-world examples of common gaps in monitoring approach and explain why holistic instrumentation of business and functionality monitors should be a part of any project scope.

TRANSCRIPT

Page 1: Improving DevOps through better monitoring

Improving DevOpsthrough better monitoring

Leon Fayer

@papa_fire

Page 2: Improving DevOps through better monitoring

Who am I ?

• 20+ years of development and operations of large systems

• currently Vice President at OmniTI

• can be found online:

• @papa_fire

• http://fayerplay.com

• github:lfayer

Page 3: Improving DevOps through better monitoring

So …

what is DevOps?

Page 4: Improving DevOps through better monitoring

What is DevOps?

philosophy of collaboration

Page 5: Improving DevOps through better monitoring

… and more

to enable business goals

Page 6: Improving DevOps through better monitoring

Not DevOps

dev ops

Page 7: Improving DevOps through better monitoring

Not DevOps either

devops

Page 8: Improving DevOps through better monitoring

DevOps

devops

Page 9: Improving DevOps through better monitoring

General consensus

Damon Edwards (http://dev2ops.org)

Page 10: Improving DevOps through better monitoring

Missing link

Damon Edwards (http://dev2ops.org)

Page 11: Improving DevOps through better monitoring

Finally, monitoring

enter monitoring

Page 12: Improving DevOps through better monitoring

What to monitor?

“in God we trust

all others we monitor”

Page 13: Improving DevOps through better monitoring

What to monitor specifically?

• systems

• databases

• application

• integration points

• performance

• user behavior

• business processes

Page 14: Improving DevOps through better monitoring

Perfect quote

“ I don’t give a **** if the

datacenter is on fire as long as

I am still making money ”

- CEO

Page 15: Improving DevOps through better monitoring

Example: Twitter

serves over 20 million unique visitors a day

… legendary for downtime

. servers are up and running

. HTTP checks return 200

. tweets lost

Page 16: Improving DevOps through better monitoring

Why monitor?

• software is never perfect

• systems are more and more complex

• proactive is better than reactive

• external dependency worry

• …

Page 17: Improving DevOps through better monitoring

Why really monitor?

things change

Page 18: Improving DevOps through better monitoring

… and when things change

changes effect business

Page 19: Improving DevOps through better monitoring

And now for real example

:case study:

Page 20: Improving DevOps through better monitoring

Setting the stage

• online marketing company

• major e-commerce component

• 90+ million users

• 1 billion emails/months

• 300,000+ lines of code

• ~ 50 physical devices

• 5600+ metrics collected

Page 21: Improving DevOps through better monitoring

It all starts with …

Page 22: Improving DevOps through better monitoring

Let the hunt begin

revenue

Page 23: Improving DevOps through better monitoring

Direct cause check

revenue + traffic

Page 24: Improving DevOps through better monitoring

Going down the stack

revenue + traffic + load time

Page 25: Improving DevOps through better monitoring

Still descending

revenue + traffic + load time + db

Page 26: Improving DevOps through better monitoring

Got ya!

revenue + traffic + load time + db + email

Page 27: Improving DevOps through better monitoring

Keys to monitoring

1. understand business

2. approach top-down

3. correlate data

Page 28: Improving DevOps through better monitoring

Questions?

For more tips & examples:

http://omniti.com/explains/monitoring-the-big-picture