Transcript
Page 1: Improving DevOps through better monitoring

Improving DevOpsthrough better monitoring

Leon Fayer

@papa_fire

Page 2: Improving DevOps through better monitoring

Who am I ?

• 20+ years of development and operations of large systems

• currently Vice President at OmniTI

• can be found online:

• @papa_fire

• http://fayerplay.com

• github:lfayer

Page 3: Improving DevOps through better monitoring

So …

what is DevOps?

Page 4: Improving DevOps through better monitoring

What is DevOps?

philosophy of collaboration

Page 5: Improving DevOps through better monitoring

… and more

to enable business goals

Page 6: Improving DevOps through better monitoring

Not DevOps

dev ops

Page 7: Improving DevOps through better monitoring

Not DevOps either

devops

Page 8: Improving DevOps through better monitoring

DevOps

devops

Page 9: Improving DevOps through better monitoring

General consensus

Damon Edwards (http://dev2ops.org)

Page 10: Improving DevOps through better monitoring

Missing link

Damon Edwards (http://dev2ops.org)

Page 11: Improving DevOps through better monitoring

Finally, monitoring

enter monitoring

Page 12: Improving DevOps through better monitoring

What to monitor?

“in God we trust

all others we monitor”

Page 13: Improving DevOps through better monitoring

What to monitor specifically?

• systems

• databases

• application

• integration points

• performance

• user behavior

• business processes

Page 14: Improving DevOps through better monitoring

Perfect quote

“ I don’t give a **** if the

datacenter is on fire as long as

I am still making money ”

- CEO

Page 15: Improving DevOps through better monitoring

Example: Twitter

serves over 20 million unique visitors a day

… legendary for downtime

. servers are up and running

. HTTP checks return 200

. tweets lost

Page 16: Improving DevOps through better monitoring

Why monitor?

• software is never perfect

• systems are more and more complex

• proactive is better than reactive

• external dependency worry

• …

Page 17: Improving DevOps through better monitoring

Why really monitor?

things change

Page 18: Improving DevOps through better monitoring

… and when things change

changes effect business

Page 19: Improving DevOps through better monitoring

And now for real example

:case study:

Page 20: Improving DevOps through better monitoring

Setting the stage

• online marketing company

• major e-commerce component

• 90+ million users

• 1 billion emails/months

• 300,000+ lines of code

• ~ 50 physical devices

• 5600+ metrics collected

Page 21: Improving DevOps through better monitoring

It all starts with …

Page 22: Improving DevOps through better monitoring

Let the hunt begin

revenue

Page 23: Improving DevOps through better monitoring

Direct cause check

revenue + traffic

Page 24: Improving DevOps through better monitoring

Going down the stack

revenue + traffic + load time

Page 25: Improving DevOps through better monitoring

Still descending

revenue + traffic + load time + db

Page 26: Improving DevOps through better monitoring

Got ya!

revenue + traffic + load time + db + email

Page 27: Improving DevOps through better monitoring

Keys to monitoring

1. understand business

2. approach top-down

3. correlate data

Page 28: Improving DevOps through better monitoring

Questions?

For more tips & examples:

http://omniti.com/explains/monitoring-the-big-picture


Top Related