appsphere 15 - achieving stability and end-to-end monitoring

44
Achieving stability and End to End Monitoring Stace Baal, eHarmony Inc. Director of Engineering

Upload: appdynamics

Post on 15-Jan-2017

368 views

Category:

Documents


3 download

TRANSCRIPT

Page 1: AppSphere 15 - Achieving Stability and End-to-End Monitoring

Achieving stability and End to End Monitoring Stace Baal, eHarmony Inc.Director of Engineering

Page 2: AppSphere 15 - Achieving Stability and End-to-End Monitoring

The Problem

The Mess or Miss of Data

What we’re working on

The Rollout

The Business Transaction

The Drill Down

Copyright © 2015 AppDynamics. All rights reserved. 2

Dashboards

Health Rules

Proactive

That shouldn’t happen?!

Out of the datacenter – EUM

What’s next?

Page 3: AppSphere 15 - Achieving Stability and End-to-End Monitoring

THE PROBLEM

Page 4: AppSphere 15 - Achieving Stability and End-to-End Monitoring

The Problem

Page 5: AppSphere 15 - Achieving Stability and End-to-End Monitoring

The Problem

Mature, large brand in the midst of changing architecture• Instability from Monolithic legacy application• Reactive not proactive with production issues• Little visibility into how new services interacted with legacy apps• Lack of data on customer experience

Page 6: AppSphere 15 - Achieving Stability and End-to-End Monitoring

The Problem

Page 7: AppSphere 15 - Achieving Stability and End-to-End Monitoring

THE MESS OR MISS OF DATA

Page 8: AppSphere 15 - Achieving Stability and End-to-End Monitoring
Page 9: AppSphere 15 - Achieving Stability and End-to-End Monitoring

The mess of data

We needed one source to truth to quickly diagnose issues• Fragmentation of monitoring systems• Lots of holes, lack of time for maintenance as services evolve• Ineffective alerting, noise• User facing outages

Page 10: AppSphere 15 - Achieving Stability and End-to-End Monitoring

The mess of data

Copyright © 2015 AppDynamics. All rights reserved. 10

Page 11: AppSphere 15 - Achieving Stability and End-to-End Monitoring

Why APM?

Challenges that caused us to choose AppDynamics• Stability• Time to Market: Needed a turnkey solution• Minimal internal resources• Engineering driven solution• Ability to track and trend baselines• Single pane of glass

Page 12: AppSphere 15 - Achieving Stability and End-to-End Monitoring

THE ROLLOUT

Page 13: AppSphere 15 - Achieving Stability and End-to-End Monitoring

How do we roll this out?

Challenges that caused us to choose AppDynamics• Deploy• Define• Diagnose• Notify• Discovery

Tiers

Business Transactions

Dashboards

Health Rules

Snapshots

Page 14: AppSphere 15 - Achieving Stability and End-to-End Monitoring

Rolling out AppDynamics

Decide what to cover and how to divide it up• Create applications• Define and organize tiers• Deploy

– Start Front to Back– Chef Integration– Staging – Production, rolling deploy

• Wow, that went fast… – Time to market vs. homgrown solutions

Page 15: AppSphere 15 - Achieving Stability and End-to-End Monitoring

Example call graph

Page 16: AppSphere 15 - Achieving Stability and End-to-End Monitoring

Rolling out AppDynamics

Copyright © 2015 AppDynamics. All rights reserved. 16

• 4 Applications– User Front– User Back– Batch– Staging

• 38 Tiers (and counting)• EUM, Web & Mobile

Page 17: AppSphere 15 - Achieving Stability and End-to-End Monitoring

THE BUSINESS TRANSACTION

Page 18: AppSphere 15 - Achieving Stability and End-to-End Monitoring

Defining Business Transactions

Copyright © 2015 AppDynamics. All rights reserved. 18

Page 19: AppSphere 15 - Achieving Stability and End-to-End Monitoring

Defining Business Transactions

BT: Business transaction correlated across service tiers• Visibility into what is most important• Start simple, broad transactions key to the business• Add granularity over time• May take some time, but it’s worth it

Page 20: AppSphere 15 - Achieving Stability and End-to-End Monitoring

Defining Business Transactions

Copyright © 2015 AppDynamics. All rights reserved. 20

One Transaction

Several URLs

Page 21: AppSphere 15 - Achieving Stability and End-to-End Monitoring

Defining Business Transactions

Copyright © 2015 AppDynamics. All rights reserved. 21

One Transaction

Correlated Tiers

Page 22: AppSphere 15 - Achieving Stability and End-to-End Monitoring

Defining Business Transactions

BT is the foundation for all diagnostic tools• Correlation• Trends and dynamic baselines• Health rules• Drilldown and discovery

Page 23: AppSphere 15 - Achieving Stability and End-to-End Monitoring

Defining Business Transactions

Copyright © 2015 AppDynamics. All rights reserved. 23

Page 24: AppSphere 15 - Achieving Stability and End-to-End Monitoring

THE DRILLDOWN

Page 25: AppSphere 15 - Achieving Stability and End-to-End Monitoring

Drilling down into Transaction Snapshots

Your diagnostic window into a transaction• Single pane view across tiers• Deep code-level visibility• Database and exit calls• Custom diagnostic data• Server health and utilization

Page 26: AppSphere 15 - Achieving Stability and End-to-End Monitoring

Drilling down into Transaction Snapshots

Copyright © 2015 AppDynamics. All rights reserved. 26

Page 27: AppSphere 15 - Achieving Stability and End-to-End Monitoring

Drilling down into Transaction Snapshots

Copyright © 2015 AppDynamics. All rights reserved. 27

Page 28: AppSphere 15 - Achieving Stability and End-to-End Monitoring

DASHBOARDS

Page 29: AppSphere 15 - Achieving Stability and End-to-End Monitoring

Dashboards

Page 30: AppSphere 15 - Achieving Stability and End-to-End Monitoring

Dashboards

Your tool to bring everything together• At a glance diagnostic and status• Comprehensive tier and node monitoring• Connecting the dots between systems• Trending over time• Easily surface data between teams

Page 31: AppSphere 15 - Achieving Stability and End-to-End Monitoring

Dashboards

Copyright © 2015 AppDynamics. All rights reserved. 31

Traffic

Performance

Page 32: AppSphere 15 - Achieving Stability and End-to-End Monitoring

Dashboards

Copyright © 2015 AppDynamics. All rights reserved. 32

WEB

API

Service

Page 33: AppSphere 15 - Achieving Stability and End-to-End Monitoring

Dashboards

Copyright © 2015 AppDynamics. All rights reserved. 33

Page 34: AppSphere 15 - Achieving Stability and End-to-End Monitoring

PROACTIVE

Page 35: AppSphere 15 - Achieving Stability and End-to-End Monitoring

So now what?

• Data and tools drive a cultural shift• Alert and surface issues before than can become outages• Health Rules are able to be predictive• Socialize the system beyond on-call • Empower teams and increase visibility• Encourage engineers to drilldown and optimize

Page 36: AppSphere 15 - Achieving Stability and End-to-End Monitoring

THAT SHOULDN’T HAPPEN?!

Page 37: AppSphere 15 - Achieving Stability and End-to-End Monitoring

That shouldn’t happen?!

Page 38: AppSphere 15 - Achieving Stability and End-to-End Monitoring

That shouldn’t happen?!

That Random Spike

Copyright © 2015 AppDynamics. All rights reserved. 38

Page 39: AppSphere 15 - Achieving Stability and End-to-End Monitoring

That shouldn’t happen?!

After the release

Copyright © 2015 AppDynamics. All rights reserved. 39

Page 40: AppSphere 15 - Achieving Stability and End-to-End Monitoring

OUT OF THE DATACENTER - EUM

Page 41: AppSphere 15 - Achieving Stability and End-to-End Monitoring

End user Monitoring

Extending visibility to the end user experience• Web: Pages & Ajax Requests• Mobile: Network Requests• Mobile: Crash Reporting• Custom Events & Timers• User snapshots correlated with BT snapshots• Justify and prioritize infrastructure and application enhancements• Correlate performance to business metrics • CDN and 3rd party monitoring

Page 42: AppSphere 15 - Achieving Stability and End-to-End Monitoring

Stability & End to End Monitoring

Copyright © 2015 AppDynamics. All rights reserved. 42

Page 43: AppSphere 15 - Achieving Stability and End-to-End Monitoring

WHAT’S NEXT?

Page 44: AppSphere 15 - Achieving Stability and End-to-End Monitoring

Thank You