customer driven devops at work: operating the digital turnstile [futurestack16 nyc]

36
MLB Advanced Media Customer Driven DevOps at Work: Operating the Digital Turnstile FutureStack ’16 - 09 Aug 2016 Sean Curtis, SVP Software Engineering

Upload: new-relic

Post on 14-Apr-2017

159 views

Category:

Technology


1 download

TRANSCRIPT

Page 1: Customer Driven DevOps at Work: Operating the Digital Turnstile [FutureStack16 NYC]

MLB Advanced MediaCustomer Driven DevOps at Work:

Operating the Digital Turnstile

FutureStack ’16 - 09 Aug 2016 Sean Curtis, SVP Software Engineering

Page 2: Customer Driven DevOps at Work: Operating the Digital Turnstile [FutureStack16 NYC]

Agenda• MLBAM - Brief Intro

• The New Digital Turnstile

• The Evolution of DevOps at MLBAM

• Our Platform

• Our Customers

• NewRelic Helping Power Customer-Driven DevOps

Page 3: Customer Driven DevOps at Work: Operating the Digital Turnstile [FutureStack16 NYC]

MLBAM - Who we are• Formed in 2000 to centralize all digital media

distribution on the Internet

• Over 1200 employees, predominantly in NY, San Francisco

• Over half of our staff are in engineering, systems, and operations

• Operate all digital functions for 30 MLB teams, as well as MLB.com league portal

Page 4: Customer Driven DevOps at Work: Operating the Digital Turnstile [FutureStack16 NYC]

The Beginning• 2002 - First sports league to live stream on the

internet.

• 30K viewers

• One Device platform: PC

• 280 Kbps

• 244x182

Page 5: Customer Driven DevOps at Work: Operating the Digital Turnstile [FutureStack16 NYC]

Where we are now…

• 2016 - #1 OTT Sports Streaming Service

• 400+ Devices

• 4.5 Mbps

• 720p HD

Page 6: Customer Driven DevOps at Work: Operating the Digital Turnstile [FutureStack16 NYC]

The Digital TurnstileAn Evolution

Page 7: Customer Driven DevOps at Work: Operating the Digital Turnstile [FutureStack16 NYC]

Turnstiles: Early Stage• One ticket source (theatre)

would sell exclusive tickets to get into a show

• The “bigger” or more popular the movie, the longer the line for access.

• Customer interest inversely correlated to Customer experience…

Page 8: Customer Driven DevOps at Work: Operating the Digital Turnstile [FutureStack16 NYC]

Friction!

Page 9: Customer Driven DevOps at Work: Operating the Digital Turnstile [FutureStack16 NYC]

Evolution• Schedule information made widely

available. Customers have greater choice in location and showing time

• Purchase tickets ahead of time from the box office

• Multiple forms of payment or billing.• Purchase tickets online from

multiple sources. Pay ahead of time.

• Pick your particular showing and save your seat!

• The system now scales better to meet customer demand.

Page 10: Customer Driven DevOps at Work: Operating the Digital Turnstile [FutureStack16 NYC]

The Digital Turnstile

video, events, data,

media rights

user, location, device,

App Store, etc.

Page 11: Customer Driven DevOps at Work: Operating the Digital Turnstile [FutureStack16 NYC]

DevOps at MLBAM

Page 12: Customer Driven DevOps at Work: Operating the Digital Turnstile [FutureStack16 NYC]

MLBAM DevOps: A Timeline

2001 2013

MLBAM, like many other Dev shops, has a “silo-ed” approach to

build vs. deploy / test & monitor

2016

MLBAM moves toward DevOps using tools for

build/deploy, configuration and

monitoring

2014

Page 13: Customer Driven DevOps at Work: Operating the Digital Turnstile [FutureStack16 NYC]

“Silo”-ed Ownership• Engineers build and test locally, then check in

artifacts to shared repository. Developers then document (when possible) through release notes & run books

• Systems actually does deployment, installing artifacts into test environments

• Developers/Project Managers then publish release to QA to coordinate testing. QA reports findings, eventually signing off on release. Load tests ensue, etc.

• If all testing passes, release to production by Systems.

• Operations now owns first-tier customer response.

Page 14: Customer Driven DevOps at Work: Operating the Digital Turnstile [FutureStack16 NYC]

“Silo”-ed Ownership

Dev

Systems/QA

Operations

Page 15: Customer Driven DevOps at Work: Operating the Digital Turnstile [FutureStack16 NYC]

Challenges we faced• Who actually owns the customer?

• How do we drive transparency (e.g. ubiquitous metrics) across all tiers?

• Does anybody or everybody have deep knowledge of the overall architecture?

• How do we properly escalate issues and priority between tiers?

• What portion of the system has trouble scaling as business demands more aggressive timelines?

Page 16: Customer Driven DevOps at Work: Operating the Digital Turnstile [FutureStack16 NYC]

A Shift in Architecture (and more)

Page 17: Customer Driven DevOps at Work: Operating the Digital Turnstile [FutureStack16 NYC]

Change in Thinking• MLBAM becomes a trusted partner to many large-scale

sports leagues and media brands (just to name a few)

Page 18: Customer Driven DevOps at Work: Operating the Digital Turnstile [FutureStack16 NYC]

Leverage proven solutions

• Spend more time on the things that matter to your customers.

• Rely on vendors for building / scaling infrastructure

• Opportunity to reimagine our existing systems while iterating on scalability, security and resiliency

Page 19: Customer Driven DevOps at Work: Operating the Digital Turnstile [FutureStack16 NYC]

MLBAM DevOps• MLBAM also formalizes an internal team, “Core

Engineering” to productize internal development tracks such as: toolset selection, configuration management, and infrastructure deployment

• Core Engineering engages with customer-facing teams to help “on-board” them to these various products and tools.

• Also helps spread best practices, architectural reviews, lessons learned across all dev teams

Page 20: Customer Driven DevOps at Work: Operating the Digital Turnstile [FutureStack16 NYC]

Engineers are motivated by…

• Being challenged to learn a new set of skills

• Owning more of the customer facing solution

• Paying down technical debt and making their products and services more scalable and resilient

• Building with transparency

Page 21: Customer Driven DevOps at Work: Operating the Digital Turnstile [FutureStack16 NYC]

Systems & Operations folks enjoy…

• Crisp architecture with time-tested methods for scaling (bonus points for automation!)

• Redundancy - enough said.

• An explicit networking and security model (and someone who understands ipchains)

• Automated tools for building and configuring infrastructure. (Bonus points for VCS integration!)

• A clear escalation path and chain-of-custody model for customer-facing issues

Page 22: Customer Driven DevOps at Work: Operating the Digital Turnstile [FutureStack16 NYC]

QA folks want…• The ability to spin up testing resources at a

moments notice

• Deeper metrics that describe system performance as a function of a recent change

• Sure-fire way of ensuring that iterations to code indeed satisfy the original requirements in all phases of testing

Page 23: Customer Driven DevOps at Work: Operating the Digital Turnstile [FutureStack16 NYC]

Business leaders want…

• Ubiquitous set of metrics that describe customer response and / or impact

• If / when issues arise, the fastest path to root cause and resolution for the customer

• An approach to scaling that aligns with business growth

Page 24: Customer Driven DevOps at Work: Operating the Digital Turnstile [FutureStack16 NYC]

Our Platform

+

Page 25: Customer Driven DevOps at Work: Operating the Digital Turnstile [FutureStack16 NYC]

Platform: AWS• Refactor hosted infrastructure (JMS Brokers, etc.) for

SNS, SQS - Improved scaling and resiliency

• Leverage IAM and other AWS products for access control, security, auditing, white/blacklist, networking, etc.

• Implement autoscaling of microservices for resource-level granularity of usage and cost

• Manage TCO per microservice, understand overprovisioning at the instance and/or product layer

Page 26: Customer Driven DevOps at Work: Operating the Digital Turnstile [FutureStack16 NYC]

Platform: NewRelic• Leverage full NewRelic platform across all tiers (mobile,

web, service)

• Foster collaboration between operations, systems, engineering, QA, and business teams by providing dashboards with ubiquitous metrics

• Establish fine-grained tolerance thresholds per metric observed to define business-critical incidents

• Integrate incident creation with other internal tools to identify, route, triage, resolve, and provide postmortem metrics

Page 27: Customer Driven DevOps at Work: Operating the Digital Turnstile [FutureStack16 NYC]

The New Turnstile

video, events, data,

media rights

user, location, device,

App Store, etc.

Page 28: Customer Driven DevOps at Work: Operating the Digital Turnstile [FutureStack16 NYC]

Our Customers• Similar to the movie ticketing example, MLBAM

has partnered with many third parties to provide unprecedented selection in how baseball fans can access our teams and league

Page 29: Customer Driven DevOps at Work: Operating the Digital Turnstile [FutureStack16 NYC]

Turnstile complexity• Authenticating users (AuthN)

and Authorizing them for access (AuthZ) become distributed operations

• In some cases, MLBAM is record holder, but in many cases, third parties sell MLB.tv to their subscribers (e.g. IOS, Pay-TV companies)

Page 30: Customer Driven DevOps at Work: Operating the Digital Turnstile [FutureStack16 NYC]

Turnstile complexity• We have to scale our turnstile

in order to meet peak demand (i.e. event start time) which differs dramatically between events

• Partner response times are not always equal or predictable

• From time to time, partners will have outages, both scheduled and unscheduled

Page 31: Customer Driven DevOps at Work: Operating the Digital Turnstile [FutureStack16 NYC]

How does New Relic help here?

• APM - provides implicit and explicit application performance monitoring

• Helps describe the overall health of each of our services

• Will create incidents and send them to downstream alerting/monitoring systems

• Drives more rapid and accurate insight as to root cause

• Provides single “pane of glass” for Ops, Systems, and Engineers at BAM

Page 32: Customer Driven DevOps at Work: Operating the Digital Turnstile [FutureStack16 NYC]

A brief example• The Rangers and Penguins are

playing in a late April NHL matchup that has playoff implications

• Hockey fans are trying to join the game live

• If any of our systems, or our partner systems has a hiccup, how can we quickly isolate any issues and take mitigation steps?

Page 33: Customer Driven DevOps at Work: Operating the Digital Turnstile [FutureStack16 NYC]

A Happy Turnstile (Green)

Page 34: Customer Driven DevOps at Work: Operating the Digital Turnstile [FutureStack16 NYC]

Operational Effects• NewRelic has become a default standard method of

providing simplified views of the state of our infrastructure

• Engineering now first class citizen in operational aspects of running our services, vs code/document-handoff, and have a greater understanding because they own infrastructure as well.

• SysOps now has fullly transparent view of the health of each service, but more importantly, the relationships between services and their dependencies

• Overlapping ownership of the customer!

Page 35: Customer Driven DevOps at Work: Operating the Digital Turnstile [FutureStack16 NYC]

Overlapping Ownership

Engineering Systems/QA Operations Business

Page 36: Customer Driven DevOps at Work: Operating the Digital Turnstile [FutureStack16 NYC]

Thank You!