(app203) how sumo logic and anki build highly resilient services on aws to manage massive usage...

53
November 12, 2014 | Las Vegas, NV APP203 Ben Whaley, Anki Christian Beedgen, Sumo Logic

Upload: amazon-web-services

Post on 01-Jul-2015

343 views

Category:

Technology


0 download

DESCRIPTION

In just two years, Sumo Logic's multitenant log analytics service has scaled to query over 10 trillion more logs each day. Christian, Sumo Logic's cofounder and CTO shares the three most important lessons he has learned in building such a massive service on AWS. Ben Whaley is an AWS Community Hero who works for Anki as an AWS cloud architect. Ben uses hundreds of millions of logs to troubleshoot and improve Anki Drive, the coolest battle robot racing game on the planet. This is an ideal session for cloud architects constantly looking to improve scalability and application performance on AWS. Sponsored by Sumo Logic.

TRANSCRIPT

Page 1: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014

November 12, 2014 | Las Vegas, NV

APP203

Ben Whaley, Anki

Christian Beedgen, Sumo Logic

Page 2: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014

Introductions

Chasing Infinity

Obvious Versus Hard

Page 3: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014

Co-Founder & CTO Chief Architect

Page 4: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014

Applications

Internet of ThingsNetwork

Mobile

Search

Visualize

Predict

Page 5: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014

0

200,000

400,000

600,000

800,000

1,000,000

1,200,000

0

2,000

4,000

6,000

8,000

10,000

12,000

14,000

16,000

Ma

y-1

2

Jun-1

2

Jul-1

2

Aug-1

2

Sep-1

2

Oct-

12

No

v-1

2

De

c-1

2

Jan-1

3

Feb

-13

Ma

r-1

3

Apr-

13

Ma

y-1

3

Jun-1

3

Jul-1

3

Aug-1

3

Sep-1

3

Oct-

13

No

v-1

3

De

c-1

3

Jan-1

4

Feb

-14

Ma

r-1

4

Apr-

14

Ma

y-1

4

Jun-1

4

Jul-1

4

Aug-1

4

Sep-1

4

Oct-

14

GB per Day

Searches

Page 6: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014

AWS Infrastructure Lead Cloud Architect

Page 7: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014
Page 8: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014
Page 9: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014
Page 10: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014

In 2010, we knew that

success will look

something like this…

Page 11: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014

In 2010, we knew that

success will look

something like this…

Page 12: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014
Page 13: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014
Page 14: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014

Just one typical Sumo Logic customer - 8x Variance!

Page 15: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014

Just one typical Sumo Logic customer - 8x Variance!

Page 16: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014
Page 17: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014
Page 18: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014
Page 19: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014
Page 20: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014

Datacenter As An API!!!

Page 21: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014

A++++++++++++!! Will Buy Again

Page 22: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014
Page 23: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014

$ bin/receiver prod.service-registry.sumologic.com

Page 24: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014
Page 25: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014
Page 26: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014
Page 27: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014
Page 28: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014
Page 29: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014

Here We Are

In 2014

Page 30: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014
Page 31: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014
Page 32: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014
Page 33: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014
Page 34: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014

Deployment wide services

Ingest

Search

Internal tools

receiverhornetq-

forgeforge

cqsplitter

search

cloud collector

service

api

con-cierge

stream

katta

glass, ganglia

bill

mix

meta

configzoo-

keeperappvault org

rawhornetq-inbound

cocoa

bloom filter

analyticscsi

cqmerger

rework

view

autoview

depman

hornetq-internal

hornetq-metadata

nrt

Page 35: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014
Page 36: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014

Embrace Change Or Die

Page 37: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014
Page 38: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014
Page 39: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014
Page 40: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014
Page 41: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Page 42: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

1 1 1 1 1

1 1 1 1 1

1 1 1 1 1

1 1 1 1 1

1 1 1 1 1

Page 43: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

1 2 1 2 1 2 1 2 1 2

1 2 1 2 1 2 1 2 1 2

1 2 1 2 1 2 1 2 1 2

1 2 1 2 1 2 1 2 1 2

1 2 1 2 1 2 1 2 1 2

Page 44: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

1 2 3 4

5 6 7 8

1 2 3 4

5 6 7 8

1 2 3 4

5 6 7 8

1 2 3 4

5 6 7 8

1 2 3 4

5 6 7 8

1 2 3 4

5 6 7 8

1 2 3 4

5 6 7 8

1 2 3 4

5 6 7 8

1 2 3 4

5 6 7 8

1 2 3 4

5 6 7 8

1 2 3 4

5 6 7 8

1 2 3 4

5 6 7 8

1 2 3 4

5 6 7 8

1 2 3 4

5 6 7 8

1 2 3 4

5 6 7 8

1 2 3 4

5 6 7 8

1 2 3 4

5 6 7 8

1 2 3 4

5 6 7 8

1 2 3 4

5 6 7 8

1 2 3 4

5 6 7 8

1 2 3 4

5 6 7 8

1 2 3 4

5 6 7 8

1 2 3 4

5 6 7 8

1 2 3 4

5 6 7 8

1 2 3 4

5 6 7 8

Page 45: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

1 1

1 1

Page 46: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

1 1 2 2 2

1 1 2 2 2

Page 47: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

Index

1 3 4 1 3 4 2 3 5 2 3 5 2 3 6

7 7 5 8 5 8

1 3 4 1 3 4 2 3 5 2 3 5 2 3 6

7 7 5 8 5 8

7 7 5 8 5 8

5 8

5 8

5 8

6

6

6

Page 48: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014

Set[Customer], Set[Instance] Map[Instance, Set[Customer]]

Page 49: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014
Page 50: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014
Page 51: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014
Page 52: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014
Page 53: (APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Manage Massive Usage Spikes | AWS re:Invent 2014

http://bit.ly/awsevals