lambda architecture with cassandra (vaibhav puranik, gumgum) | c* summit 2016

21
Vaibhav Puranik Lambda Architecture with Cassandra

Upload: datastax

Post on 16-Apr-2017

375 views

Category:

Software


2 download

TRANSCRIPT

Page 1: Lambda Architecture with Cassandra (Vaibhav Puranik, GumGum) | C* Summit 2016

Vaibhav Puranik

Lambda Architecture with Cassandra

Page 2: Lambda Architecture with Cassandra (Vaibhav Puranik, GumGum) | C* Summit 2016

About me

Page 3: Lambda Architecture with Cassandra (Vaibhav Puranik, GumGum) | C* Summit 2016

Invented In Image advertising in 2008http://gumgum.com/gallery

Computer VisionProcessing 2.6B image impressions / month10B events / day2000 premium publishers165 employees in 3 continents

Page 4: Lambda Architecture with Cassandra (Vaibhav Puranik, GumGum) | C* Summit 2016
Page 5: Lambda Architecture with Cassandra (Vaibhav Puranik, GumGum) | C* Summit 2016
Page 6: Lambda Architecture with Cassandra (Vaibhav Puranik, GumGum) | C* Summit 2016

Lambda Architecture

Page 7: Lambda Architecture with Cassandra (Vaibhav Puranik, GumGum) | C* Summit 2016

Lambda Architecture

Page 8: Lambda Architecture with Cassandra (Vaibhav Puranik, GumGum) | C* Summit 2016

Architecture Before Cassandra

500 Ad Servers

Page 9: Lambda Architecture with Cassandra (Vaibhav Puranik, GumGum) | C* Summit 2016

Caps

Geo Filter

Keyword Filter

Cap Filter

Other Filters

Sort by revenue and pick one

Page 10: Lambda Architecture with Cassandra (Vaibhav Puranik, GumGum) | C* Summit 2016

Real Time Pipeline - The Problem500 Ad Servers

Page 11: Lambda Architecture with Cassandra (Vaibhav Puranik, GumGum) | C* Summit 2016

The Problem

6 minsroundtrip time

Page 12: Lambda Architecture with Cassandra (Vaibhav Puranik, GumGum) | C* Summit 2016

Enter Cassandra - Phase 1500 Ad Servers

Cassandra: US East - 21 Nodes, US West: 18 Nodes, EU West: 12 Nodes - r3.2xlarge

Page 13: Lambda Architecture with Cassandra (Vaibhav Puranik, GumGum) | C* Summit 2016

Caps

Geo Filter

Other Filters

Cassandra Cap Filter

Old Cap Filter

Sort by revenue and pick one

Page 14: Lambda Architecture with Cassandra (Vaibhav Puranik, GumGum) | C* Summit 2016

The Solution

6 mins 10 ms

Page 15: Lambda Architecture with Cassandra (Vaibhav Puranik, GumGum) | C* Summit 2016

Enter Cassandra - Phase 2500 Ad Servers

Cassandra: US East - 21 Nodes, US West: 18 Nodes, EU West: 12 Nodes - r3.2xlarge

Load data every 8 mins async way

Page 16: Lambda Architecture with Cassandra (Vaibhav Puranik, GumGum) | C* Summit 2016

Caps

Geo Filter

Other Filters

Cassandra RealtimeCap Filter

Cassandra Batch Data Filter

Sort by revenue and pick one

Page 17: Lambda Architecture with Cassandra (Vaibhav Puranik, GumGum) | C* Summit 2016
Page 18: Lambda Architecture with Cassandra (Vaibhav Puranik, GumGum) | C* Summit 2016
Page 19: Lambda Architecture with Cassandra (Vaibhav Puranik, GumGum) | C* Summit 2016

Scale

21 + 18 + 12 + 9 = 60 r3.2xlarge instances (Metadata)

9 + 6 + 3 = 18 r3.2xlarge instances (Tally)

400,000 records loaded every 8 mins by each Ad Server

20,000 read requests per second

10,000 write requests per second

Page 20: Lambda Architecture with Cassandra (Vaibhav Puranik, GumGum) | C* Summit 2016

Batch Layer

Realtime View

Batch Views

Lambda Architecture

Speed Layer

Page 21: Lambda Architecture with Cassandra (Vaibhav Puranik, GumGum) | C* Summit 2016

gumgum.com/careers