die lmax-architecture with disruptors: 6m transactions per ...donatas/vadovavimas/temos... · costs...

Die LMAX-Architecture with Disruptors: 6M Transactions per Second

Stephan Schmidt, Vice CTO, brands4friends

Me Stephan Schmidt Vice CTO brands4friends

@codemonkeyism www.codemonkeyism.com stephan.schmidt@brands4friends.de

brands4friends No.1 Shopping Club in Germany > 360k daily visitors > 4.5M Users eBay company

20.04.12 5 WJAX 2011

Development at brands4friends Team Java and web developers, data warehouse developers Process Scrum since 2009 Kanban for DWH since 2012

LMAX - The London Multi-Asset Exchange

20.04.12 Fußzeilentext 9

"We aim to build the highest performance financial exchange in the world"

High Performance Transaction Processing

20.04.12 10 Fußzeilentext

Service / Transaction Processor

Receive Unmarshal ReplicateJournal Business Logic Marshall Send

Ghz CPU

Actors? SEDA?

Stuff that did not work for various reasons

1.  RDBMS

2.  Actors

3.  SEDA

4.  J2EE …

LMAX Architecture

Node Node Node Node

Linked List Queue

Add Remove

Array Queue

Cache Line Cache Line

AddRemove

Queue as a data structure Problems with Queues

1.  Reading (Take) and Writing (Add) are both write access => Write Contention

2.  Write Contention solves with Locks 1.  Other solutions include Deques

3.  Locks lead to context switches to the kernel 1.  Context switches lead to CPU cache misses etc.

2.  Kernel might use opportunity to do other stuff as well

Locks Costs according to LMAX Paper

Method Time in ms Single Thread 300 Single Thread mit Lock 10.000 Zwei Threads mit Lock 224.000 Single Thread mit CAS 5.700 Zwei Threads mit CAS 30.000 Single Thread/ Volatile Write

“Compare And Swap” Atomic Reference etc. in Java => No Context Switch Memory Read/Write Barrier

LMAX Data Structure – Ring Buffer

Ring Buffer

Publisher Event Processor

Pre-Allocation of Buckets

Ring Buffer

Publisher

30 29 28272625

17161514131211

Event Processor

2^5•  No (less) GC problems •  Objects are near each other in memory

=> cache friendly

Coordination

Ring Buffer

Publisher

30 29 28272625

17161514131211

Event Processor

Claim Strategy

1.Claim 2.Write 3.Make Public by advancing sequence

Wait Strategy

Latency

Receive Message

Journal

Replicate

Unmarshall

Business Logic

Datenstruktur

Ouput DisruptorOuput DisruptorInput Disruptor Ouput Disruptor

Business Logic Handler

LMAX Architektur

Input Disruptor

Receiver

Journaler

Replicator

Un-Marshaller

Output Disruptor

Publisher

Marshaller

HA Node

File System

Jede Stage kann mehrere Threads haben

Receiver

Journaler

Replicator

Receiver writes on 31. Journaler and Replicator read on 24 and can move up the sequence to 30.

Business Logic Handler needs to stay behind all others.

Un-Marshaller can move beyond Journaler and Replicator up to 30.

Un-Marshaller

Java API

LMAX Low Level Ideas

1.  Simple Code

2.  Everything in memory

3.  Single threaded per CPU for business logic

4.  Business logic has no I/O, I/O is done somewhere else

5.  Scheduler “knows” dependencies of handlers

6M TPS? How did LMAX do it?

10K+ TPS

If you don't do anything stupid

3 billions of instructions on modern CPU

100K+ TPS

Clean organized code

Standard libraries

1000K+ TPS

Custom, cache friendly collections

Performance Testing

Controlled GC

Very well modeled domain

We’re looking for very good developers

Thanks! @codemonkeyism stephan.schmidt@brands4friends.de

Images CC from Flickr: nimboo, imjustcreative, gremionis, justonlysteve, John_Scone, Matthias Wicke, irisgodd3ss, TunnelBug, alandd, seasonal wanderer, raulbarraltamayo, Gilmoth, Dunechaser, graftedno1

Sources

“Disruptor: High performance alternative to bounded queues for exchanging data between concurrent threads”, Martin Thompson, Dave Farley, Michael Barker, Patricia Gee, Andrew Stewart, 2011

"The LMAX Architecture”, Martin Fowler, 2011

http://martinfowler.com/articles/lmax.html

“How to do 100K+ TPS at less than 1ms latency”, Martin Thompson, Michael Barker, 2010

die lmax-architecture with disruptors: 6m transactions per ...donatas/vadovavimas/temos... · costs...

Documents

hour start leq (dba) lmax (dba) lmin (dba) l90 (dba) l50...

lmax low profile/high current power...

lmax architecture

lmax global trading manual 2018 · pdf filelmax global is a...

thread gauges | thread ring gages | thread check

lmax global api faqsengland and wales (number 10819525). our...

threads thread synchronization - computer...

gewindedrehen thread turning gewindefrÄsen thread …

terms of business - lmax digital · lmax digital is a...

lmax disruptor 3 - slides.yowconference.com · lmax...

lmax exchange rulebook

l. Šimanskienės, a. seiliaus knygos „komandos: samprata,...

threads and fasteners thread symbols. screw thread terms:...

trading manual - lmax exchange · lmax global is a trading...

scalability, availability & stability...

powerpoint presentation · average roundtrip latency of...

2015 vhm / pm− hss− werkzeuge · -thread milling cutter...

allocating manpower to minimize lmax in a job shop - a...

lmax global api faqs · account management team using...

assunto d michelson contrasto: lmax-lmin lmax+lmin lmax+lmin...