large scale accumulo - koverse data platform - koverse orders of magnitude: running large scale...

172
Four Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014

Upload: trankien

Post on 10-Mar-2018

217 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Four Orders of Magnitude: Running Large Scale Accumulo Clusters

Aaron Cordova Accumulo Summit, June 2014

Page 2: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Scale, Security, Schema

Page 3: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Scale

Page 4: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

to scale1 - (vt) to change the size of something

Page 5: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

“let’s scale the cluster up to twice the original size”

Page 6: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

to scale2 - (vi) to function properly at a large scale

Page 7: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

“Accumulo scales”

Page 8: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

What is Large Scale?

Page 9: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Notebook Computer

• 16 GB DRAM

• 512 GB Flash Storage

• 2.3 GHz quad-core i7 CPU

Page 10: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Modern Server

• 100s of GB DRAM

• 10s of TB on disk

• 10s of cores

Page 11: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Large ScaleLaptop Server 10 Node

Cluster100

Nodes1000

Nodes10,000 Nodes

10 GB

100 GB

1 TB

10 TB

100 TB

1 PB

10 PB

100 PB

In RAM On Disk

Page 12: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Data Composition

0

45

90

135

180

January February March April

Original Raw Derivative QFDs Indexes

Page 13: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Accumulo Scales

• From GB to PB, Accumulo keeps two things low:

• Administrative effort

• Scan latency

Page 14: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Scan Latency

0

0.013

0.025

0.038

0.05

0 250 500 750 1000

Page 15: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Administrative Overhead

0

3

6

9

12

0 250 500 750 1000

Failed Machines Admin Intervention

Page 16: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Accumulo Scales

• From GB to PB three things grow linearly:

• Total storage size

• Ingest Rate

• Concurrent scans

Page 17: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Ingest Benchmark

0

25

50

75

100

0 250 500 750 1000

Milli

ons

of e

ntrie

s pe

r sec

ond

Page 18: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

AWB Benchmark

http://sqrrl.com/media/Accumulo-Benchmark-10312013-1.pdf

Page 19: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

1000 machines

Page 20: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

100 M entries written per second

Page 21: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

408 terabytes

Page 22: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

7.56 trillion total entries

Page 23: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Graph Benchmark

http://www.pdl.cmu.edu/SDI/2013/slides/big_graph_nsa_rd_2013_56002v1.pdf

Page 24: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

1200 machines

Page 25: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

4.4 trillion vertices

Page 26: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

70.4 trillion edges

Page 27: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

149 M edges traversed per second

Page 28: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

1 petabyte

Page 29: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Graph Analysis

Billions of Edges

1

100

10000

Twitter Yahoo! Facebook Accumulo

70,000

1,000

6.61.5

Page 30: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Accumulo is designed after Google’s BigTable

Page 31: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

BigTable powers hundreds of applications at Google

Page 32: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

BigTable serves 2+ exabytes

http://hbasecon.com/sessions/#session33

Page 33: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

600 M queries per second organization wide

Page 34: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

From 10 to 10,000

Page 35: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Starting with ten machines 101

Page 36: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

One rack

Page 37: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

1 TB RAM

Page 38: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

10-100 TB Disk

Page 39: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Hardware failures rare

Page 40: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Test Application Designs

Page 41: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Designing Applications for Scale

Page 42: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Keys to Scaling

1. Live writes go to all servers

2. User requests are satisfied by few scans

3. Turning updates into inserts

Page 43: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Keys to Scaling

Writes on all servers Few Scans

Page 44: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Hash / UUID KeysRowID Col Value

af362de4 Bob

b23dc4be Annie

b98de2ff Joe

c48e2ade $30

c7e43fb2 $25

d938ff3d 32

e2e4dac4 59

e98f2eab3 43

Key Value

userA:name Bob

userA:age 43

userA:account $30

userB:name Annie

userB:age 32

userB:account $25

userC:name Joe

userC:age 59

Uniform writes

Page 45: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

MonitorParticipating Tablet Servers

MyTable

Servers Hosted Tablets … Ingest

r1n1 1500 200k

r1n2 1501 210k

r2n1 1499 190k

r2n2 1500 200k

Page 46: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Hash / UUID KeysRowID Col Value

af362de4 Bob

b23dc4be Annie

b98de2ff Joe

c48e2ade $30

c7e43fb2 $25

d938ff3d 32

e2e4dac4 59

e98f2eab3 43

3 x 1-entry scans on 3 servers

get(userA)

Page 47: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Keys to Scaling

Writes on all servers Few Scans

Hash / UUID Keys

Page 48: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Group for LocalityKey Value

userA:name Bob

userA:age 43

userB:name Annie

userB:age 32

userC:name Fred

userC:age 29

userD:name Joe

userD:age 59

Key Value

userA:name Bob

userA:age 43

userA:account $30

userB:name Annie

userB:age 32

userB:account $25

userC:name Joe

userC:age 59

RowID Col Value

af362de4 name Annie

af362de4 age 32

af362de4 account $25

c48e2ade name Joe

c48e2ade age 59

e2e4dac4 name Bob

e2e4dac4 age 43

e2e4dac4 account $30

Still fairly uniform writes

Page 49: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Group for LocalityRowID Col Value

af362de4 name Annie

af362de4 age 32

af362de4 account $25

c48e2ade name Joe

c48e2ade age 59

e2e4dac4 name Bob

e2e4dac4 age 43

e2e4dac4 account $30

1 x 3-entry scan on 1 server

get(userA)

Page 50: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Keys to Scaling

Writes on all servers Few Scans

Grouped Keys

Page 51: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Temporal KeysKey Value

userA:name Bob

userA:age 43

userB:name Annie

userB:age 32

userC:name Fred

userC:age 29

userD:name Joe

userD:age 59

Key Value

20140101 44

20140102 22

20140103 23

RowID Col Value

20140101 44

20140102 22

20140103 23

Page 52: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Temporal KeysKey Value

userA:name Bob

userA:age 43

userB:name Annie

userB:age 32

userC:name Fred

userC:age 29

userD:name Joe

userD:age 59

Key Value

20140101 44

20140102 22

20140103 23

20140104 25

20140105 31

RowID Col Value

20140101 44

20140102 22

20140103 23

20140104 25

20140105 31

Page 53: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Temporal KeysKey Value

userA:name Bob

userA:age 43

userB:name Annie

userB:age 32

userC:name Fred

userC:age 29

userD:name Joe

userD:age 59

Key Value

20140101 44

20140102 22

20140103 23

20140104 25

20140105 31

20140106 27

20140107 25

20140108 17

RowID Col Value

20140101 44

20140102 22

20140103 23

20140104 25

20140105 31

20140106 27

20140107 25

20140108 17

Always write to one server

Page 54: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

No write parallelism

Page 55: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Temporal KeysRowID Col Value

20140101 44

20140102 22

20140103 23

20140104 25

20140105 31

20140106 27

20140107 25

20140108 17

Fetching ranges uses few scans

get(20140101 to 201404)

Page 56: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Keys to Scaling

Writes on all servers Few Scans

Temporal Keys

Page 57: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Binned Temporal KeysKey Value

userA:name Bob

userA:age 43

userB:name Annie

userB:age 32

userC:name Fred

userC:age 29

userD:name Joe

userD:age 59

Key Value

20140101 44

20140102 22

20140103 23

RowID Col Value

0_20140101 44

1_20140102 22

2_20140103 23

Uniform Writes

Page 58: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Binned Temporal KeysKey Value

userA:name Bob

userA:age 43

userB:name Annie

userB:age 32

userC:name Fred

userC:age 29

userD:name Joe

userD:age 59

Key Value

20140101 44

20140102 22

20140103 23

20140104 25

20140105 31

20140106 27

RowID Col Value

0_20140101 44

0_20140104 25

1_20140102 22

1_20140105 31

2_20140103 23

2_20140106 27

Uniform Writes

Page 59: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Binned Temporal KeysKey Value

userA:name Bob

userA:age 43

userB:name Annie

userB:age 32

userC:name Fred

userC:age 29

userD:name Joe

userD:age 59

Key Value

20140101 44

20140102 22

20140103 23

20140104 25

20140105 31

20140106 27

20140107 25

20140108 17

RowID Col Value

0_20140101 44

0_20140104 25

0_20140107 25

1_20140102 22

1_20140105 31

1_20140108 17

2_20140103 23

2_20140106 27

Uniform Writes

Page 60: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Binned Temporal KeysRowID Col Value

0_20140101 44

0_20140104 25

0_20140107 25

1_20140102 22

1_20140105 31

1_20140108 17

2_20140103 23

2_20140106 27

One scan per bin

get(20140101 to 201404)

Page 61: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Keys to Scaling

Writes on all servers Few Scans

Binned Temporal Keys

Page 62: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Keys to Scaling

• Key design is critical

• Group data under common row IDs to reduce scans

• Prepend bins to row IDs to increase write parallelism

Page 63: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Splits

• Pre-split or organic splits

• Going from dev to production, can ingest a representative sample, obtain split points and use them to pre-split a larger system

• Hundreds or thousands of tablets per server is ok

• Want at least one tablet per server

Page 64: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Effect of Compression

• Similar sorted keys compress well

• May need more data than you think to auto-split

Page 65: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Inserts are fast 10s of thousands per second per

machine

Page 66: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Updates *can* be …

Page 67: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Update Types

• Overwrite

• Combine

• Complex

Page 68: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Update - Overwrite

• Performance same as insert

• Ignore (don’t read) existing value

• Accumulo’s Versioning Iterator does the overwrite

Page 69: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Update - OverwriteRowID Col Value

af362de4 name Annie

af362de4 age 32

af362de4 account $25

c48e2ade name Joe

c48e2ade age 59

e2e4dac4 name Bob

e2e4dac4 age 43

e2e4dac4 account $30

userB:age -> 34

Page 70: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Update - OverwriteRowID Col Value

af362de4 name Annie

af362de4 age 34

af362de4 account $25

c48e2ade name Joe

c48e2ade age 59

e2e4dac4 name Bob

e2e4dac4 age 43

e2e4dac4 account $30

userB:age -> 34

Page 71: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Update - Combine

• Things like X = X + 1

• Normally one would have to read the old value to do this, but Accumulo Iterators allow multiple inserts to be combined at scan time, or compaction

• Performance is same as inserts

Page 72: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Update - CombineRowID Col Value

af362de4 name Annie

af362de4 age 34

af362de4 account $25

c48e2ade name Joe

c48e2ade age 59

e2e4dac4 name Bob

e2e4dac4 age 43

e2e4dac4 account $30

userB:account -> +10

Page 73: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Update - CombineRowID Col Value

af362de4 name Annie

af362de4 age 34

af362de4 account $25

af362de4 account $10

c48e2ade name Joe

c48e2ade age 59

e2e4dac4 name Bob

e2e4dac4 age 43

e2e4dac4 account $30

userB:account -> +10

Page 74: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Update - CombineRowID Col Value

af362de4 name Annie

af362de4 age 34

af362de4 account $25

af362de4 account $10

c48e2ade name Joe

c48e2ade age 59

e2e4dac4 name Bob

e2e4dac4 age 43

e2e4dac4 account $30

getAccount(userB) $35

Page 75: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Update - Combine

After compaction

RowID Col Value

af362de4 name Annie

af362de4 age 34

af362de4 account $35

c48e2ade name Joe

c48e2ade age 59

e2e4dac4 name Bob

e2e4dac4 age 43

e2e4dac4 account $30

Page 76: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Update - Complex

• Some updates require looking at more data than Iterators have access to - such as multiple rows

• These require reading the data out in order to write the new value

• Performance will be much slower

Page 77: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Update - ComplexuserC:account = getBalance(userA) + getBalance(userB)

RowID Col Value

af362de4 name Annie

af362de4 age 34

af362de4 account $35

c48e2ade name Joe

c48e2ade age 59

c48e2ade account $40

e2e4dac4 name Bob

e2e4dac4 age 43

e2e4dac4 account $30

35+30 = 65

Page 78: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Update - ComplexuserC:account = getBalance(userA) + getBalance(userB)

RowID Col Value

af362de4 name Annie

af362de4 age 34

af362de4 account $35

c48e2ade name Joe

c48e2ade age 59

c48e2ade account $65

e2e4dac4 name Bob

e2e4dac4 age 43

e2e4dac4 account $30

35+30 = 65

Page 79: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Planning a Larger-Scale Cluster 102 - 104

Page 80: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Storage vs Ingest

1

1000

1000000

10 100 1000 10000

Ingest Rate 1x1TB 12x3TB

120,000

12,000

1,200

120

10,000

1,000

100

10 Stor

age

Tera

byte

s

Milli

ons

of E

ntrie

s pe

r sec

ond

Page 81: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Model for Ingest Rates

A = 0.85log2N * N * S

N - Number of machines S - Single Server throughput (entries / second) A - Aggregate Cluster throughput (entries / second)

Expect 85% increase in write rate when doubling the size of the cluster

Page 82: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Estimating Machines Required

N = 2 (log2(A/S) / 0.7655347)

N - Number of machines S - Single Server throughput (entries / second) A - Target Aggregate throughput (entries / second)

Expect 85% increase in write rate when doubling the size of the cluster

Page 83: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Predicted Cluster SizesN

umbe

r of M

achi

nes

0

3000

6000

9000

12000

Millions of Entries per Second

0 150 300 450 600

Page 84: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

100 Machines 102

Page 85: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Multiple racks

Page 86: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

10 TB RAM

Page 87: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

100 TB - 1PB Disk

Page 88: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Some hardware failures in the first week

(burn in)

Page 89: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Expect 3 failed HDs in first 3 mo

Page 90: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Another 4 within the first year

http://static.googleusercontent.com/media/research.google.com/en/us/archive/disk_failures.pdf

Page 91: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Can process the 1000 Genomes data set

260 TB

www.1000genomes.org

Page 92: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Can store and index the Common Crawl Corpus

!

2.8 Billion web pages 541 TB

commoncrawl.org

Page 93: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

One year of Twitter 182 trillion tweets

483 TB

http://www.sec.gov/Archives/edgar/data/1418091/000119312513390321/d564001ds1.htm

Page 94: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Deploying an ApplicationTablet ServersClientsUsers

Page 95: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

May not see the affect of writing to disk for a while

Page 96: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

1000 machines 103

Page 97: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Multiple rows of racks

Page 98: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

100 TB RAM

Page 99: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

1-10 PB Disk

Page 100: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Hardware failure is a regular occurrence

Page 101: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Hard drive failure about every 5 days (average).

!

Will be skewed towards beginning of the year

Page 102: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Can traverse the ‘brain graph’ 70 trillion edges, 1 PB

http://www.pdl.cmu.edu/SDI/2013/slides/big_graph_nsa_rd_2013_56002v1.pdf

Page 103: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Facebook Graph 1s of PB

http://www-conf.slac.stanford.edu/xldb2012/talks/xldb2012_wed_1105_DhrubaBorthakur.pdf

Page 104: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Netflix Video Master Copies 3.14 PB

http://www.businessweek.com/articles/2013-05-09/netflix-reed-hastings-survive-missteps-to-join-silicon-valleys-elite

Page 105: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

World of Warcraft Backend Storage 1.3 PB

http://www.datacenterknowledge.com/archives/2009/11/25/wows-back-end-10-data-centers-75000-cores/

Page 106: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Webpages, live on the Internet 14.3 Trillion

http://www.factshunt.com/2014/01/total-number-of-websites-size-of.html

Page 107: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Things like the difference between two compression algorithms start

to make a big difference

Page 108: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Use range compactions to affect changes on portions of table

Page 109: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Lay off Zookeeper

Page 110: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Watch Garbage Collector and Namenode ops

Page 111: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Garbage Collection > 5 minutes?

Page 112: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Start thinking about NameNode Federation

Page 113: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Accumulo 1.6

Page 114: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Multiple NameNodes

Accumulo

Namenode Namenode

DataNodesDataNodes

Multiple HDFS Clusters

Page 115: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Multiple NameNodes

Accumulo

DataNodes

Multiple NameNodes, shared DataNodes (Federation. Requires Hadoop 2.0)

Namenode Namenode

Page 116: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

More Namenodes = higher risk of one going down.

!

Can use HA Namenodes in conjunction w/ Federation

Page 117: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

10,000 machines 104

Page 118: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

You, my friend, are here to kick a** and chew bubble gum

Page 119: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

1 PB RAM

Page 120: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

10-100 PB Disk

Page 121: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

1 hardware failure every hour on average

Page 122: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Entire Internet Archive 15 PB

http://www.motherjones.com/media/2014/05/internet-archive-wayback-machine-brewster-kahle

Page 123: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

A year’s worth of data from the Large Hadron Collider

15 PB

http://home.web.cern.ch/about/computing

Page 124: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

0.1% of all Internet traffic in 2013 43.6 PB

http://www.factshunt.com/2014/01/total-number-of-websites-size-of.html

Page 125: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Facebook Messaging Data 10s of PB

http://www-conf.slac.stanford.edu/xldb2012/talks/xldb2012_wed_1105_DhrubaBorthakur.pdf

Page 126: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Facebook Photos 240 billion

High 10s of PB

http://www-conf.slac.stanford.edu/xldb2012/talks/xldb2012_wed_1105_DhrubaBorthakur.pdf

Page 127: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Must use multiple NameNodes

Page 128: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Can tune back heartbeats, periodicity of central processes in

general

Page 129: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Can combine multiple PB data sets

Page 130: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Up to 10 quadrillion entries in a single table

Page 131: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

While maintaining sub-second lookup times

Page 132: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Only with Accumulo 1.6

Page 133: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Dealing with data over time

Page 134: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Data Over Time - Patterns

• Initial Load

• Increasing Velocity

• Focus on Recency

• Historical Summaries

Page 135: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Initial Load

• Get a pile of old data into Accumulo fast

• Latency not important (data is old)

• Throughput critical

Page 136: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Bulk Load RFiles

Page 137: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Bulk Loading

MapReduce

RFiles Accumulo

Page 138: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Increasing velocity

Page 139: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

If your data isn’t big today, wait a little while

Page 140: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Accumulo scales up dynamically, online. No downtime

Page 141: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

The first scale, ‘can change size’

Page 142: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Scaling UpClients

Accumulo

HDFS

3 physical servers Each running

a Tablet Server process and a Data Node process

Page 143: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Scaling UpClients

Accumulo

HDFS Start 3 new Tablet Server procs

3 new Data node processes

Page 144: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Scaling UpClients

Accumulo

HDFS master immediately assigns tablets

Page 145: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Scaling UpClients

Accumulo

HDFS

Clients immediately begin querying new

Tablet Servers

Page 146: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Scaling UpClients

Accumulo

HDFS

new Tablet Servers read data from old Data nodes

Page 147: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Scaling UpClients

Accumulo

HDFS

new Tablet Servers write data to new Data Nodes

Page 148: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Never really seen anyone do this

Page 149: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Except myself

Page 150: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

20 machines in Amazon EC2

Page 151: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

to 400 machines

Page 152: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

all during the same MapReduce job reading data out of Accumulo, summarizing, and writing back

Page 153: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Scaled back down to 20 machines when done

Page 154: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Just killed Tablet Servers

Page 155: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Decommissioned Data Nodes for safe data consolidation to

remaining 20 nodes

Page 156: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Other ways to go from 10x to 10x+1

Page 157: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Accumulo Table Export

Page 158: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

followed by HDFS DistCP to new cluster

Page 159: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Maybe new replication feature

Page 160: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Newer Data is Read more Often

Page 161: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Accumulo keeps newly written data in memory

Page 162: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Block Cache can keep recently queried data in memory

Page 163: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Combining Iterators make maintaining summaries of large

amounts of raw events easy

Page 164: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Reduces storage burden

Page 165: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Historical Summaries

0

2000

4000

6000

8000

April May June July

Unique Entities Stored Raw Events Processed

Page 166: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Age-off iterator can automatically remove data over a certain age

Page 167: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

IBM estimates 2.5 exabytes of data is created every day

http://www-01.ibm.com/software/data/bigdata/what-is-big-data.html

Page 168: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

90% of available data created in last 2 years

http://www-01.ibm.com/software/data/bigdata/what-is-big-data.html

Page 169: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

25 new 10k node Accumulo clusters per day

Page 170: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Accumulo is doing it’s part to get in front of the big data trend

Page 171: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

Questions ?

Page 172: Large Scale Accumulo - Koverse Data Platform - Koverse Orders of Magnitude: Running Large Scale Accumulo Clusters Aaron Cordova Accumulo Summit, June 2014 Scale, Security, Schema Scale

@aaroncordova