couchconf israel 2013_couchbase server in production

Couchbase Server 2.0 in

ProductionPerry Krug

Sr. Solutions Architect

Typical Couchbase production environment

Application users

Load Balancer

Application Servers

Servers

We’ll focus on App-Couchbase interaction …

Application users

Load Balancer

Application Servers

Servers

… at each step of the application lifecycle

Dev/Test Size Deploy Monitor Manage

KEY CONCEPTS

Couchbase Single Node Architecture

Replication, Rebalance, Shard State Manager

REST management API/Web UI

8091Admin Console

11210 / 11211Data access ports

Object-managedCache

Storage Engine

8092Query API

Data Manager Cluster Manager

Couchbase Single Node

Managed Cache

Replication Queue

App Server

Couchbase Server Node

Doc 1Doc 1

To other node

XDCR Queue

To other cluster

View engine

Web Application

Couchbase deployment

Data Flow

Cluster Management

Web Application

CouchbaseClient Library

Web Application … …

Couchbase Server Couchbase Server Couchbase Server Couchbase Server

Replication Flow

COUCHBASE SERVER CLUSTER

Couchbase in a Cluster

User Configured Replica Count = 1

ACTIVE

SERVER 1

REPLICA

APP SERVER 1

COUCHBASE Client Library

CLUSTER MAP

COUCHBASE Client Library

CLUSTER MAP

APP SERVER 2

ACTIVE

SERVER 2

REPLICA

ACTIVE

SERVER 3

REPLICA

READ/WRITE/UPDATE

NODE AND CLUSTER SIZING

Dev-Test Size Deploy Monitor Manage

Size Couchbase Server

Sizing == performance• Serve reads out of RAM• Enough IO for writes and disk operations• Mitigate inevitable failures

Reading Data Writing Data

Server

Give medocument A

Here is document A

Application Server

Server

Please storedocument A

OK, I storeddocument A

Application Server

ServerServer Server

Scaling out permits matching of aggregate flow rates so queues do not grow

Application ServerApplication Server Application Server

network networknetwork

How many nodes?

5 Key Factors determine number of nodes needed:

1) RAM2) Disk3) CPU4) Network5) Data Distribution/Safety

Couchbase Servers

Web application server

Application user

RAM sizing

1) Total RAM:• Managed document cache:

• Working set• Metadata• Active+Replicas

• Index caching (I/O buffer)

Keep working set in RAM for best read performance

Server

Give medocument A

Here is document A

Application Server

Reading Data

Working set ratio depends on your application

Server Server Server

Late stage social gameMany users no longer

active; few logged in at any given time.

Ad NetworkAny cookie can show up

at any time.

Business applicationUsers logged in during

the day. Day moves around the globe.

working/total set = 1working/total set = .01 working/total set = .33

Reading Data

RAM sizing – Working set managed cache

As memory grows, some cached data will be removed from RAM to make space:

• Active and replica data share RAM• Threshold based (NRU, favoring active data)• Only cleanly persisted data can be “ejected”• Only data values can be “ejected” which means

RAM can fill up with metadata

RAM Sizing - View/Index cache (disk I/O)

• File system cache availability for the index has a big impact performance:

• Test runs based on 10 million items with 16GB bucket quota and 4GB, 8GB system RAM availability for indexes

• Performance results show that by doubling system cache availability– query latency reduces by half

– throughput increases by 50%

• Leave RAM free with quotas

Disk sizing: Space and I/O

2) Disk• Sustained write rate• Rebalance capacity• Backups• XDCR • Compaction• Total dataset:

(active + replicas + indexes)

• Append-only

Please storedocument A

OK, I storeddocument A

Application Server

Server

Writing Data

Disk sizing: I/O

Impacting disk I/O needed:• Peak write load• Sustained write load• Compaction• XDCR• Views/indexing

Configurable paths/partitions for data and indexes allows for separation of space and I/O

Disk sizing: Space

Impacting amount of disk space needed:• Total data set • Indexes• Overhead for compaction (~3x): Both data

and indexes are “append-only”

Configurable paths/partitions for data and indexes allows for separation of space and I/O

Disk sizing: Impact of Views on IO and Space

• Number of Design Documents• Extra space for each DD• Extra IO to process for each DD• Segregate views by DD

• Complexity of Views (IO)

• Amount of view output (space)• Emit as little as possible• Doc ID automatically included

• Use Development views and extrapolate

• Append-only file format puts all new/updated/deleted items at the end of the on-disk file.

– Better performance and reliability

– No more fragmentation!

• This can lead to invalidated data in the “back” of the file.

• Need to compact data

Disk sizing: Append only

Initial file layout:

Update some data:

After compaction:

Disk compaction

Doc A Doc B Doc C

Doc C Doc B’ Doc A’’

Doc A Doc B Doc A’ Doc B’ Doc A’’Doc A Doc B Doc C Doc A’ Doc D

• Compaction happens automatically:

– Settings for “threshold” of stale data

– Settings for time of day

– Split by data and index files

– Per-bucket or global

• Reduces size of on-disk files – data files AND index files

• Temporarily increased disk I/O

and CPU, but no downtime!

Disk compaction

CPU sizing

3) CPU• Disk writing• Views/compaction/XDCR• RAM r/w performance not impacted

1.8 used VERY little CPU. Under the same workloads, 2.0 should not be much different.

New 2.0 features will require more CPU

Network sizing

4) Network• Client traffic• Replication (writes)• Rebalancing• XDCR

Reads+Writes

Replication (multiply writes) and Rebalancing

Consistent low latency with varying doc sizes

Consistently low latencies in microseconds for

varying documents sizes with a mixed workload

Data Distribution

5) Data Distribution / Safety (assuming one replica):• 1 node = BAD• 2 nodes = …better…• 3+ nodes = BEST!

Note: Many applications will need more than 3 nodes

Servers fail, be prepared. The more nodes, the less impact a failure will have.

How many nodes? (recap)

New to 2.0 feature will affect sizing requirements:• Views/Indexing/Querying• XDCR• Append-only file format

5 Key Factors still determine number of nodes needed:1) RAM2) Disk3) CPU4) Network5) Data Distribution

Couchbase Servers

Web application server

Application user

MONITORING

36Server

Key resources: RAM, Disk, Network, CPU

NETWORK

Server

Application Server Application Server Application Server

Monitoring

Once in production, heart of operations is monitoring

• RAM Usage• Disk space and I/O:

• write queues / read activity / indexing• Network bandwidth, replication queues• CPU Usage• Data distribution (balance, replicas)

Monitoring

IMMENSE amount of information available

• Real-time traffic graphs

• REST API accessible

• Per bucket, per node and aggregate statistics

• Application and inter-node traffic

• RAM <-> Disk

• Inter-system timing

Key Stats to Monitor

• Working set doesn’t fit in RAM

–Cache miss rate / disk fetches

• Disk I/O not keeping up

–Disk Write queue size

• Internal replication lag

– TAP queues

• Indexing not keeping up

• XDCR lag

MANAGEMENT AND MAINTENANCE

Management/Maintenance

• Scaling

• Upgrading/Scheduled maintenance

• Backup/Restore

• Dealing with Failures

Scaling

Couchbase Scales out Linearly:

Need more RAM? Add nodes…

Need more Disk IO or space? Add nodes…

Couchbase also makes it easy to scale up by swapping larger nodes for smaller ones without any disruption

Couchbase + Cisco + Solarflare

Number of servers in cluster

High throughput with 1.4 GB/sec data transfer rate using 4 servers

Linear throughput scalability

Additional benchmark details

• Cluster of 8 nodes running Couchbase Server 1.8.0 • One server used as the client to run the workload• Workload used for the test was Couchbase’s streaming load generator • GET and SET operations were performed in the 70:30 ratio

Test System and Parameters • Couchbase Server 1.8.0 • Cisco Nexus 5548UP Switch • Solarflare SFN5122F 10 Gigabit Ethernet Enhanced Small Form-Factor

Pluggable (SFP+) server adapters • Solarflare OpenOnload• Servers: Nine Cisco UCS C200 M2 High-Density Rack Servers with Intel

Xeon processor X5670 six-core 2.93-GHz CPU, running Red Hat Enterprise Linux (RHEL) 5.5 x86 64-bit, with 100-GB RAM and four 2-TB hard drives

1. Add nodes of new version, rebalance…

2. Remove nodes of old version, rebalance…

3. Done!

No disruption

General use for software upgrade, hardware refresh, planned maintenance

Upgrade existing Couchbase Server 1.8 to

Couchbase Server 2.0!

Upgrade

Easy to Maintain Couchbase

• Use remove+rebalance on “malfunctioning” node:

– Protects data distribution and “safety”

– Replicas recreated

– Best to “swap” with new node to maintain capacity and move minimal amount of data

Backup

Data Files

cbbackup

ServerServer Server

network networknetwork

Restore

2) “cbrestore” used to restore data into live/different cluster

Data Files

cbrestore

Failures Happen!

Hardware

NetworkBugs

Easy to Manage failures with Couchbase

• Failover (automatic or manual):

– Replica data and indexes promoted for immediate access

– Replicas not recreated

– Do NOT failover healthy node

– Perform rebalance after returning cluster to full or greater capacity

Fail Over

Active Docs

Replica Docs

COUCHBASE CLIENT LIBRARY

CLUSTER MAP

APP SERVER 1

COUCHBASE CLIENT LIBRARY

CLUSTER MAP

APP SERVER 2

SERVER 1

SERVER 2

SERVER 3

Doc 7 Doc 8

DOCDOC

DOC DOC

Doc 5DOC

Replica Docs Replica Docs Replica Docs

Active Docs Active Docs Active Docs

SERVER 4 SERVER 5

Active Docs Active Docs

Replica Docs Replica Docs

COUCHBASE SERVER CLUSTER

53Dev/Test Size Deploy Monitor Manage

Conclusion

Want more?

Lots of details and best practices in our documentation:

http://www.couchbase.com/docs/

QUESTIONS?

PERRY@COUCHBASE.COM@PERRYKRUG

couchconf israel 2013_couchbase server in production

application data

couchbase node

operating couchbase

web application

application uptime

cluster of couchbase

production impact

things arent steady

Documents

the israel-palestine conflict. where is...

couchconf london developing couchbase part iii: advanced app...

ficha pais | israel israel

couchconf-bangalore-couchbase 2.0-in-production

couchconf israel 2.0 tour and demo

couchconf london: couchbase server in production

couchconf israel developing with couchbase

couchconf tokyo developing with couchbase part iii

couchconf tokyo customer presentation: docomo innovations...

couchconf london developing with couchbase i: getting...

couchconf israel 2013_developing with cb ii

couchconf israel couchbase in production 24x7

· 11 mike wilner 4 snir shoshana 16 jake paul country...

israel. find israel israel west bank gaza strip golan...

israel competitiveness snapshot files/20121206 - israel...

israel forever views from israel

couchconf tokyo opening session

¿quien es israel?. ¿somos israel o somos israel...

couchconf israel 2013_full text search

couchconf tokyo 2013_app development with documents their...