beyond the cluster … · probably, higher gc pressure ... powerpoint presentation author: paula...

47
BEYOND THE CLUSTER: WAN DATA REPLICATION WITH GRIDGAIN YAKOV ZHDANOV

Upload: others

Post on 23-Aug-2020

4 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

BEYOND THE CLUSTER:WAN DATA REPLICATION WITH GRIDGAIN

YAKOV ZHDANOV

Page 2: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

WHO?

Yakov Zhdanov:- GridGain’s Product Development VP - With GridGain since 2010- Apache Ignite committer and PMC- Passion for performance & scalability- Finding ways to make product better- St. Petersburg, Russia

Page 3: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

PLAN

1)Why replicate?

Page 4: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

PLAN

1)Why replicate?2)How do DBs solve this?

Page 5: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

PLAN

1)Why replicate?2)How do DBs solve this?3)Replication: Monolith vs Distributed4)

Page 6: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

PLAN

1)Why replicate?2)How do DBs solve this?3)Replication: Monolith vs Distributed4)GridGain DR overview – roles, features, process

Page 7: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

PLAN

1)Why replicate?2)How do DBs solve this?3)Replication: Monolith vs Distributed4)GridGain DR overview – roles, features, process5)Future plans – Sync/Async TX replication

Page 8: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

WHY REPLICATE YOUR DATA?

Page 9: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

WHY REPLICATE?

Data security

Page 10: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

WHY REPLICATE?

Data security Failover

Page 11: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

WHY REPLICATE?

Data security Failover Data warehousing

Page 12: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

WHY REPLICATE?

Data security Failover Data warehousing Load balancing

Page 13: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

WHY REPLICATE?

Data security Failover Data warehousing Load balancing Increasing system capacity

Page 14: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

POSTGRESQL REPLICATION

PostgreSQL is an object-relational database management system (ORDBMS)

Pioneered many things and concepts High maturity level Opensource and widely used

Page 15: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

POSTGRESQL REPLICATION

Shared disk storage

https://www.postgresql.org/docs/10/static/different-replication-solutions.html

Page 16: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

POSTGRESQL REPLICATION

Shared disk storage File system replication

https://www.postgresql.org/docs/10/static/different-replication-solutions.html

Page 17: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

POSTGRESQL REPLICATION

Shared disk storage File system replication Write-Ahead Log Shipping

https://www.postgresql.org/docs/10/static/different-replication-solutions.html

Page 18: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

POSTGRESQL REPLICATION

Shared disk storage File system replication Write-Ahead Log Shipping Logical Replication

https://www.postgresql.org/docs/10/static/different-replication-solutions.html

Page 19: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

POSTGRESQL REPLICATION

Shared disk storage File system replication Write-Ahead Log Shipping Logical Replication Trigger-Based Master-Standby Replication

https://www.postgresql.org/docs/10/static/different-replication-solutions.html

Page 20: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

POSTGRESQL REPLICATION

Shared disk storage File system replication Write-Ahead Log Shipping Logical Replication Trigger-Based Master-Standby Replication Statement-Based Replication Middleware

https://www.postgresql.org/docs/10/static/different-replication-solutions.html

Page 21: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

POSTGRESQL REPLICATION

Shared disk storage File system replication Write-Ahead Log Shipping Logical Replication Trigger-Based Master-Standby Replication Statement-Based Replication Middleware Async Multimaster Replication

https://www.postgresql.org/docs/10/static/different-replication-solutions.html

Page 22: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

POSTGRESQL REPLICATION

Shared disk storage File system replication Write-Ahead Log Shipping Logical Replication Trigger-Based Master-Standby Replication Statement-Based Replication Middleware Async Multimaster Replication Sync Multimaster Replication

https://www.postgresql.org/docs/10/static/different-replication-solutions.html

Page 23: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

REPLICATION IN DISTRIBUTED SYSTEMS

Monolith Distributed

Data security + ?

Failover + +

Load balancing + ?/+

Increasing system capacity + ?

Data warehousing + ?

Page 24: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

REPLICATION IN GRIDGAIN

Introduced in 2012/2013 Completely new feature Required a lot of engineering

efforts Required revisiting of existing

logic Async KEY/VALUE mode available Sync/Async TX replication under

development

https://docs.gridgain.com/docs/data-center-replication

Page 25: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

REPLICATION IN GRIDGAIN: ROLES

Sender cache Sender hub Receiver hub Receiver cache

https://docs.gridgain.com/docs/data-center-replication

Page 26: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

REPLICATION IN GRIDGAIN: FEATURES

Complex topologies (up to 32 datacenters) Failover Pluggable conflict resolution Filtering Pause/Resume Full state transfer

https://docs.gridgain.com/docs/data-center-replication

Page 27: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

REPLICATION IN GRIDGAIN: HOW IT WORKS

Page 28: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

REPLICATION IN GRIDGAIN: COMPLEX TOPOLOGIES

Page 29: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

REPLICATION IN GRIDGAIN: WHAT CAN GO WRONG?

Node failure in sending topology

Page 30: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

REPLICATION IN GRIDGAIN: WHAT CAN GO WRONG?

Node failure in sending topology

Page 31: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

REPLICATION IN GRIDGAIN: WHAT CAN GO WRONG?

Sender hub(s) failure

Page 32: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

REPLICATION IN GRIDGAIN: WHAT CAN GO WRONG?

Sender hub(s) failure

Page 33: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

REPLICATION IN GRIDGAIN: WHAT CAN GO WRONG?

Receiver hub(s) failure

Page 34: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

REPLICATION IN GRIDGAIN: WHAT CAN GO WRONG?

Receiver hub(s) failure

Page 35: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

REPLICATION IN GRIDGAIN: WHAT CAN GO WRONG?

Node failure in receiving topology

Page 36: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

REPLICATION IN GRIDGAIN: WHAT CAN GO WRONG?

Node failure in receiving topology

Page 37: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

REPLICATION IN GRIDGAIN: WHAT CAN BE BETTER?

Batching on per-node basis vs per-partition basis

Per-node batching More efficient from memory standpoint Batches collected quickly

Per-partition batching No need for additional processing on receiving side Less contention – honors thread-per-partition model Probably, higher GC pressure But still expected to perform better

Page 38: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

REPLICATION IN GRIDGAIN: FUTURE PLANS

TX Replication

Page 39: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

REPLICATION IN GRIDGAIN: FUTURE PLANS

TX Replication – over stretched cluster

Page 40: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

REPLICATION IN GRIDGAIN: FUTURE PLANS

TX replication mechanism

Page 41: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

REPLICATION IN GRIDGAIN: FUTURE PLANS

TX Replication – sync modes

Strict SYNC modeMain primary node (PN) → Sender → Receiver → Stand-In PN →Receiver →Sender → Main PN

Merciful SYNC modeMain PN → Sender → Receiver → Receiver's WAL → Sender → Main PN

ASYNC modeMain PN->Sender->Sender WAL->Main PN

Page 42: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

LESSONS LEARNED

Is replication able to solve your problem?

Page 43: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

LESSONS LEARNED

Is replication able to solve your problem? Pickup proper settings: sync/async, physical vs logical changes.

Page 44: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

LESSONS LEARNED

Is replication able to solve your problem? Pickup proper settings: sync/async, physical vs logical changes.

Be aware of internals – know what makes it work.

Page 45: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

LESSONS LEARNED

Is replication able to solve your problem? Pickup proper settings: sync/async, physical vs logical changes.

Be aware of internals – know what makes it work. Make sure to test, tune and monitor.

Page 46: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

CONTACTS

[email protected]

http://ignite.apache.org

[email protected]@ignite.apache.org

Page 47: BEYOND THE CLUSTER … · Probably, higher GC pressure ... PowerPoint Presentation Author: Paula Adduci Created Date: 11/8/2017 5:59:15 PM

QUESTIONS?

ANY QUESTIONS?