sle12 sp2 : high availability et geo cluster

16
High Availability + GEO SLE12 SP2 Antoine Giniès Project Manager / Release Manager SUSE / [email protected] Expert Days Paris Feb 2017

Upload: suse

Post on 11-Apr-2017

28 views

Category:

Software


5 download

TRANSCRIPT

Page 1: SLE12 SP2 : High Availability et Geo Cluster

High Availability + GEOSLE12 SP2

Antoine Giniès

Project Manager / Release Manager

SUSE / [email protected]

Expert Days Paris

Feb 2017

Page 2: SLE12 SP2 : High Availability et Geo Cluster

2

SUSE Entreprise ServerHA SLE12 SP2

Main Features

• Policy Driven Cluster

• Cluster Aware FS

• Continuous Data Replication

• Setup and Installation bootstrap

• Simple

Page 3: SLE12 SP2 : High Availability et Geo Cluster

3

HA Cluster Stack Architecture

Page 4: SLE12 SP2 : High Availability et Geo Cluster

4

Cluster Scenarios

Main Features

• A / A

• A / P

• Hybrid (Phy / virt)

• Geo Cluster

Page 5: SLE12 SP2 : High Availability et Geo Cluster

5

Key Concept

Terminology

• Primitive

• RA ( React )

• Group ( other Node, order )

• Clone ( multiple Node, ie: FS )

• Constraint ( condition )

Page 6: SLE12 SP2 : High Availability et Geo Cluster

6

STONITH / FENCING

Shoot The Other Node In The Head

• UPS (Uninterruptible Power Supply)

• PDU (Power Distribution Unit)

• Blade Power Control

• Light-out devices (iLo, Darc etc...)

• SBD Devices (>1)

• Crm configure ra list/info stonith:XXX

Page 7: SLE12 SP2 : High Availability et Geo Cluster

7

HAWK2 (new)

High Availability Web Konsole

• Status

• Dashboard

• History

• Resources

• Constraint

• Wizard ( resources,

• Configuration ( constraints, cluster etc...)

• Logs ( debug )

• Access Control (Target = users / Role = access to CIB)

• Batch mode ( staging / simulate )

Page 8: SLE12 SP2 : High Availability et Geo Cluster

8

GEO Concept

• Ticket ~= service

• Boothd ~= manage ticket

• Arbitrator ~= take decision

• Dead Man Dep ~= Fence?

Page 9: SLE12 SP2 : High Availability et Geo Cluster

9

HA features by Difference

Page 10: SLE12 SP2 : High Availability et Geo Cluster

10

Maintenance / Standby

How to Update

• Standby– still a R

– Elligible as a R

• Maintenance– Un-managed

– No more a R

– monitor

Page 11: SLE12 SP2 : High Availability et Geo Cluster

11

Cluster-MD (new)

Cluster Multi-device

• Software based raid storage

• Improve performance compare to CLVM mirroring

• RAID1 (redundancy)

• Replace at Runtime

• Requires: corosync / DLM

• On top of 2 SAN storage → no more SPOF

Page 12: SLE12 SP2 : High Availability et Geo Cluster

12

DRBD

Distributed Replicated Block Device

• Replication DATA

• Mirror of 2 block Devices

• Stacked DRBD

• 8 → 9 ! (meta data)

Page 13: SLE12 SP2 : High Availability et Geo Cluster

13

DRBD VS Cluster-MD

• DRBD – SAN storage

– 2 nodes only

– Possible Regular FS

– Primary / Primary with cluster aware FS

• Cluster-MD– Classical or SAN storage

– > 2 nodes

– Cluster Aware FS

Page 14: SLE12 SP2 : High Availability et Geo Cluster

14

OCFS2 VS GFS2

• OCFS2 – Fast: small/large data files on different nodes

– No quota support

– No online-resize (no mounted)

• GFS2– Fast: Large data files

– Perf issue accessing small files on different nodes

– Quota support

– Online-resize (mounted)

Page 15: SLE12 SP2 : High Availability et Geo Cluster

15

Testing HA easily

Get everything on github

• HA cluster testing using VM

• Fully automatic

• Testing scenarios

• https://github.com/krig/Deploy_HA_SLE_cluster

Page 16: SLE12 SP2 : High Availability et Geo Cluster