oracle maximum availability architecture (maa) with oracle ... · – oracle goldengate ! automated...

21
1/16/14 1 Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 1 Oracle Maximum Availability Architecture (MAA) with Oracle Database 12c Markus Michalewicz Director of Product Management Oracle Real Application Clusters (RAC) @ OracleRACpm http://www.linkedin.com/in/markusmichalewicz http://www.slideshare.net/MarkusMichalewicz

Upload: vuongthuan

Post on 03-May-2018

242 views

Category:

Documents


0 download

TRANSCRIPT

1/16/14  

1  

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 1

Oracle Maximum Availability Architecture (MAA) with Oracle Database 12c Markus Michalewicz Director of Product Management Oracle Real Application Clusters (RAC)

@OracleRACpm http://www.linkedin.com/in/markusmichalewicz

http://www.slideshare.net/MarkusMichalewicz

1/16/14  

2  

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 3

Is This HA?

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 4

Is This Better?

What if the second engine isn’t started until after the first one fails?

1/16/14  

3  

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 5

Agenda

§  High Availability (HA) Business Challenge

§  Oracle Maximum Availability Architecture (MAA)

§  Oracle MAA Reference Architectures

§  Customer Deployments

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 6

Reduce cost and increase return on investment

High Availability (HA) Business Challenges

Eliminate risk of downtime and data loss

1/16/14  

4  

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 7

Generic volume manager & file system…

Generic backup software

Generic cold failover cluster

Failover Server Production Server

Identical storage

Idle DR

Storage mirroring

Traditional Availability Cold Start = High Risk Idle Assets = High Cost

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 8

Battle Scars

8-day outage http://www.computerworld.com/

s/article/9182159/American_Eagle_Outfitters_learns_a_painful_service_provider_

lesson

American Eagle Outfitters - retail –  Disk failure, followed by mirrored disk failure. Restore from

local backup failed. Restore using copy at DR site also failed.

5-day outage http://www.computerworld.com/s/

article/9182719/Update_Virginia_s_IT_outage_continues_3_agencies_still_affecte

d

State of Virginia - government –  SAN memory failure, problem mirrored to standby SAN

Tieto - cloud infrastructure provider in Sweden –  Storage array failed, unable to read tape backups used for

DR, inadequate DR testing

5-day outage http://

www.channelregister.co.uk/2012/01/16/tieto_vnx5700/

Examples Where Traditional HA Infrastructure and Processes Failed

1/16/14  

5  

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 9

Enterprises Need a Better Approach to HA

Requirement Solution Profile Protect from outages Any type, anywhere Reduce recovery time Zero, seconds or minutes Prevent data loss Zero or seconds Minimize risk Continuous validation, test whenever Eliminate complexity Simpler, pre-integrated Increase ROI Reduce cost, utilize all assets

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 10

Agenda

§  High Availability (HA) Business Challenge

§  Oracle Maximum Availability Architecture (MAA)

§  Oracle MAA Reference Architectures

§  Customer Deployments

1/16/14  

6  

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 11

Edition-based Redefinition, Online Redefinition, Data Guard, GoldenGate –  Minimal downtime maintenance, upgrades, migrations

Active Data Guard – Data Protection, DR – Query Offload

GoldenGate – Active-active replication – Heterogeneous

Active Replica

Oracle Maximum Availability Architecture (MAA)

RMAN, Oracle Secure Backup – Backup to disk, tape or cloud

Enterprise Manager Cloud Control – Coordinated Site Failover Application Continuity – Application HA Global Data Services – Service Failover / Load Balancing

RAC – Scalability – Server HA

Flashback – Human error

correction

Production Site

ASM – ASM mirroring

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 12

Edition-based Redefinition, Online Redefinition, Data Guard, GoldenGate –  Minimal downtime maintenance, upgrades, migrations

Active Data Guard – Data Protection, DR – Query Offload

GoldenGate – Active-active replication – Heterogeneous

Active Replica

Oracle Maximum Availability Architecture (MAA)

RMAN, Oracle Secure Backup – Backup to disk, tape or cloud

Enterprise Manager Cloud Control – Coordinated Site Failover Application Continuity – Application HA Global Data Services – Service Failover / Load Balancing

RAC – Scalability – Server HA

Flashback – Human error

correction

Production Site

Edition-based Redefinition, Online Redefinition, Data Guard, GoldenGate –  Minimal downtime maintenance, upgrades, migrations

Active Data Guard – Data Protection, DR – Query Offload

GoldenGate – Active-active replication – Heterogeneous

Active Replica

RMAN, Oracle Secure Backup – Backup to disk, tape or cloud

Enterprise Manager Cloud Control – Coordinated Site Failover Application Continuity – Application HA Global Data Services – Service Failover / Load Balancing

RAC – Scalability – Server HA

Flashback – Human error

correction

Production Site

ASM – ASM mirroring

1/16/14  

7  

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 13

Oracle MAA Design Principles Eliminate Risk and Increase Return on Investment

Data Protection at Every Level

Strong Fault Isolation

Real-time DR with All Active Components

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 14

Principle #1: Data Protection at Every Level

§  Oracle Data blocks have a well-defined structure –  Block header is kept

consistent with payload –  Enables validation of

both physical and logical intra-block consistency

§  Oracle ensures block validity is maintained as it traverses I/O path

–  Extensive corruption checks

Oracle-Aware Data Validation

1/16/14  

8  

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 15

System Memory (SGA)

Oracle Database

Architecture To Standby Databases

TCP/IP

•  Better performance since no disk I/O •  Better isolation from lower layer faults •  Better network utilization: only redo sent •  Transactional consistency: always •  Corrupted blocks auto-repaired •  Database-integrated application failover

Principle #2: Strong Fault Isolation

§  Data Guard transmits redo blocks directly from SGA: like a memcpy over the network

§  Redo received / applied by running Oracle instance: continuous Oracle-integrated data validation

Oracle-Aware Database Replication

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 16

Data Guard Does What Storage Mirroring Can’t Isolate Corruption, Protect Data, Maintain Availability

“…the SRDF model is a remote mirroring model…any potential data corruption would be copied faithfully and expeditiously to the other side”

VP Global Marketing CTO, EMC

“…when something happens in the I/O stack and a database write is malformed Symmetrix A happily replicates the faulty data to site B and the corruption goes undetected (SRDF is a data mirroring tool; so it’s intentionally based on GIGO)”

EMC BLOG with Integrity

1/16/14  

9  

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 17

Principle #3: Real-time DR

§ All components active –  Servers –  Storage –  Remote sites

§ Easy scale-out –  Add capacity online

§ Rolling maintenance § Best recovery time: already hot § Least risk: you know it is working

Oracle-Aware Active Clustering and Offload to DR Systems for High ROI

Secondary Site Production Site

LAN/WAN

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 18

Agenda

§  High Availability (HA) Business Challenge

§  Oracle Maximum Availability Architecture (MAA)

§  Oracle MAA Reference Architectures

§  Customer Deployments

1/16/14  

10  

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 19

Applying MAA Principles

§  Assess impact of downtime and data loss

§  Define service level objectives –  Recovery time (RTO): how long

can you afford to be down –  Recovery Point (RPO): how much

data can you afford to lose –  Performance: pre and post failure

Begin with a Business Impact Analysis

RAC

GoldenGate

RMAN Data Guard

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 20

Reduce cost

Reduce risk

Consolidate Standardize Simplify

Set of reference HA architectures

Applying MAA Principles … contd. Implementation Approach

1/16/14  

11  

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 21

Oracle Database 12c MAA

Three Standard Reference Architectures

BRONZE Minutes to days of downtime Data protected as of last backup

SILVER Seconds to minutes of downtime Near-zero data loss

GOLD Zero application outage Zero data loss

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 22

BRONZE

1/16/14  

12  

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 23

Bronze HA Tier: Low Cost Single Instance

§ Minimize the cost of HA –  Use HA features included with Oracle

Database –  Utilize backups to protect against

media and site failures –  Secure offsite tape storage for DR

§ Optionally –  Consolidate with Oracle Multitenant –  Improve HA with RAC One Node –  Self-Service provisioning with

Enterprise Manager Database as a Service

RTO of Minutes to Days, RPO From Last Backup

Backup Appliance

Single Instance Database Off-site tape storage for archival and DR

(on premise / cloud)

Tape

Cloud

Tape

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 24

Physical Data Protection Data Corruption Protection ASM Mirroring

Human Error Correction Flashback Technologies: Drop, Query, Transaction, Table, and Database

Planned Maintenance Online Redefinition Online Reorganization Edition Based Redefinition Online File Move Online Patching

Fast Backup & Recovery Recovery Manager (RMAN) Fast Recovery Area Oracle Secure Backup

Bronze

High Availability at Bronze Tier Features Integrated with the Oracle Database

1/16/14  

13  

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 25

Bronze Evolution: A Preview of Sub-Second RPO An Oracle-Optimized Alternative to Generic Backup Appliances

§  Real time redo shipping: near zero data loss

§  Fast restore to any time point §  Delta-only architecture:

minimize production overhead and network load

§  End-to-end RMAN visibility: cloud & DR replica, disk to tape option

§  Scalable appliance: backs up 1000s databases

1000s of Databases

Validated & Compressed

Change Deltas

Database Backup Logging Recovery Appliance

Real-time Log & Change Deltas

Optional Archive to Tape

www.oracle.com/databasebackupappliance

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 26

SILVER

1/16/14  

14  

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 27

Silver HA Tier: Real-Time Recovery

Bronze, plus: § Server and instance HA

–  RAC One Node –  Oracle RAC

§ Database and site HA/DR –  Active Data Guard –  GoldenGate –  Site Guard

RTO of Seconds to Minutes, RPO of Near-Zero

Backups

Site A Oracle RAC,

RAC One Node Site B

Active Data Guard

Oracle RAC, RAC One Node

Backups

Active Data Guard

Active Data Guard

GoldenGate

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 28

Active Data Guard Best Protection, Highest Performance, All Data Types & Applications

Real-time Queries

Active Standby open read-only

Production Database

Active Data Guard sync/async physical replication

Offload Fast Incremental Backups

Offload Readers

Automatic Block Repair

Data Guard Broker Enterprise Manager Cloud Control

1/16/14  

15  

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 29

Data Guard Redo Transport

Role-based database services start automatically

2

Integrated Database and Application Failover Data Guard Fast-Start Failover

Database Tier

Application Tier

Database Services

Primary Site Standby Site

Primary Database

FAN breaks clients out of TCP timeout, TAF/FCF causes applications to quickly reconnect to new primary

3

Data Guard automatic failover

1

Standby Database Primary Database

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 30

Oracle GoldenGate

§ Many to one replication: operational data store § Subset replication: data integration § Active/Active update anywhere: distributed high availability

Flexible Logical Replication

Source & Target Oracle & Non-Oracle Database(s)

Target & Source Oracle & Non-Oracle Database(s)

Capture

Delivery

Trail Files

Pump

Trail Files

Pump

Delivery

Capture

Bi-directional

LAN / WAN / Internet Over TCP/IP

Trail Files

Trail Files

1/16/14  

16  

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 31

Minimize Planned Downtime

Active Data Guard § Primary and standby begin at version n

Database Rolling for Maintenance that can’t be done Online

Primary version n

Standby version n

§ Defer replication, upgrade standby to n+1 § Resynchronize standby with primary § Switch production – only downtime § Upgrade original primary to version n+1

and resynchronize Standby

version n+1 Standby version n

Primary version n+1

Standby version n+1

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 32

DR Automation with Oracle Site Guard

§  Automates entire stack-level (web, app, db, storage) switchover/failover between sites

–  Integrates with Data Guard and storage replication

–  Manages start order dependencies of components

–  Licensed with WLS Management and DB Lifecycle Management packs

§  Validated with: –  Oracle Databases –  Oracle Fusion Middleware –  Oracle Fusion App deployments

§  Extensible to integrate with 3rd party components (IBM WebSphere, SAP etc.)

A

DB Host1

DB Host2

ZFS Storage

Appliance

Web Tier

App Tier

DB Tier

A App

Host1 App

Host2

DB Host1

DB Host2

ZFS Storage

Appliance

Web Tier

App Tier

DB Tier

Site A Site B

Data Guard Redo Transport

Storage Replication

EMCC Oracle Site Guard

Web Host1

Web Host2

App Host1

App Host2

Web Host1

Web Host2

Web Host1

Web Host2

App Host1

App Host2

ZFS Storage

Appliance OR ACFS

Replication

ZFS Storage

Appliance OR ACFS

Replication

App Host1

App Host2

Web Host1

Web Host2

1/16/14  

17  

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 33

GOLD

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 34

Gold HA Tier: Maximum Availability

Silver, plus: §  Zero application outage

–  Application Continuity

§  Zero data loss over WAN –  Active Data Guard Far Sync

§  Zero downtime maintenance –  Oracle GoldenGate

§  Automated workload management for replicas

–  Global Data Services

Zero Application Outage, Zero Data Loss at any Distance

Site A Oracle RAC

Site B Oracle RAC

Backups

Backups

GoldenGate

Global Data Services Application Continuity

Active Data Guard

Active Data Guard Far Sync

1/16/14  

18  

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 35

Primary

Standby

Active Data Guard Far Sync Operational Flow

Far Sync Instance

SYNC ASYNC

Zero data loss failover

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 36

GoldenGate Zero Downtime Maintenance

GoldenGate § Source and target begin at version n

Bi-Directional Replication

Source version n

Target version n

§ Defer replication, upgrade target to n+1 § Start bi-directional replication

§ Synchronize source and target § Target is ready to accept new connections § Migrate users when they make new connections

§ Zero downtime § Upgrade or decommission original source

Target version n+1

1/16/14  

19  

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 37

Global Data Services (GDS)

§ Unified management § Workload routing & runtime

load balancing § Global service failover &

management

§ Benefits: –  Higher availability –  Improved performance –  Better manageability

Automated Workload Management for Replicated Databases

GoldenGate

Primary read-only

read-only

Active Data Guard

APAC Data Center

Human Resources

Order Entry

EMEA Data Center

read-write read-write

Global Data Services

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 38

BRONZE SILVER GOLD

RTO = Minutes to days RTO = Seconds to minutes Zero application outage

RPO = Since last backup RPO = Near zero Zero data loss

Minimize CapEx Minimize OpEx

•  Scalable performance •  No single point of failure •  Real-time data protection

•  Fast failover •  Minimal planned downtime

•  High ROI

•  Zero application outage •  Zero data loss at any distance

•  Zero downtime for maintenance •  Automated workload

management for replicated databases

MAA Deployment Architectures Address the Complete Range of Enterprise HA Requirements

1/16/14  

20  

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 39

ü MAA active architectures reduce cost and increase ROI

MAA Solves HA Business Challenges

ü MAA eliminates risk of downtime and data loss

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 40

Agenda

§  High Availability (HA) Business Challenge

§  Oracle Maximum Availability Architecture (MAA)

§  Oracle MAA Reference Architectures

§  Customer Deployments

1/16/14  

21  

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 41

MAA in Action At World Class Customers* 1.  PayPal: Oracle Database 12c MAA for Mission-Critical Web App

2.  Wells Fargo: Active Data Guard at one of World’s Largest Banks

3.  Fidelity: Robust Data Protection with Exadata

4.  Symantec: Enabling Consolidation & High Availability with Exadata

5.  Sanger: Oracle Database 12c Multitenant & MAA Best Practices

6.  Amway: Deploying Oracle Database 12c Global Data Services

7.  Boeing: Oracle Database 12c Flex Cluster & Flex ASM

8.  Thomson Reuters: MAA Design Patterns in Action

9.  Paychex: Achieving the Best RPO & RTO with MAA & Exadata

10.  State Street: RMAN: Foundation for Exadata Backup & Recovery

11.  US Cellular: Eliminate Performance Bottlenecks with Exadata Flash

12.  CCI: Eliminate Performance Bottlenecks with Exadata

13.  WestJet: Siebel + MAA with Exadata

14.  Sherwin-Williams: Oracle E-Business Suite + MAA with Exadata

15.  State of Texas: Journey to Database As A Service

* Ref. http://www.oracle.com/technetwork/database/features/availability/ow-papers-089200.html

Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 42