oracle maximum availability architecture (maa) with oracle ... · – oracle goldengate ! automated...
TRANSCRIPT
1/16/14
1
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 1
Oracle Maximum Availability Architecture (MAA) with Oracle Database 12c Markus Michalewicz Director of Product Management Oracle Real Application Clusters (RAC)
@OracleRACpm http://www.linkedin.com/in/markusmichalewicz
http://www.slideshare.net/MarkusMichalewicz
1/16/14
2
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 3
Is This HA?
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 4
Is This Better?
What if the second engine isn’t started until after the first one fails?
1/16/14
3
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 5
Agenda
§ High Availability (HA) Business Challenge
§ Oracle Maximum Availability Architecture (MAA)
§ Oracle MAA Reference Architectures
§ Customer Deployments
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 6
Reduce cost and increase return on investment
High Availability (HA) Business Challenges
Eliminate risk of downtime and data loss
1/16/14
4
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 7
Generic volume manager & file system…
Generic backup software
Generic cold failover cluster
Failover Server Production Server
Identical storage
Idle DR
Storage mirroring
Traditional Availability Cold Start = High Risk Idle Assets = High Cost
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 8
Battle Scars
8-day outage http://www.computerworld.com/
s/article/9182159/American_Eagle_Outfitters_learns_a_painful_service_provider_
lesson
American Eagle Outfitters - retail – Disk failure, followed by mirrored disk failure. Restore from
local backup failed. Restore using copy at DR site also failed.
5-day outage http://www.computerworld.com/s/
article/9182719/Update_Virginia_s_IT_outage_continues_3_agencies_still_affecte
d
State of Virginia - government – SAN memory failure, problem mirrored to standby SAN
Tieto - cloud infrastructure provider in Sweden – Storage array failed, unable to read tape backups used for
DR, inadequate DR testing
5-day outage http://
www.channelregister.co.uk/2012/01/16/tieto_vnx5700/
Examples Where Traditional HA Infrastructure and Processes Failed
1/16/14
5
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 9
Enterprises Need a Better Approach to HA
Requirement Solution Profile Protect from outages Any type, anywhere Reduce recovery time Zero, seconds or minutes Prevent data loss Zero or seconds Minimize risk Continuous validation, test whenever Eliminate complexity Simpler, pre-integrated Increase ROI Reduce cost, utilize all assets
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 10
Agenda
§ High Availability (HA) Business Challenge
§ Oracle Maximum Availability Architecture (MAA)
§ Oracle MAA Reference Architectures
§ Customer Deployments
1/16/14
6
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 11
Edition-based Redefinition, Online Redefinition, Data Guard, GoldenGate – Minimal downtime maintenance, upgrades, migrations
Active Data Guard – Data Protection, DR – Query Offload
GoldenGate – Active-active replication – Heterogeneous
Active Replica
Oracle Maximum Availability Architecture (MAA)
RMAN, Oracle Secure Backup – Backup to disk, tape or cloud
Enterprise Manager Cloud Control – Coordinated Site Failover Application Continuity – Application HA Global Data Services – Service Failover / Load Balancing
RAC – Scalability – Server HA
Flashback – Human error
correction
Production Site
ASM – ASM mirroring
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 12
Edition-based Redefinition, Online Redefinition, Data Guard, GoldenGate – Minimal downtime maintenance, upgrades, migrations
Active Data Guard – Data Protection, DR – Query Offload
GoldenGate – Active-active replication – Heterogeneous
Active Replica
Oracle Maximum Availability Architecture (MAA)
RMAN, Oracle Secure Backup – Backup to disk, tape or cloud
Enterprise Manager Cloud Control – Coordinated Site Failover Application Continuity – Application HA Global Data Services – Service Failover / Load Balancing
RAC – Scalability – Server HA
Flashback – Human error
correction
Production Site
Edition-based Redefinition, Online Redefinition, Data Guard, GoldenGate – Minimal downtime maintenance, upgrades, migrations
Active Data Guard – Data Protection, DR – Query Offload
GoldenGate – Active-active replication – Heterogeneous
Active Replica
RMAN, Oracle Secure Backup – Backup to disk, tape or cloud
Enterprise Manager Cloud Control – Coordinated Site Failover Application Continuity – Application HA Global Data Services – Service Failover / Load Balancing
RAC – Scalability – Server HA
Flashback – Human error
correction
Production Site
ASM – ASM mirroring
1/16/14
7
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 13
Oracle MAA Design Principles Eliminate Risk and Increase Return on Investment
Data Protection at Every Level
Strong Fault Isolation
Real-time DR with All Active Components
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 14
Principle #1: Data Protection at Every Level
§ Oracle Data blocks have a well-defined structure – Block header is kept
consistent with payload – Enables validation of
both physical and logical intra-block consistency
§ Oracle ensures block validity is maintained as it traverses I/O path
– Extensive corruption checks
Oracle-Aware Data Validation
1/16/14
8
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 15
System Memory (SGA)
Oracle Database
Architecture To Standby Databases
TCP/IP
• Better performance since no disk I/O • Better isolation from lower layer faults • Better network utilization: only redo sent • Transactional consistency: always • Corrupted blocks auto-repaired • Database-integrated application failover
Principle #2: Strong Fault Isolation
§ Data Guard transmits redo blocks directly from SGA: like a memcpy over the network
§ Redo received / applied by running Oracle instance: continuous Oracle-integrated data validation
Oracle-Aware Database Replication
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 16
Data Guard Does What Storage Mirroring Can’t Isolate Corruption, Protect Data, Maintain Availability
“…the SRDF model is a remote mirroring model…any potential data corruption would be copied faithfully and expeditiously to the other side”
VP Global Marketing CTO, EMC
“…when something happens in the I/O stack and a database write is malformed Symmetrix A happily replicates the faulty data to site B and the corruption goes undetected (SRDF is a data mirroring tool; so it’s intentionally based on GIGO)”
EMC BLOG with Integrity
1/16/14
9
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 17
Principle #3: Real-time DR
§ All components active – Servers – Storage – Remote sites
§ Easy scale-out – Add capacity online
§ Rolling maintenance § Best recovery time: already hot § Least risk: you know it is working
Oracle-Aware Active Clustering and Offload to DR Systems for High ROI
Secondary Site Production Site
LAN/WAN
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 18
Agenda
§ High Availability (HA) Business Challenge
§ Oracle Maximum Availability Architecture (MAA)
§ Oracle MAA Reference Architectures
§ Customer Deployments
1/16/14
10
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 19
Applying MAA Principles
§ Assess impact of downtime and data loss
§ Define service level objectives – Recovery time (RTO): how long
can you afford to be down – Recovery Point (RPO): how much
data can you afford to lose – Performance: pre and post failure
Begin with a Business Impact Analysis
RAC
GoldenGate
RMAN Data Guard
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 20
Reduce cost
Reduce risk
Consolidate Standardize Simplify
Set of reference HA architectures
Applying MAA Principles … contd. Implementation Approach
1/16/14
11
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 21
Oracle Database 12c MAA
Three Standard Reference Architectures
BRONZE Minutes to days of downtime Data protected as of last backup
SILVER Seconds to minutes of downtime Near-zero data loss
GOLD Zero application outage Zero data loss
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 22
BRONZE
1/16/14
12
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 23
Bronze HA Tier: Low Cost Single Instance
§ Minimize the cost of HA – Use HA features included with Oracle
Database – Utilize backups to protect against
media and site failures – Secure offsite tape storage for DR
§ Optionally – Consolidate with Oracle Multitenant – Improve HA with RAC One Node – Self-Service provisioning with
Enterprise Manager Database as a Service
RTO of Minutes to Days, RPO From Last Backup
Backup Appliance
Single Instance Database Off-site tape storage for archival and DR
(on premise / cloud)
Tape
Cloud
Tape
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 24
Physical Data Protection Data Corruption Protection ASM Mirroring
Human Error Correction Flashback Technologies: Drop, Query, Transaction, Table, and Database
Planned Maintenance Online Redefinition Online Reorganization Edition Based Redefinition Online File Move Online Patching
Fast Backup & Recovery Recovery Manager (RMAN) Fast Recovery Area Oracle Secure Backup
Bronze
High Availability at Bronze Tier Features Integrated with the Oracle Database
1/16/14
13
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 25
Bronze Evolution: A Preview of Sub-Second RPO An Oracle-Optimized Alternative to Generic Backup Appliances
§ Real time redo shipping: near zero data loss
§ Fast restore to any time point § Delta-only architecture:
minimize production overhead and network load
§ End-to-end RMAN visibility: cloud & DR replica, disk to tape option
§ Scalable appliance: backs up 1000s databases
1000s of Databases
Validated & Compressed
Change Deltas
Database Backup Logging Recovery Appliance
Real-time Log & Change Deltas
Optional Archive to Tape
www.oracle.com/databasebackupappliance
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 26
SILVER
1/16/14
14
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 27
Silver HA Tier: Real-Time Recovery
Bronze, plus: § Server and instance HA
– RAC One Node – Oracle RAC
§ Database and site HA/DR – Active Data Guard – GoldenGate – Site Guard
RTO of Seconds to Minutes, RPO of Near-Zero
Backups
Site A Oracle RAC,
RAC One Node Site B
Active Data Guard
Oracle RAC, RAC One Node
Backups
Active Data Guard
Active Data Guard
GoldenGate
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 28
Active Data Guard Best Protection, Highest Performance, All Data Types & Applications
Real-time Queries
Active Standby open read-only
Production Database
Active Data Guard sync/async physical replication
Offload Fast Incremental Backups
Offload Readers
Automatic Block Repair
Data Guard Broker Enterprise Manager Cloud Control
1/16/14
15
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 29
Data Guard Redo Transport
Role-based database services start automatically
2
Integrated Database and Application Failover Data Guard Fast-Start Failover
Database Tier
Application Tier
Database Services
Primary Site Standby Site
Primary Database
FAN breaks clients out of TCP timeout, TAF/FCF causes applications to quickly reconnect to new primary
3
Data Guard automatic failover
1
Standby Database Primary Database
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 30
Oracle GoldenGate
§ Many to one replication: operational data store § Subset replication: data integration § Active/Active update anywhere: distributed high availability
Flexible Logical Replication
Source & Target Oracle & Non-Oracle Database(s)
Target & Source Oracle & Non-Oracle Database(s)
Capture
Delivery
Trail Files
Pump
Trail Files
Pump
Delivery
Capture
Bi-directional
LAN / WAN / Internet Over TCP/IP
Trail Files
Trail Files
1/16/14
16
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 31
Minimize Planned Downtime
Active Data Guard § Primary and standby begin at version n
Database Rolling for Maintenance that can’t be done Online
Primary version n
Standby version n
§ Defer replication, upgrade standby to n+1 § Resynchronize standby with primary § Switch production – only downtime § Upgrade original primary to version n+1
and resynchronize Standby
version n+1 Standby version n
Primary version n+1
Standby version n+1
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 32
DR Automation with Oracle Site Guard
§ Automates entire stack-level (web, app, db, storage) switchover/failover between sites
– Integrates with Data Guard and storage replication
– Manages start order dependencies of components
– Licensed with WLS Management and DB Lifecycle Management packs
§ Validated with: – Oracle Databases – Oracle Fusion Middleware – Oracle Fusion App deployments
§ Extensible to integrate with 3rd party components (IBM WebSphere, SAP etc.)
A
DB Host1
DB Host2
ZFS Storage
Appliance
Web Tier
App Tier
DB Tier
A App
Host1 App
Host2
DB Host1
DB Host2
ZFS Storage
Appliance
Web Tier
App Tier
DB Tier
Site A Site B
Data Guard Redo Transport
Storage Replication
EMCC Oracle Site Guard
Web Host1
Web Host2
App Host1
App Host2
Web Host1
Web Host2
Web Host1
Web Host2
App Host1
App Host2
ZFS Storage
Appliance OR ACFS
Replication
ZFS Storage
Appliance OR ACFS
Replication
App Host1
App Host2
Web Host1
Web Host2
1/16/14
17
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 33
GOLD
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 34
Gold HA Tier: Maximum Availability
Silver, plus: § Zero application outage
– Application Continuity
§ Zero data loss over WAN – Active Data Guard Far Sync
§ Zero downtime maintenance – Oracle GoldenGate
§ Automated workload management for replicas
– Global Data Services
Zero Application Outage, Zero Data Loss at any Distance
Site A Oracle RAC
Site B Oracle RAC
Backups
Backups
GoldenGate
Global Data Services Application Continuity
Active Data Guard
Active Data Guard Far Sync
1/16/14
18
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 35
Primary
Standby
Active Data Guard Far Sync Operational Flow
Far Sync Instance
SYNC ASYNC
Zero data loss failover
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 36
GoldenGate Zero Downtime Maintenance
GoldenGate § Source and target begin at version n
Bi-Directional Replication
Source version n
Target version n
§ Defer replication, upgrade target to n+1 § Start bi-directional replication
§ Synchronize source and target § Target is ready to accept new connections § Migrate users when they make new connections
§ Zero downtime § Upgrade or decommission original source
Target version n+1
1/16/14
19
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 37
Global Data Services (GDS)
§ Unified management § Workload routing & runtime
load balancing § Global service failover &
management
§ Benefits: – Higher availability – Improved performance – Better manageability
Automated Workload Management for Replicated Databases
GoldenGate
Primary read-only
read-only
Active Data Guard
APAC Data Center
Human Resources
Order Entry
EMEA Data Center
read-write read-write
Global Data Services
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 38
BRONZE SILVER GOLD
RTO = Minutes to days RTO = Seconds to minutes Zero application outage
RPO = Since last backup RPO = Near zero Zero data loss
Minimize CapEx Minimize OpEx
• Scalable performance • No single point of failure • Real-time data protection
• Fast failover • Minimal planned downtime
• High ROI
• Zero application outage • Zero data loss at any distance
• Zero downtime for maintenance • Automated workload
management for replicated databases
MAA Deployment Architectures Address the Complete Range of Enterprise HA Requirements
1/16/14
20
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 39
ü MAA active architectures reduce cost and increase ROI
MAA Solves HA Business Challenges
ü MAA eliminates risk of downtime and data loss
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 40
Agenda
§ High Availability (HA) Business Challenge
§ Oracle Maximum Availability Architecture (MAA)
§ Oracle MAA Reference Architectures
§ Customer Deployments
1/16/14
21
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 41
MAA in Action At World Class Customers* 1. PayPal: Oracle Database 12c MAA for Mission-Critical Web App
2. Wells Fargo: Active Data Guard at one of World’s Largest Banks
3. Fidelity: Robust Data Protection with Exadata
4. Symantec: Enabling Consolidation & High Availability with Exadata
5. Sanger: Oracle Database 12c Multitenant & MAA Best Practices
6. Amway: Deploying Oracle Database 12c Global Data Services
7. Boeing: Oracle Database 12c Flex Cluster & Flex ASM
8. Thomson Reuters: MAA Design Patterns in Action
9. Paychex: Achieving the Best RPO & RTO with MAA & Exadata
10. State Street: RMAN: Foundation for Exadata Backup & Recovery
11. US Cellular: Eliminate Performance Bottlenecks with Exadata Flash
12. CCI: Eliminate Performance Bottlenecks with Exadata
13. WestJet: Siebel + MAA with Exadata
14. Sherwin-Williams: Oracle E-Business Suite + MAA with Exadata
15. State of Texas: Journey to Database As A Service
* Ref. http://www.oracle.com/technetwork/database/features/availability/ow-papers-089200.html
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 42