vmware - vmug montreal

43
© 2009 VMware Inc. All rights reserved VMware for Business Continuity What’s new with SRM 5 Vadim Shvarts Sr. Systems Engineer VMware Canada [email protected]

Upload: 1cloudroadcom

Post on 29-Oct-2014

36 views

Category:

Technology


0 download

DESCRIPTION

 

TRANSCRIPT

Page 1: VMware - VMUG Montreal

© 2009 VMware Inc. All rights reserved

VMware for Business Continuity

What’s new with SRM 5

Vadim Shvarts Sr. Systems EngineerVMware [email protected]

Page 2: VMware - VMUG Montreal

2 Confidential

Introduction – VMware for Business Continuity

Page 3: VMware - VMUG Montreal

3

43% of companies experiencing disasters never re-open, and 29% close within two years.

(McGladrey and Pullen)

93% of business that lost their data center for 10 days went bankrupt within one year.

(National Archives & Records Administration)

40% of all companies that experience a major disaster will go out of business if they cannot

gain access to their data within 24 hours.(Gartner)

Top executives say 10 hours to recovery;IT managers say up to 30 hours.

(Harris Interactive)

Disasters Happen. Do You Need Protection?

Page 4: VMware - VMUG Montreal

4

Business-Critical Applications Require Business Continuity

Availability Expectations on vSphere Continue to IncreaseRTO’s decreasing from >24 hours to <12 hours

38%

43%

53%

25% 25%

18%

% of Application Instances Running on VMware in Customer Base

MSExchange

MS SQL

MS SharePoint

OracleMiddleware

OracleDB

SAP

Source: VMware customer survey, Jan 2010 and April 2011 interim results,Data: Total number of instances of that workload deployed in your organization and the percentage of those instances that are virtualized

2010

2011

42%

47%

67%

34% 28% 28%

Page 5: VMware - VMUG Montreal

5

Drawbacks Of Traditional Business Continuity Solutions

Middleware / Java

Oracle RAC

Oracle DataGuard DB Mirroring

MS Clustering DB Access Groups

CCR / SCR

App Server Cluster

Session State Replication

Backup Data replication

Application-level availability silos:Complex and expensive

Shared availability services:Longer RTOs and RPOs

Availability requirements

Local Availability

Data Protection

Disaster Recovery

Page 6: VMware - VMUG Montreal

6

Improving Business Continuity At All Levels

Local Availability

vSphere High Availability

vSphere Fault Tolerance

vMotion and Storage vMotion

Data Protection

vSphere Data Recovery

Storage APIs for Data Protection

Local Site Failover Site

Disaster Recovery

vCenter Site Recovery Manager

Includes vSphere Replication

Newin 2011

Improved in 2011

Improved in 2011

vSphere vSpherevSphere vSphere vSphere

Improved in 2011

Page 7: VMware - VMUG Montreal

7

Transforming Cost And Complexity Of Business Continuity

Continuous

Hours

Days

RTO / RPO

Cost ($ per app)

$10,000

Minutes

$1,000$100

Shared availability services

(traditional backup, replication)

VMware business continuity

(HA, FT, vMotion, SRM, VDR)

App-level availability(Oracle RAC, MSCS, …)

• Much better RTOs than traditional backup and replication• Similar or lower cost

• Similar RTOs to app-level availability solutions• Much lower cost / complexity

Page 8: VMware - VMUG Montreal

8

Better Business Continuity Is #1 Objective For Virtualization

Top Five Objectives for Virtualization

Use virtualization to improve Business Continuity and Disaster Recovery (BCDR) 46%

Improve virtual machine performance 33%

Increase the server consolidation ratio 32%

Improve VM environment management 31%

More mission-critical applications 24%

Source: WW VMware customer survey, January 2010

N=1083

Page 9: VMware - VMUG Montreal

9 Confidential

Simple and Reliable DR with vSphere and SRM

Page 10: VMware - VMUG Montreal

10

Challenges of Traditional Disaster Recovery

ExpensiveComplex

Recovery Plans

?

?

?

??

??

?

Unreliable Failovers

Apps

Hosts

Storage

Network

Software

Hosts

Storage

Facilities

>$10K per app

Failure to meet business requirements• Long RTOs – days to weeks• Too much time and resources consumed=

+ +

Page 11: VMware - VMUG Montreal

11

vSphere Provides The Best Foundation For Disaster Recovery

Flexible Infrastructure• Eliminate need for identical hardware across

sites• Enable waterfalling of equipment to recovery site

Simple Application Protection• Entire system – including application, OS,

and data – is stored as virtual machine files• Entire system can be protected with data

protection tools

Cost-Efficient Infrastructure• Reduced hardware requirements at recovery

site• Use recovery hardware to run low-priority apps

Encapsulation

Consolidation

HardwareIndependence

vSphere

vSphere vSphere

Page 12: VMware - VMUG Montreal

12

Encapsulation Simplifies Application Protection And Recovery

Simplify recovery• No operating system re-install or bare-metal recovery

• No time spent reconfiguring hardware

Standardize recovery process• Consistent process independent of applications,

operating systems and hardware

Configure hardware

Install OS

Configure OS

Install backup agent

Start “Single-step automatic recovery”

RestoreVM

Poweron VM

Physical

Virtual

40+ Hrs.

< 4 Hrs.

Page 13: VMware - VMUG Montreal

13

vCenter Site Recovery Manager Ensures Simple, Reliable DR

Provide cost-efficient replication of applications to failover site• Built-in vSphere Replication• Broad support for storage-based

replication

Simplify management of recovery and migration plans• Replace manual runbooks with

centralized recovery plans• From weeks to minutes to set up new

plan

Automate failover and migration processes for reliable recovery• Enable frequent non-disruptive testing• Ensure fast, automated failover• Automate failback processes

Site Recovery Manager Complements vSphere to provide the simplest and most reliable disaster protection and site migration for all applications

VMware vSphere

VMwarevCenter Server

Site RecoveryManager

VMwarevCenter Server

Site RecoveryManager

VMware vSphere

Site A (Primary) Site B (Recovery)

Servers Servers

Page 14: VMware - VMUG Montreal

14

SRM Momentum

Introduced in Q2’ 2008

125,000+ units sold

5,000+ customers

50% annual growth in 2010

“If your organization is already taking advantage of virtualization, then adding Site Recovery Manager to handle disaster recovery is a no-brainer.”

― Jerry Wilkin Senior Systems Administrator, Dayton Superior Corp

Page 15: VMware - VMUG Montreal

15

Key Components Of SRM 5

Storage

vCenter ServerSite

Recovery Manager

Choice of Replication Options

Required at Both Protected and Recovery Sites

vSphere

Site Recovery Manager• Manages recovery plans

• Automates failovers and failbacks

• Tightly integrated with vCenter and replication

vSphere Replication• Bundled with SRM

• Replicates virtual machines between vSphere clusters

Storage-Based Replication (3rd party)• Provided by replication vendor

• Integrated via replication adapters created, certified and supported by replication vendor

Page 16: VMware - VMUG Montreal

16

Site Recovery Manager Complements vSphere For DR

Traditional DR VMware

Consolidation to reduce costs X

Hardware independence at failover site X

Encapsulation for simple recovery of entire systems X

vSphere Replication X

Simple management of recovery and migration plans X

Automated DR failover and non-disruptive testing X

Streamline planned migrations and automated failback X

SRMFunctionality

vSphereFunctionality

Page 17: VMware - VMUG Montreal

17

SRM Provides Broad Application Coverage

Continuous

Hours

Days

App-level geo-clustering / load balancing

RTO

RTO: 30 minutes to hoursRPO: Flexible based on storage replication

RPOSynchronousHoursDays

Site Recovery Manager

Tier 1

Tier 2

Tier 3

Page 18: VMware - VMUG Montreal

18

SRM Supports Flexible Topologies

Active-PassiveFailover

Active-ActiveFailover

Bi-directional Failover

Shared Recovery Sites

Production

Recovery

Production

Recovery

Production

Production

• Most common traditional scenario

• Expensive dedicated resources

• Leverage recovery infrastructure for test, development, training

• Utilize sunk cost of recovery site

• Production applications at both sites

• Each site acts as the recovery site for the other

• Many-to-one failover

• Particularly useful for Remote Office / Branch Office

Page 19: VMware - VMUG Montreal

19

What’s New In Site Recovery Manager 5.0?

vSphere Replication Bundled with SRM at no additional cost Provides simple, cost-efficient replication

between vSphere clusters

Automated failback Bi-directional recovery plans Automates failback to original site

Planned migration New workflow that can be applied to any

recovery plan Ensures no data-loss, application-

consistent migrations of virtual machines

Others More granular control over VM startup order Protection-side APIs IPv6 support

Expand DR coverage to Tier 2 apps and smaller sites

Streamline planned migrations(for disaster avoidance, planned maintenance, …)

Page 20: VMware - VMUG Montreal

20 Confidential

Cost-Efficient Replication To Expand DR Coverage

Page 21: VMware - VMUG Montreal

21

DR Coverage Often Limited Due To High Protection Costs

Tier 1 Apps - Protected

Tier 2 / 3 Apps – Backup only

Corporate Datacenter

Small Sites – Backup only

Small BusinessRemote Office / Branch Office

Need to expand DR protection

• Tier 2 / 3 applications in larger datacenters

• Small and medium businesses

• Remote office / branch offices

OS

APP

OS

APP

OS

APP

OS

APP

OS

APP

OS

APP

OS

APP

OS

APP

OS

APP

OS

APP

Page 22: VMware - VMUG Montreal

22

SRM Provides Broad Choice of Replication Options

vSphere Replication Simple, cost-efficient replication for Tier 2 applications and smaller sites

Storage-based ReplicationHigh-performance replication for business-critical applications in larger sites

vCenter ServerSite

Recovery Manager

vSphere

vCenter ServerSite

Recovery Manager

vSpherevSphere

Replication

Storage-based replication

Site A (Primary) Site B (Recovery)

Page 23: VMware - VMUG Montreal

23

vSphere Replication For Cost-Efficient, Simple Replication

Reduce storage costs by 2X• Support for heterogeneous

storage across sites, including non-replicating storage

• Use lower-end or older storage at failover site

Eliminate replication software costs

• vSphere Replication included with Site Recovery Manager at no additional cost

Manage replication directly from vCenter

• Eliminate complex interactions with storage teams

Manage replication at the individual VM level

• Eliminate need for complicated VM-to-LUN mapping

15 minute RPOs• Set RPOs between 15

minutes and 24 hours

Efficient network utilization• Replicate only changed disk

areas

Highly scalable• 500 virtual machines

Limitations• No automated failback• File-level consistency only

(except planned migration)• No FT, templates, linked

clones, physical RDMs

Cost-efficient Simple Powerful

Page 24: VMware - VMUG Montreal

24

Storage Replication

Expand DR Protection To Tier 2 Apps And Small Sites

Tier 1 Apps

Tier 2 / 3 Apps

Corporate Datacenter

Small Sites

Small BusinessRemote Office / Branch Office

vSphere Replication

vSphere Replication

vSphere

$1,000

$2,000$2,000/VM

Tier 1 Storage Failover Site

Replication SW

SRMEnterprise

$600/VM

Tier 2 Storage Failover Site

SRM Standard

Storage, Replication, and SRM Costs per Protected VM

Storage ReplicationLarge site

vSphere ReplicationSmall site

Page 25: VMware - VMUG Montreal

25

Simplify Replication Management With vSphere Replication

Overview

Benefits

vSphere Replication provides simple management of replication Managed directly from vCenter Managed at the individual VM-level

Eliminate complex interactions between vSphere and storage teams to set up replication

Eliminate need to shuffle VMs between datastores to map applications to replicated LUNs

Hub

LUN 1

LUN 2

VMFS A

Datastore Group

Web

SharePoint

SQL

App

vSphere Replication

Web

SharePoint

SQL

App

vSphere Admin

Storage Admin

vSphere Admin

Storage-based Replication

Datastore

VMFS BDatastore

Page 26: VMware - VMUG Montreal

26 Confidential

ESXi

Recovery SiteProtected Site

ESXESXESXi

VSR Agent vSphere Replication

Server

Tightly Integrated With SRM, vCenter and ESX

Site Recovery Manager

Site Recovery Manager

vSphere Replication Management Server

vSphere Replication Management Server

Any storage supported by

vSphere

Any storage supported by

vSphere

vCenter Server vCenter Server

vSphere Replication Architecture

Page 27: VMware - VMUG Montreal

27 Confidential

Simple Recovery and Migration Plans

Page 28: VMware - VMUG Montreal

28

Simple Setup And Management of Recovery And Migration Plans

Weeks or months to set up

Error-prone

Quickly falls out of sync with apps and infrastructure changes

Simple recovery plan set up in minutes

Fewer steps means far less room for errors

Simple to keep in sync with changes

…to Simple Recovery PlansFrom Complex Runbooks…

Page 29: VMware - VMUG Montreal

29

Step 2

Step 3

Step 4

Step 5

Five Simple Steps To Create Recovery And Migration Plans

Create Recovery Plans in 5 Steps…

Step 1

Map production site resources to recovery site• Resource pools• vSwitches• VM folders

Select virtual machine protection groups to include in recovery

Specify boot sequence of recovered VMs

Customize IP addresses of recovered VMs

Select low-priority VMs to suspend at recovery site

…And Eliminate Manual Steps of Traditional Recovery

Coordinate storage and replication processes for recovery

• Stop replication and make replicated LUNs writable

• Present data to applications• Present VMs to vSphere

Reconfigure individual hosts

Reconfigure physical switching infrastructure

Recover entire systems including OSand application binaries

X

X

X

X

Add messages and custom scriptsOptional

Page 30: VMware - VMUG Montreal

30

Application Consistent Recovery With SRM

Storage-based replication: application consistency widely available

• Enabled by replication management software

• Typically relies on agents in the VMs to properly quiesce applications

• For both DR failover and planned migrations

vSphere Replication: Application consistency for planned migrations only

• File-system consistency for DR failover via VSS requester in VMware Tools

Application Consistency Enabled by Replication Provider

Quiesce application

Replicate app-consistent VM

App-consistent VM presented

to SRM

Replication management

Page 31: VMware - VMUG Montreal

31 Confidential

Fully Automated Disaster Failovers and Planned Migrations

Page 32: VMware - VMUG Montreal

32

Beyond DR: Disaster Avoidance And Planned Migrations

Recover from unexpected site failure

• Full or partial site failure

The most critical but least frequent use-case

• Unexpected site failures do not happen often

• When they do, fast recovery is critical to the business

Anticipate potential datacenter outages

• For example: in case of planned hurricane, floods, forced evacuation, etc.

Initiate preventive failover for smooth migration

• Leverage SRM ‘planned migration’ to ensure no data-loss

• ‘Automated failback’ enables easy return to original site

Most frequent SRM use case• Planned datacenter

maintenance• Global load balancing

Streamline routine migrations across sites

• Test to minimize risk• Execute partial failovers• Leverage SRM ‘planned

migration’ to ensure no data-loss

• ‘Automated failback’ enables bi-directional migrations

Disaster Failover Disaster Avoidance Planned Migration

3 typical use-cases for SRM

Page 33: VMware - VMUG Montreal

33

SRM Reduces Recovery Risk With Frequent Testing

During the testing gap, organizations can’t be sure that they can recover the current IT environment

A failover scenario may take days or weeks to complete, leaving the business at extreme risk

SRM provides assurance that DR objectives will be met.

Lack of confidence in DR process

TimeDR Test DR Test

Changes to Applications and

Infrastructure Configuration

TESTING GAP

RecoveryRisk

Traditional Disaster Recovery

RecoveryRisk

DR Test DR TestTime

Site Recovery Manager

Frequent DR Testing

Page 34: VMware - VMUG Montreal

34

SRM Enables Frequent Non-Disruptive Testing

Overview

Benefits

Automate test execution• Execute recovery plan• Customizable for testing with extra callouts

and breakpoints• Log results of the test

Isolated test environment• Snapshot replicated LUNs• Launch VMs in fenced network• Reset environment after test

Confidence and documentation that DR requirements are satisfied

Quickly identify and remediate potential issues

Reduce cost and resources required for DR testing• Eliminate traditional ‘DR testing weekends’

Non-disruptive TestingRecovery Site

Isolated test environment

LUN snapshot

vSphere

Recovery Site

Replication

Page 35: VMware - VMUG Montreal

35

Automate DR Failover Processes

Overview

Benefits

Automatically detect site failures Require user to manually initiate failover

Automate recovery process Stop replication and present replicated LUNs

to vSphere Execute user-defined recovery plan

Ensure fast and predictable failovers and migrations

Consistently meet business requirements

Minimize risk of user errors

Site BSite A

Replication

1 Raise alert when hearbeat lost

2 User initiates failover

X3

Stop replication and present LUNs to vSphere

4 Recover VMs

DR Failover

vSphere vSphere

Page 36: VMware - VMUG Montreal

36

Testing and Executing Recovery Plans

Steps in recovery plan Status and time

stamps

When to execute

User confirmation

message

Page 37: VMware - VMUG Montreal

37

Planned Migrations For App Consistency & No Data Loss

Overview

Benefits

Two workflows can be applied to recovery plans: DR failover Planned migration

Planned migration ensures application consistency and no data-loss during migration Graceful shutdown of production VMs in

application consistent state Data sync to complete replication of VMs Recover fully replicated VMs

Better support for planned migrations

No loss of data during migration process

Recover ‘application-consistent’ VMs at recovery site

Planned Migration

Site BSite A

Replication

1 Shut down production VMs

2 Sync data, stop replication and present LUNs to vSphere

3 Recover app-consistent VMs

vSphere vSphere

Page 38: VMware - VMUG Montreal

38

Simplify failback process Automate replication management Eliminate need to set up new recovery plan

Streamline frequent bi-directional migarations

Automated Failback To Streamline Bi-Directional Migrations

Re-protect VMs from Site B to Site A Reverse replication Apply reverse resource mapping

Automate failover from Site B to Site A Reverse original recovery plan

Restrictions Does not apply if Site A has undergone major

changes / been rebuilt Not available with vSphere Replication

Overview

Benefits

Automated Failback

Site BSite A

Reverse Replication

Reverse original recovery plan

vSphere vSphere

Page 39: VMware - VMUG Montreal

39 Confidential

Next Steps

Page 40: VMware - VMUG Montreal

40

Successful Business Continuity Requires Careful Planning

Business Requirements / Business Impact Analysis (BIA)• Map service Tiers by availability requirements and cost

• For each service, identify Availability requirements, Recovery Time Objectives (RTO), Recovery Point Objectives (RPO)

Application Dependency Mapping• Identify dependencies between application

components

• Weakest link in the chain? (AD, DNS, etc)

Business Continuity Design• App-specific solutions / virtualization

for HA and DR / backup only

• Budget ahead of time

• Project planning / phasing

Use Professional Services• VMware PSO

• VMware BCDR Competency partners (300+ highly qualified partners)

Page 41: VMware - VMUG Montreal

41

SRM 5 Editions Lineup

SRM 5

Standard Enterprise

Price per protected virtual machine (license only)

$195 $495

Scalability Limits

• Maximum protected VMs 75 virtual machines (1) Unlimited(2)

Features

• Support for storage-based replication

• Centralized recovery plans

• Non-disruptive testing

• Automated DR failover

• vSphere Replication

• Automated failback

• Planned migration

New in SRM 5.01. Maximum of 75 VMs per site and per SRM instance

2. Subject to the product’s technical scalability limits

Page 42: VMware - VMUG Montreal

42

VMware BC/DR Service Offerings

VMware vCenter Site Recovery Manager Jumpstart

• The VMware vCenter Site Recovery Manager Jumpstart provides you with a proof-of-concept, on-site installation and configuration of SRM

• 3 days on-site, 5 participants max

Custom BCDR Plan and Design Service

• Comprehensive architectural design for BCDR, covering data protection, local availability, and disaster recovery.

• Address customer-specific requirements

• Flexible engagement model and duration

Page 43: VMware - VMUG Montreal

© 2009 VMware Inc. All rights reserved

Questions?