aws-enabled disaster recovery and business continuity for sifis

30
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Felix Candelario Global Financial Services Solutions Architect March 23, 2017 AWS-Enabled Disaster Recovery and Business Continuity for SIFIs

Upload: amazon-web-services

Post on 07-Apr-2017

102 views

Category:

Technology


2 download

TRANSCRIPT

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.

Felix Candelario – Global Financial Services Solutions Architect

March 23, 2017

AWS-Enabled Disaster Recovery

and Business Continuity for SIFIs

“Everything fails,

all the time”

Werner Vogels

(CTO, Amazon.com)

ELB/Appliance

EC2/Auto Scaling

Route 53

Load Balancers

Web/App Servers

Your Data

Centers

DNS

Amazon RDS

Security Groups / ACL

Availability Zones / VPC

Multi-regionGeographical

Redundancy

Data Centers

Firewall

Database Servers

DR Terminology Map

Architecting for Recovery &

Resiliency

DisasterRecovery point Recovery time

Data loss Down time

Backup &

Restore

Pilot Light

Warm

Standby

Hot-Site

Backup of on-premises

data to AWS to use in a

DR event

Replicate data and

minimal running

services into AWS,

ready to take over and

flare up

Replicate data and

services into AWS

ready to take over

Replicated and load

balanced environments

that are both actively

taking production traffic

RPO

aRTO

COST

24 hours 24 hours

$

RPO

aRTO

COST

12 hours 4 hours

$$

RPO

aRTO

COST

1-4 hours 15 min

$$$

RPO

aRTO

COST

<15 min 0-5 min

$$$

Ascending Levels of DR Options

~$200 / Month

In US-EAST

+VPN

On-premises

Active Production

www.example.com

Corporate data center AWS region

AWS DR failover

App

Servers

DB

Server

VPN

Connection

Storage

GatewayiSCSI

Backup

System

S3 / Bucket

Glacier / Archive

Web

Servers Internet traffic

S3 (1TB)

$31/Month

Glacier (2TB)

$22/Month

Storage Gateway

$125/Month

S3 / Bucket

S3 (1TB)

$31/Month1TB Data

Volume

Backup and Restore Architecture

Suitable for

• Solutions that can sustain higher technical debt

• Lower business critical nature

• Low cost DR option

Leverage existing investments in

• De-duplication

• Compression

• WAN Acceleration

Backup and Restore Details

Pilot light

Subordinate

database

server

Pilot light–prepwww.example.com

Data mirroring replication

Not running

Pilot light system

Reverse

proxy/

caching

server

Datavolume

Application

server

Corporate data center

Reverse proxy/ caching server

Application server

MasterDatabase

server

Database

server

Pilot light–recoverywww.example.com

Start in minutes

Add additional

capacity,

if needed

Reverse

proxy/

caching

server

Datavolume

Application

server

Corporate data center

Reverse proxy/ caching server

Application server

MasterDatabase

server

Considerations

Suitable for:

• Solutions that need lower RTO & RPO

• higher business critical nature

• Mid-range cost DR option

Pilot Light Details

Warm standby

Warm standby–prep

Mirroring /replication

Application data source

cut over

Elastic loadbalancer

ActiveNot active for

production traffic

Route 53

www.example.com

Scaled down

standbyCorporate data center

Datavolume

Applicationserver

Subordinatedatabase

server

Reverse proxy/ caching server

AWS region

Reverse proxy/ caching server

Application server

MasterDatabase

server

Warm standby–recover

Elastic loadbalancerActive

Route 53

www.example.com

Scaled-up

production

Corporate data center

Datavolume

Applicationserver

Databaseserver

AWS region

Reverse proxy/ caching server

Application Server

MasterDatabase

server

Reverse proxy/ caching server

Hot site

Hot site – prep

Mirroring /replication

Application data source

cut over

Elastic loadbalancer

ActiveRoute 53

www.example.com

Corporate data center

Datavolume

Applicationserver

Subordinate database

server

AWS region

Reverse proxy/ caching server

Application server

MasterDatabase

server

Active

Reverse proxy/ caching server

Hot site – recovery

Elastic loadbalancer

Route 53

www.example.com

Corporate data center

Datavolume

Applicationserver

Databaseserver

Reverse proxy/ caching server

AWS region

Reverse proxy/ caching server

Application server

MasterDatabase

server

Active

Scaled up

for production

use

Considerations

Suitable for:

• Solutions that require RTO & RPO in minutes

• Core business critical functions

• Higher cost DR option

Warm Standby and Multi-site Details

Disaster Recovery Testing &

Assurance

Continuous Testing of Infrastructure

• Continuously and constantly test.

• Regularly execute tests in stable, production &

production-like test environments.

• Infrastructure as Code

• CI/CD Test in Infrastructure Build Pipeline

• Testing of infrastructure during Integration Test

Warm Standby – Testing

Mirroring /replication

Application data source

cut over

Elastic loadbalancer

ActiveNot active for

production traffic

Route 53

www.example.com

Scaled down

standbyCorporate data center

Datavolume

Applicationserver

Subordinatedatabase

server

Reverse proxy/ caching server

AWS region

Reverse proxy/ caching server

Application server

MasterDatabase

server

Warm Standby – Testing

Mirroring /replication

Application data source

cut over

Elastic loadbalancer

ActiveNot active for

production traffic

Route 53

www.example.com

Scaled down

standbyCorporate data center

Datavolume

Applicationserver

Subordinatedatabase

server

Reverse proxy/ caching server

AWS region

Reverse proxy/ caching server

Application server

MasterDatabase

server

Warm Standby – Testing

Mirroring /replication

Application data source

cut over

Elastic loadbalancer

ActiveNot active for

production traffic

Route 53

www.example.com

Scaled down

standbyCorporate data center

Datavolume

Applicationserver

Subordinatedatabase

server

Reverse proxy/ caching server

AWS region

Reverse proxy/ caching server

Application server

MasterDatabase

server

Warm Standby – Testing

Mirroring /replication

Application data source

cut over

Elastic loadbalancer

ActiveNot active for

production traffic

Route 53

www.example.com

Scaled down

standbyCorporate data center

Datavolume

Applicationserver

Subordinatedatabase

server

Reverse proxy/ caching server

AWS region

Reverse proxy/ caching server

Application server

MasterDatabase

server

aws rds reboot-db-instance --db-instance-identifier

dbInstanceID --force-failover

Architecting for Cloud

Architecting for Resiliency

Cloud-Based Architectures

• High level of control over the environment

• Automate Everything! – Utilize AWS APIs

• Infrastructure as code – CloudFormation

• Parallel environment

• Rolling Update / All at Once

• Blue / Green Deployments

- Significant difference between physical and cloud is the

control and visibility cloud provides

Benefits of deployment on AWS

AWS:

• Agile deployments

• Flexible options

• RPO/RTO & Business

Continuity objectives

• Scalable capacity

• Pay for what you use

• Automation capabilities