designing efficient systems services and primitives for ... · designing efficient systems services...

18
Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers K. Vaidyanathan, S. Narravula, P. Balaji and D. K. Panda Network Based Computing Laboratory (NBCL) Computer Science and Engineering Ohio State University NSF #CNS - 0509452

Upload: others

Post on 25-Sep-2020

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Designing Efficient Systems Services and Primitives for ... · Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers ... Typical Multi-Tier Data-center

Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers

K. Vaidyanathan, S. Narravula, P. Balaji and D. K. PandaNetwork Based Computing Laboratory (NBCL)

Computer Science and EngineeringOhio State University

NSF #CNS - 0509452

Page 2: Designing Efficient Systems Services and Primitives for ... · Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers ... Typical Multi-Tier Data-center

Introduction and Motivation

• Interactive Data-driven Applications– Scientific as well as Enterprise/Commercial Applications

• Static Datasets: Medical Imaging Modalities• Dynamic Datasets: Stock value datasets, E-commerce, Sensors

– Need for interacting, synthesizing and visualizing large datasets– Data-centers enable such capabilities

• Clients initiate queries (over the web) to process specific datasets– Data-centers process data and reply to queries

Page 3: Designing Efficient Systems Services and Primitives for ... · Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers ... Typical Multi-Tier Data-center

Typical Multi-Tier Data-center Environment

• Requests are received from clients over the WAN

• Proxy nodes perform caching, load balancing, resource monitoring, etc.

• If not cached, the request is forwarded to the next tiers Application Server

• Application server performs the business logic (CGI, Java servlets, etc.)– Retrieves appropriate data from the database to process the requests

ProxyServer

Web-server(Apache)

Application Server (PHP)

DatabaseServer

(MySQL)

WANWAN

ClientsStorage

More Computation and CommunicationRequirements

Page 4: Designing Efficient Systems Services and Primitives for ... · Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers ... Typical Multi-Tier Data-center

Overview of Research• Propose a novel framework for next generation data-centers

– Delivering performance and scalability

– Providing advanced features such as active caching, fine-grain resource monitoring, dynamic resource adaptation, etc

• Novel approaches using the advanced features of InfiniBand and other RDMA-enabled Networks

– Resilient to the load on the back-end servers

– Order of magnitude performance gain for several scenarios

– Exploit features like RDMA and remote atomic operations for new primitives and services

• Three-layer Architecture– Advanced Communication Protocol Support

– Data-Center Primitives

– Data-Center Services

Page 5: Designing Efficient Systems Services and Primitives for ... · Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers ... Typical Multi-Tier Data-center

Proposed ArchitectureExisting Data-Center Components

RDMA Atomic Multicast

Sockets Direct Protocol

ProtocolOffload

PacketizedFlow-control

GlobalMemory

Aggregator

DistributedLock

Manager

PointTo

Point

Advanced System Services

Data-CenterService

Primitives

AdvancedCommunication Protocols

and Subsystems

Network

ActiveCaching

SoftSharedState

Async. Zero-copyCommunication

CooperativeCaching

DynamicReconfiguration

ResourceMonitoring

Dynamic Content Caching Active Resource Adaptation

Distributed Data Sharing Substrate

ActiveCaching

CooperativeCaching

DynamicReconfiguration

ResourceMonitoring

SoftSharedState

DistributedLock

Manager

Distributed Data Sharing Substrate

Async. Zero-copyCommunication

Page 6: Designing Efficient Systems Services and Primitives for ... · Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers ... Typical Multi-Tier Data-center

Publications (So Far)• Architecture for Caching Responses with Multiple Dynamic

Dependencies in Multi-Tier Data-Centers over InfiniBand, CCGrid 2005• On the Provision of Prioritization and Soft QoS in Dynamically

Reconfigurable Shared Data-Centers over InfiniBand, ISPASS 2005• Asynchronous Zero-copy Communication for Synchronous Sockets in

the Sockets Direct Protocol (SDP) over InfiniBand, CAC 2006• Designing Efficient Cooperative Caching Schemes for Multi-Tier Data-

Centers over RDMA-enabled Networks, CCGrid 2006• Exploiting RDMA operations for Providing Efficient Fine-Grained

Resource Monitoring in Cluster-Based Servers, RAIT 2006• DDSS: A Low-Overhead Distributed Data Sharing Substrate for

Cluster-Based Data-Centers over Modern Interconnects, HiPC 2006• High Performance Distributed Lock Management Services using

Network-based Remote Atomic Operations, CCGrid 2007

http://nowlab.cse.ohio-state.edu/projects/data-centers/index.html

Page 7: Designing Efficient Systems Services and Primitives for ... · Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers ... Typical Multi-Tier Data-center

Sockets Direct Protocol: Throughput and OverlapT hro ug hp ut

0

2 0 0 0

4 0 0 0

6 0 0 0

8 0 0 0

1 0 0 0 0

1 2 0 0 0

1 4 1 6 6 42 5 6 1 K 4 K1 6 K6 4 K

2 5 6 K 1 MM e s s a g e S ize (B yte s )

Thro

ughp

ut (M

bps)

B S D P

ZS D P

A Z -S D P

C o m p ./C o m m . O ve rla p

0

1 0 0 0

2 0 0 0

3 0 0 0

4 0 0 0

5 0 0 0

6 0 0 0

7 0 0 0

8 0 0 0

9 0 0 0

1 0 0 0 0

0 2 0 4 0 6 0 8 01 0 01 2 01 4 01 6 01 8 02 0 0

D e la y (us e c )

Thro

ughp

ut (M

bps)

B S D P

ZS D P

A Z S D P

Asynchronous Zero-copy Communication for Synchronous Sockets in the Sockets DirectProtocol (SDP) over InfiniBand, P. Balaji, S. Bhagvat, H. –W. Jin and D. K. Panda. Workshop on Communication Architecture for Clusters (CAC); with IPDPS ‘06.

Page 8: Designing Efficient Systems Services and Primitives for ... · Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers ... Typical Multi-Tier Data-center

Presentation Layout

Introduction and Motivation

Cooperative Caching Services

Resource Monitoring Services

Conclusions and Ongoing Work

Page 9: Designing Efficient Systems Services and Primitives for ... · Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers ... Typical Multi-Tier Data-center

Cooperative Caching Services

• Aggregate cache benefits – well known!!• Performance considerations

– Two-sided operation vs. One-sided RDMA operations– Placement of data ( Local Vs. Remote)– Controlling data redundancy– Utilize available remote memory– Load sensitive Protocols

• Objective– Can we design efficient cooperative caching schemes

utilizing the idle resources in the Data-Centers and the RDMA capabilities in networks and eliminate redundancy to optimize available system cache size?

Page 10: Designing Efficient Systems Services and Primitives for ... · Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers ... Typical Multi-Tier Data-center

Data-Center Throughput with Cooperative Caching

0

5 0 0 0

1 0 0 0 0

1 5 0 0 0

2 0 0 0 0

2 5 0 0 0

3 0 0 0 0

3 5 0 0 0

A p a c h eC a c h in g

Ba s icC o o p e r a t iv e

C a c h in g

C o o p e r a t iv eC a c h in gW it h o u t

Re d u n d a n c y

M u lt i- T ie rA g g r e g a t e

C o o p e r a t iv eC a c h in g

Hy b r idC o o p e r a t iv e

C a c h in g

8 k 1 6 k 3 2 k 6 4 k

8-Proxy nodes

• Our schemes achieve significant performance gain over basic Apache Caching (AC)

Designing Efficient Cooperative Caching Schemes for Multi-Tier Data-Centers over RDMA-enabled Networks, S. Narravula, H. -W. Jin, K. Vaidyanathanand D. K. Panda. In International Symposium on Cluster Computing and the Grid (CCGrid), 2006

TPS

Page 11: Designing Efficient Systems Services and Primitives for ... · Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers ... Typical Multi-Tier Data-center

Presentation Layout

Introduction and Motivation

Data-center Service Primitives

Cooperative Caching Services

Resource Monitoring Services

Conclusions and Ongoing Work

Page 12: Designing Efficient Systems Services and Primitives for ... · Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers ... Typical Multi-Tier Data-center

Resource Monitoring Services

• Traditional approaches– Coarse-grained in nature– Assume resource usage is consistent throughout the monitoring

granularity (in the order of seconds)

• This assumption is no longer valid– Resource usage is becoming increasingly divergent

• Fine-grained monitoring is desired but has additional overheads – High overheads, less accurate, slow in response

• Can we design fine-grained resource monitoring scheme with low overhead and accurate resource usage?

Page 13: Designing Efficient Systems Services and Primitives for ... · Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers ... Typical Multi-Tier Data-center

Synchronous Resource Monitoring using RDMA (RDMA-Sync)

/proc

KernelSpace

UserSpace

KernelSpace

UserSpace

Front-endNode Memory Memory

CPU CPU

AppThreads

Front-endMonitoringProcess

KernelData Structures

AppThreads

Back-endNode

RDMA

Page 14: Designing Efficient Systems Services and Primitives for ... · Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers ... Typical Multi-Tier Data-center

Impact of Fine-grained Monitoring with Applications

Im p a c t o n R U B iS a n d Zip f Tra c e s

0 %

5 %

1 0 %

1 5 %

2 0 %

2 5 %

3 0 %

3 5 %

4 0 %

α = 0 .9 α = 0 .7 5 α = 0 .5 α = 0 .2 5

Zip f a lp h a va lu e s

% Im

prov

emen

t

S o c k e t -S y n c R D M A -A s y n c R D M A -S y n c e -R D M A -S y n c

Exploiting RDMA operations for Providing Efficient Fine-Grained Resource Monitoring in Cluster-Based Servers, K. Vaidyanathan, H. –W. Jin and D. K. Panda. Workshop on Remote Direct Memory Access (RDMA): Applications, Implementations and Technologies, 2006

• Our schemes (RDMA-Sync and e-RDMA-Sync) achieve significant performance gain over existing schemes

Page 15: Designing Efficient Systems Services and Primitives for ... · Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers ... Typical Multi-Tier Data-center

Work-in-Progress• Data-Center Primitives

– Efficient Global Memory Aggregator Mechanisms

• Advanced Communication Protocol Mechanisms

– Efficient Packetized Flow-Control

• Detailed Data-Center Evaluation with the proposed framework

• Software release of several data-center components

– Have received multiple requests from organizations for such a

release including a large financial company

Page 16: Designing Efficient Systems Services and Primitives for ... · Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers ... Typical Multi-Tier Data-center

Conclusions• Proposed new protocols, primitives and services for next

generation data-centers– Use advanced features of InfiniBand and other RDMA-Enabled

interconnects

– Significant performance gains and scalability for several scenarios

• Potential for designing next generation scalable and high

performance data-center architectures

Page 17: Designing Efficient Systems Services and Primitives for ... · Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers ... Typical Multi-Tier Data-center

Challenges and Discussion Bullet• Challenges

– Benefits of all these components and services in an integrated

manner for handling

• Petabytes of data and Multi-thousand users

– Redesigning middleware and applications on next generation

data-centers

• Significance to the SMA and PDOS components of the CSR

program

• Discussion Bullet

– How to re-architect next generation data-center architectures,

software services, middleware and applications with advances in

modern networking technologies and capabilities?

Page 18: Designing Efficient Systems Services and Primitives for ... · Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers ... Typical Multi-Tier Data-center

Web Pointers

Website: http://www.cse.ohio-state.edu/~panda

Group Homepage: http://nowlab.cse.ohio-state.edu

Email: [email protected]

NBCL