the 6d mesh/torus interconnect of k computer · tofu: 6d mesh/torus interconnect architecture high...

12
The 6D Mesh/Torus Interconnect of K Computer Tomohiro Inoue FUJITSU LIMITED

Upload: others

Post on 12-Aug-2020

4 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: The 6D Mesh/Torus Interconnect of K Computer · Tofu: 6D mesh/torus interconnect architecture High communication performance 100GB/s off-chip bandwidth feeds enough data to high performance

The 6D Mesh/Torus Interconnect of K Computer

Tomohiro Inoue FUJITSU LIMITED

Page 2: The 6D Mesh/Torus Interconnect of K Computer · Tofu: 6D mesh/torus interconnect architecture High communication performance 100GB/s off-chip bandwidth feeds enough data to high performance

Contents

Introduction

ArchitectureNode construction

Network construction

Routing function

Conclusion

1

Page 3: The 6D Mesh/Torus Interconnect of K Computer · Tofu: 6D mesh/torus interconnect architecture High communication performance 100GB/s off-chip bandwidth feeds enough data to high performance

Interconnect of K computer

Tofu: Fujitsu’s original 6D mesh/torus interconnect

High communication performance

High system scalability

High fault-tolerance

2

Page 4: The 6D Mesh/Torus Interconnect of K Computer · Tofu: 6D mesh/torus interconnect architecture High communication performance 100GB/s off-chip bandwidth feeds enough data to high performance

Node constructionSingle CPU and single interconnect controller

10 links for inter-node connection

10GB/s per link

Total 100GB/s of off-chip bandwidth Feeds sufficient data to high performance CPU

CPU

SPARC64VIIfxTM

Interconnectcontroller

128 GFlops

10 GB/s x 10 links

3

Page 5: The 6D Mesh/Torus Interconnect of K Computer · Tofu: 6D mesh/torus interconnect architecture High communication performance 100GB/s off-chip bandwidth feeds enough data to high performance

Network construction

6 links Scalable xyz 3D torus 4 links Fixed size abc 3D mesh/torus

|a |=2, |b |=3, |c |=2 12 nodes

Total topology is 6D mesh/torus

Cartesian product of xyz and abc mesh/torus

CPUInterconnect

controller

abc 3D mesh/torus xyz 3D torus

4

Page 6: The 6D Mesh/Torus Interconnect of K Computer · Tofu: 6D mesh/torus interconnect architecture High communication performance 100GB/s off-chip bandwidth feeds enough data to high performance

Network construction cont.

From the other perspectives...Overlaid twelve xyz torus

X x Y x Z array of abc mesh/torus

Twelve times higher scalability than the 3D torus network

5

Page 7: The 6D Mesh/Torus Interconnect of K Computer · Tofu: 6D mesh/torus interconnect architecture High communication performance 100GB/s off-chip bandwidth feeds enough data to high performance

Network construction cont.

Each pair of adjacent abc mesh/torus is interconnected with twelve links

6

Page 8: The 6D Mesh/Torus Interconnect of K Computer · Tofu: 6D mesh/torus interconnect architecture High communication performance 100GB/s off-chip bandwidth feeds enough data to high performance

Routing algorithmExtended dimension order routing

Additional abc traversal

abc xyz abc

The first abc traversal is path selection

7

Page 9: The 6D Mesh/Torus Interconnect of K Computer · Tofu: 6D mesh/torus interconnect architecture High communication performance 100GB/s off-chip bandwidth feeds enough data to high performance

Video Demonstration

Example routing

Routing from (x=0,y=0,z=0, a=0,b=0,c=0) to (3,2,1, 1,1,1)

Traverses + b, + a, +x *3, +y *2, +z, + c

8

Page 10: The 6D Mesh/Torus Interconnect of K Computer · Tofu: 6D mesh/torus interconnect architecture High communication performance 100GB/s off-chip bandwidth feeds enough data to high performance

Detouring faulty nodes

Multipath routing allows to detour faulty nodes

source destinationfault

9

Page 11: The 6D Mesh/Torus Interconnect of K Computer · Tofu: 6D mesh/torus interconnect architecture High communication performance 100GB/s off-chip bandwidth feeds enough data to high performance

Conclusion

Tofu: 6D mesh/torus interconnect architecture

High communication performance

100GB/s off-chip bandwidth feeds enough data to high performance CPU

High system scalability

12x higher scalability compared with 3D torus

High fault-tolerance

Multipath routing algorithm allows to detour faulty nodes

10

Page 12: The 6D Mesh/Torus Interconnect of K Computer · Tofu: 6D mesh/torus interconnect architecture High communication performance 100GB/s off-chip bandwidth feeds enough data to high performance

11