高性能计算机和曙光 ghpc1000 集群系统

Click here to load reader

Upload: braima

Post on 28-Jan-2016

118 views

Category:

Documents


0 download

DESCRIPTION

高性能计算机和曙光 GHPC1000 集群系统. 技术支持中心 张新凤 曙光信息产业(北京)有限公司. 目 录. 1 高性能计算简介 1.1 什么是高性能并行计算机 1.2 流行的高性能计算机架构 1. 3 集群技术的趋势 2 本项目 GHPC1000 集群系统介绍. 1.1 什么是高性能并行计算机. 由众多部件组成,具有运算速度快、存储容量大、可靠性高的特性。. 也称为:巨型计算机、超级计算机 目前任何高性能计算和超级计算都离不开使用并行技术,所以高性能计算机肯定是并行计算机。. 1. 2 流行的高性能计算机架构. 并行向量机 - PowerPoint PPT Presentation

TRANSCRIPT

  • GHPC1000

  • 1 1.1 1.2 1.3 2 GHPC1000

  • 1.1

  • 1.2 SMPDSMNUMAMPPSMPDSMCluster

  • 19932006 TOP500

  • 1.3 MPP/PVP

    CC-NUMACPU

    SMP 64

  • 1 2 GHPC1000

  • CPU + GPU

    GPU183TFlops()

  • 1(A620r-T) 432=86 GPUGTX2952(A620r-T) 162=32 GPUC1060IO(A620-H)1DS6310EE 1 16TB Infiniband 1 36IB 196IB 1 20Gb IB 119 1 48 3

    6SKVM 1 1 GridViewPowerconfGNUCUDA

  • A620r-T GPUA620r-T43GPU:1Nvidia GTX295 GPU1AMD Opteron 2378 2.4G 16G1160GB SATA 21000MInfinibandDDR 20Gb/s HCA

  • 2A620r-T GPUA620r-T16GPU:1Nvidia C1060 GPU1AMD Opteron 2378 2.4G 16G1160GB SATA 21000MInfinibandDDR 20Gb/s HCA

  • -GPU21122GPU1

  • Form Factor 16.7x6.8 42.3cm x 17.3cmCPU:2AMD barcelona or shanghai Chipset:Nvidia nForce360016 DIMMDDR2 533/667 ECC REGLAN:2 Gigabit LANInfiniband: Mellanox InfiniHost III Lx DDR MT25204A0-FCC-D single portSATA:4-SATA2 Support Raid 0,1,5PCIE: 1 PCI-Ex16 (2IPMI 2.0

  • GPU-SERVER

  • I/OA620r1 2AMD Opteron 2378 2.4G16GB DDR2-6671146GB SAS HBA112Gb/s SAS 4x HBA2IB20Gb IB HCADS6310EE(16T):Raid4SAS 4x80Cache161TB SATA

  • InfinibandnfsIO116T

  • A620r-H :

    2Opteron 2000L2/L3512K / core2MB L34L2/L3512K / core6MB L34NVIDIA nForce3600/Max16DIMMs / 64GBDDR2 533/667 ECCRegDVD-RWUSB-DVDUSBSAS HostRAID011ESAS RAIDRAID56SATA HostRAID0156HostRAID12SATAIISAS21000MNvidia2PCI-E x16x83PCI-X 133/1001PCI 32Low Profile ES1000 32MB 600W 11IPMI

  • DS6310EE/DS6312EESAS-SAS/Intel IOP 3411.2GHz 4SAS 41SAS 4 SAS SAS/SATARaid011E565060 DS6310EEDS6312EE 512 MB - 2048 MB Cache Cache 3U 16 SAS 441680SAS/SATA2Dawning RAID Manager SMART condition pollingRAIDRAIDRAID0510501E

  • IP10.0.0.1255.255.255.0administratorpassword DS6310IPIPIPIPIP10.0.0.1255.255.255.0IP10.0.0.2/3255.255.255.0

  • DS6310

  • Super4 View MaintenancePDM PowerRAIDLUNRAIDStirpe sizeRAIDLUN Super SuperSuper userSuperSuper user

  • DS6310IPIPIPIP IP IP

  • IP IP

  • FIRMWARE

  • HTTPHTTP

  • 100%

  • RAID DS6310RAID01101E5506RAIDRAIDRAIDStripe Size

  • RAID1RAID 2RAIDRAID0RAIDRAID

  • RAID3RAIDCTRLShift>>

  • RAID4DS6310RAIDLUNLUNRaidIDLUNLUN

  • RAID5RAID0

  • RAID1

    2

  • RAID3RAIDconfirmOK

  • DS6310DS6310DS6310LUN

    1HBA 2LUN 3LUNHBA

  • 1HBA1HBAWWNHBAWWN>

  • 2LUN2LUNLUNLUN

  • 33LUN LUNLUNWWNLUNLUNLUNLUNLUNHBALUN0

  • LUN

  • RAIDRAIDLUN xx

  • RAIDRAID RAID

  • RAIDLUNLUNRaidIDLUNDS63105.9LUN5-38 RAIDLUN

  • RAIDLUN LUNconfirmOK

  • /RAID

    RAID

    RAID

    PDMDS6310PDMPredictive Data MigrationPDMRAIDMedia PatrolPDMPDM

    RAID

  • LUN

  • GlobalDedicatedRAIDDedicatedRAIDRAID

  • confirmOK

  • RAIDLUN2-UpdateLUNconfirmOK

  • IBVoltaireRoadRunner5000A196IB 20Gbps136IB 20Gbps IO

  • #bytes #repetitions t[usec] Mbytes/sec 0 1000 1.47 0.00 1 1000 1.57 0.61 2 1000 1.56 1.22 4 1000 1.53 2.49 8 1000 1.55 4.92 16 1000 1.60 9.52 32 1000 1.62 18.86 64 1000 1.61 37.90 128 1000 1.80 67.65 256 1000 2.05 119.26 512 1000 2.67 183.08 1024 1000 3.74 260.15 2048 1000 6.15 317.20 4096 1000 10.66 366.34 8192 1000 16.52 472.94 16384 1000 17.49 893.52 32768 1000 27.55 1134.41 65536 640 47.72 1309.74 131072 320 88.68 1409.62 262144 160 170.73 1464.31 524288 80 334.62 1494.24 1048576 40 662.45 1509.54 2097152 20 1318.55 1516.82 4194304 10 2637.10 1516.82 20Gb/40Gb Infinihost IV InfinibandInfinihost IVConnectXHPC5000HPL

    Infinihost III2.7-3.5usInfinihost IV(ConnectX)1.26us

  • * Page *9xx0 InfiniBand8

  • * Page *9xx0- Firmware,

  • * Page *9xx0-

  • 48 IO

  • 2U*17node/rack17node

  • 2000 mm 600 mm 1200mm 35 U340kg9.9kg42U35U25KW553+1GRIDVIEW

  • +

  • KVM/KVM

  • / GridViewPowerconf64 SUSE LinuxBlasLapackFFTWGNU C/C++/F77/F90 Compiler

    CUDAOpenMPIMPICH

  • GridView IPMI

  • PowerConf

    PowerConf---HPC,

  • ************