opennebulaconf2015 2.14 cloud service experience in teidehpc infrastructure - carlos ignacio...

32
OpenNebula implementation in TeideHPC infrastructure Carlos I. González Vila OpenNebulaConf 2015 - Barcelona

Upload: opennebula-project

Post on 20-Mar-2017

513 views

Category:

Technology


2 download

TRANSCRIPT

Page 1: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila

OpenNebula implementation in TeideHPC infrastructure

Carlos I. González Vila

OpenNebulaConf 2015 - Barcelona

Page 2: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila

ITERFounded by the Cabildo Insular de Tenerife in 1990, the island's administrative authority. Its objectives are to promote research activities and technological development, especially those related with the use of the renewable energies.

Page 3: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila

ITER

ITER

MA

NA

GE

ME

NT

RENEWABLE ENERGIES

Departamento de Fotovoltaica Departamento de Eólica

DIS

EM

INA

TIO

N A

RE

A

MA

INT

EN

AN

CE

Photovoltaics

Sustainable architecture

Wind Energy

INFORMATION AND NEW TECHNOLOGIES

Electronics

Informatics

Groundwater resources

Environmental quality

Volcanology

ENVIRONMENT SCIENCE

Page 4: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila

TeideHPC

Page 5: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila

TeideHPC

● Weather forecasting– Wind power generation

– Volcanic smoke dispersion prediction

– Solar power generation

● Bioinformatics● Computational Fluid Dynamics

– Aerodynamics, aeroacoustics

– Industrial design

Page 6: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila

TeideHPC

● Astrophysics– Solar composition

– Cosmological Parameters Estimation

– Analysis of solar spectro-polarimetric observations

● Computer sciences● Streaming services● Render ● Cloud computing

Page 7: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila

OpenNebula

● Implementation● Study cases

– Infiniband virtualization

– Interoperability

● Use cases– Multidatastore (lbox)

– Remote visualization

– Render farm

Page 8: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila

OpenNebula Implementation

● API● Cobbler● Chef● NetApp Storage● Monitoring ● Repository

Page 9: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila

OpenNebula Implementation

High Performance Computing Cloud computing

● Ethernet network configuration● Infiniband configuration● User auth & permissions

API

Page 10: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila

OpenNebula Implementation

● API● Cobbler● Chef● NetApp Storage● Monitoring● Repository

Page 11: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila

OpenNebula Implementation

● API● Cobbler● Chef● NetApp Storage● Monitoring● Repository

Page 12: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila

OpenNebula Implementation

● API● Cobbler● Chef● NetApp Storage● Monitoring● Repository

Page 13: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila

OpenNebula Implementation

● API● Cobbler● Chef● NetApp Storage● Monitoring● Repository

Page 14: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila

Study case – Infiniband virtualization

Page 15: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila

Study case – Infiniband virtualization

● OpenNebula Ecosystem● KVM SR-IOV Driver

– Single root I/O Virtualization

Virtual functions

PCIe device

Page 16: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila

Study case – Infiniband virtualization

● OpenNebula Ecosystem● KVM SR-IOV Driver

– Single root I/O Virtualization

Virtual functions

PCIe device

# lspcib0:00.0 Network controller: Mellanox Technologies MT27500 Family [ConnectX-3]b0:00.1 Network controller: Mellanox Technologies MT27500/MT27520 Family [ConnectX-3/ConnectX-3 Pro Virtual Function]b0:00.2 Network controller: Mellanox Technologies MT27500/MT27520 Family [ConnectX-3/ConnectX-3 Pro Virtual Function]

● Enable option in BIOS● Enable IO virtualization in kernel boot line● Define number of Virtual Functions in kernel module load● Burn SR-IOV capable firmware in the HCA (OEM)● Upgrade OFED version (2.3-2.0.5-rhel6.6)

Page 17: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila

Study case – Infiniband virtualization

● OpenNebula Ecosystem● KVM SR-IOV Driver● Great A'Tuin

– GPU devices support was funded by SURFsaraOpenNebula 4.14 Release notes... Support for GPU consumables, giving the ability to give exlcusive PCI passthrough access to VMs to GPU cards, for HPC computing.

Page 18: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila

Study case – Interoperability limitations

Page 19: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila

Study case – Interoperability limitations

● 2 research centers– Share resources

● Conectivity● Federation● Cloud bursting

Page 20: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila

Study case – Interoperability limitations

● 2 research centers– Share resources

● Conectivity● Federation● Cloud bursting

● End-to-end connection

– L2-L3

– Jumbo frames

● 1 lambda - 10 Gbps

● Ping < 40 ms

● VPN IPsec

● +600Mbps between 1Gbps hosts

Page 21: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila

Study case – Interoperability limitations

● 2 research centers– Share resources

● Conectivity● Federation● Cloud bursting

Tightly coupled

Page 22: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila

Study case – Interoperability limitations

● 2 research centers– Share resources

● Conectivity● Federation● Cloud bursting

Page 23: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila

Study case – Interoperability limitations

● 2 research centers– Share resources

● Conectivity● Federation● Cloud bursting

OpenNebula 4.8 docs... “The remote provider could be a commercial Cloud service, such as Amazon EC2, IBM SoftLayer or Microsoft Azure, or a partner infrastructure running a different OpenNebula instance”

Page 24: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila

Use case – Multiple datastore

Page 25: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila

Use case – Multiple datastore

● Host load balancing required

● Migrate / live migrate● Time of migration (live)

– VM RAM size

– Host bandwith (IB)

– Network

Page 26: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila

Use case – Multiple datastore

● Storage overloaded● Delete-recreate & Deploy to new

datastore● Take image snapshot (same DS)

& create template & deploy– VM HDD size

● Manual migration: – Stop VM + copy image to new DS

+ delete-recreate + deploy + replace image

Page 27: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila

Use case – Remote visualization

● Lack of rendering hardware● Low performance with ssh -X (even with

compression)● Spice

– Audio+video

– Low BW requirements

● QXL driver (variable desktop resolution)– 1024 x 768 … 2560 x 1600

Page 28: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila

Use case – Remote visualization

Page 29: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila

Use case – Remote visualization

Page 30: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila

Capture the Flag

● 404 VMs● Connected to client private

infrastructure through IPsec tunnel

● 20,8 TB HDD● 12,8 TB RAM● 12800 cores● 100 KW

Page 31: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila

Thank you

[email protected] @teidehpc teideHPC

Page 32: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila

References

● http://blog.scottlowe.org/2009/12/02/what-is-sr-iov/

● http://docs.opennebula.org/4.12/advanced_administration/data_center_federation/introf.html

● http://docs.opennebula.org/4.14/release_notes/release_notes/whats_new.html#opennebula-core