open architecture lab prototyping performance prototype...
TRANSCRIPT
Open Architecture LabPROTOTYPING
Performance PrototypePlatform
John TaylorUniversity of CambridgeSDP F2F Jodrell Bank13-02-2017
Performance Prototype
● Provide a managed service function to SDP to support horizontal and vertical prototyping activities as a means to address risk-reduction.
○ A service portal is being built by UIS@UCAM for access to prototype and a means to provide infrastructure status and access.
● Currently horizontal prototyping is being conducted by a number of separate prototyping activities,
○ e.g. SIP
● Vertical prototyping focused on specific COTS technologies,
● Provide a prototyping infrastructure to address specific risks. Where appropriate engage with Industry
Current Requirements
● Support of SIP work
● Support of COTS Execution Framework
● Support of UCAM work on high performance I/O
● Support of Receive function - BDN
● Provisioning Management and Control function using OpenStack (Middleware)
● Other prototyping activities ()
Performance Prototype
STATUS• Grant award for
hardware with service and integration from UCAM.
• Order placed on 20/12/16
• Racking and Stacking to begin 20/02
• Installation to begin 28/02
• System Availability Mid-March
• Rack-view (to the right). TBC
Compute/Storage (ARM)
CEPH APPLIANCE1x 4TB• 2x 2TB SSD per node1x 8-core AMD Opteron A1100 per node2x RDIMM 32GB DDR3 DRAM2x 10GBase-T Ethernet
1x 4TB• 2x 2TB SSD per node 1x 8-core AMD Opteron A1100 per node2x RDIMM 32GB DDR3 DRAM2x 10GBase-T Ethernet
3x 76TB (228TB total)• 12x 6TB HDD per node • 2x 2TB SSD per node1x 8-core AMD Opteron A1100 per node2x RDIMM 16GB DDR3 DRAM2x 10GBase-T Ethernet
1
Networking
• 2 x Mellanox EDR configured as a stack (back-to-back cables)
• 32-port 100GbE Dell switch configured with 4x25GbE break-out cables
• 10/40 GbE switch for connection to additional network
OpenStack
https://docs.google.com/document/d/1b-1GV5L4NSEtyoG_0wNdc0a-fAxzyox7cx6G91bHW8w/view
OpenStack
https://docs.google.com/document/d/1b-1GV5L4NSEtyoG_0wNdc0a-fAxzyox7cx6G91bHW8w/view
OpenStack
https://docs.google.com/document/d/1b-1GV5L4NSEtyoG_0wNdc0a-fAxzyox7cx6G91bHW8w/view
OPENSTACK AND THE SOFTWARE-DEFINED SUPERCOMPUTER
TCP BANDWIDTH - BARE METAL WITH IRONIC COMPARED TO VM
OpenStack Kolla
● Deployment of OpenStack in containers● Multiple base & container OS distributions
○ Including RHEL & CentOS● Supports a long list of OpenStack projects● Tools to build Docker images● Ansible orchestrates configuration & deployment of services
○ Deployment is (relatively) fast● Provides a minimal & functional default configuration● Highly flexible configuration
○ Set any configuration option● Rolling upgrades via recreation of containers with new images
OpenStack Kolla
● Bare metal management● Supports many types of hardware via drivers● Typically interacts with a Baseboard Management Controller (BMC)
○ E.g. Dell iDRAC● Manages nodes’ power state● Provisions images
○ Deployment image PXE booted○ Image transferred via iSCSI direct to disk or downloaded from Swift
● Provides access to nodes’ serial consoles● Integrates with OpenStack Nova
○ Nova Ironic virtualization driver presents bare metal nodes as ‘hypervisors’
Operating Environment
• Automated deployment of OpenStack control plane– Services deployed in containers on bare metal– OpenStack Kolla
• Container images• Ansible playbooks for configuration and orchestration (
– Simplified control plane updates and reconfiguration• Innovate quickly
• Compute nodes managed as a bare metal cloud– OpenStack Ironic (Kayobe)– Avoid overhead of virtualisation– Hardware heterogeneity managed via Nova flavors
• Software Defined Networking (SDN)– OpenStack Neutron– OpenDaylight
• Monitoring & log aggregation for control & compute plane– OpenStack Monasca– Grafana dashboards
User Environment
• Users can interact with OpenStack via:– Horizon web dashboard– Command Line Interface (CLI)– REST APIs
• Several options for provisioning user applications– OpenStack Nova
• Provision individual servers– OpenStack Heat
• Provision stacks composed of multiple resources– Options for deployment of containerised workloads– Configuration management tools
• e.g. Ansible