redundant ioc with atca(hpi) support

17
Redundant IOC with ATCA(HPI) support Utilizing modern hardware for better availability Artem Kazakov, KEK/SOKENDAI

Upload: leora

Post on 11-Jan-2016

54 views

Category:

Documents


2 download

DESCRIPTION

Redundant IOC with ATCA(HPI) support. Utilizing modern hardware for better availability. Artem Kazakov, KEK/SOKENDAI. Why run RIOC on ATCA?. ATCA is modern industry standard for HA applications Supposed to be very reliable (99.999% design availability) - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Redundant IOC with ATCA(HPI) support

Redundant IOC with ATCA(HPI) support

Utilizing modern hardware for better availability

Artem Kazakov, KEK/SOKENDAI

Page 2: Redundant IOC with ATCA(HPI) support

Why run RIOC on ATCA?

• ATCA is modern industry standard for HA applications– Supposed to be very reliable (99.999% design

availability)

• ATCA is suggested as a platform for the ILC control system

Page 3: Redundant IOC with ATCA(HPI) support

Advanced Telecom Computing Architecture (AdvancedTCA)

• Defined by PCI Industrial Computer Manufacturers Group with 100+ companies participating

• Targeted to requirements for the next generation of carrier grade communications equipment

• Incorporates the latest trends in high speed interconnect technologies, next generation processors and improved reliability, manageability and serviceability

Page 4: Redundant IOC with ATCA(HPI) support

AdvancedTCA cassis and blades

Page 5: Redundant IOC with ATCA(HPI) support

ATCA Features

ATCA provides monitoring and management controls for many parts of the system: fans, network connection, power supplies, bios images, boot ROMs etc…

The key role in this process is played by Shelf Manager

We want to use this features to make better decisions for fail-over

Page 6: Redundant IOC with ATCA(HPI) support

ATCA Shelf manager

Data is exchanged through redundant Intelligent Platform Management Bus IPMB

Page 7: Redundant IOC with ATCA(HPI) support

Redundant IOC

• Provides redundancy support for EPICS IOCs• Developed at DESY• Support is already in the BASE since EPICS

3.14.10 release – No need to patch/reconfigure/recompile BASE– Just download RIOC libs and link them to your IOC

to make it redundant

Page 8: Redundant IOC with ATCA(HPI) support

What is redundant IOC?

IOC#1IOC#1IOC#1IOC#1 IOC#2IOC#2 IOC#2IOC#2Private Private EthernetEthernetPrivate Private EthernetEthernet

Shared NetworkShared NetworkShared NetworkShared NetworkCA clientsCA clientsCA clientsCA clients

PublicPublicPublicPublic

PublicPublicPublicPublic

HardwareHardwareHardwareHardware

PV1PV2PV3

PV1PV2PV3

Page 9: Redundant IOC with ATCA(HPI) support

“plain” Redundant IOC on ATCA

IOC#1IOC#1IOC#1IOC#1 IOC#2IOC#2 IOC#2IOC#2Private Private EthernetEthernetPrivate Private EthernetEthernet

Shared NetworkShared NetworkShared NetworkShared NetworkCA clientsCA clientsCA clientsCA clients

PublicPublicPublicPublic

PublicPublicPublicPublic

HardwareHardwareHardwareHardware

PV1PV2PV3

PV1PV2PV3

ATCA shelf

Page 10: Redundant IOC with ATCA(HPI) support

“plain” Redundant IOC on ATCA

• Runs “as-is”

• But does not know anything about the “smart” hardware of ATCA

• Basically is same as running on two normal PCs

Page 11: Redundant IOC with ATCA(HPI) support

Possible benefits of “ATCA”-aware RIOC

• Failures can be “predicted”– i.e. temperature starts to rise and the CPU is still

working -> we can initiate fail-over procedure before actual hardware fails -> fail-over occurs in more stable and controlled environment

– Client connections can be gracefully closed– Allowing the client to reconnect to back-up IOC within

1 second– In case of “real” hardware failure reconnect would

occur only after 30 seconds

Page 12: Redundant IOC with ATCA(HPI) support

Redundancy Monitoring Task(RMT) - Key component of RIOC

Page 13: Redundant IOC with ATCA(HPI) support

RMT – Key component of RIOC

• Checks “health” of the drivers• Controls drivers (start, stop, sync etc…)• Checks network connectivity• Checks the partner status• Decides when to switch (or not to switch) to

the partner

Page 14: Redundant IOC with ATCA(HPI) support

ATCA/HPI driver for RMT

IP

HPI - Hardware Platform Interface – Generic Platform Independent specification to monitor and control HA systems

Page 15: Redundant IOC with ATCA(HPI) support

“HPI-aware” RIOC on ATCA

Page 16: Redundant IOC with ATCA(HPI) support

Now RMT can monitor any available sensor on ATCA shelf and make better fail-over decision

configuration via iocSh: rmtHPIDriverStart "{RACK,0}{ADVANCEDTCA_CHASSIS,0}{PHYSICAL_SLOT,4}{PICMG_FRONT_BLADE,0}" 1

rmtHPIDriverStart “entityPath” “Sensor ID”

Page 17: Redundant IOC with ATCA(HPI) support

Free Bonus

• The same driver can be used on other hardware other than ATCA

• What is really needed is HPI library which can run on top of – IPMI– SNMP – i.e. IBM BladeCenter– Sysfs – …