aic ha201-tp (2u 24-bay storage server solution) user manualha201-tp user's manual 7 chapter 2...
TRANSCRIPT
![Page 1: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/1.jpg)
UM_HA201-TP_Final_061615
HA201-TPStorage Server Barebone
Users Manual
contents
CONTENTSPREFACE i
SAFETY INSTRUCTIONS iiChapter 1 Product Introduction 1
11 Box Content 112 Specifications 213 General Information 3
Chapter 2 Hardware Installation 7
21 Removing and Installing top cover 722 Central Processing Unit (CPU) 823 Diagram of the Correct Installation 1624 System Memory 1725 RemovingInstalling a Drive Tray Hard Drive Drive Slot Map 2226 Removing and Installing a Fan Module 2527 Power Supply 2628 Tool-less Blade Slide Installation introduction 27
Chapter 3 Motherborad Overview 31
31 Intelreg Server Board Feature Set 3232 Motherboard Layout 3433 Motherboard block diagram 3534 Motherboard Configuration Jumpers 3635 Motherboard Configuration Jumpers 38
Chapter 4 12GB Expander Borad Overview 39
41 12GB Expander Board 39
Chapter 5 Backplane Overview 44
51 Backplane 44
Chapter 6 Debug amp Firmware Update Introduction 48
61 Expender firmware update through smart console port 4862 Update the expander firmware through in-band 6163 12G expander EDFB setting 65
contents
64 Slot HDD power setting 6765 HDD BP thermal sensor temperature setting 72
Chapter 7 Intelreg Server Board S2600TP Platform Management 75
71 Server Management Function Architecture 7572 Features and Functions 7873 Sensor Monitoring 8474 Intelreg Intelligent Power Node Manager (NM) 11675 Basic and Advanced Server Management Features 120
Chapter 8 Technical Support 129
Copyright copy 2015 AIC Inc All Rights Reserved
This document contains proprietary information about AIC products and is not to be disclosed or used except in accordance with applicable agreements
i
PREFACEbull Copyright
No part of this publication may be reproduced stored in a retrieval system
or transmitted in any form or by any means electronic mechanical photo-
static recording or otherwise without the prior written consent of the
manufacturer
bull Trademarks
All products and trade names used in this document are trademarks or
registered trademarks of their respective holders
bull Changes
The material in this document is for information purposes only and is subject
to change without notice
bull Warning
1 A shielded-type power cord is required in order to meet FCC emission
limits and also to prevent interference to the nearby radio and television
reception It is essential that only the supplied power cord be used
2 Use only shielded cables to connect IO devices to this equipment
3 You are cautioned that changes or modifications not expressly approved
by the party responsible for compliance could void your authority to
operate the equipment
bull Disclaimer
AIC shall not be liable for technical or editorial errors or omissions
contained herein The information provided is provided as is without
warranty of any kind To the extent permitted by law neither AIC or its
affiliates subcontractors or suppliers will be liable for incidental special or
consequential damages including downtime cost lost profits damages
relating to the procurement of substitute products or services or damages
for loss of data or software restoration The information in this document is
subject to change without notice
ii
SAFETY INSTRUCTIONSbull Before getting started please read the following important cautionsbull All cautions and warnings on the equipment or in the manuals should be
notedbull Most electronic components are sensitive to electrical static discharge
Therefore be sure to ground yourself at all times when installing the internal components
bull Use a grounding wrist strap and place all electronic components in static-shielded devices Grounding wrist straps can be purchased in any electronic supply store
bull Be sure to turn off the power and then disconnect the power cords from your system before performing any installation or servicing A sudden surge of power could damage sensitive electronic components
bull Do not open the systemrsquos top cover If opening the cover for maintenance is a must only a trained technician should do so Integrated circuits on computer boards are sensitive to static electricity Before handling a board or integrated circuit touch an unpainted portion of the system unit chassis for a few seconds This will help to discharge any static electricity on your body
bull Place this equipment on a stable surface when install A drop or fall could cause injury
bull Please keep this equipment away from humiditybull Carefully mount the equipment into the rack in such manner that it
wonrsquot be hazardous due to uneven mechanical loadingbull This equipment is to be installed for operation in an environment with
maximum ambient temperature below 35degCbull The openings on the enclosure are for air convection to protect the
equipment from overheating DO NOT COVER THE OPENINGSbull Never pour any liquid into ventilation openings This could cause fire or
electrical shockbull Make sure the voltage of the power source is within the specification
on the label when connecting the equipment to the power outlet The current load and output power of loads shall be within the specification
bull This equipment must be connected to reliable grounding before using Pay special attention to power supplied other than direct connections eg using of power strips
iii
bull Place the power cord out of the way of foot traffic Do not place anything over the power cord The power cord must be rated for the product voltage and current marked on the productrsquos electrical ratings label The voltage and current rating of the cord should be greater than the voltage and current rating marked on the product
bull If the equipment is not used for a long time disconnect the equipment from mains to avoid being damaged by transient over-voltage
bull Never open the equipment For safety reasons only qualified service personnel should open the equipment
bull If one of the following situations arise the equipment should be checked by service personnel1 The power cord or plug is damaged2 Liquid has penetrated the equipment3 The equipment has been exposed to moisture4 The equipment does not work well or will not work according to its user
manual5 The equipment has been dropped andor damaged6 The equipment has obvious signs of breakage7 Please disconnect this equipment from the AC outlet before cleaning
Do not use liquid or detergent for cleaning The use of a moisture sheet or cloth is recommended for cleaning
bull Module and drive bays must not be empty They must have a dummy cover
Product features and specifications are subject to change without notice
CAUTION
risk of explosion if battery is replaced by an incorrect type
dispose of used batteries according to the instructions
After performing any installation or servicing make sure the
enclosure are lock and screw in position turn on the power
1
HA201-TP Users Manual
Chapter 1 Product Introduction
11 Box Content
Chapter 1 Product Introduction
Before removing the subsystem from the shipping carton visually inspect the physical condition of the shipping carton Exterior damage to the shipping carton may indicate that the contents of the carton are damaged If any damage is found do not remove the components contact the dealer where the subsystem was purchased for further introduction Before continuing first unpack the subsystem and verify that the contents of the shipping carton are all there and in good condition
bull Enclosure( Power supply fan 24 HDD tray included)
bull RS232 cablex 1pcs bull Power cord x 2sets bull Screws kit x 1set
bull Slide rail x 1set
If any items are missing please contact your authorized reseller or sales representative
2
HA201-TP Users Manual
Chapter 1 Product Introduction
12 Specifications
Dimensions (W x D x H)(with chassis ears)
mm 4826 x 815 x 88
inches 19 x 32 x 35
Motherboard (per node) Intelreg Server Board S2600TP
Processor (per node)
Processor Support
Two Intelreg Xeonreg Processors E5-2600 v3 Product Family
QPI Speeds 96 GTs 8 GTs 72 GTs
Socket Type Socket R3 (FCLGA2011-3)
Chipset Support(per node) Intelreg C612 Chipset
System Memory (per node)
bull 16 DIMM slots in total across 8 memory channelsbull Registered DDR4 (RDIMM) Load Reduced DDR4 (LRDIMM)bull Memory DDR4 data transfer rates of 160018662133 MTsbull DIMM sizes of 4 GB 8 GB 16 GB or 32 GB depending on ranks and technology
Front Panel Power onoff
LEDs
A bull Power (Secondary)bull Warning
B bull Power (Primary)bull Warning
Drive Bays External 25 hot swap 24
Backplane 1 x 24-port 12Gb SAS dual-loop backplane
Expander Board(per node)
1 x 24-port 12Gb SAS expander board with 3 SFF-8643 connectors
Expansion Slots (per node) PCIe 30 5 x8 (4 LP amp 1 FHHL)
Internal IO Connectors Headers(per node)
bull 1 x internal USB 20 connector (port 67)bull 1 x 2x7pin header for system fan modulebull 1 x 1x12pin control panel headerbull 1 x DH-10 serial Port A connectorbull 1 x SATA 6Gbs port for SATA DOMbull 4 x SATA 6Gbs connectors (port 0123)bull 1 x 2x4 pin header for Intel RMM4 Litebull 1 x 1x4 pin header for Storage Upgrade Keybull 1 x 1x8 pin backup power controler connector
Rear IO(per node)
bull 1 x RJ45 (dedicated port for remote server management) 2 x RJ45 1 x VGA 2 x USB 20 Type A 1 x Mini SAS HD
Ethernet (per node) Dual GbE Intelreg I350 Gigabit Ethernet Controller
Video Support(per node) Integrated Matrox G200 2D Graphics Controller
Server Management(per node)
Onboard Server Engines LLC Pilot III Controller Support for Intelreg Remote Management Module 4 solutions IntelregLight-Guided Diagnostics on field replaceable units Support for Intelreg System Management Software Support for Intelreg Intelligent Power Node Manager (Need PMBus - compliant power supply) BIOS Flash Winbond W25Q64BV
Power Supply 1200W 1+1 redundant power supply
System Cooling (per node)
2 x 6056 fans
EnvironmentalSpecifications
Temperature 0degC - 35degCHumidity 5 - 95 non-condensing
Gross Weight (w PSU amp Rail)kgs 41
lbs 90
PackagingDimensions
(W x D x H)mm 590 x 1150 x 330
inches 232 x 453 x 13
Mounting Standard 28 tool-less slide rail
3
HA201-TP Users Manual
Chapter 1 Product Introduction
13 General Information
HA201-WP a 2U Storage Server Barebone supports two Intelreg Xeonreg processors E5-2600 v3 series HA201-TP has a 24 x 25rdquo HDD bays as system drive bays It is a perfect building block for Storage Servers
bull Front Panel
LED Indicator and Switch
24 x 25 hot-swap SATASAS HDD bays
4
HA201-TP Users Manual
Chapter 1 Product Introduction
bull Rear Panel
1200W 1+1 80+ redundant power supply
2 x low-profile add-on card for external connection
SFF 8644 for JBOD connection
5
HA201-TP Users Manual
Chapter 1 Product Introduction
bull Top View of HA201-TP system
2 x hot-swappable storage control node
Intelreg Xeonreg E5-2600 v3 Processors
bull TOP View Of Controller Canister
6
HA201-TP Users Manual
Chapter 1 Product Introduction
bull Top View of HA201-TP system
4 x 60x56mm hot-swap fans
Intel S2600TP
HA201-TP Users Manual
7
Chapter 2 Hardware Installation
7
21 Removing and Installing top coverUnscrew the screws(2pcs for the back and 4pcs for the both right and left side)Slide the cover backwards to remove top cover form the chassis
Chapter 2 Hardware Installation
This section demonstrates maintenance procedures in replacing a defective part once the HA201-TP UM appliance is installed and is operational
Chapter 2 Hardware Installation
HA201-TP Users Manual
8
22 Central Processing Unit (CPU)221 Removing the Processor HeatsinkThe heatsink is attached to the server board or processor socket with captive fasteners Using a 2 Phillips screwdriver loosen the four screws located on the heatsink corners in a diagonal manner using the following procedurebull Using a 2 Phillips screwdriver start with screw 1 and loosen it by
giving it two rotations and stop (see letter A) (IMPORTANT Do not fully loosen)
bull Proceed to screw 2 and loosen it by giving it two rotations and stop (see letter B) Similarly loosen screws 3 and 4 Repeat steps A and B by giving each screw two rotations each time until all screws are loosened
bull Lift the heatsink straight up (see letter C)
REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
Processor
Socket
2
3
1
4
A
B
C
HA201-TP Users Manual
9
Chapter 2 Hardware Installation
222 Installing the Processor
2221 Unlatch the CPU Load Platebull Push the lever handle labeled ldquoOPEN 1strdquo (see letter A) down and
toward the CPU socket Rotate the lever handle upbull Repeat the steps for the second lever handle (see letter B)
CAUTION
Processor must be appropriate You may damage the server board if
you install a processor that is inappropriate for your server
CAUTION
ESD and handling processors Reduce the risk of electrostatic
discharge (ESD) damage to the processor by doing the following
(1) Touch the metal chassis before touching the processor or
server board Keep part of your body in contact with the metal
chassis to dissipate the static charge while handling the processor
(2) Avoid moving around unnecessarily
REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
A REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
B
Chapter 2 Hardware Installation
HA201-TP Users Manual
10
2222 Lift open the Load Platebull Rotate the right lever handle down until it releases the Load Plate
(see letter A)bull While holding down the lever handle with your other hand lift open
the Load Plate (see letter B)
BREMOVE
REMOVE
LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
A
HA201-TP Users Manual
11
Chapter 2 Hardware Installation
2223 Install the Processorbull Remove the processor from its packagebull Carefully remove the protective cover from the bottom side of the
CPU taking care not to touch any CPU contacts (see letter A)bull Orient the processor with the socket so that the processor cutouts
match the four orientation posts on the socket (see letter B) Note the location of a gold key at the corner of the processor (see letter C) Carefully place (Do NOT drop) the CPU into the socket
CAUTION
The pins inside the CPU socket are extremely sensitive Other than
the CPU no object should make contact with the pins inside the
CPU socket A damaged CPU socket pin may render the socket inoperable
and will produce erroneous CPU or other system errors if used
NOTE
The underside of the processor has components that may damage the
socket pins if installed improperly The processor must align correctly
with the socket opening before installation DO NOT DROP the
processor into the socket
NOTE
When possible a CPU insertion tool should be used when installing the
CPU
A
B
C
Chapter 2 Hardware Installation
HA201-TP Users Manual
12
2224 Remove the Socket Cover Remove the socket cover from the load plate by pressing it out
2225 Close the Load Plate Carefully lower the load plate down over the processor
2226 Lock down the Load Platebull Push down on the locking lever on the CLOSE 1st side (see letter A)
Slide the tip of the lever under the notch in the load plate (see letter B) Make sure the load plate tab engages under the socket lever when fully closed
bull bRepeat the steps to latch the locking lever on the other side (see letter C) Latch the levers in the order as shown
NOTE
The socket cover should be saved and re-used should the processor
need to be removed at anytime in the future
Save theprotectivecover
A
BC
HA201-TP Users Manual
13
Chapter 2 Hardware Installation
223 Installing the Processor Heatsink
bull If present remove the protective film covering the Thermal Interface Material on the bottom side of the heatsink (see letter A)
bull Align the heatsink fins to the front and back of the chassis for correct airflow The airflow goes from front-to-back of the chassis (see letter B)
bull Each heatsink has four captive fasteners and should be tightened in a diagonal manner using the following procedure
bull Using a 2 Phillips screwdriver start with screw 1 and engage screw threads by giving it two rotations and stop (see letter C) (Do not fully tighten)
bull Proceed to screw 2 and engage screw threads by giving it two rotations and stop (see letter D) Similarly engage screws 3 and 4
bull Repeat steps C and D by giving each screw two rotations each time until each screw is lightly tightened up to a maximum of 8 inch-lbs torque (see letter E)
NOTE
The processor heatsink for CPU1 and CPU2 is different FXXCA91X91HS is
for CPU1 while FXXEA91X91HS2 is for CPU2 Mislocating the heatsink
will cause serious thermal damage
C
Processor
Socket
AIRFLOW
Chassis Front
2
3
1
4
A
B
TIM
D
CAUTIONDo not over-tighten fasteners
E
Chapter 2 Hardware Installation
HA201-TP Users Manual
14
224 Removing the Processor
NOTE
Remove the processor by carefully lifting it out of the socket taking care
NOT to drop the processor and not touching any pins inside the socket
Install the socket cover if a replacement processor is not going to be installed
HA201-TP Users Manual
15
Chapter 2 Hardware Installation
225 Different heatsink for each of its CPUsCarefully position heatsink over the CPU0 and CPU1 and align the heatsink screws with the screw holes in the motherboard
Caution - Possible thermal damage Avoid moving the heatsink after
it has contacted the top of the CPU Too much movement could
disturb the layer of thermal compound causing voids and leading
to ineffective heat dissipation and component damage
CPU0
CPU0
CPU1
CPU1
Chapter 2 Hardware Installation
HA201-TP Users Manual
16
23 Diagram of the Correct InstallationNOTE The heat sinkrsquos fans should be blowing toward the rear end
of the chassis If one of the fans is facing the wrong direction
please remove the heat sink and reinstall it so that it is facing
the correct direction
AIRFLOW
AIRFLOW
FAN FAN
HA201-TP Users Manual
17
Chapter 2 Hardware Installation
24 System Memory241 Supported Memory
Chapter 2 Hardware Installation
HA201-TP Users Manual
18
242 Memory Population Rules
bull Each installed processor provides four channels of memory On the Intelreg Server Board S2600TP product family each memory channel supports two memory slots for a total possible 16 DIMMs installed
bull The memory channels from processor socket 1 are identified as Channel A B C and D The memory channels from processor socket 2 are identified as Channel E F G and H
bull The silk screened DIMM slot identifiers on the board provide information about the channel and therefore the processor to which they belong For example DIMM_A1 is the first slot on Channel A on processor 1 DIMM_E1 is the first DIMM socket on Channel E on processor 2
bull The memory slots associated with a given processor are unavailable if the corresponding processor socket is not populated
bull A processor may be installed without populating the associated memory slots provided a second processor is installed with associated memory In this case the memory is shared by the processors However the platform suffers performance degradation and latency due to the remote memory
bull Processor sockets are self-contained and autonomous However all memory subsystem support (such as Memory RAS and Error Management) in the BIOS setup is applied commonly across processor sockets
bull All DIMMs must be DDR4 DIMMsbull Mixing of LRDIMM with any other DIMM type is not allowed per
platformbull Mixing of DDR4 operating frequencies is not validated within a socket
or across sockets by Intel If DIMMs with different frequencies are mixed all DIMMs run at the common lowest frequency
bull A maximum of eight logical ranks (ranks seen by the host) per channel is allowed
bull The BLUE memory slots on the server board identify the first memory slot for a given memory channel
NOTE
Although mixed DIMM configurations are supported Intel only performs platform
validation on systems that are configured with identical DIMMs installed
HA201-TP Users Manual
19
Chapter 2 Hardware Installation
bull DIMM population rules require that DIMMs within a channel be populated starting with the BLUE DIMM slot or DIMM farthest from the processor in a ldquofill-farthestrdquo approach In addition when populating a Quad-rank DIMM with a Single- or Dual-rank DIMM in the same channel the Quad-rank DIMM must be populated farthest from the processor Intelreg MRC will check for correct DIMM placement
Chapter 2 Hardware Installation
HA201-TP Users Manual
20
On the Intelreg Server Board S2600TP product a total of sixteen DIMM slots are provided (two CPUs ndash four channels per CPU and two DIMMs per channel) The nomenclature for DIMM sockets is detailed in the following table
HA201-TP Users Manual
21
Chapter 2 Hardware Installation
243 Publishing System MemoryThere are a number of different situations in which the memory size andor configuration are displayed Most of these displays differ in one way or another so the same memory configuration may appear to display differently depending on when and where the display occurs
bull The BIOS displays the ldquoTotal Memoryrdquo of the system during POST if Quiet Boot is disabled in BIOS setup This is the total size of memory discovered by the BIOS during POST and is the sum of the individual sizes of installed DDR4 DIMMs in the system
bull The BIOS displays the ldquoEffective Memoryrdquo of the system in the BIOS Setup The term Effective Memory refers to the total size of all DDR4 DIMMs that are active (not disabled) and not used as redundant units (see Note below)
bull The BIOS provides the total memory of the system in the main page of BIOS setup This total is the same as the amount described by the first bullet above
bull If Quiet Boot is disabled the BIOS displays the total system memory on the diagnostic screen at the end of POST This total is the same as the amount described by the first bullet above
bull The BIOS provides the total amount of memory in the system by supporting the EFI Boot Service function GetMemoryMap()
bull The BIOS provides the total amount of memory in the system by supporting the INT 15h E820h function For details see the Advanced Configuration and Power Interface Specification
Chapter 2 Hardware Installation
HA201-TP Users Manual
22
25 RemovingInstalling a Drive Tray Hard Drive Drive Slot Map
251 Removing a Disk DriveRelease a drive tray by pressing the unlock button and pinching the lock lever slightly and pulling out the drive tray
252 Installing a Disk Drive
Adjust and place the 25rdquo HDD on the HDD tray Choose to the proper position screw hole then secure it with screws on the bottomMounting location 1 HDD with the interposer board attached
HA201-TP Users Manual
23
Chapter 2 Hardware Installation
253 Installing a Hard Disk Drive Tray
Insert the drive carrier into its bay Push the tray lever until it clicks Make sure the drive tray is correctly secured in place when its front edge aligns with the bay edge
254 Drive Slot Map
The drive slot map follows
HBA Card
MegaRaid Card
Chapter 2 Hardware Installation
HA201-TP Users Manual
24
26 Removing and Installing a Fan Module261 Removing a FAN module
Unscrew the thumb screw on the cover to open the top cover from NODE Unpluging the FAN cable Remove the fan by lifting it and pulling it out
262 Installing a FAN module To installing a fan module follow the reverse order
HA201-TP Users Manual
25
Chapter 2 Hardware Installation
27 Power Supply
271 Removing or Replacing the Power Supply Module
Slide the small levers (see red arrow ) to the right Hold the PSU lever and firmly pull the PSU out of the server chassis
Chapter 2 Hardware Installation
HA201-TP Users Manual
26
28 Tool-less Blade Slide Installation introduction
1 Pull on the lsquorsquoFront-Releasersquorsquoto unlock the inner channel from the Slide Assembly
2 Release the Detent-Lock and push Middle Channel inwards to retract Middle Channel
HA201-TP Users Manual
27
Chapter 2 Hardware Installation
Metal Spacer
OptionalRemove Metal Spacer for Aluminium Racks
3 Aligning the Front Bracket with the Mounting Hole
4 Push in to assembly the Front Bracket onto the Rack
Chapter 2 Hardware Installation
HA201-TP Users Manual
28
5 Now the bracket is fixed onto the Rack(Optional M6x10L screws are to secure the rails with posts if needed)
6 Refer to Diagram 3 amp 4 to assemble the End Bracket onto the Rack
HA201-TP Users Manual
29
Chapter 2 Hardware Installation
7 Assemble the inner channel onto the chassis using thescrews provided
8 Push the chassis with inner channels into Slide to complete Rack Installation
Chapter 3 Motherborad Overview
HA201-TP Users Manual
30
Chapter 3 Motherborad Overview
The Intelreg Server Board S2600TP is a monolithic printed circuit board (PCB) with features designed to support the high-performance and high-density computing markets This server board is designed to support the Intelreg Xeonreg processor E5-2600 v3 product family Previous generation Intelreg Xeonreg processors are not supported Many of the features and functions of the server board family are common A board will be identified by name when a described feature or function is unique to it
Chapter 3 Motherborad Overview
HA201-TP Users Manual
31
31 Intelreg Server Board Feature Set
Chapter 3 Motherborad Overview
HA201-TP Users Manual
32
WARNING The riser slot 1 on the server board is designed for
plugging in ONLY the riser card Plugging in the PCIe card may
cause permanent server board and PCIe card damage
Chapter 3 Motherborad Overview
HA201-TP Users Manual
33
32 Motherboard Layout
DiagnosticLED
RMM4LiteRiser Slot 2
SATASGPIO
BridgeBoard
SATA 3USB
Riser Slot 4
Riser Slot 4
HDD Activity
Riser Slot 3ControlPanel
SystemFan 1
CPU 2Socket
CPU 1Socket
SystemFan 3
SystemFan 2
MainPower 1
FanConnector
MainPower 2
InfiniBandPort(QSFP+)DedicatedManagementPortUSBStatus LEDID LED
VGA
NIC 2
IPMB
NIC 1
SerialPort A
CR 2032 3W
Backup Power Control
SATA 2
SATA 0
SATA RAID 5
SATA
Upgrade Key
DOM
SATA 1
Chapter 3 Motherborad Overview
HA201-TP Users Manual
34
33 Motherboard block diagram
Chapter 3 Motherborad Overview
HA201-TP Users Manual
35
34 Motherboard Configuration JumpersThe following table provides a summary and description of configuration test and debug jumpers The server board has several 3-pin jumper blocks that can be used Pin 1 on each jumper block can be identified by the following symbol on the silkscreen
Chapter 3 Motherborad Overview
HA201-TP Users Manual
36
Chapter 3 Motherborad Overview
HA201-TP Users Manual
37
35 Motherboard Configuration JumpersThe Intelreg Server Board S2600TP has the following board rear connector placement
HA201-TP Users Manual
38
Chapter 4 12G Expander Board Overview
This chapter provides detailed introduction guide on hardware introduction
41 12GB Expander Board411 Placement amp Connectors location
J1
JPIC_DBGU8
Chip 3x36RExpander
U14Flash U16
PIC
SW1
SW2
JEXP_UART
J7
JPIC_SMT
J2 J3 J4 CN1 JUSB_MB
JVBATT_12C
JVBATT CN3 CN2 JIPMI_LOC
CN5
CN8
JPWR1
JPWR2
JHDDPWRJPSON_OK
JIPMB
JUSB_PIC
J8
CN7CN6
CN4
JPWR_SW
Chapter 4 12GB Expander Borad Overview
HA201-TP Users Manual
39
Chapter 4 12G Expander Board Overview
412 PIN-OUT
Power Connector ndash JPWR1JPWR2
OS HDD Power Connector ndash JHDDPWR
Battery Connector ndash JVBATT
FAN Connector ndash J7J8
Console for Expander ndash JEXP_UART
HA201-TP Users Manual
40
Chapter 4 12G Expander Board Overview
PIC Smart portndash JPIC_SMT
PIC Debug port(For MPLAB I2C tool) ndash JPIC_DBG
PIC USB ndash JUSB_PIC
Battery Function ndash JVBATT_I2C
IPMB ndash JIPMB
HA201-TP Users Manual
41
Chapter 4 12G Expander Board Overview
BP PIC USB ndash JUSB_MB
IPMB for Device ndash JIPMI_LOC
Power SW ndash JPWR_SW
MB power on frunction ndash JPSON_OK
HA201-TP Users Manual
42
Chapter 4 12G Expander Board Overview
413 LEDs
LED_CN6
LED2
LED3
HA201-TP Users Manual
43
Chapter 5 Backplane OverviewChapter 5 Backplane Overview
This chapter provides detailed introduction guide on backplane introduction
51 Backplane511 Placement amp Connectors location
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
9
18
10
1
3
G
D
SG
D
S
C
B
C
E
B
C
E
GA
O
I
2
43
12
43
1
14
1
3
45
134
5
134
5
134
5
16
2
9
10
1
1
1120
10
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
16
17
32
33
484964
J2
JP1J11
For Secondary Canister Node
J4 J1 SW1
SW2
J3
For Primary Canister Node
HA201-TP Users Manual
44
Chapter 5 Backplane Overview
512 PIN-OUT
Power Connector ndash J2
Power Connector ndash J11
PMBUS Connector ndash JP1
FAN Connector ndash J4
HA201-TP Users Manual
45
Chapter 5 Backplane Overview
Console for MCU ndash J1
Front IO ndash J3
HA201-TP Users Manual
46
Chapter 5 Backplane Overview
513 LEDs
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
18
9
10
1
31
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
3 4
21
3 4
21
4
1
16
2
91
101
11
10
20
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ActFail
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
9
18
10
1
3
G
D
SG
D
S
C
B
C
E
B
C
E
GA
O
I
2
43
12
43
1
14
1
3
45
134
5
134
5
134
5
16
2
9
10
1
1
1120
10
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
16
17
32
33
484964LED1
HA201-TP Users Manual
47
Chapter 6 HDD Backplane Introduction
61 Expender firmware update through smart console port611 Update Expander firmware revision
Step 1 Set up HA201-TP console serial cable Insert console serial cable into console port shown below also
the other side inert serial port into motherboard
PLEASE FIND THE CONSOLE SERIAL CABLE IN THE PACKAGE BOX
Chapter 6 Debug amp Firmware Update Introduction
HA201-TP Users Manual
48
Chapter 6 HDD Backplane Introduction
Step 2 Set up HA201-TP RS232 connectionSet up RS232 connection application into your HA201-TP as shown in the example process below
For exampleOS Microsoft WindowsRS232 connection application Hyperterminal
Step 2 Install HyperTrmexe
HA201-TP Users Manual
49
Chapter 6 HDD Backplane Introduction
Step 3 Enter a new name for the icon in the field below and click OK
Step 4 Connecting by using selecting an option in the drop down menu circled in red below (we selected COM2 in this example) and click OK
HA201-TP Users Manual
50
Chapter 6 HDD Backplane Introduction
Properties
Port Setting
Bits per second
Data bits
Parity
Stop bits
Flow control None
Restore Defaults
OK ApplyCancel
None
Step 6 Set up is complete The diagram below depicts what screen should displayed
Step 5 For ldquoBits per secondrdquo select 38400 For ldquoFlow controlrdquo select None Click OK when you have finished your selections
cmdgt_
HA201-TP Users Manual
51
Chapter 6 HDD Backplane Introduction
Step 7To get firmware image amp MFG Configuration Image version information from AIC SAS Related Firmware Downloadne website
httpppmsaicipccomtw8888downloadexpandermcu
HA201-TP Users Manual
52
Chapter 6 HDD Backplane Introduction
Step 8Comand line for show current firmware revisioncmdgtrev
HA201-TP Users Manual
53
Chapter 6 HDD Backplane Introduction
Step 9Start to update expander firmwarecmdgtfdl 0 0_
Step 10Select the tool bar Transfer -gt Send File
HA201-TP Users Manual
54
Chapter 6 HDD Backplane Introduction
Step 11bull Choose new firmware path file fw 3A1_v11221bull Protocol have to choose Xmodem
Step 12Firmware download complete
HA201-TP Users Manual
55
Chapter 6 HDD Backplane Introduction
Step 13Reset computer for success update firmwarecmdgtreset
reset
HA201-TP Users Manual
56
Chapter 6 HDD Backplane Introduction
612 Update expander configuration MFG
Step 1Comand line for show current configuration MFGcmdgt showmfg
HA201-TP Users Manual
57
Chapter 6 HDD Backplane Introduction
Step 2Start to update expander configuration MFGcmdgtfdl 83 0_
Step 3Select the tool bar Transfer -gt Send File
83 0
HA201-TP Users Manual
58
Chapter 6 HDD Backplane Introduction
Step 4bull Choose new MFG path file mfg 3A10_eob_1201binbull Protocol have to choose Xmodem
Step 5MFG download complete
HA201-TP Users Manual
59
Chapter 6 HDD Backplane Introduction
Step 6Reset computer for success update MFGcmdgtreset
reset
HA201-TP Users Manual
60
Chapter 6 HDD Backplane Introduction
62 Update the expander firmware through in-band
FOR EXAMPLEStep 1
Download and install SG3_utilsexe which compatible with Linux OSFrom website httpsgdannyczsgsg3_utilshtml website Reference version sg3_utils-140tgz
Step 2To get firmware image amp MFG Configuration Image version information from AIC SAS Related Firmware Downloadne websitehttpppmsaicipccomtw8888downloadexpandermcu
HA201-TP Users Manual
61
Chapter 6 HDD Backplane Introduction
Step 3Execute terminal under the same new firmware folderexample Setting a new firmware folder on Home page Open Terminal by click to the right button of mouse in the same window Home
Step 4Typing sudo -s to into administrator mode
HA201-TP Users Manual
62
Chapter 6 HDD Backplane Introduction
Step 5 Find expander location$ sg_map -i
Step 6Update Expander firmware$ sg_write_buffer --id=0x0 --in=fw3A1_v11221 --mode=0x2 --offset=0 devsg0
HA201-TP Users Manual
63
Chapter 6 HDD Backplane Introduction
Step 7 Update Expander MFG$ sg_write_buffer --id=0x83 --in=mfg3A10_eob_1201bin --mode=0x2 --offset=0 devsg0
Step 8Reboot computer for success update firmware amp MFGrootubuntu~ reboot
HA201-TP Users Manual
64
Chapter 6 HDD Backplane Introduction
63 12G expander EDFB settingStep 1
For Install HyperTerminalexe refer to section 41
Step 2 Get EDFB statuscmdgt edfb
HA201-TP Users Manual
65
Chapter 6 HDD Backplane Introduction
Step 3Change EDFB setting
Enable EDFB functioncmdgtedfb on
Disable EDFB functioncmdgtedfb off
cmd gtedfbEDFB is OFF
cmd gtedfb onSucceeded to set EDFB
cmd gtedfb offEDFB is OFF
HA201-TP Users Manual
66
Chapter 6 HDD Backplane Introduction
64 Slot HDD power setting(Only for system cooling Fan controled by expander)
Step 1 For Install sg3exe tool and get new firmware from website refer to section 42
Step 2Execute terminal under the same new firmware folderexample Setting a new firmware folder on Home page Open Terminal by click to the right button of mouse in the same window Home
HA201-TP Users Manual
67
Chapter 6 HDD Backplane Introduction
Step 3 Typing sudo -s to into administrator mode
Step 4 Find expander location$ sg_map -i
HA201-TP Users Manual
68
Chapter 6 HDD Backplane Introduction
Step 5For example
If would like to turn the Disk004 power off under the HBA card Need to check Disk004 power status $ sg_ses --page=7 devsg0
Under HBA card the Element 3 = Disk004
HA201-TP Users Manual
69
Chapter 6 HDD Backplane Introduction
Step 6To check Disk004 (element 3) power status is ok$ sg_ses --page=2 devsg0
Status shows belowThe status of Element 3 is OK
HA201-TP Users Manual
70
Chapter 6 HDD Backplane Introduction
Step 7Turn off a HDD power$ sg_ses --descriptor=Disk004 --set=341 devsg0
Step 8Turn on a HDD power$ sg_ses --descriptor=Disk004 --clear=341 devsg0
HA201-TP Users Manual
71
Chapter 6 HDD Backplane Introduction
65 HDD BP thermal sensor temperature setting(Only for system cooling Fan controled by expander)
Step 1 For Install HyperTerminalexe refer to section 41
Step 2Get the current temperature settingscmdgt temperature
HA201-TP Users Manual
72
Chapter 6 HDD Backplane Introduction
Step 3For example
Set new temperatureT1=20 C 4 18 CT2=50 C 4 52 CWarning threshold=50 C 448 CAlarm threshold=55 C 454 C
The new setting will take effect after resetcmdgt temperature 18 52 48 54cmdgt reset
HA201-TP Users Manual
73
Chapter 6 HDD Backplane Introduction
Step 4Check fan speed amp temperature informationcmdgt sensor
cmd gtsensor==ENCLOSURE STATUS========================================================== Total fan number 2 System Fan-0 speed 10888 RPM System Fan-1 speed 10971 RPM System PWN-0 82 Expander Temperature 76 Celsius degree Sytem Temperature-0 33 Celsius degree T1 20 Celsius degree T2 50 Celsius degree TC 55 Celsius degree Voltage Sensor 09V 093V Voltage Sensor 18V 180V
cmd gt_
HA201-TP Users Manual
74
Chapter 7 Intelreg Server Board S2600TP Platform Management
71 Server Management Function Architecture711 IPMI Feature7111 IPMI 20 Featuresbull Baseboard management controller (BMC)bull IPMI Watchdog timerbull Messaging support including command bridging and usersession
supportbull Chassis device functionality including powerreset control and BIOS boot
flags supportbull Event receiver device The BMC receives and processes events from
other platform subsystemsbull Field Replaceable Unit (FRU) inventory device functionality The BMC
supports access to system FRU devices using IPMI FRU commandsbull System Event Log (SEL) device functionality The BMC supports and
provides access to a SELbull Sensor Data Record (SDR) repository device functionality The BMC
supports storage and access of system SDRsbull Sensor device and sensor scanningmonitoring The BMC provides IPMI
management of sensors It polls sensors to monitor and report system health
bull IPMI interfaceso Host interfaces include system management software (SMS) with receive message queue support and server management mode (SMM)o IPMB interfaceo LAN interface that supports the IPMI-over-LAN protocol (RMCP RMCP+)
bull Serial-over-LAN (SOL)bull ACPI state synchronization The BMC tracks ACPI state changes that are
provided by the BIOSbull BMC self-test The BMC performs initialization and run-time self-tests and
makes results available to external entitiesbull See also the Intelligent Platform Management Interface Specification
Second Generation v20
Chapter 7 Intelreg Server Board S2600TP Platform Management
HA201-TP Users Manual
75
Chapter 7 Intelreg Server Board S2600TP Platform Management
7112 Non IPMI FeaturesThe BMC supports the following non-IPMI featuresbull In-circuit BMC firmware updatebull Fault resilient booting (FRB) FRB2 is supported by the watchdog timer
functionalitybull Chassis intrusion detectionbull Basic fan control using Control version 2 SDRsbull Fan redundancy monitoring and supportbull Enhancements to fan speed controlbull Power supply redundancy monitoring and supportbull Hot-swap fan supportbull Acoustic management Support for multiple fan profilesbull Signal testing support The BMC provides test commands for setting
and getting platform signal statesbull The BMC generates diagnostic beep codes for fault conditionsbull System GUID storage and retrievalbull Front panel management The BMC controls the system status LED
and chassis ID LED It supports secure lockout of certain front panel functionality and monitors button presses The chassis ID LED is turned on using a front panel button or a command
bull Power state retentionbull Power fault analysisbull Intelreg Light-Guided Diagnosticsbull Power unit management Support for power unit sensor The BMC
handles power-good dropout conditionsbull DIMM temperature monitoring New sensors and improved acoustic
management using closed-loop fan control algorithm taking into account DIMM temperature readings
bull Address Resolution Protocol (ARP) The BMC sends and responds to ARPs (supported on embedded NICs)
bull Dynamic Host Configuration Protocol (DHCP) The BMC performs DHCP (supported on embedded NICs)
bull Platform environment control interface (PECI) thermal management support
bull E-mail alertingbull Support for embedded web server UI in Basic Manageability feature
set
HA201-TP Users Manual
76
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull Enhancements to embedded web servero Human-readable SELo Additional system configurabilityo Additional system monitoring capabilityo Enhanced on-line help
bull Integrated KVMbull Enhancements to KVM redirection
o Support for higher resolutionbull Integrated Remote Media Redirectionbull Lightweight Directory Access Protocol (LDAP) supportbull Intelreg Intelligent Power Node Manager supportbull Embedded platform debug feature which allows capture of detailed
data for later analysisbull Provisioning and inventory enhancements
o Inventory datasystem information export (partial SMBIOS table)bull DCMI 15 compliance (product-specific)bull Management support for PMBus rev12 compliant power suppliesbull BMC Data Repository (Managed Data Region Feature)bull Support for an Intelreg Local Control Display Panelbull System Airflow Monitoringbull Exit Air Temperature Monitoringbull Ethernet Controller Thermal Monitoringbull Global Aggregate Temperature Margin Sensorbull Memory Thermal Managementbull Power Supply Fan Sensorsbull Energy Star Server Supportbull Smart Ride Through (SmaRT) Closed Loop System Throttling (CLST)bull Power Supply Cold Redundancybull Power Supply FW Updatebull Power Supply Compatibility Checkbull BMC FW reliability enhancements
o Redundant BMC boot blocks to avoid possibility of a corrupted boot block resulting in a scenario that prevents a user from updating the BMC o BMC System Management Health Monitoring
HA201-TP Users Manual
77
Chapter 7 Intelreg Server Board S2600TP Platform Management
72 Features and Functions721 Power SubsystemThe server board supports several power control sources which can initiate power-up orpower-down activity
HA201-TP Users Manual
78
Chapter 7 Intelreg Server Board S2600TP Platform Management
722 Advanced Configuration and Power Interface (ACPI)The server board has support for the following ACPI states
HA201-TP Users Manual
79
Chapter 7 Intelreg Server Board S2600TP Platform Management
723 System InitializationDuring system initialization both the BIOS and the BMC initialize the following items
7231 Processor Tcontrol SettingProcessors used with this chipset implement a feature called Tcontrol which provides a processor-specific value that can be used to adjust the fan-control behavior to achieve optimum cooling and acoustics The BMC reads these from the CPU through PECI Proxy mechanism provided by Manageability Engine (ME) The BMC uses these values as part of thefan-speed-control algorithm
HA201-TP Users Manual
80
Chapter 7 Intelreg Server Board S2600TP Platform Management
7232 Fault Resilient Booting (FRB)Fault resilient booting (FRB) is a set of BIOS and BMC algorithms and hardware support that allow a multiprocessor system to boot even if the bootstrap processor (BSP) fails Only FRB2 is supported using watchdog timer commandsFRB2 refers to the FRB algorithm that detects system failures during POST The BIOS uses the BMC watchdog timer to back up its operation during POST The BIOS configures the watchdog timer to indicate that the BIOS is using the timer for the FRB2 phase of the boot operationAfter the BIOS has identified and saved the BSP information it sets the FRB2 timer use bit and loads the watchdog timer with the new timeout intervalIf the watchdog timer expires while the watchdog use bit is set to FRB2 the BMC (if so configured) logs a watchdog expiration event showing the FRB2 timeout in the event data bytes The BMC then hard resets the system assuming the BIOS-selected reset as the watchdog timeout actionThe BIOS is responsible for disabling the FRB2 timeout before initiating the option ROM scan and before displaying a request for a boot password If the processor fails and causes an FRB2 timeout the BMC resets the systemThe BIOS gets the watchdog expiration status from the BMC If the status shows an expired FRB2 timer the BIOS enters the failure in the system event log (SEL) In the OEM bytes entry in the SEL the last POST code generated during the previous boot attempt is written FRB2failure is not reflected in the processor status sensor valueThe FRB2 failure does not affect the front panel LEDs
HA201-TP Users Manual
81
Chapter 7 Intelreg Server Board S2600TP Platform Management
7233 Post Code DisplayThe BMC upon receiving standby power initializes internal hardware to monitor port 80h (POST code) writes Data written to port 80h is output to the system POST LEDsThe BMC deactivates POST LEDs after POST had completed
HA201-TP Users Manual
82
Chapter 7 Intelreg Server Board S2600TP Platform Management
724 System Event Log (SEL)The BMC implements the system event log as specified in the Intelligent Platform Management Interface Specification Version 20 The SEL is accessible regardless of the system power state through the BMCs in-band and out-of-band interfacesThe BMC allocates 95231bytes (approx 93 KB) of non-volatile storage space to store system events The SEL timestamps may not be in order Up to 3639 SEL records can be stored at a time Because the SEL is circular any command that results in an overflow of the SEL beyond the allocated space will overwrite the oldest entries in the SEL while setting the overflow flag
HA201-TP Users Manual
83
Chapter 7 Intelreg Server Board S2600TP Platform Management
73 Sensor MonitoringThe BMC monitors system hardware and reports system health The information gathered from physical sensors is translated into IPMI sensors as part of the ldquoIPMI Sensor Modelrdquo The BMC also reports various system state changes by maintaining virtual sensors that are not specifically tied to physical hardware This section describes general aspects of BMC sensor management as well as describing how specific sensor types are modeled Unless otherwise specified the term ldquosensorrdquo refers to the IPMI sensor-model definition of a sensor
531 Sensor ScanningThe value of many of the BMCrsquos sensors is derived by the BMC FW periodically polling physical sensors in the system to read temperature voltages and so on Some of these physical sensors are built-in to the BMC component itself and some are physically separated from the BMC Polling of physical sensors for support of IPMI sensor monitoring does not occur until the BMCrsquos operational code is running and the IPMI FW subsystem has completed initializationIPMI sensor monitoring is not supported in the BMC boot code Additionally the BMC selectively polls physical sensors based on the current power and reset state of the system and the availability of the physical sensor when in that state For example non-standby voltages are not monitored when the system is in S4 or S5 power state
HA201-TP Users Manual
84
Chapter 7 Intelreg Server Board S2600TP Platform Management
732 Sensor Rearm Behavior7321 Manual versus Re-arm SensorsSensors can be either manual or automatic re-arm An automatic re-arm sensor will re-arm (clear) the assertion event state for a threshold or offset it if that threshold or offset is deasserted after having been asserted This allows a subsequent assertion of the threshold or an offset to generate a new event and associated side-effect An example side-effect would be boosting fans due to an upper critical threshold crossing of a temperature sensor The event state and the input state (value) of the sensor track each other Most sensors are auto-rearmA manual re-arm sensor does not clear the assertion state even when the threshold or offset becomes de-asserted In this case the event state and the input state (value) of the sensor do not track each other The event assertion state is sticky The following methods can be used to re-arm a sensorbull Automatic re-arm ndash Only applies to sensors that are designated as
ldquoauto-rearmrdquobull IPMI command Re-arm Sensor Eventbull BMC internal method ndash The BMC may re-arm certain sensors due to a
trigger condition For example some sensors may be re-armed due to a system reset A BMC reset will re-arm all sensorsbull System reset or DC power cycle will re-arm all system fan sensors
HA201-TP Users Manual
85
Chapter 7 Intelreg Server Board S2600TP Platform Management
7322 Re-arm and Event GenerationAll BMC-owned sensors that show an asserted event status generate a de-assertion SEL event when the sensor is re-armed provided that the associated SDR is configured to enable a deassertion event for that condition This applies regardless of whether the sensor is a thresholdanalog sensor or a discrete sensor
To manually re-arm the sensors the sequence is outlined below1 A failure condition occurs and the BMC logs an assertion event2 If this failure condition disappears the BMC logs a de-assertion event (if
so configured)3 The sensor is re-armed by one of the methods described in the
previous section4 The BMC clears the sensor status5 The sensor is put into reading-state-unavailable state until it is polled
again or otherwise updated6 The sensor is updated and the ldquoreading-state-unavailablerdquo state is
cleared A new assertion event will be logged if the fault state is once again detected
All auto-rearm sensors that show an asserted event status generate a de-assertion SEL event at the time the BMC detects that the condition causing the original assertion is no longer present and the associated SDR is configured to enable a de-assertion event for that condition
HA201-TP Users Manual
86
Chapter 7 Intelreg Server Board S2600TP Platform Management
733 BIOS Event-Only SensorsBIOS-owned discrete sensors are used for event generation only and are not accessible through IPMI sensor commands like the Get Sensor Reading command Note that in this case the sensor owner designated in the SDR is not the BMCAn example of this usage would be the SELs logged by the BIOS for uncorrectable memory errors Such SEL entries would identify a BIOS-owned sensor ID
734 Margin SensorsThere is sometimes a need for an IPMI sensor to report the difference (margin) from a nonzero reference offset For the purposes of this document these type sensors are referred to as margin sensors For instance for the case of a temperature margin sensor if the referencevalue is 90 degrees and the actual temperature of the device being monitored is 85 degreesthe margin value would be -5
735 IPMI Watchdog SensorThe BMC supports a Watchdog Sensor as a means to log SEL events due to expirations of the IPMI 20 compliant Watchdog Timer
736 BMC Watchdog SensorThe BMC supports an IPMI sensor to report that a BMC reset has occurred due to action taken by the BMC Watchdog feature A SEL event will be logged whenever either the BMC FW stack is reset or the BMC CPU itself is reset
737 BMC System Management Health MonitoringThe BMC tracks the health of each of its IPMI sensors and report failures by providing a ldquoBMC FW Healthrdquo sensor of the IPMI 20 sensor type Management Subsystem Health with support for the Sensor Failure offset Only assertions should be logged into the SEL for the Sensor Failure offset The BMC Firmware Health sensor asserts for any sensor when 10 consecutive sensor errors are read These are not standard sensor events (that is threshold crossings or discrete assertions) These are BMC Hardware Access Layer (HAL) errors If a successful sensor read is completed the counter resets to zero
HA201-TP Users Manual
87
Chapter 7 Intelreg Server Board S2600TP Platform Management
738 VR Watchdog TimerThe BMC FW monitors that the power sequence for the board VR controllers is completed when a DC power-on is initiated Incompletion of the sequence indicates a board problem in which case the FW powers down the systemThe BMC FW supports a discrete IPMI sensor for reporting and logging this fault condition
739 System Airflow MonitoringThe BMC provides an IPMI sensor to report the volumetric system airflow in CFM (cubic feet per minute) The air flow in CFM is calculated based on the system fan PWM values The specific Pulse Width Modulation (PWM or PWMs) used to determine the CFM is SDR configurable The relationship between PWM and CFM is based on a lookup table in an OEM SDRThe airflow data is used in the calculation for exit air temperature monitoring It is exposed as an IPMI sensor to allow a datacenter management application to access this data for use in rack-level thermal management
7310 Thermal MonitoringThe BMC provides monitoring of component and board temperature sensing devices This monitoring capability is instantiated in the form of IPMI analogthreshold or discrete sensors depending on the nature of the measurementFor analogthreshold sensors with the exception of Processor Temperature sensors critical and non-critical thresholds (upper and lower) are set through SDRs and event generation enabled for both assertion and de-assertion eventsFor discrete sensors both assertion and de-assertion event generation are enabledMandatory monitoring of platform thermal sensors includes
HA201-TP Users Manual
88
Chapter 7 Intelreg Server Board S2600TP Platform Management
7310 Thermal MonitoringThe BMC provides monitoring of component and board temperature sensing devices This monitoring capability is instantiated in the form of IPMI analogthreshold or discrete sensors depending on the nature of the measurementFor analogthreshold sensors with the exception of Processor Temperature sensors critical and non-critical thresholds (upper and lower) are set through SDRs and event generation enabled for both assertion and de-assertion eventsFor discrete sensors both assertion and de-assertion event generation are enabledMandatory monitoring of platform thermal sensors includesbull Inlet temperature (physical sensor is typically on system front panel or
HDD back plane)bull Board ambient thermal sensorsbull Processor temperaturebull Memory (DIMM) temperaturebull CPU VRD Hot monitoringbull Power supply (only supported for PMBus-compliant PSUs)Additionally the BMC FW may create ldquovirtualrdquo sensors that are based on a combination of aggregation of multiple physical thermal sensors and application of a mathematical formula to thermal or power sensor readings
73101 Absolute Value versus Margin SensorsThermal monitoring sensors fall into three basic categoriesbull Absolute temperature sensors ndash These are analogthreshold sensors
that provide a value that corresponds to an absolute temperature value
bull Thermal margin sensors ndash These are analogthreshold sensors that provide a value that is relative to some reference value
bull Thermal fault indication sensors ndash These are discrete sensors that indicate a specific thermal fault condition
HA201-TP Users Manual
89
Chapter 7 Intelreg Server Board S2600TP Platform Management
73102 Processor DTS-Spec Margin Sensor(s)Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family incorporate a DTS based thermal spec This allows a much more accurate control of the thermal solution and will enable lower fan speeds and lower fan power consumption The main usage of this sensor is as an input to the BMCrsquos fan control algorithms The BMC implements this as a threshold sensor There is one DTS sensor for each installed physical processor package Thresholds are not set and alert generation is not enabled for these sensors
73103 Processor Thermal Margin Sensor(s)Each processor supports a physical thermal margin sensor per core that is readable through the PECI interface This provides a relative value representing a thermal margin from the corersquos throttling thermal trip point Assuming that temp controlled throttling is enabled the physical core temp sensor reads lsquo0rsquo which indicates the processor core is being throttledThe BMC supports one IPMI processor (margin) temperature sensor per physical processor package This sensor aggregates the readings of the individual core temperatures in a package to provide the hottest core temperature reading When the sensor reads lsquo0rsquo it indicates that the hottest processor core is throttlingDue to the fact that the readings are capped at the corersquos thermal throttling trip point (reading = 0) thresholds are not set and alert generation is not enabled for these sensors
73104 Processor Thermal Control Monitoring (Prochot)The BMC FW monitors the percentage of time that a processor has been operationally constrained over a given time window (nominally six seconds) due to internal thermal management algorithms engaging to reduce the temperature of the device When any processor core temperature reaches its maximum operating temperature the processorpackage PROCHOT (processor hot) signal is asserted and these management algorithms known as the Thermal Control Circuit (TCC) engage to reduce the temperature provided TCC is enabled TCC is enabled by BIOS during system boot This monitoring is instantiated as one IPMI analogthreshold sensor per processor package The BMC implements this as a threshold sensor on a per-processor basis
HA201-TP Users Manual
90
Chapter 7 Intelreg Server Board S2600TP Platform Management
Under normal operation this sensor is expected to read lsquo0rsquo indicating that no processor throttling has occurredThe processor provides PECI-accessible counters one for the total processor time elapsed and one for the total thermally constrained time which are used to calculate the percentage assertion over the given time window
73105 Processor Voltage Regulator (VRD) Over-Temperature SensorThe BMC monitors processor VRD_HOT signals The processor VRD_HOT signals are routed to the respective processor PROCHOT input in order initiate throttling to reduce processor power draw therefore indirectly lowering the VRD temperatureThere is one processor VRD_HOT signal per CPU slot The BMC instantiates one discrete IPMI sensor for each VRD_HOT signal This sensor monitors a digital signal that indicates whether a processor VRD is running in an over-temperature condition When the BMC detects that this signal is asserted it will cause a sensor assertion which will result in an event being written into the sensor event log (SEL)
73106 Inlet Temperature SensorEach platform supports a thermal sensor for monitoring the inlet temperature There are four potential sources for inlet temperature reading
73107 Baseboard Ambient Temperature Sensor(s)The server baseboard provides one or more physical thermal sensors for monitoring the ambient temperature of a board location This is typically to provide rudimentary thermal monitoring of components that lack internal thermal sensors
73108 Server South Bridge (SSB) Thermal MonitoringThe BMC monitors the SSB temperature This is instantiated as an analog (threshold) IPMI thermal sensor
73109 Exit Air Temperature Monitoring The BMC synthesizes a virtual sensor to approximate system exit air temperature for use in fan control This is calculated based on the total power being consumed by the system and the total volumetric air flow provided by the system fans
HA201-TP Users Manual
91
Chapter 7 Intelreg Server Board S2600TP Platform Management
Each system shall be characterized in tabular format to understand total volumetric flow versus fan speed The BMC calculates an average exit air temperature based on the total system power front panel temperature the volumetric system air flow (cubic feet per meter or CFM) and altitude rangeThis sensor is only available on systems in an Intelreg chassis The Exit Air temp sensor is only available when PMBus power supplies are installed
731010 Ethernet Controller Thermal MonitoringThe Intelreg Ethernet Controller I350-AM4 and Intelreg Ethernet Controller 10 Gigabit X540 support an on-die thermal sensor For baseboard Ethernet controllers that use these devices the BMC will monitor the sensors and use this data as input to the fan speed control The BMC will instantiate an IPMI temperature sensor for each device on the baseboard
731011 Memory VRD-Hot Sensor(s)The BMC monitors memory VRD_HOT signals The memory VRD_HOT signals are routed to the respective processor MEMHOT inputs in order to throttle the associated memory to effectively lower the temperature of the VRD feeding that memoryFor Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family there are 2 memory VRD_HOT signals per CPU slot The BMC instantiates one discrete IPMI sensor for each memory VRD_HOT signal
731012 Add-in Module Thermal MonitoringSome boards have dedicated slots for an IO module andor a SAS module For boards that support these slots the BMC will instantiate an IPMI temperature sensor for each slot The modules themselves may or may not provide a physical thermal sensor (a TMP75 device) If the BMC detects that a module is installed it will attempt to access the physical thermal sensor and if found enable the associated IPMI temperature sensor
731013 Processor ThermTripWhen a Processor ThermTrip occurs the system hardware will automatically power down the server If the BMC detects that a ThermTrip occurred then it will set the ThermTrip offset for the applicable
HA201-TP Users Manual
92
Chapter 7 Intelreg Server Board S2600TP Platform Management
processor status sensor
731014 Server South Bridge (SSB) ThermTrip Monitoring The BMC supports SSB ThermTrip monitoring that is instantiated as an IPMI discrete sensor When a SSB ThermTrip occurs the system hardware will automatically power down the server and the BMC will assert the sensor offset and log an event
731015 DIMM ThermTrip MonitoringThe BMC supports DIMM ThermTrip monitoring that is instantiated as one aggregate IPMI discrete sensor per CPU When a DIMM ThermTrip occurs the system hardware will automatically power down the server and the BMC will assert the sensor offset and log an eventThis is a manual re-arm sensor that is rearmed on system resets and power-on (AC or DC power on transitions)
HA201-TP Users Manual
93
Chapter 7 Intelreg Server Board S2600TP Platform Management
7311 Processor SensorsThe BMC provides IPMI sensors for processors and associated components such as voltage regulators and fans The sensors are implemented on a per-processor basis
HA201-TP Users Manual
94
Chapter 7 Intelreg Server Board S2600TP Platform Management
73111 Processor Status SensorsThe BMC provides an IPMI sensor of type processor for monitoring status information for each processor slot If an event state (sensor offset) has been asserted it remains asserted until one of the following happens1 A Rearm Sensor Events command is executed for the processor status
sensor2 AC or DC power cycle system reset or system boot occurs
The BMC provides system status indication to the front panel LEDs for processor fault conditions shown belowCPU Presence status is not saved across AC power cycles and therefore will not generate a deassertion after cycling AC power
HA201-TP Users Manual
95
Chapter 7 Intelreg Server Board S2600TP Platform Management
73112 Processor Population Fault (CPU Missing) SensorThe BMC supports a Processor Population Fault sensor This is used to monitor for the condition in which processor slots are not populated as required by the platform hardware to allow power-on of the systemAt BMC startup the BMC will check for the fault condition and set the sensor state accordinglyThe BMC also checks for this fault condition at each attempt to DC power-on the system At each DC power-on attempt a beep code is generated if this fault is detectedThe following steps are used to correct the fault condition and clear the sensor state1 AC power down the server2 Install the missing processor into the correct slot3 AC power on the server
73113 ERR2 Timeout MonitoringThe BMC supports an ERR2 Timeout Sensor (1 per CPU) that asserts if a CPUrsquos ERR2 signal has been asserted for longer than a fixed time period (gt 90 seconds) ERR[2] is a processor signal that indicates when the IIO (Integrated IO module in the processor) has a fatal error which could not be communicated to the core to trigger SMI ERR[2] events are fatal error conditions where the BIOS and OS will attempt to gracefully handle error but may not be always do so reliably A continuously asserted ERR2 signal is an indication that the BIOS cannot service the condition that caused the error This is usually because that condition prevents the BIOS from runningWhen an ERR2 timeout occurs the BMC assertsde-asserts the ERR2 Timeout Sensor and logs a SEL event for that sensor The default behavior for BMC core firmware is to initiate a system reset upon detection of an ERR2 timeout The BIOS setup utility provides an option to disable or enable system reset by the BMC for detection of this condition
HA201-TP Users Manual
96
Chapter 7 Intelreg Server Board S2600TP Platform Management
73114 CATERR SensorThe BMC supports a CATERR sensor for monitoring the system CATERR signalThe CATERR signal is defined as having 3 statesbull high (no event)bull pulsed low (possibly fatal may be able to recover)bull low (fatal)All processors in a system have their CATERR pins tied together The pin is used as a communication path to signal a catastrophic system event to all CPUs The BMC has direct access to this aggregate CATERR signalThe BMC only monitors for the ldquoCATERR held lowrdquo condition A pulsed low condition is ignored by the BMC If a CATERR-low condition is detected the BMC logs an error message to the SEL against the CATERR sensor and the default action after logging the SEL entry is to reset the system The BIOS setup utility provides an option to disable or enable system reset by the BMC for detection of this conditionThe sensor is rearmed on power-on (AC or DC power on transitions) It is not rearmed on system resets in order to avoid multiple SEL events that could occur due to a potential reset loop if the CATERR keeps recurring which would be the case if the CATERR was due to an MSID mismatch conditionWhen the BMC detects that this aggregate CATERR signal has asserted it can then go through PECI to query each CPU to determine which one was the source of the error and write an OEM code identifying the CPU slot into an event data byte in the SEL entry If PECI is non-functional (it isnrsquot guaranteed in this situation) then the OEM code should indicate that the source is unknownEvent data byte 2 and byte 3 for CATERR sensor SEL events
ED1 ndash 0xA1ED2 - CATERR type0 Unknown1 CATERR2 CPU Core Error (not supported on Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family)3 MSID Mismatch4 CATERR due to CPU 3-strike timeout
HA201-TP Users Manual
97
Chapter 7 Intelreg Server Board S2600TP Platform Management
ED3 - CPU bitmap that causes the system CATERR[0] CPU1[1] CPU2[2] CPU3[3] CPU4When a CATERR Timeout event is determined to be a CPU 3-strike timeout The BMC shall log the logical FRU information (eg busdevfunc for a PCIe device CPU or DIMM) that identifies the FRU that caused the error in the extended SEL data bytes In this case Ext-ED0 will be set to 0x70 and the remaining ED1-ED7 will be set according to the device type and info available
73115 MSID Mismatch SensorThe BMC supports a MSID Mismatch sensor for monitoring for the fault condition that will occur if there is a power rating incompatibility between a baseboard and a processor The sensor is rearmed on power-on (AC or DC power on transitions)
7312 Voltage MonitoringThe BMC provides voltage monitoring capability for voltage sources on the main board and processors such that all major areas of the system are covered This monitoring capability is instantiated in the form of IPMI analogthreshold sensors
73121 DIMM Voltage SensorsSome systems support either LVDDR (Low Voltage DDR) memory or regular (non-LVDDR) memory During POST the system BIOS detects which type of memory is installed and configures the hardware to deliver the correct voltageSince the nominal voltage range is different this necessitates the ability to set different thresholds for any associated IPMI voltage sensors The BMC FW supports this by implementing separate sensors (that is separate IPMI sensor numbers) for each nominal voltage range supported for a single physical sensor and it enablesdisables the correct IPMI sensor based on which type memory is installed The sensor data records for both these DIMM voltage sensor types have scanning disabled by default Once the BIOS has completed its POST routine it is responsible for communicating the DIMM voltage type to the BMC which will then
HA201-TP Users Manual
98
Chapter 7 Intelreg Server Board S2600TP Platform Management
enable sensor scanning of the correct DIMM voltage sensor
7313 Fan MonitoringBMC fan monitoring support includes monitoring of fan speed (RPM) and fan presence
73131 Fan Tach SensorsFan Tach sensors are used for fan failure detection The reported sensor reading is proportional to the fanrsquos RPM This monitoring capability is instantiated in the form of IPMI analogthreshold sensorsMost fan implementations provide for a variable speed fan so the variations in fan speed can be large Therefore the threshold values must be set sufficiently low as to not result in inappropriate threshold crossingsFan tach sensors are implemented as manual re-arm sensors because a lower-critical threshold crossing can result in full boosting of the fans This in turn may cause a failing fanrsquos speed to rise above the threshold and can result in fan oscillationsAs a result fan tach sensors do not auto-rearm when the fault condition goes away but rather are rearmed for either of the following occurrences1 The system is reset or power-cycled2 The fan is removed and either replaced with another fan or re-
inserted This applies to hot-swappable fans only This re-arm is triggered by change in the state of the associated fan presence sensor
After the sensor is rearmed if the fan speed is detected to be in a normal range the failure conditions shall be cleared and a de-assertion event shall be logged
HA201-TP Users Manual
99
Chapter 7 Intelreg Server Board S2600TP Platform Management
73132 Fan Presence SensorsSome chassis and server boards provide support for hot-swap fans These fans can be removed and replaced while the system is powered on and operating normally The BMC implements fan presence sensors for each hot swappable fan These are instantiated as IPMI discrete sensorsEvents are only logged for fan presence upon changes in the presence state after AC power is applied (no events logged for initial state)
73133 Fan Redundancy SensorThe BMC supports redundant fan monitoring and implements fan redundancy sensors for products that have redundant fans Support for redundant fans is chassis-specificA fan redundancy sensor generates events when its associated set of fans transition between redundant and non-redundant states as determined by the number and health of the component fans The definition of fan redundancy is configuration dependent The BMCallows redundancy to be configured on a per fan-redundancy sensor basis through OEM SDR recordsThere is a fan redundancy sensor implemented for each redundant group of fans in the systemAssertion and de-assertion event generation is enabled for each redundancy state
73134 Power Supply Fan SensorsMonitoring is implemented through IPMI discrete sensors one for each power supply fan The BMC polls each installed power supply using the PMBus fan status commands to check for failure conditions for the power supply fans The BMC asserts the ldquoperformance lagsrdquo offset of the IPMI sensor if a fan failure is detectedPower supply fan sensors are implemented as manual re-arm sensors because a failure condition can result in boosting of the fans This in turn may cause a failing fanrsquos speed to rise above the ldquofaultrdquo threshold and can result in fan oscillations As a result these sensors do not auto-rearm when the fault condition goes away but rather are rearmed only when the system is reset or power-cycled or the PSU is removed and replaced with the same or another PSUAfter the sensor is rearmed if the fan is no longer showing a failed state the failure condition in the IPMI sensor shall be cleared and a de-
HA201-TP Users Manual
100
Chapter 7 Intelreg Server Board S2600TP Platform Management
assertion event shall be logged
73135 Monitoring for ldquoFans Offrdquo ScenarioOn Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family it is likely that there will be situations where specific fans are turned off based on current system conditions BMC Fan monitoring will comprehend this scenario and not log false failure eventsThe recommended method is for the BMC FW to halt updates to the value of the associated fan tach sensor and set that sensorrsquos IPMI sensor state to ldquoreading-state-unavailablerdquo when this mode is active Management software must comprehend this state for fan tach sensors and not report these as failure conditionsThe scenario for which this occurs is that the BMC Fan Speed Control (FSC) code turns off the fans by setting the PWM for the domain to 0 This is done when based on one or more global aggregate thermal margin sensor readings dropping below a specified thresholdBy default the fans-off feature will be disabled There is a BMC command and BIOS setup option to enabledisable this featureThe SmaRTCLST system feature will also momentarily gate power to all the system fans to reduce overall system power consumption in response to a power supply event (for example to ride out an AC power glitch) However for this scenario the fan power is gated by hardware for only 100ms which should not be long enough to result in triggering a fan fault SEL event
7314 Standard Fan ManagementThe BMC controls and monitors the system fans Each fan is associated with a fan speed sensor that detects fan failure and may also be associated with a fan presence sensor for hotswap support For redundant fan configurations the fan failure and presence status determines the fan redundancy sensor stateThe system fans are divided into fan domains each of which has a separate fan speed control signal and a separate configurable fan control policy A fan domain can have a set of temperature and fan sensors associated with it These are used to determine the current fan domain state
HA201-TP Users Manual
101
Chapter 7 Intelreg Server Board S2600TP Platform Management
A fan domain has three statesbull The sleep and boost states have fixed (but configurable through OEM
SDRs) fan speeds associated with thembull The nominal state has a variable speed determined by the fan
domain policy An OEM SDR record is used to configure the fan domain policy
The fan domain state is controlled by several factors They are listed below in order of precedence high to lowbull Boost
o Associated fan is in a critical state or missing The SDR describes which fan domains are boosted in response to a fan failure or removal in each domain If a fan is removed when the system is in lsquoFans-offrsquo mode it will not be detected and there will not be any fan boost till system comes out of lsquoFans-off modeo Any associated temperature sensor is in a critical state The SDR describes which temperature-threshold violations cause fan boost for each fan domaino The BMC is in firmware update mode or the operational firmware is corruptedo If any of the above conditions apply the fans are set to a fixed boost state speed
bull Nominalo A fan domainrsquos nominal fan speed can be configured as static (fixed value) or controlled by the state of one or more associated temperature sensorso See section 43144 for more details
73141 Hot-Swap FansHot-swap fans are supported These fans can be removed and replaced while the system is powered on and operating The BMC implements fan presence sensors for each hotswappable fanWhen a fan is not present the associated fan speed sensor is put into the readingunavailable state and any associated fan domains are put into the boost state The fans may already be boosted due to a previous fan failure or fan removalWhen a removed fan is inserted the associated fan speed sensor is rearmed If there are no other critical conditions causing a fan boost condition the fan speed returns to the nominal state Power cycling or
HA201-TP Users Manual
102
Chapter 7 Intelreg Server Board S2600TP Platform Management
resetting the system re-arms the fan speed sensors and clears fanfailure conditions If the failure condition is still present the boost state returns once the sensor has re-initialized and the threshold violation is detected again
73142 Fan Redundancy DetectionThe BMC supports redundant fan monitoring and implements a fan redundancy sensor A fan redundancy sensor generates events when its associated set of fans transitions between redundant and non-redundant states as determined by the number and health of the fans The definition of fan redundancy is configuration dependent The BMC allows redundancy to be configured on a per fan redundancy sensor basis through OEM SDR recordsA fan failure or removal of hot-swap fans up to the number of redundant fans specified in the SDR in a fan configuration is a non-critical failure and is reflected in the front panel status A fan failure or removal that exceeds the number of redundant fans is a non-fatal insufficientresources condition and is reflected in the front panel status as a non-fatal errorRedundancy is checked only when the system is in the DC-on state Fan redundancy changes that occur when the system is DC-off or when AC is removed will not be logged until the system is turned on
73143 Fan DomainsSystem fan speeds are controlled through pulse width modulation (PWM) signals which are driven separately for each domain by integrated PWM hardware Fan speed is changed by adjusting the duty cycle which is the percentage of time the signal is driven high in each pulseThe BMC controls the average duty cycle of each PWM signal through direct manipulation of the integrated PWM control registersThe same device may drive multiple PWM signals
HA201-TP Users Manual
103
Chapter 7 Intelreg Server Board S2600TP Platform Management
73144 Nominal Fan SpeedA fan domainrsquos nominal fan speed can be configured as static (fixed value) or controlled by the state of one or more associated temperature sensorsOEM SDR records are used to configure which temperature sensors are associated with which fan control domains and the algorithmic relationship between the temperature and fan speed Multiple OEM SDRs can reference or control the same fan control domain and multiple OEM SDRs can reference the same temperature sensorsThe PWM duty-cycle value for a domain is computed as a percentage using one or more instances of a stepwise linear algorithm and a clamp algorithm The transition from one computed nominal fan speed (PWM value) to another is ramped over time to minimize audible transitions The ramp rate is configurable by means of the OEM SDRMultiple stepwise linear and clamp controls can be defined for each fan domain and used simultaneously For each domain the BMC uses the maximum of the domainrsquos stepwise linear control contributions and the sum of the domainrsquos clamp control contributions to compute the domainrsquos PWM value except that a stepwise linear instance can be configured to provide the domain maximumHysteresis can be specified to minimize fan speed oscillation and to smooth fan speed transitions If a Tcontrol SDR record does not contain a hysteresis definition for example an SDR adhering to a legacy format the BMC assumes a hysteresis value of zero
73145 Thermal and Acoustic ManagementThis feature refers to enhanced fan management to keep the system optimally cooled while reducing the amount of noise generated by the system fans Aggressive acoustics standards might require a trade-off between fan speed and system performance parameters that contribute to the cooling requirements primarily memory bandwidth The BIOS BMC and SDRs work together to provide control over how this trade-off is determinedThis capability requires the BMC to access temperature sensors on the individual memory DIMMs Additionally closed-loop thermal throttling is only supported with buffered DIMMs
HA201-TP Users Manual
104
Chapter 7 Intelreg Server Board S2600TP Platform Management
73146 Thermal Sensor Input to Fan Speed ControlThe BMC uses various IPMI sensors as input to the fan speed control Some of the sensors are IPMI models of actual physical sensors whereas some are ldquovirtualrdquo sensors whose values are derived from physical sensors using calculations andor tabular informationThe following IPMI thermal sensors are used as input to the fan speed controlbull Front panel temperature sensorbull Baseboard temperature sensorsbull CPU DTS-Spec margin sensorsbull DIMM thermal margin sensorsbull Exit air temperature sensorbull Global aggregate thermal margin sensorsbull SSB (Intelreg C610 Series Chipset) temperature sensorbull On-board Ethernet controller temperature sensors (support for this is
specific to the Ethernet controller being used)bull Add-in Intelreg SASIO module temperature sensor(s) (if present)bull Power supply thermal sensors (only available on PMBus-compliant
power supplies)A simple model is shown in the following figure which gives a high level graphic of the fan speed control structure creates the resulting fan speeds
HA201-TP Users Manual
105
Chapter 7 Intelreg Server Board S2600TP Platform Management
731461 Processor Thermal ManagementProcessor thermal management utilizes clamp algorithms for which the Processor DTS-Spec margin sensor is a controlling input This replaces the use of the (legacy) raw DTS sensor reading that was utilized on previous generation platforms The legacy DTS sensor is retained only for monitoring purposes and is not used as an input to the fan speed control
731462 Memory Thermal ManagementThe system memory is the most complex subsystem to thermally manage as it requires substantial interactions between the BMC BIOS and the embedded memory controller This section provides an overview of this management capability from a BMC perspective
7314621 Memory Thermal ThrottlingThe system shall support thermal management through open loop throttling (OLTT) and closed loop throttling (CLTT) of system memory based on the platform as well as availability of valid temperature sensors on the installed memory DIMMs Throttling levels are changed dynamically to cap throttling based on memory and system thermal conditions as determined by the system and DIMM power and thermal parameters Support for CLTT on mixed-mode DIMM populations (that is some installed DIMMs have valid temp sensors and some do not) is not supported The BMC fan speed control functionality is related to the memory throttling mechanism usedThe following terminology is used for the various memory throttling optionsbull Static Open Loop Thermal Throttling (Static-OLTT) OLTT control registers
are configured by BIOS MRC remain fixed after post The system does not change any of the throttling control registers in the embedded memory controller during runtime
bull Static Closed Loop Thermal Throttling (Static-CLTT) CLTT control registers are configured by BIOS MRC during POST The memory throttling is run as a closed-loop system with the DIMM temperature sensors as the control input Otherwise the system does not change any of the throttling control registers in the embedded memory controller during runtime
bull Dynamic Open Loop Thermal Throttling (Dynamic-OLTT) OLTT control registers are configured by BIOS MRC during POST Adjustments are
HA201-TP Users Manual
106
Chapter 7 Intelreg Server Board S2600TP Platform Management
made to the throttling during runtime based on changes in system cooling (fan speed)
bull Dynamic Closed Loop Thermal Throttling (Dynamic-CLTT) CLTT control registers are configured by BIOS MRC during POST The memory throttling is run as a closed-loop system with the DIMM temperature sensors as the control input Adjustments are made to the throttling during runtime based on changes in system cooling (fan speed)
Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce a new type of CLTT which is referred to as Hybrid CLTT for which the Integrated Memory Controller estimates the DRAM temperature in between actual reads of the TSODsHybrid CLTTT shall be used on all Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family that have DIMMs with thermal sensors Therefore the terms Dynamic-CLTT and Static-CLTT are really referring to this lsquohybridrsquo mode Note that if the IMCrsquos polling of the TSODs is interrupted the temperature readings that the BMC gets from the IMC shall be these estimated values 731463 DIMM Temperature Sensor Input to Fan Speed ControlA clamp algorithm s used for controlling fan speed based on DIMM temperatures Aggregate DIMM temperature margin sensors are used as the control input to the algorithm
731464 Dynamic (Hybrid) CLTTThe system will support dynamic (memory) CLTT for which the BMC FW dynamically modifies thermal offset registers in the IMC during runtime based on changes in system cooling (fan speed) For static CLTT a fixed offset value is applied to the TSOD reading to get the die temperature however this is does not provide as accurate results as when the offset takes into account the current airflow over the DIMM as is done with dynamic CLTTIn order to support this feature the BMC FW will derive the air velocity for each fan domain based on the PWM value being driven for the domain Since this relationship is dependent on the chassis configuration a method must be used which support this dependency (for example through OEM SDR) that establishes a lookup table providing this relationshipBIOS will have an embedded lookup table that provides thermal offset
HA201-TP Users Manual
107
Chapter 7 Intelreg Server Board S2600TP Platform Management
values for each DIMM type altitude setting and air velocity range (3 ranges of air velocity are supported) During system boot BIOS will provide 3 offset values (corresponding to the 3 air velocity ranges) tothe BMC for each enabled DIMM Using this data the BMC FW constructs a table that maps the offset value corresponding to a given air velocity range for each DIMM During runtime the BMC applies an averaging algorithm to determine the target offset value corresponding to thecurrent air velocity and then the BMC writes this new offset value into the IMC thermal offset register for the DIMM
731465 Fan ProfilesThe server system supports multiple fan control profiles to support acoustic targets and American Society of Heating Refrigerating and Air Conditioning Engineers (ASHRAE) compliance The BIOS Setup utility can be used to choose between meeting the target acoustic level or enhanced system performance This is accomplished through fan profilesThe BMC supports eight fan profiles numbered from 0 to 7 Fan management policy is dictated by the Tcontrol SDRs These SDRs provide a way to associate fan control behavior with one or more fan profilesEach group of profiles allows for varying fan control policies based on the altitude For a given altitude the Tcontrol SDRs associated with an acoustics-optimized profile generate less noise than the equivalent performance-optimized profile by driving lower fan speeds and the BIOSreduces thermal management requirements by configuring more aggressive memory throttling See Table 16 for more informationThe BMC provides commands that query for fan profile support and it provides a way to enable a fan profile Enabling a fan profile determines which Tcontrol SDRs are used for fan management The BMC only supports enabling a fan profile through the command if that profile is supported on all fan domains defined for the system It is important to configure the SDRs so that all desired fan profiles are supported on each fan domain If no single profile is supported across all domains the BMC by default uses profile 0 and does not allow it to be changedAt system boot the BIOS can use the Get Fan Control Configuration command to query the BMC about which fan profiles are supported The BIOS uses this information to display options in the BIOS Setup utility The BIOS indicates the fan profile to the BMC as dictated by the BIOS Setup Utility options for fan mode and altitude using the Set Fan Control
HA201-TP Users Manual
108
Chapter 7 Intelreg Server Board S2600TP Platform Management
Configuration commandThe BMC uses this information as an input to its fan-control algorithm as supported by the Tcontrol OEM SDR The BMC only allows enabling of fan profiles that the BMC indicates are supported using the Get Fan Control Configuration command For example if the Get Fan Control Configuration command indicates that only profile 1 is supported then using the Set Fan Control Configuration command to enable profile 2 will result in the return of an error completion codeThe BMC requires the BIOS to send the Set Fan Control Configuration command to the BMC on every system boot This must be done after the BIOS has completed any throttling-related chipset configuration
HA201-TP Users Manual
109
Chapter 7 Intelreg Server Board S2600TP Platform Management
731466 Open-Loop Thermal Throttling FallbackNormal system operation uses closed-loop thermal throttling (CLTT) and DIMM temperature monitoring as major factors in overall thermal and acoustics management In the event that BIOS is unable to configure the system for CLTT it defaults to open-loop thermal throttling (OLTT) In the OLTT mode it is assumed that the DIMM temperature sensors are not available for fan speed control The BIOS communicates the throttling mode to the BMC along with the fan profile number when it sends the Set Fan Control Configuration commandWhen OLTT mode is specified the BMC internally blocks access to the DIMM temperatures causing the DIMM aggregate margin sensors to be marked as ReadingState Unavailable The BMC then uses the failure-control values for these sensors if specified in the Tcontrol SDRs as their fan speed contributions
731467 ASHRAE ComplianceSystem requirements for ASHRAE compliance is defined in the Common Fan Speed Control amp Thermal Management Platform Architecture Specification Altitude-related changes in fan speed control are handled through profiles for different altitude ranges
73147 Power Supply Fan Speed ControlThis section describes the system level control of the fans internal to the power supply over the PMBus Some but not all Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family will require that the power supplies be included in the system level fan speed control For any system that requires either of these capabilities the power supply must be PMBus-compliant
731471 System Control of Power Supply FansSome products require that the BMC control the speed of the power supply fans as is done with normal system (chassis) fans except that the BMC cannot reduce the power supply fan any lower than the internal power supply control is driving it For these products the BMC FW must have the ability to control and monitor the power supply fans through PMBus commands The power supply fans are treated as a system fan domain for which fan control policies are mapped just as for chassis system fans with system thermal sensors (rather than internal power
HA201-TP Users Manual
110
Chapter 7 Intelreg Server Board S2600TP Platform Management
supply thermal sensors) used as the input to a clamp algorithm for the power supply fan control This domain has both piecewise clipping curves and clamped sensors mapped into the power supply fan domain All the power supplies can be defined as a single fan domain
731472 Use of Power Supply Thermal Sensors as Input to System (Chassis) Fan Control Some products require that the power supply internal thermal sensors are used as control inputs to the system (chassis) fans in the same manner as other system thermal sensors are used for this purpose The power supply thermal sensors are included as clamped sensors into one or more system fan domains which may include the power supply fan domain
73148 Fan Boosting due to Fan FailuresIntelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce additional capabilities for handling fan failure or removal as described in this sectionEach fan failure shall be able to define a unique response from all other fan domains An OEM SDR table defines the response of each fan domain based on a failure of any fan including both system and power supply fans (for PMBus-compliant power supplies only) This meansthat if a system has six fans then there will be six different fan fail reactions
73149 Programmable Fan PWM OffsetThe system provides a BIOS Setup option to boost the system fan speed by a programmable positive offset or a ldquoMaxrdquo setting Setting the programmable offset causes the BMC to add the offset to the fan speeds that it would otherwise be driving the fans to The Max setting causes the BMC to replace the domain minimum speed with alternate domain minimums that also are programmable through SDRsThis capability is offered to provide system administrators the option to manually configure fans speeds in instances where the fan speed optimized for a given platform may not be sufficient when a high end add-in is configured into the system This enables easier usage of the fan speed control to support Intelreg as well as third party chassis and better support of ambient temperatures higher than 35C
HA201-TP Users Manual
111
Chapter 7 Intelreg Server Board S2600TP Platform Management
supply thermal sensors) used as the input to a clamp algorithm for the power supply fan control This domain has both piecewise clipping curves and clamped sensors mapped into the power supply fan domain All the power supplies can be defined as a single fan domain
731472 Use of Power Supply Thermal Sensors as Input to System (Chassis) Fan Control Some products require that the power supply internal thermal sensors are used as control inputs to the system (chassis) fans in the same manner as other system thermal sensors are used for this purpose The power supply thermal sensors are included as clamped sensors into one or more system fan domains which may include the power supply fan domain
73148 Fan Boosting due to Fan FailuresIntelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce additional capabilities for handling fan failure or removal as described in this sectionEach fan failure shall be able to define a unique response from all other fan domains An OEM SDR table defines the response of each fan domain based on a failure of any fan including both system and power supply fans (for PMBus-compliant power supplies only) This meansthat if a system has six fans then there will be six different fan fail reactions
73149 Programmable Fan PWM OffsetThe system provides a BIOS Setup option to boost the system fan speed by a programmable positive offset or a ldquoMaxrdquo setting Setting the programmable offset causes the BMC to add the offset to the fan speeds that it would otherwise be driving the fans to The Max setting causes the BMC to replace the domain minimum speed with alternate domain minimums that also are programmable through SDRsThis capability is offered to provide system administrators the option to manually configure fans speeds in instances where the fan speed optimized for a given platform may not be sufficient when a high end add-in is configured into the system This enables easier usage of the fan speed control to support Intelreg as well as third party chassis and better support of ambient temperatures higher than 35C
HA201-TP Users Manual
112
Chapter 7 Intelreg Server Board S2600TP Platform Management
7315 Power Management Bus (PMBus)The Power Management Bus (ldquoPMBusrdquo) is an open standard protocol that is built upon the SMBus 20 transport It defines a means of communicating with power conversion and other devices using SMBus-based commands A system must have PMBus-compliant power supplies installed in order for the BMC or ME to monitor them for status andor power metering purposesFor more information on PMBus see the System Management Interface Forum Web sitehttpwwwpowersigorg
7316 Power Supply Dynamic Redundancy SensorThe BMC supports redundant power subsystems and implements a Power Unit Redundancy sensor per platform A Power Unit Redundancy sensor is of sensor type Power Unit (09h) and reading type Availability Status (0Bh) This sensor generates events when a power subsystem transitions between redundant and non-redundant states as determined by the number and health of the power subsystemrsquos component power supplies The BMC implements Dynamic Power Supply Redundancy status based upon current system load requirements as well as total Power Supply capacity This status is independent of the Cold Redundancy status This prevents the BMC from reporting Fully Redundant Power supplies when the load required by the system exceeds half the power capability of all power supplies installed and operationalDynamic Redundancy detects this condition and generates the appropriate SEL event to notify the user of the condition Power supplies of different power ratings may be swapped in and out to adjust the power capacity and the BMC will adjust the Redundancy status accordinglyThe definition of redundancy is power subsystem dependent and sometimes even configuration dependent See the appropriate Platform Specific Information for power unit redundancy supportThis sensor is configured as manual-rearm sensor in order to avoid the possibility of extraneous SEL events that could occur under certain system configuration and workload conditions The sensor shall rearm for the following conditionsbull PSU hot-addbull System reset
HA201-TP Users Manual
113
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull AC power cyclebull DC power cycleSystem AC power is applied but on standby ndash Power unit redundancy is based on OEM SDR power unit record and number of PSU presentSystem is (DC) powered on - The BMC calculates Dynamic Power Supply Redundancy status based upon current system load requirements as well as total Power Supply capacityThe BMC allows redundancy to be configured on a per power-unit-redundancy sensor basis by means of the OEM SDR records
7317 Component Fault LED ControlSeveral sets of component fault LEDs are supported on the server board Some LEDs are owned by the BMC and some by the BIOSThe BMC owns control of the following FRUfault LEDsbull Fan fault LEDs ndash A fan fault LED is associated with each fan The BMC
lights a fan fault LED if the associated fan-tach sensor has a lower critical threshold event status asserted Fan-tach sensors are manual re-arm sensors Once the lower critical threshold is crossed the LED remains lit until the sensor is rearmed These sensors are rearmed at system DC power-on and system reset
bull DIMM fault LEDs ndash The BMC owns the hardware control for these LEDs The LEDs reflect the state of BIOS-owned event-only sensors When the BIOS detects a DIMM fault condition it sends an IPMI OEM command (Set Fault Indication) to the BMC to instruct the BMC to turn on the associated DIMM Fault LED These LEDs are only active when the system is in the lsquoonrsquo state The BMC will not activate or change the state of the LEDs unless instructed by the BIOS
bull Hard Disk Drive Status LEDs ndash The HSBP PSoC owns the hardware control for these LEDs and detection of the faultstatus conditions that the LEDs reflect
bull CPU Fault LEDs The BMC owns control for these LEDs An LED is lit if there is an MSID mismatch (that is CPU power rating is incompatible with the board)
HA201-TP Users Manual
114
Chapter 7 Intelreg Server Board S2600TP Platform Management
7318 CMOS Battery MonitoringThe BMC monitors the voltage level from the CMOS battery which provides battery backup to the chipset RTC This is monitored as an auto-rearm threshold sensorUnlike monitoring of other voltage sources for which the Emulex Pilot III component continuously cycles through each input the voltage channel used for the battery monitoring provides a software enable bit to allow the BMC FW to poll the battery voltage at a relativelyslow rate in order to conserve battery power
HA201-TP Users Manual
115
Chapter 7 Intelreg Server Board S2600TP Platform Management
74 Intelreg Intelligent Power Node Manager (NM)Power management deals with requirements to manage processor power consumption and manage power at the platform level to meet critical business needs Node Manager (NM) is a platform resident technology that enforces power capping and thermal-triggered powercapping policies for the platform These policies are applied by exploiting subsystem settings (such as processor P and T states) that can be used to control power consumption NM enables data center power management by exposing an external interface to management software through which platform policies can be specified It also implements specific data center power management usage models such as power limiting and thermal monitoringThe NM feature is implemented by a complementary architecture utilizing the ME BMC BIOS and an ACPI-compliant OS The ME provides the NM policy engine and power controllimiting functions (referred to as Node Manager or NM) while the BMC provides the external LAN link by which external management software can interact with the feature The BIOS provides system power information utilized by the NM algorithms and also exports ACPI Source Language (ASL) code used by OS-Directed Power Management (OSPM) for negotiating processor P and T state changes for power limiting PMBus-compliant power supplies provide the capability to monitor input power consumption which is necessary to support NMThe NM architecture applicable to this generation of servers is defined by the NPTM Architecture Specification v20 NPTM is an evolving technology that is expected to continue to add new capabilities that will be defined in subsequent versions of the specification The ME NM implements the NPTM policy engine and controlmonitoring algorithms defined in the Node Power and Thermal Manager (NPTM) specification
741 Hardware RequirementsNM is supported only on platforms that have the NM FW functionality loaded and enabled on the Management Engine (ME) in the SSB and that have a BMC present to support the external LAN interface to the ME NM power limiting features requires a means for the ME to monitorinput power consumption for the platform This capability is generally provided by means of PMBus-compliant power supplies although an alternative model using a simpler SMBus power monitoring device is possible (there is potential loss in accuracy and responsiveness using
HA201-TP Users Manual
116
Chapter 7 Intelreg Server Board S2600TP Platform Management
non-PMBus devices) The NM SmaRTCLST feature does specifically require PMBus-compliant power supplies as well as additional hardware on the server board
742 FeaturesNM provides feature support for policy management monitoring and querying alerts and notifications and an external interface protocol The policy management features implement specific IT goals that can be specified as policy directives for NM Monitoring and querying features enable tracking of power consumption Alerts and notifications provide the foundation for automation of power management in the data center management stack The external interface specifies the protocols that must be supported in this version of NM
743 ME System Management Bus (SMBus) Interfacebull The ME uses the SMLink0 on the SSB in multi-master mode as a
dedicated bus for communication with the BMC using the IPMB protocol The BMC FW considers this a secondary IPMB bus and runs at 400 kHz
bull The ME uses the SMLink1 on the SSB in multi-master mode bus for communication with PMBus devices in the power supplies for support of various NM-related features This bus is shared with the BMC which polls these PMBus power supplies for sensor monitoring purposes (for example power supply status input power and so on) This bus runs at 100 KHz
bull The Management Engine has access to the ldquoHost SMBusrdquo
744 PECI 30bull The BMC owns the PECI bus for all Intel server implementations and
acts as a proxy for the ME when necessary
745 NM ldquoDiscoveryrdquo OEM SDRAn NM ldquodiscoveryrdquo OEM SDR must be loaded into the BMCrsquos SDR repository if and only if the NM feature is supported on that product This OEM SDR is used by management software to detect if NM is supported and to understand how to communicate with itSince PMBus compliant power supplies are required in order to support NM the system should be probed when the SDRs are loaded into the
HA201-TP Users Manual
117
Chapter 7 Intelreg Server Board S2600TP Platform Management
BMCrsquos SDR repository in order to determine whether or not the installed power supplies do in fact support PMBus If the installed power supplies are not PMBus compliant then the NM ldquodiscoveryrdquo OEM SDR should not be loadedPlease refer to the Intelreg Intelligent Power Node Manager 20 External Architecture Specification using IPMI for details of this interface
746 SmaRTCLSTThe power supply optimization provided by SmaRTCLST relies on a platform HW capability as well as ME FW support When a PMBus-compliant power supply detects insufficient input voltage an overcurrent condition or an over-temperature condition it will assert theSMBAlert signal on the power supply SMBus (such as the PMBus) Through the use of external gates this results in a momentary assertion of the PROCHOT and MEMHOT signals to the processors thereby throttling the processors and memory The ME FW also sees the SMBAlert assertion queries the power supplies to determine the condition causing the assertion and applies an algorithm to either release or prolong the throttling based on the situationSystem power control modes include1 SmaRT Low AC in put voltage event results in a one-time momentary
throttle for each event to the maximum throttle state2 Electrical Protection CLST High output energy event results in a
throttling hiccup mode with fixed maximum throttle time and a fix throttle release ramp time
3 Thermal Protection CLST High power supply thermal event results in a throttling hiccup mode with fixed maximum throttle time and a fix throttle release ramp time
When the SMBAlert signal is asserted the fans will be gated by HW for a short period (~100ms) to reduce overall power consumption It is expected that the interruption to the fans will be of short enough duration to avoid false lower threshold crossings for the fan tachsensors however this may need to be comprehended by the fan monitoring FW if it does have this side-effectME FW will log an event into the SEL to indicate when the system has been throttled by the SmaRTCLST power management feature This is dependent on ME FW support for this sensor Please refer ME FW EPS for SEL log details
HA201-TP Users Manual
118
Chapter 7 Intelreg Server Board S2600TP Platform Management
74611 Dependencies on PMBus-compliant Power Supply SupportThe SmaRTCLST system feature depends on functionality present in the ME NM SKU This feature requires power supplies that are compliant with the PMBus
note For additional information on Intelreg Intelligent Power Node
Manager usage and support please visit the following Intel Website
httpwwwintelcomcontentwwwusendata-centerdata-center-managementnode-manager-generalhtmlwapkw=node+manager
HA201-TP Users Manual
119
Chapter 7 Intelreg Server Board S2600TP Platform Management
75 Basic and Advanced Server Management FeaturesThe integrated BMC has support for basic and advanced server management features Basic management features are available by default Advanced management features are enabled with the addition of an optionally installed Remote Management Module 4 Lite (RMM4 Lite) key
When the BMC FW initializes it attempts to access the Intelreg RMM4 lite If the attempt to access Intelreg RMM4 lite is successful then the BMC activates the Advanced featuresThe following table identifies both Basic and Advanced server management features
HA201-TP Users Manual
120
Chapter 7 Intelreg Server Board S2600TP Platform Management
751 Dedicated Management PortThe server board includes a dedicated 1GbE RJ45 Management Port The management port is active with or without the RMM4 Lite key installed
752 Embedded Web ServerBMC Base manageability provides an embedded web server and an OEM-customizable web GUI which exposes the manageability features of the BMC base feature set It is supported over all on-board NICs that have management connectivity to the BMC as well as an optionaldedicated add-in management NIC At least two concurrent web sessions from up to two different users is supported The embedded web user interface shall support the following client web browsersbull Microsoft Internet Explorer 90bull Microsoft Internet Explorer 100bull Mozilla Firefox 24bull Mozilla Firefox 25The embedded web user interface supports strong security (authentication encryption and firewall support) since it enables remote server configuration and control The user interface presented by the embedded web user interface shall authenticate the user before allowing a web session to be initiated Encryption using 128-bit SSL is supported User authentication is based on user id and passwordThe GUI presented by the embedded web server authenticates the user before allowing a web session to be initiated It presents all functions to all users but grays-out those functions that the user does not have privilege to execute For example if a user does not have privilege to power control then the item shall be displayed in grey-out font in that userrsquos UI display The web GUI also provides a launch point for some of the advanced features such as KVM and media redirection These features are grayed out in the GUI unless the system has been updated to support these advanced features The embedded web server only displays US English or Chinese language outputAdditional features supported by the web GUI includesbull Presents all the Basic features to the usersbull Power onoffreset the server and view current power statebull Displays BIOS BMC ME and SDR version informationbull Display overall system health
HA201-TP Users Manual
121
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull Configuration of various IPMI over LAN parameters for both IPV4 and IPV6
bull Configuration of alerting (SNMP and SMTP)bull Display system asset information for the product board and
chassisbull Display of BMC-owned sensors (name status current reading
enabled thresholds) including color-code status of sensorsbull Provides ability to filter sensors based on sensor type (Voltage
Temperature Fan and Power supply related)bull Automatic refresh of sensor data with a configurable refresh ratebull On-line helpbull Displayclear SEL (display is in easily understandable human
readable format)bull Supports major industry-standard browsers (Microsoft Internet
Explorer and Mozilla Firefox)bull The GUI session automatically times-out after a user-configurable
inactivity period By default this inactivity period is 30 minutesbull Embedded Platform Debug feature - Allow the user to initiate
a ldquodebug dumprdquo to a file that can be sent to Intelreg for debug purposes
bull Virtual Front Panel The Virtual Front Panel provides the same functionality as the local front panel The displayed LEDs match the current state of the local panel LEDs The displayed buttons (for example power button) can be used in the same manner as the local buttons
bull Display of ME sensor data Only sensors that have associated SDRs loaded will be displayed
bull Ability to save the SEL to a filebull Ability to force HTTPS connectivity for greater security This is
provided through a configuration option in the UIbull Display of processor and memory information as is available over
IPMI over LANbull Ability to get and set Node Manager (NM) power policiesbull Display of power consumed by the serverbull Ability to view and configure VLAN settingsbull Warn user the reconfiguration of IP address will cause disconnectbull Capability to block logins for a period of time after several
HA201-TP Users Manual
122
Chapter 7 Intelreg Server Board S2600TP Platform Management
consecutive failed login attempts The lock-out period and the number of failed logins that initiates the lockout period are configurable by the user
bull Server Power Control - Ability to force into Setup on a resetbull System POST results ndash The web server provides the systemrsquos Power-
On Self Test (POST) sequence for the previous two boot cycles including timestamps The timestamps may be viewed in relative to the start of POST or the previous POST code
bull Customizable ports - The web server provides the ability to customize the port numbers used for SMASH http https KVM secure KVM remote media and secure remote media
For additional information reference the Intelreg Remote Management Module 4 and Integrated BMC Web Console Users Guide
753 Advanced Management Feature Support (RMM4 Lite)The integrated baseboard management controller has support for advanced management features which are enabled when an optional Intelreg Remote Management Module 4 Lite (RMM4 Lite) is installed The Intel RMM4 add-on offers convenient remote KVM access and control through LAN and internet It captures digitizes and compresses video and transmits it with keyboard and mouse signals to and from a remote computer Remote access and controlsoftware runs in the integrated baseboard management controller utilizing expanded capabilities enabled by the Intel RMM4 hardwareKey Features of the RMM4 add-on arebull KVM redirection from either the dedicated management NIC or
the server board NICs used for management traffic upto to two KVM sessions
bull Media Redirection ndash The media redirection feature is intended to allow system administrators or users to mount a remote IDE or USB CDROM floppy drive or a USB flash disk as a remote device to the server Once mounted the remote device appears just like a local device to the server allowing system administrators or users to install software (including operating systems) copy files update BIOS or boot the server from this device
bull KVM ndash Automatically senses video resolution for best possible screen capture high performance mouse tracking and
HA201-TP Users Manual
123
Chapter 7 Intelreg Server Board S2600TP Platform Management
synchronization It allows remote viewing and configuration in pre-boot POST and BIOS setup
7531 Keyboard Video Mouse (KVM) RedirectionThe BMC firmware supports keyboard video and mouse redirection (KVM) over LAN This feature is available remotely from the embedded web server as a Java applet This feature is only enabled when the Intelreg RMM4 lite is present The client system must have a Java Runtime Environment (JRE) version 60 or later to run the KVM or media redirection appletsThe BMC supports an embedded KVM application (Remote Console) that can be launched from the embedded web server from a remote console USB11 or USB 20 based mouse and keyboard redirection are supported It is also possible to use the KVM-redirection (KVM-r) session concurrently with media-redirection (media-r) This feature allows a user to interactively use the keyboard video and mouse (KVM) functions of the remote server as if the user were physically at the managed server KVM redirection console supports the following keyboard layouts English Dutch French German Italian Russian and SpanishKVM redirection includes a ldquosoft keyboardrdquo function The ldquosoft keyboardrdquo is used to simulate an entire keyboard that is connected to the remote system The ldquosoft keyboardrdquo functionality supports the following layouts English Dutch French German Italian Russian and SpanishThe KVM-redirection feature automatically senses video resolution for best possible screen capture and provides high-performance mouse tracking and synchronization It allows remote viewing and configuration in pre-boot POST and BIOS setup once BIOS has initialized videoOther attributes of this feature includebull Encryption of the redirected screen keyboard and mousebull Compression of the redirected screenbull Ability to select a mouse configuration based on the OS typebull Supports user definable keyboard macrosKVM redirection feature supports the following resolutions and refresh ratesbull 640x480 at 60Hz 72Hz 75Hz 85Hz 100Hzbull 800x600 at 60Hz 72Hz 75Hz 85Hzbull 1024x768 at 60Hx 72Hz 75Hz 85Hzbull 1280x960 at 60Hz
HA201-TP Users Manual
124
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull 1280x1024 at 60Hzbull 1600x1200 at 60Hzbull 1920x1080 (1080p)bull 1920x1200 (WUXGA)bull 1650x1080 (WSXGA+)
7532 Remote ConsoleThe Remote Console is the redirected screen keyboard and mouse of the remote host system To use the Remote Console window of your managed host system the browser must include a Java Runtime Environment plug-in If the browser has no Java support such as with a small handheld device the user can maintain the remote host system using the administration forms displayed by the browserThe Remote Console window is a Java Applet that establishes TCP connections to the BMCThe protocol that is run over these connections is a unique KVM protocol and not HTTP or HTTPS This protocol uses ports 7578 for KVM 5120 for CDROM media redirection and 5123 for FloppyUSB media redirection When encryption is enabled the protocol uses ports 7582 for KVM 5124 for CDROM media redirection and 5127 for FloppyUSB media redirection The local network environment must permit these connections to be made that is the firewall and in case of a private internal network the NAT (Network Address Translation) settings have to be configured accordingly
7533 PerformanceThe remote display accurately represents the local display The feature adapts to changes to the video resolution of the local display and continues to work smoothly when the system transitions from graphics to text or vice-versa The responsiveness may be slightly delayed depending on the bandwidth and latency of the networkEnabling KVM andor media encryption will degrade performance Enabling video compression provides the fastest response while disabling compression provides better video qualityFor the best possible KVM performance a 2Mbsec link or higher is recommendedThe redirection of KVM over IP is performed in parallel with the local KVM without affecting the local KVM operation
HA201-TP Users Manual
125
Chapter 7 Intelreg Server Board S2600TP Platform Management
7534 SecurityThe KVM redirection feature supports multiple encryption algorithms including RC4 and AES The actual algorithm that is used is negotiated with the client based on the clientrsquos capabilities
7535 AvailabilityThe remote KVM session is available even when the server is powered-off (in stand-by mode) No re-start of the remote KVM session shall be required during a server reset or power onoff A BMC reset (for example due to an BMC Watchdog initiated reset or BMC reset after BMC FWupdate) will require the session to be re-established KVM sessions persist across system reset but not across an AC power loss
7536 UsageAs the server is powered up the remote KVM session displays the complete BIOS boot process The user is able interact with BIOS setup change and save settings as well as enter and interact with option ROM configuration screensAt least two concurrent remote KVM sessions are supported It is possible for at least two different users to connect to same server and start remote KVM sessions
7537 Force-enter BIOS SetupKVM redirection can present an option to force-enter BIOS Setup This enables the system to enter F2 setup while booting which is often missed by the time the remote console redirects the video
7538 Media RedirectionThe embedded web server provides a Java applet to enable remote media redirection This may be used in conjunction with the remote KVM feature or as a standalone applet The media redirection feature is intended to allow system administrators or users to mount a remote IDE or USB CD-ROM floppy drive or a USB flash disk as a remote device to the server Once mounted the remote device appears just like a local device to the server allowing system administrators or users to install software (including operating systems) copy files update BIOS and so on or boot the server from this deviceThe following capabilities are supported
HA201-TP Users Manual
126
Chapter 7 Intelreg Server Board S2600TP Platform Management
The operation of remotely mounted devices is independent of the local devices on the server Both remote and local devices are useable in parallelbull Either IDE (CD-ROM floppy) or USB devices can be mounted as a
remote device to the serverbull It is possible to boot all supported operating systems from the remotely
mounted device and to boot from disk IMAGE (IMG) and CD-ROM or DVD-ROM ISO files See the Testedsupported Operating System List for more information
bull Media redirection supports redirection for both a virtual CD device and a virtual FloppyUSB device concurrently The CD device may be either a local CD drive or else an ISO image file the FloppyUSB device may be either a local Floppy drive a local USB device or else a disk image file
bull The media redirection feature supports multiple encryption algorithms including RC4 and AES The actual algorithm that is used is negotiated with the client based on the clientrsquos capabilities
bull A remote media session is maintained even when the server is powered-off (in standby mode) No restart of the remote media session is required during a server reset or power onoff An BMC reset (for example due to an BMC reset after BMC FW update) will require the session to be re-established
bull The mounted device is visible to (and useable by) managed systemrsquos OS and BIOS in both pre-boot and post-boot states
bull The mounted device shows up in the BIOS boot order and it is possible to change the BIOS boot order to boot from this remote device
bull It is possible to install an operating system on a bare metal server (no OS present) using the remotely mounted device This may also require the use of KVM-r to configure the OS during install
USB storage devices will appear as floppy disks over media redirection This allows for the installation of device drivers during OS installationIf either a virtual IDE or virtual floppy device is remotely attached during system boot both the virtual IDE and virtual floppy are presented as bootable devices It is not possible to present only a single-mounted device type to the system BIOS
HA201-TP Users Manual
127
Chapter 7 Intelreg Server Board S2600TP Platform Management
75381 AvailabilityThe default inactivity timeout is 30 minutes and is not user-configurable Media redirection sessions persist across system reset but not across an AC power loss or BMC reset
75382 Network Port UsageThe KVM and media redirection features use the following portsbull 5120 ndash CD Redirectionbull 5123 ndash FD Redirectionbull 5124 ndash CD Redirection (Secure)bull 5127 ndash FD Redirection (Secure)bull 7578 ndash Video Redirectionbull 7582 ndash Video Redirection (Secure)For additional information reference the Intelreg Remote Management Module 4 and Integrated BMC Web Console Users Guide
Chapter 1 Product IntroductionChapter 1 Product IntroductionChapter 8 Technical Support
wwwaicipccom
bull TAIWANTel +886 3 433 9188Fax +886 3 287 1818Email salesaicipccomtw
bull CHINA Tel +862154961421 +862154961422Fax Extension 608Email Technical Support supportaicipccom
bull AMERICA - West coastTel +19098958989Fax +19098958999Email salesaicipccom
bull AMERICA - East coastTel +19738848886Fax +19738844794Email njsalesaicipccom
bull EUROPETel +31306386789Fax +31306360638Emailsalesaicipcnl
Email Technical Support supportaicipccom
- PREFACE
-
- SAFETY INSTRUCTIONS
-
- Chapter 1 Product Introduction
-
- 11 Box Content
- 12 Specifications
- 13 General Information
-
- Chapter 2 Hardware Installation
-
- 21 Removing and Installing top cover
- 22 Central Processing Unit (CPU)
- 23 Diagram of the Correct Installation
- 24 System Memory
- 25 RemovingInstalling a Drive Tray Hard Drive Drive Slot Map
- 26 Removing and Installing a Fan Module
- 27 Power Supply
- 28 Tool-less Blade Slide Installation introduction
-
- Chapter 3 Motherborad Overview
-
- 31 Intelreg Server Board Feature Set
- 32 Motherboard Layout
- 33 Motherboard block diagram
- 34 Motherboard Configuration Jumpers
- 35 Motherboard Configuration Jumpers
-
- Chapter 4 12GB Expander Borad Overview
-
- 41 12GB Expander Board
-
- Chapter 5 Backplane Overview
-
- 51 Backplane
-
- Chapter 6 Debug amp Firmware Update Introduction
-
- 61 Expender firmware update through smart console port
- 62 Update the expander firmware through in-band
- 63 12G expander EDFB setting
- 64 Slot HDD power setting
- 65 HDD BP thermal sensor temperature setting
-
- Chapter 7 Intelreg Server Board S2600TP Platform Management
-
- 71 Server Management Function Architecture
- 72 Features and Functions
- 73 Sensor Monitoring
- 74 Intelreg Intelligent Power Node Manager (NM)
- 75 Basic and Advanced Server Management Features
-
- Chapter 8 Technical Support
-
- 按鈕11
![Page 2: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/2.jpg)
contents
CONTENTSPREFACE i
SAFETY INSTRUCTIONS iiChapter 1 Product Introduction 1
11 Box Content 112 Specifications 213 General Information 3
Chapter 2 Hardware Installation 7
21 Removing and Installing top cover 722 Central Processing Unit (CPU) 823 Diagram of the Correct Installation 1624 System Memory 1725 RemovingInstalling a Drive Tray Hard Drive Drive Slot Map 2226 Removing and Installing a Fan Module 2527 Power Supply 2628 Tool-less Blade Slide Installation introduction 27
Chapter 3 Motherborad Overview 31
31 Intelreg Server Board Feature Set 3232 Motherboard Layout 3433 Motherboard block diagram 3534 Motherboard Configuration Jumpers 3635 Motherboard Configuration Jumpers 38
Chapter 4 12GB Expander Borad Overview 39
41 12GB Expander Board 39
Chapter 5 Backplane Overview 44
51 Backplane 44
Chapter 6 Debug amp Firmware Update Introduction 48
61 Expender firmware update through smart console port 4862 Update the expander firmware through in-band 6163 12G expander EDFB setting 65
contents
64 Slot HDD power setting 6765 HDD BP thermal sensor temperature setting 72
Chapter 7 Intelreg Server Board S2600TP Platform Management 75
71 Server Management Function Architecture 7572 Features and Functions 7873 Sensor Monitoring 8474 Intelreg Intelligent Power Node Manager (NM) 11675 Basic and Advanced Server Management Features 120
Chapter 8 Technical Support 129
Copyright copy 2015 AIC Inc All Rights Reserved
This document contains proprietary information about AIC products and is not to be disclosed or used except in accordance with applicable agreements
i
PREFACEbull Copyright
No part of this publication may be reproduced stored in a retrieval system
or transmitted in any form or by any means electronic mechanical photo-
static recording or otherwise without the prior written consent of the
manufacturer
bull Trademarks
All products and trade names used in this document are trademarks or
registered trademarks of their respective holders
bull Changes
The material in this document is for information purposes only and is subject
to change without notice
bull Warning
1 A shielded-type power cord is required in order to meet FCC emission
limits and also to prevent interference to the nearby radio and television
reception It is essential that only the supplied power cord be used
2 Use only shielded cables to connect IO devices to this equipment
3 You are cautioned that changes or modifications not expressly approved
by the party responsible for compliance could void your authority to
operate the equipment
bull Disclaimer
AIC shall not be liable for technical or editorial errors or omissions
contained herein The information provided is provided as is without
warranty of any kind To the extent permitted by law neither AIC or its
affiliates subcontractors or suppliers will be liable for incidental special or
consequential damages including downtime cost lost profits damages
relating to the procurement of substitute products or services or damages
for loss of data or software restoration The information in this document is
subject to change without notice
ii
SAFETY INSTRUCTIONSbull Before getting started please read the following important cautionsbull All cautions and warnings on the equipment or in the manuals should be
notedbull Most electronic components are sensitive to electrical static discharge
Therefore be sure to ground yourself at all times when installing the internal components
bull Use a grounding wrist strap and place all electronic components in static-shielded devices Grounding wrist straps can be purchased in any electronic supply store
bull Be sure to turn off the power and then disconnect the power cords from your system before performing any installation or servicing A sudden surge of power could damage sensitive electronic components
bull Do not open the systemrsquos top cover If opening the cover for maintenance is a must only a trained technician should do so Integrated circuits on computer boards are sensitive to static electricity Before handling a board or integrated circuit touch an unpainted portion of the system unit chassis for a few seconds This will help to discharge any static electricity on your body
bull Place this equipment on a stable surface when install A drop or fall could cause injury
bull Please keep this equipment away from humiditybull Carefully mount the equipment into the rack in such manner that it
wonrsquot be hazardous due to uneven mechanical loadingbull This equipment is to be installed for operation in an environment with
maximum ambient temperature below 35degCbull The openings on the enclosure are for air convection to protect the
equipment from overheating DO NOT COVER THE OPENINGSbull Never pour any liquid into ventilation openings This could cause fire or
electrical shockbull Make sure the voltage of the power source is within the specification
on the label when connecting the equipment to the power outlet The current load and output power of loads shall be within the specification
bull This equipment must be connected to reliable grounding before using Pay special attention to power supplied other than direct connections eg using of power strips
iii
bull Place the power cord out of the way of foot traffic Do not place anything over the power cord The power cord must be rated for the product voltage and current marked on the productrsquos electrical ratings label The voltage and current rating of the cord should be greater than the voltage and current rating marked on the product
bull If the equipment is not used for a long time disconnect the equipment from mains to avoid being damaged by transient over-voltage
bull Never open the equipment For safety reasons only qualified service personnel should open the equipment
bull If one of the following situations arise the equipment should be checked by service personnel1 The power cord or plug is damaged2 Liquid has penetrated the equipment3 The equipment has been exposed to moisture4 The equipment does not work well or will not work according to its user
manual5 The equipment has been dropped andor damaged6 The equipment has obvious signs of breakage7 Please disconnect this equipment from the AC outlet before cleaning
Do not use liquid or detergent for cleaning The use of a moisture sheet or cloth is recommended for cleaning
bull Module and drive bays must not be empty They must have a dummy cover
Product features and specifications are subject to change without notice
CAUTION
risk of explosion if battery is replaced by an incorrect type
dispose of used batteries according to the instructions
After performing any installation or servicing make sure the
enclosure are lock and screw in position turn on the power
1
HA201-TP Users Manual
Chapter 1 Product Introduction
11 Box Content
Chapter 1 Product Introduction
Before removing the subsystem from the shipping carton visually inspect the physical condition of the shipping carton Exterior damage to the shipping carton may indicate that the contents of the carton are damaged If any damage is found do not remove the components contact the dealer where the subsystem was purchased for further introduction Before continuing first unpack the subsystem and verify that the contents of the shipping carton are all there and in good condition
bull Enclosure( Power supply fan 24 HDD tray included)
bull RS232 cablex 1pcs bull Power cord x 2sets bull Screws kit x 1set
bull Slide rail x 1set
If any items are missing please contact your authorized reseller or sales representative
2
HA201-TP Users Manual
Chapter 1 Product Introduction
12 Specifications
Dimensions (W x D x H)(with chassis ears)
mm 4826 x 815 x 88
inches 19 x 32 x 35
Motherboard (per node) Intelreg Server Board S2600TP
Processor (per node)
Processor Support
Two Intelreg Xeonreg Processors E5-2600 v3 Product Family
QPI Speeds 96 GTs 8 GTs 72 GTs
Socket Type Socket R3 (FCLGA2011-3)
Chipset Support(per node) Intelreg C612 Chipset
System Memory (per node)
bull 16 DIMM slots in total across 8 memory channelsbull Registered DDR4 (RDIMM) Load Reduced DDR4 (LRDIMM)bull Memory DDR4 data transfer rates of 160018662133 MTsbull DIMM sizes of 4 GB 8 GB 16 GB or 32 GB depending on ranks and technology
Front Panel Power onoff
LEDs
A bull Power (Secondary)bull Warning
B bull Power (Primary)bull Warning
Drive Bays External 25 hot swap 24
Backplane 1 x 24-port 12Gb SAS dual-loop backplane
Expander Board(per node)
1 x 24-port 12Gb SAS expander board with 3 SFF-8643 connectors
Expansion Slots (per node) PCIe 30 5 x8 (4 LP amp 1 FHHL)
Internal IO Connectors Headers(per node)
bull 1 x internal USB 20 connector (port 67)bull 1 x 2x7pin header for system fan modulebull 1 x 1x12pin control panel headerbull 1 x DH-10 serial Port A connectorbull 1 x SATA 6Gbs port for SATA DOMbull 4 x SATA 6Gbs connectors (port 0123)bull 1 x 2x4 pin header for Intel RMM4 Litebull 1 x 1x4 pin header for Storage Upgrade Keybull 1 x 1x8 pin backup power controler connector
Rear IO(per node)
bull 1 x RJ45 (dedicated port for remote server management) 2 x RJ45 1 x VGA 2 x USB 20 Type A 1 x Mini SAS HD
Ethernet (per node) Dual GbE Intelreg I350 Gigabit Ethernet Controller
Video Support(per node) Integrated Matrox G200 2D Graphics Controller
Server Management(per node)
Onboard Server Engines LLC Pilot III Controller Support for Intelreg Remote Management Module 4 solutions IntelregLight-Guided Diagnostics on field replaceable units Support for Intelreg System Management Software Support for Intelreg Intelligent Power Node Manager (Need PMBus - compliant power supply) BIOS Flash Winbond W25Q64BV
Power Supply 1200W 1+1 redundant power supply
System Cooling (per node)
2 x 6056 fans
EnvironmentalSpecifications
Temperature 0degC - 35degCHumidity 5 - 95 non-condensing
Gross Weight (w PSU amp Rail)kgs 41
lbs 90
PackagingDimensions
(W x D x H)mm 590 x 1150 x 330
inches 232 x 453 x 13
Mounting Standard 28 tool-less slide rail
3
HA201-TP Users Manual
Chapter 1 Product Introduction
13 General Information
HA201-WP a 2U Storage Server Barebone supports two Intelreg Xeonreg processors E5-2600 v3 series HA201-TP has a 24 x 25rdquo HDD bays as system drive bays It is a perfect building block for Storage Servers
bull Front Panel
LED Indicator and Switch
24 x 25 hot-swap SATASAS HDD bays
4
HA201-TP Users Manual
Chapter 1 Product Introduction
bull Rear Panel
1200W 1+1 80+ redundant power supply
2 x low-profile add-on card for external connection
SFF 8644 for JBOD connection
5
HA201-TP Users Manual
Chapter 1 Product Introduction
bull Top View of HA201-TP system
2 x hot-swappable storage control node
Intelreg Xeonreg E5-2600 v3 Processors
bull TOP View Of Controller Canister
6
HA201-TP Users Manual
Chapter 1 Product Introduction
bull Top View of HA201-TP system
4 x 60x56mm hot-swap fans
Intel S2600TP
HA201-TP Users Manual
7
Chapter 2 Hardware Installation
7
21 Removing and Installing top coverUnscrew the screws(2pcs for the back and 4pcs for the both right and left side)Slide the cover backwards to remove top cover form the chassis
Chapter 2 Hardware Installation
This section demonstrates maintenance procedures in replacing a defective part once the HA201-TP UM appliance is installed and is operational
Chapter 2 Hardware Installation
HA201-TP Users Manual
8
22 Central Processing Unit (CPU)221 Removing the Processor HeatsinkThe heatsink is attached to the server board or processor socket with captive fasteners Using a 2 Phillips screwdriver loosen the four screws located on the heatsink corners in a diagonal manner using the following procedurebull Using a 2 Phillips screwdriver start with screw 1 and loosen it by
giving it two rotations and stop (see letter A) (IMPORTANT Do not fully loosen)
bull Proceed to screw 2 and loosen it by giving it two rotations and stop (see letter B) Similarly loosen screws 3 and 4 Repeat steps A and B by giving each screw two rotations each time until all screws are loosened
bull Lift the heatsink straight up (see letter C)
REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
Processor
Socket
2
3
1
4
A
B
C
HA201-TP Users Manual
9
Chapter 2 Hardware Installation
222 Installing the Processor
2221 Unlatch the CPU Load Platebull Push the lever handle labeled ldquoOPEN 1strdquo (see letter A) down and
toward the CPU socket Rotate the lever handle upbull Repeat the steps for the second lever handle (see letter B)
CAUTION
Processor must be appropriate You may damage the server board if
you install a processor that is inappropriate for your server
CAUTION
ESD and handling processors Reduce the risk of electrostatic
discharge (ESD) damage to the processor by doing the following
(1) Touch the metal chassis before touching the processor or
server board Keep part of your body in contact with the metal
chassis to dissipate the static charge while handling the processor
(2) Avoid moving around unnecessarily
REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
A REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
B
Chapter 2 Hardware Installation
HA201-TP Users Manual
10
2222 Lift open the Load Platebull Rotate the right lever handle down until it releases the Load Plate
(see letter A)bull While holding down the lever handle with your other hand lift open
the Load Plate (see letter B)
BREMOVE
REMOVE
LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
A
HA201-TP Users Manual
11
Chapter 2 Hardware Installation
2223 Install the Processorbull Remove the processor from its packagebull Carefully remove the protective cover from the bottom side of the
CPU taking care not to touch any CPU contacts (see letter A)bull Orient the processor with the socket so that the processor cutouts
match the four orientation posts on the socket (see letter B) Note the location of a gold key at the corner of the processor (see letter C) Carefully place (Do NOT drop) the CPU into the socket
CAUTION
The pins inside the CPU socket are extremely sensitive Other than
the CPU no object should make contact with the pins inside the
CPU socket A damaged CPU socket pin may render the socket inoperable
and will produce erroneous CPU or other system errors if used
NOTE
The underside of the processor has components that may damage the
socket pins if installed improperly The processor must align correctly
with the socket opening before installation DO NOT DROP the
processor into the socket
NOTE
When possible a CPU insertion tool should be used when installing the
CPU
A
B
C
Chapter 2 Hardware Installation
HA201-TP Users Manual
12
2224 Remove the Socket Cover Remove the socket cover from the load plate by pressing it out
2225 Close the Load Plate Carefully lower the load plate down over the processor
2226 Lock down the Load Platebull Push down on the locking lever on the CLOSE 1st side (see letter A)
Slide the tip of the lever under the notch in the load plate (see letter B) Make sure the load plate tab engages under the socket lever when fully closed
bull bRepeat the steps to latch the locking lever on the other side (see letter C) Latch the levers in the order as shown
NOTE
The socket cover should be saved and re-used should the processor
need to be removed at anytime in the future
Save theprotectivecover
A
BC
HA201-TP Users Manual
13
Chapter 2 Hardware Installation
223 Installing the Processor Heatsink
bull If present remove the protective film covering the Thermal Interface Material on the bottom side of the heatsink (see letter A)
bull Align the heatsink fins to the front and back of the chassis for correct airflow The airflow goes from front-to-back of the chassis (see letter B)
bull Each heatsink has four captive fasteners and should be tightened in a diagonal manner using the following procedure
bull Using a 2 Phillips screwdriver start with screw 1 and engage screw threads by giving it two rotations and stop (see letter C) (Do not fully tighten)
bull Proceed to screw 2 and engage screw threads by giving it two rotations and stop (see letter D) Similarly engage screws 3 and 4
bull Repeat steps C and D by giving each screw two rotations each time until each screw is lightly tightened up to a maximum of 8 inch-lbs torque (see letter E)
NOTE
The processor heatsink for CPU1 and CPU2 is different FXXCA91X91HS is
for CPU1 while FXXEA91X91HS2 is for CPU2 Mislocating the heatsink
will cause serious thermal damage
C
Processor
Socket
AIRFLOW
Chassis Front
2
3
1
4
A
B
TIM
D
CAUTIONDo not over-tighten fasteners
E
Chapter 2 Hardware Installation
HA201-TP Users Manual
14
224 Removing the Processor
NOTE
Remove the processor by carefully lifting it out of the socket taking care
NOT to drop the processor and not touching any pins inside the socket
Install the socket cover if a replacement processor is not going to be installed
HA201-TP Users Manual
15
Chapter 2 Hardware Installation
225 Different heatsink for each of its CPUsCarefully position heatsink over the CPU0 and CPU1 and align the heatsink screws with the screw holes in the motherboard
Caution - Possible thermal damage Avoid moving the heatsink after
it has contacted the top of the CPU Too much movement could
disturb the layer of thermal compound causing voids and leading
to ineffective heat dissipation and component damage
CPU0
CPU0
CPU1
CPU1
Chapter 2 Hardware Installation
HA201-TP Users Manual
16
23 Diagram of the Correct InstallationNOTE The heat sinkrsquos fans should be blowing toward the rear end
of the chassis If one of the fans is facing the wrong direction
please remove the heat sink and reinstall it so that it is facing
the correct direction
AIRFLOW
AIRFLOW
FAN FAN
HA201-TP Users Manual
17
Chapter 2 Hardware Installation
24 System Memory241 Supported Memory
Chapter 2 Hardware Installation
HA201-TP Users Manual
18
242 Memory Population Rules
bull Each installed processor provides four channels of memory On the Intelreg Server Board S2600TP product family each memory channel supports two memory slots for a total possible 16 DIMMs installed
bull The memory channels from processor socket 1 are identified as Channel A B C and D The memory channels from processor socket 2 are identified as Channel E F G and H
bull The silk screened DIMM slot identifiers on the board provide information about the channel and therefore the processor to which they belong For example DIMM_A1 is the first slot on Channel A on processor 1 DIMM_E1 is the first DIMM socket on Channel E on processor 2
bull The memory slots associated with a given processor are unavailable if the corresponding processor socket is not populated
bull A processor may be installed without populating the associated memory slots provided a second processor is installed with associated memory In this case the memory is shared by the processors However the platform suffers performance degradation and latency due to the remote memory
bull Processor sockets are self-contained and autonomous However all memory subsystem support (such as Memory RAS and Error Management) in the BIOS setup is applied commonly across processor sockets
bull All DIMMs must be DDR4 DIMMsbull Mixing of LRDIMM with any other DIMM type is not allowed per
platformbull Mixing of DDR4 operating frequencies is not validated within a socket
or across sockets by Intel If DIMMs with different frequencies are mixed all DIMMs run at the common lowest frequency
bull A maximum of eight logical ranks (ranks seen by the host) per channel is allowed
bull The BLUE memory slots on the server board identify the first memory slot for a given memory channel
NOTE
Although mixed DIMM configurations are supported Intel only performs platform
validation on systems that are configured with identical DIMMs installed
HA201-TP Users Manual
19
Chapter 2 Hardware Installation
bull DIMM population rules require that DIMMs within a channel be populated starting with the BLUE DIMM slot or DIMM farthest from the processor in a ldquofill-farthestrdquo approach In addition when populating a Quad-rank DIMM with a Single- or Dual-rank DIMM in the same channel the Quad-rank DIMM must be populated farthest from the processor Intelreg MRC will check for correct DIMM placement
Chapter 2 Hardware Installation
HA201-TP Users Manual
20
On the Intelreg Server Board S2600TP product a total of sixteen DIMM slots are provided (two CPUs ndash four channels per CPU and two DIMMs per channel) The nomenclature for DIMM sockets is detailed in the following table
HA201-TP Users Manual
21
Chapter 2 Hardware Installation
243 Publishing System MemoryThere are a number of different situations in which the memory size andor configuration are displayed Most of these displays differ in one way or another so the same memory configuration may appear to display differently depending on when and where the display occurs
bull The BIOS displays the ldquoTotal Memoryrdquo of the system during POST if Quiet Boot is disabled in BIOS setup This is the total size of memory discovered by the BIOS during POST and is the sum of the individual sizes of installed DDR4 DIMMs in the system
bull The BIOS displays the ldquoEffective Memoryrdquo of the system in the BIOS Setup The term Effective Memory refers to the total size of all DDR4 DIMMs that are active (not disabled) and not used as redundant units (see Note below)
bull The BIOS provides the total memory of the system in the main page of BIOS setup This total is the same as the amount described by the first bullet above
bull If Quiet Boot is disabled the BIOS displays the total system memory on the diagnostic screen at the end of POST This total is the same as the amount described by the first bullet above
bull The BIOS provides the total amount of memory in the system by supporting the EFI Boot Service function GetMemoryMap()
bull The BIOS provides the total amount of memory in the system by supporting the INT 15h E820h function For details see the Advanced Configuration and Power Interface Specification
Chapter 2 Hardware Installation
HA201-TP Users Manual
22
25 RemovingInstalling a Drive Tray Hard Drive Drive Slot Map
251 Removing a Disk DriveRelease a drive tray by pressing the unlock button and pinching the lock lever slightly and pulling out the drive tray
252 Installing a Disk Drive
Adjust and place the 25rdquo HDD on the HDD tray Choose to the proper position screw hole then secure it with screws on the bottomMounting location 1 HDD with the interposer board attached
HA201-TP Users Manual
23
Chapter 2 Hardware Installation
253 Installing a Hard Disk Drive Tray
Insert the drive carrier into its bay Push the tray lever until it clicks Make sure the drive tray is correctly secured in place when its front edge aligns with the bay edge
254 Drive Slot Map
The drive slot map follows
HBA Card
MegaRaid Card
Chapter 2 Hardware Installation
HA201-TP Users Manual
24
26 Removing and Installing a Fan Module261 Removing a FAN module
Unscrew the thumb screw on the cover to open the top cover from NODE Unpluging the FAN cable Remove the fan by lifting it and pulling it out
262 Installing a FAN module To installing a fan module follow the reverse order
HA201-TP Users Manual
25
Chapter 2 Hardware Installation
27 Power Supply
271 Removing or Replacing the Power Supply Module
Slide the small levers (see red arrow ) to the right Hold the PSU lever and firmly pull the PSU out of the server chassis
Chapter 2 Hardware Installation
HA201-TP Users Manual
26
28 Tool-less Blade Slide Installation introduction
1 Pull on the lsquorsquoFront-Releasersquorsquoto unlock the inner channel from the Slide Assembly
2 Release the Detent-Lock and push Middle Channel inwards to retract Middle Channel
HA201-TP Users Manual
27
Chapter 2 Hardware Installation
Metal Spacer
OptionalRemove Metal Spacer for Aluminium Racks
3 Aligning the Front Bracket with the Mounting Hole
4 Push in to assembly the Front Bracket onto the Rack
Chapter 2 Hardware Installation
HA201-TP Users Manual
28
5 Now the bracket is fixed onto the Rack(Optional M6x10L screws are to secure the rails with posts if needed)
6 Refer to Diagram 3 amp 4 to assemble the End Bracket onto the Rack
HA201-TP Users Manual
29
Chapter 2 Hardware Installation
7 Assemble the inner channel onto the chassis using thescrews provided
8 Push the chassis with inner channels into Slide to complete Rack Installation
Chapter 3 Motherborad Overview
HA201-TP Users Manual
30
Chapter 3 Motherborad Overview
The Intelreg Server Board S2600TP is a monolithic printed circuit board (PCB) with features designed to support the high-performance and high-density computing markets This server board is designed to support the Intelreg Xeonreg processor E5-2600 v3 product family Previous generation Intelreg Xeonreg processors are not supported Many of the features and functions of the server board family are common A board will be identified by name when a described feature or function is unique to it
Chapter 3 Motherborad Overview
HA201-TP Users Manual
31
31 Intelreg Server Board Feature Set
Chapter 3 Motherborad Overview
HA201-TP Users Manual
32
WARNING The riser slot 1 on the server board is designed for
plugging in ONLY the riser card Plugging in the PCIe card may
cause permanent server board and PCIe card damage
Chapter 3 Motherborad Overview
HA201-TP Users Manual
33
32 Motherboard Layout
DiagnosticLED
RMM4LiteRiser Slot 2
SATASGPIO
BridgeBoard
SATA 3USB
Riser Slot 4
Riser Slot 4
HDD Activity
Riser Slot 3ControlPanel
SystemFan 1
CPU 2Socket
CPU 1Socket
SystemFan 3
SystemFan 2
MainPower 1
FanConnector
MainPower 2
InfiniBandPort(QSFP+)DedicatedManagementPortUSBStatus LEDID LED
VGA
NIC 2
IPMB
NIC 1
SerialPort A
CR 2032 3W
Backup Power Control
SATA 2
SATA 0
SATA RAID 5
SATA
Upgrade Key
DOM
SATA 1
Chapter 3 Motherborad Overview
HA201-TP Users Manual
34
33 Motherboard block diagram
Chapter 3 Motherborad Overview
HA201-TP Users Manual
35
34 Motherboard Configuration JumpersThe following table provides a summary and description of configuration test and debug jumpers The server board has several 3-pin jumper blocks that can be used Pin 1 on each jumper block can be identified by the following symbol on the silkscreen
Chapter 3 Motherborad Overview
HA201-TP Users Manual
36
Chapter 3 Motherborad Overview
HA201-TP Users Manual
37
35 Motherboard Configuration JumpersThe Intelreg Server Board S2600TP has the following board rear connector placement
HA201-TP Users Manual
38
Chapter 4 12G Expander Board Overview
This chapter provides detailed introduction guide on hardware introduction
41 12GB Expander Board411 Placement amp Connectors location
J1
JPIC_DBGU8
Chip 3x36RExpander
U14Flash U16
PIC
SW1
SW2
JEXP_UART
J7
JPIC_SMT
J2 J3 J4 CN1 JUSB_MB
JVBATT_12C
JVBATT CN3 CN2 JIPMI_LOC
CN5
CN8
JPWR1
JPWR2
JHDDPWRJPSON_OK
JIPMB
JUSB_PIC
J8
CN7CN6
CN4
JPWR_SW
Chapter 4 12GB Expander Borad Overview
HA201-TP Users Manual
39
Chapter 4 12G Expander Board Overview
412 PIN-OUT
Power Connector ndash JPWR1JPWR2
OS HDD Power Connector ndash JHDDPWR
Battery Connector ndash JVBATT
FAN Connector ndash J7J8
Console for Expander ndash JEXP_UART
HA201-TP Users Manual
40
Chapter 4 12G Expander Board Overview
PIC Smart portndash JPIC_SMT
PIC Debug port(For MPLAB I2C tool) ndash JPIC_DBG
PIC USB ndash JUSB_PIC
Battery Function ndash JVBATT_I2C
IPMB ndash JIPMB
HA201-TP Users Manual
41
Chapter 4 12G Expander Board Overview
BP PIC USB ndash JUSB_MB
IPMB for Device ndash JIPMI_LOC
Power SW ndash JPWR_SW
MB power on frunction ndash JPSON_OK
HA201-TP Users Manual
42
Chapter 4 12G Expander Board Overview
413 LEDs
LED_CN6
LED2
LED3
HA201-TP Users Manual
43
Chapter 5 Backplane OverviewChapter 5 Backplane Overview
This chapter provides detailed introduction guide on backplane introduction
51 Backplane511 Placement amp Connectors location
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
9
18
10
1
3
G
D
SG
D
S
C
B
C
E
B
C
E
GA
O
I
2
43
12
43
1
14
1
3
45
134
5
134
5
134
5
16
2
9
10
1
1
1120
10
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
16
17
32
33
484964
J2
JP1J11
For Secondary Canister Node
J4 J1 SW1
SW2
J3
For Primary Canister Node
HA201-TP Users Manual
44
Chapter 5 Backplane Overview
512 PIN-OUT
Power Connector ndash J2
Power Connector ndash J11
PMBUS Connector ndash JP1
FAN Connector ndash J4
HA201-TP Users Manual
45
Chapter 5 Backplane Overview
Console for MCU ndash J1
Front IO ndash J3
HA201-TP Users Manual
46
Chapter 5 Backplane Overview
513 LEDs
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
18
9
10
1
31
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
3 4
21
3 4
21
4
1
16
2
91
101
11
10
20
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ActFail
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
9
18
10
1
3
G
D
SG
D
S
C
B
C
E
B
C
E
GA
O
I
2
43
12
43
1
14
1
3
45
134
5
134
5
134
5
16
2
9
10
1
1
1120
10
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
16
17
32
33
484964LED1
HA201-TP Users Manual
47
Chapter 6 HDD Backplane Introduction
61 Expender firmware update through smart console port611 Update Expander firmware revision
Step 1 Set up HA201-TP console serial cable Insert console serial cable into console port shown below also
the other side inert serial port into motherboard
PLEASE FIND THE CONSOLE SERIAL CABLE IN THE PACKAGE BOX
Chapter 6 Debug amp Firmware Update Introduction
HA201-TP Users Manual
48
Chapter 6 HDD Backplane Introduction
Step 2 Set up HA201-TP RS232 connectionSet up RS232 connection application into your HA201-TP as shown in the example process below
For exampleOS Microsoft WindowsRS232 connection application Hyperterminal
Step 2 Install HyperTrmexe
HA201-TP Users Manual
49
Chapter 6 HDD Backplane Introduction
Step 3 Enter a new name for the icon in the field below and click OK
Step 4 Connecting by using selecting an option in the drop down menu circled in red below (we selected COM2 in this example) and click OK
HA201-TP Users Manual
50
Chapter 6 HDD Backplane Introduction
Properties
Port Setting
Bits per second
Data bits
Parity
Stop bits
Flow control None
Restore Defaults
OK ApplyCancel
None
Step 6 Set up is complete The diagram below depicts what screen should displayed
Step 5 For ldquoBits per secondrdquo select 38400 For ldquoFlow controlrdquo select None Click OK when you have finished your selections
cmdgt_
HA201-TP Users Manual
51
Chapter 6 HDD Backplane Introduction
Step 7To get firmware image amp MFG Configuration Image version information from AIC SAS Related Firmware Downloadne website
httpppmsaicipccomtw8888downloadexpandermcu
HA201-TP Users Manual
52
Chapter 6 HDD Backplane Introduction
Step 8Comand line for show current firmware revisioncmdgtrev
HA201-TP Users Manual
53
Chapter 6 HDD Backplane Introduction
Step 9Start to update expander firmwarecmdgtfdl 0 0_
Step 10Select the tool bar Transfer -gt Send File
HA201-TP Users Manual
54
Chapter 6 HDD Backplane Introduction
Step 11bull Choose new firmware path file fw 3A1_v11221bull Protocol have to choose Xmodem
Step 12Firmware download complete
HA201-TP Users Manual
55
Chapter 6 HDD Backplane Introduction
Step 13Reset computer for success update firmwarecmdgtreset
reset
HA201-TP Users Manual
56
Chapter 6 HDD Backplane Introduction
612 Update expander configuration MFG
Step 1Comand line for show current configuration MFGcmdgt showmfg
HA201-TP Users Manual
57
Chapter 6 HDD Backplane Introduction
Step 2Start to update expander configuration MFGcmdgtfdl 83 0_
Step 3Select the tool bar Transfer -gt Send File
83 0
HA201-TP Users Manual
58
Chapter 6 HDD Backplane Introduction
Step 4bull Choose new MFG path file mfg 3A10_eob_1201binbull Protocol have to choose Xmodem
Step 5MFG download complete
HA201-TP Users Manual
59
Chapter 6 HDD Backplane Introduction
Step 6Reset computer for success update MFGcmdgtreset
reset
HA201-TP Users Manual
60
Chapter 6 HDD Backplane Introduction
62 Update the expander firmware through in-band
FOR EXAMPLEStep 1
Download and install SG3_utilsexe which compatible with Linux OSFrom website httpsgdannyczsgsg3_utilshtml website Reference version sg3_utils-140tgz
Step 2To get firmware image amp MFG Configuration Image version information from AIC SAS Related Firmware Downloadne websitehttpppmsaicipccomtw8888downloadexpandermcu
HA201-TP Users Manual
61
Chapter 6 HDD Backplane Introduction
Step 3Execute terminal under the same new firmware folderexample Setting a new firmware folder on Home page Open Terminal by click to the right button of mouse in the same window Home
Step 4Typing sudo -s to into administrator mode
HA201-TP Users Manual
62
Chapter 6 HDD Backplane Introduction
Step 5 Find expander location$ sg_map -i
Step 6Update Expander firmware$ sg_write_buffer --id=0x0 --in=fw3A1_v11221 --mode=0x2 --offset=0 devsg0
HA201-TP Users Manual
63
Chapter 6 HDD Backplane Introduction
Step 7 Update Expander MFG$ sg_write_buffer --id=0x83 --in=mfg3A10_eob_1201bin --mode=0x2 --offset=0 devsg0
Step 8Reboot computer for success update firmware amp MFGrootubuntu~ reboot
HA201-TP Users Manual
64
Chapter 6 HDD Backplane Introduction
63 12G expander EDFB settingStep 1
For Install HyperTerminalexe refer to section 41
Step 2 Get EDFB statuscmdgt edfb
HA201-TP Users Manual
65
Chapter 6 HDD Backplane Introduction
Step 3Change EDFB setting
Enable EDFB functioncmdgtedfb on
Disable EDFB functioncmdgtedfb off
cmd gtedfbEDFB is OFF
cmd gtedfb onSucceeded to set EDFB
cmd gtedfb offEDFB is OFF
HA201-TP Users Manual
66
Chapter 6 HDD Backplane Introduction
64 Slot HDD power setting(Only for system cooling Fan controled by expander)
Step 1 For Install sg3exe tool and get new firmware from website refer to section 42
Step 2Execute terminal under the same new firmware folderexample Setting a new firmware folder on Home page Open Terminal by click to the right button of mouse in the same window Home
HA201-TP Users Manual
67
Chapter 6 HDD Backplane Introduction
Step 3 Typing sudo -s to into administrator mode
Step 4 Find expander location$ sg_map -i
HA201-TP Users Manual
68
Chapter 6 HDD Backplane Introduction
Step 5For example
If would like to turn the Disk004 power off under the HBA card Need to check Disk004 power status $ sg_ses --page=7 devsg0
Under HBA card the Element 3 = Disk004
HA201-TP Users Manual
69
Chapter 6 HDD Backplane Introduction
Step 6To check Disk004 (element 3) power status is ok$ sg_ses --page=2 devsg0
Status shows belowThe status of Element 3 is OK
HA201-TP Users Manual
70
Chapter 6 HDD Backplane Introduction
Step 7Turn off a HDD power$ sg_ses --descriptor=Disk004 --set=341 devsg0
Step 8Turn on a HDD power$ sg_ses --descriptor=Disk004 --clear=341 devsg0
HA201-TP Users Manual
71
Chapter 6 HDD Backplane Introduction
65 HDD BP thermal sensor temperature setting(Only for system cooling Fan controled by expander)
Step 1 For Install HyperTerminalexe refer to section 41
Step 2Get the current temperature settingscmdgt temperature
HA201-TP Users Manual
72
Chapter 6 HDD Backplane Introduction
Step 3For example
Set new temperatureT1=20 C 4 18 CT2=50 C 4 52 CWarning threshold=50 C 448 CAlarm threshold=55 C 454 C
The new setting will take effect after resetcmdgt temperature 18 52 48 54cmdgt reset
HA201-TP Users Manual
73
Chapter 6 HDD Backplane Introduction
Step 4Check fan speed amp temperature informationcmdgt sensor
cmd gtsensor==ENCLOSURE STATUS========================================================== Total fan number 2 System Fan-0 speed 10888 RPM System Fan-1 speed 10971 RPM System PWN-0 82 Expander Temperature 76 Celsius degree Sytem Temperature-0 33 Celsius degree T1 20 Celsius degree T2 50 Celsius degree TC 55 Celsius degree Voltage Sensor 09V 093V Voltage Sensor 18V 180V
cmd gt_
HA201-TP Users Manual
74
Chapter 7 Intelreg Server Board S2600TP Platform Management
71 Server Management Function Architecture711 IPMI Feature7111 IPMI 20 Featuresbull Baseboard management controller (BMC)bull IPMI Watchdog timerbull Messaging support including command bridging and usersession
supportbull Chassis device functionality including powerreset control and BIOS boot
flags supportbull Event receiver device The BMC receives and processes events from
other platform subsystemsbull Field Replaceable Unit (FRU) inventory device functionality The BMC
supports access to system FRU devices using IPMI FRU commandsbull System Event Log (SEL) device functionality The BMC supports and
provides access to a SELbull Sensor Data Record (SDR) repository device functionality The BMC
supports storage and access of system SDRsbull Sensor device and sensor scanningmonitoring The BMC provides IPMI
management of sensors It polls sensors to monitor and report system health
bull IPMI interfaceso Host interfaces include system management software (SMS) with receive message queue support and server management mode (SMM)o IPMB interfaceo LAN interface that supports the IPMI-over-LAN protocol (RMCP RMCP+)
bull Serial-over-LAN (SOL)bull ACPI state synchronization The BMC tracks ACPI state changes that are
provided by the BIOSbull BMC self-test The BMC performs initialization and run-time self-tests and
makes results available to external entitiesbull See also the Intelligent Platform Management Interface Specification
Second Generation v20
Chapter 7 Intelreg Server Board S2600TP Platform Management
HA201-TP Users Manual
75
Chapter 7 Intelreg Server Board S2600TP Platform Management
7112 Non IPMI FeaturesThe BMC supports the following non-IPMI featuresbull In-circuit BMC firmware updatebull Fault resilient booting (FRB) FRB2 is supported by the watchdog timer
functionalitybull Chassis intrusion detectionbull Basic fan control using Control version 2 SDRsbull Fan redundancy monitoring and supportbull Enhancements to fan speed controlbull Power supply redundancy monitoring and supportbull Hot-swap fan supportbull Acoustic management Support for multiple fan profilesbull Signal testing support The BMC provides test commands for setting
and getting platform signal statesbull The BMC generates diagnostic beep codes for fault conditionsbull System GUID storage and retrievalbull Front panel management The BMC controls the system status LED
and chassis ID LED It supports secure lockout of certain front panel functionality and monitors button presses The chassis ID LED is turned on using a front panel button or a command
bull Power state retentionbull Power fault analysisbull Intelreg Light-Guided Diagnosticsbull Power unit management Support for power unit sensor The BMC
handles power-good dropout conditionsbull DIMM temperature monitoring New sensors and improved acoustic
management using closed-loop fan control algorithm taking into account DIMM temperature readings
bull Address Resolution Protocol (ARP) The BMC sends and responds to ARPs (supported on embedded NICs)
bull Dynamic Host Configuration Protocol (DHCP) The BMC performs DHCP (supported on embedded NICs)
bull Platform environment control interface (PECI) thermal management support
bull E-mail alertingbull Support for embedded web server UI in Basic Manageability feature
set
HA201-TP Users Manual
76
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull Enhancements to embedded web servero Human-readable SELo Additional system configurabilityo Additional system monitoring capabilityo Enhanced on-line help
bull Integrated KVMbull Enhancements to KVM redirection
o Support for higher resolutionbull Integrated Remote Media Redirectionbull Lightweight Directory Access Protocol (LDAP) supportbull Intelreg Intelligent Power Node Manager supportbull Embedded platform debug feature which allows capture of detailed
data for later analysisbull Provisioning and inventory enhancements
o Inventory datasystem information export (partial SMBIOS table)bull DCMI 15 compliance (product-specific)bull Management support for PMBus rev12 compliant power suppliesbull BMC Data Repository (Managed Data Region Feature)bull Support for an Intelreg Local Control Display Panelbull System Airflow Monitoringbull Exit Air Temperature Monitoringbull Ethernet Controller Thermal Monitoringbull Global Aggregate Temperature Margin Sensorbull Memory Thermal Managementbull Power Supply Fan Sensorsbull Energy Star Server Supportbull Smart Ride Through (SmaRT) Closed Loop System Throttling (CLST)bull Power Supply Cold Redundancybull Power Supply FW Updatebull Power Supply Compatibility Checkbull BMC FW reliability enhancements
o Redundant BMC boot blocks to avoid possibility of a corrupted boot block resulting in a scenario that prevents a user from updating the BMC o BMC System Management Health Monitoring
HA201-TP Users Manual
77
Chapter 7 Intelreg Server Board S2600TP Platform Management
72 Features and Functions721 Power SubsystemThe server board supports several power control sources which can initiate power-up orpower-down activity
HA201-TP Users Manual
78
Chapter 7 Intelreg Server Board S2600TP Platform Management
722 Advanced Configuration and Power Interface (ACPI)The server board has support for the following ACPI states
HA201-TP Users Manual
79
Chapter 7 Intelreg Server Board S2600TP Platform Management
723 System InitializationDuring system initialization both the BIOS and the BMC initialize the following items
7231 Processor Tcontrol SettingProcessors used with this chipset implement a feature called Tcontrol which provides a processor-specific value that can be used to adjust the fan-control behavior to achieve optimum cooling and acoustics The BMC reads these from the CPU through PECI Proxy mechanism provided by Manageability Engine (ME) The BMC uses these values as part of thefan-speed-control algorithm
HA201-TP Users Manual
80
Chapter 7 Intelreg Server Board S2600TP Platform Management
7232 Fault Resilient Booting (FRB)Fault resilient booting (FRB) is a set of BIOS and BMC algorithms and hardware support that allow a multiprocessor system to boot even if the bootstrap processor (BSP) fails Only FRB2 is supported using watchdog timer commandsFRB2 refers to the FRB algorithm that detects system failures during POST The BIOS uses the BMC watchdog timer to back up its operation during POST The BIOS configures the watchdog timer to indicate that the BIOS is using the timer for the FRB2 phase of the boot operationAfter the BIOS has identified and saved the BSP information it sets the FRB2 timer use bit and loads the watchdog timer with the new timeout intervalIf the watchdog timer expires while the watchdog use bit is set to FRB2 the BMC (if so configured) logs a watchdog expiration event showing the FRB2 timeout in the event data bytes The BMC then hard resets the system assuming the BIOS-selected reset as the watchdog timeout actionThe BIOS is responsible for disabling the FRB2 timeout before initiating the option ROM scan and before displaying a request for a boot password If the processor fails and causes an FRB2 timeout the BMC resets the systemThe BIOS gets the watchdog expiration status from the BMC If the status shows an expired FRB2 timer the BIOS enters the failure in the system event log (SEL) In the OEM bytes entry in the SEL the last POST code generated during the previous boot attempt is written FRB2failure is not reflected in the processor status sensor valueThe FRB2 failure does not affect the front panel LEDs
HA201-TP Users Manual
81
Chapter 7 Intelreg Server Board S2600TP Platform Management
7233 Post Code DisplayThe BMC upon receiving standby power initializes internal hardware to monitor port 80h (POST code) writes Data written to port 80h is output to the system POST LEDsThe BMC deactivates POST LEDs after POST had completed
HA201-TP Users Manual
82
Chapter 7 Intelreg Server Board S2600TP Platform Management
724 System Event Log (SEL)The BMC implements the system event log as specified in the Intelligent Platform Management Interface Specification Version 20 The SEL is accessible regardless of the system power state through the BMCs in-band and out-of-band interfacesThe BMC allocates 95231bytes (approx 93 KB) of non-volatile storage space to store system events The SEL timestamps may not be in order Up to 3639 SEL records can be stored at a time Because the SEL is circular any command that results in an overflow of the SEL beyond the allocated space will overwrite the oldest entries in the SEL while setting the overflow flag
HA201-TP Users Manual
83
Chapter 7 Intelreg Server Board S2600TP Platform Management
73 Sensor MonitoringThe BMC monitors system hardware and reports system health The information gathered from physical sensors is translated into IPMI sensors as part of the ldquoIPMI Sensor Modelrdquo The BMC also reports various system state changes by maintaining virtual sensors that are not specifically tied to physical hardware This section describes general aspects of BMC sensor management as well as describing how specific sensor types are modeled Unless otherwise specified the term ldquosensorrdquo refers to the IPMI sensor-model definition of a sensor
531 Sensor ScanningThe value of many of the BMCrsquos sensors is derived by the BMC FW periodically polling physical sensors in the system to read temperature voltages and so on Some of these physical sensors are built-in to the BMC component itself and some are physically separated from the BMC Polling of physical sensors for support of IPMI sensor monitoring does not occur until the BMCrsquos operational code is running and the IPMI FW subsystem has completed initializationIPMI sensor monitoring is not supported in the BMC boot code Additionally the BMC selectively polls physical sensors based on the current power and reset state of the system and the availability of the physical sensor when in that state For example non-standby voltages are not monitored when the system is in S4 or S5 power state
HA201-TP Users Manual
84
Chapter 7 Intelreg Server Board S2600TP Platform Management
732 Sensor Rearm Behavior7321 Manual versus Re-arm SensorsSensors can be either manual or automatic re-arm An automatic re-arm sensor will re-arm (clear) the assertion event state for a threshold or offset it if that threshold or offset is deasserted after having been asserted This allows a subsequent assertion of the threshold or an offset to generate a new event and associated side-effect An example side-effect would be boosting fans due to an upper critical threshold crossing of a temperature sensor The event state and the input state (value) of the sensor track each other Most sensors are auto-rearmA manual re-arm sensor does not clear the assertion state even when the threshold or offset becomes de-asserted In this case the event state and the input state (value) of the sensor do not track each other The event assertion state is sticky The following methods can be used to re-arm a sensorbull Automatic re-arm ndash Only applies to sensors that are designated as
ldquoauto-rearmrdquobull IPMI command Re-arm Sensor Eventbull BMC internal method ndash The BMC may re-arm certain sensors due to a
trigger condition For example some sensors may be re-armed due to a system reset A BMC reset will re-arm all sensorsbull System reset or DC power cycle will re-arm all system fan sensors
HA201-TP Users Manual
85
Chapter 7 Intelreg Server Board S2600TP Platform Management
7322 Re-arm and Event GenerationAll BMC-owned sensors that show an asserted event status generate a de-assertion SEL event when the sensor is re-armed provided that the associated SDR is configured to enable a deassertion event for that condition This applies regardless of whether the sensor is a thresholdanalog sensor or a discrete sensor
To manually re-arm the sensors the sequence is outlined below1 A failure condition occurs and the BMC logs an assertion event2 If this failure condition disappears the BMC logs a de-assertion event (if
so configured)3 The sensor is re-armed by one of the methods described in the
previous section4 The BMC clears the sensor status5 The sensor is put into reading-state-unavailable state until it is polled
again or otherwise updated6 The sensor is updated and the ldquoreading-state-unavailablerdquo state is
cleared A new assertion event will be logged if the fault state is once again detected
All auto-rearm sensors that show an asserted event status generate a de-assertion SEL event at the time the BMC detects that the condition causing the original assertion is no longer present and the associated SDR is configured to enable a de-assertion event for that condition
HA201-TP Users Manual
86
Chapter 7 Intelreg Server Board S2600TP Platform Management
733 BIOS Event-Only SensorsBIOS-owned discrete sensors are used for event generation only and are not accessible through IPMI sensor commands like the Get Sensor Reading command Note that in this case the sensor owner designated in the SDR is not the BMCAn example of this usage would be the SELs logged by the BIOS for uncorrectable memory errors Such SEL entries would identify a BIOS-owned sensor ID
734 Margin SensorsThere is sometimes a need for an IPMI sensor to report the difference (margin) from a nonzero reference offset For the purposes of this document these type sensors are referred to as margin sensors For instance for the case of a temperature margin sensor if the referencevalue is 90 degrees and the actual temperature of the device being monitored is 85 degreesthe margin value would be -5
735 IPMI Watchdog SensorThe BMC supports a Watchdog Sensor as a means to log SEL events due to expirations of the IPMI 20 compliant Watchdog Timer
736 BMC Watchdog SensorThe BMC supports an IPMI sensor to report that a BMC reset has occurred due to action taken by the BMC Watchdog feature A SEL event will be logged whenever either the BMC FW stack is reset or the BMC CPU itself is reset
737 BMC System Management Health MonitoringThe BMC tracks the health of each of its IPMI sensors and report failures by providing a ldquoBMC FW Healthrdquo sensor of the IPMI 20 sensor type Management Subsystem Health with support for the Sensor Failure offset Only assertions should be logged into the SEL for the Sensor Failure offset The BMC Firmware Health sensor asserts for any sensor when 10 consecutive sensor errors are read These are not standard sensor events (that is threshold crossings or discrete assertions) These are BMC Hardware Access Layer (HAL) errors If a successful sensor read is completed the counter resets to zero
HA201-TP Users Manual
87
Chapter 7 Intelreg Server Board S2600TP Platform Management
738 VR Watchdog TimerThe BMC FW monitors that the power sequence for the board VR controllers is completed when a DC power-on is initiated Incompletion of the sequence indicates a board problem in which case the FW powers down the systemThe BMC FW supports a discrete IPMI sensor for reporting and logging this fault condition
739 System Airflow MonitoringThe BMC provides an IPMI sensor to report the volumetric system airflow in CFM (cubic feet per minute) The air flow in CFM is calculated based on the system fan PWM values The specific Pulse Width Modulation (PWM or PWMs) used to determine the CFM is SDR configurable The relationship between PWM and CFM is based on a lookup table in an OEM SDRThe airflow data is used in the calculation for exit air temperature monitoring It is exposed as an IPMI sensor to allow a datacenter management application to access this data for use in rack-level thermal management
7310 Thermal MonitoringThe BMC provides monitoring of component and board temperature sensing devices This monitoring capability is instantiated in the form of IPMI analogthreshold or discrete sensors depending on the nature of the measurementFor analogthreshold sensors with the exception of Processor Temperature sensors critical and non-critical thresholds (upper and lower) are set through SDRs and event generation enabled for both assertion and de-assertion eventsFor discrete sensors both assertion and de-assertion event generation are enabledMandatory monitoring of platform thermal sensors includes
HA201-TP Users Manual
88
Chapter 7 Intelreg Server Board S2600TP Platform Management
7310 Thermal MonitoringThe BMC provides monitoring of component and board temperature sensing devices This monitoring capability is instantiated in the form of IPMI analogthreshold or discrete sensors depending on the nature of the measurementFor analogthreshold sensors with the exception of Processor Temperature sensors critical and non-critical thresholds (upper and lower) are set through SDRs and event generation enabled for both assertion and de-assertion eventsFor discrete sensors both assertion and de-assertion event generation are enabledMandatory monitoring of platform thermal sensors includesbull Inlet temperature (physical sensor is typically on system front panel or
HDD back plane)bull Board ambient thermal sensorsbull Processor temperaturebull Memory (DIMM) temperaturebull CPU VRD Hot monitoringbull Power supply (only supported for PMBus-compliant PSUs)Additionally the BMC FW may create ldquovirtualrdquo sensors that are based on a combination of aggregation of multiple physical thermal sensors and application of a mathematical formula to thermal or power sensor readings
73101 Absolute Value versus Margin SensorsThermal monitoring sensors fall into three basic categoriesbull Absolute temperature sensors ndash These are analogthreshold sensors
that provide a value that corresponds to an absolute temperature value
bull Thermal margin sensors ndash These are analogthreshold sensors that provide a value that is relative to some reference value
bull Thermal fault indication sensors ndash These are discrete sensors that indicate a specific thermal fault condition
HA201-TP Users Manual
89
Chapter 7 Intelreg Server Board S2600TP Platform Management
73102 Processor DTS-Spec Margin Sensor(s)Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family incorporate a DTS based thermal spec This allows a much more accurate control of the thermal solution and will enable lower fan speeds and lower fan power consumption The main usage of this sensor is as an input to the BMCrsquos fan control algorithms The BMC implements this as a threshold sensor There is one DTS sensor for each installed physical processor package Thresholds are not set and alert generation is not enabled for these sensors
73103 Processor Thermal Margin Sensor(s)Each processor supports a physical thermal margin sensor per core that is readable through the PECI interface This provides a relative value representing a thermal margin from the corersquos throttling thermal trip point Assuming that temp controlled throttling is enabled the physical core temp sensor reads lsquo0rsquo which indicates the processor core is being throttledThe BMC supports one IPMI processor (margin) temperature sensor per physical processor package This sensor aggregates the readings of the individual core temperatures in a package to provide the hottest core temperature reading When the sensor reads lsquo0rsquo it indicates that the hottest processor core is throttlingDue to the fact that the readings are capped at the corersquos thermal throttling trip point (reading = 0) thresholds are not set and alert generation is not enabled for these sensors
73104 Processor Thermal Control Monitoring (Prochot)The BMC FW monitors the percentage of time that a processor has been operationally constrained over a given time window (nominally six seconds) due to internal thermal management algorithms engaging to reduce the temperature of the device When any processor core temperature reaches its maximum operating temperature the processorpackage PROCHOT (processor hot) signal is asserted and these management algorithms known as the Thermal Control Circuit (TCC) engage to reduce the temperature provided TCC is enabled TCC is enabled by BIOS during system boot This monitoring is instantiated as one IPMI analogthreshold sensor per processor package The BMC implements this as a threshold sensor on a per-processor basis
HA201-TP Users Manual
90
Chapter 7 Intelreg Server Board S2600TP Platform Management
Under normal operation this sensor is expected to read lsquo0rsquo indicating that no processor throttling has occurredThe processor provides PECI-accessible counters one for the total processor time elapsed and one for the total thermally constrained time which are used to calculate the percentage assertion over the given time window
73105 Processor Voltage Regulator (VRD) Over-Temperature SensorThe BMC monitors processor VRD_HOT signals The processor VRD_HOT signals are routed to the respective processor PROCHOT input in order initiate throttling to reduce processor power draw therefore indirectly lowering the VRD temperatureThere is one processor VRD_HOT signal per CPU slot The BMC instantiates one discrete IPMI sensor for each VRD_HOT signal This sensor monitors a digital signal that indicates whether a processor VRD is running in an over-temperature condition When the BMC detects that this signal is asserted it will cause a sensor assertion which will result in an event being written into the sensor event log (SEL)
73106 Inlet Temperature SensorEach platform supports a thermal sensor for monitoring the inlet temperature There are four potential sources for inlet temperature reading
73107 Baseboard Ambient Temperature Sensor(s)The server baseboard provides one or more physical thermal sensors for monitoring the ambient temperature of a board location This is typically to provide rudimentary thermal monitoring of components that lack internal thermal sensors
73108 Server South Bridge (SSB) Thermal MonitoringThe BMC monitors the SSB temperature This is instantiated as an analog (threshold) IPMI thermal sensor
73109 Exit Air Temperature Monitoring The BMC synthesizes a virtual sensor to approximate system exit air temperature for use in fan control This is calculated based on the total power being consumed by the system and the total volumetric air flow provided by the system fans
HA201-TP Users Manual
91
Chapter 7 Intelreg Server Board S2600TP Platform Management
Each system shall be characterized in tabular format to understand total volumetric flow versus fan speed The BMC calculates an average exit air temperature based on the total system power front panel temperature the volumetric system air flow (cubic feet per meter or CFM) and altitude rangeThis sensor is only available on systems in an Intelreg chassis The Exit Air temp sensor is only available when PMBus power supplies are installed
731010 Ethernet Controller Thermal MonitoringThe Intelreg Ethernet Controller I350-AM4 and Intelreg Ethernet Controller 10 Gigabit X540 support an on-die thermal sensor For baseboard Ethernet controllers that use these devices the BMC will monitor the sensors and use this data as input to the fan speed control The BMC will instantiate an IPMI temperature sensor for each device on the baseboard
731011 Memory VRD-Hot Sensor(s)The BMC monitors memory VRD_HOT signals The memory VRD_HOT signals are routed to the respective processor MEMHOT inputs in order to throttle the associated memory to effectively lower the temperature of the VRD feeding that memoryFor Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family there are 2 memory VRD_HOT signals per CPU slot The BMC instantiates one discrete IPMI sensor for each memory VRD_HOT signal
731012 Add-in Module Thermal MonitoringSome boards have dedicated slots for an IO module andor a SAS module For boards that support these slots the BMC will instantiate an IPMI temperature sensor for each slot The modules themselves may or may not provide a physical thermal sensor (a TMP75 device) If the BMC detects that a module is installed it will attempt to access the physical thermal sensor and if found enable the associated IPMI temperature sensor
731013 Processor ThermTripWhen a Processor ThermTrip occurs the system hardware will automatically power down the server If the BMC detects that a ThermTrip occurred then it will set the ThermTrip offset for the applicable
HA201-TP Users Manual
92
Chapter 7 Intelreg Server Board S2600TP Platform Management
processor status sensor
731014 Server South Bridge (SSB) ThermTrip Monitoring The BMC supports SSB ThermTrip monitoring that is instantiated as an IPMI discrete sensor When a SSB ThermTrip occurs the system hardware will automatically power down the server and the BMC will assert the sensor offset and log an event
731015 DIMM ThermTrip MonitoringThe BMC supports DIMM ThermTrip monitoring that is instantiated as one aggregate IPMI discrete sensor per CPU When a DIMM ThermTrip occurs the system hardware will automatically power down the server and the BMC will assert the sensor offset and log an eventThis is a manual re-arm sensor that is rearmed on system resets and power-on (AC or DC power on transitions)
HA201-TP Users Manual
93
Chapter 7 Intelreg Server Board S2600TP Platform Management
7311 Processor SensorsThe BMC provides IPMI sensors for processors and associated components such as voltage regulators and fans The sensors are implemented on a per-processor basis
HA201-TP Users Manual
94
Chapter 7 Intelreg Server Board S2600TP Platform Management
73111 Processor Status SensorsThe BMC provides an IPMI sensor of type processor for monitoring status information for each processor slot If an event state (sensor offset) has been asserted it remains asserted until one of the following happens1 A Rearm Sensor Events command is executed for the processor status
sensor2 AC or DC power cycle system reset or system boot occurs
The BMC provides system status indication to the front panel LEDs for processor fault conditions shown belowCPU Presence status is not saved across AC power cycles and therefore will not generate a deassertion after cycling AC power
HA201-TP Users Manual
95
Chapter 7 Intelreg Server Board S2600TP Platform Management
73112 Processor Population Fault (CPU Missing) SensorThe BMC supports a Processor Population Fault sensor This is used to monitor for the condition in which processor slots are not populated as required by the platform hardware to allow power-on of the systemAt BMC startup the BMC will check for the fault condition and set the sensor state accordinglyThe BMC also checks for this fault condition at each attempt to DC power-on the system At each DC power-on attempt a beep code is generated if this fault is detectedThe following steps are used to correct the fault condition and clear the sensor state1 AC power down the server2 Install the missing processor into the correct slot3 AC power on the server
73113 ERR2 Timeout MonitoringThe BMC supports an ERR2 Timeout Sensor (1 per CPU) that asserts if a CPUrsquos ERR2 signal has been asserted for longer than a fixed time period (gt 90 seconds) ERR[2] is a processor signal that indicates when the IIO (Integrated IO module in the processor) has a fatal error which could not be communicated to the core to trigger SMI ERR[2] events are fatal error conditions where the BIOS and OS will attempt to gracefully handle error but may not be always do so reliably A continuously asserted ERR2 signal is an indication that the BIOS cannot service the condition that caused the error This is usually because that condition prevents the BIOS from runningWhen an ERR2 timeout occurs the BMC assertsde-asserts the ERR2 Timeout Sensor and logs a SEL event for that sensor The default behavior for BMC core firmware is to initiate a system reset upon detection of an ERR2 timeout The BIOS setup utility provides an option to disable or enable system reset by the BMC for detection of this condition
HA201-TP Users Manual
96
Chapter 7 Intelreg Server Board S2600TP Platform Management
73114 CATERR SensorThe BMC supports a CATERR sensor for monitoring the system CATERR signalThe CATERR signal is defined as having 3 statesbull high (no event)bull pulsed low (possibly fatal may be able to recover)bull low (fatal)All processors in a system have their CATERR pins tied together The pin is used as a communication path to signal a catastrophic system event to all CPUs The BMC has direct access to this aggregate CATERR signalThe BMC only monitors for the ldquoCATERR held lowrdquo condition A pulsed low condition is ignored by the BMC If a CATERR-low condition is detected the BMC logs an error message to the SEL against the CATERR sensor and the default action after logging the SEL entry is to reset the system The BIOS setup utility provides an option to disable or enable system reset by the BMC for detection of this conditionThe sensor is rearmed on power-on (AC or DC power on transitions) It is not rearmed on system resets in order to avoid multiple SEL events that could occur due to a potential reset loop if the CATERR keeps recurring which would be the case if the CATERR was due to an MSID mismatch conditionWhen the BMC detects that this aggregate CATERR signal has asserted it can then go through PECI to query each CPU to determine which one was the source of the error and write an OEM code identifying the CPU slot into an event data byte in the SEL entry If PECI is non-functional (it isnrsquot guaranteed in this situation) then the OEM code should indicate that the source is unknownEvent data byte 2 and byte 3 for CATERR sensor SEL events
ED1 ndash 0xA1ED2 - CATERR type0 Unknown1 CATERR2 CPU Core Error (not supported on Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family)3 MSID Mismatch4 CATERR due to CPU 3-strike timeout
HA201-TP Users Manual
97
Chapter 7 Intelreg Server Board S2600TP Platform Management
ED3 - CPU bitmap that causes the system CATERR[0] CPU1[1] CPU2[2] CPU3[3] CPU4When a CATERR Timeout event is determined to be a CPU 3-strike timeout The BMC shall log the logical FRU information (eg busdevfunc for a PCIe device CPU or DIMM) that identifies the FRU that caused the error in the extended SEL data bytes In this case Ext-ED0 will be set to 0x70 and the remaining ED1-ED7 will be set according to the device type and info available
73115 MSID Mismatch SensorThe BMC supports a MSID Mismatch sensor for monitoring for the fault condition that will occur if there is a power rating incompatibility between a baseboard and a processor The sensor is rearmed on power-on (AC or DC power on transitions)
7312 Voltage MonitoringThe BMC provides voltage monitoring capability for voltage sources on the main board and processors such that all major areas of the system are covered This monitoring capability is instantiated in the form of IPMI analogthreshold sensors
73121 DIMM Voltage SensorsSome systems support either LVDDR (Low Voltage DDR) memory or regular (non-LVDDR) memory During POST the system BIOS detects which type of memory is installed and configures the hardware to deliver the correct voltageSince the nominal voltage range is different this necessitates the ability to set different thresholds for any associated IPMI voltage sensors The BMC FW supports this by implementing separate sensors (that is separate IPMI sensor numbers) for each nominal voltage range supported for a single physical sensor and it enablesdisables the correct IPMI sensor based on which type memory is installed The sensor data records for both these DIMM voltage sensor types have scanning disabled by default Once the BIOS has completed its POST routine it is responsible for communicating the DIMM voltage type to the BMC which will then
HA201-TP Users Manual
98
Chapter 7 Intelreg Server Board S2600TP Platform Management
enable sensor scanning of the correct DIMM voltage sensor
7313 Fan MonitoringBMC fan monitoring support includes monitoring of fan speed (RPM) and fan presence
73131 Fan Tach SensorsFan Tach sensors are used for fan failure detection The reported sensor reading is proportional to the fanrsquos RPM This monitoring capability is instantiated in the form of IPMI analogthreshold sensorsMost fan implementations provide for a variable speed fan so the variations in fan speed can be large Therefore the threshold values must be set sufficiently low as to not result in inappropriate threshold crossingsFan tach sensors are implemented as manual re-arm sensors because a lower-critical threshold crossing can result in full boosting of the fans This in turn may cause a failing fanrsquos speed to rise above the threshold and can result in fan oscillationsAs a result fan tach sensors do not auto-rearm when the fault condition goes away but rather are rearmed for either of the following occurrences1 The system is reset or power-cycled2 The fan is removed and either replaced with another fan or re-
inserted This applies to hot-swappable fans only This re-arm is triggered by change in the state of the associated fan presence sensor
After the sensor is rearmed if the fan speed is detected to be in a normal range the failure conditions shall be cleared and a de-assertion event shall be logged
HA201-TP Users Manual
99
Chapter 7 Intelreg Server Board S2600TP Platform Management
73132 Fan Presence SensorsSome chassis and server boards provide support for hot-swap fans These fans can be removed and replaced while the system is powered on and operating normally The BMC implements fan presence sensors for each hot swappable fan These are instantiated as IPMI discrete sensorsEvents are only logged for fan presence upon changes in the presence state after AC power is applied (no events logged for initial state)
73133 Fan Redundancy SensorThe BMC supports redundant fan monitoring and implements fan redundancy sensors for products that have redundant fans Support for redundant fans is chassis-specificA fan redundancy sensor generates events when its associated set of fans transition between redundant and non-redundant states as determined by the number and health of the component fans The definition of fan redundancy is configuration dependent The BMCallows redundancy to be configured on a per fan-redundancy sensor basis through OEM SDR recordsThere is a fan redundancy sensor implemented for each redundant group of fans in the systemAssertion and de-assertion event generation is enabled for each redundancy state
73134 Power Supply Fan SensorsMonitoring is implemented through IPMI discrete sensors one for each power supply fan The BMC polls each installed power supply using the PMBus fan status commands to check for failure conditions for the power supply fans The BMC asserts the ldquoperformance lagsrdquo offset of the IPMI sensor if a fan failure is detectedPower supply fan sensors are implemented as manual re-arm sensors because a failure condition can result in boosting of the fans This in turn may cause a failing fanrsquos speed to rise above the ldquofaultrdquo threshold and can result in fan oscillations As a result these sensors do not auto-rearm when the fault condition goes away but rather are rearmed only when the system is reset or power-cycled or the PSU is removed and replaced with the same or another PSUAfter the sensor is rearmed if the fan is no longer showing a failed state the failure condition in the IPMI sensor shall be cleared and a de-
HA201-TP Users Manual
100
Chapter 7 Intelreg Server Board S2600TP Platform Management
assertion event shall be logged
73135 Monitoring for ldquoFans Offrdquo ScenarioOn Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family it is likely that there will be situations where specific fans are turned off based on current system conditions BMC Fan monitoring will comprehend this scenario and not log false failure eventsThe recommended method is for the BMC FW to halt updates to the value of the associated fan tach sensor and set that sensorrsquos IPMI sensor state to ldquoreading-state-unavailablerdquo when this mode is active Management software must comprehend this state for fan tach sensors and not report these as failure conditionsThe scenario for which this occurs is that the BMC Fan Speed Control (FSC) code turns off the fans by setting the PWM for the domain to 0 This is done when based on one or more global aggregate thermal margin sensor readings dropping below a specified thresholdBy default the fans-off feature will be disabled There is a BMC command and BIOS setup option to enabledisable this featureThe SmaRTCLST system feature will also momentarily gate power to all the system fans to reduce overall system power consumption in response to a power supply event (for example to ride out an AC power glitch) However for this scenario the fan power is gated by hardware for only 100ms which should not be long enough to result in triggering a fan fault SEL event
7314 Standard Fan ManagementThe BMC controls and monitors the system fans Each fan is associated with a fan speed sensor that detects fan failure and may also be associated with a fan presence sensor for hotswap support For redundant fan configurations the fan failure and presence status determines the fan redundancy sensor stateThe system fans are divided into fan domains each of which has a separate fan speed control signal and a separate configurable fan control policy A fan domain can have a set of temperature and fan sensors associated with it These are used to determine the current fan domain state
HA201-TP Users Manual
101
Chapter 7 Intelreg Server Board S2600TP Platform Management
A fan domain has three statesbull The sleep and boost states have fixed (but configurable through OEM
SDRs) fan speeds associated with thembull The nominal state has a variable speed determined by the fan
domain policy An OEM SDR record is used to configure the fan domain policy
The fan domain state is controlled by several factors They are listed below in order of precedence high to lowbull Boost
o Associated fan is in a critical state or missing The SDR describes which fan domains are boosted in response to a fan failure or removal in each domain If a fan is removed when the system is in lsquoFans-offrsquo mode it will not be detected and there will not be any fan boost till system comes out of lsquoFans-off modeo Any associated temperature sensor is in a critical state The SDR describes which temperature-threshold violations cause fan boost for each fan domaino The BMC is in firmware update mode or the operational firmware is corruptedo If any of the above conditions apply the fans are set to a fixed boost state speed
bull Nominalo A fan domainrsquos nominal fan speed can be configured as static (fixed value) or controlled by the state of one or more associated temperature sensorso See section 43144 for more details
73141 Hot-Swap FansHot-swap fans are supported These fans can be removed and replaced while the system is powered on and operating The BMC implements fan presence sensors for each hotswappable fanWhen a fan is not present the associated fan speed sensor is put into the readingunavailable state and any associated fan domains are put into the boost state The fans may already be boosted due to a previous fan failure or fan removalWhen a removed fan is inserted the associated fan speed sensor is rearmed If there are no other critical conditions causing a fan boost condition the fan speed returns to the nominal state Power cycling or
HA201-TP Users Manual
102
Chapter 7 Intelreg Server Board S2600TP Platform Management
resetting the system re-arms the fan speed sensors and clears fanfailure conditions If the failure condition is still present the boost state returns once the sensor has re-initialized and the threshold violation is detected again
73142 Fan Redundancy DetectionThe BMC supports redundant fan monitoring and implements a fan redundancy sensor A fan redundancy sensor generates events when its associated set of fans transitions between redundant and non-redundant states as determined by the number and health of the fans The definition of fan redundancy is configuration dependent The BMC allows redundancy to be configured on a per fan redundancy sensor basis through OEM SDR recordsA fan failure or removal of hot-swap fans up to the number of redundant fans specified in the SDR in a fan configuration is a non-critical failure and is reflected in the front panel status A fan failure or removal that exceeds the number of redundant fans is a non-fatal insufficientresources condition and is reflected in the front panel status as a non-fatal errorRedundancy is checked only when the system is in the DC-on state Fan redundancy changes that occur when the system is DC-off or when AC is removed will not be logged until the system is turned on
73143 Fan DomainsSystem fan speeds are controlled through pulse width modulation (PWM) signals which are driven separately for each domain by integrated PWM hardware Fan speed is changed by adjusting the duty cycle which is the percentage of time the signal is driven high in each pulseThe BMC controls the average duty cycle of each PWM signal through direct manipulation of the integrated PWM control registersThe same device may drive multiple PWM signals
HA201-TP Users Manual
103
Chapter 7 Intelreg Server Board S2600TP Platform Management
73144 Nominal Fan SpeedA fan domainrsquos nominal fan speed can be configured as static (fixed value) or controlled by the state of one or more associated temperature sensorsOEM SDR records are used to configure which temperature sensors are associated with which fan control domains and the algorithmic relationship between the temperature and fan speed Multiple OEM SDRs can reference or control the same fan control domain and multiple OEM SDRs can reference the same temperature sensorsThe PWM duty-cycle value for a domain is computed as a percentage using one or more instances of a stepwise linear algorithm and a clamp algorithm The transition from one computed nominal fan speed (PWM value) to another is ramped over time to minimize audible transitions The ramp rate is configurable by means of the OEM SDRMultiple stepwise linear and clamp controls can be defined for each fan domain and used simultaneously For each domain the BMC uses the maximum of the domainrsquos stepwise linear control contributions and the sum of the domainrsquos clamp control contributions to compute the domainrsquos PWM value except that a stepwise linear instance can be configured to provide the domain maximumHysteresis can be specified to minimize fan speed oscillation and to smooth fan speed transitions If a Tcontrol SDR record does not contain a hysteresis definition for example an SDR adhering to a legacy format the BMC assumes a hysteresis value of zero
73145 Thermal and Acoustic ManagementThis feature refers to enhanced fan management to keep the system optimally cooled while reducing the amount of noise generated by the system fans Aggressive acoustics standards might require a trade-off between fan speed and system performance parameters that contribute to the cooling requirements primarily memory bandwidth The BIOS BMC and SDRs work together to provide control over how this trade-off is determinedThis capability requires the BMC to access temperature sensors on the individual memory DIMMs Additionally closed-loop thermal throttling is only supported with buffered DIMMs
HA201-TP Users Manual
104
Chapter 7 Intelreg Server Board S2600TP Platform Management
73146 Thermal Sensor Input to Fan Speed ControlThe BMC uses various IPMI sensors as input to the fan speed control Some of the sensors are IPMI models of actual physical sensors whereas some are ldquovirtualrdquo sensors whose values are derived from physical sensors using calculations andor tabular informationThe following IPMI thermal sensors are used as input to the fan speed controlbull Front panel temperature sensorbull Baseboard temperature sensorsbull CPU DTS-Spec margin sensorsbull DIMM thermal margin sensorsbull Exit air temperature sensorbull Global aggregate thermal margin sensorsbull SSB (Intelreg C610 Series Chipset) temperature sensorbull On-board Ethernet controller temperature sensors (support for this is
specific to the Ethernet controller being used)bull Add-in Intelreg SASIO module temperature sensor(s) (if present)bull Power supply thermal sensors (only available on PMBus-compliant
power supplies)A simple model is shown in the following figure which gives a high level graphic of the fan speed control structure creates the resulting fan speeds
HA201-TP Users Manual
105
Chapter 7 Intelreg Server Board S2600TP Platform Management
731461 Processor Thermal ManagementProcessor thermal management utilizes clamp algorithms for which the Processor DTS-Spec margin sensor is a controlling input This replaces the use of the (legacy) raw DTS sensor reading that was utilized on previous generation platforms The legacy DTS sensor is retained only for monitoring purposes and is not used as an input to the fan speed control
731462 Memory Thermal ManagementThe system memory is the most complex subsystem to thermally manage as it requires substantial interactions between the BMC BIOS and the embedded memory controller This section provides an overview of this management capability from a BMC perspective
7314621 Memory Thermal ThrottlingThe system shall support thermal management through open loop throttling (OLTT) and closed loop throttling (CLTT) of system memory based on the platform as well as availability of valid temperature sensors on the installed memory DIMMs Throttling levels are changed dynamically to cap throttling based on memory and system thermal conditions as determined by the system and DIMM power and thermal parameters Support for CLTT on mixed-mode DIMM populations (that is some installed DIMMs have valid temp sensors and some do not) is not supported The BMC fan speed control functionality is related to the memory throttling mechanism usedThe following terminology is used for the various memory throttling optionsbull Static Open Loop Thermal Throttling (Static-OLTT) OLTT control registers
are configured by BIOS MRC remain fixed after post The system does not change any of the throttling control registers in the embedded memory controller during runtime
bull Static Closed Loop Thermal Throttling (Static-CLTT) CLTT control registers are configured by BIOS MRC during POST The memory throttling is run as a closed-loop system with the DIMM temperature sensors as the control input Otherwise the system does not change any of the throttling control registers in the embedded memory controller during runtime
bull Dynamic Open Loop Thermal Throttling (Dynamic-OLTT) OLTT control registers are configured by BIOS MRC during POST Adjustments are
HA201-TP Users Manual
106
Chapter 7 Intelreg Server Board S2600TP Platform Management
made to the throttling during runtime based on changes in system cooling (fan speed)
bull Dynamic Closed Loop Thermal Throttling (Dynamic-CLTT) CLTT control registers are configured by BIOS MRC during POST The memory throttling is run as a closed-loop system with the DIMM temperature sensors as the control input Adjustments are made to the throttling during runtime based on changes in system cooling (fan speed)
Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce a new type of CLTT which is referred to as Hybrid CLTT for which the Integrated Memory Controller estimates the DRAM temperature in between actual reads of the TSODsHybrid CLTTT shall be used on all Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family that have DIMMs with thermal sensors Therefore the terms Dynamic-CLTT and Static-CLTT are really referring to this lsquohybridrsquo mode Note that if the IMCrsquos polling of the TSODs is interrupted the temperature readings that the BMC gets from the IMC shall be these estimated values 731463 DIMM Temperature Sensor Input to Fan Speed ControlA clamp algorithm s used for controlling fan speed based on DIMM temperatures Aggregate DIMM temperature margin sensors are used as the control input to the algorithm
731464 Dynamic (Hybrid) CLTTThe system will support dynamic (memory) CLTT for which the BMC FW dynamically modifies thermal offset registers in the IMC during runtime based on changes in system cooling (fan speed) For static CLTT a fixed offset value is applied to the TSOD reading to get the die temperature however this is does not provide as accurate results as when the offset takes into account the current airflow over the DIMM as is done with dynamic CLTTIn order to support this feature the BMC FW will derive the air velocity for each fan domain based on the PWM value being driven for the domain Since this relationship is dependent on the chassis configuration a method must be used which support this dependency (for example through OEM SDR) that establishes a lookup table providing this relationshipBIOS will have an embedded lookup table that provides thermal offset
HA201-TP Users Manual
107
Chapter 7 Intelreg Server Board S2600TP Platform Management
values for each DIMM type altitude setting and air velocity range (3 ranges of air velocity are supported) During system boot BIOS will provide 3 offset values (corresponding to the 3 air velocity ranges) tothe BMC for each enabled DIMM Using this data the BMC FW constructs a table that maps the offset value corresponding to a given air velocity range for each DIMM During runtime the BMC applies an averaging algorithm to determine the target offset value corresponding to thecurrent air velocity and then the BMC writes this new offset value into the IMC thermal offset register for the DIMM
731465 Fan ProfilesThe server system supports multiple fan control profiles to support acoustic targets and American Society of Heating Refrigerating and Air Conditioning Engineers (ASHRAE) compliance The BIOS Setup utility can be used to choose between meeting the target acoustic level or enhanced system performance This is accomplished through fan profilesThe BMC supports eight fan profiles numbered from 0 to 7 Fan management policy is dictated by the Tcontrol SDRs These SDRs provide a way to associate fan control behavior with one or more fan profilesEach group of profiles allows for varying fan control policies based on the altitude For a given altitude the Tcontrol SDRs associated with an acoustics-optimized profile generate less noise than the equivalent performance-optimized profile by driving lower fan speeds and the BIOSreduces thermal management requirements by configuring more aggressive memory throttling See Table 16 for more informationThe BMC provides commands that query for fan profile support and it provides a way to enable a fan profile Enabling a fan profile determines which Tcontrol SDRs are used for fan management The BMC only supports enabling a fan profile through the command if that profile is supported on all fan domains defined for the system It is important to configure the SDRs so that all desired fan profiles are supported on each fan domain If no single profile is supported across all domains the BMC by default uses profile 0 and does not allow it to be changedAt system boot the BIOS can use the Get Fan Control Configuration command to query the BMC about which fan profiles are supported The BIOS uses this information to display options in the BIOS Setup utility The BIOS indicates the fan profile to the BMC as dictated by the BIOS Setup Utility options for fan mode and altitude using the Set Fan Control
HA201-TP Users Manual
108
Chapter 7 Intelreg Server Board S2600TP Platform Management
Configuration commandThe BMC uses this information as an input to its fan-control algorithm as supported by the Tcontrol OEM SDR The BMC only allows enabling of fan profiles that the BMC indicates are supported using the Get Fan Control Configuration command For example if the Get Fan Control Configuration command indicates that only profile 1 is supported then using the Set Fan Control Configuration command to enable profile 2 will result in the return of an error completion codeThe BMC requires the BIOS to send the Set Fan Control Configuration command to the BMC on every system boot This must be done after the BIOS has completed any throttling-related chipset configuration
HA201-TP Users Manual
109
Chapter 7 Intelreg Server Board S2600TP Platform Management
731466 Open-Loop Thermal Throttling FallbackNormal system operation uses closed-loop thermal throttling (CLTT) and DIMM temperature monitoring as major factors in overall thermal and acoustics management In the event that BIOS is unable to configure the system for CLTT it defaults to open-loop thermal throttling (OLTT) In the OLTT mode it is assumed that the DIMM temperature sensors are not available for fan speed control The BIOS communicates the throttling mode to the BMC along with the fan profile number when it sends the Set Fan Control Configuration commandWhen OLTT mode is specified the BMC internally blocks access to the DIMM temperatures causing the DIMM aggregate margin sensors to be marked as ReadingState Unavailable The BMC then uses the failure-control values for these sensors if specified in the Tcontrol SDRs as their fan speed contributions
731467 ASHRAE ComplianceSystem requirements for ASHRAE compliance is defined in the Common Fan Speed Control amp Thermal Management Platform Architecture Specification Altitude-related changes in fan speed control are handled through profiles for different altitude ranges
73147 Power Supply Fan Speed ControlThis section describes the system level control of the fans internal to the power supply over the PMBus Some but not all Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family will require that the power supplies be included in the system level fan speed control For any system that requires either of these capabilities the power supply must be PMBus-compliant
731471 System Control of Power Supply FansSome products require that the BMC control the speed of the power supply fans as is done with normal system (chassis) fans except that the BMC cannot reduce the power supply fan any lower than the internal power supply control is driving it For these products the BMC FW must have the ability to control and monitor the power supply fans through PMBus commands The power supply fans are treated as a system fan domain for which fan control policies are mapped just as for chassis system fans with system thermal sensors (rather than internal power
HA201-TP Users Manual
110
Chapter 7 Intelreg Server Board S2600TP Platform Management
supply thermal sensors) used as the input to a clamp algorithm for the power supply fan control This domain has both piecewise clipping curves and clamped sensors mapped into the power supply fan domain All the power supplies can be defined as a single fan domain
731472 Use of Power Supply Thermal Sensors as Input to System (Chassis) Fan Control Some products require that the power supply internal thermal sensors are used as control inputs to the system (chassis) fans in the same manner as other system thermal sensors are used for this purpose The power supply thermal sensors are included as clamped sensors into one or more system fan domains which may include the power supply fan domain
73148 Fan Boosting due to Fan FailuresIntelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce additional capabilities for handling fan failure or removal as described in this sectionEach fan failure shall be able to define a unique response from all other fan domains An OEM SDR table defines the response of each fan domain based on a failure of any fan including both system and power supply fans (for PMBus-compliant power supplies only) This meansthat if a system has six fans then there will be six different fan fail reactions
73149 Programmable Fan PWM OffsetThe system provides a BIOS Setup option to boost the system fan speed by a programmable positive offset or a ldquoMaxrdquo setting Setting the programmable offset causes the BMC to add the offset to the fan speeds that it would otherwise be driving the fans to The Max setting causes the BMC to replace the domain minimum speed with alternate domain minimums that also are programmable through SDRsThis capability is offered to provide system administrators the option to manually configure fans speeds in instances where the fan speed optimized for a given platform may not be sufficient when a high end add-in is configured into the system This enables easier usage of the fan speed control to support Intelreg as well as third party chassis and better support of ambient temperatures higher than 35C
HA201-TP Users Manual
111
Chapter 7 Intelreg Server Board S2600TP Platform Management
supply thermal sensors) used as the input to a clamp algorithm for the power supply fan control This domain has both piecewise clipping curves and clamped sensors mapped into the power supply fan domain All the power supplies can be defined as a single fan domain
731472 Use of Power Supply Thermal Sensors as Input to System (Chassis) Fan Control Some products require that the power supply internal thermal sensors are used as control inputs to the system (chassis) fans in the same manner as other system thermal sensors are used for this purpose The power supply thermal sensors are included as clamped sensors into one or more system fan domains which may include the power supply fan domain
73148 Fan Boosting due to Fan FailuresIntelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce additional capabilities for handling fan failure or removal as described in this sectionEach fan failure shall be able to define a unique response from all other fan domains An OEM SDR table defines the response of each fan domain based on a failure of any fan including both system and power supply fans (for PMBus-compliant power supplies only) This meansthat if a system has six fans then there will be six different fan fail reactions
73149 Programmable Fan PWM OffsetThe system provides a BIOS Setup option to boost the system fan speed by a programmable positive offset or a ldquoMaxrdquo setting Setting the programmable offset causes the BMC to add the offset to the fan speeds that it would otherwise be driving the fans to The Max setting causes the BMC to replace the domain minimum speed with alternate domain minimums that also are programmable through SDRsThis capability is offered to provide system administrators the option to manually configure fans speeds in instances where the fan speed optimized for a given platform may not be sufficient when a high end add-in is configured into the system This enables easier usage of the fan speed control to support Intelreg as well as third party chassis and better support of ambient temperatures higher than 35C
HA201-TP Users Manual
112
Chapter 7 Intelreg Server Board S2600TP Platform Management
7315 Power Management Bus (PMBus)The Power Management Bus (ldquoPMBusrdquo) is an open standard protocol that is built upon the SMBus 20 transport It defines a means of communicating with power conversion and other devices using SMBus-based commands A system must have PMBus-compliant power supplies installed in order for the BMC or ME to monitor them for status andor power metering purposesFor more information on PMBus see the System Management Interface Forum Web sitehttpwwwpowersigorg
7316 Power Supply Dynamic Redundancy SensorThe BMC supports redundant power subsystems and implements a Power Unit Redundancy sensor per platform A Power Unit Redundancy sensor is of sensor type Power Unit (09h) and reading type Availability Status (0Bh) This sensor generates events when a power subsystem transitions between redundant and non-redundant states as determined by the number and health of the power subsystemrsquos component power supplies The BMC implements Dynamic Power Supply Redundancy status based upon current system load requirements as well as total Power Supply capacity This status is independent of the Cold Redundancy status This prevents the BMC from reporting Fully Redundant Power supplies when the load required by the system exceeds half the power capability of all power supplies installed and operationalDynamic Redundancy detects this condition and generates the appropriate SEL event to notify the user of the condition Power supplies of different power ratings may be swapped in and out to adjust the power capacity and the BMC will adjust the Redundancy status accordinglyThe definition of redundancy is power subsystem dependent and sometimes even configuration dependent See the appropriate Platform Specific Information for power unit redundancy supportThis sensor is configured as manual-rearm sensor in order to avoid the possibility of extraneous SEL events that could occur under certain system configuration and workload conditions The sensor shall rearm for the following conditionsbull PSU hot-addbull System reset
HA201-TP Users Manual
113
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull AC power cyclebull DC power cycleSystem AC power is applied but on standby ndash Power unit redundancy is based on OEM SDR power unit record and number of PSU presentSystem is (DC) powered on - The BMC calculates Dynamic Power Supply Redundancy status based upon current system load requirements as well as total Power Supply capacityThe BMC allows redundancy to be configured on a per power-unit-redundancy sensor basis by means of the OEM SDR records
7317 Component Fault LED ControlSeveral sets of component fault LEDs are supported on the server board Some LEDs are owned by the BMC and some by the BIOSThe BMC owns control of the following FRUfault LEDsbull Fan fault LEDs ndash A fan fault LED is associated with each fan The BMC
lights a fan fault LED if the associated fan-tach sensor has a lower critical threshold event status asserted Fan-tach sensors are manual re-arm sensors Once the lower critical threshold is crossed the LED remains lit until the sensor is rearmed These sensors are rearmed at system DC power-on and system reset
bull DIMM fault LEDs ndash The BMC owns the hardware control for these LEDs The LEDs reflect the state of BIOS-owned event-only sensors When the BIOS detects a DIMM fault condition it sends an IPMI OEM command (Set Fault Indication) to the BMC to instruct the BMC to turn on the associated DIMM Fault LED These LEDs are only active when the system is in the lsquoonrsquo state The BMC will not activate or change the state of the LEDs unless instructed by the BIOS
bull Hard Disk Drive Status LEDs ndash The HSBP PSoC owns the hardware control for these LEDs and detection of the faultstatus conditions that the LEDs reflect
bull CPU Fault LEDs The BMC owns control for these LEDs An LED is lit if there is an MSID mismatch (that is CPU power rating is incompatible with the board)
HA201-TP Users Manual
114
Chapter 7 Intelreg Server Board S2600TP Platform Management
7318 CMOS Battery MonitoringThe BMC monitors the voltage level from the CMOS battery which provides battery backup to the chipset RTC This is monitored as an auto-rearm threshold sensorUnlike monitoring of other voltage sources for which the Emulex Pilot III component continuously cycles through each input the voltage channel used for the battery monitoring provides a software enable bit to allow the BMC FW to poll the battery voltage at a relativelyslow rate in order to conserve battery power
HA201-TP Users Manual
115
Chapter 7 Intelreg Server Board S2600TP Platform Management
74 Intelreg Intelligent Power Node Manager (NM)Power management deals with requirements to manage processor power consumption and manage power at the platform level to meet critical business needs Node Manager (NM) is a platform resident technology that enforces power capping and thermal-triggered powercapping policies for the platform These policies are applied by exploiting subsystem settings (such as processor P and T states) that can be used to control power consumption NM enables data center power management by exposing an external interface to management software through which platform policies can be specified It also implements specific data center power management usage models such as power limiting and thermal monitoringThe NM feature is implemented by a complementary architecture utilizing the ME BMC BIOS and an ACPI-compliant OS The ME provides the NM policy engine and power controllimiting functions (referred to as Node Manager or NM) while the BMC provides the external LAN link by which external management software can interact with the feature The BIOS provides system power information utilized by the NM algorithms and also exports ACPI Source Language (ASL) code used by OS-Directed Power Management (OSPM) for negotiating processor P and T state changes for power limiting PMBus-compliant power supplies provide the capability to monitor input power consumption which is necessary to support NMThe NM architecture applicable to this generation of servers is defined by the NPTM Architecture Specification v20 NPTM is an evolving technology that is expected to continue to add new capabilities that will be defined in subsequent versions of the specification The ME NM implements the NPTM policy engine and controlmonitoring algorithms defined in the Node Power and Thermal Manager (NPTM) specification
741 Hardware RequirementsNM is supported only on platforms that have the NM FW functionality loaded and enabled on the Management Engine (ME) in the SSB and that have a BMC present to support the external LAN interface to the ME NM power limiting features requires a means for the ME to monitorinput power consumption for the platform This capability is generally provided by means of PMBus-compliant power supplies although an alternative model using a simpler SMBus power monitoring device is possible (there is potential loss in accuracy and responsiveness using
HA201-TP Users Manual
116
Chapter 7 Intelreg Server Board S2600TP Platform Management
non-PMBus devices) The NM SmaRTCLST feature does specifically require PMBus-compliant power supplies as well as additional hardware on the server board
742 FeaturesNM provides feature support for policy management monitoring and querying alerts and notifications and an external interface protocol The policy management features implement specific IT goals that can be specified as policy directives for NM Monitoring and querying features enable tracking of power consumption Alerts and notifications provide the foundation for automation of power management in the data center management stack The external interface specifies the protocols that must be supported in this version of NM
743 ME System Management Bus (SMBus) Interfacebull The ME uses the SMLink0 on the SSB in multi-master mode as a
dedicated bus for communication with the BMC using the IPMB protocol The BMC FW considers this a secondary IPMB bus and runs at 400 kHz
bull The ME uses the SMLink1 on the SSB in multi-master mode bus for communication with PMBus devices in the power supplies for support of various NM-related features This bus is shared with the BMC which polls these PMBus power supplies for sensor monitoring purposes (for example power supply status input power and so on) This bus runs at 100 KHz
bull The Management Engine has access to the ldquoHost SMBusrdquo
744 PECI 30bull The BMC owns the PECI bus for all Intel server implementations and
acts as a proxy for the ME when necessary
745 NM ldquoDiscoveryrdquo OEM SDRAn NM ldquodiscoveryrdquo OEM SDR must be loaded into the BMCrsquos SDR repository if and only if the NM feature is supported on that product This OEM SDR is used by management software to detect if NM is supported and to understand how to communicate with itSince PMBus compliant power supplies are required in order to support NM the system should be probed when the SDRs are loaded into the
HA201-TP Users Manual
117
Chapter 7 Intelreg Server Board S2600TP Platform Management
BMCrsquos SDR repository in order to determine whether or not the installed power supplies do in fact support PMBus If the installed power supplies are not PMBus compliant then the NM ldquodiscoveryrdquo OEM SDR should not be loadedPlease refer to the Intelreg Intelligent Power Node Manager 20 External Architecture Specification using IPMI for details of this interface
746 SmaRTCLSTThe power supply optimization provided by SmaRTCLST relies on a platform HW capability as well as ME FW support When a PMBus-compliant power supply detects insufficient input voltage an overcurrent condition or an over-temperature condition it will assert theSMBAlert signal on the power supply SMBus (such as the PMBus) Through the use of external gates this results in a momentary assertion of the PROCHOT and MEMHOT signals to the processors thereby throttling the processors and memory The ME FW also sees the SMBAlert assertion queries the power supplies to determine the condition causing the assertion and applies an algorithm to either release or prolong the throttling based on the situationSystem power control modes include1 SmaRT Low AC in put voltage event results in a one-time momentary
throttle for each event to the maximum throttle state2 Electrical Protection CLST High output energy event results in a
throttling hiccup mode with fixed maximum throttle time and a fix throttle release ramp time
3 Thermal Protection CLST High power supply thermal event results in a throttling hiccup mode with fixed maximum throttle time and a fix throttle release ramp time
When the SMBAlert signal is asserted the fans will be gated by HW for a short period (~100ms) to reduce overall power consumption It is expected that the interruption to the fans will be of short enough duration to avoid false lower threshold crossings for the fan tachsensors however this may need to be comprehended by the fan monitoring FW if it does have this side-effectME FW will log an event into the SEL to indicate when the system has been throttled by the SmaRTCLST power management feature This is dependent on ME FW support for this sensor Please refer ME FW EPS for SEL log details
HA201-TP Users Manual
118
Chapter 7 Intelreg Server Board S2600TP Platform Management
74611 Dependencies on PMBus-compliant Power Supply SupportThe SmaRTCLST system feature depends on functionality present in the ME NM SKU This feature requires power supplies that are compliant with the PMBus
note For additional information on Intelreg Intelligent Power Node
Manager usage and support please visit the following Intel Website
httpwwwintelcomcontentwwwusendata-centerdata-center-managementnode-manager-generalhtmlwapkw=node+manager
HA201-TP Users Manual
119
Chapter 7 Intelreg Server Board S2600TP Platform Management
75 Basic and Advanced Server Management FeaturesThe integrated BMC has support for basic and advanced server management features Basic management features are available by default Advanced management features are enabled with the addition of an optionally installed Remote Management Module 4 Lite (RMM4 Lite) key
When the BMC FW initializes it attempts to access the Intelreg RMM4 lite If the attempt to access Intelreg RMM4 lite is successful then the BMC activates the Advanced featuresThe following table identifies both Basic and Advanced server management features
HA201-TP Users Manual
120
Chapter 7 Intelreg Server Board S2600TP Platform Management
751 Dedicated Management PortThe server board includes a dedicated 1GbE RJ45 Management Port The management port is active with or without the RMM4 Lite key installed
752 Embedded Web ServerBMC Base manageability provides an embedded web server and an OEM-customizable web GUI which exposes the manageability features of the BMC base feature set It is supported over all on-board NICs that have management connectivity to the BMC as well as an optionaldedicated add-in management NIC At least two concurrent web sessions from up to two different users is supported The embedded web user interface shall support the following client web browsersbull Microsoft Internet Explorer 90bull Microsoft Internet Explorer 100bull Mozilla Firefox 24bull Mozilla Firefox 25The embedded web user interface supports strong security (authentication encryption and firewall support) since it enables remote server configuration and control The user interface presented by the embedded web user interface shall authenticate the user before allowing a web session to be initiated Encryption using 128-bit SSL is supported User authentication is based on user id and passwordThe GUI presented by the embedded web server authenticates the user before allowing a web session to be initiated It presents all functions to all users but grays-out those functions that the user does not have privilege to execute For example if a user does not have privilege to power control then the item shall be displayed in grey-out font in that userrsquos UI display The web GUI also provides a launch point for some of the advanced features such as KVM and media redirection These features are grayed out in the GUI unless the system has been updated to support these advanced features The embedded web server only displays US English or Chinese language outputAdditional features supported by the web GUI includesbull Presents all the Basic features to the usersbull Power onoffreset the server and view current power statebull Displays BIOS BMC ME and SDR version informationbull Display overall system health
HA201-TP Users Manual
121
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull Configuration of various IPMI over LAN parameters for both IPV4 and IPV6
bull Configuration of alerting (SNMP and SMTP)bull Display system asset information for the product board and
chassisbull Display of BMC-owned sensors (name status current reading
enabled thresholds) including color-code status of sensorsbull Provides ability to filter sensors based on sensor type (Voltage
Temperature Fan and Power supply related)bull Automatic refresh of sensor data with a configurable refresh ratebull On-line helpbull Displayclear SEL (display is in easily understandable human
readable format)bull Supports major industry-standard browsers (Microsoft Internet
Explorer and Mozilla Firefox)bull The GUI session automatically times-out after a user-configurable
inactivity period By default this inactivity period is 30 minutesbull Embedded Platform Debug feature - Allow the user to initiate
a ldquodebug dumprdquo to a file that can be sent to Intelreg for debug purposes
bull Virtual Front Panel The Virtual Front Panel provides the same functionality as the local front panel The displayed LEDs match the current state of the local panel LEDs The displayed buttons (for example power button) can be used in the same manner as the local buttons
bull Display of ME sensor data Only sensors that have associated SDRs loaded will be displayed
bull Ability to save the SEL to a filebull Ability to force HTTPS connectivity for greater security This is
provided through a configuration option in the UIbull Display of processor and memory information as is available over
IPMI over LANbull Ability to get and set Node Manager (NM) power policiesbull Display of power consumed by the serverbull Ability to view and configure VLAN settingsbull Warn user the reconfiguration of IP address will cause disconnectbull Capability to block logins for a period of time after several
HA201-TP Users Manual
122
Chapter 7 Intelreg Server Board S2600TP Platform Management
consecutive failed login attempts The lock-out period and the number of failed logins that initiates the lockout period are configurable by the user
bull Server Power Control - Ability to force into Setup on a resetbull System POST results ndash The web server provides the systemrsquos Power-
On Self Test (POST) sequence for the previous two boot cycles including timestamps The timestamps may be viewed in relative to the start of POST or the previous POST code
bull Customizable ports - The web server provides the ability to customize the port numbers used for SMASH http https KVM secure KVM remote media and secure remote media
For additional information reference the Intelreg Remote Management Module 4 and Integrated BMC Web Console Users Guide
753 Advanced Management Feature Support (RMM4 Lite)The integrated baseboard management controller has support for advanced management features which are enabled when an optional Intelreg Remote Management Module 4 Lite (RMM4 Lite) is installed The Intel RMM4 add-on offers convenient remote KVM access and control through LAN and internet It captures digitizes and compresses video and transmits it with keyboard and mouse signals to and from a remote computer Remote access and controlsoftware runs in the integrated baseboard management controller utilizing expanded capabilities enabled by the Intel RMM4 hardwareKey Features of the RMM4 add-on arebull KVM redirection from either the dedicated management NIC or
the server board NICs used for management traffic upto to two KVM sessions
bull Media Redirection ndash The media redirection feature is intended to allow system administrators or users to mount a remote IDE or USB CDROM floppy drive or a USB flash disk as a remote device to the server Once mounted the remote device appears just like a local device to the server allowing system administrators or users to install software (including operating systems) copy files update BIOS or boot the server from this device
bull KVM ndash Automatically senses video resolution for best possible screen capture high performance mouse tracking and
HA201-TP Users Manual
123
Chapter 7 Intelreg Server Board S2600TP Platform Management
synchronization It allows remote viewing and configuration in pre-boot POST and BIOS setup
7531 Keyboard Video Mouse (KVM) RedirectionThe BMC firmware supports keyboard video and mouse redirection (KVM) over LAN This feature is available remotely from the embedded web server as a Java applet This feature is only enabled when the Intelreg RMM4 lite is present The client system must have a Java Runtime Environment (JRE) version 60 or later to run the KVM or media redirection appletsThe BMC supports an embedded KVM application (Remote Console) that can be launched from the embedded web server from a remote console USB11 or USB 20 based mouse and keyboard redirection are supported It is also possible to use the KVM-redirection (KVM-r) session concurrently with media-redirection (media-r) This feature allows a user to interactively use the keyboard video and mouse (KVM) functions of the remote server as if the user were physically at the managed server KVM redirection console supports the following keyboard layouts English Dutch French German Italian Russian and SpanishKVM redirection includes a ldquosoft keyboardrdquo function The ldquosoft keyboardrdquo is used to simulate an entire keyboard that is connected to the remote system The ldquosoft keyboardrdquo functionality supports the following layouts English Dutch French German Italian Russian and SpanishThe KVM-redirection feature automatically senses video resolution for best possible screen capture and provides high-performance mouse tracking and synchronization It allows remote viewing and configuration in pre-boot POST and BIOS setup once BIOS has initialized videoOther attributes of this feature includebull Encryption of the redirected screen keyboard and mousebull Compression of the redirected screenbull Ability to select a mouse configuration based on the OS typebull Supports user definable keyboard macrosKVM redirection feature supports the following resolutions and refresh ratesbull 640x480 at 60Hz 72Hz 75Hz 85Hz 100Hzbull 800x600 at 60Hz 72Hz 75Hz 85Hzbull 1024x768 at 60Hx 72Hz 75Hz 85Hzbull 1280x960 at 60Hz
HA201-TP Users Manual
124
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull 1280x1024 at 60Hzbull 1600x1200 at 60Hzbull 1920x1080 (1080p)bull 1920x1200 (WUXGA)bull 1650x1080 (WSXGA+)
7532 Remote ConsoleThe Remote Console is the redirected screen keyboard and mouse of the remote host system To use the Remote Console window of your managed host system the browser must include a Java Runtime Environment plug-in If the browser has no Java support such as with a small handheld device the user can maintain the remote host system using the administration forms displayed by the browserThe Remote Console window is a Java Applet that establishes TCP connections to the BMCThe protocol that is run over these connections is a unique KVM protocol and not HTTP or HTTPS This protocol uses ports 7578 for KVM 5120 for CDROM media redirection and 5123 for FloppyUSB media redirection When encryption is enabled the protocol uses ports 7582 for KVM 5124 for CDROM media redirection and 5127 for FloppyUSB media redirection The local network environment must permit these connections to be made that is the firewall and in case of a private internal network the NAT (Network Address Translation) settings have to be configured accordingly
7533 PerformanceThe remote display accurately represents the local display The feature adapts to changes to the video resolution of the local display and continues to work smoothly when the system transitions from graphics to text or vice-versa The responsiveness may be slightly delayed depending on the bandwidth and latency of the networkEnabling KVM andor media encryption will degrade performance Enabling video compression provides the fastest response while disabling compression provides better video qualityFor the best possible KVM performance a 2Mbsec link or higher is recommendedThe redirection of KVM over IP is performed in parallel with the local KVM without affecting the local KVM operation
HA201-TP Users Manual
125
Chapter 7 Intelreg Server Board S2600TP Platform Management
7534 SecurityThe KVM redirection feature supports multiple encryption algorithms including RC4 and AES The actual algorithm that is used is negotiated with the client based on the clientrsquos capabilities
7535 AvailabilityThe remote KVM session is available even when the server is powered-off (in stand-by mode) No re-start of the remote KVM session shall be required during a server reset or power onoff A BMC reset (for example due to an BMC Watchdog initiated reset or BMC reset after BMC FWupdate) will require the session to be re-established KVM sessions persist across system reset but not across an AC power loss
7536 UsageAs the server is powered up the remote KVM session displays the complete BIOS boot process The user is able interact with BIOS setup change and save settings as well as enter and interact with option ROM configuration screensAt least two concurrent remote KVM sessions are supported It is possible for at least two different users to connect to same server and start remote KVM sessions
7537 Force-enter BIOS SetupKVM redirection can present an option to force-enter BIOS Setup This enables the system to enter F2 setup while booting which is often missed by the time the remote console redirects the video
7538 Media RedirectionThe embedded web server provides a Java applet to enable remote media redirection This may be used in conjunction with the remote KVM feature or as a standalone applet The media redirection feature is intended to allow system administrators or users to mount a remote IDE or USB CD-ROM floppy drive or a USB flash disk as a remote device to the server Once mounted the remote device appears just like a local device to the server allowing system administrators or users to install software (including operating systems) copy files update BIOS and so on or boot the server from this deviceThe following capabilities are supported
HA201-TP Users Manual
126
Chapter 7 Intelreg Server Board S2600TP Platform Management
The operation of remotely mounted devices is independent of the local devices on the server Both remote and local devices are useable in parallelbull Either IDE (CD-ROM floppy) or USB devices can be mounted as a
remote device to the serverbull It is possible to boot all supported operating systems from the remotely
mounted device and to boot from disk IMAGE (IMG) and CD-ROM or DVD-ROM ISO files See the Testedsupported Operating System List for more information
bull Media redirection supports redirection for both a virtual CD device and a virtual FloppyUSB device concurrently The CD device may be either a local CD drive or else an ISO image file the FloppyUSB device may be either a local Floppy drive a local USB device or else a disk image file
bull The media redirection feature supports multiple encryption algorithms including RC4 and AES The actual algorithm that is used is negotiated with the client based on the clientrsquos capabilities
bull A remote media session is maintained even when the server is powered-off (in standby mode) No restart of the remote media session is required during a server reset or power onoff An BMC reset (for example due to an BMC reset after BMC FW update) will require the session to be re-established
bull The mounted device is visible to (and useable by) managed systemrsquos OS and BIOS in both pre-boot and post-boot states
bull The mounted device shows up in the BIOS boot order and it is possible to change the BIOS boot order to boot from this remote device
bull It is possible to install an operating system on a bare metal server (no OS present) using the remotely mounted device This may also require the use of KVM-r to configure the OS during install
USB storage devices will appear as floppy disks over media redirection This allows for the installation of device drivers during OS installationIf either a virtual IDE or virtual floppy device is remotely attached during system boot both the virtual IDE and virtual floppy are presented as bootable devices It is not possible to present only a single-mounted device type to the system BIOS
HA201-TP Users Manual
127
Chapter 7 Intelreg Server Board S2600TP Platform Management
75381 AvailabilityThe default inactivity timeout is 30 minutes and is not user-configurable Media redirection sessions persist across system reset but not across an AC power loss or BMC reset
75382 Network Port UsageThe KVM and media redirection features use the following portsbull 5120 ndash CD Redirectionbull 5123 ndash FD Redirectionbull 5124 ndash CD Redirection (Secure)bull 5127 ndash FD Redirection (Secure)bull 7578 ndash Video Redirectionbull 7582 ndash Video Redirection (Secure)For additional information reference the Intelreg Remote Management Module 4 and Integrated BMC Web Console Users Guide
Chapter 1 Product IntroductionChapter 1 Product IntroductionChapter 8 Technical Support
wwwaicipccom
bull TAIWANTel +886 3 433 9188Fax +886 3 287 1818Email salesaicipccomtw
bull CHINA Tel +862154961421 +862154961422Fax Extension 608Email Technical Support supportaicipccom
bull AMERICA - West coastTel +19098958989Fax +19098958999Email salesaicipccom
bull AMERICA - East coastTel +19738848886Fax +19738844794Email njsalesaicipccom
bull EUROPETel +31306386789Fax +31306360638Emailsalesaicipcnl
Email Technical Support supportaicipccom
- PREFACE
-
- SAFETY INSTRUCTIONS
-
- Chapter 1 Product Introduction
-
- 11 Box Content
- 12 Specifications
- 13 General Information
-
- Chapter 2 Hardware Installation
-
- 21 Removing and Installing top cover
- 22 Central Processing Unit (CPU)
- 23 Diagram of the Correct Installation
- 24 System Memory
- 25 RemovingInstalling a Drive Tray Hard Drive Drive Slot Map
- 26 Removing and Installing a Fan Module
- 27 Power Supply
- 28 Tool-less Blade Slide Installation introduction
-
- Chapter 3 Motherborad Overview
-
- 31 Intelreg Server Board Feature Set
- 32 Motherboard Layout
- 33 Motherboard block diagram
- 34 Motherboard Configuration Jumpers
- 35 Motherboard Configuration Jumpers
-
- Chapter 4 12GB Expander Borad Overview
-
- 41 12GB Expander Board
-
- Chapter 5 Backplane Overview
-
- 51 Backplane
-
- Chapter 6 Debug amp Firmware Update Introduction
-
- 61 Expender firmware update through smart console port
- 62 Update the expander firmware through in-band
- 63 12G expander EDFB setting
- 64 Slot HDD power setting
- 65 HDD BP thermal sensor temperature setting
-
- Chapter 7 Intelreg Server Board S2600TP Platform Management
-
- 71 Server Management Function Architecture
- 72 Features and Functions
- 73 Sensor Monitoring
- 74 Intelreg Intelligent Power Node Manager (NM)
- 75 Basic and Advanced Server Management Features
-
- Chapter 8 Technical Support
-
- 按鈕11
![Page 3: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/3.jpg)
contents
64 Slot HDD power setting 6765 HDD BP thermal sensor temperature setting 72
Chapter 7 Intelreg Server Board S2600TP Platform Management 75
71 Server Management Function Architecture 7572 Features and Functions 7873 Sensor Monitoring 8474 Intelreg Intelligent Power Node Manager (NM) 11675 Basic and Advanced Server Management Features 120
Chapter 8 Technical Support 129
Copyright copy 2015 AIC Inc All Rights Reserved
This document contains proprietary information about AIC products and is not to be disclosed or used except in accordance with applicable agreements
i
PREFACEbull Copyright
No part of this publication may be reproduced stored in a retrieval system
or transmitted in any form or by any means electronic mechanical photo-
static recording or otherwise without the prior written consent of the
manufacturer
bull Trademarks
All products and trade names used in this document are trademarks or
registered trademarks of their respective holders
bull Changes
The material in this document is for information purposes only and is subject
to change without notice
bull Warning
1 A shielded-type power cord is required in order to meet FCC emission
limits and also to prevent interference to the nearby radio and television
reception It is essential that only the supplied power cord be used
2 Use only shielded cables to connect IO devices to this equipment
3 You are cautioned that changes or modifications not expressly approved
by the party responsible for compliance could void your authority to
operate the equipment
bull Disclaimer
AIC shall not be liable for technical or editorial errors or omissions
contained herein The information provided is provided as is without
warranty of any kind To the extent permitted by law neither AIC or its
affiliates subcontractors or suppliers will be liable for incidental special or
consequential damages including downtime cost lost profits damages
relating to the procurement of substitute products or services or damages
for loss of data or software restoration The information in this document is
subject to change without notice
ii
SAFETY INSTRUCTIONSbull Before getting started please read the following important cautionsbull All cautions and warnings on the equipment or in the manuals should be
notedbull Most electronic components are sensitive to electrical static discharge
Therefore be sure to ground yourself at all times when installing the internal components
bull Use a grounding wrist strap and place all electronic components in static-shielded devices Grounding wrist straps can be purchased in any electronic supply store
bull Be sure to turn off the power and then disconnect the power cords from your system before performing any installation or servicing A sudden surge of power could damage sensitive electronic components
bull Do not open the systemrsquos top cover If opening the cover for maintenance is a must only a trained technician should do so Integrated circuits on computer boards are sensitive to static electricity Before handling a board or integrated circuit touch an unpainted portion of the system unit chassis for a few seconds This will help to discharge any static electricity on your body
bull Place this equipment on a stable surface when install A drop or fall could cause injury
bull Please keep this equipment away from humiditybull Carefully mount the equipment into the rack in such manner that it
wonrsquot be hazardous due to uneven mechanical loadingbull This equipment is to be installed for operation in an environment with
maximum ambient temperature below 35degCbull The openings on the enclosure are for air convection to protect the
equipment from overheating DO NOT COVER THE OPENINGSbull Never pour any liquid into ventilation openings This could cause fire or
electrical shockbull Make sure the voltage of the power source is within the specification
on the label when connecting the equipment to the power outlet The current load and output power of loads shall be within the specification
bull This equipment must be connected to reliable grounding before using Pay special attention to power supplied other than direct connections eg using of power strips
iii
bull Place the power cord out of the way of foot traffic Do not place anything over the power cord The power cord must be rated for the product voltage and current marked on the productrsquos electrical ratings label The voltage and current rating of the cord should be greater than the voltage and current rating marked on the product
bull If the equipment is not used for a long time disconnect the equipment from mains to avoid being damaged by transient over-voltage
bull Never open the equipment For safety reasons only qualified service personnel should open the equipment
bull If one of the following situations arise the equipment should be checked by service personnel1 The power cord or plug is damaged2 Liquid has penetrated the equipment3 The equipment has been exposed to moisture4 The equipment does not work well or will not work according to its user
manual5 The equipment has been dropped andor damaged6 The equipment has obvious signs of breakage7 Please disconnect this equipment from the AC outlet before cleaning
Do not use liquid or detergent for cleaning The use of a moisture sheet or cloth is recommended for cleaning
bull Module and drive bays must not be empty They must have a dummy cover
Product features and specifications are subject to change without notice
CAUTION
risk of explosion if battery is replaced by an incorrect type
dispose of used batteries according to the instructions
After performing any installation or servicing make sure the
enclosure are lock and screw in position turn on the power
1
HA201-TP Users Manual
Chapter 1 Product Introduction
11 Box Content
Chapter 1 Product Introduction
Before removing the subsystem from the shipping carton visually inspect the physical condition of the shipping carton Exterior damage to the shipping carton may indicate that the contents of the carton are damaged If any damage is found do not remove the components contact the dealer where the subsystem was purchased for further introduction Before continuing first unpack the subsystem and verify that the contents of the shipping carton are all there and in good condition
bull Enclosure( Power supply fan 24 HDD tray included)
bull RS232 cablex 1pcs bull Power cord x 2sets bull Screws kit x 1set
bull Slide rail x 1set
If any items are missing please contact your authorized reseller or sales representative
2
HA201-TP Users Manual
Chapter 1 Product Introduction
12 Specifications
Dimensions (W x D x H)(with chassis ears)
mm 4826 x 815 x 88
inches 19 x 32 x 35
Motherboard (per node) Intelreg Server Board S2600TP
Processor (per node)
Processor Support
Two Intelreg Xeonreg Processors E5-2600 v3 Product Family
QPI Speeds 96 GTs 8 GTs 72 GTs
Socket Type Socket R3 (FCLGA2011-3)
Chipset Support(per node) Intelreg C612 Chipset
System Memory (per node)
bull 16 DIMM slots in total across 8 memory channelsbull Registered DDR4 (RDIMM) Load Reduced DDR4 (LRDIMM)bull Memory DDR4 data transfer rates of 160018662133 MTsbull DIMM sizes of 4 GB 8 GB 16 GB or 32 GB depending on ranks and technology
Front Panel Power onoff
LEDs
A bull Power (Secondary)bull Warning
B bull Power (Primary)bull Warning
Drive Bays External 25 hot swap 24
Backplane 1 x 24-port 12Gb SAS dual-loop backplane
Expander Board(per node)
1 x 24-port 12Gb SAS expander board with 3 SFF-8643 connectors
Expansion Slots (per node) PCIe 30 5 x8 (4 LP amp 1 FHHL)
Internal IO Connectors Headers(per node)
bull 1 x internal USB 20 connector (port 67)bull 1 x 2x7pin header for system fan modulebull 1 x 1x12pin control panel headerbull 1 x DH-10 serial Port A connectorbull 1 x SATA 6Gbs port for SATA DOMbull 4 x SATA 6Gbs connectors (port 0123)bull 1 x 2x4 pin header for Intel RMM4 Litebull 1 x 1x4 pin header for Storage Upgrade Keybull 1 x 1x8 pin backup power controler connector
Rear IO(per node)
bull 1 x RJ45 (dedicated port for remote server management) 2 x RJ45 1 x VGA 2 x USB 20 Type A 1 x Mini SAS HD
Ethernet (per node) Dual GbE Intelreg I350 Gigabit Ethernet Controller
Video Support(per node) Integrated Matrox G200 2D Graphics Controller
Server Management(per node)
Onboard Server Engines LLC Pilot III Controller Support for Intelreg Remote Management Module 4 solutions IntelregLight-Guided Diagnostics on field replaceable units Support for Intelreg System Management Software Support for Intelreg Intelligent Power Node Manager (Need PMBus - compliant power supply) BIOS Flash Winbond W25Q64BV
Power Supply 1200W 1+1 redundant power supply
System Cooling (per node)
2 x 6056 fans
EnvironmentalSpecifications
Temperature 0degC - 35degCHumidity 5 - 95 non-condensing
Gross Weight (w PSU amp Rail)kgs 41
lbs 90
PackagingDimensions
(W x D x H)mm 590 x 1150 x 330
inches 232 x 453 x 13
Mounting Standard 28 tool-less slide rail
3
HA201-TP Users Manual
Chapter 1 Product Introduction
13 General Information
HA201-WP a 2U Storage Server Barebone supports two Intelreg Xeonreg processors E5-2600 v3 series HA201-TP has a 24 x 25rdquo HDD bays as system drive bays It is a perfect building block for Storage Servers
bull Front Panel
LED Indicator and Switch
24 x 25 hot-swap SATASAS HDD bays
4
HA201-TP Users Manual
Chapter 1 Product Introduction
bull Rear Panel
1200W 1+1 80+ redundant power supply
2 x low-profile add-on card for external connection
SFF 8644 for JBOD connection
5
HA201-TP Users Manual
Chapter 1 Product Introduction
bull Top View of HA201-TP system
2 x hot-swappable storage control node
Intelreg Xeonreg E5-2600 v3 Processors
bull TOP View Of Controller Canister
6
HA201-TP Users Manual
Chapter 1 Product Introduction
bull Top View of HA201-TP system
4 x 60x56mm hot-swap fans
Intel S2600TP
HA201-TP Users Manual
7
Chapter 2 Hardware Installation
7
21 Removing and Installing top coverUnscrew the screws(2pcs for the back and 4pcs for the both right and left side)Slide the cover backwards to remove top cover form the chassis
Chapter 2 Hardware Installation
This section demonstrates maintenance procedures in replacing a defective part once the HA201-TP UM appliance is installed and is operational
Chapter 2 Hardware Installation
HA201-TP Users Manual
8
22 Central Processing Unit (CPU)221 Removing the Processor HeatsinkThe heatsink is attached to the server board or processor socket with captive fasteners Using a 2 Phillips screwdriver loosen the four screws located on the heatsink corners in a diagonal manner using the following procedurebull Using a 2 Phillips screwdriver start with screw 1 and loosen it by
giving it two rotations and stop (see letter A) (IMPORTANT Do not fully loosen)
bull Proceed to screw 2 and loosen it by giving it two rotations and stop (see letter B) Similarly loosen screws 3 and 4 Repeat steps A and B by giving each screw two rotations each time until all screws are loosened
bull Lift the heatsink straight up (see letter C)
REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
Processor
Socket
2
3
1
4
A
B
C
HA201-TP Users Manual
9
Chapter 2 Hardware Installation
222 Installing the Processor
2221 Unlatch the CPU Load Platebull Push the lever handle labeled ldquoOPEN 1strdquo (see letter A) down and
toward the CPU socket Rotate the lever handle upbull Repeat the steps for the second lever handle (see letter B)
CAUTION
Processor must be appropriate You may damage the server board if
you install a processor that is inappropriate for your server
CAUTION
ESD and handling processors Reduce the risk of electrostatic
discharge (ESD) damage to the processor by doing the following
(1) Touch the metal chassis before touching the processor or
server board Keep part of your body in contact with the metal
chassis to dissipate the static charge while handling the processor
(2) Avoid moving around unnecessarily
REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
A REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
B
Chapter 2 Hardware Installation
HA201-TP Users Manual
10
2222 Lift open the Load Platebull Rotate the right lever handle down until it releases the Load Plate
(see letter A)bull While holding down the lever handle with your other hand lift open
the Load Plate (see letter B)
BREMOVE
REMOVE
LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
A
HA201-TP Users Manual
11
Chapter 2 Hardware Installation
2223 Install the Processorbull Remove the processor from its packagebull Carefully remove the protective cover from the bottom side of the
CPU taking care not to touch any CPU contacts (see letter A)bull Orient the processor with the socket so that the processor cutouts
match the four orientation posts on the socket (see letter B) Note the location of a gold key at the corner of the processor (see letter C) Carefully place (Do NOT drop) the CPU into the socket
CAUTION
The pins inside the CPU socket are extremely sensitive Other than
the CPU no object should make contact with the pins inside the
CPU socket A damaged CPU socket pin may render the socket inoperable
and will produce erroneous CPU or other system errors if used
NOTE
The underside of the processor has components that may damage the
socket pins if installed improperly The processor must align correctly
with the socket opening before installation DO NOT DROP the
processor into the socket
NOTE
When possible a CPU insertion tool should be used when installing the
CPU
A
B
C
Chapter 2 Hardware Installation
HA201-TP Users Manual
12
2224 Remove the Socket Cover Remove the socket cover from the load plate by pressing it out
2225 Close the Load Plate Carefully lower the load plate down over the processor
2226 Lock down the Load Platebull Push down on the locking lever on the CLOSE 1st side (see letter A)
Slide the tip of the lever under the notch in the load plate (see letter B) Make sure the load plate tab engages under the socket lever when fully closed
bull bRepeat the steps to latch the locking lever on the other side (see letter C) Latch the levers in the order as shown
NOTE
The socket cover should be saved and re-used should the processor
need to be removed at anytime in the future
Save theprotectivecover
A
BC
HA201-TP Users Manual
13
Chapter 2 Hardware Installation
223 Installing the Processor Heatsink
bull If present remove the protective film covering the Thermal Interface Material on the bottom side of the heatsink (see letter A)
bull Align the heatsink fins to the front and back of the chassis for correct airflow The airflow goes from front-to-back of the chassis (see letter B)
bull Each heatsink has four captive fasteners and should be tightened in a diagonal manner using the following procedure
bull Using a 2 Phillips screwdriver start with screw 1 and engage screw threads by giving it two rotations and stop (see letter C) (Do not fully tighten)
bull Proceed to screw 2 and engage screw threads by giving it two rotations and stop (see letter D) Similarly engage screws 3 and 4
bull Repeat steps C and D by giving each screw two rotations each time until each screw is lightly tightened up to a maximum of 8 inch-lbs torque (see letter E)
NOTE
The processor heatsink for CPU1 and CPU2 is different FXXCA91X91HS is
for CPU1 while FXXEA91X91HS2 is for CPU2 Mislocating the heatsink
will cause serious thermal damage
C
Processor
Socket
AIRFLOW
Chassis Front
2
3
1
4
A
B
TIM
D
CAUTIONDo not over-tighten fasteners
E
Chapter 2 Hardware Installation
HA201-TP Users Manual
14
224 Removing the Processor
NOTE
Remove the processor by carefully lifting it out of the socket taking care
NOT to drop the processor and not touching any pins inside the socket
Install the socket cover if a replacement processor is not going to be installed
HA201-TP Users Manual
15
Chapter 2 Hardware Installation
225 Different heatsink for each of its CPUsCarefully position heatsink over the CPU0 and CPU1 and align the heatsink screws with the screw holes in the motherboard
Caution - Possible thermal damage Avoid moving the heatsink after
it has contacted the top of the CPU Too much movement could
disturb the layer of thermal compound causing voids and leading
to ineffective heat dissipation and component damage
CPU0
CPU0
CPU1
CPU1
Chapter 2 Hardware Installation
HA201-TP Users Manual
16
23 Diagram of the Correct InstallationNOTE The heat sinkrsquos fans should be blowing toward the rear end
of the chassis If one of the fans is facing the wrong direction
please remove the heat sink and reinstall it so that it is facing
the correct direction
AIRFLOW
AIRFLOW
FAN FAN
HA201-TP Users Manual
17
Chapter 2 Hardware Installation
24 System Memory241 Supported Memory
Chapter 2 Hardware Installation
HA201-TP Users Manual
18
242 Memory Population Rules
bull Each installed processor provides four channels of memory On the Intelreg Server Board S2600TP product family each memory channel supports two memory slots for a total possible 16 DIMMs installed
bull The memory channels from processor socket 1 are identified as Channel A B C and D The memory channels from processor socket 2 are identified as Channel E F G and H
bull The silk screened DIMM slot identifiers on the board provide information about the channel and therefore the processor to which they belong For example DIMM_A1 is the first slot on Channel A on processor 1 DIMM_E1 is the first DIMM socket on Channel E on processor 2
bull The memory slots associated with a given processor are unavailable if the corresponding processor socket is not populated
bull A processor may be installed without populating the associated memory slots provided a second processor is installed with associated memory In this case the memory is shared by the processors However the platform suffers performance degradation and latency due to the remote memory
bull Processor sockets are self-contained and autonomous However all memory subsystem support (such as Memory RAS and Error Management) in the BIOS setup is applied commonly across processor sockets
bull All DIMMs must be DDR4 DIMMsbull Mixing of LRDIMM with any other DIMM type is not allowed per
platformbull Mixing of DDR4 operating frequencies is not validated within a socket
or across sockets by Intel If DIMMs with different frequencies are mixed all DIMMs run at the common lowest frequency
bull A maximum of eight logical ranks (ranks seen by the host) per channel is allowed
bull The BLUE memory slots on the server board identify the first memory slot for a given memory channel
NOTE
Although mixed DIMM configurations are supported Intel only performs platform
validation on systems that are configured with identical DIMMs installed
HA201-TP Users Manual
19
Chapter 2 Hardware Installation
bull DIMM population rules require that DIMMs within a channel be populated starting with the BLUE DIMM slot or DIMM farthest from the processor in a ldquofill-farthestrdquo approach In addition when populating a Quad-rank DIMM with a Single- or Dual-rank DIMM in the same channel the Quad-rank DIMM must be populated farthest from the processor Intelreg MRC will check for correct DIMM placement
Chapter 2 Hardware Installation
HA201-TP Users Manual
20
On the Intelreg Server Board S2600TP product a total of sixteen DIMM slots are provided (two CPUs ndash four channels per CPU and two DIMMs per channel) The nomenclature for DIMM sockets is detailed in the following table
HA201-TP Users Manual
21
Chapter 2 Hardware Installation
243 Publishing System MemoryThere are a number of different situations in which the memory size andor configuration are displayed Most of these displays differ in one way or another so the same memory configuration may appear to display differently depending on when and where the display occurs
bull The BIOS displays the ldquoTotal Memoryrdquo of the system during POST if Quiet Boot is disabled in BIOS setup This is the total size of memory discovered by the BIOS during POST and is the sum of the individual sizes of installed DDR4 DIMMs in the system
bull The BIOS displays the ldquoEffective Memoryrdquo of the system in the BIOS Setup The term Effective Memory refers to the total size of all DDR4 DIMMs that are active (not disabled) and not used as redundant units (see Note below)
bull The BIOS provides the total memory of the system in the main page of BIOS setup This total is the same as the amount described by the first bullet above
bull If Quiet Boot is disabled the BIOS displays the total system memory on the diagnostic screen at the end of POST This total is the same as the amount described by the first bullet above
bull The BIOS provides the total amount of memory in the system by supporting the EFI Boot Service function GetMemoryMap()
bull The BIOS provides the total amount of memory in the system by supporting the INT 15h E820h function For details see the Advanced Configuration and Power Interface Specification
Chapter 2 Hardware Installation
HA201-TP Users Manual
22
25 RemovingInstalling a Drive Tray Hard Drive Drive Slot Map
251 Removing a Disk DriveRelease a drive tray by pressing the unlock button and pinching the lock lever slightly and pulling out the drive tray
252 Installing a Disk Drive
Adjust and place the 25rdquo HDD on the HDD tray Choose to the proper position screw hole then secure it with screws on the bottomMounting location 1 HDD with the interposer board attached
HA201-TP Users Manual
23
Chapter 2 Hardware Installation
253 Installing a Hard Disk Drive Tray
Insert the drive carrier into its bay Push the tray lever until it clicks Make sure the drive tray is correctly secured in place when its front edge aligns with the bay edge
254 Drive Slot Map
The drive slot map follows
HBA Card
MegaRaid Card
Chapter 2 Hardware Installation
HA201-TP Users Manual
24
26 Removing and Installing a Fan Module261 Removing a FAN module
Unscrew the thumb screw on the cover to open the top cover from NODE Unpluging the FAN cable Remove the fan by lifting it and pulling it out
262 Installing a FAN module To installing a fan module follow the reverse order
HA201-TP Users Manual
25
Chapter 2 Hardware Installation
27 Power Supply
271 Removing or Replacing the Power Supply Module
Slide the small levers (see red arrow ) to the right Hold the PSU lever and firmly pull the PSU out of the server chassis
Chapter 2 Hardware Installation
HA201-TP Users Manual
26
28 Tool-less Blade Slide Installation introduction
1 Pull on the lsquorsquoFront-Releasersquorsquoto unlock the inner channel from the Slide Assembly
2 Release the Detent-Lock and push Middle Channel inwards to retract Middle Channel
HA201-TP Users Manual
27
Chapter 2 Hardware Installation
Metal Spacer
OptionalRemove Metal Spacer for Aluminium Racks
3 Aligning the Front Bracket with the Mounting Hole
4 Push in to assembly the Front Bracket onto the Rack
Chapter 2 Hardware Installation
HA201-TP Users Manual
28
5 Now the bracket is fixed onto the Rack(Optional M6x10L screws are to secure the rails with posts if needed)
6 Refer to Diagram 3 amp 4 to assemble the End Bracket onto the Rack
HA201-TP Users Manual
29
Chapter 2 Hardware Installation
7 Assemble the inner channel onto the chassis using thescrews provided
8 Push the chassis with inner channels into Slide to complete Rack Installation
Chapter 3 Motherborad Overview
HA201-TP Users Manual
30
Chapter 3 Motherborad Overview
The Intelreg Server Board S2600TP is a monolithic printed circuit board (PCB) with features designed to support the high-performance and high-density computing markets This server board is designed to support the Intelreg Xeonreg processor E5-2600 v3 product family Previous generation Intelreg Xeonreg processors are not supported Many of the features and functions of the server board family are common A board will be identified by name when a described feature or function is unique to it
Chapter 3 Motherborad Overview
HA201-TP Users Manual
31
31 Intelreg Server Board Feature Set
Chapter 3 Motherborad Overview
HA201-TP Users Manual
32
WARNING The riser slot 1 on the server board is designed for
plugging in ONLY the riser card Plugging in the PCIe card may
cause permanent server board and PCIe card damage
Chapter 3 Motherborad Overview
HA201-TP Users Manual
33
32 Motherboard Layout
DiagnosticLED
RMM4LiteRiser Slot 2
SATASGPIO
BridgeBoard
SATA 3USB
Riser Slot 4
Riser Slot 4
HDD Activity
Riser Slot 3ControlPanel
SystemFan 1
CPU 2Socket
CPU 1Socket
SystemFan 3
SystemFan 2
MainPower 1
FanConnector
MainPower 2
InfiniBandPort(QSFP+)DedicatedManagementPortUSBStatus LEDID LED
VGA
NIC 2
IPMB
NIC 1
SerialPort A
CR 2032 3W
Backup Power Control
SATA 2
SATA 0
SATA RAID 5
SATA
Upgrade Key
DOM
SATA 1
Chapter 3 Motherborad Overview
HA201-TP Users Manual
34
33 Motherboard block diagram
Chapter 3 Motherborad Overview
HA201-TP Users Manual
35
34 Motherboard Configuration JumpersThe following table provides a summary and description of configuration test and debug jumpers The server board has several 3-pin jumper blocks that can be used Pin 1 on each jumper block can be identified by the following symbol on the silkscreen
Chapter 3 Motherborad Overview
HA201-TP Users Manual
36
Chapter 3 Motherborad Overview
HA201-TP Users Manual
37
35 Motherboard Configuration JumpersThe Intelreg Server Board S2600TP has the following board rear connector placement
HA201-TP Users Manual
38
Chapter 4 12G Expander Board Overview
This chapter provides detailed introduction guide on hardware introduction
41 12GB Expander Board411 Placement amp Connectors location
J1
JPIC_DBGU8
Chip 3x36RExpander
U14Flash U16
PIC
SW1
SW2
JEXP_UART
J7
JPIC_SMT
J2 J3 J4 CN1 JUSB_MB
JVBATT_12C
JVBATT CN3 CN2 JIPMI_LOC
CN5
CN8
JPWR1
JPWR2
JHDDPWRJPSON_OK
JIPMB
JUSB_PIC
J8
CN7CN6
CN4
JPWR_SW
Chapter 4 12GB Expander Borad Overview
HA201-TP Users Manual
39
Chapter 4 12G Expander Board Overview
412 PIN-OUT
Power Connector ndash JPWR1JPWR2
OS HDD Power Connector ndash JHDDPWR
Battery Connector ndash JVBATT
FAN Connector ndash J7J8
Console for Expander ndash JEXP_UART
HA201-TP Users Manual
40
Chapter 4 12G Expander Board Overview
PIC Smart portndash JPIC_SMT
PIC Debug port(For MPLAB I2C tool) ndash JPIC_DBG
PIC USB ndash JUSB_PIC
Battery Function ndash JVBATT_I2C
IPMB ndash JIPMB
HA201-TP Users Manual
41
Chapter 4 12G Expander Board Overview
BP PIC USB ndash JUSB_MB
IPMB for Device ndash JIPMI_LOC
Power SW ndash JPWR_SW
MB power on frunction ndash JPSON_OK
HA201-TP Users Manual
42
Chapter 4 12G Expander Board Overview
413 LEDs
LED_CN6
LED2
LED3
HA201-TP Users Manual
43
Chapter 5 Backplane OverviewChapter 5 Backplane Overview
This chapter provides detailed introduction guide on backplane introduction
51 Backplane511 Placement amp Connectors location
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
9
18
10
1
3
G
D
SG
D
S
C
B
C
E
B
C
E
GA
O
I
2
43
12
43
1
14
1
3
45
134
5
134
5
134
5
16
2
9
10
1
1
1120
10
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
16
17
32
33
484964
J2
JP1J11
For Secondary Canister Node
J4 J1 SW1
SW2
J3
For Primary Canister Node
HA201-TP Users Manual
44
Chapter 5 Backplane Overview
512 PIN-OUT
Power Connector ndash J2
Power Connector ndash J11
PMBUS Connector ndash JP1
FAN Connector ndash J4
HA201-TP Users Manual
45
Chapter 5 Backplane Overview
Console for MCU ndash J1
Front IO ndash J3
HA201-TP Users Manual
46
Chapter 5 Backplane Overview
513 LEDs
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
18
9
10
1
31
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
3 4
21
3 4
21
4
1
16
2
91
101
11
10
20
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ActFail
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
9
18
10
1
3
G
D
SG
D
S
C
B
C
E
B
C
E
GA
O
I
2
43
12
43
1
14
1
3
45
134
5
134
5
134
5
16
2
9
10
1
1
1120
10
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
16
17
32
33
484964LED1
HA201-TP Users Manual
47
Chapter 6 HDD Backplane Introduction
61 Expender firmware update through smart console port611 Update Expander firmware revision
Step 1 Set up HA201-TP console serial cable Insert console serial cable into console port shown below also
the other side inert serial port into motherboard
PLEASE FIND THE CONSOLE SERIAL CABLE IN THE PACKAGE BOX
Chapter 6 Debug amp Firmware Update Introduction
HA201-TP Users Manual
48
Chapter 6 HDD Backplane Introduction
Step 2 Set up HA201-TP RS232 connectionSet up RS232 connection application into your HA201-TP as shown in the example process below
For exampleOS Microsoft WindowsRS232 connection application Hyperterminal
Step 2 Install HyperTrmexe
HA201-TP Users Manual
49
Chapter 6 HDD Backplane Introduction
Step 3 Enter a new name for the icon in the field below and click OK
Step 4 Connecting by using selecting an option in the drop down menu circled in red below (we selected COM2 in this example) and click OK
HA201-TP Users Manual
50
Chapter 6 HDD Backplane Introduction
Properties
Port Setting
Bits per second
Data bits
Parity
Stop bits
Flow control None
Restore Defaults
OK ApplyCancel
None
Step 6 Set up is complete The diagram below depicts what screen should displayed
Step 5 For ldquoBits per secondrdquo select 38400 For ldquoFlow controlrdquo select None Click OK when you have finished your selections
cmdgt_
HA201-TP Users Manual
51
Chapter 6 HDD Backplane Introduction
Step 7To get firmware image amp MFG Configuration Image version information from AIC SAS Related Firmware Downloadne website
httpppmsaicipccomtw8888downloadexpandermcu
HA201-TP Users Manual
52
Chapter 6 HDD Backplane Introduction
Step 8Comand line for show current firmware revisioncmdgtrev
HA201-TP Users Manual
53
Chapter 6 HDD Backplane Introduction
Step 9Start to update expander firmwarecmdgtfdl 0 0_
Step 10Select the tool bar Transfer -gt Send File
HA201-TP Users Manual
54
Chapter 6 HDD Backplane Introduction
Step 11bull Choose new firmware path file fw 3A1_v11221bull Protocol have to choose Xmodem
Step 12Firmware download complete
HA201-TP Users Manual
55
Chapter 6 HDD Backplane Introduction
Step 13Reset computer for success update firmwarecmdgtreset
reset
HA201-TP Users Manual
56
Chapter 6 HDD Backplane Introduction
612 Update expander configuration MFG
Step 1Comand line for show current configuration MFGcmdgt showmfg
HA201-TP Users Manual
57
Chapter 6 HDD Backplane Introduction
Step 2Start to update expander configuration MFGcmdgtfdl 83 0_
Step 3Select the tool bar Transfer -gt Send File
83 0
HA201-TP Users Manual
58
Chapter 6 HDD Backplane Introduction
Step 4bull Choose new MFG path file mfg 3A10_eob_1201binbull Protocol have to choose Xmodem
Step 5MFG download complete
HA201-TP Users Manual
59
Chapter 6 HDD Backplane Introduction
Step 6Reset computer for success update MFGcmdgtreset
reset
HA201-TP Users Manual
60
Chapter 6 HDD Backplane Introduction
62 Update the expander firmware through in-band
FOR EXAMPLEStep 1
Download and install SG3_utilsexe which compatible with Linux OSFrom website httpsgdannyczsgsg3_utilshtml website Reference version sg3_utils-140tgz
Step 2To get firmware image amp MFG Configuration Image version information from AIC SAS Related Firmware Downloadne websitehttpppmsaicipccomtw8888downloadexpandermcu
HA201-TP Users Manual
61
Chapter 6 HDD Backplane Introduction
Step 3Execute terminal under the same new firmware folderexample Setting a new firmware folder on Home page Open Terminal by click to the right button of mouse in the same window Home
Step 4Typing sudo -s to into administrator mode
HA201-TP Users Manual
62
Chapter 6 HDD Backplane Introduction
Step 5 Find expander location$ sg_map -i
Step 6Update Expander firmware$ sg_write_buffer --id=0x0 --in=fw3A1_v11221 --mode=0x2 --offset=0 devsg0
HA201-TP Users Manual
63
Chapter 6 HDD Backplane Introduction
Step 7 Update Expander MFG$ sg_write_buffer --id=0x83 --in=mfg3A10_eob_1201bin --mode=0x2 --offset=0 devsg0
Step 8Reboot computer for success update firmware amp MFGrootubuntu~ reboot
HA201-TP Users Manual
64
Chapter 6 HDD Backplane Introduction
63 12G expander EDFB settingStep 1
For Install HyperTerminalexe refer to section 41
Step 2 Get EDFB statuscmdgt edfb
HA201-TP Users Manual
65
Chapter 6 HDD Backplane Introduction
Step 3Change EDFB setting
Enable EDFB functioncmdgtedfb on
Disable EDFB functioncmdgtedfb off
cmd gtedfbEDFB is OFF
cmd gtedfb onSucceeded to set EDFB
cmd gtedfb offEDFB is OFF
HA201-TP Users Manual
66
Chapter 6 HDD Backplane Introduction
64 Slot HDD power setting(Only for system cooling Fan controled by expander)
Step 1 For Install sg3exe tool and get new firmware from website refer to section 42
Step 2Execute terminal under the same new firmware folderexample Setting a new firmware folder on Home page Open Terminal by click to the right button of mouse in the same window Home
HA201-TP Users Manual
67
Chapter 6 HDD Backplane Introduction
Step 3 Typing sudo -s to into administrator mode
Step 4 Find expander location$ sg_map -i
HA201-TP Users Manual
68
Chapter 6 HDD Backplane Introduction
Step 5For example
If would like to turn the Disk004 power off under the HBA card Need to check Disk004 power status $ sg_ses --page=7 devsg0
Under HBA card the Element 3 = Disk004
HA201-TP Users Manual
69
Chapter 6 HDD Backplane Introduction
Step 6To check Disk004 (element 3) power status is ok$ sg_ses --page=2 devsg0
Status shows belowThe status of Element 3 is OK
HA201-TP Users Manual
70
Chapter 6 HDD Backplane Introduction
Step 7Turn off a HDD power$ sg_ses --descriptor=Disk004 --set=341 devsg0
Step 8Turn on a HDD power$ sg_ses --descriptor=Disk004 --clear=341 devsg0
HA201-TP Users Manual
71
Chapter 6 HDD Backplane Introduction
65 HDD BP thermal sensor temperature setting(Only for system cooling Fan controled by expander)
Step 1 For Install HyperTerminalexe refer to section 41
Step 2Get the current temperature settingscmdgt temperature
HA201-TP Users Manual
72
Chapter 6 HDD Backplane Introduction
Step 3For example
Set new temperatureT1=20 C 4 18 CT2=50 C 4 52 CWarning threshold=50 C 448 CAlarm threshold=55 C 454 C
The new setting will take effect after resetcmdgt temperature 18 52 48 54cmdgt reset
HA201-TP Users Manual
73
Chapter 6 HDD Backplane Introduction
Step 4Check fan speed amp temperature informationcmdgt sensor
cmd gtsensor==ENCLOSURE STATUS========================================================== Total fan number 2 System Fan-0 speed 10888 RPM System Fan-1 speed 10971 RPM System PWN-0 82 Expander Temperature 76 Celsius degree Sytem Temperature-0 33 Celsius degree T1 20 Celsius degree T2 50 Celsius degree TC 55 Celsius degree Voltage Sensor 09V 093V Voltage Sensor 18V 180V
cmd gt_
HA201-TP Users Manual
74
Chapter 7 Intelreg Server Board S2600TP Platform Management
71 Server Management Function Architecture711 IPMI Feature7111 IPMI 20 Featuresbull Baseboard management controller (BMC)bull IPMI Watchdog timerbull Messaging support including command bridging and usersession
supportbull Chassis device functionality including powerreset control and BIOS boot
flags supportbull Event receiver device The BMC receives and processes events from
other platform subsystemsbull Field Replaceable Unit (FRU) inventory device functionality The BMC
supports access to system FRU devices using IPMI FRU commandsbull System Event Log (SEL) device functionality The BMC supports and
provides access to a SELbull Sensor Data Record (SDR) repository device functionality The BMC
supports storage and access of system SDRsbull Sensor device and sensor scanningmonitoring The BMC provides IPMI
management of sensors It polls sensors to monitor and report system health
bull IPMI interfaceso Host interfaces include system management software (SMS) with receive message queue support and server management mode (SMM)o IPMB interfaceo LAN interface that supports the IPMI-over-LAN protocol (RMCP RMCP+)
bull Serial-over-LAN (SOL)bull ACPI state synchronization The BMC tracks ACPI state changes that are
provided by the BIOSbull BMC self-test The BMC performs initialization and run-time self-tests and
makes results available to external entitiesbull See also the Intelligent Platform Management Interface Specification
Second Generation v20
Chapter 7 Intelreg Server Board S2600TP Platform Management
HA201-TP Users Manual
75
Chapter 7 Intelreg Server Board S2600TP Platform Management
7112 Non IPMI FeaturesThe BMC supports the following non-IPMI featuresbull In-circuit BMC firmware updatebull Fault resilient booting (FRB) FRB2 is supported by the watchdog timer
functionalitybull Chassis intrusion detectionbull Basic fan control using Control version 2 SDRsbull Fan redundancy monitoring and supportbull Enhancements to fan speed controlbull Power supply redundancy monitoring and supportbull Hot-swap fan supportbull Acoustic management Support for multiple fan profilesbull Signal testing support The BMC provides test commands for setting
and getting platform signal statesbull The BMC generates diagnostic beep codes for fault conditionsbull System GUID storage and retrievalbull Front panel management The BMC controls the system status LED
and chassis ID LED It supports secure lockout of certain front panel functionality and monitors button presses The chassis ID LED is turned on using a front panel button or a command
bull Power state retentionbull Power fault analysisbull Intelreg Light-Guided Diagnosticsbull Power unit management Support for power unit sensor The BMC
handles power-good dropout conditionsbull DIMM temperature monitoring New sensors and improved acoustic
management using closed-loop fan control algorithm taking into account DIMM temperature readings
bull Address Resolution Protocol (ARP) The BMC sends and responds to ARPs (supported on embedded NICs)
bull Dynamic Host Configuration Protocol (DHCP) The BMC performs DHCP (supported on embedded NICs)
bull Platform environment control interface (PECI) thermal management support
bull E-mail alertingbull Support for embedded web server UI in Basic Manageability feature
set
HA201-TP Users Manual
76
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull Enhancements to embedded web servero Human-readable SELo Additional system configurabilityo Additional system monitoring capabilityo Enhanced on-line help
bull Integrated KVMbull Enhancements to KVM redirection
o Support for higher resolutionbull Integrated Remote Media Redirectionbull Lightweight Directory Access Protocol (LDAP) supportbull Intelreg Intelligent Power Node Manager supportbull Embedded platform debug feature which allows capture of detailed
data for later analysisbull Provisioning and inventory enhancements
o Inventory datasystem information export (partial SMBIOS table)bull DCMI 15 compliance (product-specific)bull Management support for PMBus rev12 compliant power suppliesbull BMC Data Repository (Managed Data Region Feature)bull Support for an Intelreg Local Control Display Panelbull System Airflow Monitoringbull Exit Air Temperature Monitoringbull Ethernet Controller Thermal Monitoringbull Global Aggregate Temperature Margin Sensorbull Memory Thermal Managementbull Power Supply Fan Sensorsbull Energy Star Server Supportbull Smart Ride Through (SmaRT) Closed Loop System Throttling (CLST)bull Power Supply Cold Redundancybull Power Supply FW Updatebull Power Supply Compatibility Checkbull BMC FW reliability enhancements
o Redundant BMC boot blocks to avoid possibility of a corrupted boot block resulting in a scenario that prevents a user from updating the BMC o BMC System Management Health Monitoring
HA201-TP Users Manual
77
Chapter 7 Intelreg Server Board S2600TP Platform Management
72 Features and Functions721 Power SubsystemThe server board supports several power control sources which can initiate power-up orpower-down activity
HA201-TP Users Manual
78
Chapter 7 Intelreg Server Board S2600TP Platform Management
722 Advanced Configuration and Power Interface (ACPI)The server board has support for the following ACPI states
HA201-TP Users Manual
79
Chapter 7 Intelreg Server Board S2600TP Platform Management
723 System InitializationDuring system initialization both the BIOS and the BMC initialize the following items
7231 Processor Tcontrol SettingProcessors used with this chipset implement a feature called Tcontrol which provides a processor-specific value that can be used to adjust the fan-control behavior to achieve optimum cooling and acoustics The BMC reads these from the CPU through PECI Proxy mechanism provided by Manageability Engine (ME) The BMC uses these values as part of thefan-speed-control algorithm
HA201-TP Users Manual
80
Chapter 7 Intelreg Server Board S2600TP Platform Management
7232 Fault Resilient Booting (FRB)Fault resilient booting (FRB) is a set of BIOS and BMC algorithms and hardware support that allow a multiprocessor system to boot even if the bootstrap processor (BSP) fails Only FRB2 is supported using watchdog timer commandsFRB2 refers to the FRB algorithm that detects system failures during POST The BIOS uses the BMC watchdog timer to back up its operation during POST The BIOS configures the watchdog timer to indicate that the BIOS is using the timer for the FRB2 phase of the boot operationAfter the BIOS has identified and saved the BSP information it sets the FRB2 timer use bit and loads the watchdog timer with the new timeout intervalIf the watchdog timer expires while the watchdog use bit is set to FRB2 the BMC (if so configured) logs a watchdog expiration event showing the FRB2 timeout in the event data bytes The BMC then hard resets the system assuming the BIOS-selected reset as the watchdog timeout actionThe BIOS is responsible for disabling the FRB2 timeout before initiating the option ROM scan and before displaying a request for a boot password If the processor fails and causes an FRB2 timeout the BMC resets the systemThe BIOS gets the watchdog expiration status from the BMC If the status shows an expired FRB2 timer the BIOS enters the failure in the system event log (SEL) In the OEM bytes entry in the SEL the last POST code generated during the previous boot attempt is written FRB2failure is not reflected in the processor status sensor valueThe FRB2 failure does not affect the front panel LEDs
HA201-TP Users Manual
81
Chapter 7 Intelreg Server Board S2600TP Platform Management
7233 Post Code DisplayThe BMC upon receiving standby power initializes internal hardware to monitor port 80h (POST code) writes Data written to port 80h is output to the system POST LEDsThe BMC deactivates POST LEDs after POST had completed
HA201-TP Users Manual
82
Chapter 7 Intelreg Server Board S2600TP Platform Management
724 System Event Log (SEL)The BMC implements the system event log as specified in the Intelligent Platform Management Interface Specification Version 20 The SEL is accessible regardless of the system power state through the BMCs in-band and out-of-band interfacesThe BMC allocates 95231bytes (approx 93 KB) of non-volatile storage space to store system events The SEL timestamps may not be in order Up to 3639 SEL records can be stored at a time Because the SEL is circular any command that results in an overflow of the SEL beyond the allocated space will overwrite the oldest entries in the SEL while setting the overflow flag
HA201-TP Users Manual
83
Chapter 7 Intelreg Server Board S2600TP Platform Management
73 Sensor MonitoringThe BMC monitors system hardware and reports system health The information gathered from physical sensors is translated into IPMI sensors as part of the ldquoIPMI Sensor Modelrdquo The BMC also reports various system state changes by maintaining virtual sensors that are not specifically tied to physical hardware This section describes general aspects of BMC sensor management as well as describing how specific sensor types are modeled Unless otherwise specified the term ldquosensorrdquo refers to the IPMI sensor-model definition of a sensor
531 Sensor ScanningThe value of many of the BMCrsquos sensors is derived by the BMC FW periodically polling physical sensors in the system to read temperature voltages and so on Some of these physical sensors are built-in to the BMC component itself and some are physically separated from the BMC Polling of physical sensors for support of IPMI sensor monitoring does not occur until the BMCrsquos operational code is running and the IPMI FW subsystem has completed initializationIPMI sensor monitoring is not supported in the BMC boot code Additionally the BMC selectively polls physical sensors based on the current power and reset state of the system and the availability of the physical sensor when in that state For example non-standby voltages are not monitored when the system is in S4 or S5 power state
HA201-TP Users Manual
84
Chapter 7 Intelreg Server Board S2600TP Platform Management
732 Sensor Rearm Behavior7321 Manual versus Re-arm SensorsSensors can be either manual or automatic re-arm An automatic re-arm sensor will re-arm (clear) the assertion event state for a threshold or offset it if that threshold or offset is deasserted after having been asserted This allows a subsequent assertion of the threshold or an offset to generate a new event and associated side-effect An example side-effect would be boosting fans due to an upper critical threshold crossing of a temperature sensor The event state and the input state (value) of the sensor track each other Most sensors are auto-rearmA manual re-arm sensor does not clear the assertion state even when the threshold or offset becomes de-asserted In this case the event state and the input state (value) of the sensor do not track each other The event assertion state is sticky The following methods can be used to re-arm a sensorbull Automatic re-arm ndash Only applies to sensors that are designated as
ldquoauto-rearmrdquobull IPMI command Re-arm Sensor Eventbull BMC internal method ndash The BMC may re-arm certain sensors due to a
trigger condition For example some sensors may be re-armed due to a system reset A BMC reset will re-arm all sensorsbull System reset or DC power cycle will re-arm all system fan sensors
HA201-TP Users Manual
85
Chapter 7 Intelreg Server Board S2600TP Platform Management
7322 Re-arm and Event GenerationAll BMC-owned sensors that show an asserted event status generate a de-assertion SEL event when the sensor is re-armed provided that the associated SDR is configured to enable a deassertion event for that condition This applies regardless of whether the sensor is a thresholdanalog sensor or a discrete sensor
To manually re-arm the sensors the sequence is outlined below1 A failure condition occurs and the BMC logs an assertion event2 If this failure condition disappears the BMC logs a de-assertion event (if
so configured)3 The sensor is re-armed by one of the methods described in the
previous section4 The BMC clears the sensor status5 The sensor is put into reading-state-unavailable state until it is polled
again or otherwise updated6 The sensor is updated and the ldquoreading-state-unavailablerdquo state is
cleared A new assertion event will be logged if the fault state is once again detected
All auto-rearm sensors that show an asserted event status generate a de-assertion SEL event at the time the BMC detects that the condition causing the original assertion is no longer present and the associated SDR is configured to enable a de-assertion event for that condition
HA201-TP Users Manual
86
Chapter 7 Intelreg Server Board S2600TP Platform Management
733 BIOS Event-Only SensorsBIOS-owned discrete sensors are used for event generation only and are not accessible through IPMI sensor commands like the Get Sensor Reading command Note that in this case the sensor owner designated in the SDR is not the BMCAn example of this usage would be the SELs logged by the BIOS for uncorrectable memory errors Such SEL entries would identify a BIOS-owned sensor ID
734 Margin SensorsThere is sometimes a need for an IPMI sensor to report the difference (margin) from a nonzero reference offset For the purposes of this document these type sensors are referred to as margin sensors For instance for the case of a temperature margin sensor if the referencevalue is 90 degrees and the actual temperature of the device being monitored is 85 degreesthe margin value would be -5
735 IPMI Watchdog SensorThe BMC supports a Watchdog Sensor as a means to log SEL events due to expirations of the IPMI 20 compliant Watchdog Timer
736 BMC Watchdog SensorThe BMC supports an IPMI sensor to report that a BMC reset has occurred due to action taken by the BMC Watchdog feature A SEL event will be logged whenever either the BMC FW stack is reset or the BMC CPU itself is reset
737 BMC System Management Health MonitoringThe BMC tracks the health of each of its IPMI sensors and report failures by providing a ldquoBMC FW Healthrdquo sensor of the IPMI 20 sensor type Management Subsystem Health with support for the Sensor Failure offset Only assertions should be logged into the SEL for the Sensor Failure offset The BMC Firmware Health sensor asserts for any sensor when 10 consecutive sensor errors are read These are not standard sensor events (that is threshold crossings or discrete assertions) These are BMC Hardware Access Layer (HAL) errors If a successful sensor read is completed the counter resets to zero
HA201-TP Users Manual
87
Chapter 7 Intelreg Server Board S2600TP Platform Management
738 VR Watchdog TimerThe BMC FW monitors that the power sequence for the board VR controllers is completed when a DC power-on is initiated Incompletion of the sequence indicates a board problem in which case the FW powers down the systemThe BMC FW supports a discrete IPMI sensor for reporting and logging this fault condition
739 System Airflow MonitoringThe BMC provides an IPMI sensor to report the volumetric system airflow in CFM (cubic feet per minute) The air flow in CFM is calculated based on the system fan PWM values The specific Pulse Width Modulation (PWM or PWMs) used to determine the CFM is SDR configurable The relationship between PWM and CFM is based on a lookup table in an OEM SDRThe airflow data is used in the calculation for exit air temperature monitoring It is exposed as an IPMI sensor to allow a datacenter management application to access this data for use in rack-level thermal management
7310 Thermal MonitoringThe BMC provides monitoring of component and board temperature sensing devices This monitoring capability is instantiated in the form of IPMI analogthreshold or discrete sensors depending on the nature of the measurementFor analogthreshold sensors with the exception of Processor Temperature sensors critical and non-critical thresholds (upper and lower) are set through SDRs and event generation enabled for both assertion and de-assertion eventsFor discrete sensors both assertion and de-assertion event generation are enabledMandatory monitoring of platform thermal sensors includes
HA201-TP Users Manual
88
Chapter 7 Intelreg Server Board S2600TP Platform Management
7310 Thermal MonitoringThe BMC provides monitoring of component and board temperature sensing devices This monitoring capability is instantiated in the form of IPMI analogthreshold or discrete sensors depending on the nature of the measurementFor analogthreshold sensors with the exception of Processor Temperature sensors critical and non-critical thresholds (upper and lower) are set through SDRs and event generation enabled for both assertion and de-assertion eventsFor discrete sensors both assertion and de-assertion event generation are enabledMandatory monitoring of platform thermal sensors includesbull Inlet temperature (physical sensor is typically on system front panel or
HDD back plane)bull Board ambient thermal sensorsbull Processor temperaturebull Memory (DIMM) temperaturebull CPU VRD Hot monitoringbull Power supply (only supported for PMBus-compliant PSUs)Additionally the BMC FW may create ldquovirtualrdquo sensors that are based on a combination of aggregation of multiple physical thermal sensors and application of a mathematical formula to thermal or power sensor readings
73101 Absolute Value versus Margin SensorsThermal monitoring sensors fall into three basic categoriesbull Absolute temperature sensors ndash These are analogthreshold sensors
that provide a value that corresponds to an absolute temperature value
bull Thermal margin sensors ndash These are analogthreshold sensors that provide a value that is relative to some reference value
bull Thermal fault indication sensors ndash These are discrete sensors that indicate a specific thermal fault condition
HA201-TP Users Manual
89
Chapter 7 Intelreg Server Board S2600TP Platform Management
73102 Processor DTS-Spec Margin Sensor(s)Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family incorporate a DTS based thermal spec This allows a much more accurate control of the thermal solution and will enable lower fan speeds and lower fan power consumption The main usage of this sensor is as an input to the BMCrsquos fan control algorithms The BMC implements this as a threshold sensor There is one DTS sensor for each installed physical processor package Thresholds are not set and alert generation is not enabled for these sensors
73103 Processor Thermal Margin Sensor(s)Each processor supports a physical thermal margin sensor per core that is readable through the PECI interface This provides a relative value representing a thermal margin from the corersquos throttling thermal trip point Assuming that temp controlled throttling is enabled the physical core temp sensor reads lsquo0rsquo which indicates the processor core is being throttledThe BMC supports one IPMI processor (margin) temperature sensor per physical processor package This sensor aggregates the readings of the individual core temperatures in a package to provide the hottest core temperature reading When the sensor reads lsquo0rsquo it indicates that the hottest processor core is throttlingDue to the fact that the readings are capped at the corersquos thermal throttling trip point (reading = 0) thresholds are not set and alert generation is not enabled for these sensors
73104 Processor Thermal Control Monitoring (Prochot)The BMC FW monitors the percentage of time that a processor has been operationally constrained over a given time window (nominally six seconds) due to internal thermal management algorithms engaging to reduce the temperature of the device When any processor core temperature reaches its maximum operating temperature the processorpackage PROCHOT (processor hot) signal is asserted and these management algorithms known as the Thermal Control Circuit (TCC) engage to reduce the temperature provided TCC is enabled TCC is enabled by BIOS during system boot This monitoring is instantiated as one IPMI analogthreshold sensor per processor package The BMC implements this as a threshold sensor on a per-processor basis
HA201-TP Users Manual
90
Chapter 7 Intelreg Server Board S2600TP Platform Management
Under normal operation this sensor is expected to read lsquo0rsquo indicating that no processor throttling has occurredThe processor provides PECI-accessible counters one for the total processor time elapsed and one for the total thermally constrained time which are used to calculate the percentage assertion over the given time window
73105 Processor Voltage Regulator (VRD) Over-Temperature SensorThe BMC monitors processor VRD_HOT signals The processor VRD_HOT signals are routed to the respective processor PROCHOT input in order initiate throttling to reduce processor power draw therefore indirectly lowering the VRD temperatureThere is one processor VRD_HOT signal per CPU slot The BMC instantiates one discrete IPMI sensor for each VRD_HOT signal This sensor monitors a digital signal that indicates whether a processor VRD is running in an over-temperature condition When the BMC detects that this signal is asserted it will cause a sensor assertion which will result in an event being written into the sensor event log (SEL)
73106 Inlet Temperature SensorEach platform supports a thermal sensor for monitoring the inlet temperature There are four potential sources for inlet temperature reading
73107 Baseboard Ambient Temperature Sensor(s)The server baseboard provides one or more physical thermal sensors for monitoring the ambient temperature of a board location This is typically to provide rudimentary thermal monitoring of components that lack internal thermal sensors
73108 Server South Bridge (SSB) Thermal MonitoringThe BMC monitors the SSB temperature This is instantiated as an analog (threshold) IPMI thermal sensor
73109 Exit Air Temperature Monitoring The BMC synthesizes a virtual sensor to approximate system exit air temperature for use in fan control This is calculated based on the total power being consumed by the system and the total volumetric air flow provided by the system fans
HA201-TP Users Manual
91
Chapter 7 Intelreg Server Board S2600TP Platform Management
Each system shall be characterized in tabular format to understand total volumetric flow versus fan speed The BMC calculates an average exit air temperature based on the total system power front panel temperature the volumetric system air flow (cubic feet per meter or CFM) and altitude rangeThis sensor is only available on systems in an Intelreg chassis The Exit Air temp sensor is only available when PMBus power supplies are installed
731010 Ethernet Controller Thermal MonitoringThe Intelreg Ethernet Controller I350-AM4 and Intelreg Ethernet Controller 10 Gigabit X540 support an on-die thermal sensor For baseboard Ethernet controllers that use these devices the BMC will monitor the sensors and use this data as input to the fan speed control The BMC will instantiate an IPMI temperature sensor for each device on the baseboard
731011 Memory VRD-Hot Sensor(s)The BMC monitors memory VRD_HOT signals The memory VRD_HOT signals are routed to the respective processor MEMHOT inputs in order to throttle the associated memory to effectively lower the temperature of the VRD feeding that memoryFor Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family there are 2 memory VRD_HOT signals per CPU slot The BMC instantiates one discrete IPMI sensor for each memory VRD_HOT signal
731012 Add-in Module Thermal MonitoringSome boards have dedicated slots for an IO module andor a SAS module For boards that support these slots the BMC will instantiate an IPMI temperature sensor for each slot The modules themselves may or may not provide a physical thermal sensor (a TMP75 device) If the BMC detects that a module is installed it will attempt to access the physical thermal sensor and if found enable the associated IPMI temperature sensor
731013 Processor ThermTripWhen a Processor ThermTrip occurs the system hardware will automatically power down the server If the BMC detects that a ThermTrip occurred then it will set the ThermTrip offset for the applicable
HA201-TP Users Manual
92
Chapter 7 Intelreg Server Board S2600TP Platform Management
processor status sensor
731014 Server South Bridge (SSB) ThermTrip Monitoring The BMC supports SSB ThermTrip monitoring that is instantiated as an IPMI discrete sensor When a SSB ThermTrip occurs the system hardware will automatically power down the server and the BMC will assert the sensor offset and log an event
731015 DIMM ThermTrip MonitoringThe BMC supports DIMM ThermTrip monitoring that is instantiated as one aggregate IPMI discrete sensor per CPU When a DIMM ThermTrip occurs the system hardware will automatically power down the server and the BMC will assert the sensor offset and log an eventThis is a manual re-arm sensor that is rearmed on system resets and power-on (AC or DC power on transitions)
HA201-TP Users Manual
93
Chapter 7 Intelreg Server Board S2600TP Platform Management
7311 Processor SensorsThe BMC provides IPMI sensors for processors and associated components such as voltage regulators and fans The sensors are implemented on a per-processor basis
HA201-TP Users Manual
94
Chapter 7 Intelreg Server Board S2600TP Platform Management
73111 Processor Status SensorsThe BMC provides an IPMI sensor of type processor for monitoring status information for each processor slot If an event state (sensor offset) has been asserted it remains asserted until one of the following happens1 A Rearm Sensor Events command is executed for the processor status
sensor2 AC or DC power cycle system reset or system boot occurs
The BMC provides system status indication to the front panel LEDs for processor fault conditions shown belowCPU Presence status is not saved across AC power cycles and therefore will not generate a deassertion after cycling AC power
HA201-TP Users Manual
95
Chapter 7 Intelreg Server Board S2600TP Platform Management
73112 Processor Population Fault (CPU Missing) SensorThe BMC supports a Processor Population Fault sensor This is used to monitor for the condition in which processor slots are not populated as required by the platform hardware to allow power-on of the systemAt BMC startup the BMC will check for the fault condition and set the sensor state accordinglyThe BMC also checks for this fault condition at each attempt to DC power-on the system At each DC power-on attempt a beep code is generated if this fault is detectedThe following steps are used to correct the fault condition and clear the sensor state1 AC power down the server2 Install the missing processor into the correct slot3 AC power on the server
73113 ERR2 Timeout MonitoringThe BMC supports an ERR2 Timeout Sensor (1 per CPU) that asserts if a CPUrsquos ERR2 signal has been asserted for longer than a fixed time period (gt 90 seconds) ERR[2] is a processor signal that indicates when the IIO (Integrated IO module in the processor) has a fatal error which could not be communicated to the core to trigger SMI ERR[2] events are fatal error conditions where the BIOS and OS will attempt to gracefully handle error but may not be always do so reliably A continuously asserted ERR2 signal is an indication that the BIOS cannot service the condition that caused the error This is usually because that condition prevents the BIOS from runningWhen an ERR2 timeout occurs the BMC assertsde-asserts the ERR2 Timeout Sensor and logs a SEL event for that sensor The default behavior for BMC core firmware is to initiate a system reset upon detection of an ERR2 timeout The BIOS setup utility provides an option to disable or enable system reset by the BMC for detection of this condition
HA201-TP Users Manual
96
Chapter 7 Intelreg Server Board S2600TP Platform Management
73114 CATERR SensorThe BMC supports a CATERR sensor for monitoring the system CATERR signalThe CATERR signal is defined as having 3 statesbull high (no event)bull pulsed low (possibly fatal may be able to recover)bull low (fatal)All processors in a system have their CATERR pins tied together The pin is used as a communication path to signal a catastrophic system event to all CPUs The BMC has direct access to this aggregate CATERR signalThe BMC only monitors for the ldquoCATERR held lowrdquo condition A pulsed low condition is ignored by the BMC If a CATERR-low condition is detected the BMC logs an error message to the SEL against the CATERR sensor and the default action after logging the SEL entry is to reset the system The BIOS setup utility provides an option to disable or enable system reset by the BMC for detection of this conditionThe sensor is rearmed on power-on (AC or DC power on transitions) It is not rearmed on system resets in order to avoid multiple SEL events that could occur due to a potential reset loop if the CATERR keeps recurring which would be the case if the CATERR was due to an MSID mismatch conditionWhen the BMC detects that this aggregate CATERR signal has asserted it can then go through PECI to query each CPU to determine which one was the source of the error and write an OEM code identifying the CPU slot into an event data byte in the SEL entry If PECI is non-functional (it isnrsquot guaranteed in this situation) then the OEM code should indicate that the source is unknownEvent data byte 2 and byte 3 for CATERR sensor SEL events
ED1 ndash 0xA1ED2 - CATERR type0 Unknown1 CATERR2 CPU Core Error (not supported on Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family)3 MSID Mismatch4 CATERR due to CPU 3-strike timeout
HA201-TP Users Manual
97
Chapter 7 Intelreg Server Board S2600TP Platform Management
ED3 - CPU bitmap that causes the system CATERR[0] CPU1[1] CPU2[2] CPU3[3] CPU4When a CATERR Timeout event is determined to be a CPU 3-strike timeout The BMC shall log the logical FRU information (eg busdevfunc for a PCIe device CPU or DIMM) that identifies the FRU that caused the error in the extended SEL data bytes In this case Ext-ED0 will be set to 0x70 and the remaining ED1-ED7 will be set according to the device type and info available
73115 MSID Mismatch SensorThe BMC supports a MSID Mismatch sensor for monitoring for the fault condition that will occur if there is a power rating incompatibility between a baseboard and a processor The sensor is rearmed on power-on (AC or DC power on transitions)
7312 Voltage MonitoringThe BMC provides voltage monitoring capability for voltage sources on the main board and processors such that all major areas of the system are covered This monitoring capability is instantiated in the form of IPMI analogthreshold sensors
73121 DIMM Voltage SensorsSome systems support either LVDDR (Low Voltage DDR) memory or regular (non-LVDDR) memory During POST the system BIOS detects which type of memory is installed and configures the hardware to deliver the correct voltageSince the nominal voltage range is different this necessitates the ability to set different thresholds for any associated IPMI voltage sensors The BMC FW supports this by implementing separate sensors (that is separate IPMI sensor numbers) for each nominal voltage range supported for a single physical sensor and it enablesdisables the correct IPMI sensor based on which type memory is installed The sensor data records for both these DIMM voltage sensor types have scanning disabled by default Once the BIOS has completed its POST routine it is responsible for communicating the DIMM voltage type to the BMC which will then
HA201-TP Users Manual
98
Chapter 7 Intelreg Server Board S2600TP Platform Management
enable sensor scanning of the correct DIMM voltage sensor
7313 Fan MonitoringBMC fan monitoring support includes monitoring of fan speed (RPM) and fan presence
73131 Fan Tach SensorsFan Tach sensors are used for fan failure detection The reported sensor reading is proportional to the fanrsquos RPM This monitoring capability is instantiated in the form of IPMI analogthreshold sensorsMost fan implementations provide for a variable speed fan so the variations in fan speed can be large Therefore the threshold values must be set sufficiently low as to not result in inappropriate threshold crossingsFan tach sensors are implemented as manual re-arm sensors because a lower-critical threshold crossing can result in full boosting of the fans This in turn may cause a failing fanrsquos speed to rise above the threshold and can result in fan oscillationsAs a result fan tach sensors do not auto-rearm when the fault condition goes away but rather are rearmed for either of the following occurrences1 The system is reset or power-cycled2 The fan is removed and either replaced with another fan or re-
inserted This applies to hot-swappable fans only This re-arm is triggered by change in the state of the associated fan presence sensor
After the sensor is rearmed if the fan speed is detected to be in a normal range the failure conditions shall be cleared and a de-assertion event shall be logged
HA201-TP Users Manual
99
Chapter 7 Intelreg Server Board S2600TP Platform Management
73132 Fan Presence SensorsSome chassis and server boards provide support for hot-swap fans These fans can be removed and replaced while the system is powered on and operating normally The BMC implements fan presence sensors for each hot swappable fan These are instantiated as IPMI discrete sensorsEvents are only logged for fan presence upon changes in the presence state after AC power is applied (no events logged for initial state)
73133 Fan Redundancy SensorThe BMC supports redundant fan monitoring and implements fan redundancy sensors for products that have redundant fans Support for redundant fans is chassis-specificA fan redundancy sensor generates events when its associated set of fans transition between redundant and non-redundant states as determined by the number and health of the component fans The definition of fan redundancy is configuration dependent The BMCallows redundancy to be configured on a per fan-redundancy sensor basis through OEM SDR recordsThere is a fan redundancy sensor implemented for each redundant group of fans in the systemAssertion and de-assertion event generation is enabled for each redundancy state
73134 Power Supply Fan SensorsMonitoring is implemented through IPMI discrete sensors one for each power supply fan The BMC polls each installed power supply using the PMBus fan status commands to check for failure conditions for the power supply fans The BMC asserts the ldquoperformance lagsrdquo offset of the IPMI sensor if a fan failure is detectedPower supply fan sensors are implemented as manual re-arm sensors because a failure condition can result in boosting of the fans This in turn may cause a failing fanrsquos speed to rise above the ldquofaultrdquo threshold and can result in fan oscillations As a result these sensors do not auto-rearm when the fault condition goes away but rather are rearmed only when the system is reset or power-cycled or the PSU is removed and replaced with the same or another PSUAfter the sensor is rearmed if the fan is no longer showing a failed state the failure condition in the IPMI sensor shall be cleared and a de-
HA201-TP Users Manual
100
Chapter 7 Intelreg Server Board S2600TP Platform Management
assertion event shall be logged
73135 Monitoring for ldquoFans Offrdquo ScenarioOn Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family it is likely that there will be situations where specific fans are turned off based on current system conditions BMC Fan monitoring will comprehend this scenario and not log false failure eventsThe recommended method is for the BMC FW to halt updates to the value of the associated fan tach sensor and set that sensorrsquos IPMI sensor state to ldquoreading-state-unavailablerdquo when this mode is active Management software must comprehend this state for fan tach sensors and not report these as failure conditionsThe scenario for which this occurs is that the BMC Fan Speed Control (FSC) code turns off the fans by setting the PWM for the domain to 0 This is done when based on one or more global aggregate thermal margin sensor readings dropping below a specified thresholdBy default the fans-off feature will be disabled There is a BMC command and BIOS setup option to enabledisable this featureThe SmaRTCLST system feature will also momentarily gate power to all the system fans to reduce overall system power consumption in response to a power supply event (for example to ride out an AC power glitch) However for this scenario the fan power is gated by hardware for only 100ms which should not be long enough to result in triggering a fan fault SEL event
7314 Standard Fan ManagementThe BMC controls and monitors the system fans Each fan is associated with a fan speed sensor that detects fan failure and may also be associated with a fan presence sensor for hotswap support For redundant fan configurations the fan failure and presence status determines the fan redundancy sensor stateThe system fans are divided into fan domains each of which has a separate fan speed control signal and a separate configurable fan control policy A fan domain can have a set of temperature and fan sensors associated with it These are used to determine the current fan domain state
HA201-TP Users Manual
101
Chapter 7 Intelreg Server Board S2600TP Platform Management
A fan domain has three statesbull The sleep and boost states have fixed (but configurable through OEM
SDRs) fan speeds associated with thembull The nominal state has a variable speed determined by the fan
domain policy An OEM SDR record is used to configure the fan domain policy
The fan domain state is controlled by several factors They are listed below in order of precedence high to lowbull Boost
o Associated fan is in a critical state or missing The SDR describes which fan domains are boosted in response to a fan failure or removal in each domain If a fan is removed when the system is in lsquoFans-offrsquo mode it will not be detected and there will not be any fan boost till system comes out of lsquoFans-off modeo Any associated temperature sensor is in a critical state The SDR describes which temperature-threshold violations cause fan boost for each fan domaino The BMC is in firmware update mode or the operational firmware is corruptedo If any of the above conditions apply the fans are set to a fixed boost state speed
bull Nominalo A fan domainrsquos nominal fan speed can be configured as static (fixed value) or controlled by the state of one or more associated temperature sensorso See section 43144 for more details
73141 Hot-Swap FansHot-swap fans are supported These fans can be removed and replaced while the system is powered on and operating The BMC implements fan presence sensors for each hotswappable fanWhen a fan is not present the associated fan speed sensor is put into the readingunavailable state and any associated fan domains are put into the boost state The fans may already be boosted due to a previous fan failure or fan removalWhen a removed fan is inserted the associated fan speed sensor is rearmed If there are no other critical conditions causing a fan boost condition the fan speed returns to the nominal state Power cycling or
HA201-TP Users Manual
102
Chapter 7 Intelreg Server Board S2600TP Platform Management
resetting the system re-arms the fan speed sensors and clears fanfailure conditions If the failure condition is still present the boost state returns once the sensor has re-initialized and the threshold violation is detected again
73142 Fan Redundancy DetectionThe BMC supports redundant fan monitoring and implements a fan redundancy sensor A fan redundancy sensor generates events when its associated set of fans transitions between redundant and non-redundant states as determined by the number and health of the fans The definition of fan redundancy is configuration dependent The BMC allows redundancy to be configured on a per fan redundancy sensor basis through OEM SDR recordsA fan failure or removal of hot-swap fans up to the number of redundant fans specified in the SDR in a fan configuration is a non-critical failure and is reflected in the front panel status A fan failure or removal that exceeds the number of redundant fans is a non-fatal insufficientresources condition and is reflected in the front panel status as a non-fatal errorRedundancy is checked only when the system is in the DC-on state Fan redundancy changes that occur when the system is DC-off or when AC is removed will not be logged until the system is turned on
73143 Fan DomainsSystem fan speeds are controlled through pulse width modulation (PWM) signals which are driven separately for each domain by integrated PWM hardware Fan speed is changed by adjusting the duty cycle which is the percentage of time the signal is driven high in each pulseThe BMC controls the average duty cycle of each PWM signal through direct manipulation of the integrated PWM control registersThe same device may drive multiple PWM signals
HA201-TP Users Manual
103
Chapter 7 Intelreg Server Board S2600TP Platform Management
73144 Nominal Fan SpeedA fan domainrsquos nominal fan speed can be configured as static (fixed value) or controlled by the state of one or more associated temperature sensorsOEM SDR records are used to configure which temperature sensors are associated with which fan control domains and the algorithmic relationship between the temperature and fan speed Multiple OEM SDRs can reference or control the same fan control domain and multiple OEM SDRs can reference the same temperature sensorsThe PWM duty-cycle value for a domain is computed as a percentage using one or more instances of a stepwise linear algorithm and a clamp algorithm The transition from one computed nominal fan speed (PWM value) to another is ramped over time to minimize audible transitions The ramp rate is configurable by means of the OEM SDRMultiple stepwise linear and clamp controls can be defined for each fan domain and used simultaneously For each domain the BMC uses the maximum of the domainrsquos stepwise linear control contributions and the sum of the domainrsquos clamp control contributions to compute the domainrsquos PWM value except that a stepwise linear instance can be configured to provide the domain maximumHysteresis can be specified to minimize fan speed oscillation and to smooth fan speed transitions If a Tcontrol SDR record does not contain a hysteresis definition for example an SDR adhering to a legacy format the BMC assumes a hysteresis value of zero
73145 Thermal and Acoustic ManagementThis feature refers to enhanced fan management to keep the system optimally cooled while reducing the amount of noise generated by the system fans Aggressive acoustics standards might require a trade-off between fan speed and system performance parameters that contribute to the cooling requirements primarily memory bandwidth The BIOS BMC and SDRs work together to provide control over how this trade-off is determinedThis capability requires the BMC to access temperature sensors on the individual memory DIMMs Additionally closed-loop thermal throttling is only supported with buffered DIMMs
HA201-TP Users Manual
104
Chapter 7 Intelreg Server Board S2600TP Platform Management
73146 Thermal Sensor Input to Fan Speed ControlThe BMC uses various IPMI sensors as input to the fan speed control Some of the sensors are IPMI models of actual physical sensors whereas some are ldquovirtualrdquo sensors whose values are derived from physical sensors using calculations andor tabular informationThe following IPMI thermal sensors are used as input to the fan speed controlbull Front panel temperature sensorbull Baseboard temperature sensorsbull CPU DTS-Spec margin sensorsbull DIMM thermal margin sensorsbull Exit air temperature sensorbull Global aggregate thermal margin sensorsbull SSB (Intelreg C610 Series Chipset) temperature sensorbull On-board Ethernet controller temperature sensors (support for this is
specific to the Ethernet controller being used)bull Add-in Intelreg SASIO module temperature sensor(s) (if present)bull Power supply thermal sensors (only available on PMBus-compliant
power supplies)A simple model is shown in the following figure which gives a high level graphic of the fan speed control structure creates the resulting fan speeds
HA201-TP Users Manual
105
Chapter 7 Intelreg Server Board S2600TP Platform Management
731461 Processor Thermal ManagementProcessor thermal management utilizes clamp algorithms for which the Processor DTS-Spec margin sensor is a controlling input This replaces the use of the (legacy) raw DTS sensor reading that was utilized on previous generation platforms The legacy DTS sensor is retained only for monitoring purposes and is not used as an input to the fan speed control
731462 Memory Thermal ManagementThe system memory is the most complex subsystem to thermally manage as it requires substantial interactions between the BMC BIOS and the embedded memory controller This section provides an overview of this management capability from a BMC perspective
7314621 Memory Thermal ThrottlingThe system shall support thermal management through open loop throttling (OLTT) and closed loop throttling (CLTT) of system memory based on the platform as well as availability of valid temperature sensors on the installed memory DIMMs Throttling levels are changed dynamically to cap throttling based on memory and system thermal conditions as determined by the system and DIMM power and thermal parameters Support for CLTT on mixed-mode DIMM populations (that is some installed DIMMs have valid temp sensors and some do not) is not supported The BMC fan speed control functionality is related to the memory throttling mechanism usedThe following terminology is used for the various memory throttling optionsbull Static Open Loop Thermal Throttling (Static-OLTT) OLTT control registers
are configured by BIOS MRC remain fixed after post The system does not change any of the throttling control registers in the embedded memory controller during runtime
bull Static Closed Loop Thermal Throttling (Static-CLTT) CLTT control registers are configured by BIOS MRC during POST The memory throttling is run as a closed-loop system with the DIMM temperature sensors as the control input Otherwise the system does not change any of the throttling control registers in the embedded memory controller during runtime
bull Dynamic Open Loop Thermal Throttling (Dynamic-OLTT) OLTT control registers are configured by BIOS MRC during POST Adjustments are
HA201-TP Users Manual
106
Chapter 7 Intelreg Server Board S2600TP Platform Management
made to the throttling during runtime based on changes in system cooling (fan speed)
bull Dynamic Closed Loop Thermal Throttling (Dynamic-CLTT) CLTT control registers are configured by BIOS MRC during POST The memory throttling is run as a closed-loop system with the DIMM temperature sensors as the control input Adjustments are made to the throttling during runtime based on changes in system cooling (fan speed)
Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce a new type of CLTT which is referred to as Hybrid CLTT for which the Integrated Memory Controller estimates the DRAM temperature in between actual reads of the TSODsHybrid CLTTT shall be used on all Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family that have DIMMs with thermal sensors Therefore the terms Dynamic-CLTT and Static-CLTT are really referring to this lsquohybridrsquo mode Note that if the IMCrsquos polling of the TSODs is interrupted the temperature readings that the BMC gets from the IMC shall be these estimated values 731463 DIMM Temperature Sensor Input to Fan Speed ControlA clamp algorithm s used for controlling fan speed based on DIMM temperatures Aggregate DIMM temperature margin sensors are used as the control input to the algorithm
731464 Dynamic (Hybrid) CLTTThe system will support dynamic (memory) CLTT for which the BMC FW dynamically modifies thermal offset registers in the IMC during runtime based on changes in system cooling (fan speed) For static CLTT a fixed offset value is applied to the TSOD reading to get the die temperature however this is does not provide as accurate results as when the offset takes into account the current airflow over the DIMM as is done with dynamic CLTTIn order to support this feature the BMC FW will derive the air velocity for each fan domain based on the PWM value being driven for the domain Since this relationship is dependent on the chassis configuration a method must be used which support this dependency (for example through OEM SDR) that establishes a lookup table providing this relationshipBIOS will have an embedded lookup table that provides thermal offset
HA201-TP Users Manual
107
Chapter 7 Intelreg Server Board S2600TP Platform Management
values for each DIMM type altitude setting and air velocity range (3 ranges of air velocity are supported) During system boot BIOS will provide 3 offset values (corresponding to the 3 air velocity ranges) tothe BMC for each enabled DIMM Using this data the BMC FW constructs a table that maps the offset value corresponding to a given air velocity range for each DIMM During runtime the BMC applies an averaging algorithm to determine the target offset value corresponding to thecurrent air velocity and then the BMC writes this new offset value into the IMC thermal offset register for the DIMM
731465 Fan ProfilesThe server system supports multiple fan control profiles to support acoustic targets and American Society of Heating Refrigerating and Air Conditioning Engineers (ASHRAE) compliance The BIOS Setup utility can be used to choose between meeting the target acoustic level or enhanced system performance This is accomplished through fan profilesThe BMC supports eight fan profiles numbered from 0 to 7 Fan management policy is dictated by the Tcontrol SDRs These SDRs provide a way to associate fan control behavior with one or more fan profilesEach group of profiles allows for varying fan control policies based on the altitude For a given altitude the Tcontrol SDRs associated with an acoustics-optimized profile generate less noise than the equivalent performance-optimized profile by driving lower fan speeds and the BIOSreduces thermal management requirements by configuring more aggressive memory throttling See Table 16 for more informationThe BMC provides commands that query for fan profile support and it provides a way to enable a fan profile Enabling a fan profile determines which Tcontrol SDRs are used for fan management The BMC only supports enabling a fan profile through the command if that profile is supported on all fan domains defined for the system It is important to configure the SDRs so that all desired fan profiles are supported on each fan domain If no single profile is supported across all domains the BMC by default uses profile 0 and does not allow it to be changedAt system boot the BIOS can use the Get Fan Control Configuration command to query the BMC about which fan profiles are supported The BIOS uses this information to display options in the BIOS Setup utility The BIOS indicates the fan profile to the BMC as dictated by the BIOS Setup Utility options for fan mode and altitude using the Set Fan Control
HA201-TP Users Manual
108
Chapter 7 Intelreg Server Board S2600TP Platform Management
Configuration commandThe BMC uses this information as an input to its fan-control algorithm as supported by the Tcontrol OEM SDR The BMC only allows enabling of fan profiles that the BMC indicates are supported using the Get Fan Control Configuration command For example if the Get Fan Control Configuration command indicates that only profile 1 is supported then using the Set Fan Control Configuration command to enable profile 2 will result in the return of an error completion codeThe BMC requires the BIOS to send the Set Fan Control Configuration command to the BMC on every system boot This must be done after the BIOS has completed any throttling-related chipset configuration
HA201-TP Users Manual
109
Chapter 7 Intelreg Server Board S2600TP Platform Management
731466 Open-Loop Thermal Throttling FallbackNormal system operation uses closed-loop thermal throttling (CLTT) and DIMM temperature monitoring as major factors in overall thermal and acoustics management In the event that BIOS is unable to configure the system for CLTT it defaults to open-loop thermal throttling (OLTT) In the OLTT mode it is assumed that the DIMM temperature sensors are not available for fan speed control The BIOS communicates the throttling mode to the BMC along with the fan profile number when it sends the Set Fan Control Configuration commandWhen OLTT mode is specified the BMC internally blocks access to the DIMM temperatures causing the DIMM aggregate margin sensors to be marked as ReadingState Unavailable The BMC then uses the failure-control values for these sensors if specified in the Tcontrol SDRs as their fan speed contributions
731467 ASHRAE ComplianceSystem requirements for ASHRAE compliance is defined in the Common Fan Speed Control amp Thermal Management Platform Architecture Specification Altitude-related changes in fan speed control are handled through profiles for different altitude ranges
73147 Power Supply Fan Speed ControlThis section describes the system level control of the fans internal to the power supply over the PMBus Some but not all Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family will require that the power supplies be included in the system level fan speed control For any system that requires either of these capabilities the power supply must be PMBus-compliant
731471 System Control of Power Supply FansSome products require that the BMC control the speed of the power supply fans as is done with normal system (chassis) fans except that the BMC cannot reduce the power supply fan any lower than the internal power supply control is driving it For these products the BMC FW must have the ability to control and monitor the power supply fans through PMBus commands The power supply fans are treated as a system fan domain for which fan control policies are mapped just as for chassis system fans with system thermal sensors (rather than internal power
HA201-TP Users Manual
110
Chapter 7 Intelreg Server Board S2600TP Platform Management
supply thermal sensors) used as the input to a clamp algorithm for the power supply fan control This domain has both piecewise clipping curves and clamped sensors mapped into the power supply fan domain All the power supplies can be defined as a single fan domain
731472 Use of Power Supply Thermal Sensors as Input to System (Chassis) Fan Control Some products require that the power supply internal thermal sensors are used as control inputs to the system (chassis) fans in the same manner as other system thermal sensors are used for this purpose The power supply thermal sensors are included as clamped sensors into one or more system fan domains which may include the power supply fan domain
73148 Fan Boosting due to Fan FailuresIntelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce additional capabilities for handling fan failure or removal as described in this sectionEach fan failure shall be able to define a unique response from all other fan domains An OEM SDR table defines the response of each fan domain based on a failure of any fan including both system and power supply fans (for PMBus-compliant power supplies only) This meansthat if a system has six fans then there will be six different fan fail reactions
73149 Programmable Fan PWM OffsetThe system provides a BIOS Setup option to boost the system fan speed by a programmable positive offset or a ldquoMaxrdquo setting Setting the programmable offset causes the BMC to add the offset to the fan speeds that it would otherwise be driving the fans to The Max setting causes the BMC to replace the domain minimum speed with alternate domain minimums that also are programmable through SDRsThis capability is offered to provide system administrators the option to manually configure fans speeds in instances where the fan speed optimized for a given platform may not be sufficient when a high end add-in is configured into the system This enables easier usage of the fan speed control to support Intelreg as well as third party chassis and better support of ambient temperatures higher than 35C
HA201-TP Users Manual
111
Chapter 7 Intelreg Server Board S2600TP Platform Management
supply thermal sensors) used as the input to a clamp algorithm for the power supply fan control This domain has both piecewise clipping curves and clamped sensors mapped into the power supply fan domain All the power supplies can be defined as a single fan domain
731472 Use of Power Supply Thermal Sensors as Input to System (Chassis) Fan Control Some products require that the power supply internal thermal sensors are used as control inputs to the system (chassis) fans in the same manner as other system thermal sensors are used for this purpose The power supply thermal sensors are included as clamped sensors into one or more system fan domains which may include the power supply fan domain
73148 Fan Boosting due to Fan FailuresIntelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce additional capabilities for handling fan failure or removal as described in this sectionEach fan failure shall be able to define a unique response from all other fan domains An OEM SDR table defines the response of each fan domain based on a failure of any fan including both system and power supply fans (for PMBus-compliant power supplies only) This meansthat if a system has six fans then there will be six different fan fail reactions
73149 Programmable Fan PWM OffsetThe system provides a BIOS Setup option to boost the system fan speed by a programmable positive offset or a ldquoMaxrdquo setting Setting the programmable offset causes the BMC to add the offset to the fan speeds that it would otherwise be driving the fans to The Max setting causes the BMC to replace the domain minimum speed with alternate domain minimums that also are programmable through SDRsThis capability is offered to provide system administrators the option to manually configure fans speeds in instances where the fan speed optimized for a given platform may not be sufficient when a high end add-in is configured into the system This enables easier usage of the fan speed control to support Intelreg as well as third party chassis and better support of ambient temperatures higher than 35C
HA201-TP Users Manual
112
Chapter 7 Intelreg Server Board S2600TP Platform Management
7315 Power Management Bus (PMBus)The Power Management Bus (ldquoPMBusrdquo) is an open standard protocol that is built upon the SMBus 20 transport It defines a means of communicating with power conversion and other devices using SMBus-based commands A system must have PMBus-compliant power supplies installed in order for the BMC or ME to monitor them for status andor power metering purposesFor more information on PMBus see the System Management Interface Forum Web sitehttpwwwpowersigorg
7316 Power Supply Dynamic Redundancy SensorThe BMC supports redundant power subsystems and implements a Power Unit Redundancy sensor per platform A Power Unit Redundancy sensor is of sensor type Power Unit (09h) and reading type Availability Status (0Bh) This sensor generates events when a power subsystem transitions between redundant and non-redundant states as determined by the number and health of the power subsystemrsquos component power supplies The BMC implements Dynamic Power Supply Redundancy status based upon current system load requirements as well as total Power Supply capacity This status is independent of the Cold Redundancy status This prevents the BMC from reporting Fully Redundant Power supplies when the load required by the system exceeds half the power capability of all power supplies installed and operationalDynamic Redundancy detects this condition and generates the appropriate SEL event to notify the user of the condition Power supplies of different power ratings may be swapped in and out to adjust the power capacity and the BMC will adjust the Redundancy status accordinglyThe definition of redundancy is power subsystem dependent and sometimes even configuration dependent See the appropriate Platform Specific Information for power unit redundancy supportThis sensor is configured as manual-rearm sensor in order to avoid the possibility of extraneous SEL events that could occur under certain system configuration and workload conditions The sensor shall rearm for the following conditionsbull PSU hot-addbull System reset
HA201-TP Users Manual
113
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull AC power cyclebull DC power cycleSystem AC power is applied but on standby ndash Power unit redundancy is based on OEM SDR power unit record and number of PSU presentSystem is (DC) powered on - The BMC calculates Dynamic Power Supply Redundancy status based upon current system load requirements as well as total Power Supply capacityThe BMC allows redundancy to be configured on a per power-unit-redundancy sensor basis by means of the OEM SDR records
7317 Component Fault LED ControlSeveral sets of component fault LEDs are supported on the server board Some LEDs are owned by the BMC and some by the BIOSThe BMC owns control of the following FRUfault LEDsbull Fan fault LEDs ndash A fan fault LED is associated with each fan The BMC
lights a fan fault LED if the associated fan-tach sensor has a lower critical threshold event status asserted Fan-tach sensors are manual re-arm sensors Once the lower critical threshold is crossed the LED remains lit until the sensor is rearmed These sensors are rearmed at system DC power-on and system reset
bull DIMM fault LEDs ndash The BMC owns the hardware control for these LEDs The LEDs reflect the state of BIOS-owned event-only sensors When the BIOS detects a DIMM fault condition it sends an IPMI OEM command (Set Fault Indication) to the BMC to instruct the BMC to turn on the associated DIMM Fault LED These LEDs are only active when the system is in the lsquoonrsquo state The BMC will not activate or change the state of the LEDs unless instructed by the BIOS
bull Hard Disk Drive Status LEDs ndash The HSBP PSoC owns the hardware control for these LEDs and detection of the faultstatus conditions that the LEDs reflect
bull CPU Fault LEDs The BMC owns control for these LEDs An LED is lit if there is an MSID mismatch (that is CPU power rating is incompatible with the board)
HA201-TP Users Manual
114
Chapter 7 Intelreg Server Board S2600TP Platform Management
7318 CMOS Battery MonitoringThe BMC monitors the voltage level from the CMOS battery which provides battery backup to the chipset RTC This is monitored as an auto-rearm threshold sensorUnlike monitoring of other voltage sources for which the Emulex Pilot III component continuously cycles through each input the voltage channel used for the battery monitoring provides a software enable bit to allow the BMC FW to poll the battery voltage at a relativelyslow rate in order to conserve battery power
HA201-TP Users Manual
115
Chapter 7 Intelreg Server Board S2600TP Platform Management
74 Intelreg Intelligent Power Node Manager (NM)Power management deals with requirements to manage processor power consumption and manage power at the platform level to meet critical business needs Node Manager (NM) is a platform resident technology that enforces power capping and thermal-triggered powercapping policies for the platform These policies are applied by exploiting subsystem settings (such as processor P and T states) that can be used to control power consumption NM enables data center power management by exposing an external interface to management software through which platform policies can be specified It also implements specific data center power management usage models such as power limiting and thermal monitoringThe NM feature is implemented by a complementary architecture utilizing the ME BMC BIOS and an ACPI-compliant OS The ME provides the NM policy engine and power controllimiting functions (referred to as Node Manager or NM) while the BMC provides the external LAN link by which external management software can interact with the feature The BIOS provides system power information utilized by the NM algorithms and also exports ACPI Source Language (ASL) code used by OS-Directed Power Management (OSPM) for negotiating processor P and T state changes for power limiting PMBus-compliant power supplies provide the capability to monitor input power consumption which is necessary to support NMThe NM architecture applicable to this generation of servers is defined by the NPTM Architecture Specification v20 NPTM is an evolving technology that is expected to continue to add new capabilities that will be defined in subsequent versions of the specification The ME NM implements the NPTM policy engine and controlmonitoring algorithms defined in the Node Power and Thermal Manager (NPTM) specification
741 Hardware RequirementsNM is supported only on platforms that have the NM FW functionality loaded and enabled on the Management Engine (ME) in the SSB and that have a BMC present to support the external LAN interface to the ME NM power limiting features requires a means for the ME to monitorinput power consumption for the platform This capability is generally provided by means of PMBus-compliant power supplies although an alternative model using a simpler SMBus power monitoring device is possible (there is potential loss in accuracy and responsiveness using
HA201-TP Users Manual
116
Chapter 7 Intelreg Server Board S2600TP Platform Management
non-PMBus devices) The NM SmaRTCLST feature does specifically require PMBus-compliant power supplies as well as additional hardware on the server board
742 FeaturesNM provides feature support for policy management monitoring and querying alerts and notifications and an external interface protocol The policy management features implement specific IT goals that can be specified as policy directives for NM Monitoring and querying features enable tracking of power consumption Alerts and notifications provide the foundation for automation of power management in the data center management stack The external interface specifies the protocols that must be supported in this version of NM
743 ME System Management Bus (SMBus) Interfacebull The ME uses the SMLink0 on the SSB in multi-master mode as a
dedicated bus for communication with the BMC using the IPMB protocol The BMC FW considers this a secondary IPMB bus and runs at 400 kHz
bull The ME uses the SMLink1 on the SSB in multi-master mode bus for communication with PMBus devices in the power supplies for support of various NM-related features This bus is shared with the BMC which polls these PMBus power supplies for sensor monitoring purposes (for example power supply status input power and so on) This bus runs at 100 KHz
bull The Management Engine has access to the ldquoHost SMBusrdquo
744 PECI 30bull The BMC owns the PECI bus for all Intel server implementations and
acts as a proxy for the ME when necessary
745 NM ldquoDiscoveryrdquo OEM SDRAn NM ldquodiscoveryrdquo OEM SDR must be loaded into the BMCrsquos SDR repository if and only if the NM feature is supported on that product This OEM SDR is used by management software to detect if NM is supported and to understand how to communicate with itSince PMBus compliant power supplies are required in order to support NM the system should be probed when the SDRs are loaded into the
HA201-TP Users Manual
117
Chapter 7 Intelreg Server Board S2600TP Platform Management
BMCrsquos SDR repository in order to determine whether or not the installed power supplies do in fact support PMBus If the installed power supplies are not PMBus compliant then the NM ldquodiscoveryrdquo OEM SDR should not be loadedPlease refer to the Intelreg Intelligent Power Node Manager 20 External Architecture Specification using IPMI for details of this interface
746 SmaRTCLSTThe power supply optimization provided by SmaRTCLST relies on a platform HW capability as well as ME FW support When a PMBus-compliant power supply detects insufficient input voltage an overcurrent condition or an over-temperature condition it will assert theSMBAlert signal on the power supply SMBus (such as the PMBus) Through the use of external gates this results in a momentary assertion of the PROCHOT and MEMHOT signals to the processors thereby throttling the processors and memory The ME FW also sees the SMBAlert assertion queries the power supplies to determine the condition causing the assertion and applies an algorithm to either release or prolong the throttling based on the situationSystem power control modes include1 SmaRT Low AC in put voltage event results in a one-time momentary
throttle for each event to the maximum throttle state2 Electrical Protection CLST High output energy event results in a
throttling hiccup mode with fixed maximum throttle time and a fix throttle release ramp time
3 Thermal Protection CLST High power supply thermal event results in a throttling hiccup mode with fixed maximum throttle time and a fix throttle release ramp time
When the SMBAlert signal is asserted the fans will be gated by HW for a short period (~100ms) to reduce overall power consumption It is expected that the interruption to the fans will be of short enough duration to avoid false lower threshold crossings for the fan tachsensors however this may need to be comprehended by the fan monitoring FW if it does have this side-effectME FW will log an event into the SEL to indicate when the system has been throttled by the SmaRTCLST power management feature This is dependent on ME FW support for this sensor Please refer ME FW EPS for SEL log details
HA201-TP Users Manual
118
Chapter 7 Intelreg Server Board S2600TP Platform Management
74611 Dependencies on PMBus-compliant Power Supply SupportThe SmaRTCLST system feature depends on functionality present in the ME NM SKU This feature requires power supplies that are compliant with the PMBus
note For additional information on Intelreg Intelligent Power Node
Manager usage and support please visit the following Intel Website
httpwwwintelcomcontentwwwusendata-centerdata-center-managementnode-manager-generalhtmlwapkw=node+manager
HA201-TP Users Manual
119
Chapter 7 Intelreg Server Board S2600TP Platform Management
75 Basic and Advanced Server Management FeaturesThe integrated BMC has support for basic and advanced server management features Basic management features are available by default Advanced management features are enabled with the addition of an optionally installed Remote Management Module 4 Lite (RMM4 Lite) key
When the BMC FW initializes it attempts to access the Intelreg RMM4 lite If the attempt to access Intelreg RMM4 lite is successful then the BMC activates the Advanced featuresThe following table identifies both Basic and Advanced server management features
HA201-TP Users Manual
120
Chapter 7 Intelreg Server Board S2600TP Platform Management
751 Dedicated Management PortThe server board includes a dedicated 1GbE RJ45 Management Port The management port is active with or without the RMM4 Lite key installed
752 Embedded Web ServerBMC Base manageability provides an embedded web server and an OEM-customizable web GUI which exposes the manageability features of the BMC base feature set It is supported over all on-board NICs that have management connectivity to the BMC as well as an optionaldedicated add-in management NIC At least two concurrent web sessions from up to two different users is supported The embedded web user interface shall support the following client web browsersbull Microsoft Internet Explorer 90bull Microsoft Internet Explorer 100bull Mozilla Firefox 24bull Mozilla Firefox 25The embedded web user interface supports strong security (authentication encryption and firewall support) since it enables remote server configuration and control The user interface presented by the embedded web user interface shall authenticate the user before allowing a web session to be initiated Encryption using 128-bit SSL is supported User authentication is based on user id and passwordThe GUI presented by the embedded web server authenticates the user before allowing a web session to be initiated It presents all functions to all users but grays-out those functions that the user does not have privilege to execute For example if a user does not have privilege to power control then the item shall be displayed in grey-out font in that userrsquos UI display The web GUI also provides a launch point for some of the advanced features such as KVM and media redirection These features are grayed out in the GUI unless the system has been updated to support these advanced features The embedded web server only displays US English or Chinese language outputAdditional features supported by the web GUI includesbull Presents all the Basic features to the usersbull Power onoffreset the server and view current power statebull Displays BIOS BMC ME and SDR version informationbull Display overall system health
HA201-TP Users Manual
121
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull Configuration of various IPMI over LAN parameters for both IPV4 and IPV6
bull Configuration of alerting (SNMP and SMTP)bull Display system asset information for the product board and
chassisbull Display of BMC-owned sensors (name status current reading
enabled thresholds) including color-code status of sensorsbull Provides ability to filter sensors based on sensor type (Voltage
Temperature Fan and Power supply related)bull Automatic refresh of sensor data with a configurable refresh ratebull On-line helpbull Displayclear SEL (display is in easily understandable human
readable format)bull Supports major industry-standard browsers (Microsoft Internet
Explorer and Mozilla Firefox)bull The GUI session automatically times-out after a user-configurable
inactivity period By default this inactivity period is 30 minutesbull Embedded Platform Debug feature - Allow the user to initiate
a ldquodebug dumprdquo to a file that can be sent to Intelreg for debug purposes
bull Virtual Front Panel The Virtual Front Panel provides the same functionality as the local front panel The displayed LEDs match the current state of the local panel LEDs The displayed buttons (for example power button) can be used in the same manner as the local buttons
bull Display of ME sensor data Only sensors that have associated SDRs loaded will be displayed
bull Ability to save the SEL to a filebull Ability to force HTTPS connectivity for greater security This is
provided through a configuration option in the UIbull Display of processor and memory information as is available over
IPMI over LANbull Ability to get and set Node Manager (NM) power policiesbull Display of power consumed by the serverbull Ability to view and configure VLAN settingsbull Warn user the reconfiguration of IP address will cause disconnectbull Capability to block logins for a period of time after several
HA201-TP Users Manual
122
Chapter 7 Intelreg Server Board S2600TP Platform Management
consecutive failed login attempts The lock-out period and the number of failed logins that initiates the lockout period are configurable by the user
bull Server Power Control - Ability to force into Setup on a resetbull System POST results ndash The web server provides the systemrsquos Power-
On Self Test (POST) sequence for the previous two boot cycles including timestamps The timestamps may be viewed in relative to the start of POST or the previous POST code
bull Customizable ports - The web server provides the ability to customize the port numbers used for SMASH http https KVM secure KVM remote media and secure remote media
For additional information reference the Intelreg Remote Management Module 4 and Integrated BMC Web Console Users Guide
753 Advanced Management Feature Support (RMM4 Lite)The integrated baseboard management controller has support for advanced management features which are enabled when an optional Intelreg Remote Management Module 4 Lite (RMM4 Lite) is installed The Intel RMM4 add-on offers convenient remote KVM access and control through LAN and internet It captures digitizes and compresses video and transmits it with keyboard and mouse signals to and from a remote computer Remote access and controlsoftware runs in the integrated baseboard management controller utilizing expanded capabilities enabled by the Intel RMM4 hardwareKey Features of the RMM4 add-on arebull KVM redirection from either the dedicated management NIC or
the server board NICs used for management traffic upto to two KVM sessions
bull Media Redirection ndash The media redirection feature is intended to allow system administrators or users to mount a remote IDE or USB CDROM floppy drive or a USB flash disk as a remote device to the server Once mounted the remote device appears just like a local device to the server allowing system administrators or users to install software (including operating systems) copy files update BIOS or boot the server from this device
bull KVM ndash Automatically senses video resolution for best possible screen capture high performance mouse tracking and
HA201-TP Users Manual
123
Chapter 7 Intelreg Server Board S2600TP Platform Management
synchronization It allows remote viewing and configuration in pre-boot POST and BIOS setup
7531 Keyboard Video Mouse (KVM) RedirectionThe BMC firmware supports keyboard video and mouse redirection (KVM) over LAN This feature is available remotely from the embedded web server as a Java applet This feature is only enabled when the Intelreg RMM4 lite is present The client system must have a Java Runtime Environment (JRE) version 60 or later to run the KVM or media redirection appletsThe BMC supports an embedded KVM application (Remote Console) that can be launched from the embedded web server from a remote console USB11 or USB 20 based mouse and keyboard redirection are supported It is also possible to use the KVM-redirection (KVM-r) session concurrently with media-redirection (media-r) This feature allows a user to interactively use the keyboard video and mouse (KVM) functions of the remote server as if the user were physically at the managed server KVM redirection console supports the following keyboard layouts English Dutch French German Italian Russian and SpanishKVM redirection includes a ldquosoft keyboardrdquo function The ldquosoft keyboardrdquo is used to simulate an entire keyboard that is connected to the remote system The ldquosoft keyboardrdquo functionality supports the following layouts English Dutch French German Italian Russian and SpanishThe KVM-redirection feature automatically senses video resolution for best possible screen capture and provides high-performance mouse tracking and synchronization It allows remote viewing and configuration in pre-boot POST and BIOS setup once BIOS has initialized videoOther attributes of this feature includebull Encryption of the redirected screen keyboard and mousebull Compression of the redirected screenbull Ability to select a mouse configuration based on the OS typebull Supports user definable keyboard macrosKVM redirection feature supports the following resolutions and refresh ratesbull 640x480 at 60Hz 72Hz 75Hz 85Hz 100Hzbull 800x600 at 60Hz 72Hz 75Hz 85Hzbull 1024x768 at 60Hx 72Hz 75Hz 85Hzbull 1280x960 at 60Hz
HA201-TP Users Manual
124
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull 1280x1024 at 60Hzbull 1600x1200 at 60Hzbull 1920x1080 (1080p)bull 1920x1200 (WUXGA)bull 1650x1080 (WSXGA+)
7532 Remote ConsoleThe Remote Console is the redirected screen keyboard and mouse of the remote host system To use the Remote Console window of your managed host system the browser must include a Java Runtime Environment plug-in If the browser has no Java support such as with a small handheld device the user can maintain the remote host system using the administration forms displayed by the browserThe Remote Console window is a Java Applet that establishes TCP connections to the BMCThe protocol that is run over these connections is a unique KVM protocol and not HTTP or HTTPS This protocol uses ports 7578 for KVM 5120 for CDROM media redirection and 5123 for FloppyUSB media redirection When encryption is enabled the protocol uses ports 7582 for KVM 5124 for CDROM media redirection and 5127 for FloppyUSB media redirection The local network environment must permit these connections to be made that is the firewall and in case of a private internal network the NAT (Network Address Translation) settings have to be configured accordingly
7533 PerformanceThe remote display accurately represents the local display The feature adapts to changes to the video resolution of the local display and continues to work smoothly when the system transitions from graphics to text or vice-versa The responsiveness may be slightly delayed depending on the bandwidth and latency of the networkEnabling KVM andor media encryption will degrade performance Enabling video compression provides the fastest response while disabling compression provides better video qualityFor the best possible KVM performance a 2Mbsec link or higher is recommendedThe redirection of KVM over IP is performed in parallel with the local KVM without affecting the local KVM operation
HA201-TP Users Manual
125
Chapter 7 Intelreg Server Board S2600TP Platform Management
7534 SecurityThe KVM redirection feature supports multiple encryption algorithms including RC4 and AES The actual algorithm that is used is negotiated with the client based on the clientrsquos capabilities
7535 AvailabilityThe remote KVM session is available even when the server is powered-off (in stand-by mode) No re-start of the remote KVM session shall be required during a server reset or power onoff A BMC reset (for example due to an BMC Watchdog initiated reset or BMC reset after BMC FWupdate) will require the session to be re-established KVM sessions persist across system reset but not across an AC power loss
7536 UsageAs the server is powered up the remote KVM session displays the complete BIOS boot process The user is able interact with BIOS setup change and save settings as well as enter and interact with option ROM configuration screensAt least two concurrent remote KVM sessions are supported It is possible for at least two different users to connect to same server and start remote KVM sessions
7537 Force-enter BIOS SetupKVM redirection can present an option to force-enter BIOS Setup This enables the system to enter F2 setup while booting which is often missed by the time the remote console redirects the video
7538 Media RedirectionThe embedded web server provides a Java applet to enable remote media redirection This may be used in conjunction with the remote KVM feature or as a standalone applet The media redirection feature is intended to allow system administrators or users to mount a remote IDE or USB CD-ROM floppy drive or a USB flash disk as a remote device to the server Once mounted the remote device appears just like a local device to the server allowing system administrators or users to install software (including operating systems) copy files update BIOS and so on or boot the server from this deviceThe following capabilities are supported
HA201-TP Users Manual
126
Chapter 7 Intelreg Server Board S2600TP Platform Management
The operation of remotely mounted devices is independent of the local devices on the server Both remote and local devices are useable in parallelbull Either IDE (CD-ROM floppy) or USB devices can be mounted as a
remote device to the serverbull It is possible to boot all supported operating systems from the remotely
mounted device and to boot from disk IMAGE (IMG) and CD-ROM or DVD-ROM ISO files See the Testedsupported Operating System List for more information
bull Media redirection supports redirection for both a virtual CD device and a virtual FloppyUSB device concurrently The CD device may be either a local CD drive or else an ISO image file the FloppyUSB device may be either a local Floppy drive a local USB device or else a disk image file
bull The media redirection feature supports multiple encryption algorithms including RC4 and AES The actual algorithm that is used is negotiated with the client based on the clientrsquos capabilities
bull A remote media session is maintained even when the server is powered-off (in standby mode) No restart of the remote media session is required during a server reset or power onoff An BMC reset (for example due to an BMC reset after BMC FW update) will require the session to be re-established
bull The mounted device is visible to (and useable by) managed systemrsquos OS and BIOS in both pre-boot and post-boot states
bull The mounted device shows up in the BIOS boot order and it is possible to change the BIOS boot order to boot from this remote device
bull It is possible to install an operating system on a bare metal server (no OS present) using the remotely mounted device This may also require the use of KVM-r to configure the OS during install
USB storage devices will appear as floppy disks over media redirection This allows for the installation of device drivers during OS installationIf either a virtual IDE or virtual floppy device is remotely attached during system boot both the virtual IDE and virtual floppy are presented as bootable devices It is not possible to present only a single-mounted device type to the system BIOS
HA201-TP Users Manual
127
Chapter 7 Intelreg Server Board S2600TP Platform Management
75381 AvailabilityThe default inactivity timeout is 30 minutes and is not user-configurable Media redirection sessions persist across system reset but not across an AC power loss or BMC reset
75382 Network Port UsageThe KVM and media redirection features use the following portsbull 5120 ndash CD Redirectionbull 5123 ndash FD Redirectionbull 5124 ndash CD Redirection (Secure)bull 5127 ndash FD Redirection (Secure)bull 7578 ndash Video Redirectionbull 7582 ndash Video Redirection (Secure)For additional information reference the Intelreg Remote Management Module 4 and Integrated BMC Web Console Users Guide
Chapter 1 Product IntroductionChapter 1 Product IntroductionChapter 8 Technical Support
wwwaicipccom
bull TAIWANTel +886 3 433 9188Fax +886 3 287 1818Email salesaicipccomtw
bull CHINA Tel +862154961421 +862154961422Fax Extension 608Email Technical Support supportaicipccom
bull AMERICA - West coastTel +19098958989Fax +19098958999Email salesaicipccom
bull AMERICA - East coastTel +19738848886Fax +19738844794Email njsalesaicipccom
bull EUROPETel +31306386789Fax +31306360638Emailsalesaicipcnl
Email Technical Support supportaicipccom
- PREFACE
-
- SAFETY INSTRUCTIONS
-
- Chapter 1 Product Introduction
-
- 11 Box Content
- 12 Specifications
- 13 General Information
-
- Chapter 2 Hardware Installation
-
- 21 Removing and Installing top cover
- 22 Central Processing Unit (CPU)
- 23 Diagram of the Correct Installation
- 24 System Memory
- 25 RemovingInstalling a Drive Tray Hard Drive Drive Slot Map
- 26 Removing and Installing a Fan Module
- 27 Power Supply
- 28 Tool-less Blade Slide Installation introduction
-
- Chapter 3 Motherborad Overview
-
- 31 Intelreg Server Board Feature Set
- 32 Motherboard Layout
- 33 Motherboard block diagram
- 34 Motherboard Configuration Jumpers
- 35 Motherboard Configuration Jumpers
-
- Chapter 4 12GB Expander Borad Overview
-
- 41 12GB Expander Board
-
- Chapter 5 Backplane Overview
-
- 51 Backplane
-
- Chapter 6 Debug amp Firmware Update Introduction
-
- 61 Expender firmware update through smart console port
- 62 Update the expander firmware through in-band
- 63 12G expander EDFB setting
- 64 Slot HDD power setting
- 65 HDD BP thermal sensor temperature setting
-
- Chapter 7 Intelreg Server Board S2600TP Platform Management
-
- 71 Server Management Function Architecture
- 72 Features and Functions
- 73 Sensor Monitoring
- 74 Intelreg Intelligent Power Node Manager (NM)
- 75 Basic and Advanced Server Management Features
-
- Chapter 8 Technical Support
-
- 按鈕11
![Page 4: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/4.jpg)
Copyright copy 2015 AIC Inc All Rights Reserved
This document contains proprietary information about AIC products and is not to be disclosed or used except in accordance with applicable agreements
i
PREFACEbull Copyright
No part of this publication may be reproduced stored in a retrieval system
or transmitted in any form or by any means electronic mechanical photo-
static recording or otherwise without the prior written consent of the
manufacturer
bull Trademarks
All products and trade names used in this document are trademarks or
registered trademarks of their respective holders
bull Changes
The material in this document is for information purposes only and is subject
to change without notice
bull Warning
1 A shielded-type power cord is required in order to meet FCC emission
limits and also to prevent interference to the nearby radio and television
reception It is essential that only the supplied power cord be used
2 Use only shielded cables to connect IO devices to this equipment
3 You are cautioned that changes or modifications not expressly approved
by the party responsible for compliance could void your authority to
operate the equipment
bull Disclaimer
AIC shall not be liable for technical or editorial errors or omissions
contained herein The information provided is provided as is without
warranty of any kind To the extent permitted by law neither AIC or its
affiliates subcontractors or suppliers will be liable for incidental special or
consequential damages including downtime cost lost profits damages
relating to the procurement of substitute products or services or damages
for loss of data or software restoration The information in this document is
subject to change without notice
ii
SAFETY INSTRUCTIONSbull Before getting started please read the following important cautionsbull All cautions and warnings on the equipment or in the manuals should be
notedbull Most electronic components are sensitive to electrical static discharge
Therefore be sure to ground yourself at all times when installing the internal components
bull Use a grounding wrist strap and place all electronic components in static-shielded devices Grounding wrist straps can be purchased in any electronic supply store
bull Be sure to turn off the power and then disconnect the power cords from your system before performing any installation or servicing A sudden surge of power could damage sensitive electronic components
bull Do not open the systemrsquos top cover If opening the cover for maintenance is a must only a trained technician should do so Integrated circuits on computer boards are sensitive to static electricity Before handling a board or integrated circuit touch an unpainted portion of the system unit chassis for a few seconds This will help to discharge any static electricity on your body
bull Place this equipment on a stable surface when install A drop or fall could cause injury
bull Please keep this equipment away from humiditybull Carefully mount the equipment into the rack in such manner that it
wonrsquot be hazardous due to uneven mechanical loadingbull This equipment is to be installed for operation in an environment with
maximum ambient temperature below 35degCbull The openings on the enclosure are for air convection to protect the
equipment from overheating DO NOT COVER THE OPENINGSbull Never pour any liquid into ventilation openings This could cause fire or
electrical shockbull Make sure the voltage of the power source is within the specification
on the label when connecting the equipment to the power outlet The current load and output power of loads shall be within the specification
bull This equipment must be connected to reliable grounding before using Pay special attention to power supplied other than direct connections eg using of power strips
iii
bull Place the power cord out of the way of foot traffic Do not place anything over the power cord The power cord must be rated for the product voltage and current marked on the productrsquos electrical ratings label The voltage and current rating of the cord should be greater than the voltage and current rating marked on the product
bull If the equipment is not used for a long time disconnect the equipment from mains to avoid being damaged by transient over-voltage
bull Never open the equipment For safety reasons only qualified service personnel should open the equipment
bull If one of the following situations arise the equipment should be checked by service personnel1 The power cord or plug is damaged2 Liquid has penetrated the equipment3 The equipment has been exposed to moisture4 The equipment does not work well or will not work according to its user
manual5 The equipment has been dropped andor damaged6 The equipment has obvious signs of breakage7 Please disconnect this equipment from the AC outlet before cleaning
Do not use liquid or detergent for cleaning The use of a moisture sheet or cloth is recommended for cleaning
bull Module and drive bays must not be empty They must have a dummy cover
Product features and specifications are subject to change without notice
CAUTION
risk of explosion if battery is replaced by an incorrect type
dispose of used batteries according to the instructions
After performing any installation or servicing make sure the
enclosure are lock and screw in position turn on the power
1
HA201-TP Users Manual
Chapter 1 Product Introduction
11 Box Content
Chapter 1 Product Introduction
Before removing the subsystem from the shipping carton visually inspect the physical condition of the shipping carton Exterior damage to the shipping carton may indicate that the contents of the carton are damaged If any damage is found do not remove the components contact the dealer where the subsystem was purchased for further introduction Before continuing first unpack the subsystem and verify that the contents of the shipping carton are all there and in good condition
bull Enclosure( Power supply fan 24 HDD tray included)
bull RS232 cablex 1pcs bull Power cord x 2sets bull Screws kit x 1set
bull Slide rail x 1set
If any items are missing please contact your authorized reseller or sales representative
2
HA201-TP Users Manual
Chapter 1 Product Introduction
12 Specifications
Dimensions (W x D x H)(with chassis ears)
mm 4826 x 815 x 88
inches 19 x 32 x 35
Motherboard (per node) Intelreg Server Board S2600TP
Processor (per node)
Processor Support
Two Intelreg Xeonreg Processors E5-2600 v3 Product Family
QPI Speeds 96 GTs 8 GTs 72 GTs
Socket Type Socket R3 (FCLGA2011-3)
Chipset Support(per node) Intelreg C612 Chipset
System Memory (per node)
bull 16 DIMM slots in total across 8 memory channelsbull Registered DDR4 (RDIMM) Load Reduced DDR4 (LRDIMM)bull Memory DDR4 data transfer rates of 160018662133 MTsbull DIMM sizes of 4 GB 8 GB 16 GB or 32 GB depending on ranks and technology
Front Panel Power onoff
LEDs
A bull Power (Secondary)bull Warning
B bull Power (Primary)bull Warning
Drive Bays External 25 hot swap 24
Backplane 1 x 24-port 12Gb SAS dual-loop backplane
Expander Board(per node)
1 x 24-port 12Gb SAS expander board with 3 SFF-8643 connectors
Expansion Slots (per node) PCIe 30 5 x8 (4 LP amp 1 FHHL)
Internal IO Connectors Headers(per node)
bull 1 x internal USB 20 connector (port 67)bull 1 x 2x7pin header for system fan modulebull 1 x 1x12pin control panel headerbull 1 x DH-10 serial Port A connectorbull 1 x SATA 6Gbs port for SATA DOMbull 4 x SATA 6Gbs connectors (port 0123)bull 1 x 2x4 pin header for Intel RMM4 Litebull 1 x 1x4 pin header for Storage Upgrade Keybull 1 x 1x8 pin backup power controler connector
Rear IO(per node)
bull 1 x RJ45 (dedicated port for remote server management) 2 x RJ45 1 x VGA 2 x USB 20 Type A 1 x Mini SAS HD
Ethernet (per node) Dual GbE Intelreg I350 Gigabit Ethernet Controller
Video Support(per node) Integrated Matrox G200 2D Graphics Controller
Server Management(per node)
Onboard Server Engines LLC Pilot III Controller Support for Intelreg Remote Management Module 4 solutions IntelregLight-Guided Diagnostics on field replaceable units Support for Intelreg System Management Software Support for Intelreg Intelligent Power Node Manager (Need PMBus - compliant power supply) BIOS Flash Winbond W25Q64BV
Power Supply 1200W 1+1 redundant power supply
System Cooling (per node)
2 x 6056 fans
EnvironmentalSpecifications
Temperature 0degC - 35degCHumidity 5 - 95 non-condensing
Gross Weight (w PSU amp Rail)kgs 41
lbs 90
PackagingDimensions
(W x D x H)mm 590 x 1150 x 330
inches 232 x 453 x 13
Mounting Standard 28 tool-less slide rail
3
HA201-TP Users Manual
Chapter 1 Product Introduction
13 General Information
HA201-WP a 2U Storage Server Barebone supports two Intelreg Xeonreg processors E5-2600 v3 series HA201-TP has a 24 x 25rdquo HDD bays as system drive bays It is a perfect building block for Storage Servers
bull Front Panel
LED Indicator and Switch
24 x 25 hot-swap SATASAS HDD bays
4
HA201-TP Users Manual
Chapter 1 Product Introduction
bull Rear Panel
1200W 1+1 80+ redundant power supply
2 x low-profile add-on card for external connection
SFF 8644 for JBOD connection
5
HA201-TP Users Manual
Chapter 1 Product Introduction
bull Top View of HA201-TP system
2 x hot-swappable storage control node
Intelreg Xeonreg E5-2600 v3 Processors
bull TOP View Of Controller Canister
6
HA201-TP Users Manual
Chapter 1 Product Introduction
bull Top View of HA201-TP system
4 x 60x56mm hot-swap fans
Intel S2600TP
HA201-TP Users Manual
7
Chapter 2 Hardware Installation
7
21 Removing and Installing top coverUnscrew the screws(2pcs for the back and 4pcs for the both right and left side)Slide the cover backwards to remove top cover form the chassis
Chapter 2 Hardware Installation
This section demonstrates maintenance procedures in replacing a defective part once the HA201-TP UM appliance is installed and is operational
Chapter 2 Hardware Installation
HA201-TP Users Manual
8
22 Central Processing Unit (CPU)221 Removing the Processor HeatsinkThe heatsink is attached to the server board or processor socket with captive fasteners Using a 2 Phillips screwdriver loosen the four screws located on the heatsink corners in a diagonal manner using the following procedurebull Using a 2 Phillips screwdriver start with screw 1 and loosen it by
giving it two rotations and stop (see letter A) (IMPORTANT Do not fully loosen)
bull Proceed to screw 2 and loosen it by giving it two rotations and stop (see letter B) Similarly loosen screws 3 and 4 Repeat steps A and B by giving each screw two rotations each time until all screws are loosened
bull Lift the heatsink straight up (see letter C)
REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
Processor
Socket
2
3
1
4
A
B
C
HA201-TP Users Manual
9
Chapter 2 Hardware Installation
222 Installing the Processor
2221 Unlatch the CPU Load Platebull Push the lever handle labeled ldquoOPEN 1strdquo (see letter A) down and
toward the CPU socket Rotate the lever handle upbull Repeat the steps for the second lever handle (see letter B)
CAUTION
Processor must be appropriate You may damage the server board if
you install a processor that is inappropriate for your server
CAUTION
ESD and handling processors Reduce the risk of electrostatic
discharge (ESD) damage to the processor by doing the following
(1) Touch the metal chassis before touching the processor or
server board Keep part of your body in contact with the metal
chassis to dissipate the static charge while handling the processor
(2) Avoid moving around unnecessarily
REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
A REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
B
Chapter 2 Hardware Installation
HA201-TP Users Manual
10
2222 Lift open the Load Platebull Rotate the right lever handle down until it releases the Load Plate
(see letter A)bull While holding down the lever handle with your other hand lift open
the Load Plate (see letter B)
BREMOVE
REMOVE
LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
A
HA201-TP Users Manual
11
Chapter 2 Hardware Installation
2223 Install the Processorbull Remove the processor from its packagebull Carefully remove the protective cover from the bottom side of the
CPU taking care not to touch any CPU contacts (see letter A)bull Orient the processor with the socket so that the processor cutouts
match the four orientation posts on the socket (see letter B) Note the location of a gold key at the corner of the processor (see letter C) Carefully place (Do NOT drop) the CPU into the socket
CAUTION
The pins inside the CPU socket are extremely sensitive Other than
the CPU no object should make contact with the pins inside the
CPU socket A damaged CPU socket pin may render the socket inoperable
and will produce erroneous CPU or other system errors if used
NOTE
The underside of the processor has components that may damage the
socket pins if installed improperly The processor must align correctly
with the socket opening before installation DO NOT DROP the
processor into the socket
NOTE
When possible a CPU insertion tool should be used when installing the
CPU
A
B
C
Chapter 2 Hardware Installation
HA201-TP Users Manual
12
2224 Remove the Socket Cover Remove the socket cover from the load plate by pressing it out
2225 Close the Load Plate Carefully lower the load plate down over the processor
2226 Lock down the Load Platebull Push down on the locking lever on the CLOSE 1st side (see letter A)
Slide the tip of the lever under the notch in the load plate (see letter B) Make sure the load plate tab engages under the socket lever when fully closed
bull bRepeat the steps to latch the locking lever on the other side (see letter C) Latch the levers in the order as shown
NOTE
The socket cover should be saved and re-used should the processor
need to be removed at anytime in the future
Save theprotectivecover
A
BC
HA201-TP Users Manual
13
Chapter 2 Hardware Installation
223 Installing the Processor Heatsink
bull If present remove the protective film covering the Thermal Interface Material on the bottom side of the heatsink (see letter A)
bull Align the heatsink fins to the front and back of the chassis for correct airflow The airflow goes from front-to-back of the chassis (see letter B)
bull Each heatsink has four captive fasteners and should be tightened in a diagonal manner using the following procedure
bull Using a 2 Phillips screwdriver start with screw 1 and engage screw threads by giving it two rotations and stop (see letter C) (Do not fully tighten)
bull Proceed to screw 2 and engage screw threads by giving it two rotations and stop (see letter D) Similarly engage screws 3 and 4
bull Repeat steps C and D by giving each screw two rotations each time until each screw is lightly tightened up to a maximum of 8 inch-lbs torque (see letter E)
NOTE
The processor heatsink for CPU1 and CPU2 is different FXXCA91X91HS is
for CPU1 while FXXEA91X91HS2 is for CPU2 Mislocating the heatsink
will cause serious thermal damage
C
Processor
Socket
AIRFLOW
Chassis Front
2
3
1
4
A
B
TIM
D
CAUTIONDo not over-tighten fasteners
E
Chapter 2 Hardware Installation
HA201-TP Users Manual
14
224 Removing the Processor
NOTE
Remove the processor by carefully lifting it out of the socket taking care
NOT to drop the processor and not touching any pins inside the socket
Install the socket cover if a replacement processor is not going to be installed
HA201-TP Users Manual
15
Chapter 2 Hardware Installation
225 Different heatsink for each of its CPUsCarefully position heatsink over the CPU0 and CPU1 and align the heatsink screws with the screw holes in the motherboard
Caution - Possible thermal damage Avoid moving the heatsink after
it has contacted the top of the CPU Too much movement could
disturb the layer of thermal compound causing voids and leading
to ineffective heat dissipation and component damage
CPU0
CPU0
CPU1
CPU1
Chapter 2 Hardware Installation
HA201-TP Users Manual
16
23 Diagram of the Correct InstallationNOTE The heat sinkrsquos fans should be blowing toward the rear end
of the chassis If one of the fans is facing the wrong direction
please remove the heat sink and reinstall it so that it is facing
the correct direction
AIRFLOW
AIRFLOW
FAN FAN
HA201-TP Users Manual
17
Chapter 2 Hardware Installation
24 System Memory241 Supported Memory
Chapter 2 Hardware Installation
HA201-TP Users Manual
18
242 Memory Population Rules
bull Each installed processor provides four channels of memory On the Intelreg Server Board S2600TP product family each memory channel supports two memory slots for a total possible 16 DIMMs installed
bull The memory channels from processor socket 1 are identified as Channel A B C and D The memory channels from processor socket 2 are identified as Channel E F G and H
bull The silk screened DIMM slot identifiers on the board provide information about the channel and therefore the processor to which they belong For example DIMM_A1 is the first slot on Channel A on processor 1 DIMM_E1 is the first DIMM socket on Channel E on processor 2
bull The memory slots associated with a given processor are unavailable if the corresponding processor socket is not populated
bull A processor may be installed without populating the associated memory slots provided a second processor is installed with associated memory In this case the memory is shared by the processors However the platform suffers performance degradation and latency due to the remote memory
bull Processor sockets are self-contained and autonomous However all memory subsystem support (such as Memory RAS and Error Management) in the BIOS setup is applied commonly across processor sockets
bull All DIMMs must be DDR4 DIMMsbull Mixing of LRDIMM with any other DIMM type is not allowed per
platformbull Mixing of DDR4 operating frequencies is not validated within a socket
or across sockets by Intel If DIMMs with different frequencies are mixed all DIMMs run at the common lowest frequency
bull A maximum of eight logical ranks (ranks seen by the host) per channel is allowed
bull The BLUE memory slots on the server board identify the first memory slot for a given memory channel
NOTE
Although mixed DIMM configurations are supported Intel only performs platform
validation on systems that are configured with identical DIMMs installed
HA201-TP Users Manual
19
Chapter 2 Hardware Installation
bull DIMM population rules require that DIMMs within a channel be populated starting with the BLUE DIMM slot or DIMM farthest from the processor in a ldquofill-farthestrdquo approach In addition when populating a Quad-rank DIMM with a Single- or Dual-rank DIMM in the same channel the Quad-rank DIMM must be populated farthest from the processor Intelreg MRC will check for correct DIMM placement
Chapter 2 Hardware Installation
HA201-TP Users Manual
20
On the Intelreg Server Board S2600TP product a total of sixteen DIMM slots are provided (two CPUs ndash four channels per CPU and two DIMMs per channel) The nomenclature for DIMM sockets is detailed in the following table
HA201-TP Users Manual
21
Chapter 2 Hardware Installation
243 Publishing System MemoryThere are a number of different situations in which the memory size andor configuration are displayed Most of these displays differ in one way or another so the same memory configuration may appear to display differently depending on when and where the display occurs
bull The BIOS displays the ldquoTotal Memoryrdquo of the system during POST if Quiet Boot is disabled in BIOS setup This is the total size of memory discovered by the BIOS during POST and is the sum of the individual sizes of installed DDR4 DIMMs in the system
bull The BIOS displays the ldquoEffective Memoryrdquo of the system in the BIOS Setup The term Effective Memory refers to the total size of all DDR4 DIMMs that are active (not disabled) and not used as redundant units (see Note below)
bull The BIOS provides the total memory of the system in the main page of BIOS setup This total is the same as the amount described by the first bullet above
bull If Quiet Boot is disabled the BIOS displays the total system memory on the diagnostic screen at the end of POST This total is the same as the amount described by the first bullet above
bull The BIOS provides the total amount of memory in the system by supporting the EFI Boot Service function GetMemoryMap()
bull The BIOS provides the total amount of memory in the system by supporting the INT 15h E820h function For details see the Advanced Configuration and Power Interface Specification
Chapter 2 Hardware Installation
HA201-TP Users Manual
22
25 RemovingInstalling a Drive Tray Hard Drive Drive Slot Map
251 Removing a Disk DriveRelease a drive tray by pressing the unlock button and pinching the lock lever slightly and pulling out the drive tray
252 Installing a Disk Drive
Adjust and place the 25rdquo HDD on the HDD tray Choose to the proper position screw hole then secure it with screws on the bottomMounting location 1 HDD with the interposer board attached
HA201-TP Users Manual
23
Chapter 2 Hardware Installation
253 Installing a Hard Disk Drive Tray
Insert the drive carrier into its bay Push the tray lever until it clicks Make sure the drive tray is correctly secured in place when its front edge aligns with the bay edge
254 Drive Slot Map
The drive slot map follows
HBA Card
MegaRaid Card
Chapter 2 Hardware Installation
HA201-TP Users Manual
24
26 Removing and Installing a Fan Module261 Removing a FAN module
Unscrew the thumb screw on the cover to open the top cover from NODE Unpluging the FAN cable Remove the fan by lifting it and pulling it out
262 Installing a FAN module To installing a fan module follow the reverse order
HA201-TP Users Manual
25
Chapter 2 Hardware Installation
27 Power Supply
271 Removing or Replacing the Power Supply Module
Slide the small levers (see red arrow ) to the right Hold the PSU lever and firmly pull the PSU out of the server chassis
Chapter 2 Hardware Installation
HA201-TP Users Manual
26
28 Tool-less Blade Slide Installation introduction
1 Pull on the lsquorsquoFront-Releasersquorsquoto unlock the inner channel from the Slide Assembly
2 Release the Detent-Lock and push Middle Channel inwards to retract Middle Channel
HA201-TP Users Manual
27
Chapter 2 Hardware Installation
Metal Spacer
OptionalRemove Metal Spacer for Aluminium Racks
3 Aligning the Front Bracket with the Mounting Hole
4 Push in to assembly the Front Bracket onto the Rack
Chapter 2 Hardware Installation
HA201-TP Users Manual
28
5 Now the bracket is fixed onto the Rack(Optional M6x10L screws are to secure the rails with posts if needed)
6 Refer to Diagram 3 amp 4 to assemble the End Bracket onto the Rack
HA201-TP Users Manual
29
Chapter 2 Hardware Installation
7 Assemble the inner channel onto the chassis using thescrews provided
8 Push the chassis with inner channels into Slide to complete Rack Installation
Chapter 3 Motherborad Overview
HA201-TP Users Manual
30
Chapter 3 Motherborad Overview
The Intelreg Server Board S2600TP is a monolithic printed circuit board (PCB) with features designed to support the high-performance and high-density computing markets This server board is designed to support the Intelreg Xeonreg processor E5-2600 v3 product family Previous generation Intelreg Xeonreg processors are not supported Many of the features and functions of the server board family are common A board will be identified by name when a described feature or function is unique to it
Chapter 3 Motherborad Overview
HA201-TP Users Manual
31
31 Intelreg Server Board Feature Set
Chapter 3 Motherborad Overview
HA201-TP Users Manual
32
WARNING The riser slot 1 on the server board is designed for
plugging in ONLY the riser card Plugging in the PCIe card may
cause permanent server board and PCIe card damage
Chapter 3 Motherborad Overview
HA201-TP Users Manual
33
32 Motherboard Layout
DiagnosticLED
RMM4LiteRiser Slot 2
SATASGPIO
BridgeBoard
SATA 3USB
Riser Slot 4
Riser Slot 4
HDD Activity
Riser Slot 3ControlPanel
SystemFan 1
CPU 2Socket
CPU 1Socket
SystemFan 3
SystemFan 2
MainPower 1
FanConnector
MainPower 2
InfiniBandPort(QSFP+)DedicatedManagementPortUSBStatus LEDID LED
VGA
NIC 2
IPMB
NIC 1
SerialPort A
CR 2032 3W
Backup Power Control
SATA 2
SATA 0
SATA RAID 5
SATA
Upgrade Key
DOM
SATA 1
Chapter 3 Motherborad Overview
HA201-TP Users Manual
34
33 Motherboard block diagram
Chapter 3 Motherborad Overview
HA201-TP Users Manual
35
34 Motherboard Configuration JumpersThe following table provides a summary and description of configuration test and debug jumpers The server board has several 3-pin jumper blocks that can be used Pin 1 on each jumper block can be identified by the following symbol on the silkscreen
Chapter 3 Motherborad Overview
HA201-TP Users Manual
36
Chapter 3 Motherborad Overview
HA201-TP Users Manual
37
35 Motherboard Configuration JumpersThe Intelreg Server Board S2600TP has the following board rear connector placement
HA201-TP Users Manual
38
Chapter 4 12G Expander Board Overview
This chapter provides detailed introduction guide on hardware introduction
41 12GB Expander Board411 Placement amp Connectors location
J1
JPIC_DBGU8
Chip 3x36RExpander
U14Flash U16
PIC
SW1
SW2
JEXP_UART
J7
JPIC_SMT
J2 J3 J4 CN1 JUSB_MB
JVBATT_12C
JVBATT CN3 CN2 JIPMI_LOC
CN5
CN8
JPWR1
JPWR2
JHDDPWRJPSON_OK
JIPMB
JUSB_PIC
J8
CN7CN6
CN4
JPWR_SW
Chapter 4 12GB Expander Borad Overview
HA201-TP Users Manual
39
Chapter 4 12G Expander Board Overview
412 PIN-OUT
Power Connector ndash JPWR1JPWR2
OS HDD Power Connector ndash JHDDPWR
Battery Connector ndash JVBATT
FAN Connector ndash J7J8
Console for Expander ndash JEXP_UART
HA201-TP Users Manual
40
Chapter 4 12G Expander Board Overview
PIC Smart portndash JPIC_SMT
PIC Debug port(For MPLAB I2C tool) ndash JPIC_DBG
PIC USB ndash JUSB_PIC
Battery Function ndash JVBATT_I2C
IPMB ndash JIPMB
HA201-TP Users Manual
41
Chapter 4 12G Expander Board Overview
BP PIC USB ndash JUSB_MB
IPMB for Device ndash JIPMI_LOC
Power SW ndash JPWR_SW
MB power on frunction ndash JPSON_OK
HA201-TP Users Manual
42
Chapter 4 12G Expander Board Overview
413 LEDs
LED_CN6
LED2
LED3
HA201-TP Users Manual
43
Chapter 5 Backplane OverviewChapter 5 Backplane Overview
This chapter provides detailed introduction guide on backplane introduction
51 Backplane511 Placement amp Connectors location
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
9
18
10
1
3
G
D
SG
D
S
C
B
C
E
B
C
E
GA
O
I
2
43
12
43
1
14
1
3
45
134
5
134
5
134
5
16
2
9
10
1
1
1120
10
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
16
17
32
33
484964
J2
JP1J11
For Secondary Canister Node
J4 J1 SW1
SW2
J3
For Primary Canister Node
HA201-TP Users Manual
44
Chapter 5 Backplane Overview
512 PIN-OUT
Power Connector ndash J2
Power Connector ndash J11
PMBUS Connector ndash JP1
FAN Connector ndash J4
HA201-TP Users Manual
45
Chapter 5 Backplane Overview
Console for MCU ndash J1
Front IO ndash J3
HA201-TP Users Manual
46
Chapter 5 Backplane Overview
513 LEDs
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
18
9
10
1
31
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
3 4
21
3 4
21
4
1
16
2
91
101
11
10
20
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ActFail
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
9
18
10
1
3
G
D
SG
D
S
C
B
C
E
B
C
E
GA
O
I
2
43
12
43
1
14
1
3
45
134
5
134
5
134
5
16
2
9
10
1
1
1120
10
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
16
17
32
33
484964LED1
HA201-TP Users Manual
47
Chapter 6 HDD Backplane Introduction
61 Expender firmware update through smart console port611 Update Expander firmware revision
Step 1 Set up HA201-TP console serial cable Insert console serial cable into console port shown below also
the other side inert serial port into motherboard
PLEASE FIND THE CONSOLE SERIAL CABLE IN THE PACKAGE BOX
Chapter 6 Debug amp Firmware Update Introduction
HA201-TP Users Manual
48
Chapter 6 HDD Backplane Introduction
Step 2 Set up HA201-TP RS232 connectionSet up RS232 connection application into your HA201-TP as shown in the example process below
For exampleOS Microsoft WindowsRS232 connection application Hyperterminal
Step 2 Install HyperTrmexe
HA201-TP Users Manual
49
Chapter 6 HDD Backplane Introduction
Step 3 Enter a new name for the icon in the field below and click OK
Step 4 Connecting by using selecting an option in the drop down menu circled in red below (we selected COM2 in this example) and click OK
HA201-TP Users Manual
50
Chapter 6 HDD Backplane Introduction
Properties
Port Setting
Bits per second
Data bits
Parity
Stop bits
Flow control None
Restore Defaults
OK ApplyCancel
None
Step 6 Set up is complete The diagram below depicts what screen should displayed
Step 5 For ldquoBits per secondrdquo select 38400 For ldquoFlow controlrdquo select None Click OK when you have finished your selections
cmdgt_
HA201-TP Users Manual
51
Chapter 6 HDD Backplane Introduction
Step 7To get firmware image amp MFG Configuration Image version information from AIC SAS Related Firmware Downloadne website
httpppmsaicipccomtw8888downloadexpandermcu
HA201-TP Users Manual
52
Chapter 6 HDD Backplane Introduction
Step 8Comand line for show current firmware revisioncmdgtrev
HA201-TP Users Manual
53
Chapter 6 HDD Backplane Introduction
Step 9Start to update expander firmwarecmdgtfdl 0 0_
Step 10Select the tool bar Transfer -gt Send File
HA201-TP Users Manual
54
Chapter 6 HDD Backplane Introduction
Step 11bull Choose new firmware path file fw 3A1_v11221bull Protocol have to choose Xmodem
Step 12Firmware download complete
HA201-TP Users Manual
55
Chapter 6 HDD Backplane Introduction
Step 13Reset computer for success update firmwarecmdgtreset
reset
HA201-TP Users Manual
56
Chapter 6 HDD Backplane Introduction
612 Update expander configuration MFG
Step 1Comand line for show current configuration MFGcmdgt showmfg
HA201-TP Users Manual
57
Chapter 6 HDD Backplane Introduction
Step 2Start to update expander configuration MFGcmdgtfdl 83 0_
Step 3Select the tool bar Transfer -gt Send File
83 0
HA201-TP Users Manual
58
Chapter 6 HDD Backplane Introduction
Step 4bull Choose new MFG path file mfg 3A10_eob_1201binbull Protocol have to choose Xmodem
Step 5MFG download complete
HA201-TP Users Manual
59
Chapter 6 HDD Backplane Introduction
Step 6Reset computer for success update MFGcmdgtreset
reset
HA201-TP Users Manual
60
Chapter 6 HDD Backplane Introduction
62 Update the expander firmware through in-band
FOR EXAMPLEStep 1
Download and install SG3_utilsexe which compatible with Linux OSFrom website httpsgdannyczsgsg3_utilshtml website Reference version sg3_utils-140tgz
Step 2To get firmware image amp MFG Configuration Image version information from AIC SAS Related Firmware Downloadne websitehttpppmsaicipccomtw8888downloadexpandermcu
HA201-TP Users Manual
61
Chapter 6 HDD Backplane Introduction
Step 3Execute terminal under the same new firmware folderexample Setting a new firmware folder on Home page Open Terminal by click to the right button of mouse in the same window Home
Step 4Typing sudo -s to into administrator mode
HA201-TP Users Manual
62
Chapter 6 HDD Backplane Introduction
Step 5 Find expander location$ sg_map -i
Step 6Update Expander firmware$ sg_write_buffer --id=0x0 --in=fw3A1_v11221 --mode=0x2 --offset=0 devsg0
HA201-TP Users Manual
63
Chapter 6 HDD Backplane Introduction
Step 7 Update Expander MFG$ sg_write_buffer --id=0x83 --in=mfg3A10_eob_1201bin --mode=0x2 --offset=0 devsg0
Step 8Reboot computer for success update firmware amp MFGrootubuntu~ reboot
HA201-TP Users Manual
64
Chapter 6 HDD Backplane Introduction
63 12G expander EDFB settingStep 1
For Install HyperTerminalexe refer to section 41
Step 2 Get EDFB statuscmdgt edfb
HA201-TP Users Manual
65
Chapter 6 HDD Backplane Introduction
Step 3Change EDFB setting
Enable EDFB functioncmdgtedfb on
Disable EDFB functioncmdgtedfb off
cmd gtedfbEDFB is OFF
cmd gtedfb onSucceeded to set EDFB
cmd gtedfb offEDFB is OFF
HA201-TP Users Manual
66
Chapter 6 HDD Backplane Introduction
64 Slot HDD power setting(Only for system cooling Fan controled by expander)
Step 1 For Install sg3exe tool and get new firmware from website refer to section 42
Step 2Execute terminal under the same new firmware folderexample Setting a new firmware folder on Home page Open Terminal by click to the right button of mouse in the same window Home
HA201-TP Users Manual
67
Chapter 6 HDD Backplane Introduction
Step 3 Typing sudo -s to into administrator mode
Step 4 Find expander location$ sg_map -i
HA201-TP Users Manual
68
Chapter 6 HDD Backplane Introduction
Step 5For example
If would like to turn the Disk004 power off under the HBA card Need to check Disk004 power status $ sg_ses --page=7 devsg0
Under HBA card the Element 3 = Disk004
HA201-TP Users Manual
69
Chapter 6 HDD Backplane Introduction
Step 6To check Disk004 (element 3) power status is ok$ sg_ses --page=2 devsg0
Status shows belowThe status of Element 3 is OK
HA201-TP Users Manual
70
Chapter 6 HDD Backplane Introduction
Step 7Turn off a HDD power$ sg_ses --descriptor=Disk004 --set=341 devsg0
Step 8Turn on a HDD power$ sg_ses --descriptor=Disk004 --clear=341 devsg0
HA201-TP Users Manual
71
Chapter 6 HDD Backplane Introduction
65 HDD BP thermal sensor temperature setting(Only for system cooling Fan controled by expander)
Step 1 For Install HyperTerminalexe refer to section 41
Step 2Get the current temperature settingscmdgt temperature
HA201-TP Users Manual
72
Chapter 6 HDD Backplane Introduction
Step 3For example
Set new temperatureT1=20 C 4 18 CT2=50 C 4 52 CWarning threshold=50 C 448 CAlarm threshold=55 C 454 C
The new setting will take effect after resetcmdgt temperature 18 52 48 54cmdgt reset
HA201-TP Users Manual
73
Chapter 6 HDD Backplane Introduction
Step 4Check fan speed amp temperature informationcmdgt sensor
cmd gtsensor==ENCLOSURE STATUS========================================================== Total fan number 2 System Fan-0 speed 10888 RPM System Fan-1 speed 10971 RPM System PWN-0 82 Expander Temperature 76 Celsius degree Sytem Temperature-0 33 Celsius degree T1 20 Celsius degree T2 50 Celsius degree TC 55 Celsius degree Voltage Sensor 09V 093V Voltage Sensor 18V 180V
cmd gt_
HA201-TP Users Manual
74
Chapter 7 Intelreg Server Board S2600TP Platform Management
71 Server Management Function Architecture711 IPMI Feature7111 IPMI 20 Featuresbull Baseboard management controller (BMC)bull IPMI Watchdog timerbull Messaging support including command bridging and usersession
supportbull Chassis device functionality including powerreset control and BIOS boot
flags supportbull Event receiver device The BMC receives and processes events from
other platform subsystemsbull Field Replaceable Unit (FRU) inventory device functionality The BMC
supports access to system FRU devices using IPMI FRU commandsbull System Event Log (SEL) device functionality The BMC supports and
provides access to a SELbull Sensor Data Record (SDR) repository device functionality The BMC
supports storage and access of system SDRsbull Sensor device and sensor scanningmonitoring The BMC provides IPMI
management of sensors It polls sensors to monitor and report system health
bull IPMI interfaceso Host interfaces include system management software (SMS) with receive message queue support and server management mode (SMM)o IPMB interfaceo LAN interface that supports the IPMI-over-LAN protocol (RMCP RMCP+)
bull Serial-over-LAN (SOL)bull ACPI state synchronization The BMC tracks ACPI state changes that are
provided by the BIOSbull BMC self-test The BMC performs initialization and run-time self-tests and
makes results available to external entitiesbull See also the Intelligent Platform Management Interface Specification
Second Generation v20
Chapter 7 Intelreg Server Board S2600TP Platform Management
HA201-TP Users Manual
75
Chapter 7 Intelreg Server Board S2600TP Platform Management
7112 Non IPMI FeaturesThe BMC supports the following non-IPMI featuresbull In-circuit BMC firmware updatebull Fault resilient booting (FRB) FRB2 is supported by the watchdog timer
functionalitybull Chassis intrusion detectionbull Basic fan control using Control version 2 SDRsbull Fan redundancy monitoring and supportbull Enhancements to fan speed controlbull Power supply redundancy monitoring and supportbull Hot-swap fan supportbull Acoustic management Support for multiple fan profilesbull Signal testing support The BMC provides test commands for setting
and getting platform signal statesbull The BMC generates diagnostic beep codes for fault conditionsbull System GUID storage and retrievalbull Front panel management The BMC controls the system status LED
and chassis ID LED It supports secure lockout of certain front panel functionality and monitors button presses The chassis ID LED is turned on using a front panel button or a command
bull Power state retentionbull Power fault analysisbull Intelreg Light-Guided Diagnosticsbull Power unit management Support for power unit sensor The BMC
handles power-good dropout conditionsbull DIMM temperature monitoring New sensors and improved acoustic
management using closed-loop fan control algorithm taking into account DIMM temperature readings
bull Address Resolution Protocol (ARP) The BMC sends and responds to ARPs (supported on embedded NICs)
bull Dynamic Host Configuration Protocol (DHCP) The BMC performs DHCP (supported on embedded NICs)
bull Platform environment control interface (PECI) thermal management support
bull E-mail alertingbull Support for embedded web server UI in Basic Manageability feature
set
HA201-TP Users Manual
76
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull Enhancements to embedded web servero Human-readable SELo Additional system configurabilityo Additional system monitoring capabilityo Enhanced on-line help
bull Integrated KVMbull Enhancements to KVM redirection
o Support for higher resolutionbull Integrated Remote Media Redirectionbull Lightweight Directory Access Protocol (LDAP) supportbull Intelreg Intelligent Power Node Manager supportbull Embedded platform debug feature which allows capture of detailed
data for later analysisbull Provisioning and inventory enhancements
o Inventory datasystem information export (partial SMBIOS table)bull DCMI 15 compliance (product-specific)bull Management support for PMBus rev12 compliant power suppliesbull BMC Data Repository (Managed Data Region Feature)bull Support for an Intelreg Local Control Display Panelbull System Airflow Monitoringbull Exit Air Temperature Monitoringbull Ethernet Controller Thermal Monitoringbull Global Aggregate Temperature Margin Sensorbull Memory Thermal Managementbull Power Supply Fan Sensorsbull Energy Star Server Supportbull Smart Ride Through (SmaRT) Closed Loop System Throttling (CLST)bull Power Supply Cold Redundancybull Power Supply FW Updatebull Power Supply Compatibility Checkbull BMC FW reliability enhancements
o Redundant BMC boot blocks to avoid possibility of a corrupted boot block resulting in a scenario that prevents a user from updating the BMC o BMC System Management Health Monitoring
HA201-TP Users Manual
77
Chapter 7 Intelreg Server Board S2600TP Platform Management
72 Features and Functions721 Power SubsystemThe server board supports several power control sources which can initiate power-up orpower-down activity
HA201-TP Users Manual
78
Chapter 7 Intelreg Server Board S2600TP Platform Management
722 Advanced Configuration and Power Interface (ACPI)The server board has support for the following ACPI states
HA201-TP Users Manual
79
Chapter 7 Intelreg Server Board S2600TP Platform Management
723 System InitializationDuring system initialization both the BIOS and the BMC initialize the following items
7231 Processor Tcontrol SettingProcessors used with this chipset implement a feature called Tcontrol which provides a processor-specific value that can be used to adjust the fan-control behavior to achieve optimum cooling and acoustics The BMC reads these from the CPU through PECI Proxy mechanism provided by Manageability Engine (ME) The BMC uses these values as part of thefan-speed-control algorithm
HA201-TP Users Manual
80
Chapter 7 Intelreg Server Board S2600TP Platform Management
7232 Fault Resilient Booting (FRB)Fault resilient booting (FRB) is a set of BIOS and BMC algorithms and hardware support that allow a multiprocessor system to boot even if the bootstrap processor (BSP) fails Only FRB2 is supported using watchdog timer commandsFRB2 refers to the FRB algorithm that detects system failures during POST The BIOS uses the BMC watchdog timer to back up its operation during POST The BIOS configures the watchdog timer to indicate that the BIOS is using the timer for the FRB2 phase of the boot operationAfter the BIOS has identified and saved the BSP information it sets the FRB2 timer use bit and loads the watchdog timer with the new timeout intervalIf the watchdog timer expires while the watchdog use bit is set to FRB2 the BMC (if so configured) logs a watchdog expiration event showing the FRB2 timeout in the event data bytes The BMC then hard resets the system assuming the BIOS-selected reset as the watchdog timeout actionThe BIOS is responsible for disabling the FRB2 timeout before initiating the option ROM scan and before displaying a request for a boot password If the processor fails and causes an FRB2 timeout the BMC resets the systemThe BIOS gets the watchdog expiration status from the BMC If the status shows an expired FRB2 timer the BIOS enters the failure in the system event log (SEL) In the OEM bytes entry in the SEL the last POST code generated during the previous boot attempt is written FRB2failure is not reflected in the processor status sensor valueThe FRB2 failure does not affect the front panel LEDs
HA201-TP Users Manual
81
Chapter 7 Intelreg Server Board S2600TP Platform Management
7233 Post Code DisplayThe BMC upon receiving standby power initializes internal hardware to monitor port 80h (POST code) writes Data written to port 80h is output to the system POST LEDsThe BMC deactivates POST LEDs after POST had completed
HA201-TP Users Manual
82
Chapter 7 Intelreg Server Board S2600TP Platform Management
724 System Event Log (SEL)The BMC implements the system event log as specified in the Intelligent Platform Management Interface Specification Version 20 The SEL is accessible regardless of the system power state through the BMCs in-band and out-of-band interfacesThe BMC allocates 95231bytes (approx 93 KB) of non-volatile storage space to store system events The SEL timestamps may not be in order Up to 3639 SEL records can be stored at a time Because the SEL is circular any command that results in an overflow of the SEL beyond the allocated space will overwrite the oldest entries in the SEL while setting the overflow flag
HA201-TP Users Manual
83
Chapter 7 Intelreg Server Board S2600TP Platform Management
73 Sensor MonitoringThe BMC monitors system hardware and reports system health The information gathered from physical sensors is translated into IPMI sensors as part of the ldquoIPMI Sensor Modelrdquo The BMC also reports various system state changes by maintaining virtual sensors that are not specifically tied to physical hardware This section describes general aspects of BMC sensor management as well as describing how specific sensor types are modeled Unless otherwise specified the term ldquosensorrdquo refers to the IPMI sensor-model definition of a sensor
531 Sensor ScanningThe value of many of the BMCrsquos sensors is derived by the BMC FW periodically polling physical sensors in the system to read temperature voltages and so on Some of these physical sensors are built-in to the BMC component itself and some are physically separated from the BMC Polling of physical sensors for support of IPMI sensor monitoring does not occur until the BMCrsquos operational code is running and the IPMI FW subsystem has completed initializationIPMI sensor monitoring is not supported in the BMC boot code Additionally the BMC selectively polls physical sensors based on the current power and reset state of the system and the availability of the physical sensor when in that state For example non-standby voltages are not monitored when the system is in S4 or S5 power state
HA201-TP Users Manual
84
Chapter 7 Intelreg Server Board S2600TP Platform Management
732 Sensor Rearm Behavior7321 Manual versus Re-arm SensorsSensors can be either manual or automatic re-arm An automatic re-arm sensor will re-arm (clear) the assertion event state for a threshold or offset it if that threshold or offset is deasserted after having been asserted This allows a subsequent assertion of the threshold or an offset to generate a new event and associated side-effect An example side-effect would be boosting fans due to an upper critical threshold crossing of a temperature sensor The event state and the input state (value) of the sensor track each other Most sensors are auto-rearmA manual re-arm sensor does not clear the assertion state even when the threshold or offset becomes de-asserted In this case the event state and the input state (value) of the sensor do not track each other The event assertion state is sticky The following methods can be used to re-arm a sensorbull Automatic re-arm ndash Only applies to sensors that are designated as
ldquoauto-rearmrdquobull IPMI command Re-arm Sensor Eventbull BMC internal method ndash The BMC may re-arm certain sensors due to a
trigger condition For example some sensors may be re-armed due to a system reset A BMC reset will re-arm all sensorsbull System reset or DC power cycle will re-arm all system fan sensors
HA201-TP Users Manual
85
Chapter 7 Intelreg Server Board S2600TP Platform Management
7322 Re-arm and Event GenerationAll BMC-owned sensors that show an asserted event status generate a de-assertion SEL event when the sensor is re-armed provided that the associated SDR is configured to enable a deassertion event for that condition This applies regardless of whether the sensor is a thresholdanalog sensor or a discrete sensor
To manually re-arm the sensors the sequence is outlined below1 A failure condition occurs and the BMC logs an assertion event2 If this failure condition disappears the BMC logs a de-assertion event (if
so configured)3 The sensor is re-armed by one of the methods described in the
previous section4 The BMC clears the sensor status5 The sensor is put into reading-state-unavailable state until it is polled
again or otherwise updated6 The sensor is updated and the ldquoreading-state-unavailablerdquo state is
cleared A new assertion event will be logged if the fault state is once again detected
All auto-rearm sensors that show an asserted event status generate a de-assertion SEL event at the time the BMC detects that the condition causing the original assertion is no longer present and the associated SDR is configured to enable a de-assertion event for that condition
HA201-TP Users Manual
86
Chapter 7 Intelreg Server Board S2600TP Platform Management
733 BIOS Event-Only SensorsBIOS-owned discrete sensors are used for event generation only and are not accessible through IPMI sensor commands like the Get Sensor Reading command Note that in this case the sensor owner designated in the SDR is not the BMCAn example of this usage would be the SELs logged by the BIOS for uncorrectable memory errors Such SEL entries would identify a BIOS-owned sensor ID
734 Margin SensorsThere is sometimes a need for an IPMI sensor to report the difference (margin) from a nonzero reference offset For the purposes of this document these type sensors are referred to as margin sensors For instance for the case of a temperature margin sensor if the referencevalue is 90 degrees and the actual temperature of the device being monitored is 85 degreesthe margin value would be -5
735 IPMI Watchdog SensorThe BMC supports a Watchdog Sensor as a means to log SEL events due to expirations of the IPMI 20 compliant Watchdog Timer
736 BMC Watchdog SensorThe BMC supports an IPMI sensor to report that a BMC reset has occurred due to action taken by the BMC Watchdog feature A SEL event will be logged whenever either the BMC FW stack is reset or the BMC CPU itself is reset
737 BMC System Management Health MonitoringThe BMC tracks the health of each of its IPMI sensors and report failures by providing a ldquoBMC FW Healthrdquo sensor of the IPMI 20 sensor type Management Subsystem Health with support for the Sensor Failure offset Only assertions should be logged into the SEL for the Sensor Failure offset The BMC Firmware Health sensor asserts for any sensor when 10 consecutive sensor errors are read These are not standard sensor events (that is threshold crossings or discrete assertions) These are BMC Hardware Access Layer (HAL) errors If a successful sensor read is completed the counter resets to zero
HA201-TP Users Manual
87
Chapter 7 Intelreg Server Board S2600TP Platform Management
738 VR Watchdog TimerThe BMC FW monitors that the power sequence for the board VR controllers is completed when a DC power-on is initiated Incompletion of the sequence indicates a board problem in which case the FW powers down the systemThe BMC FW supports a discrete IPMI sensor for reporting and logging this fault condition
739 System Airflow MonitoringThe BMC provides an IPMI sensor to report the volumetric system airflow in CFM (cubic feet per minute) The air flow in CFM is calculated based on the system fan PWM values The specific Pulse Width Modulation (PWM or PWMs) used to determine the CFM is SDR configurable The relationship between PWM and CFM is based on a lookup table in an OEM SDRThe airflow data is used in the calculation for exit air temperature monitoring It is exposed as an IPMI sensor to allow a datacenter management application to access this data for use in rack-level thermal management
7310 Thermal MonitoringThe BMC provides monitoring of component and board temperature sensing devices This monitoring capability is instantiated in the form of IPMI analogthreshold or discrete sensors depending on the nature of the measurementFor analogthreshold sensors with the exception of Processor Temperature sensors critical and non-critical thresholds (upper and lower) are set through SDRs and event generation enabled for both assertion and de-assertion eventsFor discrete sensors both assertion and de-assertion event generation are enabledMandatory monitoring of platform thermal sensors includes
HA201-TP Users Manual
88
Chapter 7 Intelreg Server Board S2600TP Platform Management
7310 Thermal MonitoringThe BMC provides monitoring of component and board temperature sensing devices This monitoring capability is instantiated in the form of IPMI analogthreshold or discrete sensors depending on the nature of the measurementFor analogthreshold sensors with the exception of Processor Temperature sensors critical and non-critical thresholds (upper and lower) are set through SDRs and event generation enabled for both assertion and de-assertion eventsFor discrete sensors both assertion and de-assertion event generation are enabledMandatory monitoring of platform thermal sensors includesbull Inlet temperature (physical sensor is typically on system front panel or
HDD back plane)bull Board ambient thermal sensorsbull Processor temperaturebull Memory (DIMM) temperaturebull CPU VRD Hot monitoringbull Power supply (only supported for PMBus-compliant PSUs)Additionally the BMC FW may create ldquovirtualrdquo sensors that are based on a combination of aggregation of multiple physical thermal sensors and application of a mathematical formula to thermal or power sensor readings
73101 Absolute Value versus Margin SensorsThermal monitoring sensors fall into three basic categoriesbull Absolute temperature sensors ndash These are analogthreshold sensors
that provide a value that corresponds to an absolute temperature value
bull Thermal margin sensors ndash These are analogthreshold sensors that provide a value that is relative to some reference value
bull Thermal fault indication sensors ndash These are discrete sensors that indicate a specific thermal fault condition
HA201-TP Users Manual
89
Chapter 7 Intelreg Server Board S2600TP Platform Management
73102 Processor DTS-Spec Margin Sensor(s)Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family incorporate a DTS based thermal spec This allows a much more accurate control of the thermal solution and will enable lower fan speeds and lower fan power consumption The main usage of this sensor is as an input to the BMCrsquos fan control algorithms The BMC implements this as a threshold sensor There is one DTS sensor for each installed physical processor package Thresholds are not set and alert generation is not enabled for these sensors
73103 Processor Thermal Margin Sensor(s)Each processor supports a physical thermal margin sensor per core that is readable through the PECI interface This provides a relative value representing a thermal margin from the corersquos throttling thermal trip point Assuming that temp controlled throttling is enabled the physical core temp sensor reads lsquo0rsquo which indicates the processor core is being throttledThe BMC supports one IPMI processor (margin) temperature sensor per physical processor package This sensor aggregates the readings of the individual core temperatures in a package to provide the hottest core temperature reading When the sensor reads lsquo0rsquo it indicates that the hottest processor core is throttlingDue to the fact that the readings are capped at the corersquos thermal throttling trip point (reading = 0) thresholds are not set and alert generation is not enabled for these sensors
73104 Processor Thermal Control Monitoring (Prochot)The BMC FW monitors the percentage of time that a processor has been operationally constrained over a given time window (nominally six seconds) due to internal thermal management algorithms engaging to reduce the temperature of the device When any processor core temperature reaches its maximum operating temperature the processorpackage PROCHOT (processor hot) signal is asserted and these management algorithms known as the Thermal Control Circuit (TCC) engage to reduce the temperature provided TCC is enabled TCC is enabled by BIOS during system boot This monitoring is instantiated as one IPMI analogthreshold sensor per processor package The BMC implements this as a threshold sensor on a per-processor basis
HA201-TP Users Manual
90
Chapter 7 Intelreg Server Board S2600TP Platform Management
Under normal operation this sensor is expected to read lsquo0rsquo indicating that no processor throttling has occurredThe processor provides PECI-accessible counters one for the total processor time elapsed and one for the total thermally constrained time which are used to calculate the percentage assertion over the given time window
73105 Processor Voltage Regulator (VRD) Over-Temperature SensorThe BMC monitors processor VRD_HOT signals The processor VRD_HOT signals are routed to the respective processor PROCHOT input in order initiate throttling to reduce processor power draw therefore indirectly lowering the VRD temperatureThere is one processor VRD_HOT signal per CPU slot The BMC instantiates one discrete IPMI sensor for each VRD_HOT signal This sensor monitors a digital signal that indicates whether a processor VRD is running in an over-temperature condition When the BMC detects that this signal is asserted it will cause a sensor assertion which will result in an event being written into the sensor event log (SEL)
73106 Inlet Temperature SensorEach platform supports a thermal sensor for monitoring the inlet temperature There are four potential sources for inlet temperature reading
73107 Baseboard Ambient Temperature Sensor(s)The server baseboard provides one or more physical thermal sensors for monitoring the ambient temperature of a board location This is typically to provide rudimentary thermal monitoring of components that lack internal thermal sensors
73108 Server South Bridge (SSB) Thermal MonitoringThe BMC monitors the SSB temperature This is instantiated as an analog (threshold) IPMI thermal sensor
73109 Exit Air Temperature Monitoring The BMC synthesizes a virtual sensor to approximate system exit air temperature for use in fan control This is calculated based on the total power being consumed by the system and the total volumetric air flow provided by the system fans
HA201-TP Users Manual
91
Chapter 7 Intelreg Server Board S2600TP Platform Management
Each system shall be characterized in tabular format to understand total volumetric flow versus fan speed The BMC calculates an average exit air temperature based on the total system power front panel temperature the volumetric system air flow (cubic feet per meter or CFM) and altitude rangeThis sensor is only available on systems in an Intelreg chassis The Exit Air temp sensor is only available when PMBus power supplies are installed
731010 Ethernet Controller Thermal MonitoringThe Intelreg Ethernet Controller I350-AM4 and Intelreg Ethernet Controller 10 Gigabit X540 support an on-die thermal sensor For baseboard Ethernet controllers that use these devices the BMC will monitor the sensors and use this data as input to the fan speed control The BMC will instantiate an IPMI temperature sensor for each device on the baseboard
731011 Memory VRD-Hot Sensor(s)The BMC monitors memory VRD_HOT signals The memory VRD_HOT signals are routed to the respective processor MEMHOT inputs in order to throttle the associated memory to effectively lower the temperature of the VRD feeding that memoryFor Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family there are 2 memory VRD_HOT signals per CPU slot The BMC instantiates one discrete IPMI sensor for each memory VRD_HOT signal
731012 Add-in Module Thermal MonitoringSome boards have dedicated slots for an IO module andor a SAS module For boards that support these slots the BMC will instantiate an IPMI temperature sensor for each slot The modules themselves may or may not provide a physical thermal sensor (a TMP75 device) If the BMC detects that a module is installed it will attempt to access the physical thermal sensor and if found enable the associated IPMI temperature sensor
731013 Processor ThermTripWhen a Processor ThermTrip occurs the system hardware will automatically power down the server If the BMC detects that a ThermTrip occurred then it will set the ThermTrip offset for the applicable
HA201-TP Users Manual
92
Chapter 7 Intelreg Server Board S2600TP Platform Management
processor status sensor
731014 Server South Bridge (SSB) ThermTrip Monitoring The BMC supports SSB ThermTrip monitoring that is instantiated as an IPMI discrete sensor When a SSB ThermTrip occurs the system hardware will automatically power down the server and the BMC will assert the sensor offset and log an event
731015 DIMM ThermTrip MonitoringThe BMC supports DIMM ThermTrip monitoring that is instantiated as one aggregate IPMI discrete sensor per CPU When a DIMM ThermTrip occurs the system hardware will automatically power down the server and the BMC will assert the sensor offset and log an eventThis is a manual re-arm sensor that is rearmed on system resets and power-on (AC or DC power on transitions)
HA201-TP Users Manual
93
Chapter 7 Intelreg Server Board S2600TP Platform Management
7311 Processor SensorsThe BMC provides IPMI sensors for processors and associated components such as voltage regulators and fans The sensors are implemented on a per-processor basis
HA201-TP Users Manual
94
Chapter 7 Intelreg Server Board S2600TP Platform Management
73111 Processor Status SensorsThe BMC provides an IPMI sensor of type processor for monitoring status information for each processor slot If an event state (sensor offset) has been asserted it remains asserted until one of the following happens1 A Rearm Sensor Events command is executed for the processor status
sensor2 AC or DC power cycle system reset or system boot occurs
The BMC provides system status indication to the front panel LEDs for processor fault conditions shown belowCPU Presence status is not saved across AC power cycles and therefore will not generate a deassertion after cycling AC power
HA201-TP Users Manual
95
Chapter 7 Intelreg Server Board S2600TP Platform Management
73112 Processor Population Fault (CPU Missing) SensorThe BMC supports a Processor Population Fault sensor This is used to monitor for the condition in which processor slots are not populated as required by the platform hardware to allow power-on of the systemAt BMC startup the BMC will check for the fault condition and set the sensor state accordinglyThe BMC also checks for this fault condition at each attempt to DC power-on the system At each DC power-on attempt a beep code is generated if this fault is detectedThe following steps are used to correct the fault condition and clear the sensor state1 AC power down the server2 Install the missing processor into the correct slot3 AC power on the server
73113 ERR2 Timeout MonitoringThe BMC supports an ERR2 Timeout Sensor (1 per CPU) that asserts if a CPUrsquos ERR2 signal has been asserted for longer than a fixed time period (gt 90 seconds) ERR[2] is a processor signal that indicates when the IIO (Integrated IO module in the processor) has a fatal error which could not be communicated to the core to trigger SMI ERR[2] events are fatal error conditions where the BIOS and OS will attempt to gracefully handle error but may not be always do so reliably A continuously asserted ERR2 signal is an indication that the BIOS cannot service the condition that caused the error This is usually because that condition prevents the BIOS from runningWhen an ERR2 timeout occurs the BMC assertsde-asserts the ERR2 Timeout Sensor and logs a SEL event for that sensor The default behavior for BMC core firmware is to initiate a system reset upon detection of an ERR2 timeout The BIOS setup utility provides an option to disable or enable system reset by the BMC for detection of this condition
HA201-TP Users Manual
96
Chapter 7 Intelreg Server Board S2600TP Platform Management
73114 CATERR SensorThe BMC supports a CATERR sensor for monitoring the system CATERR signalThe CATERR signal is defined as having 3 statesbull high (no event)bull pulsed low (possibly fatal may be able to recover)bull low (fatal)All processors in a system have their CATERR pins tied together The pin is used as a communication path to signal a catastrophic system event to all CPUs The BMC has direct access to this aggregate CATERR signalThe BMC only monitors for the ldquoCATERR held lowrdquo condition A pulsed low condition is ignored by the BMC If a CATERR-low condition is detected the BMC logs an error message to the SEL against the CATERR sensor and the default action after logging the SEL entry is to reset the system The BIOS setup utility provides an option to disable or enable system reset by the BMC for detection of this conditionThe sensor is rearmed on power-on (AC or DC power on transitions) It is not rearmed on system resets in order to avoid multiple SEL events that could occur due to a potential reset loop if the CATERR keeps recurring which would be the case if the CATERR was due to an MSID mismatch conditionWhen the BMC detects that this aggregate CATERR signal has asserted it can then go through PECI to query each CPU to determine which one was the source of the error and write an OEM code identifying the CPU slot into an event data byte in the SEL entry If PECI is non-functional (it isnrsquot guaranteed in this situation) then the OEM code should indicate that the source is unknownEvent data byte 2 and byte 3 for CATERR sensor SEL events
ED1 ndash 0xA1ED2 - CATERR type0 Unknown1 CATERR2 CPU Core Error (not supported on Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family)3 MSID Mismatch4 CATERR due to CPU 3-strike timeout
HA201-TP Users Manual
97
Chapter 7 Intelreg Server Board S2600TP Platform Management
ED3 - CPU bitmap that causes the system CATERR[0] CPU1[1] CPU2[2] CPU3[3] CPU4When a CATERR Timeout event is determined to be a CPU 3-strike timeout The BMC shall log the logical FRU information (eg busdevfunc for a PCIe device CPU or DIMM) that identifies the FRU that caused the error in the extended SEL data bytes In this case Ext-ED0 will be set to 0x70 and the remaining ED1-ED7 will be set according to the device type and info available
73115 MSID Mismatch SensorThe BMC supports a MSID Mismatch sensor for monitoring for the fault condition that will occur if there is a power rating incompatibility between a baseboard and a processor The sensor is rearmed on power-on (AC or DC power on transitions)
7312 Voltage MonitoringThe BMC provides voltage monitoring capability for voltage sources on the main board and processors such that all major areas of the system are covered This monitoring capability is instantiated in the form of IPMI analogthreshold sensors
73121 DIMM Voltage SensorsSome systems support either LVDDR (Low Voltage DDR) memory or regular (non-LVDDR) memory During POST the system BIOS detects which type of memory is installed and configures the hardware to deliver the correct voltageSince the nominal voltage range is different this necessitates the ability to set different thresholds for any associated IPMI voltage sensors The BMC FW supports this by implementing separate sensors (that is separate IPMI sensor numbers) for each nominal voltage range supported for a single physical sensor and it enablesdisables the correct IPMI sensor based on which type memory is installed The sensor data records for both these DIMM voltage sensor types have scanning disabled by default Once the BIOS has completed its POST routine it is responsible for communicating the DIMM voltage type to the BMC which will then
HA201-TP Users Manual
98
Chapter 7 Intelreg Server Board S2600TP Platform Management
enable sensor scanning of the correct DIMM voltage sensor
7313 Fan MonitoringBMC fan monitoring support includes monitoring of fan speed (RPM) and fan presence
73131 Fan Tach SensorsFan Tach sensors are used for fan failure detection The reported sensor reading is proportional to the fanrsquos RPM This monitoring capability is instantiated in the form of IPMI analogthreshold sensorsMost fan implementations provide for a variable speed fan so the variations in fan speed can be large Therefore the threshold values must be set sufficiently low as to not result in inappropriate threshold crossingsFan tach sensors are implemented as manual re-arm sensors because a lower-critical threshold crossing can result in full boosting of the fans This in turn may cause a failing fanrsquos speed to rise above the threshold and can result in fan oscillationsAs a result fan tach sensors do not auto-rearm when the fault condition goes away but rather are rearmed for either of the following occurrences1 The system is reset or power-cycled2 The fan is removed and either replaced with another fan or re-
inserted This applies to hot-swappable fans only This re-arm is triggered by change in the state of the associated fan presence sensor
After the sensor is rearmed if the fan speed is detected to be in a normal range the failure conditions shall be cleared and a de-assertion event shall be logged
HA201-TP Users Manual
99
Chapter 7 Intelreg Server Board S2600TP Platform Management
73132 Fan Presence SensorsSome chassis and server boards provide support for hot-swap fans These fans can be removed and replaced while the system is powered on and operating normally The BMC implements fan presence sensors for each hot swappable fan These are instantiated as IPMI discrete sensorsEvents are only logged for fan presence upon changes in the presence state after AC power is applied (no events logged for initial state)
73133 Fan Redundancy SensorThe BMC supports redundant fan monitoring and implements fan redundancy sensors for products that have redundant fans Support for redundant fans is chassis-specificA fan redundancy sensor generates events when its associated set of fans transition between redundant and non-redundant states as determined by the number and health of the component fans The definition of fan redundancy is configuration dependent The BMCallows redundancy to be configured on a per fan-redundancy sensor basis through OEM SDR recordsThere is a fan redundancy sensor implemented for each redundant group of fans in the systemAssertion and de-assertion event generation is enabled for each redundancy state
73134 Power Supply Fan SensorsMonitoring is implemented through IPMI discrete sensors one for each power supply fan The BMC polls each installed power supply using the PMBus fan status commands to check for failure conditions for the power supply fans The BMC asserts the ldquoperformance lagsrdquo offset of the IPMI sensor if a fan failure is detectedPower supply fan sensors are implemented as manual re-arm sensors because a failure condition can result in boosting of the fans This in turn may cause a failing fanrsquos speed to rise above the ldquofaultrdquo threshold and can result in fan oscillations As a result these sensors do not auto-rearm when the fault condition goes away but rather are rearmed only when the system is reset or power-cycled or the PSU is removed and replaced with the same or another PSUAfter the sensor is rearmed if the fan is no longer showing a failed state the failure condition in the IPMI sensor shall be cleared and a de-
HA201-TP Users Manual
100
Chapter 7 Intelreg Server Board S2600TP Platform Management
assertion event shall be logged
73135 Monitoring for ldquoFans Offrdquo ScenarioOn Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family it is likely that there will be situations where specific fans are turned off based on current system conditions BMC Fan monitoring will comprehend this scenario and not log false failure eventsThe recommended method is for the BMC FW to halt updates to the value of the associated fan tach sensor and set that sensorrsquos IPMI sensor state to ldquoreading-state-unavailablerdquo when this mode is active Management software must comprehend this state for fan tach sensors and not report these as failure conditionsThe scenario for which this occurs is that the BMC Fan Speed Control (FSC) code turns off the fans by setting the PWM for the domain to 0 This is done when based on one or more global aggregate thermal margin sensor readings dropping below a specified thresholdBy default the fans-off feature will be disabled There is a BMC command and BIOS setup option to enabledisable this featureThe SmaRTCLST system feature will also momentarily gate power to all the system fans to reduce overall system power consumption in response to a power supply event (for example to ride out an AC power glitch) However for this scenario the fan power is gated by hardware for only 100ms which should not be long enough to result in triggering a fan fault SEL event
7314 Standard Fan ManagementThe BMC controls and monitors the system fans Each fan is associated with a fan speed sensor that detects fan failure and may also be associated with a fan presence sensor for hotswap support For redundant fan configurations the fan failure and presence status determines the fan redundancy sensor stateThe system fans are divided into fan domains each of which has a separate fan speed control signal and a separate configurable fan control policy A fan domain can have a set of temperature and fan sensors associated with it These are used to determine the current fan domain state
HA201-TP Users Manual
101
Chapter 7 Intelreg Server Board S2600TP Platform Management
A fan domain has three statesbull The sleep and boost states have fixed (but configurable through OEM
SDRs) fan speeds associated with thembull The nominal state has a variable speed determined by the fan
domain policy An OEM SDR record is used to configure the fan domain policy
The fan domain state is controlled by several factors They are listed below in order of precedence high to lowbull Boost
o Associated fan is in a critical state or missing The SDR describes which fan domains are boosted in response to a fan failure or removal in each domain If a fan is removed when the system is in lsquoFans-offrsquo mode it will not be detected and there will not be any fan boost till system comes out of lsquoFans-off modeo Any associated temperature sensor is in a critical state The SDR describes which temperature-threshold violations cause fan boost for each fan domaino The BMC is in firmware update mode or the operational firmware is corruptedo If any of the above conditions apply the fans are set to a fixed boost state speed
bull Nominalo A fan domainrsquos nominal fan speed can be configured as static (fixed value) or controlled by the state of one or more associated temperature sensorso See section 43144 for more details
73141 Hot-Swap FansHot-swap fans are supported These fans can be removed and replaced while the system is powered on and operating The BMC implements fan presence sensors for each hotswappable fanWhen a fan is not present the associated fan speed sensor is put into the readingunavailable state and any associated fan domains are put into the boost state The fans may already be boosted due to a previous fan failure or fan removalWhen a removed fan is inserted the associated fan speed sensor is rearmed If there are no other critical conditions causing a fan boost condition the fan speed returns to the nominal state Power cycling or
HA201-TP Users Manual
102
Chapter 7 Intelreg Server Board S2600TP Platform Management
resetting the system re-arms the fan speed sensors and clears fanfailure conditions If the failure condition is still present the boost state returns once the sensor has re-initialized and the threshold violation is detected again
73142 Fan Redundancy DetectionThe BMC supports redundant fan monitoring and implements a fan redundancy sensor A fan redundancy sensor generates events when its associated set of fans transitions between redundant and non-redundant states as determined by the number and health of the fans The definition of fan redundancy is configuration dependent The BMC allows redundancy to be configured on a per fan redundancy sensor basis through OEM SDR recordsA fan failure or removal of hot-swap fans up to the number of redundant fans specified in the SDR in a fan configuration is a non-critical failure and is reflected in the front panel status A fan failure or removal that exceeds the number of redundant fans is a non-fatal insufficientresources condition and is reflected in the front panel status as a non-fatal errorRedundancy is checked only when the system is in the DC-on state Fan redundancy changes that occur when the system is DC-off or when AC is removed will not be logged until the system is turned on
73143 Fan DomainsSystem fan speeds are controlled through pulse width modulation (PWM) signals which are driven separately for each domain by integrated PWM hardware Fan speed is changed by adjusting the duty cycle which is the percentage of time the signal is driven high in each pulseThe BMC controls the average duty cycle of each PWM signal through direct manipulation of the integrated PWM control registersThe same device may drive multiple PWM signals
HA201-TP Users Manual
103
Chapter 7 Intelreg Server Board S2600TP Platform Management
73144 Nominal Fan SpeedA fan domainrsquos nominal fan speed can be configured as static (fixed value) or controlled by the state of one or more associated temperature sensorsOEM SDR records are used to configure which temperature sensors are associated with which fan control domains and the algorithmic relationship between the temperature and fan speed Multiple OEM SDRs can reference or control the same fan control domain and multiple OEM SDRs can reference the same temperature sensorsThe PWM duty-cycle value for a domain is computed as a percentage using one or more instances of a stepwise linear algorithm and a clamp algorithm The transition from one computed nominal fan speed (PWM value) to another is ramped over time to minimize audible transitions The ramp rate is configurable by means of the OEM SDRMultiple stepwise linear and clamp controls can be defined for each fan domain and used simultaneously For each domain the BMC uses the maximum of the domainrsquos stepwise linear control contributions and the sum of the domainrsquos clamp control contributions to compute the domainrsquos PWM value except that a stepwise linear instance can be configured to provide the domain maximumHysteresis can be specified to minimize fan speed oscillation and to smooth fan speed transitions If a Tcontrol SDR record does not contain a hysteresis definition for example an SDR adhering to a legacy format the BMC assumes a hysteresis value of zero
73145 Thermal and Acoustic ManagementThis feature refers to enhanced fan management to keep the system optimally cooled while reducing the amount of noise generated by the system fans Aggressive acoustics standards might require a trade-off between fan speed and system performance parameters that contribute to the cooling requirements primarily memory bandwidth The BIOS BMC and SDRs work together to provide control over how this trade-off is determinedThis capability requires the BMC to access temperature sensors on the individual memory DIMMs Additionally closed-loop thermal throttling is only supported with buffered DIMMs
HA201-TP Users Manual
104
Chapter 7 Intelreg Server Board S2600TP Platform Management
73146 Thermal Sensor Input to Fan Speed ControlThe BMC uses various IPMI sensors as input to the fan speed control Some of the sensors are IPMI models of actual physical sensors whereas some are ldquovirtualrdquo sensors whose values are derived from physical sensors using calculations andor tabular informationThe following IPMI thermal sensors are used as input to the fan speed controlbull Front panel temperature sensorbull Baseboard temperature sensorsbull CPU DTS-Spec margin sensorsbull DIMM thermal margin sensorsbull Exit air temperature sensorbull Global aggregate thermal margin sensorsbull SSB (Intelreg C610 Series Chipset) temperature sensorbull On-board Ethernet controller temperature sensors (support for this is
specific to the Ethernet controller being used)bull Add-in Intelreg SASIO module temperature sensor(s) (if present)bull Power supply thermal sensors (only available on PMBus-compliant
power supplies)A simple model is shown in the following figure which gives a high level graphic of the fan speed control structure creates the resulting fan speeds
HA201-TP Users Manual
105
Chapter 7 Intelreg Server Board S2600TP Platform Management
731461 Processor Thermal ManagementProcessor thermal management utilizes clamp algorithms for which the Processor DTS-Spec margin sensor is a controlling input This replaces the use of the (legacy) raw DTS sensor reading that was utilized on previous generation platforms The legacy DTS sensor is retained only for monitoring purposes and is not used as an input to the fan speed control
731462 Memory Thermal ManagementThe system memory is the most complex subsystem to thermally manage as it requires substantial interactions between the BMC BIOS and the embedded memory controller This section provides an overview of this management capability from a BMC perspective
7314621 Memory Thermal ThrottlingThe system shall support thermal management through open loop throttling (OLTT) and closed loop throttling (CLTT) of system memory based on the platform as well as availability of valid temperature sensors on the installed memory DIMMs Throttling levels are changed dynamically to cap throttling based on memory and system thermal conditions as determined by the system and DIMM power and thermal parameters Support for CLTT on mixed-mode DIMM populations (that is some installed DIMMs have valid temp sensors and some do not) is not supported The BMC fan speed control functionality is related to the memory throttling mechanism usedThe following terminology is used for the various memory throttling optionsbull Static Open Loop Thermal Throttling (Static-OLTT) OLTT control registers
are configured by BIOS MRC remain fixed after post The system does not change any of the throttling control registers in the embedded memory controller during runtime
bull Static Closed Loop Thermal Throttling (Static-CLTT) CLTT control registers are configured by BIOS MRC during POST The memory throttling is run as a closed-loop system with the DIMM temperature sensors as the control input Otherwise the system does not change any of the throttling control registers in the embedded memory controller during runtime
bull Dynamic Open Loop Thermal Throttling (Dynamic-OLTT) OLTT control registers are configured by BIOS MRC during POST Adjustments are
HA201-TP Users Manual
106
Chapter 7 Intelreg Server Board S2600TP Platform Management
made to the throttling during runtime based on changes in system cooling (fan speed)
bull Dynamic Closed Loop Thermal Throttling (Dynamic-CLTT) CLTT control registers are configured by BIOS MRC during POST The memory throttling is run as a closed-loop system with the DIMM temperature sensors as the control input Adjustments are made to the throttling during runtime based on changes in system cooling (fan speed)
Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce a new type of CLTT which is referred to as Hybrid CLTT for which the Integrated Memory Controller estimates the DRAM temperature in between actual reads of the TSODsHybrid CLTTT shall be used on all Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family that have DIMMs with thermal sensors Therefore the terms Dynamic-CLTT and Static-CLTT are really referring to this lsquohybridrsquo mode Note that if the IMCrsquos polling of the TSODs is interrupted the temperature readings that the BMC gets from the IMC shall be these estimated values 731463 DIMM Temperature Sensor Input to Fan Speed ControlA clamp algorithm s used for controlling fan speed based on DIMM temperatures Aggregate DIMM temperature margin sensors are used as the control input to the algorithm
731464 Dynamic (Hybrid) CLTTThe system will support dynamic (memory) CLTT for which the BMC FW dynamically modifies thermal offset registers in the IMC during runtime based on changes in system cooling (fan speed) For static CLTT a fixed offset value is applied to the TSOD reading to get the die temperature however this is does not provide as accurate results as when the offset takes into account the current airflow over the DIMM as is done with dynamic CLTTIn order to support this feature the BMC FW will derive the air velocity for each fan domain based on the PWM value being driven for the domain Since this relationship is dependent on the chassis configuration a method must be used which support this dependency (for example through OEM SDR) that establishes a lookup table providing this relationshipBIOS will have an embedded lookup table that provides thermal offset
HA201-TP Users Manual
107
Chapter 7 Intelreg Server Board S2600TP Platform Management
values for each DIMM type altitude setting and air velocity range (3 ranges of air velocity are supported) During system boot BIOS will provide 3 offset values (corresponding to the 3 air velocity ranges) tothe BMC for each enabled DIMM Using this data the BMC FW constructs a table that maps the offset value corresponding to a given air velocity range for each DIMM During runtime the BMC applies an averaging algorithm to determine the target offset value corresponding to thecurrent air velocity and then the BMC writes this new offset value into the IMC thermal offset register for the DIMM
731465 Fan ProfilesThe server system supports multiple fan control profiles to support acoustic targets and American Society of Heating Refrigerating and Air Conditioning Engineers (ASHRAE) compliance The BIOS Setup utility can be used to choose between meeting the target acoustic level or enhanced system performance This is accomplished through fan profilesThe BMC supports eight fan profiles numbered from 0 to 7 Fan management policy is dictated by the Tcontrol SDRs These SDRs provide a way to associate fan control behavior with one or more fan profilesEach group of profiles allows for varying fan control policies based on the altitude For a given altitude the Tcontrol SDRs associated with an acoustics-optimized profile generate less noise than the equivalent performance-optimized profile by driving lower fan speeds and the BIOSreduces thermal management requirements by configuring more aggressive memory throttling See Table 16 for more informationThe BMC provides commands that query for fan profile support and it provides a way to enable a fan profile Enabling a fan profile determines which Tcontrol SDRs are used for fan management The BMC only supports enabling a fan profile through the command if that profile is supported on all fan domains defined for the system It is important to configure the SDRs so that all desired fan profiles are supported on each fan domain If no single profile is supported across all domains the BMC by default uses profile 0 and does not allow it to be changedAt system boot the BIOS can use the Get Fan Control Configuration command to query the BMC about which fan profiles are supported The BIOS uses this information to display options in the BIOS Setup utility The BIOS indicates the fan profile to the BMC as dictated by the BIOS Setup Utility options for fan mode and altitude using the Set Fan Control
HA201-TP Users Manual
108
Chapter 7 Intelreg Server Board S2600TP Platform Management
Configuration commandThe BMC uses this information as an input to its fan-control algorithm as supported by the Tcontrol OEM SDR The BMC only allows enabling of fan profiles that the BMC indicates are supported using the Get Fan Control Configuration command For example if the Get Fan Control Configuration command indicates that only profile 1 is supported then using the Set Fan Control Configuration command to enable profile 2 will result in the return of an error completion codeThe BMC requires the BIOS to send the Set Fan Control Configuration command to the BMC on every system boot This must be done after the BIOS has completed any throttling-related chipset configuration
HA201-TP Users Manual
109
Chapter 7 Intelreg Server Board S2600TP Platform Management
731466 Open-Loop Thermal Throttling FallbackNormal system operation uses closed-loop thermal throttling (CLTT) and DIMM temperature monitoring as major factors in overall thermal and acoustics management In the event that BIOS is unable to configure the system for CLTT it defaults to open-loop thermal throttling (OLTT) In the OLTT mode it is assumed that the DIMM temperature sensors are not available for fan speed control The BIOS communicates the throttling mode to the BMC along with the fan profile number when it sends the Set Fan Control Configuration commandWhen OLTT mode is specified the BMC internally blocks access to the DIMM temperatures causing the DIMM aggregate margin sensors to be marked as ReadingState Unavailable The BMC then uses the failure-control values for these sensors if specified in the Tcontrol SDRs as their fan speed contributions
731467 ASHRAE ComplianceSystem requirements for ASHRAE compliance is defined in the Common Fan Speed Control amp Thermal Management Platform Architecture Specification Altitude-related changes in fan speed control are handled through profiles for different altitude ranges
73147 Power Supply Fan Speed ControlThis section describes the system level control of the fans internal to the power supply over the PMBus Some but not all Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family will require that the power supplies be included in the system level fan speed control For any system that requires either of these capabilities the power supply must be PMBus-compliant
731471 System Control of Power Supply FansSome products require that the BMC control the speed of the power supply fans as is done with normal system (chassis) fans except that the BMC cannot reduce the power supply fan any lower than the internal power supply control is driving it For these products the BMC FW must have the ability to control and monitor the power supply fans through PMBus commands The power supply fans are treated as a system fan domain for which fan control policies are mapped just as for chassis system fans with system thermal sensors (rather than internal power
HA201-TP Users Manual
110
Chapter 7 Intelreg Server Board S2600TP Platform Management
supply thermal sensors) used as the input to a clamp algorithm for the power supply fan control This domain has both piecewise clipping curves and clamped sensors mapped into the power supply fan domain All the power supplies can be defined as a single fan domain
731472 Use of Power Supply Thermal Sensors as Input to System (Chassis) Fan Control Some products require that the power supply internal thermal sensors are used as control inputs to the system (chassis) fans in the same manner as other system thermal sensors are used for this purpose The power supply thermal sensors are included as clamped sensors into one or more system fan domains which may include the power supply fan domain
73148 Fan Boosting due to Fan FailuresIntelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce additional capabilities for handling fan failure or removal as described in this sectionEach fan failure shall be able to define a unique response from all other fan domains An OEM SDR table defines the response of each fan domain based on a failure of any fan including both system and power supply fans (for PMBus-compliant power supplies only) This meansthat if a system has six fans then there will be six different fan fail reactions
73149 Programmable Fan PWM OffsetThe system provides a BIOS Setup option to boost the system fan speed by a programmable positive offset or a ldquoMaxrdquo setting Setting the programmable offset causes the BMC to add the offset to the fan speeds that it would otherwise be driving the fans to The Max setting causes the BMC to replace the domain minimum speed with alternate domain minimums that also are programmable through SDRsThis capability is offered to provide system administrators the option to manually configure fans speeds in instances where the fan speed optimized for a given platform may not be sufficient when a high end add-in is configured into the system This enables easier usage of the fan speed control to support Intelreg as well as third party chassis and better support of ambient temperatures higher than 35C
HA201-TP Users Manual
111
Chapter 7 Intelreg Server Board S2600TP Platform Management
supply thermal sensors) used as the input to a clamp algorithm for the power supply fan control This domain has both piecewise clipping curves and clamped sensors mapped into the power supply fan domain All the power supplies can be defined as a single fan domain
731472 Use of Power Supply Thermal Sensors as Input to System (Chassis) Fan Control Some products require that the power supply internal thermal sensors are used as control inputs to the system (chassis) fans in the same manner as other system thermal sensors are used for this purpose The power supply thermal sensors are included as clamped sensors into one or more system fan domains which may include the power supply fan domain
73148 Fan Boosting due to Fan FailuresIntelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce additional capabilities for handling fan failure or removal as described in this sectionEach fan failure shall be able to define a unique response from all other fan domains An OEM SDR table defines the response of each fan domain based on a failure of any fan including both system and power supply fans (for PMBus-compliant power supplies only) This meansthat if a system has six fans then there will be six different fan fail reactions
73149 Programmable Fan PWM OffsetThe system provides a BIOS Setup option to boost the system fan speed by a programmable positive offset or a ldquoMaxrdquo setting Setting the programmable offset causes the BMC to add the offset to the fan speeds that it would otherwise be driving the fans to The Max setting causes the BMC to replace the domain minimum speed with alternate domain minimums that also are programmable through SDRsThis capability is offered to provide system administrators the option to manually configure fans speeds in instances where the fan speed optimized for a given platform may not be sufficient when a high end add-in is configured into the system This enables easier usage of the fan speed control to support Intelreg as well as third party chassis and better support of ambient temperatures higher than 35C
HA201-TP Users Manual
112
Chapter 7 Intelreg Server Board S2600TP Platform Management
7315 Power Management Bus (PMBus)The Power Management Bus (ldquoPMBusrdquo) is an open standard protocol that is built upon the SMBus 20 transport It defines a means of communicating with power conversion and other devices using SMBus-based commands A system must have PMBus-compliant power supplies installed in order for the BMC or ME to monitor them for status andor power metering purposesFor more information on PMBus see the System Management Interface Forum Web sitehttpwwwpowersigorg
7316 Power Supply Dynamic Redundancy SensorThe BMC supports redundant power subsystems and implements a Power Unit Redundancy sensor per platform A Power Unit Redundancy sensor is of sensor type Power Unit (09h) and reading type Availability Status (0Bh) This sensor generates events when a power subsystem transitions between redundant and non-redundant states as determined by the number and health of the power subsystemrsquos component power supplies The BMC implements Dynamic Power Supply Redundancy status based upon current system load requirements as well as total Power Supply capacity This status is independent of the Cold Redundancy status This prevents the BMC from reporting Fully Redundant Power supplies when the load required by the system exceeds half the power capability of all power supplies installed and operationalDynamic Redundancy detects this condition and generates the appropriate SEL event to notify the user of the condition Power supplies of different power ratings may be swapped in and out to adjust the power capacity and the BMC will adjust the Redundancy status accordinglyThe definition of redundancy is power subsystem dependent and sometimes even configuration dependent See the appropriate Platform Specific Information for power unit redundancy supportThis sensor is configured as manual-rearm sensor in order to avoid the possibility of extraneous SEL events that could occur under certain system configuration and workload conditions The sensor shall rearm for the following conditionsbull PSU hot-addbull System reset
HA201-TP Users Manual
113
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull AC power cyclebull DC power cycleSystem AC power is applied but on standby ndash Power unit redundancy is based on OEM SDR power unit record and number of PSU presentSystem is (DC) powered on - The BMC calculates Dynamic Power Supply Redundancy status based upon current system load requirements as well as total Power Supply capacityThe BMC allows redundancy to be configured on a per power-unit-redundancy sensor basis by means of the OEM SDR records
7317 Component Fault LED ControlSeveral sets of component fault LEDs are supported on the server board Some LEDs are owned by the BMC and some by the BIOSThe BMC owns control of the following FRUfault LEDsbull Fan fault LEDs ndash A fan fault LED is associated with each fan The BMC
lights a fan fault LED if the associated fan-tach sensor has a lower critical threshold event status asserted Fan-tach sensors are manual re-arm sensors Once the lower critical threshold is crossed the LED remains lit until the sensor is rearmed These sensors are rearmed at system DC power-on and system reset
bull DIMM fault LEDs ndash The BMC owns the hardware control for these LEDs The LEDs reflect the state of BIOS-owned event-only sensors When the BIOS detects a DIMM fault condition it sends an IPMI OEM command (Set Fault Indication) to the BMC to instruct the BMC to turn on the associated DIMM Fault LED These LEDs are only active when the system is in the lsquoonrsquo state The BMC will not activate or change the state of the LEDs unless instructed by the BIOS
bull Hard Disk Drive Status LEDs ndash The HSBP PSoC owns the hardware control for these LEDs and detection of the faultstatus conditions that the LEDs reflect
bull CPU Fault LEDs The BMC owns control for these LEDs An LED is lit if there is an MSID mismatch (that is CPU power rating is incompatible with the board)
HA201-TP Users Manual
114
Chapter 7 Intelreg Server Board S2600TP Platform Management
7318 CMOS Battery MonitoringThe BMC monitors the voltage level from the CMOS battery which provides battery backup to the chipset RTC This is monitored as an auto-rearm threshold sensorUnlike monitoring of other voltage sources for which the Emulex Pilot III component continuously cycles through each input the voltage channel used for the battery monitoring provides a software enable bit to allow the BMC FW to poll the battery voltage at a relativelyslow rate in order to conserve battery power
HA201-TP Users Manual
115
Chapter 7 Intelreg Server Board S2600TP Platform Management
74 Intelreg Intelligent Power Node Manager (NM)Power management deals with requirements to manage processor power consumption and manage power at the platform level to meet critical business needs Node Manager (NM) is a platform resident technology that enforces power capping and thermal-triggered powercapping policies for the platform These policies are applied by exploiting subsystem settings (such as processor P and T states) that can be used to control power consumption NM enables data center power management by exposing an external interface to management software through which platform policies can be specified It also implements specific data center power management usage models such as power limiting and thermal monitoringThe NM feature is implemented by a complementary architecture utilizing the ME BMC BIOS and an ACPI-compliant OS The ME provides the NM policy engine and power controllimiting functions (referred to as Node Manager or NM) while the BMC provides the external LAN link by which external management software can interact with the feature The BIOS provides system power information utilized by the NM algorithms and also exports ACPI Source Language (ASL) code used by OS-Directed Power Management (OSPM) for negotiating processor P and T state changes for power limiting PMBus-compliant power supplies provide the capability to monitor input power consumption which is necessary to support NMThe NM architecture applicable to this generation of servers is defined by the NPTM Architecture Specification v20 NPTM is an evolving technology that is expected to continue to add new capabilities that will be defined in subsequent versions of the specification The ME NM implements the NPTM policy engine and controlmonitoring algorithms defined in the Node Power and Thermal Manager (NPTM) specification
741 Hardware RequirementsNM is supported only on platforms that have the NM FW functionality loaded and enabled on the Management Engine (ME) in the SSB and that have a BMC present to support the external LAN interface to the ME NM power limiting features requires a means for the ME to monitorinput power consumption for the platform This capability is generally provided by means of PMBus-compliant power supplies although an alternative model using a simpler SMBus power monitoring device is possible (there is potential loss in accuracy and responsiveness using
HA201-TP Users Manual
116
Chapter 7 Intelreg Server Board S2600TP Platform Management
non-PMBus devices) The NM SmaRTCLST feature does specifically require PMBus-compliant power supplies as well as additional hardware on the server board
742 FeaturesNM provides feature support for policy management monitoring and querying alerts and notifications and an external interface protocol The policy management features implement specific IT goals that can be specified as policy directives for NM Monitoring and querying features enable tracking of power consumption Alerts and notifications provide the foundation for automation of power management in the data center management stack The external interface specifies the protocols that must be supported in this version of NM
743 ME System Management Bus (SMBus) Interfacebull The ME uses the SMLink0 on the SSB in multi-master mode as a
dedicated bus for communication with the BMC using the IPMB protocol The BMC FW considers this a secondary IPMB bus and runs at 400 kHz
bull The ME uses the SMLink1 on the SSB in multi-master mode bus for communication with PMBus devices in the power supplies for support of various NM-related features This bus is shared with the BMC which polls these PMBus power supplies for sensor monitoring purposes (for example power supply status input power and so on) This bus runs at 100 KHz
bull The Management Engine has access to the ldquoHost SMBusrdquo
744 PECI 30bull The BMC owns the PECI bus for all Intel server implementations and
acts as a proxy for the ME when necessary
745 NM ldquoDiscoveryrdquo OEM SDRAn NM ldquodiscoveryrdquo OEM SDR must be loaded into the BMCrsquos SDR repository if and only if the NM feature is supported on that product This OEM SDR is used by management software to detect if NM is supported and to understand how to communicate with itSince PMBus compliant power supplies are required in order to support NM the system should be probed when the SDRs are loaded into the
HA201-TP Users Manual
117
Chapter 7 Intelreg Server Board S2600TP Platform Management
BMCrsquos SDR repository in order to determine whether or not the installed power supplies do in fact support PMBus If the installed power supplies are not PMBus compliant then the NM ldquodiscoveryrdquo OEM SDR should not be loadedPlease refer to the Intelreg Intelligent Power Node Manager 20 External Architecture Specification using IPMI for details of this interface
746 SmaRTCLSTThe power supply optimization provided by SmaRTCLST relies on a platform HW capability as well as ME FW support When a PMBus-compliant power supply detects insufficient input voltage an overcurrent condition or an over-temperature condition it will assert theSMBAlert signal on the power supply SMBus (such as the PMBus) Through the use of external gates this results in a momentary assertion of the PROCHOT and MEMHOT signals to the processors thereby throttling the processors and memory The ME FW also sees the SMBAlert assertion queries the power supplies to determine the condition causing the assertion and applies an algorithm to either release or prolong the throttling based on the situationSystem power control modes include1 SmaRT Low AC in put voltage event results in a one-time momentary
throttle for each event to the maximum throttle state2 Electrical Protection CLST High output energy event results in a
throttling hiccup mode with fixed maximum throttle time and a fix throttle release ramp time
3 Thermal Protection CLST High power supply thermal event results in a throttling hiccup mode with fixed maximum throttle time and a fix throttle release ramp time
When the SMBAlert signal is asserted the fans will be gated by HW for a short period (~100ms) to reduce overall power consumption It is expected that the interruption to the fans will be of short enough duration to avoid false lower threshold crossings for the fan tachsensors however this may need to be comprehended by the fan monitoring FW if it does have this side-effectME FW will log an event into the SEL to indicate when the system has been throttled by the SmaRTCLST power management feature This is dependent on ME FW support for this sensor Please refer ME FW EPS for SEL log details
HA201-TP Users Manual
118
Chapter 7 Intelreg Server Board S2600TP Platform Management
74611 Dependencies on PMBus-compliant Power Supply SupportThe SmaRTCLST system feature depends on functionality present in the ME NM SKU This feature requires power supplies that are compliant with the PMBus
note For additional information on Intelreg Intelligent Power Node
Manager usage and support please visit the following Intel Website
httpwwwintelcomcontentwwwusendata-centerdata-center-managementnode-manager-generalhtmlwapkw=node+manager
HA201-TP Users Manual
119
Chapter 7 Intelreg Server Board S2600TP Platform Management
75 Basic and Advanced Server Management FeaturesThe integrated BMC has support for basic and advanced server management features Basic management features are available by default Advanced management features are enabled with the addition of an optionally installed Remote Management Module 4 Lite (RMM4 Lite) key
When the BMC FW initializes it attempts to access the Intelreg RMM4 lite If the attempt to access Intelreg RMM4 lite is successful then the BMC activates the Advanced featuresThe following table identifies both Basic and Advanced server management features
HA201-TP Users Manual
120
Chapter 7 Intelreg Server Board S2600TP Platform Management
751 Dedicated Management PortThe server board includes a dedicated 1GbE RJ45 Management Port The management port is active with or without the RMM4 Lite key installed
752 Embedded Web ServerBMC Base manageability provides an embedded web server and an OEM-customizable web GUI which exposes the manageability features of the BMC base feature set It is supported over all on-board NICs that have management connectivity to the BMC as well as an optionaldedicated add-in management NIC At least two concurrent web sessions from up to two different users is supported The embedded web user interface shall support the following client web browsersbull Microsoft Internet Explorer 90bull Microsoft Internet Explorer 100bull Mozilla Firefox 24bull Mozilla Firefox 25The embedded web user interface supports strong security (authentication encryption and firewall support) since it enables remote server configuration and control The user interface presented by the embedded web user interface shall authenticate the user before allowing a web session to be initiated Encryption using 128-bit SSL is supported User authentication is based on user id and passwordThe GUI presented by the embedded web server authenticates the user before allowing a web session to be initiated It presents all functions to all users but grays-out those functions that the user does not have privilege to execute For example if a user does not have privilege to power control then the item shall be displayed in grey-out font in that userrsquos UI display The web GUI also provides a launch point for some of the advanced features such as KVM and media redirection These features are grayed out in the GUI unless the system has been updated to support these advanced features The embedded web server only displays US English or Chinese language outputAdditional features supported by the web GUI includesbull Presents all the Basic features to the usersbull Power onoffreset the server and view current power statebull Displays BIOS BMC ME and SDR version informationbull Display overall system health
HA201-TP Users Manual
121
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull Configuration of various IPMI over LAN parameters for both IPV4 and IPV6
bull Configuration of alerting (SNMP and SMTP)bull Display system asset information for the product board and
chassisbull Display of BMC-owned sensors (name status current reading
enabled thresholds) including color-code status of sensorsbull Provides ability to filter sensors based on sensor type (Voltage
Temperature Fan and Power supply related)bull Automatic refresh of sensor data with a configurable refresh ratebull On-line helpbull Displayclear SEL (display is in easily understandable human
readable format)bull Supports major industry-standard browsers (Microsoft Internet
Explorer and Mozilla Firefox)bull The GUI session automatically times-out after a user-configurable
inactivity period By default this inactivity period is 30 minutesbull Embedded Platform Debug feature - Allow the user to initiate
a ldquodebug dumprdquo to a file that can be sent to Intelreg for debug purposes
bull Virtual Front Panel The Virtual Front Panel provides the same functionality as the local front panel The displayed LEDs match the current state of the local panel LEDs The displayed buttons (for example power button) can be used in the same manner as the local buttons
bull Display of ME sensor data Only sensors that have associated SDRs loaded will be displayed
bull Ability to save the SEL to a filebull Ability to force HTTPS connectivity for greater security This is
provided through a configuration option in the UIbull Display of processor and memory information as is available over
IPMI over LANbull Ability to get and set Node Manager (NM) power policiesbull Display of power consumed by the serverbull Ability to view and configure VLAN settingsbull Warn user the reconfiguration of IP address will cause disconnectbull Capability to block logins for a period of time after several
HA201-TP Users Manual
122
Chapter 7 Intelreg Server Board S2600TP Platform Management
consecutive failed login attempts The lock-out period and the number of failed logins that initiates the lockout period are configurable by the user
bull Server Power Control - Ability to force into Setup on a resetbull System POST results ndash The web server provides the systemrsquos Power-
On Self Test (POST) sequence for the previous two boot cycles including timestamps The timestamps may be viewed in relative to the start of POST or the previous POST code
bull Customizable ports - The web server provides the ability to customize the port numbers used for SMASH http https KVM secure KVM remote media and secure remote media
For additional information reference the Intelreg Remote Management Module 4 and Integrated BMC Web Console Users Guide
753 Advanced Management Feature Support (RMM4 Lite)The integrated baseboard management controller has support for advanced management features which are enabled when an optional Intelreg Remote Management Module 4 Lite (RMM4 Lite) is installed The Intel RMM4 add-on offers convenient remote KVM access and control through LAN and internet It captures digitizes and compresses video and transmits it with keyboard and mouse signals to and from a remote computer Remote access and controlsoftware runs in the integrated baseboard management controller utilizing expanded capabilities enabled by the Intel RMM4 hardwareKey Features of the RMM4 add-on arebull KVM redirection from either the dedicated management NIC or
the server board NICs used for management traffic upto to two KVM sessions
bull Media Redirection ndash The media redirection feature is intended to allow system administrators or users to mount a remote IDE or USB CDROM floppy drive or a USB flash disk as a remote device to the server Once mounted the remote device appears just like a local device to the server allowing system administrators or users to install software (including operating systems) copy files update BIOS or boot the server from this device
bull KVM ndash Automatically senses video resolution for best possible screen capture high performance mouse tracking and
HA201-TP Users Manual
123
Chapter 7 Intelreg Server Board S2600TP Platform Management
synchronization It allows remote viewing and configuration in pre-boot POST and BIOS setup
7531 Keyboard Video Mouse (KVM) RedirectionThe BMC firmware supports keyboard video and mouse redirection (KVM) over LAN This feature is available remotely from the embedded web server as a Java applet This feature is only enabled when the Intelreg RMM4 lite is present The client system must have a Java Runtime Environment (JRE) version 60 or later to run the KVM or media redirection appletsThe BMC supports an embedded KVM application (Remote Console) that can be launched from the embedded web server from a remote console USB11 or USB 20 based mouse and keyboard redirection are supported It is also possible to use the KVM-redirection (KVM-r) session concurrently with media-redirection (media-r) This feature allows a user to interactively use the keyboard video and mouse (KVM) functions of the remote server as if the user were physically at the managed server KVM redirection console supports the following keyboard layouts English Dutch French German Italian Russian and SpanishKVM redirection includes a ldquosoft keyboardrdquo function The ldquosoft keyboardrdquo is used to simulate an entire keyboard that is connected to the remote system The ldquosoft keyboardrdquo functionality supports the following layouts English Dutch French German Italian Russian and SpanishThe KVM-redirection feature automatically senses video resolution for best possible screen capture and provides high-performance mouse tracking and synchronization It allows remote viewing and configuration in pre-boot POST and BIOS setup once BIOS has initialized videoOther attributes of this feature includebull Encryption of the redirected screen keyboard and mousebull Compression of the redirected screenbull Ability to select a mouse configuration based on the OS typebull Supports user definable keyboard macrosKVM redirection feature supports the following resolutions and refresh ratesbull 640x480 at 60Hz 72Hz 75Hz 85Hz 100Hzbull 800x600 at 60Hz 72Hz 75Hz 85Hzbull 1024x768 at 60Hx 72Hz 75Hz 85Hzbull 1280x960 at 60Hz
HA201-TP Users Manual
124
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull 1280x1024 at 60Hzbull 1600x1200 at 60Hzbull 1920x1080 (1080p)bull 1920x1200 (WUXGA)bull 1650x1080 (WSXGA+)
7532 Remote ConsoleThe Remote Console is the redirected screen keyboard and mouse of the remote host system To use the Remote Console window of your managed host system the browser must include a Java Runtime Environment plug-in If the browser has no Java support such as with a small handheld device the user can maintain the remote host system using the administration forms displayed by the browserThe Remote Console window is a Java Applet that establishes TCP connections to the BMCThe protocol that is run over these connections is a unique KVM protocol and not HTTP or HTTPS This protocol uses ports 7578 for KVM 5120 for CDROM media redirection and 5123 for FloppyUSB media redirection When encryption is enabled the protocol uses ports 7582 for KVM 5124 for CDROM media redirection and 5127 for FloppyUSB media redirection The local network environment must permit these connections to be made that is the firewall and in case of a private internal network the NAT (Network Address Translation) settings have to be configured accordingly
7533 PerformanceThe remote display accurately represents the local display The feature adapts to changes to the video resolution of the local display and continues to work smoothly when the system transitions from graphics to text or vice-versa The responsiveness may be slightly delayed depending on the bandwidth and latency of the networkEnabling KVM andor media encryption will degrade performance Enabling video compression provides the fastest response while disabling compression provides better video qualityFor the best possible KVM performance a 2Mbsec link or higher is recommendedThe redirection of KVM over IP is performed in parallel with the local KVM without affecting the local KVM operation
HA201-TP Users Manual
125
Chapter 7 Intelreg Server Board S2600TP Platform Management
7534 SecurityThe KVM redirection feature supports multiple encryption algorithms including RC4 and AES The actual algorithm that is used is negotiated with the client based on the clientrsquos capabilities
7535 AvailabilityThe remote KVM session is available even when the server is powered-off (in stand-by mode) No re-start of the remote KVM session shall be required during a server reset or power onoff A BMC reset (for example due to an BMC Watchdog initiated reset or BMC reset after BMC FWupdate) will require the session to be re-established KVM sessions persist across system reset but not across an AC power loss
7536 UsageAs the server is powered up the remote KVM session displays the complete BIOS boot process The user is able interact with BIOS setup change and save settings as well as enter and interact with option ROM configuration screensAt least two concurrent remote KVM sessions are supported It is possible for at least two different users to connect to same server and start remote KVM sessions
7537 Force-enter BIOS SetupKVM redirection can present an option to force-enter BIOS Setup This enables the system to enter F2 setup while booting which is often missed by the time the remote console redirects the video
7538 Media RedirectionThe embedded web server provides a Java applet to enable remote media redirection This may be used in conjunction with the remote KVM feature or as a standalone applet The media redirection feature is intended to allow system administrators or users to mount a remote IDE or USB CD-ROM floppy drive or a USB flash disk as a remote device to the server Once mounted the remote device appears just like a local device to the server allowing system administrators or users to install software (including operating systems) copy files update BIOS and so on or boot the server from this deviceThe following capabilities are supported
HA201-TP Users Manual
126
Chapter 7 Intelreg Server Board S2600TP Platform Management
The operation of remotely mounted devices is independent of the local devices on the server Both remote and local devices are useable in parallelbull Either IDE (CD-ROM floppy) or USB devices can be mounted as a
remote device to the serverbull It is possible to boot all supported operating systems from the remotely
mounted device and to boot from disk IMAGE (IMG) and CD-ROM or DVD-ROM ISO files See the Testedsupported Operating System List for more information
bull Media redirection supports redirection for both a virtual CD device and a virtual FloppyUSB device concurrently The CD device may be either a local CD drive or else an ISO image file the FloppyUSB device may be either a local Floppy drive a local USB device or else a disk image file
bull The media redirection feature supports multiple encryption algorithms including RC4 and AES The actual algorithm that is used is negotiated with the client based on the clientrsquos capabilities
bull A remote media session is maintained even when the server is powered-off (in standby mode) No restart of the remote media session is required during a server reset or power onoff An BMC reset (for example due to an BMC reset after BMC FW update) will require the session to be re-established
bull The mounted device is visible to (and useable by) managed systemrsquos OS and BIOS in both pre-boot and post-boot states
bull The mounted device shows up in the BIOS boot order and it is possible to change the BIOS boot order to boot from this remote device
bull It is possible to install an operating system on a bare metal server (no OS present) using the remotely mounted device This may also require the use of KVM-r to configure the OS during install
USB storage devices will appear as floppy disks over media redirection This allows for the installation of device drivers during OS installationIf either a virtual IDE or virtual floppy device is remotely attached during system boot both the virtual IDE and virtual floppy are presented as bootable devices It is not possible to present only a single-mounted device type to the system BIOS
HA201-TP Users Manual
127
Chapter 7 Intelreg Server Board S2600TP Platform Management
75381 AvailabilityThe default inactivity timeout is 30 minutes and is not user-configurable Media redirection sessions persist across system reset but not across an AC power loss or BMC reset
75382 Network Port UsageThe KVM and media redirection features use the following portsbull 5120 ndash CD Redirectionbull 5123 ndash FD Redirectionbull 5124 ndash CD Redirection (Secure)bull 5127 ndash FD Redirection (Secure)bull 7578 ndash Video Redirectionbull 7582 ndash Video Redirection (Secure)For additional information reference the Intelreg Remote Management Module 4 and Integrated BMC Web Console Users Guide
Chapter 1 Product IntroductionChapter 1 Product IntroductionChapter 8 Technical Support
wwwaicipccom
bull TAIWANTel +886 3 433 9188Fax +886 3 287 1818Email salesaicipccomtw
bull CHINA Tel +862154961421 +862154961422Fax Extension 608Email Technical Support supportaicipccom
bull AMERICA - West coastTel +19098958989Fax +19098958999Email salesaicipccom
bull AMERICA - East coastTel +19738848886Fax +19738844794Email njsalesaicipccom
bull EUROPETel +31306386789Fax +31306360638Emailsalesaicipcnl
Email Technical Support supportaicipccom
- PREFACE
-
- SAFETY INSTRUCTIONS
-
- Chapter 1 Product Introduction
-
- 11 Box Content
- 12 Specifications
- 13 General Information
-
- Chapter 2 Hardware Installation
-
- 21 Removing and Installing top cover
- 22 Central Processing Unit (CPU)
- 23 Diagram of the Correct Installation
- 24 System Memory
- 25 RemovingInstalling a Drive Tray Hard Drive Drive Slot Map
- 26 Removing and Installing a Fan Module
- 27 Power Supply
- 28 Tool-less Blade Slide Installation introduction
-
- Chapter 3 Motherborad Overview
-
- 31 Intelreg Server Board Feature Set
- 32 Motherboard Layout
- 33 Motherboard block diagram
- 34 Motherboard Configuration Jumpers
- 35 Motherboard Configuration Jumpers
-
- Chapter 4 12GB Expander Borad Overview
-
- 41 12GB Expander Board
-
- Chapter 5 Backplane Overview
-
- 51 Backplane
-
- Chapter 6 Debug amp Firmware Update Introduction
-
- 61 Expender firmware update through smart console port
- 62 Update the expander firmware through in-band
- 63 12G expander EDFB setting
- 64 Slot HDD power setting
- 65 HDD BP thermal sensor temperature setting
-
- Chapter 7 Intelreg Server Board S2600TP Platform Management
-
- 71 Server Management Function Architecture
- 72 Features and Functions
- 73 Sensor Monitoring
- 74 Intelreg Intelligent Power Node Manager (NM)
- 75 Basic and Advanced Server Management Features
-
- Chapter 8 Technical Support
-
- 按鈕11
![Page 5: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/5.jpg)
i
PREFACEbull Copyright
No part of this publication may be reproduced stored in a retrieval system
or transmitted in any form or by any means electronic mechanical photo-
static recording or otherwise without the prior written consent of the
manufacturer
bull Trademarks
All products and trade names used in this document are trademarks or
registered trademarks of their respective holders
bull Changes
The material in this document is for information purposes only and is subject
to change without notice
bull Warning
1 A shielded-type power cord is required in order to meet FCC emission
limits and also to prevent interference to the nearby radio and television
reception It is essential that only the supplied power cord be used
2 Use only shielded cables to connect IO devices to this equipment
3 You are cautioned that changes or modifications not expressly approved
by the party responsible for compliance could void your authority to
operate the equipment
bull Disclaimer
AIC shall not be liable for technical or editorial errors or omissions
contained herein The information provided is provided as is without
warranty of any kind To the extent permitted by law neither AIC or its
affiliates subcontractors or suppliers will be liable for incidental special or
consequential damages including downtime cost lost profits damages
relating to the procurement of substitute products or services or damages
for loss of data or software restoration The information in this document is
subject to change without notice
ii
SAFETY INSTRUCTIONSbull Before getting started please read the following important cautionsbull All cautions and warnings on the equipment or in the manuals should be
notedbull Most electronic components are sensitive to electrical static discharge
Therefore be sure to ground yourself at all times when installing the internal components
bull Use a grounding wrist strap and place all electronic components in static-shielded devices Grounding wrist straps can be purchased in any electronic supply store
bull Be sure to turn off the power and then disconnect the power cords from your system before performing any installation or servicing A sudden surge of power could damage sensitive electronic components
bull Do not open the systemrsquos top cover If opening the cover for maintenance is a must only a trained technician should do so Integrated circuits on computer boards are sensitive to static electricity Before handling a board or integrated circuit touch an unpainted portion of the system unit chassis for a few seconds This will help to discharge any static electricity on your body
bull Place this equipment on a stable surface when install A drop or fall could cause injury
bull Please keep this equipment away from humiditybull Carefully mount the equipment into the rack in such manner that it
wonrsquot be hazardous due to uneven mechanical loadingbull This equipment is to be installed for operation in an environment with
maximum ambient temperature below 35degCbull The openings on the enclosure are for air convection to protect the
equipment from overheating DO NOT COVER THE OPENINGSbull Never pour any liquid into ventilation openings This could cause fire or
electrical shockbull Make sure the voltage of the power source is within the specification
on the label when connecting the equipment to the power outlet The current load and output power of loads shall be within the specification
bull This equipment must be connected to reliable grounding before using Pay special attention to power supplied other than direct connections eg using of power strips
iii
bull Place the power cord out of the way of foot traffic Do not place anything over the power cord The power cord must be rated for the product voltage and current marked on the productrsquos electrical ratings label The voltage and current rating of the cord should be greater than the voltage and current rating marked on the product
bull If the equipment is not used for a long time disconnect the equipment from mains to avoid being damaged by transient over-voltage
bull Never open the equipment For safety reasons only qualified service personnel should open the equipment
bull If one of the following situations arise the equipment should be checked by service personnel1 The power cord or plug is damaged2 Liquid has penetrated the equipment3 The equipment has been exposed to moisture4 The equipment does not work well or will not work according to its user
manual5 The equipment has been dropped andor damaged6 The equipment has obvious signs of breakage7 Please disconnect this equipment from the AC outlet before cleaning
Do not use liquid or detergent for cleaning The use of a moisture sheet or cloth is recommended for cleaning
bull Module and drive bays must not be empty They must have a dummy cover
Product features and specifications are subject to change without notice
CAUTION
risk of explosion if battery is replaced by an incorrect type
dispose of used batteries according to the instructions
After performing any installation or servicing make sure the
enclosure are lock and screw in position turn on the power
1
HA201-TP Users Manual
Chapter 1 Product Introduction
11 Box Content
Chapter 1 Product Introduction
Before removing the subsystem from the shipping carton visually inspect the physical condition of the shipping carton Exterior damage to the shipping carton may indicate that the contents of the carton are damaged If any damage is found do not remove the components contact the dealer where the subsystem was purchased for further introduction Before continuing first unpack the subsystem and verify that the contents of the shipping carton are all there and in good condition
bull Enclosure( Power supply fan 24 HDD tray included)
bull RS232 cablex 1pcs bull Power cord x 2sets bull Screws kit x 1set
bull Slide rail x 1set
If any items are missing please contact your authorized reseller or sales representative
2
HA201-TP Users Manual
Chapter 1 Product Introduction
12 Specifications
Dimensions (W x D x H)(with chassis ears)
mm 4826 x 815 x 88
inches 19 x 32 x 35
Motherboard (per node) Intelreg Server Board S2600TP
Processor (per node)
Processor Support
Two Intelreg Xeonreg Processors E5-2600 v3 Product Family
QPI Speeds 96 GTs 8 GTs 72 GTs
Socket Type Socket R3 (FCLGA2011-3)
Chipset Support(per node) Intelreg C612 Chipset
System Memory (per node)
bull 16 DIMM slots in total across 8 memory channelsbull Registered DDR4 (RDIMM) Load Reduced DDR4 (LRDIMM)bull Memory DDR4 data transfer rates of 160018662133 MTsbull DIMM sizes of 4 GB 8 GB 16 GB or 32 GB depending on ranks and technology
Front Panel Power onoff
LEDs
A bull Power (Secondary)bull Warning
B bull Power (Primary)bull Warning
Drive Bays External 25 hot swap 24
Backplane 1 x 24-port 12Gb SAS dual-loop backplane
Expander Board(per node)
1 x 24-port 12Gb SAS expander board with 3 SFF-8643 connectors
Expansion Slots (per node) PCIe 30 5 x8 (4 LP amp 1 FHHL)
Internal IO Connectors Headers(per node)
bull 1 x internal USB 20 connector (port 67)bull 1 x 2x7pin header for system fan modulebull 1 x 1x12pin control panel headerbull 1 x DH-10 serial Port A connectorbull 1 x SATA 6Gbs port for SATA DOMbull 4 x SATA 6Gbs connectors (port 0123)bull 1 x 2x4 pin header for Intel RMM4 Litebull 1 x 1x4 pin header for Storage Upgrade Keybull 1 x 1x8 pin backup power controler connector
Rear IO(per node)
bull 1 x RJ45 (dedicated port for remote server management) 2 x RJ45 1 x VGA 2 x USB 20 Type A 1 x Mini SAS HD
Ethernet (per node) Dual GbE Intelreg I350 Gigabit Ethernet Controller
Video Support(per node) Integrated Matrox G200 2D Graphics Controller
Server Management(per node)
Onboard Server Engines LLC Pilot III Controller Support for Intelreg Remote Management Module 4 solutions IntelregLight-Guided Diagnostics on field replaceable units Support for Intelreg System Management Software Support for Intelreg Intelligent Power Node Manager (Need PMBus - compliant power supply) BIOS Flash Winbond W25Q64BV
Power Supply 1200W 1+1 redundant power supply
System Cooling (per node)
2 x 6056 fans
EnvironmentalSpecifications
Temperature 0degC - 35degCHumidity 5 - 95 non-condensing
Gross Weight (w PSU amp Rail)kgs 41
lbs 90
PackagingDimensions
(W x D x H)mm 590 x 1150 x 330
inches 232 x 453 x 13
Mounting Standard 28 tool-less slide rail
3
HA201-TP Users Manual
Chapter 1 Product Introduction
13 General Information
HA201-WP a 2U Storage Server Barebone supports two Intelreg Xeonreg processors E5-2600 v3 series HA201-TP has a 24 x 25rdquo HDD bays as system drive bays It is a perfect building block for Storage Servers
bull Front Panel
LED Indicator and Switch
24 x 25 hot-swap SATASAS HDD bays
4
HA201-TP Users Manual
Chapter 1 Product Introduction
bull Rear Panel
1200W 1+1 80+ redundant power supply
2 x low-profile add-on card for external connection
SFF 8644 for JBOD connection
5
HA201-TP Users Manual
Chapter 1 Product Introduction
bull Top View of HA201-TP system
2 x hot-swappable storage control node
Intelreg Xeonreg E5-2600 v3 Processors
bull TOP View Of Controller Canister
6
HA201-TP Users Manual
Chapter 1 Product Introduction
bull Top View of HA201-TP system
4 x 60x56mm hot-swap fans
Intel S2600TP
HA201-TP Users Manual
7
Chapter 2 Hardware Installation
7
21 Removing and Installing top coverUnscrew the screws(2pcs for the back and 4pcs for the both right and left side)Slide the cover backwards to remove top cover form the chassis
Chapter 2 Hardware Installation
This section demonstrates maintenance procedures in replacing a defective part once the HA201-TP UM appliance is installed and is operational
Chapter 2 Hardware Installation
HA201-TP Users Manual
8
22 Central Processing Unit (CPU)221 Removing the Processor HeatsinkThe heatsink is attached to the server board or processor socket with captive fasteners Using a 2 Phillips screwdriver loosen the four screws located on the heatsink corners in a diagonal manner using the following procedurebull Using a 2 Phillips screwdriver start with screw 1 and loosen it by
giving it two rotations and stop (see letter A) (IMPORTANT Do not fully loosen)
bull Proceed to screw 2 and loosen it by giving it two rotations and stop (see letter B) Similarly loosen screws 3 and 4 Repeat steps A and B by giving each screw two rotations each time until all screws are loosened
bull Lift the heatsink straight up (see letter C)
REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
Processor
Socket
2
3
1
4
A
B
C
HA201-TP Users Manual
9
Chapter 2 Hardware Installation
222 Installing the Processor
2221 Unlatch the CPU Load Platebull Push the lever handle labeled ldquoOPEN 1strdquo (see letter A) down and
toward the CPU socket Rotate the lever handle upbull Repeat the steps for the second lever handle (see letter B)
CAUTION
Processor must be appropriate You may damage the server board if
you install a processor that is inappropriate for your server
CAUTION
ESD and handling processors Reduce the risk of electrostatic
discharge (ESD) damage to the processor by doing the following
(1) Touch the metal chassis before touching the processor or
server board Keep part of your body in contact with the metal
chassis to dissipate the static charge while handling the processor
(2) Avoid moving around unnecessarily
REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
A REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
B
Chapter 2 Hardware Installation
HA201-TP Users Manual
10
2222 Lift open the Load Platebull Rotate the right lever handle down until it releases the Load Plate
(see letter A)bull While holding down the lever handle with your other hand lift open
the Load Plate (see letter B)
BREMOVE
REMOVE
LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
A
HA201-TP Users Manual
11
Chapter 2 Hardware Installation
2223 Install the Processorbull Remove the processor from its packagebull Carefully remove the protective cover from the bottom side of the
CPU taking care not to touch any CPU contacts (see letter A)bull Orient the processor with the socket so that the processor cutouts
match the four orientation posts on the socket (see letter B) Note the location of a gold key at the corner of the processor (see letter C) Carefully place (Do NOT drop) the CPU into the socket
CAUTION
The pins inside the CPU socket are extremely sensitive Other than
the CPU no object should make contact with the pins inside the
CPU socket A damaged CPU socket pin may render the socket inoperable
and will produce erroneous CPU or other system errors if used
NOTE
The underside of the processor has components that may damage the
socket pins if installed improperly The processor must align correctly
with the socket opening before installation DO NOT DROP the
processor into the socket
NOTE
When possible a CPU insertion tool should be used when installing the
CPU
A
B
C
Chapter 2 Hardware Installation
HA201-TP Users Manual
12
2224 Remove the Socket Cover Remove the socket cover from the load plate by pressing it out
2225 Close the Load Plate Carefully lower the load plate down over the processor
2226 Lock down the Load Platebull Push down on the locking lever on the CLOSE 1st side (see letter A)
Slide the tip of the lever under the notch in the load plate (see letter B) Make sure the load plate tab engages under the socket lever when fully closed
bull bRepeat the steps to latch the locking lever on the other side (see letter C) Latch the levers in the order as shown
NOTE
The socket cover should be saved and re-used should the processor
need to be removed at anytime in the future
Save theprotectivecover
A
BC
HA201-TP Users Manual
13
Chapter 2 Hardware Installation
223 Installing the Processor Heatsink
bull If present remove the protective film covering the Thermal Interface Material on the bottom side of the heatsink (see letter A)
bull Align the heatsink fins to the front and back of the chassis for correct airflow The airflow goes from front-to-back of the chassis (see letter B)
bull Each heatsink has four captive fasteners and should be tightened in a diagonal manner using the following procedure
bull Using a 2 Phillips screwdriver start with screw 1 and engage screw threads by giving it two rotations and stop (see letter C) (Do not fully tighten)
bull Proceed to screw 2 and engage screw threads by giving it two rotations and stop (see letter D) Similarly engage screws 3 and 4
bull Repeat steps C and D by giving each screw two rotations each time until each screw is lightly tightened up to a maximum of 8 inch-lbs torque (see letter E)
NOTE
The processor heatsink for CPU1 and CPU2 is different FXXCA91X91HS is
for CPU1 while FXXEA91X91HS2 is for CPU2 Mislocating the heatsink
will cause serious thermal damage
C
Processor
Socket
AIRFLOW
Chassis Front
2
3
1
4
A
B
TIM
D
CAUTIONDo not over-tighten fasteners
E
Chapter 2 Hardware Installation
HA201-TP Users Manual
14
224 Removing the Processor
NOTE
Remove the processor by carefully lifting it out of the socket taking care
NOT to drop the processor and not touching any pins inside the socket
Install the socket cover if a replacement processor is not going to be installed
HA201-TP Users Manual
15
Chapter 2 Hardware Installation
225 Different heatsink for each of its CPUsCarefully position heatsink over the CPU0 and CPU1 and align the heatsink screws with the screw holes in the motherboard
Caution - Possible thermal damage Avoid moving the heatsink after
it has contacted the top of the CPU Too much movement could
disturb the layer of thermal compound causing voids and leading
to ineffective heat dissipation and component damage
CPU0
CPU0
CPU1
CPU1
Chapter 2 Hardware Installation
HA201-TP Users Manual
16
23 Diagram of the Correct InstallationNOTE The heat sinkrsquos fans should be blowing toward the rear end
of the chassis If one of the fans is facing the wrong direction
please remove the heat sink and reinstall it so that it is facing
the correct direction
AIRFLOW
AIRFLOW
FAN FAN
HA201-TP Users Manual
17
Chapter 2 Hardware Installation
24 System Memory241 Supported Memory
Chapter 2 Hardware Installation
HA201-TP Users Manual
18
242 Memory Population Rules
bull Each installed processor provides four channels of memory On the Intelreg Server Board S2600TP product family each memory channel supports two memory slots for a total possible 16 DIMMs installed
bull The memory channels from processor socket 1 are identified as Channel A B C and D The memory channels from processor socket 2 are identified as Channel E F G and H
bull The silk screened DIMM slot identifiers on the board provide information about the channel and therefore the processor to which they belong For example DIMM_A1 is the first slot on Channel A on processor 1 DIMM_E1 is the first DIMM socket on Channel E on processor 2
bull The memory slots associated with a given processor are unavailable if the corresponding processor socket is not populated
bull A processor may be installed without populating the associated memory slots provided a second processor is installed with associated memory In this case the memory is shared by the processors However the platform suffers performance degradation and latency due to the remote memory
bull Processor sockets are self-contained and autonomous However all memory subsystem support (such as Memory RAS and Error Management) in the BIOS setup is applied commonly across processor sockets
bull All DIMMs must be DDR4 DIMMsbull Mixing of LRDIMM with any other DIMM type is not allowed per
platformbull Mixing of DDR4 operating frequencies is not validated within a socket
or across sockets by Intel If DIMMs with different frequencies are mixed all DIMMs run at the common lowest frequency
bull A maximum of eight logical ranks (ranks seen by the host) per channel is allowed
bull The BLUE memory slots on the server board identify the first memory slot for a given memory channel
NOTE
Although mixed DIMM configurations are supported Intel only performs platform
validation on systems that are configured with identical DIMMs installed
HA201-TP Users Manual
19
Chapter 2 Hardware Installation
bull DIMM population rules require that DIMMs within a channel be populated starting with the BLUE DIMM slot or DIMM farthest from the processor in a ldquofill-farthestrdquo approach In addition when populating a Quad-rank DIMM with a Single- or Dual-rank DIMM in the same channel the Quad-rank DIMM must be populated farthest from the processor Intelreg MRC will check for correct DIMM placement
Chapter 2 Hardware Installation
HA201-TP Users Manual
20
On the Intelreg Server Board S2600TP product a total of sixteen DIMM slots are provided (two CPUs ndash four channels per CPU and two DIMMs per channel) The nomenclature for DIMM sockets is detailed in the following table
HA201-TP Users Manual
21
Chapter 2 Hardware Installation
243 Publishing System MemoryThere are a number of different situations in which the memory size andor configuration are displayed Most of these displays differ in one way or another so the same memory configuration may appear to display differently depending on when and where the display occurs
bull The BIOS displays the ldquoTotal Memoryrdquo of the system during POST if Quiet Boot is disabled in BIOS setup This is the total size of memory discovered by the BIOS during POST and is the sum of the individual sizes of installed DDR4 DIMMs in the system
bull The BIOS displays the ldquoEffective Memoryrdquo of the system in the BIOS Setup The term Effective Memory refers to the total size of all DDR4 DIMMs that are active (not disabled) and not used as redundant units (see Note below)
bull The BIOS provides the total memory of the system in the main page of BIOS setup This total is the same as the amount described by the first bullet above
bull If Quiet Boot is disabled the BIOS displays the total system memory on the diagnostic screen at the end of POST This total is the same as the amount described by the first bullet above
bull The BIOS provides the total amount of memory in the system by supporting the EFI Boot Service function GetMemoryMap()
bull The BIOS provides the total amount of memory in the system by supporting the INT 15h E820h function For details see the Advanced Configuration and Power Interface Specification
Chapter 2 Hardware Installation
HA201-TP Users Manual
22
25 RemovingInstalling a Drive Tray Hard Drive Drive Slot Map
251 Removing a Disk DriveRelease a drive tray by pressing the unlock button and pinching the lock lever slightly and pulling out the drive tray
252 Installing a Disk Drive
Adjust and place the 25rdquo HDD on the HDD tray Choose to the proper position screw hole then secure it with screws on the bottomMounting location 1 HDD with the interposer board attached
HA201-TP Users Manual
23
Chapter 2 Hardware Installation
253 Installing a Hard Disk Drive Tray
Insert the drive carrier into its bay Push the tray lever until it clicks Make sure the drive tray is correctly secured in place when its front edge aligns with the bay edge
254 Drive Slot Map
The drive slot map follows
HBA Card
MegaRaid Card
Chapter 2 Hardware Installation
HA201-TP Users Manual
24
26 Removing and Installing a Fan Module261 Removing a FAN module
Unscrew the thumb screw on the cover to open the top cover from NODE Unpluging the FAN cable Remove the fan by lifting it and pulling it out
262 Installing a FAN module To installing a fan module follow the reverse order
HA201-TP Users Manual
25
Chapter 2 Hardware Installation
27 Power Supply
271 Removing or Replacing the Power Supply Module
Slide the small levers (see red arrow ) to the right Hold the PSU lever and firmly pull the PSU out of the server chassis
Chapter 2 Hardware Installation
HA201-TP Users Manual
26
28 Tool-less Blade Slide Installation introduction
1 Pull on the lsquorsquoFront-Releasersquorsquoto unlock the inner channel from the Slide Assembly
2 Release the Detent-Lock and push Middle Channel inwards to retract Middle Channel
HA201-TP Users Manual
27
Chapter 2 Hardware Installation
Metal Spacer
OptionalRemove Metal Spacer for Aluminium Racks
3 Aligning the Front Bracket with the Mounting Hole
4 Push in to assembly the Front Bracket onto the Rack
Chapter 2 Hardware Installation
HA201-TP Users Manual
28
5 Now the bracket is fixed onto the Rack(Optional M6x10L screws are to secure the rails with posts if needed)
6 Refer to Diagram 3 amp 4 to assemble the End Bracket onto the Rack
HA201-TP Users Manual
29
Chapter 2 Hardware Installation
7 Assemble the inner channel onto the chassis using thescrews provided
8 Push the chassis with inner channels into Slide to complete Rack Installation
Chapter 3 Motherborad Overview
HA201-TP Users Manual
30
Chapter 3 Motherborad Overview
The Intelreg Server Board S2600TP is a monolithic printed circuit board (PCB) with features designed to support the high-performance and high-density computing markets This server board is designed to support the Intelreg Xeonreg processor E5-2600 v3 product family Previous generation Intelreg Xeonreg processors are not supported Many of the features and functions of the server board family are common A board will be identified by name when a described feature or function is unique to it
Chapter 3 Motherborad Overview
HA201-TP Users Manual
31
31 Intelreg Server Board Feature Set
Chapter 3 Motherborad Overview
HA201-TP Users Manual
32
WARNING The riser slot 1 on the server board is designed for
plugging in ONLY the riser card Plugging in the PCIe card may
cause permanent server board and PCIe card damage
Chapter 3 Motherborad Overview
HA201-TP Users Manual
33
32 Motherboard Layout
DiagnosticLED
RMM4LiteRiser Slot 2
SATASGPIO
BridgeBoard
SATA 3USB
Riser Slot 4
Riser Slot 4
HDD Activity
Riser Slot 3ControlPanel
SystemFan 1
CPU 2Socket
CPU 1Socket
SystemFan 3
SystemFan 2
MainPower 1
FanConnector
MainPower 2
InfiniBandPort(QSFP+)DedicatedManagementPortUSBStatus LEDID LED
VGA
NIC 2
IPMB
NIC 1
SerialPort A
CR 2032 3W
Backup Power Control
SATA 2
SATA 0
SATA RAID 5
SATA
Upgrade Key
DOM
SATA 1
Chapter 3 Motherborad Overview
HA201-TP Users Manual
34
33 Motherboard block diagram
Chapter 3 Motherborad Overview
HA201-TP Users Manual
35
34 Motherboard Configuration JumpersThe following table provides a summary and description of configuration test and debug jumpers The server board has several 3-pin jumper blocks that can be used Pin 1 on each jumper block can be identified by the following symbol on the silkscreen
Chapter 3 Motherborad Overview
HA201-TP Users Manual
36
Chapter 3 Motherborad Overview
HA201-TP Users Manual
37
35 Motherboard Configuration JumpersThe Intelreg Server Board S2600TP has the following board rear connector placement
HA201-TP Users Manual
38
Chapter 4 12G Expander Board Overview
This chapter provides detailed introduction guide on hardware introduction
41 12GB Expander Board411 Placement amp Connectors location
J1
JPIC_DBGU8
Chip 3x36RExpander
U14Flash U16
PIC
SW1
SW2
JEXP_UART
J7
JPIC_SMT
J2 J3 J4 CN1 JUSB_MB
JVBATT_12C
JVBATT CN3 CN2 JIPMI_LOC
CN5
CN8
JPWR1
JPWR2
JHDDPWRJPSON_OK
JIPMB
JUSB_PIC
J8
CN7CN6
CN4
JPWR_SW
Chapter 4 12GB Expander Borad Overview
HA201-TP Users Manual
39
Chapter 4 12G Expander Board Overview
412 PIN-OUT
Power Connector ndash JPWR1JPWR2
OS HDD Power Connector ndash JHDDPWR
Battery Connector ndash JVBATT
FAN Connector ndash J7J8
Console for Expander ndash JEXP_UART
HA201-TP Users Manual
40
Chapter 4 12G Expander Board Overview
PIC Smart portndash JPIC_SMT
PIC Debug port(For MPLAB I2C tool) ndash JPIC_DBG
PIC USB ndash JUSB_PIC
Battery Function ndash JVBATT_I2C
IPMB ndash JIPMB
HA201-TP Users Manual
41
Chapter 4 12G Expander Board Overview
BP PIC USB ndash JUSB_MB
IPMB for Device ndash JIPMI_LOC
Power SW ndash JPWR_SW
MB power on frunction ndash JPSON_OK
HA201-TP Users Manual
42
Chapter 4 12G Expander Board Overview
413 LEDs
LED_CN6
LED2
LED3
HA201-TP Users Manual
43
Chapter 5 Backplane OverviewChapter 5 Backplane Overview
This chapter provides detailed introduction guide on backplane introduction
51 Backplane511 Placement amp Connectors location
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
9
18
10
1
3
G
D
SG
D
S
C
B
C
E
B
C
E
GA
O
I
2
43
12
43
1
14
1
3
45
134
5
134
5
134
5
16
2
9
10
1
1
1120
10
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
16
17
32
33
484964
J2
JP1J11
For Secondary Canister Node
J4 J1 SW1
SW2
J3
For Primary Canister Node
HA201-TP Users Manual
44
Chapter 5 Backplane Overview
512 PIN-OUT
Power Connector ndash J2
Power Connector ndash J11
PMBUS Connector ndash JP1
FAN Connector ndash J4
HA201-TP Users Manual
45
Chapter 5 Backplane Overview
Console for MCU ndash J1
Front IO ndash J3
HA201-TP Users Manual
46
Chapter 5 Backplane Overview
513 LEDs
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
18
9
10
1
31
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
3 4
21
3 4
21
4
1
16
2
91
101
11
10
20
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ActFail
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
9
18
10
1
3
G
D
SG
D
S
C
B
C
E
B
C
E
GA
O
I
2
43
12
43
1
14
1
3
45
134
5
134
5
134
5
16
2
9
10
1
1
1120
10
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
16
17
32
33
484964LED1
HA201-TP Users Manual
47
Chapter 6 HDD Backplane Introduction
61 Expender firmware update through smart console port611 Update Expander firmware revision
Step 1 Set up HA201-TP console serial cable Insert console serial cable into console port shown below also
the other side inert serial port into motherboard
PLEASE FIND THE CONSOLE SERIAL CABLE IN THE PACKAGE BOX
Chapter 6 Debug amp Firmware Update Introduction
HA201-TP Users Manual
48
Chapter 6 HDD Backplane Introduction
Step 2 Set up HA201-TP RS232 connectionSet up RS232 connection application into your HA201-TP as shown in the example process below
For exampleOS Microsoft WindowsRS232 connection application Hyperterminal
Step 2 Install HyperTrmexe
HA201-TP Users Manual
49
Chapter 6 HDD Backplane Introduction
Step 3 Enter a new name for the icon in the field below and click OK
Step 4 Connecting by using selecting an option in the drop down menu circled in red below (we selected COM2 in this example) and click OK
HA201-TP Users Manual
50
Chapter 6 HDD Backplane Introduction
Properties
Port Setting
Bits per second
Data bits
Parity
Stop bits
Flow control None
Restore Defaults
OK ApplyCancel
None
Step 6 Set up is complete The diagram below depicts what screen should displayed
Step 5 For ldquoBits per secondrdquo select 38400 For ldquoFlow controlrdquo select None Click OK when you have finished your selections
cmdgt_
HA201-TP Users Manual
51
Chapter 6 HDD Backplane Introduction
Step 7To get firmware image amp MFG Configuration Image version information from AIC SAS Related Firmware Downloadne website
httpppmsaicipccomtw8888downloadexpandermcu
HA201-TP Users Manual
52
Chapter 6 HDD Backplane Introduction
Step 8Comand line for show current firmware revisioncmdgtrev
HA201-TP Users Manual
53
Chapter 6 HDD Backplane Introduction
Step 9Start to update expander firmwarecmdgtfdl 0 0_
Step 10Select the tool bar Transfer -gt Send File
HA201-TP Users Manual
54
Chapter 6 HDD Backplane Introduction
Step 11bull Choose new firmware path file fw 3A1_v11221bull Protocol have to choose Xmodem
Step 12Firmware download complete
HA201-TP Users Manual
55
Chapter 6 HDD Backplane Introduction
Step 13Reset computer for success update firmwarecmdgtreset
reset
HA201-TP Users Manual
56
Chapter 6 HDD Backplane Introduction
612 Update expander configuration MFG
Step 1Comand line for show current configuration MFGcmdgt showmfg
HA201-TP Users Manual
57
Chapter 6 HDD Backplane Introduction
Step 2Start to update expander configuration MFGcmdgtfdl 83 0_
Step 3Select the tool bar Transfer -gt Send File
83 0
HA201-TP Users Manual
58
Chapter 6 HDD Backplane Introduction
Step 4bull Choose new MFG path file mfg 3A10_eob_1201binbull Protocol have to choose Xmodem
Step 5MFG download complete
HA201-TP Users Manual
59
Chapter 6 HDD Backplane Introduction
Step 6Reset computer for success update MFGcmdgtreset
reset
HA201-TP Users Manual
60
Chapter 6 HDD Backplane Introduction
62 Update the expander firmware through in-band
FOR EXAMPLEStep 1
Download and install SG3_utilsexe which compatible with Linux OSFrom website httpsgdannyczsgsg3_utilshtml website Reference version sg3_utils-140tgz
Step 2To get firmware image amp MFG Configuration Image version information from AIC SAS Related Firmware Downloadne websitehttpppmsaicipccomtw8888downloadexpandermcu
HA201-TP Users Manual
61
Chapter 6 HDD Backplane Introduction
Step 3Execute terminal under the same new firmware folderexample Setting a new firmware folder on Home page Open Terminal by click to the right button of mouse in the same window Home
Step 4Typing sudo -s to into administrator mode
HA201-TP Users Manual
62
Chapter 6 HDD Backplane Introduction
Step 5 Find expander location$ sg_map -i
Step 6Update Expander firmware$ sg_write_buffer --id=0x0 --in=fw3A1_v11221 --mode=0x2 --offset=0 devsg0
HA201-TP Users Manual
63
Chapter 6 HDD Backplane Introduction
Step 7 Update Expander MFG$ sg_write_buffer --id=0x83 --in=mfg3A10_eob_1201bin --mode=0x2 --offset=0 devsg0
Step 8Reboot computer for success update firmware amp MFGrootubuntu~ reboot
HA201-TP Users Manual
64
Chapter 6 HDD Backplane Introduction
63 12G expander EDFB settingStep 1
For Install HyperTerminalexe refer to section 41
Step 2 Get EDFB statuscmdgt edfb
HA201-TP Users Manual
65
Chapter 6 HDD Backplane Introduction
Step 3Change EDFB setting
Enable EDFB functioncmdgtedfb on
Disable EDFB functioncmdgtedfb off
cmd gtedfbEDFB is OFF
cmd gtedfb onSucceeded to set EDFB
cmd gtedfb offEDFB is OFF
HA201-TP Users Manual
66
Chapter 6 HDD Backplane Introduction
64 Slot HDD power setting(Only for system cooling Fan controled by expander)
Step 1 For Install sg3exe tool and get new firmware from website refer to section 42
Step 2Execute terminal under the same new firmware folderexample Setting a new firmware folder on Home page Open Terminal by click to the right button of mouse in the same window Home
HA201-TP Users Manual
67
Chapter 6 HDD Backplane Introduction
Step 3 Typing sudo -s to into administrator mode
Step 4 Find expander location$ sg_map -i
HA201-TP Users Manual
68
Chapter 6 HDD Backplane Introduction
Step 5For example
If would like to turn the Disk004 power off under the HBA card Need to check Disk004 power status $ sg_ses --page=7 devsg0
Under HBA card the Element 3 = Disk004
HA201-TP Users Manual
69
Chapter 6 HDD Backplane Introduction
Step 6To check Disk004 (element 3) power status is ok$ sg_ses --page=2 devsg0
Status shows belowThe status of Element 3 is OK
HA201-TP Users Manual
70
Chapter 6 HDD Backplane Introduction
Step 7Turn off a HDD power$ sg_ses --descriptor=Disk004 --set=341 devsg0
Step 8Turn on a HDD power$ sg_ses --descriptor=Disk004 --clear=341 devsg0
HA201-TP Users Manual
71
Chapter 6 HDD Backplane Introduction
65 HDD BP thermal sensor temperature setting(Only for system cooling Fan controled by expander)
Step 1 For Install HyperTerminalexe refer to section 41
Step 2Get the current temperature settingscmdgt temperature
HA201-TP Users Manual
72
Chapter 6 HDD Backplane Introduction
Step 3For example
Set new temperatureT1=20 C 4 18 CT2=50 C 4 52 CWarning threshold=50 C 448 CAlarm threshold=55 C 454 C
The new setting will take effect after resetcmdgt temperature 18 52 48 54cmdgt reset
HA201-TP Users Manual
73
Chapter 6 HDD Backplane Introduction
Step 4Check fan speed amp temperature informationcmdgt sensor
cmd gtsensor==ENCLOSURE STATUS========================================================== Total fan number 2 System Fan-0 speed 10888 RPM System Fan-1 speed 10971 RPM System PWN-0 82 Expander Temperature 76 Celsius degree Sytem Temperature-0 33 Celsius degree T1 20 Celsius degree T2 50 Celsius degree TC 55 Celsius degree Voltage Sensor 09V 093V Voltage Sensor 18V 180V
cmd gt_
HA201-TP Users Manual
74
Chapter 7 Intelreg Server Board S2600TP Platform Management
71 Server Management Function Architecture711 IPMI Feature7111 IPMI 20 Featuresbull Baseboard management controller (BMC)bull IPMI Watchdog timerbull Messaging support including command bridging and usersession
supportbull Chassis device functionality including powerreset control and BIOS boot
flags supportbull Event receiver device The BMC receives and processes events from
other platform subsystemsbull Field Replaceable Unit (FRU) inventory device functionality The BMC
supports access to system FRU devices using IPMI FRU commandsbull System Event Log (SEL) device functionality The BMC supports and
provides access to a SELbull Sensor Data Record (SDR) repository device functionality The BMC
supports storage and access of system SDRsbull Sensor device and sensor scanningmonitoring The BMC provides IPMI
management of sensors It polls sensors to monitor and report system health
bull IPMI interfaceso Host interfaces include system management software (SMS) with receive message queue support and server management mode (SMM)o IPMB interfaceo LAN interface that supports the IPMI-over-LAN protocol (RMCP RMCP+)
bull Serial-over-LAN (SOL)bull ACPI state synchronization The BMC tracks ACPI state changes that are
provided by the BIOSbull BMC self-test The BMC performs initialization and run-time self-tests and
makes results available to external entitiesbull See also the Intelligent Platform Management Interface Specification
Second Generation v20
Chapter 7 Intelreg Server Board S2600TP Platform Management
HA201-TP Users Manual
75
Chapter 7 Intelreg Server Board S2600TP Platform Management
7112 Non IPMI FeaturesThe BMC supports the following non-IPMI featuresbull In-circuit BMC firmware updatebull Fault resilient booting (FRB) FRB2 is supported by the watchdog timer
functionalitybull Chassis intrusion detectionbull Basic fan control using Control version 2 SDRsbull Fan redundancy monitoring and supportbull Enhancements to fan speed controlbull Power supply redundancy monitoring and supportbull Hot-swap fan supportbull Acoustic management Support for multiple fan profilesbull Signal testing support The BMC provides test commands for setting
and getting platform signal statesbull The BMC generates diagnostic beep codes for fault conditionsbull System GUID storage and retrievalbull Front panel management The BMC controls the system status LED
and chassis ID LED It supports secure lockout of certain front panel functionality and monitors button presses The chassis ID LED is turned on using a front panel button or a command
bull Power state retentionbull Power fault analysisbull Intelreg Light-Guided Diagnosticsbull Power unit management Support for power unit sensor The BMC
handles power-good dropout conditionsbull DIMM temperature monitoring New sensors and improved acoustic
management using closed-loop fan control algorithm taking into account DIMM temperature readings
bull Address Resolution Protocol (ARP) The BMC sends and responds to ARPs (supported on embedded NICs)
bull Dynamic Host Configuration Protocol (DHCP) The BMC performs DHCP (supported on embedded NICs)
bull Platform environment control interface (PECI) thermal management support
bull E-mail alertingbull Support for embedded web server UI in Basic Manageability feature
set
HA201-TP Users Manual
76
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull Enhancements to embedded web servero Human-readable SELo Additional system configurabilityo Additional system monitoring capabilityo Enhanced on-line help
bull Integrated KVMbull Enhancements to KVM redirection
o Support for higher resolutionbull Integrated Remote Media Redirectionbull Lightweight Directory Access Protocol (LDAP) supportbull Intelreg Intelligent Power Node Manager supportbull Embedded platform debug feature which allows capture of detailed
data for later analysisbull Provisioning and inventory enhancements
o Inventory datasystem information export (partial SMBIOS table)bull DCMI 15 compliance (product-specific)bull Management support for PMBus rev12 compliant power suppliesbull BMC Data Repository (Managed Data Region Feature)bull Support for an Intelreg Local Control Display Panelbull System Airflow Monitoringbull Exit Air Temperature Monitoringbull Ethernet Controller Thermal Monitoringbull Global Aggregate Temperature Margin Sensorbull Memory Thermal Managementbull Power Supply Fan Sensorsbull Energy Star Server Supportbull Smart Ride Through (SmaRT) Closed Loop System Throttling (CLST)bull Power Supply Cold Redundancybull Power Supply FW Updatebull Power Supply Compatibility Checkbull BMC FW reliability enhancements
o Redundant BMC boot blocks to avoid possibility of a corrupted boot block resulting in a scenario that prevents a user from updating the BMC o BMC System Management Health Monitoring
HA201-TP Users Manual
77
Chapter 7 Intelreg Server Board S2600TP Platform Management
72 Features and Functions721 Power SubsystemThe server board supports several power control sources which can initiate power-up orpower-down activity
HA201-TP Users Manual
78
Chapter 7 Intelreg Server Board S2600TP Platform Management
722 Advanced Configuration and Power Interface (ACPI)The server board has support for the following ACPI states
HA201-TP Users Manual
79
Chapter 7 Intelreg Server Board S2600TP Platform Management
723 System InitializationDuring system initialization both the BIOS and the BMC initialize the following items
7231 Processor Tcontrol SettingProcessors used with this chipset implement a feature called Tcontrol which provides a processor-specific value that can be used to adjust the fan-control behavior to achieve optimum cooling and acoustics The BMC reads these from the CPU through PECI Proxy mechanism provided by Manageability Engine (ME) The BMC uses these values as part of thefan-speed-control algorithm
HA201-TP Users Manual
80
Chapter 7 Intelreg Server Board S2600TP Platform Management
7232 Fault Resilient Booting (FRB)Fault resilient booting (FRB) is a set of BIOS and BMC algorithms and hardware support that allow a multiprocessor system to boot even if the bootstrap processor (BSP) fails Only FRB2 is supported using watchdog timer commandsFRB2 refers to the FRB algorithm that detects system failures during POST The BIOS uses the BMC watchdog timer to back up its operation during POST The BIOS configures the watchdog timer to indicate that the BIOS is using the timer for the FRB2 phase of the boot operationAfter the BIOS has identified and saved the BSP information it sets the FRB2 timer use bit and loads the watchdog timer with the new timeout intervalIf the watchdog timer expires while the watchdog use bit is set to FRB2 the BMC (if so configured) logs a watchdog expiration event showing the FRB2 timeout in the event data bytes The BMC then hard resets the system assuming the BIOS-selected reset as the watchdog timeout actionThe BIOS is responsible for disabling the FRB2 timeout before initiating the option ROM scan and before displaying a request for a boot password If the processor fails and causes an FRB2 timeout the BMC resets the systemThe BIOS gets the watchdog expiration status from the BMC If the status shows an expired FRB2 timer the BIOS enters the failure in the system event log (SEL) In the OEM bytes entry in the SEL the last POST code generated during the previous boot attempt is written FRB2failure is not reflected in the processor status sensor valueThe FRB2 failure does not affect the front panel LEDs
HA201-TP Users Manual
81
Chapter 7 Intelreg Server Board S2600TP Platform Management
7233 Post Code DisplayThe BMC upon receiving standby power initializes internal hardware to monitor port 80h (POST code) writes Data written to port 80h is output to the system POST LEDsThe BMC deactivates POST LEDs after POST had completed
HA201-TP Users Manual
82
Chapter 7 Intelreg Server Board S2600TP Platform Management
724 System Event Log (SEL)The BMC implements the system event log as specified in the Intelligent Platform Management Interface Specification Version 20 The SEL is accessible regardless of the system power state through the BMCs in-band and out-of-band interfacesThe BMC allocates 95231bytes (approx 93 KB) of non-volatile storage space to store system events The SEL timestamps may not be in order Up to 3639 SEL records can be stored at a time Because the SEL is circular any command that results in an overflow of the SEL beyond the allocated space will overwrite the oldest entries in the SEL while setting the overflow flag
HA201-TP Users Manual
83
Chapter 7 Intelreg Server Board S2600TP Platform Management
73 Sensor MonitoringThe BMC monitors system hardware and reports system health The information gathered from physical sensors is translated into IPMI sensors as part of the ldquoIPMI Sensor Modelrdquo The BMC also reports various system state changes by maintaining virtual sensors that are not specifically tied to physical hardware This section describes general aspects of BMC sensor management as well as describing how specific sensor types are modeled Unless otherwise specified the term ldquosensorrdquo refers to the IPMI sensor-model definition of a sensor
531 Sensor ScanningThe value of many of the BMCrsquos sensors is derived by the BMC FW periodically polling physical sensors in the system to read temperature voltages and so on Some of these physical sensors are built-in to the BMC component itself and some are physically separated from the BMC Polling of physical sensors for support of IPMI sensor monitoring does not occur until the BMCrsquos operational code is running and the IPMI FW subsystem has completed initializationIPMI sensor monitoring is not supported in the BMC boot code Additionally the BMC selectively polls physical sensors based on the current power and reset state of the system and the availability of the physical sensor when in that state For example non-standby voltages are not monitored when the system is in S4 or S5 power state
HA201-TP Users Manual
84
Chapter 7 Intelreg Server Board S2600TP Platform Management
732 Sensor Rearm Behavior7321 Manual versus Re-arm SensorsSensors can be either manual or automatic re-arm An automatic re-arm sensor will re-arm (clear) the assertion event state for a threshold or offset it if that threshold or offset is deasserted after having been asserted This allows a subsequent assertion of the threshold or an offset to generate a new event and associated side-effect An example side-effect would be boosting fans due to an upper critical threshold crossing of a temperature sensor The event state and the input state (value) of the sensor track each other Most sensors are auto-rearmA manual re-arm sensor does not clear the assertion state even when the threshold or offset becomes de-asserted In this case the event state and the input state (value) of the sensor do not track each other The event assertion state is sticky The following methods can be used to re-arm a sensorbull Automatic re-arm ndash Only applies to sensors that are designated as
ldquoauto-rearmrdquobull IPMI command Re-arm Sensor Eventbull BMC internal method ndash The BMC may re-arm certain sensors due to a
trigger condition For example some sensors may be re-armed due to a system reset A BMC reset will re-arm all sensorsbull System reset or DC power cycle will re-arm all system fan sensors
HA201-TP Users Manual
85
Chapter 7 Intelreg Server Board S2600TP Platform Management
7322 Re-arm and Event GenerationAll BMC-owned sensors that show an asserted event status generate a de-assertion SEL event when the sensor is re-armed provided that the associated SDR is configured to enable a deassertion event for that condition This applies regardless of whether the sensor is a thresholdanalog sensor or a discrete sensor
To manually re-arm the sensors the sequence is outlined below1 A failure condition occurs and the BMC logs an assertion event2 If this failure condition disappears the BMC logs a de-assertion event (if
so configured)3 The sensor is re-armed by one of the methods described in the
previous section4 The BMC clears the sensor status5 The sensor is put into reading-state-unavailable state until it is polled
again or otherwise updated6 The sensor is updated and the ldquoreading-state-unavailablerdquo state is
cleared A new assertion event will be logged if the fault state is once again detected
All auto-rearm sensors that show an asserted event status generate a de-assertion SEL event at the time the BMC detects that the condition causing the original assertion is no longer present and the associated SDR is configured to enable a de-assertion event for that condition
HA201-TP Users Manual
86
Chapter 7 Intelreg Server Board S2600TP Platform Management
733 BIOS Event-Only SensorsBIOS-owned discrete sensors are used for event generation only and are not accessible through IPMI sensor commands like the Get Sensor Reading command Note that in this case the sensor owner designated in the SDR is not the BMCAn example of this usage would be the SELs logged by the BIOS for uncorrectable memory errors Such SEL entries would identify a BIOS-owned sensor ID
734 Margin SensorsThere is sometimes a need for an IPMI sensor to report the difference (margin) from a nonzero reference offset For the purposes of this document these type sensors are referred to as margin sensors For instance for the case of a temperature margin sensor if the referencevalue is 90 degrees and the actual temperature of the device being monitored is 85 degreesthe margin value would be -5
735 IPMI Watchdog SensorThe BMC supports a Watchdog Sensor as a means to log SEL events due to expirations of the IPMI 20 compliant Watchdog Timer
736 BMC Watchdog SensorThe BMC supports an IPMI sensor to report that a BMC reset has occurred due to action taken by the BMC Watchdog feature A SEL event will be logged whenever either the BMC FW stack is reset or the BMC CPU itself is reset
737 BMC System Management Health MonitoringThe BMC tracks the health of each of its IPMI sensors and report failures by providing a ldquoBMC FW Healthrdquo sensor of the IPMI 20 sensor type Management Subsystem Health with support for the Sensor Failure offset Only assertions should be logged into the SEL for the Sensor Failure offset The BMC Firmware Health sensor asserts for any sensor when 10 consecutive sensor errors are read These are not standard sensor events (that is threshold crossings or discrete assertions) These are BMC Hardware Access Layer (HAL) errors If a successful sensor read is completed the counter resets to zero
HA201-TP Users Manual
87
Chapter 7 Intelreg Server Board S2600TP Platform Management
738 VR Watchdog TimerThe BMC FW monitors that the power sequence for the board VR controllers is completed when a DC power-on is initiated Incompletion of the sequence indicates a board problem in which case the FW powers down the systemThe BMC FW supports a discrete IPMI sensor for reporting and logging this fault condition
739 System Airflow MonitoringThe BMC provides an IPMI sensor to report the volumetric system airflow in CFM (cubic feet per minute) The air flow in CFM is calculated based on the system fan PWM values The specific Pulse Width Modulation (PWM or PWMs) used to determine the CFM is SDR configurable The relationship between PWM and CFM is based on a lookup table in an OEM SDRThe airflow data is used in the calculation for exit air temperature monitoring It is exposed as an IPMI sensor to allow a datacenter management application to access this data for use in rack-level thermal management
7310 Thermal MonitoringThe BMC provides monitoring of component and board temperature sensing devices This monitoring capability is instantiated in the form of IPMI analogthreshold or discrete sensors depending on the nature of the measurementFor analogthreshold sensors with the exception of Processor Temperature sensors critical and non-critical thresholds (upper and lower) are set through SDRs and event generation enabled for both assertion and de-assertion eventsFor discrete sensors both assertion and de-assertion event generation are enabledMandatory monitoring of platform thermal sensors includes
HA201-TP Users Manual
88
Chapter 7 Intelreg Server Board S2600TP Platform Management
7310 Thermal MonitoringThe BMC provides monitoring of component and board temperature sensing devices This monitoring capability is instantiated in the form of IPMI analogthreshold or discrete sensors depending on the nature of the measurementFor analogthreshold sensors with the exception of Processor Temperature sensors critical and non-critical thresholds (upper and lower) are set through SDRs and event generation enabled for both assertion and de-assertion eventsFor discrete sensors both assertion and de-assertion event generation are enabledMandatory monitoring of platform thermal sensors includesbull Inlet temperature (physical sensor is typically on system front panel or
HDD back plane)bull Board ambient thermal sensorsbull Processor temperaturebull Memory (DIMM) temperaturebull CPU VRD Hot monitoringbull Power supply (only supported for PMBus-compliant PSUs)Additionally the BMC FW may create ldquovirtualrdquo sensors that are based on a combination of aggregation of multiple physical thermal sensors and application of a mathematical formula to thermal or power sensor readings
73101 Absolute Value versus Margin SensorsThermal monitoring sensors fall into three basic categoriesbull Absolute temperature sensors ndash These are analogthreshold sensors
that provide a value that corresponds to an absolute temperature value
bull Thermal margin sensors ndash These are analogthreshold sensors that provide a value that is relative to some reference value
bull Thermal fault indication sensors ndash These are discrete sensors that indicate a specific thermal fault condition
HA201-TP Users Manual
89
Chapter 7 Intelreg Server Board S2600TP Platform Management
73102 Processor DTS-Spec Margin Sensor(s)Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family incorporate a DTS based thermal spec This allows a much more accurate control of the thermal solution and will enable lower fan speeds and lower fan power consumption The main usage of this sensor is as an input to the BMCrsquos fan control algorithms The BMC implements this as a threshold sensor There is one DTS sensor for each installed physical processor package Thresholds are not set and alert generation is not enabled for these sensors
73103 Processor Thermal Margin Sensor(s)Each processor supports a physical thermal margin sensor per core that is readable through the PECI interface This provides a relative value representing a thermal margin from the corersquos throttling thermal trip point Assuming that temp controlled throttling is enabled the physical core temp sensor reads lsquo0rsquo which indicates the processor core is being throttledThe BMC supports one IPMI processor (margin) temperature sensor per physical processor package This sensor aggregates the readings of the individual core temperatures in a package to provide the hottest core temperature reading When the sensor reads lsquo0rsquo it indicates that the hottest processor core is throttlingDue to the fact that the readings are capped at the corersquos thermal throttling trip point (reading = 0) thresholds are not set and alert generation is not enabled for these sensors
73104 Processor Thermal Control Monitoring (Prochot)The BMC FW monitors the percentage of time that a processor has been operationally constrained over a given time window (nominally six seconds) due to internal thermal management algorithms engaging to reduce the temperature of the device When any processor core temperature reaches its maximum operating temperature the processorpackage PROCHOT (processor hot) signal is asserted and these management algorithms known as the Thermal Control Circuit (TCC) engage to reduce the temperature provided TCC is enabled TCC is enabled by BIOS during system boot This monitoring is instantiated as one IPMI analogthreshold sensor per processor package The BMC implements this as a threshold sensor on a per-processor basis
HA201-TP Users Manual
90
Chapter 7 Intelreg Server Board S2600TP Platform Management
Under normal operation this sensor is expected to read lsquo0rsquo indicating that no processor throttling has occurredThe processor provides PECI-accessible counters one for the total processor time elapsed and one for the total thermally constrained time which are used to calculate the percentage assertion over the given time window
73105 Processor Voltage Regulator (VRD) Over-Temperature SensorThe BMC monitors processor VRD_HOT signals The processor VRD_HOT signals are routed to the respective processor PROCHOT input in order initiate throttling to reduce processor power draw therefore indirectly lowering the VRD temperatureThere is one processor VRD_HOT signal per CPU slot The BMC instantiates one discrete IPMI sensor for each VRD_HOT signal This sensor monitors a digital signal that indicates whether a processor VRD is running in an over-temperature condition When the BMC detects that this signal is asserted it will cause a sensor assertion which will result in an event being written into the sensor event log (SEL)
73106 Inlet Temperature SensorEach platform supports a thermal sensor for monitoring the inlet temperature There are four potential sources for inlet temperature reading
73107 Baseboard Ambient Temperature Sensor(s)The server baseboard provides one or more physical thermal sensors for monitoring the ambient temperature of a board location This is typically to provide rudimentary thermal monitoring of components that lack internal thermal sensors
73108 Server South Bridge (SSB) Thermal MonitoringThe BMC monitors the SSB temperature This is instantiated as an analog (threshold) IPMI thermal sensor
73109 Exit Air Temperature Monitoring The BMC synthesizes a virtual sensor to approximate system exit air temperature for use in fan control This is calculated based on the total power being consumed by the system and the total volumetric air flow provided by the system fans
HA201-TP Users Manual
91
Chapter 7 Intelreg Server Board S2600TP Platform Management
Each system shall be characterized in tabular format to understand total volumetric flow versus fan speed The BMC calculates an average exit air temperature based on the total system power front panel temperature the volumetric system air flow (cubic feet per meter or CFM) and altitude rangeThis sensor is only available on systems in an Intelreg chassis The Exit Air temp sensor is only available when PMBus power supplies are installed
731010 Ethernet Controller Thermal MonitoringThe Intelreg Ethernet Controller I350-AM4 and Intelreg Ethernet Controller 10 Gigabit X540 support an on-die thermal sensor For baseboard Ethernet controllers that use these devices the BMC will monitor the sensors and use this data as input to the fan speed control The BMC will instantiate an IPMI temperature sensor for each device on the baseboard
731011 Memory VRD-Hot Sensor(s)The BMC monitors memory VRD_HOT signals The memory VRD_HOT signals are routed to the respective processor MEMHOT inputs in order to throttle the associated memory to effectively lower the temperature of the VRD feeding that memoryFor Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family there are 2 memory VRD_HOT signals per CPU slot The BMC instantiates one discrete IPMI sensor for each memory VRD_HOT signal
731012 Add-in Module Thermal MonitoringSome boards have dedicated slots for an IO module andor a SAS module For boards that support these slots the BMC will instantiate an IPMI temperature sensor for each slot The modules themselves may or may not provide a physical thermal sensor (a TMP75 device) If the BMC detects that a module is installed it will attempt to access the physical thermal sensor and if found enable the associated IPMI temperature sensor
731013 Processor ThermTripWhen a Processor ThermTrip occurs the system hardware will automatically power down the server If the BMC detects that a ThermTrip occurred then it will set the ThermTrip offset for the applicable
HA201-TP Users Manual
92
Chapter 7 Intelreg Server Board S2600TP Platform Management
processor status sensor
731014 Server South Bridge (SSB) ThermTrip Monitoring The BMC supports SSB ThermTrip monitoring that is instantiated as an IPMI discrete sensor When a SSB ThermTrip occurs the system hardware will automatically power down the server and the BMC will assert the sensor offset and log an event
731015 DIMM ThermTrip MonitoringThe BMC supports DIMM ThermTrip monitoring that is instantiated as one aggregate IPMI discrete sensor per CPU When a DIMM ThermTrip occurs the system hardware will automatically power down the server and the BMC will assert the sensor offset and log an eventThis is a manual re-arm sensor that is rearmed on system resets and power-on (AC or DC power on transitions)
HA201-TP Users Manual
93
Chapter 7 Intelreg Server Board S2600TP Platform Management
7311 Processor SensorsThe BMC provides IPMI sensors for processors and associated components such as voltage regulators and fans The sensors are implemented on a per-processor basis
HA201-TP Users Manual
94
Chapter 7 Intelreg Server Board S2600TP Platform Management
73111 Processor Status SensorsThe BMC provides an IPMI sensor of type processor for monitoring status information for each processor slot If an event state (sensor offset) has been asserted it remains asserted until one of the following happens1 A Rearm Sensor Events command is executed for the processor status
sensor2 AC or DC power cycle system reset or system boot occurs
The BMC provides system status indication to the front panel LEDs for processor fault conditions shown belowCPU Presence status is not saved across AC power cycles and therefore will not generate a deassertion after cycling AC power
HA201-TP Users Manual
95
Chapter 7 Intelreg Server Board S2600TP Platform Management
73112 Processor Population Fault (CPU Missing) SensorThe BMC supports a Processor Population Fault sensor This is used to monitor for the condition in which processor slots are not populated as required by the platform hardware to allow power-on of the systemAt BMC startup the BMC will check for the fault condition and set the sensor state accordinglyThe BMC also checks for this fault condition at each attempt to DC power-on the system At each DC power-on attempt a beep code is generated if this fault is detectedThe following steps are used to correct the fault condition and clear the sensor state1 AC power down the server2 Install the missing processor into the correct slot3 AC power on the server
73113 ERR2 Timeout MonitoringThe BMC supports an ERR2 Timeout Sensor (1 per CPU) that asserts if a CPUrsquos ERR2 signal has been asserted for longer than a fixed time period (gt 90 seconds) ERR[2] is a processor signal that indicates when the IIO (Integrated IO module in the processor) has a fatal error which could not be communicated to the core to trigger SMI ERR[2] events are fatal error conditions where the BIOS and OS will attempt to gracefully handle error but may not be always do so reliably A continuously asserted ERR2 signal is an indication that the BIOS cannot service the condition that caused the error This is usually because that condition prevents the BIOS from runningWhen an ERR2 timeout occurs the BMC assertsde-asserts the ERR2 Timeout Sensor and logs a SEL event for that sensor The default behavior for BMC core firmware is to initiate a system reset upon detection of an ERR2 timeout The BIOS setup utility provides an option to disable or enable system reset by the BMC for detection of this condition
HA201-TP Users Manual
96
Chapter 7 Intelreg Server Board S2600TP Platform Management
73114 CATERR SensorThe BMC supports a CATERR sensor for monitoring the system CATERR signalThe CATERR signal is defined as having 3 statesbull high (no event)bull pulsed low (possibly fatal may be able to recover)bull low (fatal)All processors in a system have their CATERR pins tied together The pin is used as a communication path to signal a catastrophic system event to all CPUs The BMC has direct access to this aggregate CATERR signalThe BMC only monitors for the ldquoCATERR held lowrdquo condition A pulsed low condition is ignored by the BMC If a CATERR-low condition is detected the BMC logs an error message to the SEL against the CATERR sensor and the default action after logging the SEL entry is to reset the system The BIOS setup utility provides an option to disable or enable system reset by the BMC for detection of this conditionThe sensor is rearmed on power-on (AC or DC power on transitions) It is not rearmed on system resets in order to avoid multiple SEL events that could occur due to a potential reset loop if the CATERR keeps recurring which would be the case if the CATERR was due to an MSID mismatch conditionWhen the BMC detects that this aggregate CATERR signal has asserted it can then go through PECI to query each CPU to determine which one was the source of the error and write an OEM code identifying the CPU slot into an event data byte in the SEL entry If PECI is non-functional (it isnrsquot guaranteed in this situation) then the OEM code should indicate that the source is unknownEvent data byte 2 and byte 3 for CATERR sensor SEL events
ED1 ndash 0xA1ED2 - CATERR type0 Unknown1 CATERR2 CPU Core Error (not supported on Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family)3 MSID Mismatch4 CATERR due to CPU 3-strike timeout
HA201-TP Users Manual
97
Chapter 7 Intelreg Server Board S2600TP Platform Management
ED3 - CPU bitmap that causes the system CATERR[0] CPU1[1] CPU2[2] CPU3[3] CPU4When a CATERR Timeout event is determined to be a CPU 3-strike timeout The BMC shall log the logical FRU information (eg busdevfunc for a PCIe device CPU or DIMM) that identifies the FRU that caused the error in the extended SEL data bytes In this case Ext-ED0 will be set to 0x70 and the remaining ED1-ED7 will be set according to the device type and info available
73115 MSID Mismatch SensorThe BMC supports a MSID Mismatch sensor for monitoring for the fault condition that will occur if there is a power rating incompatibility between a baseboard and a processor The sensor is rearmed on power-on (AC or DC power on transitions)
7312 Voltage MonitoringThe BMC provides voltage monitoring capability for voltage sources on the main board and processors such that all major areas of the system are covered This monitoring capability is instantiated in the form of IPMI analogthreshold sensors
73121 DIMM Voltage SensorsSome systems support either LVDDR (Low Voltage DDR) memory or regular (non-LVDDR) memory During POST the system BIOS detects which type of memory is installed and configures the hardware to deliver the correct voltageSince the nominal voltage range is different this necessitates the ability to set different thresholds for any associated IPMI voltage sensors The BMC FW supports this by implementing separate sensors (that is separate IPMI sensor numbers) for each nominal voltage range supported for a single physical sensor and it enablesdisables the correct IPMI sensor based on which type memory is installed The sensor data records for both these DIMM voltage sensor types have scanning disabled by default Once the BIOS has completed its POST routine it is responsible for communicating the DIMM voltage type to the BMC which will then
HA201-TP Users Manual
98
Chapter 7 Intelreg Server Board S2600TP Platform Management
enable sensor scanning of the correct DIMM voltage sensor
7313 Fan MonitoringBMC fan monitoring support includes monitoring of fan speed (RPM) and fan presence
73131 Fan Tach SensorsFan Tach sensors are used for fan failure detection The reported sensor reading is proportional to the fanrsquos RPM This monitoring capability is instantiated in the form of IPMI analogthreshold sensorsMost fan implementations provide for a variable speed fan so the variations in fan speed can be large Therefore the threshold values must be set sufficiently low as to not result in inappropriate threshold crossingsFan tach sensors are implemented as manual re-arm sensors because a lower-critical threshold crossing can result in full boosting of the fans This in turn may cause a failing fanrsquos speed to rise above the threshold and can result in fan oscillationsAs a result fan tach sensors do not auto-rearm when the fault condition goes away but rather are rearmed for either of the following occurrences1 The system is reset or power-cycled2 The fan is removed and either replaced with another fan or re-
inserted This applies to hot-swappable fans only This re-arm is triggered by change in the state of the associated fan presence sensor
After the sensor is rearmed if the fan speed is detected to be in a normal range the failure conditions shall be cleared and a de-assertion event shall be logged
HA201-TP Users Manual
99
Chapter 7 Intelreg Server Board S2600TP Platform Management
73132 Fan Presence SensorsSome chassis and server boards provide support for hot-swap fans These fans can be removed and replaced while the system is powered on and operating normally The BMC implements fan presence sensors for each hot swappable fan These are instantiated as IPMI discrete sensorsEvents are only logged for fan presence upon changes in the presence state after AC power is applied (no events logged for initial state)
73133 Fan Redundancy SensorThe BMC supports redundant fan monitoring and implements fan redundancy sensors for products that have redundant fans Support for redundant fans is chassis-specificA fan redundancy sensor generates events when its associated set of fans transition between redundant and non-redundant states as determined by the number and health of the component fans The definition of fan redundancy is configuration dependent The BMCallows redundancy to be configured on a per fan-redundancy sensor basis through OEM SDR recordsThere is a fan redundancy sensor implemented for each redundant group of fans in the systemAssertion and de-assertion event generation is enabled for each redundancy state
73134 Power Supply Fan SensorsMonitoring is implemented through IPMI discrete sensors one for each power supply fan The BMC polls each installed power supply using the PMBus fan status commands to check for failure conditions for the power supply fans The BMC asserts the ldquoperformance lagsrdquo offset of the IPMI sensor if a fan failure is detectedPower supply fan sensors are implemented as manual re-arm sensors because a failure condition can result in boosting of the fans This in turn may cause a failing fanrsquos speed to rise above the ldquofaultrdquo threshold and can result in fan oscillations As a result these sensors do not auto-rearm when the fault condition goes away but rather are rearmed only when the system is reset or power-cycled or the PSU is removed and replaced with the same or another PSUAfter the sensor is rearmed if the fan is no longer showing a failed state the failure condition in the IPMI sensor shall be cleared and a de-
HA201-TP Users Manual
100
Chapter 7 Intelreg Server Board S2600TP Platform Management
assertion event shall be logged
73135 Monitoring for ldquoFans Offrdquo ScenarioOn Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family it is likely that there will be situations where specific fans are turned off based on current system conditions BMC Fan monitoring will comprehend this scenario and not log false failure eventsThe recommended method is for the BMC FW to halt updates to the value of the associated fan tach sensor and set that sensorrsquos IPMI sensor state to ldquoreading-state-unavailablerdquo when this mode is active Management software must comprehend this state for fan tach sensors and not report these as failure conditionsThe scenario for which this occurs is that the BMC Fan Speed Control (FSC) code turns off the fans by setting the PWM for the domain to 0 This is done when based on one or more global aggregate thermal margin sensor readings dropping below a specified thresholdBy default the fans-off feature will be disabled There is a BMC command and BIOS setup option to enabledisable this featureThe SmaRTCLST system feature will also momentarily gate power to all the system fans to reduce overall system power consumption in response to a power supply event (for example to ride out an AC power glitch) However for this scenario the fan power is gated by hardware for only 100ms which should not be long enough to result in triggering a fan fault SEL event
7314 Standard Fan ManagementThe BMC controls and monitors the system fans Each fan is associated with a fan speed sensor that detects fan failure and may also be associated with a fan presence sensor for hotswap support For redundant fan configurations the fan failure and presence status determines the fan redundancy sensor stateThe system fans are divided into fan domains each of which has a separate fan speed control signal and a separate configurable fan control policy A fan domain can have a set of temperature and fan sensors associated with it These are used to determine the current fan domain state
HA201-TP Users Manual
101
Chapter 7 Intelreg Server Board S2600TP Platform Management
A fan domain has three statesbull The sleep and boost states have fixed (but configurable through OEM
SDRs) fan speeds associated with thembull The nominal state has a variable speed determined by the fan
domain policy An OEM SDR record is used to configure the fan domain policy
The fan domain state is controlled by several factors They are listed below in order of precedence high to lowbull Boost
o Associated fan is in a critical state or missing The SDR describes which fan domains are boosted in response to a fan failure or removal in each domain If a fan is removed when the system is in lsquoFans-offrsquo mode it will not be detected and there will not be any fan boost till system comes out of lsquoFans-off modeo Any associated temperature sensor is in a critical state The SDR describes which temperature-threshold violations cause fan boost for each fan domaino The BMC is in firmware update mode or the operational firmware is corruptedo If any of the above conditions apply the fans are set to a fixed boost state speed
bull Nominalo A fan domainrsquos nominal fan speed can be configured as static (fixed value) or controlled by the state of one or more associated temperature sensorso See section 43144 for more details
73141 Hot-Swap FansHot-swap fans are supported These fans can be removed and replaced while the system is powered on and operating The BMC implements fan presence sensors for each hotswappable fanWhen a fan is not present the associated fan speed sensor is put into the readingunavailable state and any associated fan domains are put into the boost state The fans may already be boosted due to a previous fan failure or fan removalWhen a removed fan is inserted the associated fan speed sensor is rearmed If there are no other critical conditions causing a fan boost condition the fan speed returns to the nominal state Power cycling or
HA201-TP Users Manual
102
Chapter 7 Intelreg Server Board S2600TP Platform Management
resetting the system re-arms the fan speed sensors and clears fanfailure conditions If the failure condition is still present the boost state returns once the sensor has re-initialized and the threshold violation is detected again
73142 Fan Redundancy DetectionThe BMC supports redundant fan monitoring and implements a fan redundancy sensor A fan redundancy sensor generates events when its associated set of fans transitions between redundant and non-redundant states as determined by the number and health of the fans The definition of fan redundancy is configuration dependent The BMC allows redundancy to be configured on a per fan redundancy sensor basis through OEM SDR recordsA fan failure or removal of hot-swap fans up to the number of redundant fans specified in the SDR in a fan configuration is a non-critical failure and is reflected in the front panel status A fan failure or removal that exceeds the number of redundant fans is a non-fatal insufficientresources condition and is reflected in the front panel status as a non-fatal errorRedundancy is checked only when the system is in the DC-on state Fan redundancy changes that occur when the system is DC-off or when AC is removed will not be logged until the system is turned on
73143 Fan DomainsSystem fan speeds are controlled through pulse width modulation (PWM) signals which are driven separately for each domain by integrated PWM hardware Fan speed is changed by adjusting the duty cycle which is the percentage of time the signal is driven high in each pulseThe BMC controls the average duty cycle of each PWM signal through direct manipulation of the integrated PWM control registersThe same device may drive multiple PWM signals
HA201-TP Users Manual
103
Chapter 7 Intelreg Server Board S2600TP Platform Management
73144 Nominal Fan SpeedA fan domainrsquos nominal fan speed can be configured as static (fixed value) or controlled by the state of one or more associated temperature sensorsOEM SDR records are used to configure which temperature sensors are associated with which fan control domains and the algorithmic relationship between the temperature and fan speed Multiple OEM SDRs can reference or control the same fan control domain and multiple OEM SDRs can reference the same temperature sensorsThe PWM duty-cycle value for a domain is computed as a percentage using one or more instances of a stepwise linear algorithm and a clamp algorithm The transition from one computed nominal fan speed (PWM value) to another is ramped over time to minimize audible transitions The ramp rate is configurable by means of the OEM SDRMultiple stepwise linear and clamp controls can be defined for each fan domain and used simultaneously For each domain the BMC uses the maximum of the domainrsquos stepwise linear control contributions and the sum of the domainrsquos clamp control contributions to compute the domainrsquos PWM value except that a stepwise linear instance can be configured to provide the domain maximumHysteresis can be specified to minimize fan speed oscillation and to smooth fan speed transitions If a Tcontrol SDR record does not contain a hysteresis definition for example an SDR adhering to a legacy format the BMC assumes a hysteresis value of zero
73145 Thermal and Acoustic ManagementThis feature refers to enhanced fan management to keep the system optimally cooled while reducing the amount of noise generated by the system fans Aggressive acoustics standards might require a trade-off between fan speed and system performance parameters that contribute to the cooling requirements primarily memory bandwidth The BIOS BMC and SDRs work together to provide control over how this trade-off is determinedThis capability requires the BMC to access temperature sensors on the individual memory DIMMs Additionally closed-loop thermal throttling is only supported with buffered DIMMs
HA201-TP Users Manual
104
Chapter 7 Intelreg Server Board S2600TP Platform Management
73146 Thermal Sensor Input to Fan Speed ControlThe BMC uses various IPMI sensors as input to the fan speed control Some of the sensors are IPMI models of actual physical sensors whereas some are ldquovirtualrdquo sensors whose values are derived from physical sensors using calculations andor tabular informationThe following IPMI thermal sensors are used as input to the fan speed controlbull Front panel temperature sensorbull Baseboard temperature sensorsbull CPU DTS-Spec margin sensorsbull DIMM thermal margin sensorsbull Exit air temperature sensorbull Global aggregate thermal margin sensorsbull SSB (Intelreg C610 Series Chipset) temperature sensorbull On-board Ethernet controller temperature sensors (support for this is
specific to the Ethernet controller being used)bull Add-in Intelreg SASIO module temperature sensor(s) (if present)bull Power supply thermal sensors (only available on PMBus-compliant
power supplies)A simple model is shown in the following figure which gives a high level graphic of the fan speed control structure creates the resulting fan speeds
HA201-TP Users Manual
105
Chapter 7 Intelreg Server Board S2600TP Platform Management
731461 Processor Thermal ManagementProcessor thermal management utilizes clamp algorithms for which the Processor DTS-Spec margin sensor is a controlling input This replaces the use of the (legacy) raw DTS sensor reading that was utilized on previous generation platforms The legacy DTS sensor is retained only for monitoring purposes and is not used as an input to the fan speed control
731462 Memory Thermal ManagementThe system memory is the most complex subsystem to thermally manage as it requires substantial interactions between the BMC BIOS and the embedded memory controller This section provides an overview of this management capability from a BMC perspective
7314621 Memory Thermal ThrottlingThe system shall support thermal management through open loop throttling (OLTT) and closed loop throttling (CLTT) of system memory based on the platform as well as availability of valid temperature sensors on the installed memory DIMMs Throttling levels are changed dynamically to cap throttling based on memory and system thermal conditions as determined by the system and DIMM power and thermal parameters Support for CLTT on mixed-mode DIMM populations (that is some installed DIMMs have valid temp sensors and some do not) is not supported The BMC fan speed control functionality is related to the memory throttling mechanism usedThe following terminology is used for the various memory throttling optionsbull Static Open Loop Thermal Throttling (Static-OLTT) OLTT control registers
are configured by BIOS MRC remain fixed after post The system does not change any of the throttling control registers in the embedded memory controller during runtime
bull Static Closed Loop Thermal Throttling (Static-CLTT) CLTT control registers are configured by BIOS MRC during POST The memory throttling is run as a closed-loop system with the DIMM temperature sensors as the control input Otherwise the system does not change any of the throttling control registers in the embedded memory controller during runtime
bull Dynamic Open Loop Thermal Throttling (Dynamic-OLTT) OLTT control registers are configured by BIOS MRC during POST Adjustments are
HA201-TP Users Manual
106
Chapter 7 Intelreg Server Board S2600TP Platform Management
made to the throttling during runtime based on changes in system cooling (fan speed)
bull Dynamic Closed Loop Thermal Throttling (Dynamic-CLTT) CLTT control registers are configured by BIOS MRC during POST The memory throttling is run as a closed-loop system with the DIMM temperature sensors as the control input Adjustments are made to the throttling during runtime based on changes in system cooling (fan speed)
Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce a new type of CLTT which is referred to as Hybrid CLTT for which the Integrated Memory Controller estimates the DRAM temperature in between actual reads of the TSODsHybrid CLTTT shall be used on all Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family that have DIMMs with thermal sensors Therefore the terms Dynamic-CLTT and Static-CLTT are really referring to this lsquohybridrsquo mode Note that if the IMCrsquos polling of the TSODs is interrupted the temperature readings that the BMC gets from the IMC shall be these estimated values 731463 DIMM Temperature Sensor Input to Fan Speed ControlA clamp algorithm s used for controlling fan speed based on DIMM temperatures Aggregate DIMM temperature margin sensors are used as the control input to the algorithm
731464 Dynamic (Hybrid) CLTTThe system will support dynamic (memory) CLTT for which the BMC FW dynamically modifies thermal offset registers in the IMC during runtime based on changes in system cooling (fan speed) For static CLTT a fixed offset value is applied to the TSOD reading to get the die temperature however this is does not provide as accurate results as when the offset takes into account the current airflow over the DIMM as is done with dynamic CLTTIn order to support this feature the BMC FW will derive the air velocity for each fan domain based on the PWM value being driven for the domain Since this relationship is dependent on the chassis configuration a method must be used which support this dependency (for example through OEM SDR) that establishes a lookup table providing this relationshipBIOS will have an embedded lookup table that provides thermal offset
HA201-TP Users Manual
107
Chapter 7 Intelreg Server Board S2600TP Platform Management
values for each DIMM type altitude setting and air velocity range (3 ranges of air velocity are supported) During system boot BIOS will provide 3 offset values (corresponding to the 3 air velocity ranges) tothe BMC for each enabled DIMM Using this data the BMC FW constructs a table that maps the offset value corresponding to a given air velocity range for each DIMM During runtime the BMC applies an averaging algorithm to determine the target offset value corresponding to thecurrent air velocity and then the BMC writes this new offset value into the IMC thermal offset register for the DIMM
731465 Fan ProfilesThe server system supports multiple fan control profiles to support acoustic targets and American Society of Heating Refrigerating and Air Conditioning Engineers (ASHRAE) compliance The BIOS Setup utility can be used to choose between meeting the target acoustic level or enhanced system performance This is accomplished through fan profilesThe BMC supports eight fan profiles numbered from 0 to 7 Fan management policy is dictated by the Tcontrol SDRs These SDRs provide a way to associate fan control behavior with one or more fan profilesEach group of profiles allows for varying fan control policies based on the altitude For a given altitude the Tcontrol SDRs associated with an acoustics-optimized profile generate less noise than the equivalent performance-optimized profile by driving lower fan speeds and the BIOSreduces thermal management requirements by configuring more aggressive memory throttling See Table 16 for more informationThe BMC provides commands that query for fan profile support and it provides a way to enable a fan profile Enabling a fan profile determines which Tcontrol SDRs are used for fan management The BMC only supports enabling a fan profile through the command if that profile is supported on all fan domains defined for the system It is important to configure the SDRs so that all desired fan profiles are supported on each fan domain If no single profile is supported across all domains the BMC by default uses profile 0 and does not allow it to be changedAt system boot the BIOS can use the Get Fan Control Configuration command to query the BMC about which fan profiles are supported The BIOS uses this information to display options in the BIOS Setup utility The BIOS indicates the fan profile to the BMC as dictated by the BIOS Setup Utility options for fan mode and altitude using the Set Fan Control
HA201-TP Users Manual
108
Chapter 7 Intelreg Server Board S2600TP Platform Management
Configuration commandThe BMC uses this information as an input to its fan-control algorithm as supported by the Tcontrol OEM SDR The BMC only allows enabling of fan profiles that the BMC indicates are supported using the Get Fan Control Configuration command For example if the Get Fan Control Configuration command indicates that only profile 1 is supported then using the Set Fan Control Configuration command to enable profile 2 will result in the return of an error completion codeThe BMC requires the BIOS to send the Set Fan Control Configuration command to the BMC on every system boot This must be done after the BIOS has completed any throttling-related chipset configuration
HA201-TP Users Manual
109
Chapter 7 Intelreg Server Board S2600TP Platform Management
731466 Open-Loop Thermal Throttling FallbackNormal system operation uses closed-loop thermal throttling (CLTT) and DIMM temperature monitoring as major factors in overall thermal and acoustics management In the event that BIOS is unable to configure the system for CLTT it defaults to open-loop thermal throttling (OLTT) In the OLTT mode it is assumed that the DIMM temperature sensors are not available for fan speed control The BIOS communicates the throttling mode to the BMC along with the fan profile number when it sends the Set Fan Control Configuration commandWhen OLTT mode is specified the BMC internally blocks access to the DIMM temperatures causing the DIMM aggregate margin sensors to be marked as ReadingState Unavailable The BMC then uses the failure-control values for these sensors if specified in the Tcontrol SDRs as their fan speed contributions
731467 ASHRAE ComplianceSystem requirements for ASHRAE compliance is defined in the Common Fan Speed Control amp Thermal Management Platform Architecture Specification Altitude-related changes in fan speed control are handled through profiles for different altitude ranges
73147 Power Supply Fan Speed ControlThis section describes the system level control of the fans internal to the power supply over the PMBus Some but not all Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family will require that the power supplies be included in the system level fan speed control For any system that requires either of these capabilities the power supply must be PMBus-compliant
731471 System Control of Power Supply FansSome products require that the BMC control the speed of the power supply fans as is done with normal system (chassis) fans except that the BMC cannot reduce the power supply fan any lower than the internal power supply control is driving it For these products the BMC FW must have the ability to control and monitor the power supply fans through PMBus commands The power supply fans are treated as a system fan domain for which fan control policies are mapped just as for chassis system fans with system thermal sensors (rather than internal power
HA201-TP Users Manual
110
Chapter 7 Intelreg Server Board S2600TP Platform Management
supply thermal sensors) used as the input to a clamp algorithm for the power supply fan control This domain has both piecewise clipping curves and clamped sensors mapped into the power supply fan domain All the power supplies can be defined as a single fan domain
731472 Use of Power Supply Thermal Sensors as Input to System (Chassis) Fan Control Some products require that the power supply internal thermal sensors are used as control inputs to the system (chassis) fans in the same manner as other system thermal sensors are used for this purpose The power supply thermal sensors are included as clamped sensors into one or more system fan domains which may include the power supply fan domain
73148 Fan Boosting due to Fan FailuresIntelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce additional capabilities for handling fan failure or removal as described in this sectionEach fan failure shall be able to define a unique response from all other fan domains An OEM SDR table defines the response of each fan domain based on a failure of any fan including both system and power supply fans (for PMBus-compliant power supplies only) This meansthat if a system has six fans then there will be six different fan fail reactions
73149 Programmable Fan PWM OffsetThe system provides a BIOS Setup option to boost the system fan speed by a programmable positive offset or a ldquoMaxrdquo setting Setting the programmable offset causes the BMC to add the offset to the fan speeds that it would otherwise be driving the fans to The Max setting causes the BMC to replace the domain minimum speed with alternate domain minimums that also are programmable through SDRsThis capability is offered to provide system administrators the option to manually configure fans speeds in instances where the fan speed optimized for a given platform may not be sufficient when a high end add-in is configured into the system This enables easier usage of the fan speed control to support Intelreg as well as third party chassis and better support of ambient temperatures higher than 35C
HA201-TP Users Manual
111
Chapter 7 Intelreg Server Board S2600TP Platform Management
supply thermal sensors) used as the input to a clamp algorithm for the power supply fan control This domain has both piecewise clipping curves and clamped sensors mapped into the power supply fan domain All the power supplies can be defined as a single fan domain
731472 Use of Power Supply Thermal Sensors as Input to System (Chassis) Fan Control Some products require that the power supply internal thermal sensors are used as control inputs to the system (chassis) fans in the same manner as other system thermal sensors are used for this purpose The power supply thermal sensors are included as clamped sensors into one or more system fan domains which may include the power supply fan domain
73148 Fan Boosting due to Fan FailuresIntelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce additional capabilities for handling fan failure or removal as described in this sectionEach fan failure shall be able to define a unique response from all other fan domains An OEM SDR table defines the response of each fan domain based on a failure of any fan including both system and power supply fans (for PMBus-compliant power supplies only) This meansthat if a system has six fans then there will be six different fan fail reactions
73149 Programmable Fan PWM OffsetThe system provides a BIOS Setup option to boost the system fan speed by a programmable positive offset or a ldquoMaxrdquo setting Setting the programmable offset causes the BMC to add the offset to the fan speeds that it would otherwise be driving the fans to The Max setting causes the BMC to replace the domain minimum speed with alternate domain minimums that also are programmable through SDRsThis capability is offered to provide system administrators the option to manually configure fans speeds in instances where the fan speed optimized for a given platform may not be sufficient when a high end add-in is configured into the system This enables easier usage of the fan speed control to support Intelreg as well as third party chassis and better support of ambient temperatures higher than 35C
HA201-TP Users Manual
112
Chapter 7 Intelreg Server Board S2600TP Platform Management
7315 Power Management Bus (PMBus)The Power Management Bus (ldquoPMBusrdquo) is an open standard protocol that is built upon the SMBus 20 transport It defines a means of communicating with power conversion and other devices using SMBus-based commands A system must have PMBus-compliant power supplies installed in order for the BMC or ME to monitor them for status andor power metering purposesFor more information on PMBus see the System Management Interface Forum Web sitehttpwwwpowersigorg
7316 Power Supply Dynamic Redundancy SensorThe BMC supports redundant power subsystems and implements a Power Unit Redundancy sensor per platform A Power Unit Redundancy sensor is of sensor type Power Unit (09h) and reading type Availability Status (0Bh) This sensor generates events when a power subsystem transitions between redundant and non-redundant states as determined by the number and health of the power subsystemrsquos component power supplies The BMC implements Dynamic Power Supply Redundancy status based upon current system load requirements as well as total Power Supply capacity This status is independent of the Cold Redundancy status This prevents the BMC from reporting Fully Redundant Power supplies when the load required by the system exceeds half the power capability of all power supplies installed and operationalDynamic Redundancy detects this condition and generates the appropriate SEL event to notify the user of the condition Power supplies of different power ratings may be swapped in and out to adjust the power capacity and the BMC will adjust the Redundancy status accordinglyThe definition of redundancy is power subsystem dependent and sometimes even configuration dependent See the appropriate Platform Specific Information for power unit redundancy supportThis sensor is configured as manual-rearm sensor in order to avoid the possibility of extraneous SEL events that could occur under certain system configuration and workload conditions The sensor shall rearm for the following conditionsbull PSU hot-addbull System reset
HA201-TP Users Manual
113
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull AC power cyclebull DC power cycleSystem AC power is applied but on standby ndash Power unit redundancy is based on OEM SDR power unit record and number of PSU presentSystem is (DC) powered on - The BMC calculates Dynamic Power Supply Redundancy status based upon current system load requirements as well as total Power Supply capacityThe BMC allows redundancy to be configured on a per power-unit-redundancy sensor basis by means of the OEM SDR records
7317 Component Fault LED ControlSeveral sets of component fault LEDs are supported on the server board Some LEDs are owned by the BMC and some by the BIOSThe BMC owns control of the following FRUfault LEDsbull Fan fault LEDs ndash A fan fault LED is associated with each fan The BMC
lights a fan fault LED if the associated fan-tach sensor has a lower critical threshold event status asserted Fan-tach sensors are manual re-arm sensors Once the lower critical threshold is crossed the LED remains lit until the sensor is rearmed These sensors are rearmed at system DC power-on and system reset
bull DIMM fault LEDs ndash The BMC owns the hardware control for these LEDs The LEDs reflect the state of BIOS-owned event-only sensors When the BIOS detects a DIMM fault condition it sends an IPMI OEM command (Set Fault Indication) to the BMC to instruct the BMC to turn on the associated DIMM Fault LED These LEDs are only active when the system is in the lsquoonrsquo state The BMC will not activate or change the state of the LEDs unless instructed by the BIOS
bull Hard Disk Drive Status LEDs ndash The HSBP PSoC owns the hardware control for these LEDs and detection of the faultstatus conditions that the LEDs reflect
bull CPU Fault LEDs The BMC owns control for these LEDs An LED is lit if there is an MSID mismatch (that is CPU power rating is incompatible with the board)
HA201-TP Users Manual
114
Chapter 7 Intelreg Server Board S2600TP Platform Management
7318 CMOS Battery MonitoringThe BMC monitors the voltage level from the CMOS battery which provides battery backup to the chipset RTC This is monitored as an auto-rearm threshold sensorUnlike monitoring of other voltage sources for which the Emulex Pilot III component continuously cycles through each input the voltage channel used for the battery monitoring provides a software enable bit to allow the BMC FW to poll the battery voltage at a relativelyslow rate in order to conserve battery power
HA201-TP Users Manual
115
Chapter 7 Intelreg Server Board S2600TP Platform Management
74 Intelreg Intelligent Power Node Manager (NM)Power management deals with requirements to manage processor power consumption and manage power at the platform level to meet critical business needs Node Manager (NM) is a platform resident technology that enforces power capping and thermal-triggered powercapping policies for the platform These policies are applied by exploiting subsystem settings (such as processor P and T states) that can be used to control power consumption NM enables data center power management by exposing an external interface to management software through which platform policies can be specified It also implements specific data center power management usage models such as power limiting and thermal monitoringThe NM feature is implemented by a complementary architecture utilizing the ME BMC BIOS and an ACPI-compliant OS The ME provides the NM policy engine and power controllimiting functions (referred to as Node Manager or NM) while the BMC provides the external LAN link by which external management software can interact with the feature The BIOS provides system power information utilized by the NM algorithms and also exports ACPI Source Language (ASL) code used by OS-Directed Power Management (OSPM) for negotiating processor P and T state changes for power limiting PMBus-compliant power supplies provide the capability to monitor input power consumption which is necessary to support NMThe NM architecture applicable to this generation of servers is defined by the NPTM Architecture Specification v20 NPTM is an evolving technology that is expected to continue to add new capabilities that will be defined in subsequent versions of the specification The ME NM implements the NPTM policy engine and controlmonitoring algorithms defined in the Node Power and Thermal Manager (NPTM) specification
741 Hardware RequirementsNM is supported only on platforms that have the NM FW functionality loaded and enabled on the Management Engine (ME) in the SSB and that have a BMC present to support the external LAN interface to the ME NM power limiting features requires a means for the ME to monitorinput power consumption for the platform This capability is generally provided by means of PMBus-compliant power supplies although an alternative model using a simpler SMBus power monitoring device is possible (there is potential loss in accuracy and responsiveness using
HA201-TP Users Manual
116
Chapter 7 Intelreg Server Board S2600TP Platform Management
non-PMBus devices) The NM SmaRTCLST feature does specifically require PMBus-compliant power supplies as well as additional hardware on the server board
742 FeaturesNM provides feature support for policy management monitoring and querying alerts and notifications and an external interface protocol The policy management features implement specific IT goals that can be specified as policy directives for NM Monitoring and querying features enable tracking of power consumption Alerts and notifications provide the foundation for automation of power management in the data center management stack The external interface specifies the protocols that must be supported in this version of NM
743 ME System Management Bus (SMBus) Interfacebull The ME uses the SMLink0 on the SSB in multi-master mode as a
dedicated bus for communication with the BMC using the IPMB protocol The BMC FW considers this a secondary IPMB bus and runs at 400 kHz
bull The ME uses the SMLink1 on the SSB in multi-master mode bus for communication with PMBus devices in the power supplies for support of various NM-related features This bus is shared with the BMC which polls these PMBus power supplies for sensor monitoring purposes (for example power supply status input power and so on) This bus runs at 100 KHz
bull The Management Engine has access to the ldquoHost SMBusrdquo
744 PECI 30bull The BMC owns the PECI bus for all Intel server implementations and
acts as a proxy for the ME when necessary
745 NM ldquoDiscoveryrdquo OEM SDRAn NM ldquodiscoveryrdquo OEM SDR must be loaded into the BMCrsquos SDR repository if and only if the NM feature is supported on that product This OEM SDR is used by management software to detect if NM is supported and to understand how to communicate with itSince PMBus compliant power supplies are required in order to support NM the system should be probed when the SDRs are loaded into the
HA201-TP Users Manual
117
Chapter 7 Intelreg Server Board S2600TP Platform Management
BMCrsquos SDR repository in order to determine whether or not the installed power supplies do in fact support PMBus If the installed power supplies are not PMBus compliant then the NM ldquodiscoveryrdquo OEM SDR should not be loadedPlease refer to the Intelreg Intelligent Power Node Manager 20 External Architecture Specification using IPMI for details of this interface
746 SmaRTCLSTThe power supply optimization provided by SmaRTCLST relies on a platform HW capability as well as ME FW support When a PMBus-compliant power supply detects insufficient input voltage an overcurrent condition or an over-temperature condition it will assert theSMBAlert signal on the power supply SMBus (such as the PMBus) Through the use of external gates this results in a momentary assertion of the PROCHOT and MEMHOT signals to the processors thereby throttling the processors and memory The ME FW also sees the SMBAlert assertion queries the power supplies to determine the condition causing the assertion and applies an algorithm to either release or prolong the throttling based on the situationSystem power control modes include1 SmaRT Low AC in put voltage event results in a one-time momentary
throttle for each event to the maximum throttle state2 Electrical Protection CLST High output energy event results in a
throttling hiccup mode with fixed maximum throttle time and a fix throttle release ramp time
3 Thermal Protection CLST High power supply thermal event results in a throttling hiccup mode with fixed maximum throttle time and a fix throttle release ramp time
When the SMBAlert signal is asserted the fans will be gated by HW for a short period (~100ms) to reduce overall power consumption It is expected that the interruption to the fans will be of short enough duration to avoid false lower threshold crossings for the fan tachsensors however this may need to be comprehended by the fan monitoring FW if it does have this side-effectME FW will log an event into the SEL to indicate when the system has been throttled by the SmaRTCLST power management feature This is dependent on ME FW support for this sensor Please refer ME FW EPS for SEL log details
HA201-TP Users Manual
118
Chapter 7 Intelreg Server Board S2600TP Platform Management
74611 Dependencies on PMBus-compliant Power Supply SupportThe SmaRTCLST system feature depends on functionality present in the ME NM SKU This feature requires power supplies that are compliant with the PMBus
note For additional information on Intelreg Intelligent Power Node
Manager usage and support please visit the following Intel Website
httpwwwintelcomcontentwwwusendata-centerdata-center-managementnode-manager-generalhtmlwapkw=node+manager
HA201-TP Users Manual
119
Chapter 7 Intelreg Server Board S2600TP Platform Management
75 Basic and Advanced Server Management FeaturesThe integrated BMC has support for basic and advanced server management features Basic management features are available by default Advanced management features are enabled with the addition of an optionally installed Remote Management Module 4 Lite (RMM4 Lite) key
When the BMC FW initializes it attempts to access the Intelreg RMM4 lite If the attempt to access Intelreg RMM4 lite is successful then the BMC activates the Advanced featuresThe following table identifies both Basic and Advanced server management features
HA201-TP Users Manual
120
Chapter 7 Intelreg Server Board S2600TP Platform Management
751 Dedicated Management PortThe server board includes a dedicated 1GbE RJ45 Management Port The management port is active with or without the RMM4 Lite key installed
752 Embedded Web ServerBMC Base manageability provides an embedded web server and an OEM-customizable web GUI which exposes the manageability features of the BMC base feature set It is supported over all on-board NICs that have management connectivity to the BMC as well as an optionaldedicated add-in management NIC At least two concurrent web sessions from up to two different users is supported The embedded web user interface shall support the following client web browsersbull Microsoft Internet Explorer 90bull Microsoft Internet Explorer 100bull Mozilla Firefox 24bull Mozilla Firefox 25The embedded web user interface supports strong security (authentication encryption and firewall support) since it enables remote server configuration and control The user interface presented by the embedded web user interface shall authenticate the user before allowing a web session to be initiated Encryption using 128-bit SSL is supported User authentication is based on user id and passwordThe GUI presented by the embedded web server authenticates the user before allowing a web session to be initiated It presents all functions to all users but grays-out those functions that the user does not have privilege to execute For example if a user does not have privilege to power control then the item shall be displayed in grey-out font in that userrsquos UI display The web GUI also provides a launch point for some of the advanced features such as KVM and media redirection These features are grayed out in the GUI unless the system has been updated to support these advanced features The embedded web server only displays US English or Chinese language outputAdditional features supported by the web GUI includesbull Presents all the Basic features to the usersbull Power onoffreset the server and view current power statebull Displays BIOS BMC ME and SDR version informationbull Display overall system health
HA201-TP Users Manual
121
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull Configuration of various IPMI over LAN parameters for both IPV4 and IPV6
bull Configuration of alerting (SNMP and SMTP)bull Display system asset information for the product board and
chassisbull Display of BMC-owned sensors (name status current reading
enabled thresholds) including color-code status of sensorsbull Provides ability to filter sensors based on sensor type (Voltage
Temperature Fan and Power supply related)bull Automatic refresh of sensor data with a configurable refresh ratebull On-line helpbull Displayclear SEL (display is in easily understandable human
readable format)bull Supports major industry-standard browsers (Microsoft Internet
Explorer and Mozilla Firefox)bull The GUI session automatically times-out after a user-configurable
inactivity period By default this inactivity period is 30 minutesbull Embedded Platform Debug feature - Allow the user to initiate
a ldquodebug dumprdquo to a file that can be sent to Intelreg for debug purposes
bull Virtual Front Panel The Virtual Front Panel provides the same functionality as the local front panel The displayed LEDs match the current state of the local panel LEDs The displayed buttons (for example power button) can be used in the same manner as the local buttons
bull Display of ME sensor data Only sensors that have associated SDRs loaded will be displayed
bull Ability to save the SEL to a filebull Ability to force HTTPS connectivity for greater security This is
provided through a configuration option in the UIbull Display of processor and memory information as is available over
IPMI over LANbull Ability to get and set Node Manager (NM) power policiesbull Display of power consumed by the serverbull Ability to view and configure VLAN settingsbull Warn user the reconfiguration of IP address will cause disconnectbull Capability to block logins for a period of time after several
HA201-TP Users Manual
122
Chapter 7 Intelreg Server Board S2600TP Platform Management
consecutive failed login attempts The lock-out period and the number of failed logins that initiates the lockout period are configurable by the user
bull Server Power Control - Ability to force into Setup on a resetbull System POST results ndash The web server provides the systemrsquos Power-
On Self Test (POST) sequence for the previous two boot cycles including timestamps The timestamps may be viewed in relative to the start of POST or the previous POST code
bull Customizable ports - The web server provides the ability to customize the port numbers used for SMASH http https KVM secure KVM remote media and secure remote media
For additional information reference the Intelreg Remote Management Module 4 and Integrated BMC Web Console Users Guide
753 Advanced Management Feature Support (RMM4 Lite)The integrated baseboard management controller has support for advanced management features which are enabled when an optional Intelreg Remote Management Module 4 Lite (RMM4 Lite) is installed The Intel RMM4 add-on offers convenient remote KVM access and control through LAN and internet It captures digitizes and compresses video and transmits it with keyboard and mouse signals to and from a remote computer Remote access and controlsoftware runs in the integrated baseboard management controller utilizing expanded capabilities enabled by the Intel RMM4 hardwareKey Features of the RMM4 add-on arebull KVM redirection from either the dedicated management NIC or
the server board NICs used for management traffic upto to two KVM sessions
bull Media Redirection ndash The media redirection feature is intended to allow system administrators or users to mount a remote IDE or USB CDROM floppy drive or a USB flash disk as a remote device to the server Once mounted the remote device appears just like a local device to the server allowing system administrators or users to install software (including operating systems) copy files update BIOS or boot the server from this device
bull KVM ndash Automatically senses video resolution for best possible screen capture high performance mouse tracking and
HA201-TP Users Manual
123
Chapter 7 Intelreg Server Board S2600TP Platform Management
synchronization It allows remote viewing and configuration in pre-boot POST and BIOS setup
7531 Keyboard Video Mouse (KVM) RedirectionThe BMC firmware supports keyboard video and mouse redirection (KVM) over LAN This feature is available remotely from the embedded web server as a Java applet This feature is only enabled when the Intelreg RMM4 lite is present The client system must have a Java Runtime Environment (JRE) version 60 or later to run the KVM or media redirection appletsThe BMC supports an embedded KVM application (Remote Console) that can be launched from the embedded web server from a remote console USB11 or USB 20 based mouse and keyboard redirection are supported It is also possible to use the KVM-redirection (KVM-r) session concurrently with media-redirection (media-r) This feature allows a user to interactively use the keyboard video and mouse (KVM) functions of the remote server as if the user were physically at the managed server KVM redirection console supports the following keyboard layouts English Dutch French German Italian Russian and SpanishKVM redirection includes a ldquosoft keyboardrdquo function The ldquosoft keyboardrdquo is used to simulate an entire keyboard that is connected to the remote system The ldquosoft keyboardrdquo functionality supports the following layouts English Dutch French German Italian Russian and SpanishThe KVM-redirection feature automatically senses video resolution for best possible screen capture and provides high-performance mouse tracking and synchronization It allows remote viewing and configuration in pre-boot POST and BIOS setup once BIOS has initialized videoOther attributes of this feature includebull Encryption of the redirected screen keyboard and mousebull Compression of the redirected screenbull Ability to select a mouse configuration based on the OS typebull Supports user definable keyboard macrosKVM redirection feature supports the following resolutions and refresh ratesbull 640x480 at 60Hz 72Hz 75Hz 85Hz 100Hzbull 800x600 at 60Hz 72Hz 75Hz 85Hzbull 1024x768 at 60Hx 72Hz 75Hz 85Hzbull 1280x960 at 60Hz
HA201-TP Users Manual
124
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull 1280x1024 at 60Hzbull 1600x1200 at 60Hzbull 1920x1080 (1080p)bull 1920x1200 (WUXGA)bull 1650x1080 (WSXGA+)
7532 Remote ConsoleThe Remote Console is the redirected screen keyboard and mouse of the remote host system To use the Remote Console window of your managed host system the browser must include a Java Runtime Environment plug-in If the browser has no Java support such as with a small handheld device the user can maintain the remote host system using the administration forms displayed by the browserThe Remote Console window is a Java Applet that establishes TCP connections to the BMCThe protocol that is run over these connections is a unique KVM protocol and not HTTP or HTTPS This protocol uses ports 7578 for KVM 5120 for CDROM media redirection and 5123 for FloppyUSB media redirection When encryption is enabled the protocol uses ports 7582 for KVM 5124 for CDROM media redirection and 5127 for FloppyUSB media redirection The local network environment must permit these connections to be made that is the firewall and in case of a private internal network the NAT (Network Address Translation) settings have to be configured accordingly
7533 PerformanceThe remote display accurately represents the local display The feature adapts to changes to the video resolution of the local display and continues to work smoothly when the system transitions from graphics to text or vice-versa The responsiveness may be slightly delayed depending on the bandwidth and latency of the networkEnabling KVM andor media encryption will degrade performance Enabling video compression provides the fastest response while disabling compression provides better video qualityFor the best possible KVM performance a 2Mbsec link or higher is recommendedThe redirection of KVM over IP is performed in parallel with the local KVM without affecting the local KVM operation
HA201-TP Users Manual
125
Chapter 7 Intelreg Server Board S2600TP Platform Management
7534 SecurityThe KVM redirection feature supports multiple encryption algorithms including RC4 and AES The actual algorithm that is used is negotiated with the client based on the clientrsquos capabilities
7535 AvailabilityThe remote KVM session is available even when the server is powered-off (in stand-by mode) No re-start of the remote KVM session shall be required during a server reset or power onoff A BMC reset (for example due to an BMC Watchdog initiated reset or BMC reset after BMC FWupdate) will require the session to be re-established KVM sessions persist across system reset but not across an AC power loss
7536 UsageAs the server is powered up the remote KVM session displays the complete BIOS boot process The user is able interact with BIOS setup change and save settings as well as enter and interact with option ROM configuration screensAt least two concurrent remote KVM sessions are supported It is possible for at least two different users to connect to same server and start remote KVM sessions
7537 Force-enter BIOS SetupKVM redirection can present an option to force-enter BIOS Setup This enables the system to enter F2 setup while booting which is often missed by the time the remote console redirects the video
7538 Media RedirectionThe embedded web server provides a Java applet to enable remote media redirection This may be used in conjunction with the remote KVM feature or as a standalone applet The media redirection feature is intended to allow system administrators or users to mount a remote IDE or USB CD-ROM floppy drive or a USB flash disk as a remote device to the server Once mounted the remote device appears just like a local device to the server allowing system administrators or users to install software (including operating systems) copy files update BIOS and so on or boot the server from this deviceThe following capabilities are supported
HA201-TP Users Manual
126
Chapter 7 Intelreg Server Board S2600TP Platform Management
The operation of remotely mounted devices is independent of the local devices on the server Both remote and local devices are useable in parallelbull Either IDE (CD-ROM floppy) or USB devices can be mounted as a
remote device to the serverbull It is possible to boot all supported operating systems from the remotely
mounted device and to boot from disk IMAGE (IMG) and CD-ROM or DVD-ROM ISO files See the Testedsupported Operating System List for more information
bull Media redirection supports redirection for both a virtual CD device and a virtual FloppyUSB device concurrently The CD device may be either a local CD drive or else an ISO image file the FloppyUSB device may be either a local Floppy drive a local USB device or else a disk image file
bull The media redirection feature supports multiple encryption algorithms including RC4 and AES The actual algorithm that is used is negotiated with the client based on the clientrsquos capabilities
bull A remote media session is maintained even when the server is powered-off (in standby mode) No restart of the remote media session is required during a server reset or power onoff An BMC reset (for example due to an BMC reset after BMC FW update) will require the session to be re-established
bull The mounted device is visible to (and useable by) managed systemrsquos OS and BIOS in both pre-boot and post-boot states
bull The mounted device shows up in the BIOS boot order and it is possible to change the BIOS boot order to boot from this remote device
bull It is possible to install an operating system on a bare metal server (no OS present) using the remotely mounted device This may also require the use of KVM-r to configure the OS during install
USB storage devices will appear as floppy disks over media redirection This allows for the installation of device drivers during OS installationIf either a virtual IDE or virtual floppy device is remotely attached during system boot both the virtual IDE and virtual floppy are presented as bootable devices It is not possible to present only a single-mounted device type to the system BIOS
HA201-TP Users Manual
127
Chapter 7 Intelreg Server Board S2600TP Platform Management
75381 AvailabilityThe default inactivity timeout is 30 minutes and is not user-configurable Media redirection sessions persist across system reset but not across an AC power loss or BMC reset
75382 Network Port UsageThe KVM and media redirection features use the following portsbull 5120 ndash CD Redirectionbull 5123 ndash FD Redirectionbull 5124 ndash CD Redirection (Secure)bull 5127 ndash FD Redirection (Secure)bull 7578 ndash Video Redirectionbull 7582 ndash Video Redirection (Secure)For additional information reference the Intelreg Remote Management Module 4 and Integrated BMC Web Console Users Guide
Chapter 1 Product IntroductionChapter 1 Product IntroductionChapter 8 Technical Support
wwwaicipccom
bull TAIWANTel +886 3 433 9188Fax +886 3 287 1818Email salesaicipccomtw
bull CHINA Tel +862154961421 +862154961422Fax Extension 608Email Technical Support supportaicipccom
bull AMERICA - West coastTel +19098958989Fax +19098958999Email salesaicipccom
bull AMERICA - East coastTel +19738848886Fax +19738844794Email njsalesaicipccom
bull EUROPETel +31306386789Fax +31306360638Emailsalesaicipcnl
Email Technical Support supportaicipccom
- PREFACE
-
- SAFETY INSTRUCTIONS
-
- Chapter 1 Product Introduction
-
- 11 Box Content
- 12 Specifications
- 13 General Information
-
- Chapter 2 Hardware Installation
-
- 21 Removing and Installing top cover
- 22 Central Processing Unit (CPU)
- 23 Diagram of the Correct Installation
- 24 System Memory
- 25 RemovingInstalling a Drive Tray Hard Drive Drive Slot Map
- 26 Removing and Installing a Fan Module
- 27 Power Supply
- 28 Tool-less Blade Slide Installation introduction
-
- Chapter 3 Motherborad Overview
-
- 31 Intelreg Server Board Feature Set
- 32 Motherboard Layout
- 33 Motherboard block diagram
- 34 Motherboard Configuration Jumpers
- 35 Motherboard Configuration Jumpers
-
- Chapter 4 12GB Expander Borad Overview
-
- 41 12GB Expander Board
-
- Chapter 5 Backplane Overview
-
- 51 Backplane
-
- Chapter 6 Debug amp Firmware Update Introduction
-
- 61 Expender firmware update through smart console port
- 62 Update the expander firmware through in-band
- 63 12G expander EDFB setting
- 64 Slot HDD power setting
- 65 HDD BP thermal sensor temperature setting
-
- Chapter 7 Intelreg Server Board S2600TP Platform Management
-
- 71 Server Management Function Architecture
- 72 Features and Functions
- 73 Sensor Monitoring
- 74 Intelreg Intelligent Power Node Manager (NM)
- 75 Basic and Advanced Server Management Features
-
- Chapter 8 Technical Support
-
- 按鈕11
![Page 6: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/6.jpg)
ii
SAFETY INSTRUCTIONSbull Before getting started please read the following important cautionsbull All cautions and warnings on the equipment or in the manuals should be
notedbull Most electronic components are sensitive to electrical static discharge
Therefore be sure to ground yourself at all times when installing the internal components
bull Use a grounding wrist strap and place all electronic components in static-shielded devices Grounding wrist straps can be purchased in any electronic supply store
bull Be sure to turn off the power and then disconnect the power cords from your system before performing any installation or servicing A sudden surge of power could damage sensitive electronic components
bull Do not open the systemrsquos top cover If opening the cover for maintenance is a must only a trained technician should do so Integrated circuits on computer boards are sensitive to static electricity Before handling a board or integrated circuit touch an unpainted portion of the system unit chassis for a few seconds This will help to discharge any static electricity on your body
bull Place this equipment on a stable surface when install A drop or fall could cause injury
bull Please keep this equipment away from humiditybull Carefully mount the equipment into the rack in such manner that it
wonrsquot be hazardous due to uneven mechanical loadingbull This equipment is to be installed for operation in an environment with
maximum ambient temperature below 35degCbull The openings on the enclosure are for air convection to protect the
equipment from overheating DO NOT COVER THE OPENINGSbull Never pour any liquid into ventilation openings This could cause fire or
electrical shockbull Make sure the voltage of the power source is within the specification
on the label when connecting the equipment to the power outlet The current load and output power of loads shall be within the specification
bull This equipment must be connected to reliable grounding before using Pay special attention to power supplied other than direct connections eg using of power strips
iii
bull Place the power cord out of the way of foot traffic Do not place anything over the power cord The power cord must be rated for the product voltage and current marked on the productrsquos electrical ratings label The voltage and current rating of the cord should be greater than the voltage and current rating marked on the product
bull If the equipment is not used for a long time disconnect the equipment from mains to avoid being damaged by transient over-voltage
bull Never open the equipment For safety reasons only qualified service personnel should open the equipment
bull If one of the following situations arise the equipment should be checked by service personnel1 The power cord or plug is damaged2 Liquid has penetrated the equipment3 The equipment has been exposed to moisture4 The equipment does not work well or will not work according to its user
manual5 The equipment has been dropped andor damaged6 The equipment has obvious signs of breakage7 Please disconnect this equipment from the AC outlet before cleaning
Do not use liquid or detergent for cleaning The use of a moisture sheet or cloth is recommended for cleaning
bull Module and drive bays must not be empty They must have a dummy cover
Product features and specifications are subject to change without notice
CAUTION
risk of explosion if battery is replaced by an incorrect type
dispose of used batteries according to the instructions
After performing any installation or servicing make sure the
enclosure are lock and screw in position turn on the power
1
HA201-TP Users Manual
Chapter 1 Product Introduction
11 Box Content
Chapter 1 Product Introduction
Before removing the subsystem from the shipping carton visually inspect the physical condition of the shipping carton Exterior damage to the shipping carton may indicate that the contents of the carton are damaged If any damage is found do not remove the components contact the dealer where the subsystem was purchased for further introduction Before continuing first unpack the subsystem and verify that the contents of the shipping carton are all there and in good condition
bull Enclosure( Power supply fan 24 HDD tray included)
bull RS232 cablex 1pcs bull Power cord x 2sets bull Screws kit x 1set
bull Slide rail x 1set
If any items are missing please contact your authorized reseller or sales representative
2
HA201-TP Users Manual
Chapter 1 Product Introduction
12 Specifications
Dimensions (W x D x H)(with chassis ears)
mm 4826 x 815 x 88
inches 19 x 32 x 35
Motherboard (per node) Intelreg Server Board S2600TP
Processor (per node)
Processor Support
Two Intelreg Xeonreg Processors E5-2600 v3 Product Family
QPI Speeds 96 GTs 8 GTs 72 GTs
Socket Type Socket R3 (FCLGA2011-3)
Chipset Support(per node) Intelreg C612 Chipset
System Memory (per node)
bull 16 DIMM slots in total across 8 memory channelsbull Registered DDR4 (RDIMM) Load Reduced DDR4 (LRDIMM)bull Memory DDR4 data transfer rates of 160018662133 MTsbull DIMM sizes of 4 GB 8 GB 16 GB or 32 GB depending on ranks and technology
Front Panel Power onoff
LEDs
A bull Power (Secondary)bull Warning
B bull Power (Primary)bull Warning
Drive Bays External 25 hot swap 24
Backplane 1 x 24-port 12Gb SAS dual-loop backplane
Expander Board(per node)
1 x 24-port 12Gb SAS expander board with 3 SFF-8643 connectors
Expansion Slots (per node) PCIe 30 5 x8 (4 LP amp 1 FHHL)
Internal IO Connectors Headers(per node)
bull 1 x internal USB 20 connector (port 67)bull 1 x 2x7pin header for system fan modulebull 1 x 1x12pin control panel headerbull 1 x DH-10 serial Port A connectorbull 1 x SATA 6Gbs port for SATA DOMbull 4 x SATA 6Gbs connectors (port 0123)bull 1 x 2x4 pin header for Intel RMM4 Litebull 1 x 1x4 pin header for Storage Upgrade Keybull 1 x 1x8 pin backup power controler connector
Rear IO(per node)
bull 1 x RJ45 (dedicated port for remote server management) 2 x RJ45 1 x VGA 2 x USB 20 Type A 1 x Mini SAS HD
Ethernet (per node) Dual GbE Intelreg I350 Gigabit Ethernet Controller
Video Support(per node) Integrated Matrox G200 2D Graphics Controller
Server Management(per node)
Onboard Server Engines LLC Pilot III Controller Support for Intelreg Remote Management Module 4 solutions IntelregLight-Guided Diagnostics on field replaceable units Support for Intelreg System Management Software Support for Intelreg Intelligent Power Node Manager (Need PMBus - compliant power supply) BIOS Flash Winbond W25Q64BV
Power Supply 1200W 1+1 redundant power supply
System Cooling (per node)
2 x 6056 fans
EnvironmentalSpecifications
Temperature 0degC - 35degCHumidity 5 - 95 non-condensing
Gross Weight (w PSU amp Rail)kgs 41
lbs 90
PackagingDimensions
(W x D x H)mm 590 x 1150 x 330
inches 232 x 453 x 13
Mounting Standard 28 tool-less slide rail
3
HA201-TP Users Manual
Chapter 1 Product Introduction
13 General Information
HA201-WP a 2U Storage Server Barebone supports two Intelreg Xeonreg processors E5-2600 v3 series HA201-TP has a 24 x 25rdquo HDD bays as system drive bays It is a perfect building block for Storage Servers
bull Front Panel
LED Indicator and Switch
24 x 25 hot-swap SATASAS HDD bays
4
HA201-TP Users Manual
Chapter 1 Product Introduction
bull Rear Panel
1200W 1+1 80+ redundant power supply
2 x low-profile add-on card for external connection
SFF 8644 for JBOD connection
5
HA201-TP Users Manual
Chapter 1 Product Introduction
bull Top View of HA201-TP system
2 x hot-swappable storage control node
Intelreg Xeonreg E5-2600 v3 Processors
bull TOP View Of Controller Canister
6
HA201-TP Users Manual
Chapter 1 Product Introduction
bull Top View of HA201-TP system
4 x 60x56mm hot-swap fans
Intel S2600TP
HA201-TP Users Manual
7
Chapter 2 Hardware Installation
7
21 Removing and Installing top coverUnscrew the screws(2pcs for the back and 4pcs for the both right and left side)Slide the cover backwards to remove top cover form the chassis
Chapter 2 Hardware Installation
This section demonstrates maintenance procedures in replacing a defective part once the HA201-TP UM appliance is installed and is operational
Chapter 2 Hardware Installation
HA201-TP Users Manual
8
22 Central Processing Unit (CPU)221 Removing the Processor HeatsinkThe heatsink is attached to the server board or processor socket with captive fasteners Using a 2 Phillips screwdriver loosen the four screws located on the heatsink corners in a diagonal manner using the following procedurebull Using a 2 Phillips screwdriver start with screw 1 and loosen it by
giving it two rotations and stop (see letter A) (IMPORTANT Do not fully loosen)
bull Proceed to screw 2 and loosen it by giving it two rotations and stop (see letter B) Similarly loosen screws 3 and 4 Repeat steps A and B by giving each screw two rotations each time until all screws are loosened
bull Lift the heatsink straight up (see letter C)
REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
Processor
Socket
2
3
1
4
A
B
C
HA201-TP Users Manual
9
Chapter 2 Hardware Installation
222 Installing the Processor
2221 Unlatch the CPU Load Platebull Push the lever handle labeled ldquoOPEN 1strdquo (see letter A) down and
toward the CPU socket Rotate the lever handle upbull Repeat the steps for the second lever handle (see letter B)
CAUTION
Processor must be appropriate You may damage the server board if
you install a processor that is inappropriate for your server
CAUTION
ESD and handling processors Reduce the risk of electrostatic
discharge (ESD) damage to the processor by doing the following
(1) Touch the metal chassis before touching the processor or
server board Keep part of your body in contact with the metal
chassis to dissipate the static charge while handling the processor
(2) Avoid moving around unnecessarily
REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
A REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
B
Chapter 2 Hardware Installation
HA201-TP Users Manual
10
2222 Lift open the Load Platebull Rotate the right lever handle down until it releases the Load Plate
(see letter A)bull While holding down the lever handle with your other hand lift open
the Load Plate (see letter B)
BREMOVE
REMOVE
LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
A
HA201-TP Users Manual
11
Chapter 2 Hardware Installation
2223 Install the Processorbull Remove the processor from its packagebull Carefully remove the protective cover from the bottom side of the
CPU taking care not to touch any CPU contacts (see letter A)bull Orient the processor with the socket so that the processor cutouts
match the four orientation posts on the socket (see letter B) Note the location of a gold key at the corner of the processor (see letter C) Carefully place (Do NOT drop) the CPU into the socket
CAUTION
The pins inside the CPU socket are extremely sensitive Other than
the CPU no object should make contact with the pins inside the
CPU socket A damaged CPU socket pin may render the socket inoperable
and will produce erroneous CPU or other system errors if used
NOTE
The underside of the processor has components that may damage the
socket pins if installed improperly The processor must align correctly
with the socket opening before installation DO NOT DROP the
processor into the socket
NOTE
When possible a CPU insertion tool should be used when installing the
CPU
A
B
C
Chapter 2 Hardware Installation
HA201-TP Users Manual
12
2224 Remove the Socket Cover Remove the socket cover from the load plate by pressing it out
2225 Close the Load Plate Carefully lower the load plate down over the processor
2226 Lock down the Load Platebull Push down on the locking lever on the CLOSE 1st side (see letter A)
Slide the tip of the lever under the notch in the load plate (see letter B) Make sure the load plate tab engages under the socket lever when fully closed
bull bRepeat the steps to latch the locking lever on the other side (see letter C) Latch the levers in the order as shown
NOTE
The socket cover should be saved and re-used should the processor
need to be removed at anytime in the future
Save theprotectivecover
A
BC
HA201-TP Users Manual
13
Chapter 2 Hardware Installation
223 Installing the Processor Heatsink
bull If present remove the protective film covering the Thermal Interface Material on the bottom side of the heatsink (see letter A)
bull Align the heatsink fins to the front and back of the chassis for correct airflow The airflow goes from front-to-back of the chassis (see letter B)
bull Each heatsink has four captive fasteners and should be tightened in a diagonal manner using the following procedure
bull Using a 2 Phillips screwdriver start with screw 1 and engage screw threads by giving it two rotations and stop (see letter C) (Do not fully tighten)
bull Proceed to screw 2 and engage screw threads by giving it two rotations and stop (see letter D) Similarly engage screws 3 and 4
bull Repeat steps C and D by giving each screw two rotations each time until each screw is lightly tightened up to a maximum of 8 inch-lbs torque (see letter E)
NOTE
The processor heatsink for CPU1 and CPU2 is different FXXCA91X91HS is
for CPU1 while FXXEA91X91HS2 is for CPU2 Mislocating the heatsink
will cause serious thermal damage
C
Processor
Socket
AIRFLOW
Chassis Front
2
3
1
4
A
B
TIM
D
CAUTIONDo not over-tighten fasteners
E
Chapter 2 Hardware Installation
HA201-TP Users Manual
14
224 Removing the Processor
NOTE
Remove the processor by carefully lifting it out of the socket taking care
NOT to drop the processor and not touching any pins inside the socket
Install the socket cover if a replacement processor is not going to be installed
HA201-TP Users Manual
15
Chapter 2 Hardware Installation
225 Different heatsink for each of its CPUsCarefully position heatsink over the CPU0 and CPU1 and align the heatsink screws with the screw holes in the motherboard
Caution - Possible thermal damage Avoid moving the heatsink after
it has contacted the top of the CPU Too much movement could
disturb the layer of thermal compound causing voids and leading
to ineffective heat dissipation and component damage
CPU0
CPU0
CPU1
CPU1
Chapter 2 Hardware Installation
HA201-TP Users Manual
16
23 Diagram of the Correct InstallationNOTE The heat sinkrsquos fans should be blowing toward the rear end
of the chassis If one of the fans is facing the wrong direction
please remove the heat sink and reinstall it so that it is facing
the correct direction
AIRFLOW
AIRFLOW
FAN FAN
HA201-TP Users Manual
17
Chapter 2 Hardware Installation
24 System Memory241 Supported Memory
Chapter 2 Hardware Installation
HA201-TP Users Manual
18
242 Memory Population Rules
bull Each installed processor provides four channels of memory On the Intelreg Server Board S2600TP product family each memory channel supports two memory slots for a total possible 16 DIMMs installed
bull The memory channels from processor socket 1 are identified as Channel A B C and D The memory channels from processor socket 2 are identified as Channel E F G and H
bull The silk screened DIMM slot identifiers on the board provide information about the channel and therefore the processor to which they belong For example DIMM_A1 is the first slot on Channel A on processor 1 DIMM_E1 is the first DIMM socket on Channel E on processor 2
bull The memory slots associated with a given processor are unavailable if the corresponding processor socket is not populated
bull A processor may be installed without populating the associated memory slots provided a second processor is installed with associated memory In this case the memory is shared by the processors However the platform suffers performance degradation and latency due to the remote memory
bull Processor sockets are self-contained and autonomous However all memory subsystem support (such as Memory RAS and Error Management) in the BIOS setup is applied commonly across processor sockets
bull All DIMMs must be DDR4 DIMMsbull Mixing of LRDIMM with any other DIMM type is not allowed per
platformbull Mixing of DDR4 operating frequencies is not validated within a socket
or across sockets by Intel If DIMMs with different frequencies are mixed all DIMMs run at the common lowest frequency
bull A maximum of eight logical ranks (ranks seen by the host) per channel is allowed
bull The BLUE memory slots on the server board identify the first memory slot for a given memory channel
NOTE
Although mixed DIMM configurations are supported Intel only performs platform
validation on systems that are configured with identical DIMMs installed
HA201-TP Users Manual
19
Chapter 2 Hardware Installation
bull DIMM population rules require that DIMMs within a channel be populated starting with the BLUE DIMM slot or DIMM farthest from the processor in a ldquofill-farthestrdquo approach In addition when populating a Quad-rank DIMM with a Single- or Dual-rank DIMM in the same channel the Quad-rank DIMM must be populated farthest from the processor Intelreg MRC will check for correct DIMM placement
Chapter 2 Hardware Installation
HA201-TP Users Manual
20
On the Intelreg Server Board S2600TP product a total of sixteen DIMM slots are provided (two CPUs ndash four channels per CPU and two DIMMs per channel) The nomenclature for DIMM sockets is detailed in the following table
HA201-TP Users Manual
21
Chapter 2 Hardware Installation
243 Publishing System MemoryThere are a number of different situations in which the memory size andor configuration are displayed Most of these displays differ in one way or another so the same memory configuration may appear to display differently depending on when and where the display occurs
bull The BIOS displays the ldquoTotal Memoryrdquo of the system during POST if Quiet Boot is disabled in BIOS setup This is the total size of memory discovered by the BIOS during POST and is the sum of the individual sizes of installed DDR4 DIMMs in the system
bull The BIOS displays the ldquoEffective Memoryrdquo of the system in the BIOS Setup The term Effective Memory refers to the total size of all DDR4 DIMMs that are active (not disabled) and not used as redundant units (see Note below)
bull The BIOS provides the total memory of the system in the main page of BIOS setup This total is the same as the amount described by the first bullet above
bull If Quiet Boot is disabled the BIOS displays the total system memory on the diagnostic screen at the end of POST This total is the same as the amount described by the first bullet above
bull The BIOS provides the total amount of memory in the system by supporting the EFI Boot Service function GetMemoryMap()
bull The BIOS provides the total amount of memory in the system by supporting the INT 15h E820h function For details see the Advanced Configuration and Power Interface Specification
Chapter 2 Hardware Installation
HA201-TP Users Manual
22
25 RemovingInstalling a Drive Tray Hard Drive Drive Slot Map
251 Removing a Disk DriveRelease a drive tray by pressing the unlock button and pinching the lock lever slightly and pulling out the drive tray
252 Installing a Disk Drive
Adjust and place the 25rdquo HDD on the HDD tray Choose to the proper position screw hole then secure it with screws on the bottomMounting location 1 HDD with the interposer board attached
HA201-TP Users Manual
23
Chapter 2 Hardware Installation
253 Installing a Hard Disk Drive Tray
Insert the drive carrier into its bay Push the tray lever until it clicks Make sure the drive tray is correctly secured in place when its front edge aligns with the bay edge
254 Drive Slot Map
The drive slot map follows
HBA Card
MegaRaid Card
Chapter 2 Hardware Installation
HA201-TP Users Manual
24
26 Removing and Installing a Fan Module261 Removing a FAN module
Unscrew the thumb screw on the cover to open the top cover from NODE Unpluging the FAN cable Remove the fan by lifting it and pulling it out
262 Installing a FAN module To installing a fan module follow the reverse order
HA201-TP Users Manual
25
Chapter 2 Hardware Installation
27 Power Supply
271 Removing or Replacing the Power Supply Module
Slide the small levers (see red arrow ) to the right Hold the PSU lever and firmly pull the PSU out of the server chassis
Chapter 2 Hardware Installation
HA201-TP Users Manual
26
28 Tool-less Blade Slide Installation introduction
1 Pull on the lsquorsquoFront-Releasersquorsquoto unlock the inner channel from the Slide Assembly
2 Release the Detent-Lock and push Middle Channel inwards to retract Middle Channel
HA201-TP Users Manual
27
Chapter 2 Hardware Installation
Metal Spacer
OptionalRemove Metal Spacer for Aluminium Racks
3 Aligning the Front Bracket with the Mounting Hole
4 Push in to assembly the Front Bracket onto the Rack
Chapter 2 Hardware Installation
HA201-TP Users Manual
28
5 Now the bracket is fixed onto the Rack(Optional M6x10L screws are to secure the rails with posts if needed)
6 Refer to Diagram 3 amp 4 to assemble the End Bracket onto the Rack
HA201-TP Users Manual
29
Chapter 2 Hardware Installation
7 Assemble the inner channel onto the chassis using thescrews provided
8 Push the chassis with inner channels into Slide to complete Rack Installation
Chapter 3 Motherborad Overview
HA201-TP Users Manual
30
Chapter 3 Motherborad Overview
The Intelreg Server Board S2600TP is a monolithic printed circuit board (PCB) with features designed to support the high-performance and high-density computing markets This server board is designed to support the Intelreg Xeonreg processor E5-2600 v3 product family Previous generation Intelreg Xeonreg processors are not supported Many of the features and functions of the server board family are common A board will be identified by name when a described feature or function is unique to it
Chapter 3 Motherborad Overview
HA201-TP Users Manual
31
31 Intelreg Server Board Feature Set
Chapter 3 Motherborad Overview
HA201-TP Users Manual
32
WARNING The riser slot 1 on the server board is designed for
plugging in ONLY the riser card Plugging in the PCIe card may
cause permanent server board and PCIe card damage
Chapter 3 Motherborad Overview
HA201-TP Users Manual
33
32 Motherboard Layout
DiagnosticLED
RMM4LiteRiser Slot 2
SATASGPIO
BridgeBoard
SATA 3USB
Riser Slot 4
Riser Slot 4
HDD Activity
Riser Slot 3ControlPanel
SystemFan 1
CPU 2Socket
CPU 1Socket
SystemFan 3
SystemFan 2
MainPower 1
FanConnector
MainPower 2
InfiniBandPort(QSFP+)DedicatedManagementPortUSBStatus LEDID LED
VGA
NIC 2
IPMB
NIC 1
SerialPort A
CR 2032 3W
Backup Power Control
SATA 2
SATA 0
SATA RAID 5
SATA
Upgrade Key
DOM
SATA 1
Chapter 3 Motherborad Overview
HA201-TP Users Manual
34
33 Motherboard block diagram
Chapter 3 Motherborad Overview
HA201-TP Users Manual
35
34 Motherboard Configuration JumpersThe following table provides a summary and description of configuration test and debug jumpers The server board has several 3-pin jumper blocks that can be used Pin 1 on each jumper block can be identified by the following symbol on the silkscreen
Chapter 3 Motherborad Overview
HA201-TP Users Manual
36
Chapter 3 Motherborad Overview
HA201-TP Users Manual
37
35 Motherboard Configuration JumpersThe Intelreg Server Board S2600TP has the following board rear connector placement
HA201-TP Users Manual
38
Chapter 4 12G Expander Board Overview
This chapter provides detailed introduction guide on hardware introduction
41 12GB Expander Board411 Placement amp Connectors location
J1
JPIC_DBGU8
Chip 3x36RExpander
U14Flash U16
PIC
SW1
SW2
JEXP_UART
J7
JPIC_SMT
J2 J3 J4 CN1 JUSB_MB
JVBATT_12C
JVBATT CN3 CN2 JIPMI_LOC
CN5
CN8
JPWR1
JPWR2
JHDDPWRJPSON_OK
JIPMB
JUSB_PIC
J8
CN7CN6
CN4
JPWR_SW
Chapter 4 12GB Expander Borad Overview
HA201-TP Users Manual
39
Chapter 4 12G Expander Board Overview
412 PIN-OUT
Power Connector ndash JPWR1JPWR2
OS HDD Power Connector ndash JHDDPWR
Battery Connector ndash JVBATT
FAN Connector ndash J7J8
Console for Expander ndash JEXP_UART
HA201-TP Users Manual
40
Chapter 4 12G Expander Board Overview
PIC Smart portndash JPIC_SMT
PIC Debug port(For MPLAB I2C tool) ndash JPIC_DBG
PIC USB ndash JUSB_PIC
Battery Function ndash JVBATT_I2C
IPMB ndash JIPMB
HA201-TP Users Manual
41
Chapter 4 12G Expander Board Overview
BP PIC USB ndash JUSB_MB
IPMB for Device ndash JIPMI_LOC
Power SW ndash JPWR_SW
MB power on frunction ndash JPSON_OK
HA201-TP Users Manual
42
Chapter 4 12G Expander Board Overview
413 LEDs
LED_CN6
LED2
LED3
HA201-TP Users Manual
43
Chapter 5 Backplane OverviewChapter 5 Backplane Overview
This chapter provides detailed introduction guide on backplane introduction
51 Backplane511 Placement amp Connectors location
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
9
18
10
1
3
G
D
SG
D
S
C
B
C
E
B
C
E
GA
O
I
2
43
12
43
1
14
1
3
45
134
5
134
5
134
5
16
2
9
10
1
1
1120
10
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
16
17
32
33
484964
J2
JP1J11
For Secondary Canister Node
J4 J1 SW1
SW2
J3
For Primary Canister Node
HA201-TP Users Manual
44
Chapter 5 Backplane Overview
512 PIN-OUT
Power Connector ndash J2
Power Connector ndash J11
PMBUS Connector ndash JP1
FAN Connector ndash J4
HA201-TP Users Manual
45
Chapter 5 Backplane Overview
Console for MCU ndash J1
Front IO ndash J3
HA201-TP Users Manual
46
Chapter 5 Backplane Overview
513 LEDs
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
18
9
10
1
31
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
3 4
21
3 4
21
4
1
16
2
91
101
11
10
20
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ActFail
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
9
18
10
1
3
G
D
SG
D
S
C
B
C
E
B
C
E
GA
O
I
2
43
12
43
1
14
1
3
45
134
5
134
5
134
5
16
2
9
10
1
1
1120
10
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
16
17
32
33
484964LED1
HA201-TP Users Manual
47
Chapter 6 HDD Backplane Introduction
61 Expender firmware update through smart console port611 Update Expander firmware revision
Step 1 Set up HA201-TP console serial cable Insert console serial cable into console port shown below also
the other side inert serial port into motherboard
PLEASE FIND THE CONSOLE SERIAL CABLE IN THE PACKAGE BOX
Chapter 6 Debug amp Firmware Update Introduction
HA201-TP Users Manual
48
Chapter 6 HDD Backplane Introduction
Step 2 Set up HA201-TP RS232 connectionSet up RS232 connection application into your HA201-TP as shown in the example process below
For exampleOS Microsoft WindowsRS232 connection application Hyperterminal
Step 2 Install HyperTrmexe
HA201-TP Users Manual
49
Chapter 6 HDD Backplane Introduction
Step 3 Enter a new name for the icon in the field below and click OK
Step 4 Connecting by using selecting an option in the drop down menu circled in red below (we selected COM2 in this example) and click OK
HA201-TP Users Manual
50
Chapter 6 HDD Backplane Introduction
Properties
Port Setting
Bits per second
Data bits
Parity
Stop bits
Flow control None
Restore Defaults
OK ApplyCancel
None
Step 6 Set up is complete The diagram below depicts what screen should displayed
Step 5 For ldquoBits per secondrdquo select 38400 For ldquoFlow controlrdquo select None Click OK when you have finished your selections
cmdgt_
HA201-TP Users Manual
51
Chapter 6 HDD Backplane Introduction
Step 7To get firmware image amp MFG Configuration Image version information from AIC SAS Related Firmware Downloadne website
httpppmsaicipccomtw8888downloadexpandermcu
HA201-TP Users Manual
52
Chapter 6 HDD Backplane Introduction
Step 8Comand line for show current firmware revisioncmdgtrev
HA201-TP Users Manual
53
Chapter 6 HDD Backplane Introduction
Step 9Start to update expander firmwarecmdgtfdl 0 0_
Step 10Select the tool bar Transfer -gt Send File
HA201-TP Users Manual
54
Chapter 6 HDD Backplane Introduction
Step 11bull Choose new firmware path file fw 3A1_v11221bull Protocol have to choose Xmodem
Step 12Firmware download complete
HA201-TP Users Manual
55
Chapter 6 HDD Backplane Introduction
Step 13Reset computer for success update firmwarecmdgtreset
reset
HA201-TP Users Manual
56
Chapter 6 HDD Backplane Introduction
612 Update expander configuration MFG
Step 1Comand line for show current configuration MFGcmdgt showmfg
HA201-TP Users Manual
57
Chapter 6 HDD Backplane Introduction
Step 2Start to update expander configuration MFGcmdgtfdl 83 0_
Step 3Select the tool bar Transfer -gt Send File
83 0
HA201-TP Users Manual
58
Chapter 6 HDD Backplane Introduction
Step 4bull Choose new MFG path file mfg 3A10_eob_1201binbull Protocol have to choose Xmodem
Step 5MFG download complete
HA201-TP Users Manual
59
Chapter 6 HDD Backplane Introduction
Step 6Reset computer for success update MFGcmdgtreset
reset
HA201-TP Users Manual
60
Chapter 6 HDD Backplane Introduction
62 Update the expander firmware through in-band
FOR EXAMPLEStep 1
Download and install SG3_utilsexe which compatible with Linux OSFrom website httpsgdannyczsgsg3_utilshtml website Reference version sg3_utils-140tgz
Step 2To get firmware image amp MFG Configuration Image version information from AIC SAS Related Firmware Downloadne websitehttpppmsaicipccomtw8888downloadexpandermcu
HA201-TP Users Manual
61
Chapter 6 HDD Backplane Introduction
Step 3Execute terminal under the same new firmware folderexample Setting a new firmware folder on Home page Open Terminal by click to the right button of mouse in the same window Home
Step 4Typing sudo -s to into administrator mode
HA201-TP Users Manual
62
Chapter 6 HDD Backplane Introduction
Step 5 Find expander location$ sg_map -i
Step 6Update Expander firmware$ sg_write_buffer --id=0x0 --in=fw3A1_v11221 --mode=0x2 --offset=0 devsg0
HA201-TP Users Manual
63
Chapter 6 HDD Backplane Introduction
Step 7 Update Expander MFG$ sg_write_buffer --id=0x83 --in=mfg3A10_eob_1201bin --mode=0x2 --offset=0 devsg0
Step 8Reboot computer for success update firmware amp MFGrootubuntu~ reboot
HA201-TP Users Manual
64
Chapter 6 HDD Backplane Introduction
63 12G expander EDFB settingStep 1
For Install HyperTerminalexe refer to section 41
Step 2 Get EDFB statuscmdgt edfb
HA201-TP Users Manual
65
Chapter 6 HDD Backplane Introduction
Step 3Change EDFB setting
Enable EDFB functioncmdgtedfb on
Disable EDFB functioncmdgtedfb off
cmd gtedfbEDFB is OFF
cmd gtedfb onSucceeded to set EDFB
cmd gtedfb offEDFB is OFF
HA201-TP Users Manual
66
Chapter 6 HDD Backplane Introduction
64 Slot HDD power setting(Only for system cooling Fan controled by expander)
Step 1 For Install sg3exe tool and get new firmware from website refer to section 42
Step 2Execute terminal under the same new firmware folderexample Setting a new firmware folder on Home page Open Terminal by click to the right button of mouse in the same window Home
HA201-TP Users Manual
67
Chapter 6 HDD Backplane Introduction
Step 3 Typing sudo -s to into administrator mode
Step 4 Find expander location$ sg_map -i
HA201-TP Users Manual
68
Chapter 6 HDD Backplane Introduction
Step 5For example
If would like to turn the Disk004 power off under the HBA card Need to check Disk004 power status $ sg_ses --page=7 devsg0
Under HBA card the Element 3 = Disk004
HA201-TP Users Manual
69
Chapter 6 HDD Backplane Introduction
Step 6To check Disk004 (element 3) power status is ok$ sg_ses --page=2 devsg0
Status shows belowThe status of Element 3 is OK
HA201-TP Users Manual
70
Chapter 6 HDD Backplane Introduction
Step 7Turn off a HDD power$ sg_ses --descriptor=Disk004 --set=341 devsg0
Step 8Turn on a HDD power$ sg_ses --descriptor=Disk004 --clear=341 devsg0
HA201-TP Users Manual
71
Chapter 6 HDD Backplane Introduction
65 HDD BP thermal sensor temperature setting(Only for system cooling Fan controled by expander)
Step 1 For Install HyperTerminalexe refer to section 41
Step 2Get the current temperature settingscmdgt temperature
HA201-TP Users Manual
72
Chapter 6 HDD Backplane Introduction
Step 3For example
Set new temperatureT1=20 C 4 18 CT2=50 C 4 52 CWarning threshold=50 C 448 CAlarm threshold=55 C 454 C
The new setting will take effect after resetcmdgt temperature 18 52 48 54cmdgt reset
HA201-TP Users Manual
73
Chapter 6 HDD Backplane Introduction
Step 4Check fan speed amp temperature informationcmdgt sensor
cmd gtsensor==ENCLOSURE STATUS========================================================== Total fan number 2 System Fan-0 speed 10888 RPM System Fan-1 speed 10971 RPM System PWN-0 82 Expander Temperature 76 Celsius degree Sytem Temperature-0 33 Celsius degree T1 20 Celsius degree T2 50 Celsius degree TC 55 Celsius degree Voltage Sensor 09V 093V Voltage Sensor 18V 180V
cmd gt_
HA201-TP Users Manual
74
Chapter 7 Intelreg Server Board S2600TP Platform Management
71 Server Management Function Architecture711 IPMI Feature7111 IPMI 20 Featuresbull Baseboard management controller (BMC)bull IPMI Watchdog timerbull Messaging support including command bridging and usersession
supportbull Chassis device functionality including powerreset control and BIOS boot
flags supportbull Event receiver device The BMC receives and processes events from
other platform subsystemsbull Field Replaceable Unit (FRU) inventory device functionality The BMC
supports access to system FRU devices using IPMI FRU commandsbull System Event Log (SEL) device functionality The BMC supports and
provides access to a SELbull Sensor Data Record (SDR) repository device functionality The BMC
supports storage and access of system SDRsbull Sensor device and sensor scanningmonitoring The BMC provides IPMI
management of sensors It polls sensors to monitor and report system health
bull IPMI interfaceso Host interfaces include system management software (SMS) with receive message queue support and server management mode (SMM)o IPMB interfaceo LAN interface that supports the IPMI-over-LAN protocol (RMCP RMCP+)
bull Serial-over-LAN (SOL)bull ACPI state synchronization The BMC tracks ACPI state changes that are
provided by the BIOSbull BMC self-test The BMC performs initialization and run-time self-tests and
makes results available to external entitiesbull See also the Intelligent Platform Management Interface Specification
Second Generation v20
Chapter 7 Intelreg Server Board S2600TP Platform Management
HA201-TP Users Manual
75
Chapter 7 Intelreg Server Board S2600TP Platform Management
7112 Non IPMI FeaturesThe BMC supports the following non-IPMI featuresbull In-circuit BMC firmware updatebull Fault resilient booting (FRB) FRB2 is supported by the watchdog timer
functionalitybull Chassis intrusion detectionbull Basic fan control using Control version 2 SDRsbull Fan redundancy monitoring and supportbull Enhancements to fan speed controlbull Power supply redundancy monitoring and supportbull Hot-swap fan supportbull Acoustic management Support for multiple fan profilesbull Signal testing support The BMC provides test commands for setting
and getting platform signal statesbull The BMC generates diagnostic beep codes for fault conditionsbull System GUID storage and retrievalbull Front panel management The BMC controls the system status LED
and chassis ID LED It supports secure lockout of certain front panel functionality and monitors button presses The chassis ID LED is turned on using a front panel button or a command
bull Power state retentionbull Power fault analysisbull Intelreg Light-Guided Diagnosticsbull Power unit management Support for power unit sensor The BMC
handles power-good dropout conditionsbull DIMM temperature monitoring New sensors and improved acoustic
management using closed-loop fan control algorithm taking into account DIMM temperature readings
bull Address Resolution Protocol (ARP) The BMC sends and responds to ARPs (supported on embedded NICs)
bull Dynamic Host Configuration Protocol (DHCP) The BMC performs DHCP (supported on embedded NICs)
bull Platform environment control interface (PECI) thermal management support
bull E-mail alertingbull Support for embedded web server UI in Basic Manageability feature
set
HA201-TP Users Manual
76
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull Enhancements to embedded web servero Human-readable SELo Additional system configurabilityo Additional system monitoring capabilityo Enhanced on-line help
bull Integrated KVMbull Enhancements to KVM redirection
o Support for higher resolutionbull Integrated Remote Media Redirectionbull Lightweight Directory Access Protocol (LDAP) supportbull Intelreg Intelligent Power Node Manager supportbull Embedded platform debug feature which allows capture of detailed
data for later analysisbull Provisioning and inventory enhancements
o Inventory datasystem information export (partial SMBIOS table)bull DCMI 15 compliance (product-specific)bull Management support for PMBus rev12 compliant power suppliesbull BMC Data Repository (Managed Data Region Feature)bull Support for an Intelreg Local Control Display Panelbull System Airflow Monitoringbull Exit Air Temperature Monitoringbull Ethernet Controller Thermal Monitoringbull Global Aggregate Temperature Margin Sensorbull Memory Thermal Managementbull Power Supply Fan Sensorsbull Energy Star Server Supportbull Smart Ride Through (SmaRT) Closed Loop System Throttling (CLST)bull Power Supply Cold Redundancybull Power Supply FW Updatebull Power Supply Compatibility Checkbull BMC FW reliability enhancements
o Redundant BMC boot blocks to avoid possibility of a corrupted boot block resulting in a scenario that prevents a user from updating the BMC o BMC System Management Health Monitoring
HA201-TP Users Manual
77
Chapter 7 Intelreg Server Board S2600TP Platform Management
72 Features and Functions721 Power SubsystemThe server board supports several power control sources which can initiate power-up orpower-down activity
HA201-TP Users Manual
78
Chapter 7 Intelreg Server Board S2600TP Platform Management
722 Advanced Configuration and Power Interface (ACPI)The server board has support for the following ACPI states
HA201-TP Users Manual
79
Chapter 7 Intelreg Server Board S2600TP Platform Management
723 System InitializationDuring system initialization both the BIOS and the BMC initialize the following items
7231 Processor Tcontrol SettingProcessors used with this chipset implement a feature called Tcontrol which provides a processor-specific value that can be used to adjust the fan-control behavior to achieve optimum cooling and acoustics The BMC reads these from the CPU through PECI Proxy mechanism provided by Manageability Engine (ME) The BMC uses these values as part of thefan-speed-control algorithm
HA201-TP Users Manual
80
Chapter 7 Intelreg Server Board S2600TP Platform Management
7232 Fault Resilient Booting (FRB)Fault resilient booting (FRB) is a set of BIOS and BMC algorithms and hardware support that allow a multiprocessor system to boot even if the bootstrap processor (BSP) fails Only FRB2 is supported using watchdog timer commandsFRB2 refers to the FRB algorithm that detects system failures during POST The BIOS uses the BMC watchdog timer to back up its operation during POST The BIOS configures the watchdog timer to indicate that the BIOS is using the timer for the FRB2 phase of the boot operationAfter the BIOS has identified and saved the BSP information it sets the FRB2 timer use bit and loads the watchdog timer with the new timeout intervalIf the watchdog timer expires while the watchdog use bit is set to FRB2 the BMC (if so configured) logs a watchdog expiration event showing the FRB2 timeout in the event data bytes The BMC then hard resets the system assuming the BIOS-selected reset as the watchdog timeout actionThe BIOS is responsible for disabling the FRB2 timeout before initiating the option ROM scan and before displaying a request for a boot password If the processor fails and causes an FRB2 timeout the BMC resets the systemThe BIOS gets the watchdog expiration status from the BMC If the status shows an expired FRB2 timer the BIOS enters the failure in the system event log (SEL) In the OEM bytes entry in the SEL the last POST code generated during the previous boot attempt is written FRB2failure is not reflected in the processor status sensor valueThe FRB2 failure does not affect the front panel LEDs
HA201-TP Users Manual
81
Chapter 7 Intelreg Server Board S2600TP Platform Management
7233 Post Code DisplayThe BMC upon receiving standby power initializes internal hardware to monitor port 80h (POST code) writes Data written to port 80h is output to the system POST LEDsThe BMC deactivates POST LEDs after POST had completed
HA201-TP Users Manual
82
Chapter 7 Intelreg Server Board S2600TP Platform Management
724 System Event Log (SEL)The BMC implements the system event log as specified in the Intelligent Platform Management Interface Specification Version 20 The SEL is accessible regardless of the system power state through the BMCs in-band and out-of-band interfacesThe BMC allocates 95231bytes (approx 93 KB) of non-volatile storage space to store system events The SEL timestamps may not be in order Up to 3639 SEL records can be stored at a time Because the SEL is circular any command that results in an overflow of the SEL beyond the allocated space will overwrite the oldest entries in the SEL while setting the overflow flag
HA201-TP Users Manual
83
Chapter 7 Intelreg Server Board S2600TP Platform Management
73 Sensor MonitoringThe BMC monitors system hardware and reports system health The information gathered from physical sensors is translated into IPMI sensors as part of the ldquoIPMI Sensor Modelrdquo The BMC also reports various system state changes by maintaining virtual sensors that are not specifically tied to physical hardware This section describes general aspects of BMC sensor management as well as describing how specific sensor types are modeled Unless otherwise specified the term ldquosensorrdquo refers to the IPMI sensor-model definition of a sensor
531 Sensor ScanningThe value of many of the BMCrsquos sensors is derived by the BMC FW periodically polling physical sensors in the system to read temperature voltages and so on Some of these physical sensors are built-in to the BMC component itself and some are physically separated from the BMC Polling of physical sensors for support of IPMI sensor monitoring does not occur until the BMCrsquos operational code is running and the IPMI FW subsystem has completed initializationIPMI sensor monitoring is not supported in the BMC boot code Additionally the BMC selectively polls physical sensors based on the current power and reset state of the system and the availability of the physical sensor when in that state For example non-standby voltages are not monitored when the system is in S4 or S5 power state
HA201-TP Users Manual
84
Chapter 7 Intelreg Server Board S2600TP Platform Management
732 Sensor Rearm Behavior7321 Manual versus Re-arm SensorsSensors can be either manual or automatic re-arm An automatic re-arm sensor will re-arm (clear) the assertion event state for a threshold or offset it if that threshold or offset is deasserted after having been asserted This allows a subsequent assertion of the threshold or an offset to generate a new event and associated side-effect An example side-effect would be boosting fans due to an upper critical threshold crossing of a temperature sensor The event state and the input state (value) of the sensor track each other Most sensors are auto-rearmA manual re-arm sensor does not clear the assertion state even when the threshold or offset becomes de-asserted In this case the event state and the input state (value) of the sensor do not track each other The event assertion state is sticky The following methods can be used to re-arm a sensorbull Automatic re-arm ndash Only applies to sensors that are designated as
ldquoauto-rearmrdquobull IPMI command Re-arm Sensor Eventbull BMC internal method ndash The BMC may re-arm certain sensors due to a
trigger condition For example some sensors may be re-armed due to a system reset A BMC reset will re-arm all sensorsbull System reset or DC power cycle will re-arm all system fan sensors
HA201-TP Users Manual
85
Chapter 7 Intelreg Server Board S2600TP Platform Management
7322 Re-arm and Event GenerationAll BMC-owned sensors that show an asserted event status generate a de-assertion SEL event when the sensor is re-armed provided that the associated SDR is configured to enable a deassertion event for that condition This applies regardless of whether the sensor is a thresholdanalog sensor or a discrete sensor
To manually re-arm the sensors the sequence is outlined below1 A failure condition occurs and the BMC logs an assertion event2 If this failure condition disappears the BMC logs a de-assertion event (if
so configured)3 The sensor is re-armed by one of the methods described in the
previous section4 The BMC clears the sensor status5 The sensor is put into reading-state-unavailable state until it is polled
again or otherwise updated6 The sensor is updated and the ldquoreading-state-unavailablerdquo state is
cleared A new assertion event will be logged if the fault state is once again detected
All auto-rearm sensors that show an asserted event status generate a de-assertion SEL event at the time the BMC detects that the condition causing the original assertion is no longer present and the associated SDR is configured to enable a de-assertion event for that condition
HA201-TP Users Manual
86
Chapter 7 Intelreg Server Board S2600TP Platform Management
733 BIOS Event-Only SensorsBIOS-owned discrete sensors are used for event generation only and are not accessible through IPMI sensor commands like the Get Sensor Reading command Note that in this case the sensor owner designated in the SDR is not the BMCAn example of this usage would be the SELs logged by the BIOS for uncorrectable memory errors Such SEL entries would identify a BIOS-owned sensor ID
734 Margin SensorsThere is sometimes a need for an IPMI sensor to report the difference (margin) from a nonzero reference offset For the purposes of this document these type sensors are referred to as margin sensors For instance for the case of a temperature margin sensor if the referencevalue is 90 degrees and the actual temperature of the device being monitored is 85 degreesthe margin value would be -5
735 IPMI Watchdog SensorThe BMC supports a Watchdog Sensor as a means to log SEL events due to expirations of the IPMI 20 compliant Watchdog Timer
736 BMC Watchdog SensorThe BMC supports an IPMI sensor to report that a BMC reset has occurred due to action taken by the BMC Watchdog feature A SEL event will be logged whenever either the BMC FW stack is reset or the BMC CPU itself is reset
737 BMC System Management Health MonitoringThe BMC tracks the health of each of its IPMI sensors and report failures by providing a ldquoBMC FW Healthrdquo sensor of the IPMI 20 sensor type Management Subsystem Health with support for the Sensor Failure offset Only assertions should be logged into the SEL for the Sensor Failure offset The BMC Firmware Health sensor asserts for any sensor when 10 consecutive sensor errors are read These are not standard sensor events (that is threshold crossings or discrete assertions) These are BMC Hardware Access Layer (HAL) errors If a successful sensor read is completed the counter resets to zero
HA201-TP Users Manual
87
Chapter 7 Intelreg Server Board S2600TP Platform Management
738 VR Watchdog TimerThe BMC FW monitors that the power sequence for the board VR controllers is completed when a DC power-on is initiated Incompletion of the sequence indicates a board problem in which case the FW powers down the systemThe BMC FW supports a discrete IPMI sensor for reporting and logging this fault condition
739 System Airflow MonitoringThe BMC provides an IPMI sensor to report the volumetric system airflow in CFM (cubic feet per minute) The air flow in CFM is calculated based on the system fan PWM values The specific Pulse Width Modulation (PWM or PWMs) used to determine the CFM is SDR configurable The relationship between PWM and CFM is based on a lookup table in an OEM SDRThe airflow data is used in the calculation for exit air temperature monitoring It is exposed as an IPMI sensor to allow a datacenter management application to access this data for use in rack-level thermal management
7310 Thermal MonitoringThe BMC provides monitoring of component and board temperature sensing devices This monitoring capability is instantiated in the form of IPMI analogthreshold or discrete sensors depending on the nature of the measurementFor analogthreshold sensors with the exception of Processor Temperature sensors critical and non-critical thresholds (upper and lower) are set through SDRs and event generation enabled for both assertion and de-assertion eventsFor discrete sensors both assertion and de-assertion event generation are enabledMandatory monitoring of platform thermal sensors includes
HA201-TP Users Manual
88
Chapter 7 Intelreg Server Board S2600TP Platform Management
7310 Thermal MonitoringThe BMC provides monitoring of component and board temperature sensing devices This monitoring capability is instantiated in the form of IPMI analogthreshold or discrete sensors depending on the nature of the measurementFor analogthreshold sensors with the exception of Processor Temperature sensors critical and non-critical thresholds (upper and lower) are set through SDRs and event generation enabled for both assertion and de-assertion eventsFor discrete sensors both assertion and de-assertion event generation are enabledMandatory monitoring of platform thermal sensors includesbull Inlet temperature (physical sensor is typically on system front panel or
HDD back plane)bull Board ambient thermal sensorsbull Processor temperaturebull Memory (DIMM) temperaturebull CPU VRD Hot monitoringbull Power supply (only supported for PMBus-compliant PSUs)Additionally the BMC FW may create ldquovirtualrdquo sensors that are based on a combination of aggregation of multiple physical thermal sensors and application of a mathematical formula to thermal or power sensor readings
73101 Absolute Value versus Margin SensorsThermal monitoring sensors fall into three basic categoriesbull Absolute temperature sensors ndash These are analogthreshold sensors
that provide a value that corresponds to an absolute temperature value
bull Thermal margin sensors ndash These are analogthreshold sensors that provide a value that is relative to some reference value
bull Thermal fault indication sensors ndash These are discrete sensors that indicate a specific thermal fault condition
HA201-TP Users Manual
89
Chapter 7 Intelreg Server Board S2600TP Platform Management
73102 Processor DTS-Spec Margin Sensor(s)Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family incorporate a DTS based thermal spec This allows a much more accurate control of the thermal solution and will enable lower fan speeds and lower fan power consumption The main usage of this sensor is as an input to the BMCrsquos fan control algorithms The BMC implements this as a threshold sensor There is one DTS sensor for each installed physical processor package Thresholds are not set and alert generation is not enabled for these sensors
73103 Processor Thermal Margin Sensor(s)Each processor supports a physical thermal margin sensor per core that is readable through the PECI interface This provides a relative value representing a thermal margin from the corersquos throttling thermal trip point Assuming that temp controlled throttling is enabled the physical core temp sensor reads lsquo0rsquo which indicates the processor core is being throttledThe BMC supports one IPMI processor (margin) temperature sensor per physical processor package This sensor aggregates the readings of the individual core temperatures in a package to provide the hottest core temperature reading When the sensor reads lsquo0rsquo it indicates that the hottest processor core is throttlingDue to the fact that the readings are capped at the corersquos thermal throttling trip point (reading = 0) thresholds are not set and alert generation is not enabled for these sensors
73104 Processor Thermal Control Monitoring (Prochot)The BMC FW monitors the percentage of time that a processor has been operationally constrained over a given time window (nominally six seconds) due to internal thermal management algorithms engaging to reduce the temperature of the device When any processor core temperature reaches its maximum operating temperature the processorpackage PROCHOT (processor hot) signal is asserted and these management algorithms known as the Thermal Control Circuit (TCC) engage to reduce the temperature provided TCC is enabled TCC is enabled by BIOS during system boot This monitoring is instantiated as one IPMI analogthreshold sensor per processor package The BMC implements this as a threshold sensor on a per-processor basis
HA201-TP Users Manual
90
Chapter 7 Intelreg Server Board S2600TP Platform Management
Under normal operation this sensor is expected to read lsquo0rsquo indicating that no processor throttling has occurredThe processor provides PECI-accessible counters one for the total processor time elapsed and one for the total thermally constrained time which are used to calculate the percentage assertion over the given time window
73105 Processor Voltage Regulator (VRD) Over-Temperature SensorThe BMC monitors processor VRD_HOT signals The processor VRD_HOT signals are routed to the respective processor PROCHOT input in order initiate throttling to reduce processor power draw therefore indirectly lowering the VRD temperatureThere is one processor VRD_HOT signal per CPU slot The BMC instantiates one discrete IPMI sensor for each VRD_HOT signal This sensor monitors a digital signal that indicates whether a processor VRD is running in an over-temperature condition When the BMC detects that this signal is asserted it will cause a sensor assertion which will result in an event being written into the sensor event log (SEL)
73106 Inlet Temperature SensorEach platform supports a thermal sensor for monitoring the inlet temperature There are four potential sources for inlet temperature reading
73107 Baseboard Ambient Temperature Sensor(s)The server baseboard provides one or more physical thermal sensors for monitoring the ambient temperature of a board location This is typically to provide rudimentary thermal monitoring of components that lack internal thermal sensors
73108 Server South Bridge (SSB) Thermal MonitoringThe BMC monitors the SSB temperature This is instantiated as an analog (threshold) IPMI thermal sensor
73109 Exit Air Temperature Monitoring The BMC synthesizes a virtual sensor to approximate system exit air temperature for use in fan control This is calculated based on the total power being consumed by the system and the total volumetric air flow provided by the system fans
HA201-TP Users Manual
91
Chapter 7 Intelreg Server Board S2600TP Platform Management
Each system shall be characterized in tabular format to understand total volumetric flow versus fan speed The BMC calculates an average exit air temperature based on the total system power front panel temperature the volumetric system air flow (cubic feet per meter or CFM) and altitude rangeThis sensor is only available on systems in an Intelreg chassis The Exit Air temp sensor is only available when PMBus power supplies are installed
731010 Ethernet Controller Thermal MonitoringThe Intelreg Ethernet Controller I350-AM4 and Intelreg Ethernet Controller 10 Gigabit X540 support an on-die thermal sensor For baseboard Ethernet controllers that use these devices the BMC will monitor the sensors and use this data as input to the fan speed control The BMC will instantiate an IPMI temperature sensor for each device on the baseboard
731011 Memory VRD-Hot Sensor(s)The BMC monitors memory VRD_HOT signals The memory VRD_HOT signals are routed to the respective processor MEMHOT inputs in order to throttle the associated memory to effectively lower the temperature of the VRD feeding that memoryFor Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family there are 2 memory VRD_HOT signals per CPU slot The BMC instantiates one discrete IPMI sensor for each memory VRD_HOT signal
731012 Add-in Module Thermal MonitoringSome boards have dedicated slots for an IO module andor a SAS module For boards that support these slots the BMC will instantiate an IPMI temperature sensor for each slot The modules themselves may or may not provide a physical thermal sensor (a TMP75 device) If the BMC detects that a module is installed it will attempt to access the physical thermal sensor and if found enable the associated IPMI temperature sensor
731013 Processor ThermTripWhen a Processor ThermTrip occurs the system hardware will automatically power down the server If the BMC detects that a ThermTrip occurred then it will set the ThermTrip offset for the applicable
HA201-TP Users Manual
92
Chapter 7 Intelreg Server Board S2600TP Platform Management
processor status sensor
731014 Server South Bridge (SSB) ThermTrip Monitoring The BMC supports SSB ThermTrip monitoring that is instantiated as an IPMI discrete sensor When a SSB ThermTrip occurs the system hardware will automatically power down the server and the BMC will assert the sensor offset and log an event
731015 DIMM ThermTrip MonitoringThe BMC supports DIMM ThermTrip monitoring that is instantiated as one aggregate IPMI discrete sensor per CPU When a DIMM ThermTrip occurs the system hardware will automatically power down the server and the BMC will assert the sensor offset and log an eventThis is a manual re-arm sensor that is rearmed on system resets and power-on (AC or DC power on transitions)
HA201-TP Users Manual
93
Chapter 7 Intelreg Server Board S2600TP Platform Management
7311 Processor SensorsThe BMC provides IPMI sensors for processors and associated components such as voltage regulators and fans The sensors are implemented on a per-processor basis
HA201-TP Users Manual
94
Chapter 7 Intelreg Server Board S2600TP Platform Management
73111 Processor Status SensorsThe BMC provides an IPMI sensor of type processor for monitoring status information for each processor slot If an event state (sensor offset) has been asserted it remains asserted until one of the following happens1 A Rearm Sensor Events command is executed for the processor status
sensor2 AC or DC power cycle system reset or system boot occurs
The BMC provides system status indication to the front panel LEDs for processor fault conditions shown belowCPU Presence status is not saved across AC power cycles and therefore will not generate a deassertion after cycling AC power
HA201-TP Users Manual
95
Chapter 7 Intelreg Server Board S2600TP Platform Management
73112 Processor Population Fault (CPU Missing) SensorThe BMC supports a Processor Population Fault sensor This is used to monitor for the condition in which processor slots are not populated as required by the platform hardware to allow power-on of the systemAt BMC startup the BMC will check for the fault condition and set the sensor state accordinglyThe BMC also checks for this fault condition at each attempt to DC power-on the system At each DC power-on attempt a beep code is generated if this fault is detectedThe following steps are used to correct the fault condition and clear the sensor state1 AC power down the server2 Install the missing processor into the correct slot3 AC power on the server
73113 ERR2 Timeout MonitoringThe BMC supports an ERR2 Timeout Sensor (1 per CPU) that asserts if a CPUrsquos ERR2 signal has been asserted for longer than a fixed time period (gt 90 seconds) ERR[2] is a processor signal that indicates when the IIO (Integrated IO module in the processor) has a fatal error which could not be communicated to the core to trigger SMI ERR[2] events are fatal error conditions where the BIOS and OS will attempt to gracefully handle error but may not be always do so reliably A continuously asserted ERR2 signal is an indication that the BIOS cannot service the condition that caused the error This is usually because that condition prevents the BIOS from runningWhen an ERR2 timeout occurs the BMC assertsde-asserts the ERR2 Timeout Sensor and logs a SEL event for that sensor The default behavior for BMC core firmware is to initiate a system reset upon detection of an ERR2 timeout The BIOS setup utility provides an option to disable or enable system reset by the BMC for detection of this condition
HA201-TP Users Manual
96
Chapter 7 Intelreg Server Board S2600TP Platform Management
73114 CATERR SensorThe BMC supports a CATERR sensor for monitoring the system CATERR signalThe CATERR signal is defined as having 3 statesbull high (no event)bull pulsed low (possibly fatal may be able to recover)bull low (fatal)All processors in a system have their CATERR pins tied together The pin is used as a communication path to signal a catastrophic system event to all CPUs The BMC has direct access to this aggregate CATERR signalThe BMC only monitors for the ldquoCATERR held lowrdquo condition A pulsed low condition is ignored by the BMC If a CATERR-low condition is detected the BMC logs an error message to the SEL against the CATERR sensor and the default action after logging the SEL entry is to reset the system The BIOS setup utility provides an option to disable or enable system reset by the BMC for detection of this conditionThe sensor is rearmed on power-on (AC or DC power on transitions) It is not rearmed on system resets in order to avoid multiple SEL events that could occur due to a potential reset loop if the CATERR keeps recurring which would be the case if the CATERR was due to an MSID mismatch conditionWhen the BMC detects that this aggregate CATERR signal has asserted it can then go through PECI to query each CPU to determine which one was the source of the error and write an OEM code identifying the CPU slot into an event data byte in the SEL entry If PECI is non-functional (it isnrsquot guaranteed in this situation) then the OEM code should indicate that the source is unknownEvent data byte 2 and byte 3 for CATERR sensor SEL events
ED1 ndash 0xA1ED2 - CATERR type0 Unknown1 CATERR2 CPU Core Error (not supported on Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family)3 MSID Mismatch4 CATERR due to CPU 3-strike timeout
HA201-TP Users Manual
97
Chapter 7 Intelreg Server Board S2600TP Platform Management
ED3 - CPU bitmap that causes the system CATERR[0] CPU1[1] CPU2[2] CPU3[3] CPU4When a CATERR Timeout event is determined to be a CPU 3-strike timeout The BMC shall log the logical FRU information (eg busdevfunc for a PCIe device CPU or DIMM) that identifies the FRU that caused the error in the extended SEL data bytes In this case Ext-ED0 will be set to 0x70 and the remaining ED1-ED7 will be set according to the device type and info available
73115 MSID Mismatch SensorThe BMC supports a MSID Mismatch sensor for monitoring for the fault condition that will occur if there is a power rating incompatibility between a baseboard and a processor The sensor is rearmed on power-on (AC or DC power on transitions)
7312 Voltage MonitoringThe BMC provides voltage monitoring capability for voltage sources on the main board and processors such that all major areas of the system are covered This monitoring capability is instantiated in the form of IPMI analogthreshold sensors
73121 DIMM Voltage SensorsSome systems support either LVDDR (Low Voltage DDR) memory or regular (non-LVDDR) memory During POST the system BIOS detects which type of memory is installed and configures the hardware to deliver the correct voltageSince the nominal voltage range is different this necessitates the ability to set different thresholds for any associated IPMI voltage sensors The BMC FW supports this by implementing separate sensors (that is separate IPMI sensor numbers) for each nominal voltage range supported for a single physical sensor and it enablesdisables the correct IPMI sensor based on which type memory is installed The sensor data records for both these DIMM voltage sensor types have scanning disabled by default Once the BIOS has completed its POST routine it is responsible for communicating the DIMM voltage type to the BMC which will then
HA201-TP Users Manual
98
Chapter 7 Intelreg Server Board S2600TP Platform Management
enable sensor scanning of the correct DIMM voltage sensor
7313 Fan MonitoringBMC fan monitoring support includes monitoring of fan speed (RPM) and fan presence
73131 Fan Tach SensorsFan Tach sensors are used for fan failure detection The reported sensor reading is proportional to the fanrsquos RPM This monitoring capability is instantiated in the form of IPMI analogthreshold sensorsMost fan implementations provide for a variable speed fan so the variations in fan speed can be large Therefore the threshold values must be set sufficiently low as to not result in inappropriate threshold crossingsFan tach sensors are implemented as manual re-arm sensors because a lower-critical threshold crossing can result in full boosting of the fans This in turn may cause a failing fanrsquos speed to rise above the threshold and can result in fan oscillationsAs a result fan tach sensors do not auto-rearm when the fault condition goes away but rather are rearmed for either of the following occurrences1 The system is reset or power-cycled2 The fan is removed and either replaced with another fan or re-
inserted This applies to hot-swappable fans only This re-arm is triggered by change in the state of the associated fan presence sensor
After the sensor is rearmed if the fan speed is detected to be in a normal range the failure conditions shall be cleared and a de-assertion event shall be logged
HA201-TP Users Manual
99
Chapter 7 Intelreg Server Board S2600TP Platform Management
73132 Fan Presence SensorsSome chassis and server boards provide support for hot-swap fans These fans can be removed and replaced while the system is powered on and operating normally The BMC implements fan presence sensors for each hot swappable fan These are instantiated as IPMI discrete sensorsEvents are only logged for fan presence upon changes in the presence state after AC power is applied (no events logged for initial state)
73133 Fan Redundancy SensorThe BMC supports redundant fan monitoring and implements fan redundancy sensors for products that have redundant fans Support for redundant fans is chassis-specificA fan redundancy sensor generates events when its associated set of fans transition between redundant and non-redundant states as determined by the number and health of the component fans The definition of fan redundancy is configuration dependent The BMCallows redundancy to be configured on a per fan-redundancy sensor basis through OEM SDR recordsThere is a fan redundancy sensor implemented for each redundant group of fans in the systemAssertion and de-assertion event generation is enabled for each redundancy state
73134 Power Supply Fan SensorsMonitoring is implemented through IPMI discrete sensors one for each power supply fan The BMC polls each installed power supply using the PMBus fan status commands to check for failure conditions for the power supply fans The BMC asserts the ldquoperformance lagsrdquo offset of the IPMI sensor if a fan failure is detectedPower supply fan sensors are implemented as manual re-arm sensors because a failure condition can result in boosting of the fans This in turn may cause a failing fanrsquos speed to rise above the ldquofaultrdquo threshold and can result in fan oscillations As a result these sensors do not auto-rearm when the fault condition goes away but rather are rearmed only when the system is reset or power-cycled or the PSU is removed and replaced with the same or another PSUAfter the sensor is rearmed if the fan is no longer showing a failed state the failure condition in the IPMI sensor shall be cleared and a de-
HA201-TP Users Manual
100
Chapter 7 Intelreg Server Board S2600TP Platform Management
assertion event shall be logged
73135 Monitoring for ldquoFans Offrdquo ScenarioOn Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family it is likely that there will be situations where specific fans are turned off based on current system conditions BMC Fan monitoring will comprehend this scenario and not log false failure eventsThe recommended method is for the BMC FW to halt updates to the value of the associated fan tach sensor and set that sensorrsquos IPMI sensor state to ldquoreading-state-unavailablerdquo when this mode is active Management software must comprehend this state for fan tach sensors and not report these as failure conditionsThe scenario for which this occurs is that the BMC Fan Speed Control (FSC) code turns off the fans by setting the PWM for the domain to 0 This is done when based on one or more global aggregate thermal margin sensor readings dropping below a specified thresholdBy default the fans-off feature will be disabled There is a BMC command and BIOS setup option to enabledisable this featureThe SmaRTCLST system feature will also momentarily gate power to all the system fans to reduce overall system power consumption in response to a power supply event (for example to ride out an AC power glitch) However for this scenario the fan power is gated by hardware for only 100ms which should not be long enough to result in triggering a fan fault SEL event
7314 Standard Fan ManagementThe BMC controls and monitors the system fans Each fan is associated with a fan speed sensor that detects fan failure and may also be associated with a fan presence sensor for hotswap support For redundant fan configurations the fan failure and presence status determines the fan redundancy sensor stateThe system fans are divided into fan domains each of which has a separate fan speed control signal and a separate configurable fan control policy A fan domain can have a set of temperature and fan sensors associated with it These are used to determine the current fan domain state
HA201-TP Users Manual
101
Chapter 7 Intelreg Server Board S2600TP Platform Management
A fan domain has three statesbull The sleep and boost states have fixed (but configurable through OEM
SDRs) fan speeds associated with thembull The nominal state has a variable speed determined by the fan
domain policy An OEM SDR record is used to configure the fan domain policy
The fan domain state is controlled by several factors They are listed below in order of precedence high to lowbull Boost
o Associated fan is in a critical state or missing The SDR describes which fan domains are boosted in response to a fan failure or removal in each domain If a fan is removed when the system is in lsquoFans-offrsquo mode it will not be detected and there will not be any fan boost till system comes out of lsquoFans-off modeo Any associated temperature sensor is in a critical state The SDR describes which temperature-threshold violations cause fan boost for each fan domaino The BMC is in firmware update mode or the operational firmware is corruptedo If any of the above conditions apply the fans are set to a fixed boost state speed
bull Nominalo A fan domainrsquos nominal fan speed can be configured as static (fixed value) or controlled by the state of one or more associated temperature sensorso See section 43144 for more details
73141 Hot-Swap FansHot-swap fans are supported These fans can be removed and replaced while the system is powered on and operating The BMC implements fan presence sensors for each hotswappable fanWhen a fan is not present the associated fan speed sensor is put into the readingunavailable state and any associated fan domains are put into the boost state The fans may already be boosted due to a previous fan failure or fan removalWhen a removed fan is inserted the associated fan speed sensor is rearmed If there are no other critical conditions causing a fan boost condition the fan speed returns to the nominal state Power cycling or
HA201-TP Users Manual
102
Chapter 7 Intelreg Server Board S2600TP Platform Management
resetting the system re-arms the fan speed sensors and clears fanfailure conditions If the failure condition is still present the boost state returns once the sensor has re-initialized and the threshold violation is detected again
73142 Fan Redundancy DetectionThe BMC supports redundant fan monitoring and implements a fan redundancy sensor A fan redundancy sensor generates events when its associated set of fans transitions between redundant and non-redundant states as determined by the number and health of the fans The definition of fan redundancy is configuration dependent The BMC allows redundancy to be configured on a per fan redundancy sensor basis through OEM SDR recordsA fan failure or removal of hot-swap fans up to the number of redundant fans specified in the SDR in a fan configuration is a non-critical failure and is reflected in the front panel status A fan failure or removal that exceeds the number of redundant fans is a non-fatal insufficientresources condition and is reflected in the front panel status as a non-fatal errorRedundancy is checked only when the system is in the DC-on state Fan redundancy changes that occur when the system is DC-off or when AC is removed will not be logged until the system is turned on
73143 Fan DomainsSystem fan speeds are controlled through pulse width modulation (PWM) signals which are driven separately for each domain by integrated PWM hardware Fan speed is changed by adjusting the duty cycle which is the percentage of time the signal is driven high in each pulseThe BMC controls the average duty cycle of each PWM signal through direct manipulation of the integrated PWM control registersThe same device may drive multiple PWM signals
HA201-TP Users Manual
103
Chapter 7 Intelreg Server Board S2600TP Platform Management
73144 Nominal Fan SpeedA fan domainrsquos nominal fan speed can be configured as static (fixed value) or controlled by the state of one or more associated temperature sensorsOEM SDR records are used to configure which temperature sensors are associated with which fan control domains and the algorithmic relationship between the temperature and fan speed Multiple OEM SDRs can reference or control the same fan control domain and multiple OEM SDRs can reference the same temperature sensorsThe PWM duty-cycle value for a domain is computed as a percentage using one or more instances of a stepwise linear algorithm and a clamp algorithm The transition from one computed nominal fan speed (PWM value) to another is ramped over time to minimize audible transitions The ramp rate is configurable by means of the OEM SDRMultiple stepwise linear and clamp controls can be defined for each fan domain and used simultaneously For each domain the BMC uses the maximum of the domainrsquos stepwise linear control contributions and the sum of the domainrsquos clamp control contributions to compute the domainrsquos PWM value except that a stepwise linear instance can be configured to provide the domain maximumHysteresis can be specified to minimize fan speed oscillation and to smooth fan speed transitions If a Tcontrol SDR record does not contain a hysteresis definition for example an SDR adhering to a legacy format the BMC assumes a hysteresis value of zero
73145 Thermal and Acoustic ManagementThis feature refers to enhanced fan management to keep the system optimally cooled while reducing the amount of noise generated by the system fans Aggressive acoustics standards might require a trade-off between fan speed and system performance parameters that contribute to the cooling requirements primarily memory bandwidth The BIOS BMC and SDRs work together to provide control over how this trade-off is determinedThis capability requires the BMC to access temperature sensors on the individual memory DIMMs Additionally closed-loop thermal throttling is only supported with buffered DIMMs
HA201-TP Users Manual
104
Chapter 7 Intelreg Server Board S2600TP Platform Management
73146 Thermal Sensor Input to Fan Speed ControlThe BMC uses various IPMI sensors as input to the fan speed control Some of the sensors are IPMI models of actual physical sensors whereas some are ldquovirtualrdquo sensors whose values are derived from physical sensors using calculations andor tabular informationThe following IPMI thermal sensors are used as input to the fan speed controlbull Front panel temperature sensorbull Baseboard temperature sensorsbull CPU DTS-Spec margin sensorsbull DIMM thermal margin sensorsbull Exit air temperature sensorbull Global aggregate thermal margin sensorsbull SSB (Intelreg C610 Series Chipset) temperature sensorbull On-board Ethernet controller temperature sensors (support for this is
specific to the Ethernet controller being used)bull Add-in Intelreg SASIO module temperature sensor(s) (if present)bull Power supply thermal sensors (only available on PMBus-compliant
power supplies)A simple model is shown in the following figure which gives a high level graphic of the fan speed control structure creates the resulting fan speeds
HA201-TP Users Manual
105
Chapter 7 Intelreg Server Board S2600TP Platform Management
731461 Processor Thermal ManagementProcessor thermal management utilizes clamp algorithms for which the Processor DTS-Spec margin sensor is a controlling input This replaces the use of the (legacy) raw DTS sensor reading that was utilized on previous generation platforms The legacy DTS sensor is retained only for monitoring purposes and is not used as an input to the fan speed control
731462 Memory Thermal ManagementThe system memory is the most complex subsystem to thermally manage as it requires substantial interactions between the BMC BIOS and the embedded memory controller This section provides an overview of this management capability from a BMC perspective
7314621 Memory Thermal ThrottlingThe system shall support thermal management through open loop throttling (OLTT) and closed loop throttling (CLTT) of system memory based on the platform as well as availability of valid temperature sensors on the installed memory DIMMs Throttling levels are changed dynamically to cap throttling based on memory and system thermal conditions as determined by the system and DIMM power and thermal parameters Support for CLTT on mixed-mode DIMM populations (that is some installed DIMMs have valid temp sensors and some do not) is not supported The BMC fan speed control functionality is related to the memory throttling mechanism usedThe following terminology is used for the various memory throttling optionsbull Static Open Loop Thermal Throttling (Static-OLTT) OLTT control registers
are configured by BIOS MRC remain fixed after post The system does not change any of the throttling control registers in the embedded memory controller during runtime
bull Static Closed Loop Thermal Throttling (Static-CLTT) CLTT control registers are configured by BIOS MRC during POST The memory throttling is run as a closed-loop system with the DIMM temperature sensors as the control input Otherwise the system does not change any of the throttling control registers in the embedded memory controller during runtime
bull Dynamic Open Loop Thermal Throttling (Dynamic-OLTT) OLTT control registers are configured by BIOS MRC during POST Adjustments are
HA201-TP Users Manual
106
Chapter 7 Intelreg Server Board S2600TP Platform Management
made to the throttling during runtime based on changes in system cooling (fan speed)
bull Dynamic Closed Loop Thermal Throttling (Dynamic-CLTT) CLTT control registers are configured by BIOS MRC during POST The memory throttling is run as a closed-loop system with the DIMM temperature sensors as the control input Adjustments are made to the throttling during runtime based on changes in system cooling (fan speed)
Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce a new type of CLTT which is referred to as Hybrid CLTT for which the Integrated Memory Controller estimates the DRAM temperature in between actual reads of the TSODsHybrid CLTTT shall be used on all Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family that have DIMMs with thermal sensors Therefore the terms Dynamic-CLTT and Static-CLTT are really referring to this lsquohybridrsquo mode Note that if the IMCrsquos polling of the TSODs is interrupted the temperature readings that the BMC gets from the IMC shall be these estimated values 731463 DIMM Temperature Sensor Input to Fan Speed ControlA clamp algorithm s used for controlling fan speed based on DIMM temperatures Aggregate DIMM temperature margin sensors are used as the control input to the algorithm
731464 Dynamic (Hybrid) CLTTThe system will support dynamic (memory) CLTT for which the BMC FW dynamically modifies thermal offset registers in the IMC during runtime based on changes in system cooling (fan speed) For static CLTT a fixed offset value is applied to the TSOD reading to get the die temperature however this is does not provide as accurate results as when the offset takes into account the current airflow over the DIMM as is done with dynamic CLTTIn order to support this feature the BMC FW will derive the air velocity for each fan domain based on the PWM value being driven for the domain Since this relationship is dependent on the chassis configuration a method must be used which support this dependency (for example through OEM SDR) that establishes a lookup table providing this relationshipBIOS will have an embedded lookup table that provides thermal offset
HA201-TP Users Manual
107
Chapter 7 Intelreg Server Board S2600TP Platform Management
values for each DIMM type altitude setting and air velocity range (3 ranges of air velocity are supported) During system boot BIOS will provide 3 offset values (corresponding to the 3 air velocity ranges) tothe BMC for each enabled DIMM Using this data the BMC FW constructs a table that maps the offset value corresponding to a given air velocity range for each DIMM During runtime the BMC applies an averaging algorithm to determine the target offset value corresponding to thecurrent air velocity and then the BMC writes this new offset value into the IMC thermal offset register for the DIMM
731465 Fan ProfilesThe server system supports multiple fan control profiles to support acoustic targets and American Society of Heating Refrigerating and Air Conditioning Engineers (ASHRAE) compliance The BIOS Setup utility can be used to choose between meeting the target acoustic level or enhanced system performance This is accomplished through fan profilesThe BMC supports eight fan profiles numbered from 0 to 7 Fan management policy is dictated by the Tcontrol SDRs These SDRs provide a way to associate fan control behavior with one or more fan profilesEach group of profiles allows for varying fan control policies based on the altitude For a given altitude the Tcontrol SDRs associated with an acoustics-optimized profile generate less noise than the equivalent performance-optimized profile by driving lower fan speeds and the BIOSreduces thermal management requirements by configuring more aggressive memory throttling See Table 16 for more informationThe BMC provides commands that query for fan profile support and it provides a way to enable a fan profile Enabling a fan profile determines which Tcontrol SDRs are used for fan management The BMC only supports enabling a fan profile through the command if that profile is supported on all fan domains defined for the system It is important to configure the SDRs so that all desired fan profiles are supported on each fan domain If no single profile is supported across all domains the BMC by default uses profile 0 and does not allow it to be changedAt system boot the BIOS can use the Get Fan Control Configuration command to query the BMC about which fan profiles are supported The BIOS uses this information to display options in the BIOS Setup utility The BIOS indicates the fan profile to the BMC as dictated by the BIOS Setup Utility options for fan mode and altitude using the Set Fan Control
HA201-TP Users Manual
108
Chapter 7 Intelreg Server Board S2600TP Platform Management
Configuration commandThe BMC uses this information as an input to its fan-control algorithm as supported by the Tcontrol OEM SDR The BMC only allows enabling of fan profiles that the BMC indicates are supported using the Get Fan Control Configuration command For example if the Get Fan Control Configuration command indicates that only profile 1 is supported then using the Set Fan Control Configuration command to enable profile 2 will result in the return of an error completion codeThe BMC requires the BIOS to send the Set Fan Control Configuration command to the BMC on every system boot This must be done after the BIOS has completed any throttling-related chipset configuration
HA201-TP Users Manual
109
Chapter 7 Intelreg Server Board S2600TP Platform Management
731466 Open-Loop Thermal Throttling FallbackNormal system operation uses closed-loop thermal throttling (CLTT) and DIMM temperature monitoring as major factors in overall thermal and acoustics management In the event that BIOS is unable to configure the system for CLTT it defaults to open-loop thermal throttling (OLTT) In the OLTT mode it is assumed that the DIMM temperature sensors are not available for fan speed control The BIOS communicates the throttling mode to the BMC along with the fan profile number when it sends the Set Fan Control Configuration commandWhen OLTT mode is specified the BMC internally blocks access to the DIMM temperatures causing the DIMM aggregate margin sensors to be marked as ReadingState Unavailable The BMC then uses the failure-control values for these sensors if specified in the Tcontrol SDRs as their fan speed contributions
731467 ASHRAE ComplianceSystem requirements for ASHRAE compliance is defined in the Common Fan Speed Control amp Thermal Management Platform Architecture Specification Altitude-related changes in fan speed control are handled through profiles for different altitude ranges
73147 Power Supply Fan Speed ControlThis section describes the system level control of the fans internal to the power supply over the PMBus Some but not all Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family will require that the power supplies be included in the system level fan speed control For any system that requires either of these capabilities the power supply must be PMBus-compliant
731471 System Control of Power Supply FansSome products require that the BMC control the speed of the power supply fans as is done with normal system (chassis) fans except that the BMC cannot reduce the power supply fan any lower than the internal power supply control is driving it For these products the BMC FW must have the ability to control and monitor the power supply fans through PMBus commands The power supply fans are treated as a system fan domain for which fan control policies are mapped just as for chassis system fans with system thermal sensors (rather than internal power
HA201-TP Users Manual
110
Chapter 7 Intelreg Server Board S2600TP Platform Management
supply thermal sensors) used as the input to a clamp algorithm for the power supply fan control This domain has both piecewise clipping curves and clamped sensors mapped into the power supply fan domain All the power supplies can be defined as a single fan domain
731472 Use of Power Supply Thermal Sensors as Input to System (Chassis) Fan Control Some products require that the power supply internal thermal sensors are used as control inputs to the system (chassis) fans in the same manner as other system thermal sensors are used for this purpose The power supply thermal sensors are included as clamped sensors into one or more system fan domains which may include the power supply fan domain
73148 Fan Boosting due to Fan FailuresIntelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce additional capabilities for handling fan failure or removal as described in this sectionEach fan failure shall be able to define a unique response from all other fan domains An OEM SDR table defines the response of each fan domain based on a failure of any fan including both system and power supply fans (for PMBus-compliant power supplies only) This meansthat if a system has six fans then there will be six different fan fail reactions
73149 Programmable Fan PWM OffsetThe system provides a BIOS Setup option to boost the system fan speed by a programmable positive offset or a ldquoMaxrdquo setting Setting the programmable offset causes the BMC to add the offset to the fan speeds that it would otherwise be driving the fans to The Max setting causes the BMC to replace the domain minimum speed with alternate domain minimums that also are programmable through SDRsThis capability is offered to provide system administrators the option to manually configure fans speeds in instances where the fan speed optimized for a given platform may not be sufficient when a high end add-in is configured into the system This enables easier usage of the fan speed control to support Intelreg as well as third party chassis and better support of ambient temperatures higher than 35C
HA201-TP Users Manual
111
Chapter 7 Intelreg Server Board S2600TP Platform Management
supply thermal sensors) used as the input to a clamp algorithm for the power supply fan control This domain has both piecewise clipping curves and clamped sensors mapped into the power supply fan domain All the power supplies can be defined as a single fan domain
731472 Use of Power Supply Thermal Sensors as Input to System (Chassis) Fan Control Some products require that the power supply internal thermal sensors are used as control inputs to the system (chassis) fans in the same manner as other system thermal sensors are used for this purpose The power supply thermal sensors are included as clamped sensors into one or more system fan domains which may include the power supply fan domain
73148 Fan Boosting due to Fan FailuresIntelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce additional capabilities for handling fan failure or removal as described in this sectionEach fan failure shall be able to define a unique response from all other fan domains An OEM SDR table defines the response of each fan domain based on a failure of any fan including both system and power supply fans (for PMBus-compliant power supplies only) This meansthat if a system has six fans then there will be six different fan fail reactions
73149 Programmable Fan PWM OffsetThe system provides a BIOS Setup option to boost the system fan speed by a programmable positive offset or a ldquoMaxrdquo setting Setting the programmable offset causes the BMC to add the offset to the fan speeds that it would otherwise be driving the fans to The Max setting causes the BMC to replace the domain minimum speed with alternate domain minimums that also are programmable through SDRsThis capability is offered to provide system administrators the option to manually configure fans speeds in instances where the fan speed optimized for a given platform may not be sufficient when a high end add-in is configured into the system This enables easier usage of the fan speed control to support Intelreg as well as third party chassis and better support of ambient temperatures higher than 35C
HA201-TP Users Manual
112
Chapter 7 Intelreg Server Board S2600TP Platform Management
7315 Power Management Bus (PMBus)The Power Management Bus (ldquoPMBusrdquo) is an open standard protocol that is built upon the SMBus 20 transport It defines a means of communicating with power conversion and other devices using SMBus-based commands A system must have PMBus-compliant power supplies installed in order for the BMC or ME to monitor them for status andor power metering purposesFor more information on PMBus see the System Management Interface Forum Web sitehttpwwwpowersigorg
7316 Power Supply Dynamic Redundancy SensorThe BMC supports redundant power subsystems and implements a Power Unit Redundancy sensor per platform A Power Unit Redundancy sensor is of sensor type Power Unit (09h) and reading type Availability Status (0Bh) This sensor generates events when a power subsystem transitions between redundant and non-redundant states as determined by the number and health of the power subsystemrsquos component power supplies The BMC implements Dynamic Power Supply Redundancy status based upon current system load requirements as well as total Power Supply capacity This status is independent of the Cold Redundancy status This prevents the BMC from reporting Fully Redundant Power supplies when the load required by the system exceeds half the power capability of all power supplies installed and operationalDynamic Redundancy detects this condition and generates the appropriate SEL event to notify the user of the condition Power supplies of different power ratings may be swapped in and out to adjust the power capacity and the BMC will adjust the Redundancy status accordinglyThe definition of redundancy is power subsystem dependent and sometimes even configuration dependent See the appropriate Platform Specific Information for power unit redundancy supportThis sensor is configured as manual-rearm sensor in order to avoid the possibility of extraneous SEL events that could occur under certain system configuration and workload conditions The sensor shall rearm for the following conditionsbull PSU hot-addbull System reset
HA201-TP Users Manual
113
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull AC power cyclebull DC power cycleSystem AC power is applied but on standby ndash Power unit redundancy is based on OEM SDR power unit record and number of PSU presentSystem is (DC) powered on - The BMC calculates Dynamic Power Supply Redundancy status based upon current system load requirements as well as total Power Supply capacityThe BMC allows redundancy to be configured on a per power-unit-redundancy sensor basis by means of the OEM SDR records
7317 Component Fault LED ControlSeveral sets of component fault LEDs are supported on the server board Some LEDs are owned by the BMC and some by the BIOSThe BMC owns control of the following FRUfault LEDsbull Fan fault LEDs ndash A fan fault LED is associated with each fan The BMC
lights a fan fault LED if the associated fan-tach sensor has a lower critical threshold event status asserted Fan-tach sensors are manual re-arm sensors Once the lower critical threshold is crossed the LED remains lit until the sensor is rearmed These sensors are rearmed at system DC power-on and system reset
bull DIMM fault LEDs ndash The BMC owns the hardware control for these LEDs The LEDs reflect the state of BIOS-owned event-only sensors When the BIOS detects a DIMM fault condition it sends an IPMI OEM command (Set Fault Indication) to the BMC to instruct the BMC to turn on the associated DIMM Fault LED These LEDs are only active when the system is in the lsquoonrsquo state The BMC will not activate or change the state of the LEDs unless instructed by the BIOS
bull Hard Disk Drive Status LEDs ndash The HSBP PSoC owns the hardware control for these LEDs and detection of the faultstatus conditions that the LEDs reflect
bull CPU Fault LEDs The BMC owns control for these LEDs An LED is lit if there is an MSID mismatch (that is CPU power rating is incompatible with the board)
HA201-TP Users Manual
114
Chapter 7 Intelreg Server Board S2600TP Platform Management
7318 CMOS Battery MonitoringThe BMC monitors the voltage level from the CMOS battery which provides battery backup to the chipset RTC This is monitored as an auto-rearm threshold sensorUnlike monitoring of other voltage sources for which the Emulex Pilot III component continuously cycles through each input the voltage channel used for the battery monitoring provides a software enable bit to allow the BMC FW to poll the battery voltage at a relativelyslow rate in order to conserve battery power
HA201-TP Users Manual
115
Chapter 7 Intelreg Server Board S2600TP Platform Management
74 Intelreg Intelligent Power Node Manager (NM)Power management deals with requirements to manage processor power consumption and manage power at the platform level to meet critical business needs Node Manager (NM) is a platform resident technology that enforces power capping and thermal-triggered powercapping policies for the platform These policies are applied by exploiting subsystem settings (such as processor P and T states) that can be used to control power consumption NM enables data center power management by exposing an external interface to management software through which platform policies can be specified It also implements specific data center power management usage models such as power limiting and thermal monitoringThe NM feature is implemented by a complementary architecture utilizing the ME BMC BIOS and an ACPI-compliant OS The ME provides the NM policy engine and power controllimiting functions (referred to as Node Manager or NM) while the BMC provides the external LAN link by which external management software can interact with the feature The BIOS provides system power information utilized by the NM algorithms and also exports ACPI Source Language (ASL) code used by OS-Directed Power Management (OSPM) for negotiating processor P and T state changes for power limiting PMBus-compliant power supplies provide the capability to monitor input power consumption which is necessary to support NMThe NM architecture applicable to this generation of servers is defined by the NPTM Architecture Specification v20 NPTM is an evolving technology that is expected to continue to add new capabilities that will be defined in subsequent versions of the specification The ME NM implements the NPTM policy engine and controlmonitoring algorithms defined in the Node Power and Thermal Manager (NPTM) specification
741 Hardware RequirementsNM is supported only on platforms that have the NM FW functionality loaded and enabled on the Management Engine (ME) in the SSB and that have a BMC present to support the external LAN interface to the ME NM power limiting features requires a means for the ME to monitorinput power consumption for the platform This capability is generally provided by means of PMBus-compliant power supplies although an alternative model using a simpler SMBus power monitoring device is possible (there is potential loss in accuracy and responsiveness using
HA201-TP Users Manual
116
Chapter 7 Intelreg Server Board S2600TP Platform Management
non-PMBus devices) The NM SmaRTCLST feature does specifically require PMBus-compliant power supplies as well as additional hardware on the server board
742 FeaturesNM provides feature support for policy management monitoring and querying alerts and notifications and an external interface protocol The policy management features implement specific IT goals that can be specified as policy directives for NM Monitoring and querying features enable tracking of power consumption Alerts and notifications provide the foundation for automation of power management in the data center management stack The external interface specifies the protocols that must be supported in this version of NM
743 ME System Management Bus (SMBus) Interfacebull The ME uses the SMLink0 on the SSB in multi-master mode as a
dedicated bus for communication with the BMC using the IPMB protocol The BMC FW considers this a secondary IPMB bus and runs at 400 kHz
bull The ME uses the SMLink1 on the SSB in multi-master mode bus for communication with PMBus devices in the power supplies for support of various NM-related features This bus is shared with the BMC which polls these PMBus power supplies for sensor monitoring purposes (for example power supply status input power and so on) This bus runs at 100 KHz
bull The Management Engine has access to the ldquoHost SMBusrdquo
744 PECI 30bull The BMC owns the PECI bus for all Intel server implementations and
acts as a proxy for the ME when necessary
745 NM ldquoDiscoveryrdquo OEM SDRAn NM ldquodiscoveryrdquo OEM SDR must be loaded into the BMCrsquos SDR repository if and only if the NM feature is supported on that product This OEM SDR is used by management software to detect if NM is supported and to understand how to communicate with itSince PMBus compliant power supplies are required in order to support NM the system should be probed when the SDRs are loaded into the
HA201-TP Users Manual
117
Chapter 7 Intelreg Server Board S2600TP Platform Management
BMCrsquos SDR repository in order to determine whether or not the installed power supplies do in fact support PMBus If the installed power supplies are not PMBus compliant then the NM ldquodiscoveryrdquo OEM SDR should not be loadedPlease refer to the Intelreg Intelligent Power Node Manager 20 External Architecture Specification using IPMI for details of this interface
746 SmaRTCLSTThe power supply optimization provided by SmaRTCLST relies on a platform HW capability as well as ME FW support When a PMBus-compliant power supply detects insufficient input voltage an overcurrent condition or an over-temperature condition it will assert theSMBAlert signal on the power supply SMBus (such as the PMBus) Through the use of external gates this results in a momentary assertion of the PROCHOT and MEMHOT signals to the processors thereby throttling the processors and memory The ME FW also sees the SMBAlert assertion queries the power supplies to determine the condition causing the assertion and applies an algorithm to either release or prolong the throttling based on the situationSystem power control modes include1 SmaRT Low AC in put voltage event results in a one-time momentary
throttle for each event to the maximum throttle state2 Electrical Protection CLST High output energy event results in a
throttling hiccup mode with fixed maximum throttle time and a fix throttle release ramp time
3 Thermal Protection CLST High power supply thermal event results in a throttling hiccup mode with fixed maximum throttle time and a fix throttle release ramp time
When the SMBAlert signal is asserted the fans will be gated by HW for a short period (~100ms) to reduce overall power consumption It is expected that the interruption to the fans will be of short enough duration to avoid false lower threshold crossings for the fan tachsensors however this may need to be comprehended by the fan monitoring FW if it does have this side-effectME FW will log an event into the SEL to indicate when the system has been throttled by the SmaRTCLST power management feature This is dependent on ME FW support for this sensor Please refer ME FW EPS for SEL log details
HA201-TP Users Manual
118
Chapter 7 Intelreg Server Board S2600TP Platform Management
74611 Dependencies on PMBus-compliant Power Supply SupportThe SmaRTCLST system feature depends on functionality present in the ME NM SKU This feature requires power supplies that are compliant with the PMBus
note For additional information on Intelreg Intelligent Power Node
Manager usage and support please visit the following Intel Website
httpwwwintelcomcontentwwwusendata-centerdata-center-managementnode-manager-generalhtmlwapkw=node+manager
HA201-TP Users Manual
119
Chapter 7 Intelreg Server Board S2600TP Platform Management
75 Basic and Advanced Server Management FeaturesThe integrated BMC has support for basic and advanced server management features Basic management features are available by default Advanced management features are enabled with the addition of an optionally installed Remote Management Module 4 Lite (RMM4 Lite) key
When the BMC FW initializes it attempts to access the Intelreg RMM4 lite If the attempt to access Intelreg RMM4 lite is successful then the BMC activates the Advanced featuresThe following table identifies both Basic and Advanced server management features
HA201-TP Users Manual
120
Chapter 7 Intelreg Server Board S2600TP Platform Management
751 Dedicated Management PortThe server board includes a dedicated 1GbE RJ45 Management Port The management port is active with or without the RMM4 Lite key installed
752 Embedded Web ServerBMC Base manageability provides an embedded web server and an OEM-customizable web GUI which exposes the manageability features of the BMC base feature set It is supported over all on-board NICs that have management connectivity to the BMC as well as an optionaldedicated add-in management NIC At least two concurrent web sessions from up to two different users is supported The embedded web user interface shall support the following client web browsersbull Microsoft Internet Explorer 90bull Microsoft Internet Explorer 100bull Mozilla Firefox 24bull Mozilla Firefox 25The embedded web user interface supports strong security (authentication encryption and firewall support) since it enables remote server configuration and control The user interface presented by the embedded web user interface shall authenticate the user before allowing a web session to be initiated Encryption using 128-bit SSL is supported User authentication is based on user id and passwordThe GUI presented by the embedded web server authenticates the user before allowing a web session to be initiated It presents all functions to all users but grays-out those functions that the user does not have privilege to execute For example if a user does not have privilege to power control then the item shall be displayed in grey-out font in that userrsquos UI display The web GUI also provides a launch point for some of the advanced features such as KVM and media redirection These features are grayed out in the GUI unless the system has been updated to support these advanced features The embedded web server only displays US English or Chinese language outputAdditional features supported by the web GUI includesbull Presents all the Basic features to the usersbull Power onoffreset the server and view current power statebull Displays BIOS BMC ME and SDR version informationbull Display overall system health
HA201-TP Users Manual
121
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull Configuration of various IPMI over LAN parameters for both IPV4 and IPV6
bull Configuration of alerting (SNMP and SMTP)bull Display system asset information for the product board and
chassisbull Display of BMC-owned sensors (name status current reading
enabled thresholds) including color-code status of sensorsbull Provides ability to filter sensors based on sensor type (Voltage
Temperature Fan and Power supply related)bull Automatic refresh of sensor data with a configurable refresh ratebull On-line helpbull Displayclear SEL (display is in easily understandable human
readable format)bull Supports major industry-standard browsers (Microsoft Internet
Explorer and Mozilla Firefox)bull The GUI session automatically times-out after a user-configurable
inactivity period By default this inactivity period is 30 minutesbull Embedded Platform Debug feature - Allow the user to initiate
a ldquodebug dumprdquo to a file that can be sent to Intelreg for debug purposes
bull Virtual Front Panel The Virtual Front Panel provides the same functionality as the local front panel The displayed LEDs match the current state of the local panel LEDs The displayed buttons (for example power button) can be used in the same manner as the local buttons
bull Display of ME sensor data Only sensors that have associated SDRs loaded will be displayed
bull Ability to save the SEL to a filebull Ability to force HTTPS connectivity for greater security This is
provided through a configuration option in the UIbull Display of processor and memory information as is available over
IPMI over LANbull Ability to get and set Node Manager (NM) power policiesbull Display of power consumed by the serverbull Ability to view and configure VLAN settingsbull Warn user the reconfiguration of IP address will cause disconnectbull Capability to block logins for a period of time after several
HA201-TP Users Manual
122
Chapter 7 Intelreg Server Board S2600TP Platform Management
consecutive failed login attempts The lock-out period and the number of failed logins that initiates the lockout period are configurable by the user
bull Server Power Control - Ability to force into Setup on a resetbull System POST results ndash The web server provides the systemrsquos Power-
On Self Test (POST) sequence for the previous two boot cycles including timestamps The timestamps may be viewed in relative to the start of POST or the previous POST code
bull Customizable ports - The web server provides the ability to customize the port numbers used for SMASH http https KVM secure KVM remote media and secure remote media
For additional information reference the Intelreg Remote Management Module 4 and Integrated BMC Web Console Users Guide
753 Advanced Management Feature Support (RMM4 Lite)The integrated baseboard management controller has support for advanced management features which are enabled when an optional Intelreg Remote Management Module 4 Lite (RMM4 Lite) is installed The Intel RMM4 add-on offers convenient remote KVM access and control through LAN and internet It captures digitizes and compresses video and transmits it with keyboard and mouse signals to and from a remote computer Remote access and controlsoftware runs in the integrated baseboard management controller utilizing expanded capabilities enabled by the Intel RMM4 hardwareKey Features of the RMM4 add-on arebull KVM redirection from either the dedicated management NIC or
the server board NICs used for management traffic upto to two KVM sessions
bull Media Redirection ndash The media redirection feature is intended to allow system administrators or users to mount a remote IDE or USB CDROM floppy drive or a USB flash disk as a remote device to the server Once mounted the remote device appears just like a local device to the server allowing system administrators or users to install software (including operating systems) copy files update BIOS or boot the server from this device
bull KVM ndash Automatically senses video resolution for best possible screen capture high performance mouse tracking and
HA201-TP Users Manual
123
Chapter 7 Intelreg Server Board S2600TP Platform Management
synchronization It allows remote viewing and configuration in pre-boot POST and BIOS setup
7531 Keyboard Video Mouse (KVM) RedirectionThe BMC firmware supports keyboard video and mouse redirection (KVM) over LAN This feature is available remotely from the embedded web server as a Java applet This feature is only enabled when the Intelreg RMM4 lite is present The client system must have a Java Runtime Environment (JRE) version 60 or later to run the KVM or media redirection appletsThe BMC supports an embedded KVM application (Remote Console) that can be launched from the embedded web server from a remote console USB11 or USB 20 based mouse and keyboard redirection are supported It is also possible to use the KVM-redirection (KVM-r) session concurrently with media-redirection (media-r) This feature allows a user to interactively use the keyboard video and mouse (KVM) functions of the remote server as if the user were physically at the managed server KVM redirection console supports the following keyboard layouts English Dutch French German Italian Russian and SpanishKVM redirection includes a ldquosoft keyboardrdquo function The ldquosoft keyboardrdquo is used to simulate an entire keyboard that is connected to the remote system The ldquosoft keyboardrdquo functionality supports the following layouts English Dutch French German Italian Russian and SpanishThe KVM-redirection feature automatically senses video resolution for best possible screen capture and provides high-performance mouse tracking and synchronization It allows remote viewing and configuration in pre-boot POST and BIOS setup once BIOS has initialized videoOther attributes of this feature includebull Encryption of the redirected screen keyboard and mousebull Compression of the redirected screenbull Ability to select a mouse configuration based on the OS typebull Supports user definable keyboard macrosKVM redirection feature supports the following resolutions and refresh ratesbull 640x480 at 60Hz 72Hz 75Hz 85Hz 100Hzbull 800x600 at 60Hz 72Hz 75Hz 85Hzbull 1024x768 at 60Hx 72Hz 75Hz 85Hzbull 1280x960 at 60Hz
HA201-TP Users Manual
124
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull 1280x1024 at 60Hzbull 1600x1200 at 60Hzbull 1920x1080 (1080p)bull 1920x1200 (WUXGA)bull 1650x1080 (WSXGA+)
7532 Remote ConsoleThe Remote Console is the redirected screen keyboard and mouse of the remote host system To use the Remote Console window of your managed host system the browser must include a Java Runtime Environment plug-in If the browser has no Java support such as with a small handheld device the user can maintain the remote host system using the administration forms displayed by the browserThe Remote Console window is a Java Applet that establishes TCP connections to the BMCThe protocol that is run over these connections is a unique KVM protocol and not HTTP or HTTPS This protocol uses ports 7578 for KVM 5120 for CDROM media redirection and 5123 for FloppyUSB media redirection When encryption is enabled the protocol uses ports 7582 for KVM 5124 for CDROM media redirection and 5127 for FloppyUSB media redirection The local network environment must permit these connections to be made that is the firewall and in case of a private internal network the NAT (Network Address Translation) settings have to be configured accordingly
7533 PerformanceThe remote display accurately represents the local display The feature adapts to changes to the video resolution of the local display and continues to work smoothly when the system transitions from graphics to text or vice-versa The responsiveness may be slightly delayed depending on the bandwidth and latency of the networkEnabling KVM andor media encryption will degrade performance Enabling video compression provides the fastest response while disabling compression provides better video qualityFor the best possible KVM performance a 2Mbsec link or higher is recommendedThe redirection of KVM over IP is performed in parallel with the local KVM without affecting the local KVM operation
HA201-TP Users Manual
125
Chapter 7 Intelreg Server Board S2600TP Platform Management
7534 SecurityThe KVM redirection feature supports multiple encryption algorithms including RC4 and AES The actual algorithm that is used is negotiated with the client based on the clientrsquos capabilities
7535 AvailabilityThe remote KVM session is available even when the server is powered-off (in stand-by mode) No re-start of the remote KVM session shall be required during a server reset or power onoff A BMC reset (for example due to an BMC Watchdog initiated reset or BMC reset after BMC FWupdate) will require the session to be re-established KVM sessions persist across system reset but not across an AC power loss
7536 UsageAs the server is powered up the remote KVM session displays the complete BIOS boot process The user is able interact with BIOS setup change and save settings as well as enter and interact with option ROM configuration screensAt least two concurrent remote KVM sessions are supported It is possible for at least two different users to connect to same server and start remote KVM sessions
7537 Force-enter BIOS SetupKVM redirection can present an option to force-enter BIOS Setup This enables the system to enter F2 setup while booting which is often missed by the time the remote console redirects the video
7538 Media RedirectionThe embedded web server provides a Java applet to enable remote media redirection This may be used in conjunction with the remote KVM feature or as a standalone applet The media redirection feature is intended to allow system administrators or users to mount a remote IDE or USB CD-ROM floppy drive or a USB flash disk as a remote device to the server Once mounted the remote device appears just like a local device to the server allowing system administrators or users to install software (including operating systems) copy files update BIOS and so on or boot the server from this deviceThe following capabilities are supported
HA201-TP Users Manual
126
Chapter 7 Intelreg Server Board S2600TP Platform Management
The operation of remotely mounted devices is independent of the local devices on the server Both remote and local devices are useable in parallelbull Either IDE (CD-ROM floppy) or USB devices can be mounted as a
remote device to the serverbull It is possible to boot all supported operating systems from the remotely
mounted device and to boot from disk IMAGE (IMG) and CD-ROM or DVD-ROM ISO files See the Testedsupported Operating System List for more information
bull Media redirection supports redirection for both a virtual CD device and a virtual FloppyUSB device concurrently The CD device may be either a local CD drive or else an ISO image file the FloppyUSB device may be either a local Floppy drive a local USB device or else a disk image file
bull The media redirection feature supports multiple encryption algorithms including RC4 and AES The actual algorithm that is used is negotiated with the client based on the clientrsquos capabilities
bull A remote media session is maintained even when the server is powered-off (in standby mode) No restart of the remote media session is required during a server reset or power onoff An BMC reset (for example due to an BMC reset after BMC FW update) will require the session to be re-established
bull The mounted device is visible to (and useable by) managed systemrsquos OS and BIOS in both pre-boot and post-boot states
bull The mounted device shows up in the BIOS boot order and it is possible to change the BIOS boot order to boot from this remote device
bull It is possible to install an operating system on a bare metal server (no OS present) using the remotely mounted device This may also require the use of KVM-r to configure the OS during install
USB storage devices will appear as floppy disks over media redirection This allows for the installation of device drivers during OS installationIf either a virtual IDE or virtual floppy device is remotely attached during system boot both the virtual IDE and virtual floppy are presented as bootable devices It is not possible to present only a single-mounted device type to the system BIOS
HA201-TP Users Manual
127
Chapter 7 Intelreg Server Board S2600TP Platform Management
75381 AvailabilityThe default inactivity timeout is 30 minutes and is not user-configurable Media redirection sessions persist across system reset but not across an AC power loss or BMC reset
75382 Network Port UsageThe KVM and media redirection features use the following portsbull 5120 ndash CD Redirectionbull 5123 ndash FD Redirectionbull 5124 ndash CD Redirection (Secure)bull 5127 ndash FD Redirection (Secure)bull 7578 ndash Video Redirectionbull 7582 ndash Video Redirection (Secure)For additional information reference the Intelreg Remote Management Module 4 and Integrated BMC Web Console Users Guide
Chapter 1 Product IntroductionChapter 1 Product IntroductionChapter 8 Technical Support
wwwaicipccom
bull TAIWANTel +886 3 433 9188Fax +886 3 287 1818Email salesaicipccomtw
bull CHINA Tel +862154961421 +862154961422Fax Extension 608Email Technical Support supportaicipccom
bull AMERICA - West coastTel +19098958989Fax +19098958999Email salesaicipccom
bull AMERICA - East coastTel +19738848886Fax +19738844794Email njsalesaicipccom
bull EUROPETel +31306386789Fax +31306360638Emailsalesaicipcnl
Email Technical Support supportaicipccom
- PREFACE
-
- SAFETY INSTRUCTIONS
-
- Chapter 1 Product Introduction
-
- 11 Box Content
- 12 Specifications
- 13 General Information
-
- Chapter 2 Hardware Installation
-
- 21 Removing and Installing top cover
- 22 Central Processing Unit (CPU)
- 23 Diagram of the Correct Installation
- 24 System Memory
- 25 RemovingInstalling a Drive Tray Hard Drive Drive Slot Map
- 26 Removing and Installing a Fan Module
- 27 Power Supply
- 28 Tool-less Blade Slide Installation introduction
-
- Chapter 3 Motherborad Overview
-
- 31 Intelreg Server Board Feature Set
- 32 Motherboard Layout
- 33 Motherboard block diagram
- 34 Motherboard Configuration Jumpers
- 35 Motherboard Configuration Jumpers
-
- Chapter 4 12GB Expander Borad Overview
-
- 41 12GB Expander Board
-
- Chapter 5 Backplane Overview
-
- 51 Backplane
-
- Chapter 6 Debug amp Firmware Update Introduction
-
- 61 Expender firmware update through smart console port
- 62 Update the expander firmware through in-band
- 63 12G expander EDFB setting
- 64 Slot HDD power setting
- 65 HDD BP thermal sensor temperature setting
-
- Chapter 7 Intelreg Server Board S2600TP Platform Management
-
- 71 Server Management Function Architecture
- 72 Features and Functions
- 73 Sensor Monitoring
- 74 Intelreg Intelligent Power Node Manager (NM)
- 75 Basic and Advanced Server Management Features
-
- Chapter 8 Technical Support
-
- 按鈕11
![Page 7: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/7.jpg)
iii
bull Place the power cord out of the way of foot traffic Do not place anything over the power cord The power cord must be rated for the product voltage and current marked on the productrsquos electrical ratings label The voltage and current rating of the cord should be greater than the voltage and current rating marked on the product
bull If the equipment is not used for a long time disconnect the equipment from mains to avoid being damaged by transient over-voltage
bull Never open the equipment For safety reasons only qualified service personnel should open the equipment
bull If one of the following situations arise the equipment should be checked by service personnel1 The power cord or plug is damaged2 Liquid has penetrated the equipment3 The equipment has been exposed to moisture4 The equipment does not work well or will not work according to its user
manual5 The equipment has been dropped andor damaged6 The equipment has obvious signs of breakage7 Please disconnect this equipment from the AC outlet before cleaning
Do not use liquid or detergent for cleaning The use of a moisture sheet or cloth is recommended for cleaning
bull Module and drive bays must not be empty They must have a dummy cover
Product features and specifications are subject to change without notice
CAUTION
risk of explosion if battery is replaced by an incorrect type
dispose of used batteries according to the instructions
After performing any installation or servicing make sure the
enclosure are lock and screw in position turn on the power
1
HA201-TP Users Manual
Chapter 1 Product Introduction
11 Box Content
Chapter 1 Product Introduction
Before removing the subsystem from the shipping carton visually inspect the physical condition of the shipping carton Exterior damage to the shipping carton may indicate that the contents of the carton are damaged If any damage is found do not remove the components contact the dealer where the subsystem was purchased for further introduction Before continuing first unpack the subsystem and verify that the contents of the shipping carton are all there and in good condition
bull Enclosure( Power supply fan 24 HDD tray included)
bull RS232 cablex 1pcs bull Power cord x 2sets bull Screws kit x 1set
bull Slide rail x 1set
If any items are missing please contact your authorized reseller or sales representative
2
HA201-TP Users Manual
Chapter 1 Product Introduction
12 Specifications
Dimensions (W x D x H)(with chassis ears)
mm 4826 x 815 x 88
inches 19 x 32 x 35
Motherboard (per node) Intelreg Server Board S2600TP
Processor (per node)
Processor Support
Two Intelreg Xeonreg Processors E5-2600 v3 Product Family
QPI Speeds 96 GTs 8 GTs 72 GTs
Socket Type Socket R3 (FCLGA2011-3)
Chipset Support(per node) Intelreg C612 Chipset
System Memory (per node)
bull 16 DIMM slots in total across 8 memory channelsbull Registered DDR4 (RDIMM) Load Reduced DDR4 (LRDIMM)bull Memory DDR4 data transfer rates of 160018662133 MTsbull DIMM sizes of 4 GB 8 GB 16 GB or 32 GB depending on ranks and technology
Front Panel Power onoff
LEDs
A bull Power (Secondary)bull Warning
B bull Power (Primary)bull Warning
Drive Bays External 25 hot swap 24
Backplane 1 x 24-port 12Gb SAS dual-loop backplane
Expander Board(per node)
1 x 24-port 12Gb SAS expander board with 3 SFF-8643 connectors
Expansion Slots (per node) PCIe 30 5 x8 (4 LP amp 1 FHHL)
Internal IO Connectors Headers(per node)
bull 1 x internal USB 20 connector (port 67)bull 1 x 2x7pin header for system fan modulebull 1 x 1x12pin control panel headerbull 1 x DH-10 serial Port A connectorbull 1 x SATA 6Gbs port for SATA DOMbull 4 x SATA 6Gbs connectors (port 0123)bull 1 x 2x4 pin header for Intel RMM4 Litebull 1 x 1x4 pin header for Storage Upgrade Keybull 1 x 1x8 pin backup power controler connector
Rear IO(per node)
bull 1 x RJ45 (dedicated port for remote server management) 2 x RJ45 1 x VGA 2 x USB 20 Type A 1 x Mini SAS HD
Ethernet (per node) Dual GbE Intelreg I350 Gigabit Ethernet Controller
Video Support(per node) Integrated Matrox G200 2D Graphics Controller
Server Management(per node)
Onboard Server Engines LLC Pilot III Controller Support for Intelreg Remote Management Module 4 solutions IntelregLight-Guided Diagnostics on field replaceable units Support for Intelreg System Management Software Support for Intelreg Intelligent Power Node Manager (Need PMBus - compliant power supply) BIOS Flash Winbond W25Q64BV
Power Supply 1200W 1+1 redundant power supply
System Cooling (per node)
2 x 6056 fans
EnvironmentalSpecifications
Temperature 0degC - 35degCHumidity 5 - 95 non-condensing
Gross Weight (w PSU amp Rail)kgs 41
lbs 90
PackagingDimensions
(W x D x H)mm 590 x 1150 x 330
inches 232 x 453 x 13
Mounting Standard 28 tool-less slide rail
3
HA201-TP Users Manual
Chapter 1 Product Introduction
13 General Information
HA201-WP a 2U Storage Server Barebone supports two Intelreg Xeonreg processors E5-2600 v3 series HA201-TP has a 24 x 25rdquo HDD bays as system drive bays It is a perfect building block for Storage Servers
bull Front Panel
LED Indicator and Switch
24 x 25 hot-swap SATASAS HDD bays
4
HA201-TP Users Manual
Chapter 1 Product Introduction
bull Rear Panel
1200W 1+1 80+ redundant power supply
2 x low-profile add-on card for external connection
SFF 8644 for JBOD connection
5
HA201-TP Users Manual
Chapter 1 Product Introduction
bull Top View of HA201-TP system
2 x hot-swappable storage control node
Intelreg Xeonreg E5-2600 v3 Processors
bull TOP View Of Controller Canister
6
HA201-TP Users Manual
Chapter 1 Product Introduction
bull Top View of HA201-TP system
4 x 60x56mm hot-swap fans
Intel S2600TP
HA201-TP Users Manual
7
Chapter 2 Hardware Installation
7
21 Removing and Installing top coverUnscrew the screws(2pcs for the back and 4pcs for the both right and left side)Slide the cover backwards to remove top cover form the chassis
Chapter 2 Hardware Installation
This section demonstrates maintenance procedures in replacing a defective part once the HA201-TP UM appliance is installed and is operational
Chapter 2 Hardware Installation
HA201-TP Users Manual
8
22 Central Processing Unit (CPU)221 Removing the Processor HeatsinkThe heatsink is attached to the server board or processor socket with captive fasteners Using a 2 Phillips screwdriver loosen the four screws located on the heatsink corners in a diagonal manner using the following procedurebull Using a 2 Phillips screwdriver start with screw 1 and loosen it by
giving it two rotations and stop (see letter A) (IMPORTANT Do not fully loosen)
bull Proceed to screw 2 and loosen it by giving it two rotations and stop (see letter B) Similarly loosen screws 3 and 4 Repeat steps A and B by giving each screw two rotations each time until all screws are loosened
bull Lift the heatsink straight up (see letter C)
REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
Processor
Socket
2
3
1
4
A
B
C
HA201-TP Users Manual
9
Chapter 2 Hardware Installation
222 Installing the Processor
2221 Unlatch the CPU Load Platebull Push the lever handle labeled ldquoOPEN 1strdquo (see letter A) down and
toward the CPU socket Rotate the lever handle upbull Repeat the steps for the second lever handle (see letter B)
CAUTION
Processor must be appropriate You may damage the server board if
you install a processor that is inappropriate for your server
CAUTION
ESD and handling processors Reduce the risk of electrostatic
discharge (ESD) damage to the processor by doing the following
(1) Touch the metal chassis before touching the processor or
server board Keep part of your body in contact with the metal
chassis to dissipate the static charge while handling the processor
(2) Avoid moving around unnecessarily
REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
A REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
B
Chapter 2 Hardware Installation
HA201-TP Users Manual
10
2222 Lift open the Load Platebull Rotate the right lever handle down until it releases the Load Plate
(see letter A)bull While holding down the lever handle with your other hand lift open
the Load Plate (see letter B)
BREMOVE
REMOVE
LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
A
HA201-TP Users Manual
11
Chapter 2 Hardware Installation
2223 Install the Processorbull Remove the processor from its packagebull Carefully remove the protective cover from the bottom side of the
CPU taking care not to touch any CPU contacts (see letter A)bull Orient the processor with the socket so that the processor cutouts
match the four orientation posts on the socket (see letter B) Note the location of a gold key at the corner of the processor (see letter C) Carefully place (Do NOT drop) the CPU into the socket
CAUTION
The pins inside the CPU socket are extremely sensitive Other than
the CPU no object should make contact with the pins inside the
CPU socket A damaged CPU socket pin may render the socket inoperable
and will produce erroneous CPU or other system errors if used
NOTE
The underside of the processor has components that may damage the
socket pins if installed improperly The processor must align correctly
with the socket opening before installation DO NOT DROP the
processor into the socket
NOTE
When possible a CPU insertion tool should be used when installing the
CPU
A
B
C
Chapter 2 Hardware Installation
HA201-TP Users Manual
12
2224 Remove the Socket Cover Remove the socket cover from the load plate by pressing it out
2225 Close the Load Plate Carefully lower the load plate down over the processor
2226 Lock down the Load Platebull Push down on the locking lever on the CLOSE 1st side (see letter A)
Slide the tip of the lever under the notch in the load plate (see letter B) Make sure the load plate tab engages under the socket lever when fully closed
bull bRepeat the steps to latch the locking lever on the other side (see letter C) Latch the levers in the order as shown
NOTE
The socket cover should be saved and re-used should the processor
need to be removed at anytime in the future
Save theprotectivecover
A
BC
HA201-TP Users Manual
13
Chapter 2 Hardware Installation
223 Installing the Processor Heatsink
bull If present remove the protective film covering the Thermal Interface Material on the bottom side of the heatsink (see letter A)
bull Align the heatsink fins to the front and back of the chassis for correct airflow The airflow goes from front-to-back of the chassis (see letter B)
bull Each heatsink has four captive fasteners and should be tightened in a diagonal manner using the following procedure
bull Using a 2 Phillips screwdriver start with screw 1 and engage screw threads by giving it two rotations and stop (see letter C) (Do not fully tighten)
bull Proceed to screw 2 and engage screw threads by giving it two rotations and stop (see letter D) Similarly engage screws 3 and 4
bull Repeat steps C and D by giving each screw two rotations each time until each screw is lightly tightened up to a maximum of 8 inch-lbs torque (see letter E)
NOTE
The processor heatsink for CPU1 and CPU2 is different FXXCA91X91HS is
for CPU1 while FXXEA91X91HS2 is for CPU2 Mislocating the heatsink
will cause serious thermal damage
C
Processor
Socket
AIRFLOW
Chassis Front
2
3
1
4
A
B
TIM
D
CAUTIONDo not over-tighten fasteners
E
Chapter 2 Hardware Installation
HA201-TP Users Manual
14
224 Removing the Processor
NOTE
Remove the processor by carefully lifting it out of the socket taking care
NOT to drop the processor and not touching any pins inside the socket
Install the socket cover if a replacement processor is not going to be installed
HA201-TP Users Manual
15
Chapter 2 Hardware Installation
225 Different heatsink for each of its CPUsCarefully position heatsink over the CPU0 and CPU1 and align the heatsink screws with the screw holes in the motherboard
Caution - Possible thermal damage Avoid moving the heatsink after
it has contacted the top of the CPU Too much movement could
disturb the layer of thermal compound causing voids and leading
to ineffective heat dissipation and component damage
CPU0
CPU0
CPU1
CPU1
Chapter 2 Hardware Installation
HA201-TP Users Manual
16
23 Diagram of the Correct InstallationNOTE The heat sinkrsquos fans should be blowing toward the rear end
of the chassis If one of the fans is facing the wrong direction
please remove the heat sink and reinstall it so that it is facing
the correct direction
AIRFLOW
AIRFLOW
FAN FAN
HA201-TP Users Manual
17
Chapter 2 Hardware Installation
24 System Memory241 Supported Memory
Chapter 2 Hardware Installation
HA201-TP Users Manual
18
242 Memory Population Rules
bull Each installed processor provides four channels of memory On the Intelreg Server Board S2600TP product family each memory channel supports two memory slots for a total possible 16 DIMMs installed
bull The memory channels from processor socket 1 are identified as Channel A B C and D The memory channels from processor socket 2 are identified as Channel E F G and H
bull The silk screened DIMM slot identifiers on the board provide information about the channel and therefore the processor to which they belong For example DIMM_A1 is the first slot on Channel A on processor 1 DIMM_E1 is the first DIMM socket on Channel E on processor 2
bull The memory slots associated with a given processor are unavailable if the corresponding processor socket is not populated
bull A processor may be installed without populating the associated memory slots provided a second processor is installed with associated memory In this case the memory is shared by the processors However the platform suffers performance degradation and latency due to the remote memory
bull Processor sockets are self-contained and autonomous However all memory subsystem support (such as Memory RAS and Error Management) in the BIOS setup is applied commonly across processor sockets
bull All DIMMs must be DDR4 DIMMsbull Mixing of LRDIMM with any other DIMM type is not allowed per
platformbull Mixing of DDR4 operating frequencies is not validated within a socket
or across sockets by Intel If DIMMs with different frequencies are mixed all DIMMs run at the common lowest frequency
bull A maximum of eight logical ranks (ranks seen by the host) per channel is allowed
bull The BLUE memory slots on the server board identify the first memory slot for a given memory channel
NOTE
Although mixed DIMM configurations are supported Intel only performs platform
validation on systems that are configured with identical DIMMs installed
HA201-TP Users Manual
19
Chapter 2 Hardware Installation
bull DIMM population rules require that DIMMs within a channel be populated starting with the BLUE DIMM slot or DIMM farthest from the processor in a ldquofill-farthestrdquo approach In addition when populating a Quad-rank DIMM with a Single- or Dual-rank DIMM in the same channel the Quad-rank DIMM must be populated farthest from the processor Intelreg MRC will check for correct DIMM placement
Chapter 2 Hardware Installation
HA201-TP Users Manual
20
On the Intelreg Server Board S2600TP product a total of sixteen DIMM slots are provided (two CPUs ndash four channels per CPU and two DIMMs per channel) The nomenclature for DIMM sockets is detailed in the following table
HA201-TP Users Manual
21
Chapter 2 Hardware Installation
243 Publishing System MemoryThere are a number of different situations in which the memory size andor configuration are displayed Most of these displays differ in one way or another so the same memory configuration may appear to display differently depending on when and where the display occurs
bull The BIOS displays the ldquoTotal Memoryrdquo of the system during POST if Quiet Boot is disabled in BIOS setup This is the total size of memory discovered by the BIOS during POST and is the sum of the individual sizes of installed DDR4 DIMMs in the system
bull The BIOS displays the ldquoEffective Memoryrdquo of the system in the BIOS Setup The term Effective Memory refers to the total size of all DDR4 DIMMs that are active (not disabled) and not used as redundant units (see Note below)
bull The BIOS provides the total memory of the system in the main page of BIOS setup This total is the same as the amount described by the first bullet above
bull If Quiet Boot is disabled the BIOS displays the total system memory on the diagnostic screen at the end of POST This total is the same as the amount described by the first bullet above
bull The BIOS provides the total amount of memory in the system by supporting the EFI Boot Service function GetMemoryMap()
bull The BIOS provides the total amount of memory in the system by supporting the INT 15h E820h function For details see the Advanced Configuration and Power Interface Specification
Chapter 2 Hardware Installation
HA201-TP Users Manual
22
25 RemovingInstalling a Drive Tray Hard Drive Drive Slot Map
251 Removing a Disk DriveRelease a drive tray by pressing the unlock button and pinching the lock lever slightly and pulling out the drive tray
252 Installing a Disk Drive
Adjust and place the 25rdquo HDD on the HDD tray Choose to the proper position screw hole then secure it with screws on the bottomMounting location 1 HDD with the interposer board attached
HA201-TP Users Manual
23
Chapter 2 Hardware Installation
253 Installing a Hard Disk Drive Tray
Insert the drive carrier into its bay Push the tray lever until it clicks Make sure the drive tray is correctly secured in place when its front edge aligns with the bay edge
254 Drive Slot Map
The drive slot map follows
HBA Card
MegaRaid Card
Chapter 2 Hardware Installation
HA201-TP Users Manual
24
26 Removing and Installing a Fan Module261 Removing a FAN module
Unscrew the thumb screw on the cover to open the top cover from NODE Unpluging the FAN cable Remove the fan by lifting it and pulling it out
262 Installing a FAN module To installing a fan module follow the reverse order
HA201-TP Users Manual
25
Chapter 2 Hardware Installation
27 Power Supply
271 Removing or Replacing the Power Supply Module
Slide the small levers (see red arrow ) to the right Hold the PSU lever and firmly pull the PSU out of the server chassis
Chapter 2 Hardware Installation
HA201-TP Users Manual
26
28 Tool-less Blade Slide Installation introduction
1 Pull on the lsquorsquoFront-Releasersquorsquoto unlock the inner channel from the Slide Assembly
2 Release the Detent-Lock and push Middle Channel inwards to retract Middle Channel
HA201-TP Users Manual
27
Chapter 2 Hardware Installation
Metal Spacer
OptionalRemove Metal Spacer for Aluminium Racks
3 Aligning the Front Bracket with the Mounting Hole
4 Push in to assembly the Front Bracket onto the Rack
Chapter 2 Hardware Installation
HA201-TP Users Manual
28
5 Now the bracket is fixed onto the Rack(Optional M6x10L screws are to secure the rails with posts if needed)
6 Refer to Diagram 3 amp 4 to assemble the End Bracket onto the Rack
HA201-TP Users Manual
29
Chapter 2 Hardware Installation
7 Assemble the inner channel onto the chassis using thescrews provided
8 Push the chassis with inner channels into Slide to complete Rack Installation
Chapter 3 Motherborad Overview
HA201-TP Users Manual
30
Chapter 3 Motherborad Overview
The Intelreg Server Board S2600TP is a monolithic printed circuit board (PCB) with features designed to support the high-performance and high-density computing markets This server board is designed to support the Intelreg Xeonreg processor E5-2600 v3 product family Previous generation Intelreg Xeonreg processors are not supported Many of the features and functions of the server board family are common A board will be identified by name when a described feature or function is unique to it
Chapter 3 Motherborad Overview
HA201-TP Users Manual
31
31 Intelreg Server Board Feature Set
Chapter 3 Motherborad Overview
HA201-TP Users Manual
32
WARNING The riser slot 1 on the server board is designed for
plugging in ONLY the riser card Plugging in the PCIe card may
cause permanent server board and PCIe card damage
Chapter 3 Motherborad Overview
HA201-TP Users Manual
33
32 Motherboard Layout
DiagnosticLED
RMM4LiteRiser Slot 2
SATASGPIO
BridgeBoard
SATA 3USB
Riser Slot 4
Riser Slot 4
HDD Activity
Riser Slot 3ControlPanel
SystemFan 1
CPU 2Socket
CPU 1Socket
SystemFan 3
SystemFan 2
MainPower 1
FanConnector
MainPower 2
InfiniBandPort(QSFP+)DedicatedManagementPortUSBStatus LEDID LED
VGA
NIC 2
IPMB
NIC 1
SerialPort A
CR 2032 3W
Backup Power Control
SATA 2
SATA 0
SATA RAID 5
SATA
Upgrade Key
DOM
SATA 1
Chapter 3 Motherborad Overview
HA201-TP Users Manual
34
33 Motherboard block diagram
Chapter 3 Motherborad Overview
HA201-TP Users Manual
35
34 Motherboard Configuration JumpersThe following table provides a summary and description of configuration test and debug jumpers The server board has several 3-pin jumper blocks that can be used Pin 1 on each jumper block can be identified by the following symbol on the silkscreen
Chapter 3 Motherborad Overview
HA201-TP Users Manual
36
Chapter 3 Motherborad Overview
HA201-TP Users Manual
37
35 Motherboard Configuration JumpersThe Intelreg Server Board S2600TP has the following board rear connector placement
HA201-TP Users Manual
38
Chapter 4 12G Expander Board Overview
This chapter provides detailed introduction guide on hardware introduction
41 12GB Expander Board411 Placement amp Connectors location
J1
JPIC_DBGU8
Chip 3x36RExpander
U14Flash U16
PIC
SW1
SW2
JEXP_UART
J7
JPIC_SMT
J2 J3 J4 CN1 JUSB_MB
JVBATT_12C
JVBATT CN3 CN2 JIPMI_LOC
CN5
CN8
JPWR1
JPWR2
JHDDPWRJPSON_OK
JIPMB
JUSB_PIC
J8
CN7CN6
CN4
JPWR_SW
Chapter 4 12GB Expander Borad Overview
HA201-TP Users Manual
39
Chapter 4 12G Expander Board Overview
412 PIN-OUT
Power Connector ndash JPWR1JPWR2
OS HDD Power Connector ndash JHDDPWR
Battery Connector ndash JVBATT
FAN Connector ndash J7J8
Console for Expander ndash JEXP_UART
HA201-TP Users Manual
40
Chapter 4 12G Expander Board Overview
PIC Smart portndash JPIC_SMT
PIC Debug port(For MPLAB I2C tool) ndash JPIC_DBG
PIC USB ndash JUSB_PIC
Battery Function ndash JVBATT_I2C
IPMB ndash JIPMB
HA201-TP Users Manual
41
Chapter 4 12G Expander Board Overview
BP PIC USB ndash JUSB_MB
IPMB for Device ndash JIPMI_LOC
Power SW ndash JPWR_SW
MB power on frunction ndash JPSON_OK
HA201-TP Users Manual
42
Chapter 4 12G Expander Board Overview
413 LEDs
LED_CN6
LED2
LED3
HA201-TP Users Manual
43
Chapter 5 Backplane OverviewChapter 5 Backplane Overview
This chapter provides detailed introduction guide on backplane introduction
51 Backplane511 Placement amp Connectors location
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
9
18
10
1
3
G
D
SG
D
S
C
B
C
E
B
C
E
GA
O
I
2
43
12
43
1
14
1
3
45
134
5
134
5
134
5
16
2
9
10
1
1
1120
10
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
16
17
32
33
484964
J2
JP1J11
For Secondary Canister Node
J4 J1 SW1
SW2
J3
For Primary Canister Node
HA201-TP Users Manual
44
Chapter 5 Backplane Overview
512 PIN-OUT
Power Connector ndash J2
Power Connector ndash J11
PMBUS Connector ndash JP1
FAN Connector ndash J4
HA201-TP Users Manual
45
Chapter 5 Backplane Overview
Console for MCU ndash J1
Front IO ndash J3
HA201-TP Users Manual
46
Chapter 5 Backplane Overview
513 LEDs
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
18
9
10
1
31
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
3 4
21
3 4
21
4
1
16
2
91
101
11
10
20
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ActFail
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
9
18
10
1
3
G
D
SG
D
S
C
B
C
E
B
C
E
GA
O
I
2
43
12
43
1
14
1
3
45
134
5
134
5
134
5
16
2
9
10
1
1
1120
10
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
16
17
32
33
484964LED1
HA201-TP Users Manual
47
Chapter 6 HDD Backplane Introduction
61 Expender firmware update through smart console port611 Update Expander firmware revision
Step 1 Set up HA201-TP console serial cable Insert console serial cable into console port shown below also
the other side inert serial port into motherboard
PLEASE FIND THE CONSOLE SERIAL CABLE IN THE PACKAGE BOX
Chapter 6 Debug amp Firmware Update Introduction
HA201-TP Users Manual
48
Chapter 6 HDD Backplane Introduction
Step 2 Set up HA201-TP RS232 connectionSet up RS232 connection application into your HA201-TP as shown in the example process below
For exampleOS Microsoft WindowsRS232 connection application Hyperterminal
Step 2 Install HyperTrmexe
HA201-TP Users Manual
49
Chapter 6 HDD Backplane Introduction
Step 3 Enter a new name for the icon in the field below and click OK
Step 4 Connecting by using selecting an option in the drop down menu circled in red below (we selected COM2 in this example) and click OK
HA201-TP Users Manual
50
Chapter 6 HDD Backplane Introduction
Properties
Port Setting
Bits per second
Data bits
Parity
Stop bits
Flow control None
Restore Defaults
OK ApplyCancel
None
Step 6 Set up is complete The diagram below depicts what screen should displayed
Step 5 For ldquoBits per secondrdquo select 38400 For ldquoFlow controlrdquo select None Click OK when you have finished your selections
cmdgt_
HA201-TP Users Manual
51
Chapter 6 HDD Backplane Introduction
Step 7To get firmware image amp MFG Configuration Image version information from AIC SAS Related Firmware Downloadne website
httpppmsaicipccomtw8888downloadexpandermcu
HA201-TP Users Manual
52
Chapter 6 HDD Backplane Introduction
Step 8Comand line for show current firmware revisioncmdgtrev
HA201-TP Users Manual
53
Chapter 6 HDD Backplane Introduction
Step 9Start to update expander firmwarecmdgtfdl 0 0_
Step 10Select the tool bar Transfer -gt Send File
HA201-TP Users Manual
54
Chapter 6 HDD Backplane Introduction
Step 11bull Choose new firmware path file fw 3A1_v11221bull Protocol have to choose Xmodem
Step 12Firmware download complete
HA201-TP Users Manual
55
Chapter 6 HDD Backplane Introduction
Step 13Reset computer for success update firmwarecmdgtreset
reset
HA201-TP Users Manual
56
Chapter 6 HDD Backplane Introduction
612 Update expander configuration MFG
Step 1Comand line for show current configuration MFGcmdgt showmfg
HA201-TP Users Manual
57
Chapter 6 HDD Backplane Introduction
Step 2Start to update expander configuration MFGcmdgtfdl 83 0_
Step 3Select the tool bar Transfer -gt Send File
83 0
HA201-TP Users Manual
58
Chapter 6 HDD Backplane Introduction
Step 4bull Choose new MFG path file mfg 3A10_eob_1201binbull Protocol have to choose Xmodem
Step 5MFG download complete
HA201-TP Users Manual
59
Chapter 6 HDD Backplane Introduction
Step 6Reset computer for success update MFGcmdgtreset
reset
HA201-TP Users Manual
60
Chapter 6 HDD Backplane Introduction
62 Update the expander firmware through in-band
FOR EXAMPLEStep 1
Download and install SG3_utilsexe which compatible with Linux OSFrom website httpsgdannyczsgsg3_utilshtml website Reference version sg3_utils-140tgz
Step 2To get firmware image amp MFG Configuration Image version information from AIC SAS Related Firmware Downloadne websitehttpppmsaicipccomtw8888downloadexpandermcu
HA201-TP Users Manual
61
Chapter 6 HDD Backplane Introduction
Step 3Execute terminal under the same new firmware folderexample Setting a new firmware folder on Home page Open Terminal by click to the right button of mouse in the same window Home
Step 4Typing sudo -s to into administrator mode
HA201-TP Users Manual
62
Chapter 6 HDD Backplane Introduction
Step 5 Find expander location$ sg_map -i
Step 6Update Expander firmware$ sg_write_buffer --id=0x0 --in=fw3A1_v11221 --mode=0x2 --offset=0 devsg0
HA201-TP Users Manual
63
Chapter 6 HDD Backplane Introduction
Step 7 Update Expander MFG$ sg_write_buffer --id=0x83 --in=mfg3A10_eob_1201bin --mode=0x2 --offset=0 devsg0
Step 8Reboot computer for success update firmware amp MFGrootubuntu~ reboot
HA201-TP Users Manual
64
Chapter 6 HDD Backplane Introduction
63 12G expander EDFB settingStep 1
For Install HyperTerminalexe refer to section 41
Step 2 Get EDFB statuscmdgt edfb
HA201-TP Users Manual
65
Chapter 6 HDD Backplane Introduction
Step 3Change EDFB setting
Enable EDFB functioncmdgtedfb on
Disable EDFB functioncmdgtedfb off
cmd gtedfbEDFB is OFF
cmd gtedfb onSucceeded to set EDFB
cmd gtedfb offEDFB is OFF
HA201-TP Users Manual
66
Chapter 6 HDD Backplane Introduction
64 Slot HDD power setting(Only for system cooling Fan controled by expander)
Step 1 For Install sg3exe tool and get new firmware from website refer to section 42
Step 2Execute terminal under the same new firmware folderexample Setting a new firmware folder on Home page Open Terminal by click to the right button of mouse in the same window Home
HA201-TP Users Manual
67
Chapter 6 HDD Backplane Introduction
Step 3 Typing sudo -s to into administrator mode
Step 4 Find expander location$ sg_map -i
HA201-TP Users Manual
68
Chapter 6 HDD Backplane Introduction
Step 5For example
If would like to turn the Disk004 power off under the HBA card Need to check Disk004 power status $ sg_ses --page=7 devsg0
Under HBA card the Element 3 = Disk004
HA201-TP Users Manual
69
Chapter 6 HDD Backplane Introduction
Step 6To check Disk004 (element 3) power status is ok$ sg_ses --page=2 devsg0
Status shows belowThe status of Element 3 is OK
HA201-TP Users Manual
70
Chapter 6 HDD Backplane Introduction
Step 7Turn off a HDD power$ sg_ses --descriptor=Disk004 --set=341 devsg0
Step 8Turn on a HDD power$ sg_ses --descriptor=Disk004 --clear=341 devsg0
HA201-TP Users Manual
71
Chapter 6 HDD Backplane Introduction
65 HDD BP thermal sensor temperature setting(Only for system cooling Fan controled by expander)
Step 1 For Install HyperTerminalexe refer to section 41
Step 2Get the current temperature settingscmdgt temperature
HA201-TP Users Manual
72
Chapter 6 HDD Backplane Introduction
Step 3For example
Set new temperatureT1=20 C 4 18 CT2=50 C 4 52 CWarning threshold=50 C 448 CAlarm threshold=55 C 454 C
The new setting will take effect after resetcmdgt temperature 18 52 48 54cmdgt reset
HA201-TP Users Manual
73
Chapter 6 HDD Backplane Introduction
Step 4Check fan speed amp temperature informationcmdgt sensor
cmd gtsensor==ENCLOSURE STATUS========================================================== Total fan number 2 System Fan-0 speed 10888 RPM System Fan-1 speed 10971 RPM System PWN-0 82 Expander Temperature 76 Celsius degree Sytem Temperature-0 33 Celsius degree T1 20 Celsius degree T2 50 Celsius degree TC 55 Celsius degree Voltage Sensor 09V 093V Voltage Sensor 18V 180V
cmd gt_
HA201-TP Users Manual
74
Chapter 7 Intelreg Server Board S2600TP Platform Management
71 Server Management Function Architecture711 IPMI Feature7111 IPMI 20 Featuresbull Baseboard management controller (BMC)bull IPMI Watchdog timerbull Messaging support including command bridging and usersession
supportbull Chassis device functionality including powerreset control and BIOS boot
flags supportbull Event receiver device The BMC receives and processes events from
other platform subsystemsbull Field Replaceable Unit (FRU) inventory device functionality The BMC
supports access to system FRU devices using IPMI FRU commandsbull System Event Log (SEL) device functionality The BMC supports and
provides access to a SELbull Sensor Data Record (SDR) repository device functionality The BMC
supports storage and access of system SDRsbull Sensor device and sensor scanningmonitoring The BMC provides IPMI
management of sensors It polls sensors to monitor and report system health
bull IPMI interfaceso Host interfaces include system management software (SMS) with receive message queue support and server management mode (SMM)o IPMB interfaceo LAN interface that supports the IPMI-over-LAN protocol (RMCP RMCP+)
bull Serial-over-LAN (SOL)bull ACPI state synchronization The BMC tracks ACPI state changes that are
provided by the BIOSbull BMC self-test The BMC performs initialization and run-time self-tests and
makes results available to external entitiesbull See also the Intelligent Platform Management Interface Specification
Second Generation v20
Chapter 7 Intelreg Server Board S2600TP Platform Management
HA201-TP Users Manual
75
Chapter 7 Intelreg Server Board S2600TP Platform Management
7112 Non IPMI FeaturesThe BMC supports the following non-IPMI featuresbull In-circuit BMC firmware updatebull Fault resilient booting (FRB) FRB2 is supported by the watchdog timer
functionalitybull Chassis intrusion detectionbull Basic fan control using Control version 2 SDRsbull Fan redundancy monitoring and supportbull Enhancements to fan speed controlbull Power supply redundancy monitoring and supportbull Hot-swap fan supportbull Acoustic management Support for multiple fan profilesbull Signal testing support The BMC provides test commands for setting
and getting platform signal statesbull The BMC generates diagnostic beep codes for fault conditionsbull System GUID storage and retrievalbull Front panel management The BMC controls the system status LED
and chassis ID LED It supports secure lockout of certain front panel functionality and monitors button presses The chassis ID LED is turned on using a front panel button or a command
bull Power state retentionbull Power fault analysisbull Intelreg Light-Guided Diagnosticsbull Power unit management Support for power unit sensor The BMC
handles power-good dropout conditionsbull DIMM temperature monitoring New sensors and improved acoustic
management using closed-loop fan control algorithm taking into account DIMM temperature readings
bull Address Resolution Protocol (ARP) The BMC sends and responds to ARPs (supported on embedded NICs)
bull Dynamic Host Configuration Protocol (DHCP) The BMC performs DHCP (supported on embedded NICs)
bull Platform environment control interface (PECI) thermal management support
bull E-mail alertingbull Support for embedded web server UI in Basic Manageability feature
set
HA201-TP Users Manual
76
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull Enhancements to embedded web servero Human-readable SELo Additional system configurabilityo Additional system monitoring capabilityo Enhanced on-line help
bull Integrated KVMbull Enhancements to KVM redirection
o Support for higher resolutionbull Integrated Remote Media Redirectionbull Lightweight Directory Access Protocol (LDAP) supportbull Intelreg Intelligent Power Node Manager supportbull Embedded platform debug feature which allows capture of detailed
data for later analysisbull Provisioning and inventory enhancements
o Inventory datasystem information export (partial SMBIOS table)bull DCMI 15 compliance (product-specific)bull Management support for PMBus rev12 compliant power suppliesbull BMC Data Repository (Managed Data Region Feature)bull Support for an Intelreg Local Control Display Panelbull System Airflow Monitoringbull Exit Air Temperature Monitoringbull Ethernet Controller Thermal Monitoringbull Global Aggregate Temperature Margin Sensorbull Memory Thermal Managementbull Power Supply Fan Sensorsbull Energy Star Server Supportbull Smart Ride Through (SmaRT) Closed Loop System Throttling (CLST)bull Power Supply Cold Redundancybull Power Supply FW Updatebull Power Supply Compatibility Checkbull BMC FW reliability enhancements
o Redundant BMC boot blocks to avoid possibility of a corrupted boot block resulting in a scenario that prevents a user from updating the BMC o BMC System Management Health Monitoring
HA201-TP Users Manual
77
Chapter 7 Intelreg Server Board S2600TP Platform Management
72 Features and Functions721 Power SubsystemThe server board supports several power control sources which can initiate power-up orpower-down activity
HA201-TP Users Manual
78
Chapter 7 Intelreg Server Board S2600TP Platform Management
722 Advanced Configuration and Power Interface (ACPI)The server board has support for the following ACPI states
HA201-TP Users Manual
79
Chapter 7 Intelreg Server Board S2600TP Platform Management
723 System InitializationDuring system initialization both the BIOS and the BMC initialize the following items
7231 Processor Tcontrol SettingProcessors used with this chipset implement a feature called Tcontrol which provides a processor-specific value that can be used to adjust the fan-control behavior to achieve optimum cooling and acoustics The BMC reads these from the CPU through PECI Proxy mechanism provided by Manageability Engine (ME) The BMC uses these values as part of thefan-speed-control algorithm
HA201-TP Users Manual
80
Chapter 7 Intelreg Server Board S2600TP Platform Management
7232 Fault Resilient Booting (FRB)Fault resilient booting (FRB) is a set of BIOS and BMC algorithms and hardware support that allow a multiprocessor system to boot even if the bootstrap processor (BSP) fails Only FRB2 is supported using watchdog timer commandsFRB2 refers to the FRB algorithm that detects system failures during POST The BIOS uses the BMC watchdog timer to back up its operation during POST The BIOS configures the watchdog timer to indicate that the BIOS is using the timer for the FRB2 phase of the boot operationAfter the BIOS has identified and saved the BSP information it sets the FRB2 timer use bit and loads the watchdog timer with the new timeout intervalIf the watchdog timer expires while the watchdog use bit is set to FRB2 the BMC (if so configured) logs a watchdog expiration event showing the FRB2 timeout in the event data bytes The BMC then hard resets the system assuming the BIOS-selected reset as the watchdog timeout actionThe BIOS is responsible for disabling the FRB2 timeout before initiating the option ROM scan and before displaying a request for a boot password If the processor fails and causes an FRB2 timeout the BMC resets the systemThe BIOS gets the watchdog expiration status from the BMC If the status shows an expired FRB2 timer the BIOS enters the failure in the system event log (SEL) In the OEM bytes entry in the SEL the last POST code generated during the previous boot attempt is written FRB2failure is not reflected in the processor status sensor valueThe FRB2 failure does not affect the front panel LEDs
HA201-TP Users Manual
81
Chapter 7 Intelreg Server Board S2600TP Platform Management
7233 Post Code DisplayThe BMC upon receiving standby power initializes internal hardware to monitor port 80h (POST code) writes Data written to port 80h is output to the system POST LEDsThe BMC deactivates POST LEDs after POST had completed
HA201-TP Users Manual
82
Chapter 7 Intelreg Server Board S2600TP Platform Management
724 System Event Log (SEL)The BMC implements the system event log as specified in the Intelligent Platform Management Interface Specification Version 20 The SEL is accessible regardless of the system power state through the BMCs in-band and out-of-band interfacesThe BMC allocates 95231bytes (approx 93 KB) of non-volatile storage space to store system events The SEL timestamps may not be in order Up to 3639 SEL records can be stored at a time Because the SEL is circular any command that results in an overflow of the SEL beyond the allocated space will overwrite the oldest entries in the SEL while setting the overflow flag
HA201-TP Users Manual
83
Chapter 7 Intelreg Server Board S2600TP Platform Management
73 Sensor MonitoringThe BMC monitors system hardware and reports system health The information gathered from physical sensors is translated into IPMI sensors as part of the ldquoIPMI Sensor Modelrdquo The BMC also reports various system state changes by maintaining virtual sensors that are not specifically tied to physical hardware This section describes general aspects of BMC sensor management as well as describing how specific sensor types are modeled Unless otherwise specified the term ldquosensorrdquo refers to the IPMI sensor-model definition of a sensor
531 Sensor ScanningThe value of many of the BMCrsquos sensors is derived by the BMC FW periodically polling physical sensors in the system to read temperature voltages and so on Some of these physical sensors are built-in to the BMC component itself and some are physically separated from the BMC Polling of physical sensors for support of IPMI sensor monitoring does not occur until the BMCrsquos operational code is running and the IPMI FW subsystem has completed initializationIPMI sensor monitoring is not supported in the BMC boot code Additionally the BMC selectively polls physical sensors based on the current power and reset state of the system and the availability of the physical sensor when in that state For example non-standby voltages are not monitored when the system is in S4 or S5 power state
HA201-TP Users Manual
84
Chapter 7 Intelreg Server Board S2600TP Platform Management
732 Sensor Rearm Behavior7321 Manual versus Re-arm SensorsSensors can be either manual or automatic re-arm An automatic re-arm sensor will re-arm (clear) the assertion event state for a threshold or offset it if that threshold or offset is deasserted after having been asserted This allows a subsequent assertion of the threshold or an offset to generate a new event and associated side-effect An example side-effect would be boosting fans due to an upper critical threshold crossing of a temperature sensor The event state and the input state (value) of the sensor track each other Most sensors are auto-rearmA manual re-arm sensor does not clear the assertion state even when the threshold or offset becomes de-asserted In this case the event state and the input state (value) of the sensor do not track each other The event assertion state is sticky The following methods can be used to re-arm a sensorbull Automatic re-arm ndash Only applies to sensors that are designated as
ldquoauto-rearmrdquobull IPMI command Re-arm Sensor Eventbull BMC internal method ndash The BMC may re-arm certain sensors due to a
trigger condition For example some sensors may be re-armed due to a system reset A BMC reset will re-arm all sensorsbull System reset or DC power cycle will re-arm all system fan sensors
HA201-TP Users Manual
85
Chapter 7 Intelreg Server Board S2600TP Platform Management
7322 Re-arm and Event GenerationAll BMC-owned sensors that show an asserted event status generate a de-assertion SEL event when the sensor is re-armed provided that the associated SDR is configured to enable a deassertion event for that condition This applies regardless of whether the sensor is a thresholdanalog sensor or a discrete sensor
To manually re-arm the sensors the sequence is outlined below1 A failure condition occurs and the BMC logs an assertion event2 If this failure condition disappears the BMC logs a de-assertion event (if
so configured)3 The sensor is re-armed by one of the methods described in the
previous section4 The BMC clears the sensor status5 The sensor is put into reading-state-unavailable state until it is polled
again or otherwise updated6 The sensor is updated and the ldquoreading-state-unavailablerdquo state is
cleared A new assertion event will be logged if the fault state is once again detected
All auto-rearm sensors that show an asserted event status generate a de-assertion SEL event at the time the BMC detects that the condition causing the original assertion is no longer present and the associated SDR is configured to enable a de-assertion event for that condition
HA201-TP Users Manual
86
Chapter 7 Intelreg Server Board S2600TP Platform Management
733 BIOS Event-Only SensorsBIOS-owned discrete sensors are used for event generation only and are not accessible through IPMI sensor commands like the Get Sensor Reading command Note that in this case the sensor owner designated in the SDR is not the BMCAn example of this usage would be the SELs logged by the BIOS for uncorrectable memory errors Such SEL entries would identify a BIOS-owned sensor ID
734 Margin SensorsThere is sometimes a need for an IPMI sensor to report the difference (margin) from a nonzero reference offset For the purposes of this document these type sensors are referred to as margin sensors For instance for the case of a temperature margin sensor if the referencevalue is 90 degrees and the actual temperature of the device being monitored is 85 degreesthe margin value would be -5
735 IPMI Watchdog SensorThe BMC supports a Watchdog Sensor as a means to log SEL events due to expirations of the IPMI 20 compliant Watchdog Timer
736 BMC Watchdog SensorThe BMC supports an IPMI sensor to report that a BMC reset has occurred due to action taken by the BMC Watchdog feature A SEL event will be logged whenever either the BMC FW stack is reset or the BMC CPU itself is reset
737 BMC System Management Health MonitoringThe BMC tracks the health of each of its IPMI sensors and report failures by providing a ldquoBMC FW Healthrdquo sensor of the IPMI 20 sensor type Management Subsystem Health with support for the Sensor Failure offset Only assertions should be logged into the SEL for the Sensor Failure offset The BMC Firmware Health sensor asserts for any sensor when 10 consecutive sensor errors are read These are not standard sensor events (that is threshold crossings or discrete assertions) These are BMC Hardware Access Layer (HAL) errors If a successful sensor read is completed the counter resets to zero
HA201-TP Users Manual
87
Chapter 7 Intelreg Server Board S2600TP Platform Management
738 VR Watchdog TimerThe BMC FW monitors that the power sequence for the board VR controllers is completed when a DC power-on is initiated Incompletion of the sequence indicates a board problem in which case the FW powers down the systemThe BMC FW supports a discrete IPMI sensor for reporting and logging this fault condition
739 System Airflow MonitoringThe BMC provides an IPMI sensor to report the volumetric system airflow in CFM (cubic feet per minute) The air flow in CFM is calculated based on the system fan PWM values The specific Pulse Width Modulation (PWM or PWMs) used to determine the CFM is SDR configurable The relationship between PWM and CFM is based on a lookup table in an OEM SDRThe airflow data is used in the calculation for exit air temperature monitoring It is exposed as an IPMI sensor to allow a datacenter management application to access this data for use in rack-level thermal management
7310 Thermal MonitoringThe BMC provides monitoring of component and board temperature sensing devices This monitoring capability is instantiated in the form of IPMI analogthreshold or discrete sensors depending on the nature of the measurementFor analogthreshold sensors with the exception of Processor Temperature sensors critical and non-critical thresholds (upper and lower) are set through SDRs and event generation enabled for both assertion and de-assertion eventsFor discrete sensors both assertion and de-assertion event generation are enabledMandatory monitoring of platform thermal sensors includes
HA201-TP Users Manual
88
Chapter 7 Intelreg Server Board S2600TP Platform Management
7310 Thermal MonitoringThe BMC provides monitoring of component and board temperature sensing devices This monitoring capability is instantiated in the form of IPMI analogthreshold or discrete sensors depending on the nature of the measurementFor analogthreshold sensors with the exception of Processor Temperature sensors critical and non-critical thresholds (upper and lower) are set through SDRs and event generation enabled for both assertion and de-assertion eventsFor discrete sensors both assertion and de-assertion event generation are enabledMandatory monitoring of platform thermal sensors includesbull Inlet temperature (physical sensor is typically on system front panel or
HDD back plane)bull Board ambient thermal sensorsbull Processor temperaturebull Memory (DIMM) temperaturebull CPU VRD Hot monitoringbull Power supply (only supported for PMBus-compliant PSUs)Additionally the BMC FW may create ldquovirtualrdquo sensors that are based on a combination of aggregation of multiple physical thermal sensors and application of a mathematical formula to thermal or power sensor readings
73101 Absolute Value versus Margin SensorsThermal monitoring sensors fall into three basic categoriesbull Absolute temperature sensors ndash These are analogthreshold sensors
that provide a value that corresponds to an absolute temperature value
bull Thermal margin sensors ndash These are analogthreshold sensors that provide a value that is relative to some reference value
bull Thermal fault indication sensors ndash These are discrete sensors that indicate a specific thermal fault condition
HA201-TP Users Manual
89
Chapter 7 Intelreg Server Board S2600TP Platform Management
73102 Processor DTS-Spec Margin Sensor(s)Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family incorporate a DTS based thermal spec This allows a much more accurate control of the thermal solution and will enable lower fan speeds and lower fan power consumption The main usage of this sensor is as an input to the BMCrsquos fan control algorithms The BMC implements this as a threshold sensor There is one DTS sensor for each installed physical processor package Thresholds are not set and alert generation is not enabled for these sensors
73103 Processor Thermal Margin Sensor(s)Each processor supports a physical thermal margin sensor per core that is readable through the PECI interface This provides a relative value representing a thermal margin from the corersquos throttling thermal trip point Assuming that temp controlled throttling is enabled the physical core temp sensor reads lsquo0rsquo which indicates the processor core is being throttledThe BMC supports one IPMI processor (margin) temperature sensor per physical processor package This sensor aggregates the readings of the individual core temperatures in a package to provide the hottest core temperature reading When the sensor reads lsquo0rsquo it indicates that the hottest processor core is throttlingDue to the fact that the readings are capped at the corersquos thermal throttling trip point (reading = 0) thresholds are not set and alert generation is not enabled for these sensors
73104 Processor Thermal Control Monitoring (Prochot)The BMC FW monitors the percentage of time that a processor has been operationally constrained over a given time window (nominally six seconds) due to internal thermal management algorithms engaging to reduce the temperature of the device When any processor core temperature reaches its maximum operating temperature the processorpackage PROCHOT (processor hot) signal is asserted and these management algorithms known as the Thermal Control Circuit (TCC) engage to reduce the temperature provided TCC is enabled TCC is enabled by BIOS during system boot This monitoring is instantiated as one IPMI analogthreshold sensor per processor package The BMC implements this as a threshold sensor on a per-processor basis
HA201-TP Users Manual
90
Chapter 7 Intelreg Server Board S2600TP Platform Management
Under normal operation this sensor is expected to read lsquo0rsquo indicating that no processor throttling has occurredThe processor provides PECI-accessible counters one for the total processor time elapsed and one for the total thermally constrained time which are used to calculate the percentage assertion over the given time window
73105 Processor Voltage Regulator (VRD) Over-Temperature SensorThe BMC monitors processor VRD_HOT signals The processor VRD_HOT signals are routed to the respective processor PROCHOT input in order initiate throttling to reduce processor power draw therefore indirectly lowering the VRD temperatureThere is one processor VRD_HOT signal per CPU slot The BMC instantiates one discrete IPMI sensor for each VRD_HOT signal This sensor monitors a digital signal that indicates whether a processor VRD is running in an over-temperature condition When the BMC detects that this signal is asserted it will cause a sensor assertion which will result in an event being written into the sensor event log (SEL)
73106 Inlet Temperature SensorEach platform supports a thermal sensor for monitoring the inlet temperature There are four potential sources for inlet temperature reading
73107 Baseboard Ambient Temperature Sensor(s)The server baseboard provides one or more physical thermal sensors for monitoring the ambient temperature of a board location This is typically to provide rudimentary thermal monitoring of components that lack internal thermal sensors
73108 Server South Bridge (SSB) Thermal MonitoringThe BMC monitors the SSB temperature This is instantiated as an analog (threshold) IPMI thermal sensor
73109 Exit Air Temperature Monitoring The BMC synthesizes a virtual sensor to approximate system exit air temperature for use in fan control This is calculated based on the total power being consumed by the system and the total volumetric air flow provided by the system fans
HA201-TP Users Manual
91
Chapter 7 Intelreg Server Board S2600TP Platform Management
Each system shall be characterized in tabular format to understand total volumetric flow versus fan speed The BMC calculates an average exit air temperature based on the total system power front panel temperature the volumetric system air flow (cubic feet per meter or CFM) and altitude rangeThis sensor is only available on systems in an Intelreg chassis The Exit Air temp sensor is only available when PMBus power supplies are installed
731010 Ethernet Controller Thermal MonitoringThe Intelreg Ethernet Controller I350-AM4 and Intelreg Ethernet Controller 10 Gigabit X540 support an on-die thermal sensor For baseboard Ethernet controllers that use these devices the BMC will monitor the sensors and use this data as input to the fan speed control The BMC will instantiate an IPMI temperature sensor for each device on the baseboard
731011 Memory VRD-Hot Sensor(s)The BMC monitors memory VRD_HOT signals The memory VRD_HOT signals are routed to the respective processor MEMHOT inputs in order to throttle the associated memory to effectively lower the temperature of the VRD feeding that memoryFor Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family there are 2 memory VRD_HOT signals per CPU slot The BMC instantiates one discrete IPMI sensor for each memory VRD_HOT signal
731012 Add-in Module Thermal MonitoringSome boards have dedicated slots for an IO module andor a SAS module For boards that support these slots the BMC will instantiate an IPMI temperature sensor for each slot The modules themselves may or may not provide a physical thermal sensor (a TMP75 device) If the BMC detects that a module is installed it will attempt to access the physical thermal sensor and if found enable the associated IPMI temperature sensor
731013 Processor ThermTripWhen a Processor ThermTrip occurs the system hardware will automatically power down the server If the BMC detects that a ThermTrip occurred then it will set the ThermTrip offset for the applicable
HA201-TP Users Manual
92
Chapter 7 Intelreg Server Board S2600TP Platform Management
processor status sensor
731014 Server South Bridge (SSB) ThermTrip Monitoring The BMC supports SSB ThermTrip monitoring that is instantiated as an IPMI discrete sensor When a SSB ThermTrip occurs the system hardware will automatically power down the server and the BMC will assert the sensor offset and log an event
731015 DIMM ThermTrip MonitoringThe BMC supports DIMM ThermTrip monitoring that is instantiated as one aggregate IPMI discrete sensor per CPU When a DIMM ThermTrip occurs the system hardware will automatically power down the server and the BMC will assert the sensor offset and log an eventThis is a manual re-arm sensor that is rearmed on system resets and power-on (AC or DC power on transitions)
HA201-TP Users Manual
93
Chapter 7 Intelreg Server Board S2600TP Platform Management
7311 Processor SensorsThe BMC provides IPMI sensors for processors and associated components such as voltage regulators and fans The sensors are implemented on a per-processor basis
HA201-TP Users Manual
94
Chapter 7 Intelreg Server Board S2600TP Platform Management
73111 Processor Status SensorsThe BMC provides an IPMI sensor of type processor for monitoring status information for each processor slot If an event state (sensor offset) has been asserted it remains asserted until one of the following happens1 A Rearm Sensor Events command is executed for the processor status
sensor2 AC or DC power cycle system reset or system boot occurs
The BMC provides system status indication to the front panel LEDs for processor fault conditions shown belowCPU Presence status is not saved across AC power cycles and therefore will not generate a deassertion after cycling AC power
HA201-TP Users Manual
95
Chapter 7 Intelreg Server Board S2600TP Platform Management
73112 Processor Population Fault (CPU Missing) SensorThe BMC supports a Processor Population Fault sensor This is used to monitor for the condition in which processor slots are not populated as required by the platform hardware to allow power-on of the systemAt BMC startup the BMC will check for the fault condition and set the sensor state accordinglyThe BMC also checks for this fault condition at each attempt to DC power-on the system At each DC power-on attempt a beep code is generated if this fault is detectedThe following steps are used to correct the fault condition and clear the sensor state1 AC power down the server2 Install the missing processor into the correct slot3 AC power on the server
73113 ERR2 Timeout MonitoringThe BMC supports an ERR2 Timeout Sensor (1 per CPU) that asserts if a CPUrsquos ERR2 signal has been asserted for longer than a fixed time period (gt 90 seconds) ERR[2] is a processor signal that indicates when the IIO (Integrated IO module in the processor) has a fatal error which could not be communicated to the core to trigger SMI ERR[2] events are fatal error conditions where the BIOS and OS will attempt to gracefully handle error but may not be always do so reliably A continuously asserted ERR2 signal is an indication that the BIOS cannot service the condition that caused the error This is usually because that condition prevents the BIOS from runningWhen an ERR2 timeout occurs the BMC assertsde-asserts the ERR2 Timeout Sensor and logs a SEL event for that sensor The default behavior for BMC core firmware is to initiate a system reset upon detection of an ERR2 timeout The BIOS setup utility provides an option to disable or enable system reset by the BMC for detection of this condition
HA201-TP Users Manual
96
Chapter 7 Intelreg Server Board S2600TP Platform Management
73114 CATERR SensorThe BMC supports a CATERR sensor for monitoring the system CATERR signalThe CATERR signal is defined as having 3 statesbull high (no event)bull pulsed low (possibly fatal may be able to recover)bull low (fatal)All processors in a system have their CATERR pins tied together The pin is used as a communication path to signal a catastrophic system event to all CPUs The BMC has direct access to this aggregate CATERR signalThe BMC only monitors for the ldquoCATERR held lowrdquo condition A pulsed low condition is ignored by the BMC If a CATERR-low condition is detected the BMC logs an error message to the SEL against the CATERR sensor and the default action after logging the SEL entry is to reset the system The BIOS setup utility provides an option to disable or enable system reset by the BMC for detection of this conditionThe sensor is rearmed on power-on (AC or DC power on transitions) It is not rearmed on system resets in order to avoid multiple SEL events that could occur due to a potential reset loop if the CATERR keeps recurring which would be the case if the CATERR was due to an MSID mismatch conditionWhen the BMC detects that this aggregate CATERR signal has asserted it can then go through PECI to query each CPU to determine which one was the source of the error and write an OEM code identifying the CPU slot into an event data byte in the SEL entry If PECI is non-functional (it isnrsquot guaranteed in this situation) then the OEM code should indicate that the source is unknownEvent data byte 2 and byte 3 for CATERR sensor SEL events
ED1 ndash 0xA1ED2 - CATERR type0 Unknown1 CATERR2 CPU Core Error (not supported on Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family)3 MSID Mismatch4 CATERR due to CPU 3-strike timeout
HA201-TP Users Manual
97
Chapter 7 Intelreg Server Board S2600TP Platform Management
ED3 - CPU bitmap that causes the system CATERR[0] CPU1[1] CPU2[2] CPU3[3] CPU4When a CATERR Timeout event is determined to be a CPU 3-strike timeout The BMC shall log the logical FRU information (eg busdevfunc for a PCIe device CPU or DIMM) that identifies the FRU that caused the error in the extended SEL data bytes In this case Ext-ED0 will be set to 0x70 and the remaining ED1-ED7 will be set according to the device type and info available
73115 MSID Mismatch SensorThe BMC supports a MSID Mismatch sensor for monitoring for the fault condition that will occur if there is a power rating incompatibility between a baseboard and a processor The sensor is rearmed on power-on (AC or DC power on transitions)
7312 Voltage MonitoringThe BMC provides voltage monitoring capability for voltage sources on the main board and processors such that all major areas of the system are covered This monitoring capability is instantiated in the form of IPMI analogthreshold sensors
73121 DIMM Voltage SensorsSome systems support either LVDDR (Low Voltage DDR) memory or regular (non-LVDDR) memory During POST the system BIOS detects which type of memory is installed and configures the hardware to deliver the correct voltageSince the nominal voltage range is different this necessitates the ability to set different thresholds for any associated IPMI voltage sensors The BMC FW supports this by implementing separate sensors (that is separate IPMI sensor numbers) for each nominal voltage range supported for a single physical sensor and it enablesdisables the correct IPMI sensor based on which type memory is installed The sensor data records for both these DIMM voltage sensor types have scanning disabled by default Once the BIOS has completed its POST routine it is responsible for communicating the DIMM voltage type to the BMC which will then
HA201-TP Users Manual
98
Chapter 7 Intelreg Server Board S2600TP Platform Management
enable sensor scanning of the correct DIMM voltage sensor
7313 Fan MonitoringBMC fan monitoring support includes monitoring of fan speed (RPM) and fan presence
73131 Fan Tach SensorsFan Tach sensors are used for fan failure detection The reported sensor reading is proportional to the fanrsquos RPM This monitoring capability is instantiated in the form of IPMI analogthreshold sensorsMost fan implementations provide for a variable speed fan so the variations in fan speed can be large Therefore the threshold values must be set sufficiently low as to not result in inappropriate threshold crossingsFan tach sensors are implemented as manual re-arm sensors because a lower-critical threshold crossing can result in full boosting of the fans This in turn may cause a failing fanrsquos speed to rise above the threshold and can result in fan oscillationsAs a result fan tach sensors do not auto-rearm when the fault condition goes away but rather are rearmed for either of the following occurrences1 The system is reset or power-cycled2 The fan is removed and either replaced with another fan or re-
inserted This applies to hot-swappable fans only This re-arm is triggered by change in the state of the associated fan presence sensor
After the sensor is rearmed if the fan speed is detected to be in a normal range the failure conditions shall be cleared and a de-assertion event shall be logged
HA201-TP Users Manual
99
Chapter 7 Intelreg Server Board S2600TP Platform Management
73132 Fan Presence SensorsSome chassis and server boards provide support for hot-swap fans These fans can be removed and replaced while the system is powered on and operating normally The BMC implements fan presence sensors for each hot swappable fan These are instantiated as IPMI discrete sensorsEvents are only logged for fan presence upon changes in the presence state after AC power is applied (no events logged for initial state)
73133 Fan Redundancy SensorThe BMC supports redundant fan monitoring and implements fan redundancy sensors for products that have redundant fans Support for redundant fans is chassis-specificA fan redundancy sensor generates events when its associated set of fans transition between redundant and non-redundant states as determined by the number and health of the component fans The definition of fan redundancy is configuration dependent The BMCallows redundancy to be configured on a per fan-redundancy sensor basis through OEM SDR recordsThere is a fan redundancy sensor implemented for each redundant group of fans in the systemAssertion and de-assertion event generation is enabled for each redundancy state
73134 Power Supply Fan SensorsMonitoring is implemented through IPMI discrete sensors one for each power supply fan The BMC polls each installed power supply using the PMBus fan status commands to check for failure conditions for the power supply fans The BMC asserts the ldquoperformance lagsrdquo offset of the IPMI sensor if a fan failure is detectedPower supply fan sensors are implemented as manual re-arm sensors because a failure condition can result in boosting of the fans This in turn may cause a failing fanrsquos speed to rise above the ldquofaultrdquo threshold and can result in fan oscillations As a result these sensors do not auto-rearm when the fault condition goes away but rather are rearmed only when the system is reset or power-cycled or the PSU is removed and replaced with the same or another PSUAfter the sensor is rearmed if the fan is no longer showing a failed state the failure condition in the IPMI sensor shall be cleared and a de-
HA201-TP Users Manual
100
Chapter 7 Intelreg Server Board S2600TP Platform Management
assertion event shall be logged
73135 Monitoring for ldquoFans Offrdquo ScenarioOn Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family it is likely that there will be situations where specific fans are turned off based on current system conditions BMC Fan monitoring will comprehend this scenario and not log false failure eventsThe recommended method is for the BMC FW to halt updates to the value of the associated fan tach sensor and set that sensorrsquos IPMI sensor state to ldquoreading-state-unavailablerdquo when this mode is active Management software must comprehend this state for fan tach sensors and not report these as failure conditionsThe scenario for which this occurs is that the BMC Fan Speed Control (FSC) code turns off the fans by setting the PWM for the domain to 0 This is done when based on one or more global aggregate thermal margin sensor readings dropping below a specified thresholdBy default the fans-off feature will be disabled There is a BMC command and BIOS setup option to enabledisable this featureThe SmaRTCLST system feature will also momentarily gate power to all the system fans to reduce overall system power consumption in response to a power supply event (for example to ride out an AC power glitch) However for this scenario the fan power is gated by hardware for only 100ms which should not be long enough to result in triggering a fan fault SEL event
7314 Standard Fan ManagementThe BMC controls and monitors the system fans Each fan is associated with a fan speed sensor that detects fan failure and may also be associated with a fan presence sensor for hotswap support For redundant fan configurations the fan failure and presence status determines the fan redundancy sensor stateThe system fans are divided into fan domains each of which has a separate fan speed control signal and a separate configurable fan control policy A fan domain can have a set of temperature and fan sensors associated with it These are used to determine the current fan domain state
HA201-TP Users Manual
101
Chapter 7 Intelreg Server Board S2600TP Platform Management
A fan domain has three statesbull The sleep and boost states have fixed (but configurable through OEM
SDRs) fan speeds associated with thembull The nominal state has a variable speed determined by the fan
domain policy An OEM SDR record is used to configure the fan domain policy
The fan domain state is controlled by several factors They are listed below in order of precedence high to lowbull Boost
o Associated fan is in a critical state or missing The SDR describes which fan domains are boosted in response to a fan failure or removal in each domain If a fan is removed when the system is in lsquoFans-offrsquo mode it will not be detected and there will not be any fan boost till system comes out of lsquoFans-off modeo Any associated temperature sensor is in a critical state The SDR describes which temperature-threshold violations cause fan boost for each fan domaino The BMC is in firmware update mode or the operational firmware is corruptedo If any of the above conditions apply the fans are set to a fixed boost state speed
bull Nominalo A fan domainrsquos nominal fan speed can be configured as static (fixed value) or controlled by the state of one or more associated temperature sensorso See section 43144 for more details
73141 Hot-Swap FansHot-swap fans are supported These fans can be removed and replaced while the system is powered on and operating The BMC implements fan presence sensors for each hotswappable fanWhen a fan is not present the associated fan speed sensor is put into the readingunavailable state and any associated fan domains are put into the boost state The fans may already be boosted due to a previous fan failure or fan removalWhen a removed fan is inserted the associated fan speed sensor is rearmed If there are no other critical conditions causing a fan boost condition the fan speed returns to the nominal state Power cycling or
HA201-TP Users Manual
102
Chapter 7 Intelreg Server Board S2600TP Platform Management
resetting the system re-arms the fan speed sensors and clears fanfailure conditions If the failure condition is still present the boost state returns once the sensor has re-initialized and the threshold violation is detected again
73142 Fan Redundancy DetectionThe BMC supports redundant fan monitoring and implements a fan redundancy sensor A fan redundancy sensor generates events when its associated set of fans transitions between redundant and non-redundant states as determined by the number and health of the fans The definition of fan redundancy is configuration dependent The BMC allows redundancy to be configured on a per fan redundancy sensor basis through OEM SDR recordsA fan failure or removal of hot-swap fans up to the number of redundant fans specified in the SDR in a fan configuration is a non-critical failure and is reflected in the front panel status A fan failure or removal that exceeds the number of redundant fans is a non-fatal insufficientresources condition and is reflected in the front panel status as a non-fatal errorRedundancy is checked only when the system is in the DC-on state Fan redundancy changes that occur when the system is DC-off or when AC is removed will not be logged until the system is turned on
73143 Fan DomainsSystem fan speeds are controlled through pulse width modulation (PWM) signals which are driven separately for each domain by integrated PWM hardware Fan speed is changed by adjusting the duty cycle which is the percentage of time the signal is driven high in each pulseThe BMC controls the average duty cycle of each PWM signal through direct manipulation of the integrated PWM control registersThe same device may drive multiple PWM signals
HA201-TP Users Manual
103
Chapter 7 Intelreg Server Board S2600TP Platform Management
73144 Nominal Fan SpeedA fan domainrsquos nominal fan speed can be configured as static (fixed value) or controlled by the state of one or more associated temperature sensorsOEM SDR records are used to configure which temperature sensors are associated with which fan control domains and the algorithmic relationship between the temperature and fan speed Multiple OEM SDRs can reference or control the same fan control domain and multiple OEM SDRs can reference the same temperature sensorsThe PWM duty-cycle value for a domain is computed as a percentage using one or more instances of a stepwise linear algorithm and a clamp algorithm The transition from one computed nominal fan speed (PWM value) to another is ramped over time to minimize audible transitions The ramp rate is configurable by means of the OEM SDRMultiple stepwise linear and clamp controls can be defined for each fan domain and used simultaneously For each domain the BMC uses the maximum of the domainrsquos stepwise linear control contributions and the sum of the domainrsquos clamp control contributions to compute the domainrsquos PWM value except that a stepwise linear instance can be configured to provide the domain maximumHysteresis can be specified to minimize fan speed oscillation and to smooth fan speed transitions If a Tcontrol SDR record does not contain a hysteresis definition for example an SDR adhering to a legacy format the BMC assumes a hysteresis value of zero
73145 Thermal and Acoustic ManagementThis feature refers to enhanced fan management to keep the system optimally cooled while reducing the amount of noise generated by the system fans Aggressive acoustics standards might require a trade-off between fan speed and system performance parameters that contribute to the cooling requirements primarily memory bandwidth The BIOS BMC and SDRs work together to provide control over how this trade-off is determinedThis capability requires the BMC to access temperature sensors on the individual memory DIMMs Additionally closed-loop thermal throttling is only supported with buffered DIMMs
HA201-TP Users Manual
104
Chapter 7 Intelreg Server Board S2600TP Platform Management
73146 Thermal Sensor Input to Fan Speed ControlThe BMC uses various IPMI sensors as input to the fan speed control Some of the sensors are IPMI models of actual physical sensors whereas some are ldquovirtualrdquo sensors whose values are derived from physical sensors using calculations andor tabular informationThe following IPMI thermal sensors are used as input to the fan speed controlbull Front panel temperature sensorbull Baseboard temperature sensorsbull CPU DTS-Spec margin sensorsbull DIMM thermal margin sensorsbull Exit air temperature sensorbull Global aggregate thermal margin sensorsbull SSB (Intelreg C610 Series Chipset) temperature sensorbull On-board Ethernet controller temperature sensors (support for this is
specific to the Ethernet controller being used)bull Add-in Intelreg SASIO module temperature sensor(s) (if present)bull Power supply thermal sensors (only available on PMBus-compliant
power supplies)A simple model is shown in the following figure which gives a high level graphic of the fan speed control structure creates the resulting fan speeds
HA201-TP Users Manual
105
Chapter 7 Intelreg Server Board S2600TP Platform Management
731461 Processor Thermal ManagementProcessor thermal management utilizes clamp algorithms for which the Processor DTS-Spec margin sensor is a controlling input This replaces the use of the (legacy) raw DTS sensor reading that was utilized on previous generation platforms The legacy DTS sensor is retained only for monitoring purposes and is not used as an input to the fan speed control
731462 Memory Thermal ManagementThe system memory is the most complex subsystem to thermally manage as it requires substantial interactions between the BMC BIOS and the embedded memory controller This section provides an overview of this management capability from a BMC perspective
7314621 Memory Thermal ThrottlingThe system shall support thermal management through open loop throttling (OLTT) and closed loop throttling (CLTT) of system memory based on the platform as well as availability of valid temperature sensors on the installed memory DIMMs Throttling levels are changed dynamically to cap throttling based on memory and system thermal conditions as determined by the system and DIMM power and thermal parameters Support for CLTT on mixed-mode DIMM populations (that is some installed DIMMs have valid temp sensors and some do not) is not supported The BMC fan speed control functionality is related to the memory throttling mechanism usedThe following terminology is used for the various memory throttling optionsbull Static Open Loop Thermal Throttling (Static-OLTT) OLTT control registers
are configured by BIOS MRC remain fixed after post The system does not change any of the throttling control registers in the embedded memory controller during runtime
bull Static Closed Loop Thermal Throttling (Static-CLTT) CLTT control registers are configured by BIOS MRC during POST The memory throttling is run as a closed-loop system with the DIMM temperature sensors as the control input Otherwise the system does not change any of the throttling control registers in the embedded memory controller during runtime
bull Dynamic Open Loop Thermal Throttling (Dynamic-OLTT) OLTT control registers are configured by BIOS MRC during POST Adjustments are
HA201-TP Users Manual
106
Chapter 7 Intelreg Server Board S2600TP Platform Management
made to the throttling during runtime based on changes in system cooling (fan speed)
bull Dynamic Closed Loop Thermal Throttling (Dynamic-CLTT) CLTT control registers are configured by BIOS MRC during POST The memory throttling is run as a closed-loop system with the DIMM temperature sensors as the control input Adjustments are made to the throttling during runtime based on changes in system cooling (fan speed)
Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce a new type of CLTT which is referred to as Hybrid CLTT for which the Integrated Memory Controller estimates the DRAM temperature in between actual reads of the TSODsHybrid CLTTT shall be used on all Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family that have DIMMs with thermal sensors Therefore the terms Dynamic-CLTT and Static-CLTT are really referring to this lsquohybridrsquo mode Note that if the IMCrsquos polling of the TSODs is interrupted the temperature readings that the BMC gets from the IMC shall be these estimated values 731463 DIMM Temperature Sensor Input to Fan Speed ControlA clamp algorithm s used for controlling fan speed based on DIMM temperatures Aggregate DIMM temperature margin sensors are used as the control input to the algorithm
731464 Dynamic (Hybrid) CLTTThe system will support dynamic (memory) CLTT for which the BMC FW dynamically modifies thermal offset registers in the IMC during runtime based on changes in system cooling (fan speed) For static CLTT a fixed offset value is applied to the TSOD reading to get the die temperature however this is does not provide as accurate results as when the offset takes into account the current airflow over the DIMM as is done with dynamic CLTTIn order to support this feature the BMC FW will derive the air velocity for each fan domain based on the PWM value being driven for the domain Since this relationship is dependent on the chassis configuration a method must be used which support this dependency (for example through OEM SDR) that establishes a lookup table providing this relationshipBIOS will have an embedded lookup table that provides thermal offset
HA201-TP Users Manual
107
Chapter 7 Intelreg Server Board S2600TP Platform Management
values for each DIMM type altitude setting and air velocity range (3 ranges of air velocity are supported) During system boot BIOS will provide 3 offset values (corresponding to the 3 air velocity ranges) tothe BMC for each enabled DIMM Using this data the BMC FW constructs a table that maps the offset value corresponding to a given air velocity range for each DIMM During runtime the BMC applies an averaging algorithm to determine the target offset value corresponding to thecurrent air velocity and then the BMC writes this new offset value into the IMC thermal offset register for the DIMM
731465 Fan ProfilesThe server system supports multiple fan control profiles to support acoustic targets and American Society of Heating Refrigerating and Air Conditioning Engineers (ASHRAE) compliance The BIOS Setup utility can be used to choose between meeting the target acoustic level or enhanced system performance This is accomplished through fan profilesThe BMC supports eight fan profiles numbered from 0 to 7 Fan management policy is dictated by the Tcontrol SDRs These SDRs provide a way to associate fan control behavior with one or more fan profilesEach group of profiles allows for varying fan control policies based on the altitude For a given altitude the Tcontrol SDRs associated with an acoustics-optimized profile generate less noise than the equivalent performance-optimized profile by driving lower fan speeds and the BIOSreduces thermal management requirements by configuring more aggressive memory throttling See Table 16 for more informationThe BMC provides commands that query for fan profile support and it provides a way to enable a fan profile Enabling a fan profile determines which Tcontrol SDRs are used for fan management The BMC only supports enabling a fan profile through the command if that profile is supported on all fan domains defined for the system It is important to configure the SDRs so that all desired fan profiles are supported on each fan domain If no single profile is supported across all domains the BMC by default uses profile 0 and does not allow it to be changedAt system boot the BIOS can use the Get Fan Control Configuration command to query the BMC about which fan profiles are supported The BIOS uses this information to display options in the BIOS Setup utility The BIOS indicates the fan profile to the BMC as dictated by the BIOS Setup Utility options for fan mode and altitude using the Set Fan Control
HA201-TP Users Manual
108
Chapter 7 Intelreg Server Board S2600TP Platform Management
Configuration commandThe BMC uses this information as an input to its fan-control algorithm as supported by the Tcontrol OEM SDR The BMC only allows enabling of fan profiles that the BMC indicates are supported using the Get Fan Control Configuration command For example if the Get Fan Control Configuration command indicates that only profile 1 is supported then using the Set Fan Control Configuration command to enable profile 2 will result in the return of an error completion codeThe BMC requires the BIOS to send the Set Fan Control Configuration command to the BMC on every system boot This must be done after the BIOS has completed any throttling-related chipset configuration
HA201-TP Users Manual
109
Chapter 7 Intelreg Server Board S2600TP Platform Management
731466 Open-Loop Thermal Throttling FallbackNormal system operation uses closed-loop thermal throttling (CLTT) and DIMM temperature monitoring as major factors in overall thermal and acoustics management In the event that BIOS is unable to configure the system for CLTT it defaults to open-loop thermal throttling (OLTT) In the OLTT mode it is assumed that the DIMM temperature sensors are not available for fan speed control The BIOS communicates the throttling mode to the BMC along with the fan profile number when it sends the Set Fan Control Configuration commandWhen OLTT mode is specified the BMC internally blocks access to the DIMM temperatures causing the DIMM aggregate margin sensors to be marked as ReadingState Unavailable The BMC then uses the failure-control values for these sensors if specified in the Tcontrol SDRs as their fan speed contributions
731467 ASHRAE ComplianceSystem requirements for ASHRAE compliance is defined in the Common Fan Speed Control amp Thermal Management Platform Architecture Specification Altitude-related changes in fan speed control are handled through profiles for different altitude ranges
73147 Power Supply Fan Speed ControlThis section describes the system level control of the fans internal to the power supply over the PMBus Some but not all Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family will require that the power supplies be included in the system level fan speed control For any system that requires either of these capabilities the power supply must be PMBus-compliant
731471 System Control of Power Supply FansSome products require that the BMC control the speed of the power supply fans as is done with normal system (chassis) fans except that the BMC cannot reduce the power supply fan any lower than the internal power supply control is driving it For these products the BMC FW must have the ability to control and monitor the power supply fans through PMBus commands The power supply fans are treated as a system fan domain for which fan control policies are mapped just as for chassis system fans with system thermal sensors (rather than internal power
HA201-TP Users Manual
110
Chapter 7 Intelreg Server Board S2600TP Platform Management
supply thermal sensors) used as the input to a clamp algorithm for the power supply fan control This domain has both piecewise clipping curves and clamped sensors mapped into the power supply fan domain All the power supplies can be defined as a single fan domain
731472 Use of Power Supply Thermal Sensors as Input to System (Chassis) Fan Control Some products require that the power supply internal thermal sensors are used as control inputs to the system (chassis) fans in the same manner as other system thermal sensors are used for this purpose The power supply thermal sensors are included as clamped sensors into one or more system fan domains which may include the power supply fan domain
73148 Fan Boosting due to Fan FailuresIntelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce additional capabilities for handling fan failure or removal as described in this sectionEach fan failure shall be able to define a unique response from all other fan domains An OEM SDR table defines the response of each fan domain based on a failure of any fan including both system and power supply fans (for PMBus-compliant power supplies only) This meansthat if a system has six fans then there will be six different fan fail reactions
73149 Programmable Fan PWM OffsetThe system provides a BIOS Setup option to boost the system fan speed by a programmable positive offset or a ldquoMaxrdquo setting Setting the programmable offset causes the BMC to add the offset to the fan speeds that it would otherwise be driving the fans to The Max setting causes the BMC to replace the domain minimum speed with alternate domain minimums that also are programmable through SDRsThis capability is offered to provide system administrators the option to manually configure fans speeds in instances where the fan speed optimized for a given platform may not be sufficient when a high end add-in is configured into the system This enables easier usage of the fan speed control to support Intelreg as well as third party chassis and better support of ambient temperatures higher than 35C
HA201-TP Users Manual
111
Chapter 7 Intelreg Server Board S2600TP Platform Management
supply thermal sensors) used as the input to a clamp algorithm for the power supply fan control This domain has both piecewise clipping curves and clamped sensors mapped into the power supply fan domain All the power supplies can be defined as a single fan domain
731472 Use of Power Supply Thermal Sensors as Input to System (Chassis) Fan Control Some products require that the power supply internal thermal sensors are used as control inputs to the system (chassis) fans in the same manner as other system thermal sensors are used for this purpose The power supply thermal sensors are included as clamped sensors into one or more system fan domains which may include the power supply fan domain
73148 Fan Boosting due to Fan FailuresIntelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce additional capabilities for handling fan failure or removal as described in this sectionEach fan failure shall be able to define a unique response from all other fan domains An OEM SDR table defines the response of each fan domain based on a failure of any fan including both system and power supply fans (for PMBus-compliant power supplies only) This meansthat if a system has six fans then there will be six different fan fail reactions
73149 Programmable Fan PWM OffsetThe system provides a BIOS Setup option to boost the system fan speed by a programmable positive offset or a ldquoMaxrdquo setting Setting the programmable offset causes the BMC to add the offset to the fan speeds that it would otherwise be driving the fans to The Max setting causes the BMC to replace the domain minimum speed with alternate domain minimums that also are programmable through SDRsThis capability is offered to provide system administrators the option to manually configure fans speeds in instances where the fan speed optimized for a given platform may not be sufficient when a high end add-in is configured into the system This enables easier usage of the fan speed control to support Intelreg as well as third party chassis and better support of ambient temperatures higher than 35C
HA201-TP Users Manual
112
Chapter 7 Intelreg Server Board S2600TP Platform Management
7315 Power Management Bus (PMBus)The Power Management Bus (ldquoPMBusrdquo) is an open standard protocol that is built upon the SMBus 20 transport It defines a means of communicating with power conversion and other devices using SMBus-based commands A system must have PMBus-compliant power supplies installed in order for the BMC or ME to monitor them for status andor power metering purposesFor more information on PMBus see the System Management Interface Forum Web sitehttpwwwpowersigorg
7316 Power Supply Dynamic Redundancy SensorThe BMC supports redundant power subsystems and implements a Power Unit Redundancy sensor per platform A Power Unit Redundancy sensor is of sensor type Power Unit (09h) and reading type Availability Status (0Bh) This sensor generates events when a power subsystem transitions between redundant and non-redundant states as determined by the number and health of the power subsystemrsquos component power supplies The BMC implements Dynamic Power Supply Redundancy status based upon current system load requirements as well as total Power Supply capacity This status is independent of the Cold Redundancy status This prevents the BMC from reporting Fully Redundant Power supplies when the load required by the system exceeds half the power capability of all power supplies installed and operationalDynamic Redundancy detects this condition and generates the appropriate SEL event to notify the user of the condition Power supplies of different power ratings may be swapped in and out to adjust the power capacity and the BMC will adjust the Redundancy status accordinglyThe definition of redundancy is power subsystem dependent and sometimes even configuration dependent See the appropriate Platform Specific Information for power unit redundancy supportThis sensor is configured as manual-rearm sensor in order to avoid the possibility of extraneous SEL events that could occur under certain system configuration and workload conditions The sensor shall rearm for the following conditionsbull PSU hot-addbull System reset
HA201-TP Users Manual
113
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull AC power cyclebull DC power cycleSystem AC power is applied but on standby ndash Power unit redundancy is based on OEM SDR power unit record and number of PSU presentSystem is (DC) powered on - The BMC calculates Dynamic Power Supply Redundancy status based upon current system load requirements as well as total Power Supply capacityThe BMC allows redundancy to be configured on a per power-unit-redundancy sensor basis by means of the OEM SDR records
7317 Component Fault LED ControlSeveral sets of component fault LEDs are supported on the server board Some LEDs are owned by the BMC and some by the BIOSThe BMC owns control of the following FRUfault LEDsbull Fan fault LEDs ndash A fan fault LED is associated with each fan The BMC
lights a fan fault LED if the associated fan-tach sensor has a lower critical threshold event status asserted Fan-tach sensors are manual re-arm sensors Once the lower critical threshold is crossed the LED remains lit until the sensor is rearmed These sensors are rearmed at system DC power-on and system reset
bull DIMM fault LEDs ndash The BMC owns the hardware control for these LEDs The LEDs reflect the state of BIOS-owned event-only sensors When the BIOS detects a DIMM fault condition it sends an IPMI OEM command (Set Fault Indication) to the BMC to instruct the BMC to turn on the associated DIMM Fault LED These LEDs are only active when the system is in the lsquoonrsquo state The BMC will not activate or change the state of the LEDs unless instructed by the BIOS
bull Hard Disk Drive Status LEDs ndash The HSBP PSoC owns the hardware control for these LEDs and detection of the faultstatus conditions that the LEDs reflect
bull CPU Fault LEDs The BMC owns control for these LEDs An LED is lit if there is an MSID mismatch (that is CPU power rating is incompatible with the board)
HA201-TP Users Manual
114
Chapter 7 Intelreg Server Board S2600TP Platform Management
7318 CMOS Battery MonitoringThe BMC monitors the voltage level from the CMOS battery which provides battery backup to the chipset RTC This is monitored as an auto-rearm threshold sensorUnlike monitoring of other voltage sources for which the Emulex Pilot III component continuously cycles through each input the voltage channel used for the battery monitoring provides a software enable bit to allow the BMC FW to poll the battery voltage at a relativelyslow rate in order to conserve battery power
HA201-TP Users Manual
115
Chapter 7 Intelreg Server Board S2600TP Platform Management
74 Intelreg Intelligent Power Node Manager (NM)Power management deals with requirements to manage processor power consumption and manage power at the platform level to meet critical business needs Node Manager (NM) is a platform resident technology that enforces power capping and thermal-triggered powercapping policies for the platform These policies are applied by exploiting subsystem settings (such as processor P and T states) that can be used to control power consumption NM enables data center power management by exposing an external interface to management software through which platform policies can be specified It also implements specific data center power management usage models such as power limiting and thermal monitoringThe NM feature is implemented by a complementary architecture utilizing the ME BMC BIOS and an ACPI-compliant OS The ME provides the NM policy engine and power controllimiting functions (referred to as Node Manager or NM) while the BMC provides the external LAN link by which external management software can interact with the feature The BIOS provides system power information utilized by the NM algorithms and also exports ACPI Source Language (ASL) code used by OS-Directed Power Management (OSPM) for negotiating processor P and T state changes for power limiting PMBus-compliant power supplies provide the capability to monitor input power consumption which is necessary to support NMThe NM architecture applicable to this generation of servers is defined by the NPTM Architecture Specification v20 NPTM is an evolving technology that is expected to continue to add new capabilities that will be defined in subsequent versions of the specification The ME NM implements the NPTM policy engine and controlmonitoring algorithms defined in the Node Power and Thermal Manager (NPTM) specification
741 Hardware RequirementsNM is supported only on platforms that have the NM FW functionality loaded and enabled on the Management Engine (ME) in the SSB and that have a BMC present to support the external LAN interface to the ME NM power limiting features requires a means for the ME to monitorinput power consumption for the platform This capability is generally provided by means of PMBus-compliant power supplies although an alternative model using a simpler SMBus power monitoring device is possible (there is potential loss in accuracy and responsiveness using
HA201-TP Users Manual
116
Chapter 7 Intelreg Server Board S2600TP Platform Management
non-PMBus devices) The NM SmaRTCLST feature does specifically require PMBus-compliant power supplies as well as additional hardware on the server board
742 FeaturesNM provides feature support for policy management monitoring and querying alerts and notifications and an external interface protocol The policy management features implement specific IT goals that can be specified as policy directives for NM Monitoring and querying features enable tracking of power consumption Alerts and notifications provide the foundation for automation of power management in the data center management stack The external interface specifies the protocols that must be supported in this version of NM
743 ME System Management Bus (SMBus) Interfacebull The ME uses the SMLink0 on the SSB in multi-master mode as a
dedicated bus for communication with the BMC using the IPMB protocol The BMC FW considers this a secondary IPMB bus and runs at 400 kHz
bull The ME uses the SMLink1 on the SSB in multi-master mode bus for communication with PMBus devices in the power supplies for support of various NM-related features This bus is shared with the BMC which polls these PMBus power supplies for sensor monitoring purposes (for example power supply status input power and so on) This bus runs at 100 KHz
bull The Management Engine has access to the ldquoHost SMBusrdquo
744 PECI 30bull The BMC owns the PECI bus for all Intel server implementations and
acts as a proxy for the ME when necessary
745 NM ldquoDiscoveryrdquo OEM SDRAn NM ldquodiscoveryrdquo OEM SDR must be loaded into the BMCrsquos SDR repository if and only if the NM feature is supported on that product This OEM SDR is used by management software to detect if NM is supported and to understand how to communicate with itSince PMBus compliant power supplies are required in order to support NM the system should be probed when the SDRs are loaded into the
HA201-TP Users Manual
117
Chapter 7 Intelreg Server Board S2600TP Platform Management
BMCrsquos SDR repository in order to determine whether or not the installed power supplies do in fact support PMBus If the installed power supplies are not PMBus compliant then the NM ldquodiscoveryrdquo OEM SDR should not be loadedPlease refer to the Intelreg Intelligent Power Node Manager 20 External Architecture Specification using IPMI for details of this interface
746 SmaRTCLSTThe power supply optimization provided by SmaRTCLST relies on a platform HW capability as well as ME FW support When a PMBus-compliant power supply detects insufficient input voltage an overcurrent condition or an over-temperature condition it will assert theSMBAlert signal on the power supply SMBus (such as the PMBus) Through the use of external gates this results in a momentary assertion of the PROCHOT and MEMHOT signals to the processors thereby throttling the processors and memory The ME FW also sees the SMBAlert assertion queries the power supplies to determine the condition causing the assertion and applies an algorithm to either release or prolong the throttling based on the situationSystem power control modes include1 SmaRT Low AC in put voltage event results in a one-time momentary
throttle for each event to the maximum throttle state2 Electrical Protection CLST High output energy event results in a
throttling hiccup mode with fixed maximum throttle time and a fix throttle release ramp time
3 Thermal Protection CLST High power supply thermal event results in a throttling hiccup mode with fixed maximum throttle time and a fix throttle release ramp time
When the SMBAlert signal is asserted the fans will be gated by HW for a short period (~100ms) to reduce overall power consumption It is expected that the interruption to the fans will be of short enough duration to avoid false lower threshold crossings for the fan tachsensors however this may need to be comprehended by the fan monitoring FW if it does have this side-effectME FW will log an event into the SEL to indicate when the system has been throttled by the SmaRTCLST power management feature This is dependent on ME FW support for this sensor Please refer ME FW EPS for SEL log details
HA201-TP Users Manual
118
Chapter 7 Intelreg Server Board S2600TP Platform Management
74611 Dependencies on PMBus-compliant Power Supply SupportThe SmaRTCLST system feature depends on functionality present in the ME NM SKU This feature requires power supplies that are compliant with the PMBus
note For additional information on Intelreg Intelligent Power Node
Manager usage and support please visit the following Intel Website
httpwwwintelcomcontentwwwusendata-centerdata-center-managementnode-manager-generalhtmlwapkw=node+manager
HA201-TP Users Manual
119
Chapter 7 Intelreg Server Board S2600TP Platform Management
75 Basic and Advanced Server Management FeaturesThe integrated BMC has support for basic and advanced server management features Basic management features are available by default Advanced management features are enabled with the addition of an optionally installed Remote Management Module 4 Lite (RMM4 Lite) key
When the BMC FW initializes it attempts to access the Intelreg RMM4 lite If the attempt to access Intelreg RMM4 lite is successful then the BMC activates the Advanced featuresThe following table identifies both Basic and Advanced server management features
HA201-TP Users Manual
120
Chapter 7 Intelreg Server Board S2600TP Platform Management
751 Dedicated Management PortThe server board includes a dedicated 1GbE RJ45 Management Port The management port is active with or without the RMM4 Lite key installed
752 Embedded Web ServerBMC Base manageability provides an embedded web server and an OEM-customizable web GUI which exposes the manageability features of the BMC base feature set It is supported over all on-board NICs that have management connectivity to the BMC as well as an optionaldedicated add-in management NIC At least two concurrent web sessions from up to two different users is supported The embedded web user interface shall support the following client web browsersbull Microsoft Internet Explorer 90bull Microsoft Internet Explorer 100bull Mozilla Firefox 24bull Mozilla Firefox 25The embedded web user interface supports strong security (authentication encryption and firewall support) since it enables remote server configuration and control The user interface presented by the embedded web user interface shall authenticate the user before allowing a web session to be initiated Encryption using 128-bit SSL is supported User authentication is based on user id and passwordThe GUI presented by the embedded web server authenticates the user before allowing a web session to be initiated It presents all functions to all users but grays-out those functions that the user does not have privilege to execute For example if a user does not have privilege to power control then the item shall be displayed in grey-out font in that userrsquos UI display The web GUI also provides a launch point for some of the advanced features such as KVM and media redirection These features are grayed out in the GUI unless the system has been updated to support these advanced features The embedded web server only displays US English or Chinese language outputAdditional features supported by the web GUI includesbull Presents all the Basic features to the usersbull Power onoffreset the server and view current power statebull Displays BIOS BMC ME and SDR version informationbull Display overall system health
HA201-TP Users Manual
121
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull Configuration of various IPMI over LAN parameters for both IPV4 and IPV6
bull Configuration of alerting (SNMP and SMTP)bull Display system asset information for the product board and
chassisbull Display of BMC-owned sensors (name status current reading
enabled thresholds) including color-code status of sensorsbull Provides ability to filter sensors based on sensor type (Voltage
Temperature Fan and Power supply related)bull Automatic refresh of sensor data with a configurable refresh ratebull On-line helpbull Displayclear SEL (display is in easily understandable human
readable format)bull Supports major industry-standard browsers (Microsoft Internet
Explorer and Mozilla Firefox)bull The GUI session automatically times-out after a user-configurable
inactivity period By default this inactivity period is 30 minutesbull Embedded Platform Debug feature - Allow the user to initiate
a ldquodebug dumprdquo to a file that can be sent to Intelreg for debug purposes
bull Virtual Front Panel The Virtual Front Panel provides the same functionality as the local front panel The displayed LEDs match the current state of the local panel LEDs The displayed buttons (for example power button) can be used in the same manner as the local buttons
bull Display of ME sensor data Only sensors that have associated SDRs loaded will be displayed
bull Ability to save the SEL to a filebull Ability to force HTTPS connectivity for greater security This is
provided through a configuration option in the UIbull Display of processor and memory information as is available over
IPMI over LANbull Ability to get and set Node Manager (NM) power policiesbull Display of power consumed by the serverbull Ability to view and configure VLAN settingsbull Warn user the reconfiguration of IP address will cause disconnectbull Capability to block logins for a period of time after several
HA201-TP Users Manual
122
Chapter 7 Intelreg Server Board S2600TP Platform Management
consecutive failed login attempts The lock-out period and the number of failed logins that initiates the lockout period are configurable by the user
bull Server Power Control - Ability to force into Setup on a resetbull System POST results ndash The web server provides the systemrsquos Power-
On Self Test (POST) sequence for the previous two boot cycles including timestamps The timestamps may be viewed in relative to the start of POST or the previous POST code
bull Customizable ports - The web server provides the ability to customize the port numbers used for SMASH http https KVM secure KVM remote media and secure remote media
For additional information reference the Intelreg Remote Management Module 4 and Integrated BMC Web Console Users Guide
753 Advanced Management Feature Support (RMM4 Lite)The integrated baseboard management controller has support for advanced management features which are enabled when an optional Intelreg Remote Management Module 4 Lite (RMM4 Lite) is installed The Intel RMM4 add-on offers convenient remote KVM access and control through LAN and internet It captures digitizes and compresses video and transmits it with keyboard and mouse signals to and from a remote computer Remote access and controlsoftware runs in the integrated baseboard management controller utilizing expanded capabilities enabled by the Intel RMM4 hardwareKey Features of the RMM4 add-on arebull KVM redirection from either the dedicated management NIC or
the server board NICs used for management traffic upto to two KVM sessions
bull Media Redirection ndash The media redirection feature is intended to allow system administrators or users to mount a remote IDE or USB CDROM floppy drive or a USB flash disk as a remote device to the server Once mounted the remote device appears just like a local device to the server allowing system administrators or users to install software (including operating systems) copy files update BIOS or boot the server from this device
bull KVM ndash Automatically senses video resolution for best possible screen capture high performance mouse tracking and
HA201-TP Users Manual
123
Chapter 7 Intelreg Server Board S2600TP Platform Management
synchronization It allows remote viewing and configuration in pre-boot POST and BIOS setup
7531 Keyboard Video Mouse (KVM) RedirectionThe BMC firmware supports keyboard video and mouse redirection (KVM) over LAN This feature is available remotely from the embedded web server as a Java applet This feature is only enabled when the Intelreg RMM4 lite is present The client system must have a Java Runtime Environment (JRE) version 60 or later to run the KVM or media redirection appletsThe BMC supports an embedded KVM application (Remote Console) that can be launched from the embedded web server from a remote console USB11 or USB 20 based mouse and keyboard redirection are supported It is also possible to use the KVM-redirection (KVM-r) session concurrently with media-redirection (media-r) This feature allows a user to interactively use the keyboard video and mouse (KVM) functions of the remote server as if the user were physically at the managed server KVM redirection console supports the following keyboard layouts English Dutch French German Italian Russian and SpanishKVM redirection includes a ldquosoft keyboardrdquo function The ldquosoft keyboardrdquo is used to simulate an entire keyboard that is connected to the remote system The ldquosoft keyboardrdquo functionality supports the following layouts English Dutch French German Italian Russian and SpanishThe KVM-redirection feature automatically senses video resolution for best possible screen capture and provides high-performance mouse tracking and synchronization It allows remote viewing and configuration in pre-boot POST and BIOS setup once BIOS has initialized videoOther attributes of this feature includebull Encryption of the redirected screen keyboard and mousebull Compression of the redirected screenbull Ability to select a mouse configuration based on the OS typebull Supports user definable keyboard macrosKVM redirection feature supports the following resolutions and refresh ratesbull 640x480 at 60Hz 72Hz 75Hz 85Hz 100Hzbull 800x600 at 60Hz 72Hz 75Hz 85Hzbull 1024x768 at 60Hx 72Hz 75Hz 85Hzbull 1280x960 at 60Hz
HA201-TP Users Manual
124
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull 1280x1024 at 60Hzbull 1600x1200 at 60Hzbull 1920x1080 (1080p)bull 1920x1200 (WUXGA)bull 1650x1080 (WSXGA+)
7532 Remote ConsoleThe Remote Console is the redirected screen keyboard and mouse of the remote host system To use the Remote Console window of your managed host system the browser must include a Java Runtime Environment plug-in If the browser has no Java support such as with a small handheld device the user can maintain the remote host system using the administration forms displayed by the browserThe Remote Console window is a Java Applet that establishes TCP connections to the BMCThe protocol that is run over these connections is a unique KVM protocol and not HTTP or HTTPS This protocol uses ports 7578 for KVM 5120 for CDROM media redirection and 5123 for FloppyUSB media redirection When encryption is enabled the protocol uses ports 7582 for KVM 5124 for CDROM media redirection and 5127 for FloppyUSB media redirection The local network environment must permit these connections to be made that is the firewall and in case of a private internal network the NAT (Network Address Translation) settings have to be configured accordingly
7533 PerformanceThe remote display accurately represents the local display The feature adapts to changes to the video resolution of the local display and continues to work smoothly when the system transitions from graphics to text or vice-versa The responsiveness may be slightly delayed depending on the bandwidth and latency of the networkEnabling KVM andor media encryption will degrade performance Enabling video compression provides the fastest response while disabling compression provides better video qualityFor the best possible KVM performance a 2Mbsec link or higher is recommendedThe redirection of KVM over IP is performed in parallel with the local KVM without affecting the local KVM operation
HA201-TP Users Manual
125
Chapter 7 Intelreg Server Board S2600TP Platform Management
7534 SecurityThe KVM redirection feature supports multiple encryption algorithms including RC4 and AES The actual algorithm that is used is negotiated with the client based on the clientrsquos capabilities
7535 AvailabilityThe remote KVM session is available even when the server is powered-off (in stand-by mode) No re-start of the remote KVM session shall be required during a server reset or power onoff A BMC reset (for example due to an BMC Watchdog initiated reset or BMC reset after BMC FWupdate) will require the session to be re-established KVM sessions persist across system reset but not across an AC power loss
7536 UsageAs the server is powered up the remote KVM session displays the complete BIOS boot process The user is able interact with BIOS setup change and save settings as well as enter and interact with option ROM configuration screensAt least two concurrent remote KVM sessions are supported It is possible for at least two different users to connect to same server and start remote KVM sessions
7537 Force-enter BIOS SetupKVM redirection can present an option to force-enter BIOS Setup This enables the system to enter F2 setup while booting which is often missed by the time the remote console redirects the video
7538 Media RedirectionThe embedded web server provides a Java applet to enable remote media redirection This may be used in conjunction with the remote KVM feature or as a standalone applet The media redirection feature is intended to allow system administrators or users to mount a remote IDE or USB CD-ROM floppy drive or a USB flash disk as a remote device to the server Once mounted the remote device appears just like a local device to the server allowing system administrators or users to install software (including operating systems) copy files update BIOS and so on or boot the server from this deviceThe following capabilities are supported
HA201-TP Users Manual
126
Chapter 7 Intelreg Server Board S2600TP Platform Management
The operation of remotely mounted devices is independent of the local devices on the server Both remote and local devices are useable in parallelbull Either IDE (CD-ROM floppy) or USB devices can be mounted as a
remote device to the serverbull It is possible to boot all supported operating systems from the remotely
mounted device and to boot from disk IMAGE (IMG) and CD-ROM or DVD-ROM ISO files See the Testedsupported Operating System List for more information
bull Media redirection supports redirection for both a virtual CD device and a virtual FloppyUSB device concurrently The CD device may be either a local CD drive or else an ISO image file the FloppyUSB device may be either a local Floppy drive a local USB device or else a disk image file
bull The media redirection feature supports multiple encryption algorithms including RC4 and AES The actual algorithm that is used is negotiated with the client based on the clientrsquos capabilities
bull A remote media session is maintained even when the server is powered-off (in standby mode) No restart of the remote media session is required during a server reset or power onoff An BMC reset (for example due to an BMC reset after BMC FW update) will require the session to be re-established
bull The mounted device is visible to (and useable by) managed systemrsquos OS and BIOS in both pre-boot and post-boot states
bull The mounted device shows up in the BIOS boot order and it is possible to change the BIOS boot order to boot from this remote device
bull It is possible to install an operating system on a bare metal server (no OS present) using the remotely mounted device This may also require the use of KVM-r to configure the OS during install
USB storage devices will appear as floppy disks over media redirection This allows for the installation of device drivers during OS installationIf either a virtual IDE or virtual floppy device is remotely attached during system boot both the virtual IDE and virtual floppy are presented as bootable devices It is not possible to present only a single-mounted device type to the system BIOS
HA201-TP Users Manual
127
Chapter 7 Intelreg Server Board S2600TP Platform Management
75381 AvailabilityThe default inactivity timeout is 30 minutes and is not user-configurable Media redirection sessions persist across system reset but not across an AC power loss or BMC reset
75382 Network Port UsageThe KVM and media redirection features use the following portsbull 5120 ndash CD Redirectionbull 5123 ndash FD Redirectionbull 5124 ndash CD Redirection (Secure)bull 5127 ndash FD Redirection (Secure)bull 7578 ndash Video Redirectionbull 7582 ndash Video Redirection (Secure)For additional information reference the Intelreg Remote Management Module 4 and Integrated BMC Web Console Users Guide
Chapter 1 Product IntroductionChapter 1 Product IntroductionChapter 8 Technical Support
wwwaicipccom
bull TAIWANTel +886 3 433 9188Fax +886 3 287 1818Email salesaicipccomtw
bull CHINA Tel +862154961421 +862154961422Fax Extension 608Email Technical Support supportaicipccom
bull AMERICA - West coastTel +19098958989Fax +19098958999Email salesaicipccom
bull AMERICA - East coastTel +19738848886Fax +19738844794Email njsalesaicipccom
bull EUROPETel +31306386789Fax +31306360638Emailsalesaicipcnl
Email Technical Support supportaicipccom
- PREFACE
-
- SAFETY INSTRUCTIONS
-
- Chapter 1 Product Introduction
-
- 11 Box Content
- 12 Specifications
- 13 General Information
-
- Chapter 2 Hardware Installation
-
- 21 Removing and Installing top cover
- 22 Central Processing Unit (CPU)
- 23 Diagram of the Correct Installation
- 24 System Memory
- 25 RemovingInstalling a Drive Tray Hard Drive Drive Slot Map
- 26 Removing and Installing a Fan Module
- 27 Power Supply
- 28 Tool-less Blade Slide Installation introduction
-
- Chapter 3 Motherborad Overview
-
- 31 Intelreg Server Board Feature Set
- 32 Motherboard Layout
- 33 Motherboard block diagram
- 34 Motherboard Configuration Jumpers
- 35 Motherboard Configuration Jumpers
-
- Chapter 4 12GB Expander Borad Overview
-
- 41 12GB Expander Board
-
- Chapter 5 Backplane Overview
-
- 51 Backplane
-
- Chapter 6 Debug amp Firmware Update Introduction
-
- 61 Expender firmware update through smart console port
- 62 Update the expander firmware through in-band
- 63 12G expander EDFB setting
- 64 Slot HDD power setting
- 65 HDD BP thermal sensor temperature setting
-
- Chapter 7 Intelreg Server Board S2600TP Platform Management
-
- 71 Server Management Function Architecture
- 72 Features and Functions
- 73 Sensor Monitoring
- 74 Intelreg Intelligent Power Node Manager (NM)
- 75 Basic and Advanced Server Management Features
-
- Chapter 8 Technical Support
-
- 按鈕11
![Page 8: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/8.jpg)
1
HA201-TP Users Manual
Chapter 1 Product Introduction
11 Box Content
Chapter 1 Product Introduction
Before removing the subsystem from the shipping carton visually inspect the physical condition of the shipping carton Exterior damage to the shipping carton may indicate that the contents of the carton are damaged If any damage is found do not remove the components contact the dealer where the subsystem was purchased for further introduction Before continuing first unpack the subsystem and verify that the contents of the shipping carton are all there and in good condition
bull Enclosure( Power supply fan 24 HDD tray included)
bull RS232 cablex 1pcs bull Power cord x 2sets bull Screws kit x 1set
bull Slide rail x 1set
If any items are missing please contact your authorized reseller or sales representative
2
HA201-TP Users Manual
Chapter 1 Product Introduction
12 Specifications
Dimensions (W x D x H)(with chassis ears)
mm 4826 x 815 x 88
inches 19 x 32 x 35
Motherboard (per node) Intelreg Server Board S2600TP
Processor (per node)
Processor Support
Two Intelreg Xeonreg Processors E5-2600 v3 Product Family
QPI Speeds 96 GTs 8 GTs 72 GTs
Socket Type Socket R3 (FCLGA2011-3)
Chipset Support(per node) Intelreg C612 Chipset
System Memory (per node)
bull 16 DIMM slots in total across 8 memory channelsbull Registered DDR4 (RDIMM) Load Reduced DDR4 (LRDIMM)bull Memory DDR4 data transfer rates of 160018662133 MTsbull DIMM sizes of 4 GB 8 GB 16 GB or 32 GB depending on ranks and technology
Front Panel Power onoff
LEDs
A bull Power (Secondary)bull Warning
B bull Power (Primary)bull Warning
Drive Bays External 25 hot swap 24
Backplane 1 x 24-port 12Gb SAS dual-loop backplane
Expander Board(per node)
1 x 24-port 12Gb SAS expander board with 3 SFF-8643 connectors
Expansion Slots (per node) PCIe 30 5 x8 (4 LP amp 1 FHHL)
Internal IO Connectors Headers(per node)
bull 1 x internal USB 20 connector (port 67)bull 1 x 2x7pin header for system fan modulebull 1 x 1x12pin control panel headerbull 1 x DH-10 serial Port A connectorbull 1 x SATA 6Gbs port for SATA DOMbull 4 x SATA 6Gbs connectors (port 0123)bull 1 x 2x4 pin header for Intel RMM4 Litebull 1 x 1x4 pin header for Storage Upgrade Keybull 1 x 1x8 pin backup power controler connector
Rear IO(per node)
bull 1 x RJ45 (dedicated port for remote server management) 2 x RJ45 1 x VGA 2 x USB 20 Type A 1 x Mini SAS HD
Ethernet (per node) Dual GbE Intelreg I350 Gigabit Ethernet Controller
Video Support(per node) Integrated Matrox G200 2D Graphics Controller
Server Management(per node)
Onboard Server Engines LLC Pilot III Controller Support for Intelreg Remote Management Module 4 solutions IntelregLight-Guided Diagnostics on field replaceable units Support for Intelreg System Management Software Support for Intelreg Intelligent Power Node Manager (Need PMBus - compliant power supply) BIOS Flash Winbond W25Q64BV
Power Supply 1200W 1+1 redundant power supply
System Cooling (per node)
2 x 6056 fans
EnvironmentalSpecifications
Temperature 0degC - 35degCHumidity 5 - 95 non-condensing
Gross Weight (w PSU amp Rail)kgs 41
lbs 90
PackagingDimensions
(W x D x H)mm 590 x 1150 x 330
inches 232 x 453 x 13
Mounting Standard 28 tool-less slide rail
3
HA201-TP Users Manual
Chapter 1 Product Introduction
13 General Information
HA201-WP a 2U Storage Server Barebone supports two Intelreg Xeonreg processors E5-2600 v3 series HA201-TP has a 24 x 25rdquo HDD bays as system drive bays It is a perfect building block for Storage Servers
bull Front Panel
LED Indicator and Switch
24 x 25 hot-swap SATASAS HDD bays
4
HA201-TP Users Manual
Chapter 1 Product Introduction
bull Rear Panel
1200W 1+1 80+ redundant power supply
2 x low-profile add-on card for external connection
SFF 8644 for JBOD connection
5
HA201-TP Users Manual
Chapter 1 Product Introduction
bull Top View of HA201-TP system
2 x hot-swappable storage control node
Intelreg Xeonreg E5-2600 v3 Processors
bull TOP View Of Controller Canister
6
HA201-TP Users Manual
Chapter 1 Product Introduction
bull Top View of HA201-TP system
4 x 60x56mm hot-swap fans
Intel S2600TP
HA201-TP Users Manual
7
Chapter 2 Hardware Installation
7
21 Removing and Installing top coverUnscrew the screws(2pcs for the back and 4pcs for the both right and left side)Slide the cover backwards to remove top cover form the chassis
Chapter 2 Hardware Installation
This section demonstrates maintenance procedures in replacing a defective part once the HA201-TP UM appliance is installed and is operational
Chapter 2 Hardware Installation
HA201-TP Users Manual
8
22 Central Processing Unit (CPU)221 Removing the Processor HeatsinkThe heatsink is attached to the server board or processor socket with captive fasteners Using a 2 Phillips screwdriver loosen the four screws located on the heatsink corners in a diagonal manner using the following procedurebull Using a 2 Phillips screwdriver start with screw 1 and loosen it by
giving it two rotations and stop (see letter A) (IMPORTANT Do not fully loosen)
bull Proceed to screw 2 and loosen it by giving it two rotations and stop (see letter B) Similarly loosen screws 3 and 4 Repeat steps A and B by giving each screw two rotations each time until all screws are loosened
bull Lift the heatsink straight up (see letter C)
REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
Processor
Socket
2
3
1
4
A
B
C
HA201-TP Users Manual
9
Chapter 2 Hardware Installation
222 Installing the Processor
2221 Unlatch the CPU Load Platebull Push the lever handle labeled ldquoOPEN 1strdquo (see letter A) down and
toward the CPU socket Rotate the lever handle upbull Repeat the steps for the second lever handle (see letter B)
CAUTION
Processor must be appropriate You may damage the server board if
you install a processor that is inappropriate for your server
CAUTION
ESD and handling processors Reduce the risk of electrostatic
discharge (ESD) damage to the processor by doing the following
(1) Touch the metal chassis before touching the processor or
server board Keep part of your body in contact with the metal
chassis to dissipate the static charge while handling the processor
(2) Avoid moving around unnecessarily
REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
A REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
B
Chapter 2 Hardware Installation
HA201-TP Users Manual
10
2222 Lift open the Load Platebull Rotate the right lever handle down until it releases the Load Plate
(see letter A)bull While holding down the lever handle with your other hand lift open
the Load Plate (see letter B)
BREMOVE
REMOVE
LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
A
HA201-TP Users Manual
11
Chapter 2 Hardware Installation
2223 Install the Processorbull Remove the processor from its packagebull Carefully remove the protective cover from the bottom side of the
CPU taking care not to touch any CPU contacts (see letter A)bull Orient the processor with the socket so that the processor cutouts
match the four orientation posts on the socket (see letter B) Note the location of a gold key at the corner of the processor (see letter C) Carefully place (Do NOT drop) the CPU into the socket
CAUTION
The pins inside the CPU socket are extremely sensitive Other than
the CPU no object should make contact with the pins inside the
CPU socket A damaged CPU socket pin may render the socket inoperable
and will produce erroneous CPU or other system errors if used
NOTE
The underside of the processor has components that may damage the
socket pins if installed improperly The processor must align correctly
with the socket opening before installation DO NOT DROP the
processor into the socket
NOTE
When possible a CPU insertion tool should be used when installing the
CPU
A
B
C
Chapter 2 Hardware Installation
HA201-TP Users Manual
12
2224 Remove the Socket Cover Remove the socket cover from the load plate by pressing it out
2225 Close the Load Plate Carefully lower the load plate down over the processor
2226 Lock down the Load Platebull Push down on the locking lever on the CLOSE 1st side (see letter A)
Slide the tip of the lever under the notch in the load plate (see letter B) Make sure the load plate tab engages under the socket lever when fully closed
bull bRepeat the steps to latch the locking lever on the other side (see letter C) Latch the levers in the order as shown
NOTE
The socket cover should be saved and re-used should the processor
need to be removed at anytime in the future
Save theprotectivecover
A
BC
HA201-TP Users Manual
13
Chapter 2 Hardware Installation
223 Installing the Processor Heatsink
bull If present remove the protective film covering the Thermal Interface Material on the bottom side of the heatsink (see letter A)
bull Align the heatsink fins to the front and back of the chassis for correct airflow The airflow goes from front-to-back of the chassis (see letter B)
bull Each heatsink has four captive fasteners and should be tightened in a diagonal manner using the following procedure
bull Using a 2 Phillips screwdriver start with screw 1 and engage screw threads by giving it two rotations and stop (see letter C) (Do not fully tighten)
bull Proceed to screw 2 and engage screw threads by giving it two rotations and stop (see letter D) Similarly engage screws 3 and 4
bull Repeat steps C and D by giving each screw two rotations each time until each screw is lightly tightened up to a maximum of 8 inch-lbs torque (see letter E)
NOTE
The processor heatsink for CPU1 and CPU2 is different FXXCA91X91HS is
for CPU1 while FXXEA91X91HS2 is for CPU2 Mislocating the heatsink
will cause serious thermal damage
C
Processor
Socket
AIRFLOW
Chassis Front
2
3
1
4
A
B
TIM
D
CAUTIONDo not over-tighten fasteners
E
Chapter 2 Hardware Installation
HA201-TP Users Manual
14
224 Removing the Processor
NOTE
Remove the processor by carefully lifting it out of the socket taking care
NOT to drop the processor and not touching any pins inside the socket
Install the socket cover if a replacement processor is not going to be installed
HA201-TP Users Manual
15
Chapter 2 Hardware Installation
225 Different heatsink for each of its CPUsCarefully position heatsink over the CPU0 and CPU1 and align the heatsink screws with the screw holes in the motherboard
Caution - Possible thermal damage Avoid moving the heatsink after
it has contacted the top of the CPU Too much movement could
disturb the layer of thermal compound causing voids and leading
to ineffective heat dissipation and component damage
CPU0
CPU0
CPU1
CPU1
Chapter 2 Hardware Installation
HA201-TP Users Manual
16
23 Diagram of the Correct InstallationNOTE The heat sinkrsquos fans should be blowing toward the rear end
of the chassis If one of the fans is facing the wrong direction
please remove the heat sink and reinstall it so that it is facing
the correct direction
AIRFLOW
AIRFLOW
FAN FAN
HA201-TP Users Manual
17
Chapter 2 Hardware Installation
24 System Memory241 Supported Memory
Chapter 2 Hardware Installation
HA201-TP Users Manual
18
242 Memory Population Rules
bull Each installed processor provides four channels of memory On the Intelreg Server Board S2600TP product family each memory channel supports two memory slots for a total possible 16 DIMMs installed
bull The memory channels from processor socket 1 are identified as Channel A B C and D The memory channels from processor socket 2 are identified as Channel E F G and H
bull The silk screened DIMM slot identifiers on the board provide information about the channel and therefore the processor to which they belong For example DIMM_A1 is the first slot on Channel A on processor 1 DIMM_E1 is the first DIMM socket on Channel E on processor 2
bull The memory slots associated with a given processor are unavailable if the corresponding processor socket is not populated
bull A processor may be installed without populating the associated memory slots provided a second processor is installed with associated memory In this case the memory is shared by the processors However the platform suffers performance degradation and latency due to the remote memory
bull Processor sockets are self-contained and autonomous However all memory subsystem support (such as Memory RAS and Error Management) in the BIOS setup is applied commonly across processor sockets
bull All DIMMs must be DDR4 DIMMsbull Mixing of LRDIMM with any other DIMM type is not allowed per
platformbull Mixing of DDR4 operating frequencies is not validated within a socket
or across sockets by Intel If DIMMs with different frequencies are mixed all DIMMs run at the common lowest frequency
bull A maximum of eight logical ranks (ranks seen by the host) per channel is allowed
bull The BLUE memory slots on the server board identify the first memory slot for a given memory channel
NOTE
Although mixed DIMM configurations are supported Intel only performs platform
validation on systems that are configured with identical DIMMs installed
HA201-TP Users Manual
19
Chapter 2 Hardware Installation
bull DIMM population rules require that DIMMs within a channel be populated starting with the BLUE DIMM slot or DIMM farthest from the processor in a ldquofill-farthestrdquo approach In addition when populating a Quad-rank DIMM with a Single- or Dual-rank DIMM in the same channel the Quad-rank DIMM must be populated farthest from the processor Intelreg MRC will check for correct DIMM placement
Chapter 2 Hardware Installation
HA201-TP Users Manual
20
On the Intelreg Server Board S2600TP product a total of sixteen DIMM slots are provided (two CPUs ndash four channels per CPU and two DIMMs per channel) The nomenclature for DIMM sockets is detailed in the following table
HA201-TP Users Manual
21
Chapter 2 Hardware Installation
243 Publishing System MemoryThere are a number of different situations in which the memory size andor configuration are displayed Most of these displays differ in one way or another so the same memory configuration may appear to display differently depending on when and where the display occurs
bull The BIOS displays the ldquoTotal Memoryrdquo of the system during POST if Quiet Boot is disabled in BIOS setup This is the total size of memory discovered by the BIOS during POST and is the sum of the individual sizes of installed DDR4 DIMMs in the system
bull The BIOS displays the ldquoEffective Memoryrdquo of the system in the BIOS Setup The term Effective Memory refers to the total size of all DDR4 DIMMs that are active (not disabled) and not used as redundant units (see Note below)
bull The BIOS provides the total memory of the system in the main page of BIOS setup This total is the same as the amount described by the first bullet above
bull If Quiet Boot is disabled the BIOS displays the total system memory on the diagnostic screen at the end of POST This total is the same as the amount described by the first bullet above
bull The BIOS provides the total amount of memory in the system by supporting the EFI Boot Service function GetMemoryMap()
bull The BIOS provides the total amount of memory in the system by supporting the INT 15h E820h function For details see the Advanced Configuration and Power Interface Specification
Chapter 2 Hardware Installation
HA201-TP Users Manual
22
25 RemovingInstalling a Drive Tray Hard Drive Drive Slot Map
251 Removing a Disk DriveRelease a drive tray by pressing the unlock button and pinching the lock lever slightly and pulling out the drive tray
252 Installing a Disk Drive
Adjust and place the 25rdquo HDD on the HDD tray Choose to the proper position screw hole then secure it with screws on the bottomMounting location 1 HDD with the interposer board attached
HA201-TP Users Manual
23
Chapter 2 Hardware Installation
253 Installing a Hard Disk Drive Tray
Insert the drive carrier into its bay Push the tray lever until it clicks Make sure the drive tray is correctly secured in place when its front edge aligns with the bay edge
254 Drive Slot Map
The drive slot map follows
HBA Card
MegaRaid Card
Chapter 2 Hardware Installation
HA201-TP Users Manual
24
26 Removing and Installing a Fan Module261 Removing a FAN module
Unscrew the thumb screw on the cover to open the top cover from NODE Unpluging the FAN cable Remove the fan by lifting it and pulling it out
262 Installing a FAN module To installing a fan module follow the reverse order
HA201-TP Users Manual
25
Chapter 2 Hardware Installation
27 Power Supply
271 Removing or Replacing the Power Supply Module
Slide the small levers (see red arrow ) to the right Hold the PSU lever and firmly pull the PSU out of the server chassis
Chapter 2 Hardware Installation
HA201-TP Users Manual
26
28 Tool-less Blade Slide Installation introduction
1 Pull on the lsquorsquoFront-Releasersquorsquoto unlock the inner channel from the Slide Assembly
2 Release the Detent-Lock and push Middle Channel inwards to retract Middle Channel
HA201-TP Users Manual
27
Chapter 2 Hardware Installation
Metal Spacer
OptionalRemove Metal Spacer for Aluminium Racks
3 Aligning the Front Bracket with the Mounting Hole
4 Push in to assembly the Front Bracket onto the Rack
Chapter 2 Hardware Installation
HA201-TP Users Manual
28
5 Now the bracket is fixed onto the Rack(Optional M6x10L screws are to secure the rails with posts if needed)
6 Refer to Diagram 3 amp 4 to assemble the End Bracket onto the Rack
HA201-TP Users Manual
29
Chapter 2 Hardware Installation
7 Assemble the inner channel onto the chassis using thescrews provided
8 Push the chassis with inner channels into Slide to complete Rack Installation
Chapter 3 Motherborad Overview
HA201-TP Users Manual
30
Chapter 3 Motherborad Overview
The Intelreg Server Board S2600TP is a monolithic printed circuit board (PCB) with features designed to support the high-performance and high-density computing markets This server board is designed to support the Intelreg Xeonreg processor E5-2600 v3 product family Previous generation Intelreg Xeonreg processors are not supported Many of the features and functions of the server board family are common A board will be identified by name when a described feature or function is unique to it
Chapter 3 Motherborad Overview
HA201-TP Users Manual
31
31 Intelreg Server Board Feature Set
Chapter 3 Motherborad Overview
HA201-TP Users Manual
32
WARNING The riser slot 1 on the server board is designed for
plugging in ONLY the riser card Plugging in the PCIe card may
cause permanent server board and PCIe card damage
Chapter 3 Motherborad Overview
HA201-TP Users Manual
33
32 Motherboard Layout
DiagnosticLED
RMM4LiteRiser Slot 2
SATASGPIO
BridgeBoard
SATA 3USB
Riser Slot 4
Riser Slot 4
HDD Activity
Riser Slot 3ControlPanel
SystemFan 1
CPU 2Socket
CPU 1Socket
SystemFan 3
SystemFan 2
MainPower 1
FanConnector
MainPower 2
InfiniBandPort(QSFP+)DedicatedManagementPortUSBStatus LEDID LED
VGA
NIC 2
IPMB
NIC 1
SerialPort A
CR 2032 3W
Backup Power Control
SATA 2
SATA 0
SATA RAID 5
SATA
Upgrade Key
DOM
SATA 1
Chapter 3 Motherborad Overview
HA201-TP Users Manual
34
33 Motherboard block diagram
Chapter 3 Motherborad Overview
HA201-TP Users Manual
35
34 Motherboard Configuration JumpersThe following table provides a summary and description of configuration test and debug jumpers The server board has several 3-pin jumper blocks that can be used Pin 1 on each jumper block can be identified by the following symbol on the silkscreen
Chapter 3 Motherborad Overview
HA201-TP Users Manual
36
Chapter 3 Motherborad Overview
HA201-TP Users Manual
37
35 Motherboard Configuration JumpersThe Intelreg Server Board S2600TP has the following board rear connector placement
HA201-TP Users Manual
38
Chapter 4 12G Expander Board Overview
This chapter provides detailed introduction guide on hardware introduction
41 12GB Expander Board411 Placement amp Connectors location
J1
JPIC_DBGU8
Chip 3x36RExpander
U14Flash U16
PIC
SW1
SW2
JEXP_UART
J7
JPIC_SMT
J2 J3 J4 CN1 JUSB_MB
JVBATT_12C
JVBATT CN3 CN2 JIPMI_LOC
CN5
CN8
JPWR1
JPWR2
JHDDPWRJPSON_OK
JIPMB
JUSB_PIC
J8
CN7CN6
CN4
JPWR_SW
Chapter 4 12GB Expander Borad Overview
HA201-TP Users Manual
39
Chapter 4 12G Expander Board Overview
412 PIN-OUT
Power Connector ndash JPWR1JPWR2
OS HDD Power Connector ndash JHDDPWR
Battery Connector ndash JVBATT
FAN Connector ndash J7J8
Console for Expander ndash JEXP_UART
HA201-TP Users Manual
40
Chapter 4 12G Expander Board Overview
PIC Smart portndash JPIC_SMT
PIC Debug port(For MPLAB I2C tool) ndash JPIC_DBG
PIC USB ndash JUSB_PIC
Battery Function ndash JVBATT_I2C
IPMB ndash JIPMB
HA201-TP Users Manual
41
Chapter 4 12G Expander Board Overview
BP PIC USB ndash JUSB_MB
IPMB for Device ndash JIPMI_LOC
Power SW ndash JPWR_SW
MB power on frunction ndash JPSON_OK
HA201-TP Users Manual
42
Chapter 4 12G Expander Board Overview
413 LEDs
LED_CN6
LED2
LED3
HA201-TP Users Manual
43
Chapter 5 Backplane OverviewChapter 5 Backplane Overview
This chapter provides detailed introduction guide on backplane introduction
51 Backplane511 Placement amp Connectors location
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
9
18
10
1
3
G
D
SG
D
S
C
B
C
E
B
C
E
GA
O
I
2
43
12
43
1
14
1
3
45
134
5
134
5
134
5
16
2
9
10
1
1
1120
10
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
16
17
32
33
484964
J2
JP1J11
For Secondary Canister Node
J4 J1 SW1
SW2
J3
For Primary Canister Node
HA201-TP Users Manual
44
Chapter 5 Backplane Overview
512 PIN-OUT
Power Connector ndash J2
Power Connector ndash J11
PMBUS Connector ndash JP1
FAN Connector ndash J4
HA201-TP Users Manual
45
Chapter 5 Backplane Overview
Console for MCU ndash J1
Front IO ndash J3
HA201-TP Users Manual
46
Chapter 5 Backplane Overview
513 LEDs
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
18
9
10
1
31
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
3 4
21
3 4
21
4
1
16
2
91
101
11
10
20
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ActFail
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
9
18
10
1
3
G
D
SG
D
S
C
B
C
E
B
C
E
GA
O
I
2
43
12
43
1
14
1
3
45
134
5
134
5
134
5
16
2
9
10
1
1
1120
10
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
16
17
32
33
484964LED1
HA201-TP Users Manual
47
Chapter 6 HDD Backplane Introduction
61 Expender firmware update through smart console port611 Update Expander firmware revision
Step 1 Set up HA201-TP console serial cable Insert console serial cable into console port shown below also
the other side inert serial port into motherboard
PLEASE FIND THE CONSOLE SERIAL CABLE IN THE PACKAGE BOX
Chapter 6 Debug amp Firmware Update Introduction
HA201-TP Users Manual
48
Chapter 6 HDD Backplane Introduction
Step 2 Set up HA201-TP RS232 connectionSet up RS232 connection application into your HA201-TP as shown in the example process below
For exampleOS Microsoft WindowsRS232 connection application Hyperterminal
Step 2 Install HyperTrmexe
HA201-TP Users Manual
49
Chapter 6 HDD Backplane Introduction
Step 3 Enter a new name for the icon in the field below and click OK
Step 4 Connecting by using selecting an option in the drop down menu circled in red below (we selected COM2 in this example) and click OK
HA201-TP Users Manual
50
Chapter 6 HDD Backplane Introduction
Properties
Port Setting
Bits per second
Data bits
Parity
Stop bits
Flow control None
Restore Defaults
OK ApplyCancel
None
Step 6 Set up is complete The diagram below depicts what screen should displayed
Step 5 For ldquoBits per secondrdquo select 38400 For ldquoFlow controlrdquo select None Click OK when you have finished your selections
cmdgt_
HA201-TP Users Manual
51
Chapter 6 HDD Backplane Introduction
Step 7To get firmware image amp MFG Configuration Image version information from AIC SAS Related Firmware Downloadne website
httpppmsaicipccomtw8888downloadexpandermcu
HA201-TP Users Manual
52
Chapter 6 HDD Backplane Introduction
Step 8Comand line for show current firmware revisioncmdgtrev
HA201-TP Users Manual
53
Chapter 6 HDD Backplane Introduction
Step 9Start to update expander firmwarecmdgtfdl 0 0_
Step 10Select the tool bar Transfer -gt Send File
HA201-TP Users Manual
54
Chapter 6 HDD Backplane Introduction
Step 11bull Choose new firmware path file fw 3A1_v11221bull Protocol have to choose Xmodem
Step 12Firmware download complete
HA201-TP Users Manual
55
Chapter 6 HDD Backplane Introduction
Step 13Reset computer for success update firmwarecmdgtreset
reset
HA201-TP Users Manual
56
Chapter 6 HDD Backplane Introduction
612 Update expander configuration MFG
Step 1Comand line for show current configuration MFGcmdgt showmfg
HA201-TP Users Manual
57
Chapter 6 HDD Backplane Introduction
Step 2Start to update expander configuration MFGcmdgtfdl 83 0_
Step 3Select the tool bar Transfer -gt Send File
83 0
HA201-TP Users Manual
58
Chapter 6 HDD Backplane Introduction
Step 4bull Choose new MFG path file mfg 3A10_eob_1201binbull Protocol have to choose Xmodem
Step 5MFG download complete
HA201-TP Users Manual
59
Chapter 6 HDD Backplane Introduction
Step 6Reset computer for success update MFGcmdgtreset
reset
HA201-TP Users Manual
60
Chapter 6 HDD Backplane Introduction
62 Update the expander firmware through in-band
FOR EXAMPLEStep 1
Download and install SG3_utilsexe which compatible with Linux OSFrom website httpsgdannyczsgsg3_utilshtml website Reference version sg3_utils-140tgz
Step 2To get firmware image amp MFG Configuration Image version information from AIC SAS Related Firmware Downloadne websitehttpppmsaicipccomtw8888downloadexpandermcu
HA201-TP Users Manual
61
Chapter 6 HDD Backplane Introduction
Step 3Execute terminal under the same new firmware folderexample Setting a new firmware folder on Home page Open Terminal by click to the right button of mouse in the same window Home
Step 4Typing sudo -s to into administrator mode
HA201-TP Users Manual
62
Chapter 6 HDD Backplane Introduction
Step 5 Find expander location$ sg_map -i
Step 6Update Expander firmware$ sg_write_buffer --id=0x0 --in=fw3A1_v11221 --mode=0x2 --offset=0 devsg0
HA201-TP Users Manual
63
Chapter 6 HDD Backplane Introduction
Step 7 Update Expander MFG$ sg_write_buffer --id=0x83 --in=mfg3A10_eob_1201bin --mode=0x2 --offset=0 devsg0
Step 8Reboot computer for success update firmware amp MFGrootubuntu~ reboot
HA201-TP Users Manual
64
Chapter 6 HDD Backplane Introduction
63 12G expander EDFB settingStep 1
For Install HyperTerminalexe refer to section 41
Step 2 Get EDFB statuscmdgt edfb
HA201-TP Users Manual
65
Chapter 6 HDD Backplane Introduction
Step 3Change EDFB setting
Enable EDFB functioncmdgtedfb on
Disable EDFB functioncmdgtedfb off
cmd gtedfbEDFB is OFF
cmd gtedfb onSucceeded to set EDFB
cmd gtedfb offEDFB is OFF
HA201-TP Users Manual
66
Chapter 6 HDD Backplane Introduction
64 Slot HDD power setting(Only for system cooling Fan controled by expander)
Step 1 For Install sg3exe tool and get new firmware from website refer to section 42
Step 2Execute terminal under the same new firmware folderexample Setting a new firmware folder on Home page Open Terminal by click to the right button of mouse in the same window Home
HA201-TP Users Manual
67
Chapter 6 HDD Backplane Introduction
Step 3 Typing sudo -s to into administrator mode
Step 4 Find expander location$ sg_map -i
HA201-TP Users Manual
68
Chapter 6 HDD Backplane Introduction
Step 5For example
If would like to turn the Disk004 power off under the HBA card Need to check Disk004 power status $ sg_ses --page=7 devsg0
Under HBA card the Element 3 = Disk004
HA201-TP Users Manual
69
Chapter 6 HDD Backplane Introduction
Step 6To check Disk004 (element 3) power status is ok$ sg_ses --page=2 devsg0
Status shows belowThe status of Element 3 is OK
HA201-TP Users Manual
70
Chapter 6 HDD Backplane Introduction
Step 7Turn off a HDD power$ sg_ses --descriptor=Disk004 --set=341 devsg0
Step 8Turn on a HDD power$ sg_ses --descriptor=Disk004 --clear=341 devsg0
HA201-TP Users Manual
71
Chapter 6 HDD Backplane Introduction
65 HDD BP thermal sensor temperature setting(Only for system cooling Fan controled by expander)
Step 1 For Install HyperTerminalexe refer to section 41
Step 2Get the current temperature settingscmdgt temperature
HA201-TP Users Manual
72
Chapter 6 HDD Backplane Introduction
Step 3For example
Set new temperatureT1=20 C 4 18 CT2=50 C 4 52 CWarning threshold=50 C 448 CAlarm threshold=55 C 454 C
The new setting will take effect after resetcmdgt temperature 18 52 48 54cmdgt reset
HA201-TP Users Manual
73
Chapter 6 HDD Backplane Introduction
Step 4Check fan speed amp temperature informationcmdgt sensor
cmd gtsensor==ENCLOSURE STATUS========================================================== Total fan number 2 System Fan-0 speed 10888 RPM System Fan-1 speed 10971 RPM System PWN-0 82 Expander Temperature 76 Celsius degree Sytem Temperature-0 33 Celsius degree T1 20 Celsius degree T2 50 Celsius degree TC 55 Celsius degree Voltage Sensor 09V 093V Voltage Sensor 18V 180V
cmd gt_
HA201-TP Users Manual
74
Chapter 7 Intelreg Server Board S2600TP Platform Management
71 Server Management Function Architecture711 IPMI Feature7111 IPMI 20 Featuresbull Baseboard management controller (BMC)bull IPMI Watchdog timerbull Messaging support including command bridging and usersession
supportbull Chassis device functionality including powerreset control and BIOS boot
flags supportbull Event receiver device The BMC receives and processes events from
other platform subsystemsbull Field Replaceable Unit (FRU) inventory device functionality The BMC
supports access to system FRU devices using IPMI FRU commandsbull System Event Log (SEL) device functionality The BMC supports and
provides access to a SELbull Sensor Data Record (SDR) repository device functionality The BMC
supports storage and access of system SDRsbull Sensor device and sensor scanningmonitoring The BMC provides IPMI
management of sensors It polls sensors to monitor and report system health
bull IPMI interfaceso Host interfaces include system management software (SMS) with receive message queue support and server management mode (SMM)o IPMB interfaceo LAN interface that supports the IPMI-over-LAN protocol (RMCP RMCP+)
bull Serial-over-LAN (SOL)bull ACPI state synchronization The BMC tracks ACPI state changes that are
provided by the BIOSbull BMC self-test The BMC performs initialization and run-time self-tests and
makes results available to external entitiesbull See also the Intelligent Platform Management Interface Specification
Second Generation v20
Chapter 7 Intelreg Server Board S2600TP Platform Management
HA201-TP Users Manual
75
Chapter 7 Intelreg Server Board S2600TP Platform Management
7112 Non IPMI FeaturesThe BMC supports the following non-IPMI featuresbull In-circuit BMC firmware updatebull Fault resilient booting (FRB) FRB2 is supported by the watchdog timer
functionalitybull Chassis intrusion detectionbull Basic fan control using Control version 2 SDRsbull Fan redundancy monitoring and supportbull Enhancements to fan speed controlbull Power supply redundancy monitoring and supportbull Hot-swap fan supportbull Acoustic management Support for multiple fan profilesbull Signal testing support The BMC provides test commands for setting
and getting platform signal statesbull The BMC generates diagnostic beep codes for fault conditionsbull System GUID storage and retrievalbull Front panel management The BMC controls the system status LED
and chassis ID LED It supports secure lockout of certain front panel functionality and monitors button presses The chassis ID LED is turned on using a front panel button or a command
bull Power state retentionbull Power fault analysisbull Intelreg Light-Guided Diagnosticsbull Power unit management Support for power unit sensor The BMC
handles power-good dropout conditionsbull DIMM temperature monitoring New sensors and improved acoustic
management using closed-loop fan control algorithm taking into account DIMM temperature readings
bull Address Resolution Protocol (ARP) The BMC sends and responds to ARPs (supported on embedded NICs)
bull Dynamic Host Configuration Protocol (DHCP) The BMC performs DHCP (supported on embedded NICs)
bull Platform environment control interface (PECI) thermal management support
bull E-mail alertingbull Support for embedded web server UI in Basic Manageability feature
set
HA201-TP Users Manual
76
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull Enhancements to embedded web servero Human-readable SELo Additional system configurabilityo Additional system monitoring capabilityo Enhanced on-line help
bull Integrated KVMbull Enhancements to KVM redirection
o Support for higher resolutionbull Integrated Remote Media Redirectionbull Lightweight Directory Access Protocol (LDAP) supportbull Intelreg Intelligent Power Node Manager supportbull Embedded platform debug feature which allows capture of detailed
data for later analysisbull Provisioning and inventory enhancements
o Inventory datasystem information export (partial SMBIOS table)bull DCMI 15 compliance (product-specific)bull Management support for PMBus rev12 compliant power suppliesbull BMC Data Repository (Managed Data Region Feature)bull Support for an Intelreg Local Control Display Panelbull System Airflow Monitoringbull Exit Air Temperature Monitoringbull Ethernet Controller Thermal Monitoringbull Global Aggregate Temperature Margin Sensorbull Memory Thermal Managementbull Power Supply Fan Sensorsbull Energy Star Server Supportbull Smart Ride Through (SmaRT) Closed Loop System Throttling (CLST)bull Power Supply Cold Redundancybull Power Supply FW Updatebull Power Supply Compatibility Checkbull BMC FW reliability enhancements
o Redundant BMC boot blocks to avoid possibility of a corrupted boot block resulting in a scenario that prevents a user from updating the BMC o BMC System Management Health Monitoring
HA201-TP Users Manual
77
Chapter 7 Intelreg Server Board S2600TP Platform Management
72 Features and Functions721 Power SubsystemThe server board supports several power control sources which can initiate power-up orpower-down activity
HA201-TP Users Manual
78
Chapter 7 Intelreg Server Board S2600TP Platform Management
722 Advanced Configuration and Power Interface (ACPI)The server board has support for the following ACPI states
HA201-TP Users Manual
79
Chapter 7 Intelreg Server Board S2600TP Platform Management
723 System InitializationDuring system initialization both the BIOS and the BMC initialize the following items
7231 Processor Tcontrol SettingProcessors used with this chipset implement a feature called Tcontrol which provides a processor-specific value that can be used to adjust the fan-control behavior to achieve optimum cooling and acoustics The BMC reads these from the CPU through PECI Proxy mechanism provided by Manageability Engine (ME) The BMC uses these values as part of thefan-speed-control algorithm
HA201-TP Users Manual
80
Chapter 7 Intelreg Server Board S2600TP Platform Management
7232 Fault Resilient Booting (FRB)Fault resilient booting (FRB) is a set of BIOS and BMC algorithms and hardware support that allow a multiprocessor system to boot even if the bootstrap processor (BSP) fails Only FRB2 is supported using watchdog timer commandsFRB2 refers to the FRB algorithm that detects system failures during POST The BIOS uses the BMC watchdog timer to back up its operation during POST The BIOS configures the watchdog timer to indicate that the BIOS is using the timer for the FRB2 phase of the boot operationAfter the BIOS has identified and saved the BSP information it sets the FRB2 timer use bit and loads the watchdog timer with the new timeout intervalIf the watchdog timer expires while the watchdog use bit is set to FRB2 the BMC (if so configured) logs a watchdog expiration event showing the FRB2 timeout in the event data bytes The BMC then hard resets the system assuming the BIOS-selected reset as the watchdog timeout actionThe BIOS is responsible for disabling the FRB2 timeout before initiating the option ROM scan and before displaying a request for a boot password If the processor fails and causes an FRB2 timeout the BMC resets the systemThe BIOS gets the watchdog expiration status from the BMC If the status shows an expired FRB2 timer the BIOS enters the failure in the system event log (SEL) In the OEM bytes entry in the SEL the last POST code generated during the previous boot attempt is written FRB2failure is not reflected in the processor status sensor valueThe FRB2 failure does not affect the front panel LEDs
HA201-TP Users Manual
81
Chapter 7 Intelreg Server Board S2600TP Platform Management
7233 Post Code DisplayThe BMC upon receiving standby power initializes internal hardware to monitor port 80h (POST code) writes Data written to port 80h is output to the system POST LEDsThe BMC deactivates POST LEDs after POST had completed
HA201-TP Users Manual
82
Chapter 7 Intelreg Server Board S2600TP Platform Management
724 System Event Log (SEL)The BMC implements the system event log as specified in the Intelligent Platform Management Interface Specification Version 20 The SEL is accessible regardless of the system power state through the BMCs in-band and out-of-band interfacesThe BMC allocates 95231bytes (approx 93 KB) of non-volatile storage space to store system events The SEL timestamps may not be in order Up to 3639 SEL records can be stored at a time Because the SEL is circular any command that results in an overflow of the SEL beyond the allocated space will overwrite the oldest entries in the SEL while setting the overflow flag
HA201-TP Users Manual
83
Chapter 7 Intelreg Server Board S2600TP Platform Management
73 Sensor MonitoringThe BMC monitors system hardware and reports system health The information gathered from physical sensors is translated into IPMI sensors as part of the ldquoIPMI Sensor Modelrdquo The BMC also reports various system state changes by maintaining virtual sensors that are not specifically tied to physical hardware This section describes general aspects of BMC sensor management as well as describing how specific sensor types are modeled Unless otherwise specified the term ldquosensorrdquo refers to the IPMI sensor-model definition of a sensor
531 Sensor ScanningThe value of many of the BMCrsquos sensors is derived by the BMC FW periodically polling physical sensors in the system to read temperature voltages and so on Some of these physical sensors are built-in to the BMC component itself and some are physically separated from the BMC Polling of physical sensors for support of IPMI sensor monitoring does not occur until the BMCrsquos operational code is running and the IPMI FW subsystem has completed initializationIPMI sensor monitoring is not supported in the BMC boot code Additionally the BMC selectively polls physical sensors based on the current power and reset state of the system and the availability of the physical sensor when in that state For example non-standby voltages are not monitored when the system is in S4 or S5 power state
HA201-TP Users Manual
84
Chapter 7 Intelreg Server Board S2600TP Platform Management
732 Sensor Rearm Behavior7321 Manual versus Re-arm SensorsSensors can be either manual or automatic re-arm An automatic re-arm sensor will re-arm (clear) the assertion event state for a threshold or offset it if that threshold or offset is deasserted after having been asserted This allows a subsequent assertion of the threshold or an offset to generate a new event and associated side-effect An example side-effect would be boosting fans due to an upper critical threshold crossing of a temperature sensor The event state and the input state (value) of the sensor track each other Most sensors are auto-rearmA manual re-arm sensor does not clear the assertion state even when the threshold or offset becomes de-asserted In this case the event state and the input state (value) of the sensor do not track each other The event assertion state is sticky The following methods can be used to re-arm a sensorbull Automatic re-arm ndash Only applies to sensors that are designated as
ldquoauto-rearmrdquobull IPMI command Re-arm Sensor Eventbull BMC internal method ndash The BMC may re-arm certain sensors due to a
trigger condition For example some sensors may be re-armed due to a system reset A BMC reset will re-arm all sensorsbull System reset or DC power cycle will re-arm all system fan sensors
HA201-TP Users Manual
85
Chapter 7 Intelreg Server Board S2600TP Platform Management
7322 Re-arm and Event GenerationAll BMC-owned sensors that show an asserted event status generate a de-assertion SEL event when the sensor is re-armed provided that the associated SDR is configured to enable a deassertion event for that condition This applies regardless of whether the sensor is a thresholdanalog sensor or a discrete sensor
To manually re-arm the sensors the sequence is outlined below1 A failure condition occurs and the BMC logs an assertion event2 If this failure condition disappears the BMC logs a de-assertion event (if
so configured)3 The sensor is re-armed by one of the methods described in the
previous section4 The BMC clears the sensor status5 The sensor is put into reading-state-unavailable state until it is polled
again or otherwise updated6 The sensor is updated and the ldquoreading-state-unavailablerdquo state is
cleared A new assertion event will be logged if the fault state is once again detected
All auto-rearm sensors that show an asserted event status generate a de-assertion SEL event at the time the BMC detects that the condition causing the original assertion is no longer present and the associated SDR is configured to enable a de-assertion event for that condition
HA201-TP Users Manual
86
Chapter 7 Intelreg Server Board S2600TP Platform Management
733 BIOS Event-Only SensorsBIOS-owned discrete sensors are used for event generation only and are not accessible through IPMI sensor commands like the Get Sensor Reading command Note that in this case the sensor owner designated in the SDR is not the BMCAn example of this usage would be the SELs logged by the BIOS for uncorrectable memory errors Such SEL entries would identify a BIOS-owned sensor ID
734 Margin SensorsThere is sometimes a need for an IPMI sensor to report the difference (margin) from a nonzero reference offset For the purposes of this document these type sensors are referred to as margin sensors For instance for the case of a temperature margin sensor if the referencevalue is 90 degrees and the actual temperature of the device being monitored is 85 degreesthe margin value would be -5
735 IPMI Watchdog SensorThe BMC supports a Watchdog Sensor as a means to log SEL events due to expirations of the IPMI 20 compliant Watchdog Timer
736 BMC Watchdog SensorThe BMC supports an IPMI sensor to report that a BMC reset has occurred due to action taken by the BMC Watchdog feature A SEL event will be logged whenever either the BMC FW stack is reset or the BMC CPU itself is reset
737 BMC System Management Health MonitoringThe BMC tracks the health of each of its IPMI sensors and report failures by providing a ldquoBMC FW Healthrdquo sensor of the IPMI 20 sensor type Management Subsystem Health with support for the Sensor Failure offset Only assertions should be logged into the SEL for the Sensor Failure offset The BMC Firmware Health sensor asserts for any sensor when 10 consecutive sensor errors are read These are not standard sensor events (that is threshold crossings or discrete assertions) These are BMC Hardware Access Layer (HAL) errors If a successful sensor read is completed the counter resets to zero
HA201-TP Users Manual
87
Chapter 7 Intelreg Server Board S2600TP Platform Management
738 VR Watchdog TimerThe BMC FW monitors that the power sequence for the board VR controllers is completed when a DC power-on is initiated Incompletion of the sequence indicates a board problem in which case the FW powers down the systemThe BMC FW supports a discrete IPMI sensor for reporting and logging this fault condition
739 System Airflow MonitoringThe BMC provides an IPMI sensor to report the volumetric system airflow in CFM (cubic feet per minute) The air flow in CFM is calculated based on the system fan PWM values The specific Pulse Width Modulation (PWM or PWMs) used to determine the CFM is SDR configurable The relationship between PWM and CFM is based on a lookup table in an OEM SDRThe airflow data is used in the calculation for exit air temperature monitoring It is exposed as an IPMI sensor to allow a datacenter management application to access this data for use in rack-level thermal management
7310 Thermal MonitoringThe BMC provides monitoring of component and board temperature sensing devices This monitoring capability is instantiated in the form of IPMI analogthreshold or discrete sensors depending on the nature of the measurementFor analogthreshold sensors with the exception of Processor Temperature sensors critical and non-critical thresholds (upper and lower) are set through SDRs and event generation enabled for both assertion and de-assertion eventsFor discrete sensors both assertion and de-assertion event generation are enabledMandatory monitoring of platform thermal sensors includes
HA201-TP Users Manual
88
Chapter 7 Intelreg Server Board S2600TP Platform Management
7310 Thermal MonitoringThe BMC provides monitoring of component and board temperature sensing devices This monitoring capability is instantiated in the form of IPMI analogthreshold or discrete sensors depending on the nature of the measurementFor analogthreshold sensors with the exception of Processor Temperature sensors critical and non-critical thresholds (upper and lower) are set through SDRs and event generation enabled for both assertion and de-assertion eventsFor discrete sensors both assertion and de-assertion event generation are enabledMandatory monitoring of platform thermal sensors includesbull Inlet temperature (physical sensor is typically on system front panel or
HDD back plane)bull Board ambient thermal sensorsbull Processor temperaturebull Memory (DIMM) temperaturebull CPU VRD Hot monitoringbull Power supply (only supported for PMBus-compliant PSUs)Additionally the BMC FW may create ldquovirtualrdquo sensors that are based on a combination of aggregation of multiple physical thermal sensors and application of a mathematical formula to thermal or power sensor readings
73101 Absolute Value versus Margin SensorsThermal monitoring sensors fall into three basic categoriesbull Absolute temperature sensors ndash These are analogthreshold sensors
that provide a value that corresponds to an absolute temperature value
bull Thermal margin sensors ndash These are analogthreshold sensors that provide a value that is relative to some reference value
bull Thermal fault indication sensors ndash These are discrete sensors that indicate a specific thermal fault condition
HA201-TP Users Manual
89
Chapter 7 Intelreg Server Board S2600TP Platform Management
73102 Processor DTS-Spec Margin Sensor(s)Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family incorporate a DTS based thermal spec This allows a much more accurate control of the thermal solution and will enable lower fan speeds and lower fan power consumption The main usage of this sensor is as an input to the BMCrsquos fan control algorithms The BMC implements this as a threshold sensor There is one DTS sensor for each installed physical processor package Thresholds are not set and alert generation is not enabled for these sensors
73103 Processor Thermal Margin Sensor(s)Each processor supports a physical thermal margin sensor per core that is readable through the PECI interface This provides a relative value representing a thermal margin from the corersquos throttling thermal trip point Assuming that temp controlled throttling is enabled the physical core temp sensor reads lsquo0rsquo which indicates the processor core is being throttledThe BMC supports one IPMI processor (margin) temperature sensor per physical processor package This sensor aggregates the readings of the individual core temperatures in a package to provide the hottest core temperature reading When the sensor reads lsquo0rsquo it indicates that the hottest processor core is throttlingDue to the fact that the readings are capped at the corersquos thermal throttling trip point (reading = 0) thresholds are not set and alert generation is not enabled for these sensors
73104 Processor Thermal Control Monitoring (Prochot)The BMC FW monitors the percentage of time that a processor has been operationally constrained over a given time window (nominally six seconds) due to internal thermal management algorithms engaging to reduce the temperature of the device When any processor core temperature reaches its maximum operating temperature the processorpackage PROCHOT (processor hot) signal is asserted and these management algorithms known as the Thermal Control Circuit (TCC) engage to reduce the temperature provided TCC is enabled TCC is enabled by BIOS during system boot This monitoring is instantiated as one IPMI analogthreshold sensor per processor package The BMC implements this as a threshold sensor on a per-processor basis
HA201-TP Users Manual
90
Chapter 7 Intelreg Server Board S2600TP Platform Management
Under normal operation this sensor is expected to read lsquo0rsquo indicating that no processor throttling has occurredThe processor provides PECI-accessible counters one for the total processor time elapsed and one for the total thermally constrained time which are used to calculate the percentage assertion over the given time window
73105 Processor Voltage Regulator (VRD) Over-Temperature SensorThe BMC monitors processor VRD_HOT signals The processor VRD_HOT signals are routed to the respective processor PROCHOT input in order initiate throttling to reduce processor power draw therefore indirectly lowering the VRD temperatureThere is one processor VRD_HOT signal per CPU slot The BMC instantiates one discrete IPMI sensor for each VRD_HOT signal This sensor monitors a digital signal that indicates whether a processor VRD is running in an over-temperature condition When the BMC detects that this signal is asserted it will cause a sensor assertion which will result in an event being written into the sensor event log (SEL)
73106 Inlet Temperature SensorEach platform supports a thermal sensor for monitoring the inlet temperature There are four potential sources for inlet temperature reading
73107 Baseboard Ambient Temperature Sensor(s)The server baseboard provides one or more physical thermal sensors for monitoring the ambient temperature of a board location This is typically to provide rudimentary thermal monitoring of components that lack internal thermal sensors
73108 Server South Bridge (SSB) Thermal MonitoringThe BMC monitors the SSB temperature This is instantiated as an analog (threshold) IPMI thermal sensor
73109 Exit Air Temperature Monitoring The BMC synthesizes a virtual sensor to approximate system exit air temperature for use in fan control This is calculated based on the total power being consumed by the system and the total volumetric air flow provided by the system fans
HA201-TP Users Manual
91
Chapter 7 Intelreg Server Board S2600TP Platform Management
Each system shall be characterized in tabular format to understand total volumetric flow versus fan speed The BMC calculates an average exit air temperature based on the total system power front panel temperature the volumetric system air flow (cubic feet per meter or CFM) and altitude rangeThis sensor is only available on systems in an Intelreg chassis The Exit Air temp sensor is only available when PMBus power supplies are installed
731010 Ethernet Controller Thermal MonitoringThe Intelreg Ethernet Controller I350-AM4 and Intelreg Ethernet Controller 10 Gigabit X540 support an on-die thermal sensor For baseboard Ethernet controllers that use these devices the BMC will monitor the sensors and use this data as input to the fan speed control The BMC will instantiate an IPMI temperature sensor for each device on the baseboard
731011 Memory VRD-Hot Sensor(s)The BMC monitors memory VRD_HOT signals The memory VRD_HOT signals are routed to the respective processor MEMHOT inputs in order to throttle the associated memory to effectively lower the temperature of the VRD feeding that memoryFor Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family there are 2 memory VRD_HOT signals per CPU slot The BMC instantiates one discrete IPMI sensor for each memory VRD_HOT signal
731012 Add-in Module Thermal MonitoringSome boards have dedicated slots for an IO module andor a SAS module For boards that support these slots the BMC will instantiate an IPMI temperature sensor for each slot The modules themselves may or may not provide a physical thermal sensor (a TMP75 device) If the BMC detects that a module is installed it will attempt to access the physical thermal sensor and if found enable the associated IPMI temperature sensor
731013 Processor ThermTripWhen a Processor ThermTrip occurs the system hardware will automatically power down the server If the BMC detects that a ThermTrip occurred then it will set the ThermTrip offset for the applicable
HA201-TP Users Manual
92
Chapter 7 Intelreg Server Board S2600TP Platform Management
processor status sensor
731014 Server South Bridge (SSB) ThermTrip Monitoring The BMC supports SSB ThermTrip monitoring that is instantiated as an IPMI discrete sensor When a SSB ThermTrip occurs the system hardware will automatically power down the server and the BMC will assert the sensor offset and log an event
731015 DIMM ThermTrip MonitoringThe BMC supports DIMM ThermTrip monitoring that is instantiated as one aggregate IPMI discrete sensor per CPU When a DIMM ThermTrip occurs the system hardware will automatically power down the server and the BMC will assert the sensor offset and log an eventThis is a manual re-arm sensor that is rearmed on system resets and power-on (AC or DC power on transitions)
HA201-TP Users Manual
93
Chapter 7 Intelreg Server Board S2600TP Platform Management
7311 Processor SensorsThe BMC provides IPMI sensors for processors and associated components such as voltage regulators and fans The sensors are implemented on a per-processor basis
HA201-TP Users Manual
94
Chapter 7 Intelreg Server Board S2600TP Platform Management
73111 Processor Status SensorsThe BMC provides an IPMI sensor of type processor for monitoring status information for each processor slot If an event state (sensor offset) has been asserted it remains asserted until one of the following happens1 A Rearm Sensor Events command is executed for the processor status
sensor2 AC or DC power cycle system reset or system boot occurs
The BMC provides system status indication to the front panel LEDs for processor fault conditions shown belowCPU Presence status is not saved across AC power cycles and therefore will not generate a deassertion after cycling AC power
HA201-TP Users Manual
95
Chapter 7 Intelreg Server Board S2600TP Platform Management
73112 Processor Population Fault (CPU Missing) SensorThe BMC supports a Processor Population Fault sensor This is used to monitor for the condition in which processor slots are not populated as required by the platform hardware to allow power-on of the systemAt BMC startup the BMC will check for the fault condition and set the sensor state accordinglyThe BMC also checks for this fault condition at each attempt to DC power-on the system At each DC power-on attempt a beep code is generated if this fault is detectedThe following steps are used to correct the fault condition and clear the sensor state1 AC power down the server2 Install the missing processor into the correct slot3 AC power on the server
73113 ERR2 Timeout MonitoringThe BMC supports an ERR2 Timeout Sensor (1 per CPU) that asserts if a CPUrsquos ERR2 signal has been asserted for longer than a fixed time period (gt 90 seconds) ERR[2] is a processor signal that indicates when the IIO (Integrated IO module in the processor) has a fatal error which could not be communicated to the core to trigger SMI ERR[2] events are fatal error conditions where the BIOS and OS will attempt to gracefully handle error but may not be always do so reliably A continuously asserted ERR2 signal is an indication that the BIOS cannot service the condition that caused the error This is usually because that condition prevents the BIOS from runningWhen an ERR2 timeout occurs the BMC assertsde-asserts the ERR2 Timeout Sensor and logs a SEL event for that sensor The default behavior for BMC core firmware is to initiate a system reset upon detection of an ERR2 timeout The BIOS setup utility provides an option to disable or enable system reset by the BMC for detection of this condition
HA201-TP Users Manual
96
Chapter 7 Intelreg Server Board S2600TP Platform Management
73114 CATERR SensorThe BMC supports a CATERR sensor for monitoring the system CATERR signalThe CATERR signal is defined as having 3 statesbull high (no event)bull pulsed low (possibly fatal may be able to recover)bull low (fatal)All processors in a system have their CATERR pins tied together The pin is used as a communication path to signal a catastrophic system event to all CPUs The BMC has direct access to this aggregate CATERR signalThe BMC only monitors for the ldquoCATERR held lowrdquo condition A pulsed low condition is ignored by the BMC If a CATERR-low condition is detected the BMC logs an error message to the SEL against the CATERR sensor and the default action after logging the SEL entry is to reset the system The BIOS setup utility provides an option to disable or enable system reset by the BMC for detection of this conditionThe sensor is rearmed on power-on (AC or DC power on transitions) It is not rearmed on system resets in order to avoid multiple SEL events that could occur due to a potential reset loop if the CATERR keeps recurring which would be the case if the CATERR was due to an MSID mismatch conditionWhen the BMC detects that this aggregate CATERR signal has asserted it can then go through PECI to query each CPU to determine which one was the source of the error and write an OEM code identifying the CPU slot into an event data byte in the SEL entry If PECI is non-functional (it isnrsquot guaranteed in this situation) then the OEM code should indicate that the source is unknownEvent data byte 2 and byte 3 for CATERR sensor SEL events
ED1 ndash 0xA1ED2 - CATERR type0 Unknown1 CATERR2 CPU Core Error (not supported on Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family)3 MSID Mismatch4 CATERR due to CPU 3-strike timeout
HA201-TP Users Manual
97
Chapter 7 Intelreg Server Board S2600TP Platform Management
ED3 - CPU bitmap that causes the system CATERR[0] CPU1[1] CPU2[2] CPU3[3] CPU4When a CATERR Timeout event is determined to be a CPU 3-strike timeout The BMC shall log the logical FRU information (eg busdevfunc for a PCIe device CPU or DIMM) that identifies the FRU that caused the error in the extended SEL data bytes In this case Ext-ED0 will be set to 0x70 and the remaining ED1-ED7 will be set according to the device type and info available
73115 MSID Mismatch SensorThe BMC supports a MSID Mismatch sensor for monitoring for the fault condition that will occur if there is a power rating incompatibility between a baseboard and a processor The sensor is rearmed on power-on (AC or DC power on transitions)
7312 Voltage MonitoringThe BMC provides voltage monitoring capability for voltage sources on the main board and processors such that all major areas of the system are covered This monitoring capability is instantiated in the form of IPMI analogthreshold sensors
73121 DIMM Voltage SensorsSome systems support either LVDDR (Low Voltage DDR) memory or regular (non-LVDDR) memory During POST the system BIOS detects which type of memory is installed and configures the hardware to deliver the correct voltageSince the nominal voltage range is different this necessitates the ability to set different thresholds for any associated IPMI voltage sensors The BMC FW supports this by implementing separate sensors (that is separate IPMI sensor numbers) for each nominal voltage range supported for a single physical sensor and it enablesdisables the correct IPMI sensor based on which type memory is installed The sensor data records for both these DIMM voltage sensor types have scanning disabled by default Once the BIOS has completed its POST routine it is responsible for communicating the DIMM voltage type to the BMC which will then
HA201-TP Users Manual
98
Chapter 7 Intelreg Server Board S2600TP Platform Management
enable sensor scanning of the correct DIMM voltage sensor
7313 Fan MonitoringBMC fan monitoring support includes monitoring of fan speed (RPM) and fan presence
73131 Fan Tach SensorsFan Tach sensors are used for fan failure detection The reported sensor reading is proportional to the fanrsquos RPM This monitoring capability is instantiated in the form of IPMI analogthreshold sensorsMost fan implementations provide for a variable speed fan so the variations in fan speed can be large Therefore the threshold values must be set sufficiently low as to not result in inappropriate threshold crossingsFan tach sensors are implemented as manual re-arm sensors because a lower-critical threshold crossing can result in full boosting of the fans This in turn may cause a failing fanrsquos speed to rise above the threshold and can result in fan oscillationsAs a result fan tach sensors do not auto-rearm when the fault condition goes away but rather are rearmed for either of the following occurrences1 The system is reset or power-cycled2 The fan is removed and either replaced with another fan or re-
inserted This applies to hot-swappable fans only This re-arm is triggered by change in the state of the associated fan presence sensor
After the sensor is rearmed if the fan speed is detected to be in a normal range the failure conditions shall be cleared and a de-assertion event shall be logged
HA201-TP Users Manual
99
Chapter 7 Intelreg Server Board S2600TP Platform Management
73132 Fan Presence SensorsSome chassis and server boards provide support for hot-swap fans These fans can be removed and replaced while the system is powered on and operating normally The BMC implements fan presence sensors for each hot swappable fan These are instantiated as IPMI discrete sensorsEvents are only logged for fan presence upon changes in the presence state after AC power is applied (no events logged for initial state)
73133 Fan Redundancy SensorThe BMC supports redundant fan monitoring and implements fan redundancy sensors for products that have redundant fans Support for redundant fans is chassis-specificA fan redundancy sensor generates events when its associated set of fans transition between redundant and non-redundant states as determined by the number and health of the component fans The definition of fan redundancy is configuration dependent The BMCallows redundancy to be configured on a per fan-redundancy sensor basis through OEM SDR recordsThere is a fan redundancy sensor implemented for each redundant group of fans in the systemAssertion and de-assertion event generation is enabled for each redundancy state
73134 Power Supply Fan SensorsMonitoring is implemented through IPMI discrete sensors one for each power supply fan The BMC polls each installed power supply using the PMBus fan status commands to check for failure conditions for the power supply fans The BMC asserts the ldquoperformance lagsrdquo offset of the IPMI sensor if a fan failure is detectedPower supply fan sensors are implemented as manual re-arm sensors because a failure condition can result in boosting of the fans This in turn may cause a failing fanrsquos speed to rise above the ldquofaultrdquo threshold and can result in fan oscillations As a result these sensors do not auto-rearm when the fault condition goes away but rather are rearmed only when the system is reset or power-cycled or the PSU is removed and replaced with the same or another PSUAfter the sensor is rearmed if the fan is no longer showing a failed state the failure condition in the IPMI sensor shall be cleared and a de-
HA201-TP Users Manual
100
Chapter 7 Intelreg Server Board S2600TP Platform Management
assertion event shall be logged
73135 Monitoring for ldquoFans Offrdquo ScenarioOn Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family it is likely that there will be situations where specific fans are turned off based on current system conditions BMC Fan monitoring will comprehend this scenario and not log false failure eventsThe recommended method is for the BMC FW to halt updates to the value of the associated fan tach sensor and set that sensorrsquos IPMI sensor state to ldquoreading-state-unavailablerdquo when this mode is active Management software must comprehend this state for fan tach sensors and not report these as failure conditionsThe scenario for which this occurs is that the BMC Fan Speed Control (FSC) code turns off the fans by setting the PWM for the domain to 0 This is done when based on one or more global aggregate thermal margin sensor readings dropping below a specified thresholdBy default the fans-off feature will be disabled There is a BMC command and BIOS setup option to enabledisable this featureThe SmaRTCLST system feature will also momentarily gate power to all the system fans to reduce overall system power consumption in response to a power supply event (for example to ride out an AC power glitch) However for this scenario the fan power is gated by hardware for only 100ms which should not be long enough to result in triggering a fan fault SEL event
7314 Standard Fan ManagementThe BMC controls and monitors the system fans Each fan is associated with a fan speed sensor that detects fan failure and may also be associated with a fan presence sensor for hotswap support For redundant fan configurations the fan failure and presence status determines the fan redundancy sensor stateThe system fans are divided into fan domains each of which has a separate fan speed control signal and a separate configurable fan control policy A fan domain can have a set of temperature and fan sensors associated with it These are used to determine the current fan domain state
HA201-TP Users Manual
101
Chapter 7 Intelreg Server Board S2600TP Platform Management
A fan domain has three statesbull The sleep and boost states have fixed (but configurable through OEM
SDRs) fan speeds associated with thembull The nominal state has a variable speed determined by the fan
domain policy An OEM SDR record is used to configure the fan domain policy
The fan domain state is controlled by several factors They are listed below in order of precedence high to lowbull Boost
o Associated fan is in a critical state or missing The SDR describes which fan domains are boosted in response to a fan failure or removal in each domain If a fan is removed when the system is in lsquoFans-offrsquo mode it will not be detected and there will not be any fan boost till system comes out of lsquoFans-off modeo Any associated temperature sensor is in a critical state The SDR describes which temperature-threshold violations cause fan boost for each fan domaino The BMC is in firmware update mode or the operational firmware is corruptedo If any of the above conditions apply the fans are set to a fixed boost state speed
bull Nominalo A fan domainrsquos nominal fan speed can be configured as static (fixed value) or controlled by the state of one or more associated temperature sensorso See section 43144 for more details
73141 Hot-Swap FansHot-swap fans are supported These fans can be removed and replaced while the system is powered on and operating The BMC implements fan presence sensors for each hotswappable fanWhen a fan is not present the associated fan speed sensor is put into the readingunavailable state and any associated fan domains are put into the boost state The fans may already be boosted due to a previous fan failure or fan removalWhen a removed fan is inserted the associated fan speed sensor is rearmed If there are no other critical conditions causing a fan boost condition the fan speed returns to the nominal state Power cycling or
HA201-TP Users Manual
102
Chapter 7 Intelreg Server Board S2600TP Platform Management
resetting the system re-arms the fan speed sensors and clears fanfailure conditions If the failure condition is still present the boost state returns once the sensor has re-initialized and the threshold violation is detected again
73142 Fan Redundancy DetectionThe BMC supports redundant fan monitoring and implements a fan redundancy sensor A fan redundancy sensor generates events when its associated set of fans transitions between redundant and non-redundant states as determined by the number and health of the fans The definition of fan redundancy is configuration dependent The BMC allows redundancy to be configured on a per fan redundancy sensor basis through OEM SDR recordsA fan failure or removal of hot-swap fans up to the number of redundant fans specified in the SDR in a fan configuration is a non-critical failure and is reflected in the front panel status A fan failure or removal that exceeds the number of redundant fans is a non-fatal insufficientresources condition and is reflected in the front panel status as a non-fatal errorRedundancy is checked only when the system is in the DC-on state Fan redundancy changes that occur when the system is DC-off or when AC is removed will not be logged until the system is turned on
73143 Fan DomainsSystem fan speeds are controlled through pulse width modulation (PWM) signals which are driven separately for each domain by integrated PWM hardware Fan speed is changed by adjusting the duty cycle which is the percentage of time the signal is driven high in each pulseThe BMC controls the average duty cycle of each PWM signal through direct manipulation of the integrated PWM control registersThe same device may drive multiple PWM signals
HA201-TP Users Manual
103
Chapter 7 Intelreg Server Board S2600TP Platform Management
73144 Nominal Fan SpeedA fan domainrsquos nominal fan speed can be configured as static (fixed value) or controlled by the state of one or more associated temperature sensorsOEM SDR records are used to configure which temperature sensors are associated with which fan control domains and the algorithmic relationship between the temperature and fan speed Multiple OEM SDRs can reference or control the same fan control domain and multiple OEM SDRs can reference the same temperature sensorsThe PWM duty-cycle value for a domain is computed as a percentage using one or more instances of a stepwise linear algorithm and a clamp algorithm The transition from one computed nominal fan speed (PWM value) to another is ramped over time to minimize audible transitions The ramp rate is configurable by means of the OEM SDRMultiple stepwise linear and clamp controls can be defined for each fan domain and used simultaneously For each domain the BMC uses the maximum of the domainrsquos stepwise linear control contributions and the sum of the domainrsquos clamp control contributions to compute the domainrsquos PWM value except that a stepwise linear instance can be configured to provide the domain maximumHysteresis can be specified to minimize fan speed oscillation and to smooth fan speed transitions If a Tcontrol SDR record does not contain a hysteresis definition for example an SDR adhering to a legacy format the BMC assumes a hysteresis value of zero
73145 Thermal and Acoustic ManagementThis feature refers to enhanced fan management to keep the system optimally cooled while reducing the amount of noise generated by the system fans Aggressive acoustics standards might require a trade-off between fan speed and system performance parameters that contribute to the cooling requirements primarily memory bandwidth The BIOS BMC and SDRs work together to provide control over how this trade-off is determinedThis capability requires the BMC to access temperature sensors on the individual memory DIMMs Additionally closed-loop thermal throttling is only supported with buffered DIMMs
HA201-TP Users Manual
104
Chapter 7 Intelreg Server Board S2600TP Platform Management
73146 Thermal Sensor Input to Fan Speed ControlThe BMC uses various IPMI sensors as input to the fan speed control Some of the sensors are IPMI models of actual physical sensors whereas some are ldquovirtualrdquo sensors whose values are derived from physical sensors using calculations andor tabular informationThe following IPMI thermal sensors are used as input to the fan speed controlbull Front panel temperature sensorbull Baseboard temperature sensorsbull CPU DTS-Spec margin sensorsbull DIMM thermal margin sensorsbull Exit air temperature sensorbull Global aggregate thermal margin sensorsbull SSB (Intelreg C610 Series Chipset) temperature sensorbull On-board Ethernet controller temperature sensors (support for this is
specific to the Ethernet controller being used)bull Add-in Intelreg SASIO module temperature sensor(s) (if present)bull Power supply thermal sensors (only available on PMBus-compliant
power supplies)A simple model is shown in the following figure which gives a high level graphic of the fan speed control structure creates the resulting fan speeds
HA201-TP Users Manual
105
Chapter 7 Intelreg Server Board S2600TP Platform Management
731461 Processor Thermal ManagementProcessor thermal management utilizes clamp algorithms for which the Processor DTS-Spec margin sensor is a controlling input This replaces the use of the (legacy) raw DTS sensor reading that was utilized on previous generation platforms The legacy DTS sensor is retained only for monitoring purposes and is not used as an input to the fan speed control
731462 Memory Thermal ManagementThe system memory is the most complex subsystem to thermally manage as it requires substantial interactions between the BMC BIOS and the embedded memory controller This section provides an overview of this management capability from a BMC perspective
7314621 Memory Thermal ThrottlingThe system shall support thermal management through open loop throttling (OLTT) and closed loop throttling (CLTT) of system memory based on the platform as well as availability of valid temperature sensors on the installed memory DIMMs Throttling levels are changed dynamically to cap throttling based on memory and system thermal conditions as determined by the system and DIMM power and thermal parameters Support for CLTT on mixed-mode DIMM populations (that is some installed DIMMs have valid temp sensors and some do not) is not supported The BMC fan speed control functionality is related to the memory throttling mechanism usedThe following terminology is used for the various memory throttling optionsbull Static Open Loop Thermal Throttling (Static-OLTT) OLTT control registers
are configured by BIOS MRC remain fixed after post The system does not change any of the throttling control registers in the embedded memory controller during runtime
bull Static Closed Loop Thermal Throttling (Static-CLTT) CLTT control registers are configured by BIOS MRC during POST The memory throttling is run as a closed-loop system with the DIMM temperature sensors as the control input Otherwise the system does not change any of the throttling control registers in the embedded memory controller during runtime
bull Dynamic Open Loop Thermal Throttling (Dynamic-OLTT) OLTT control registers are configured by BIOS MRC during POST Adjustments are
HA201-TP Users Manual
106
Chapter 7 Intelreg Server Board S2600TP Platform Management
made to the throttling during runtime based on changes in system cooling (fan speed)
bull Dynamic Closed Loop Thermal Throttling (Dynamic-CLTT) CLTT control registers are configured by BIOS MRC during POST The memory throttling is run as a closed-loop system with the DIMM temperature sensors as the control input Adjustments are made to the throttling during runtime based on changes in system cooling (fan speed)
Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce a new type of CLTT which is referred to as Hybrid CLTT for which the Integrated Memory Controller estimates the DRAM temperature in between actual reads of the TSODsHybrid CLTTT shall be used on all Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family that have DIMMs with thermal sensors Therefore the terms Dynamic-CLTT and Static-CLTT are really referring to this lsquohybridrsquo mode Note that if the IMCrsquos polling of the TSODs is interrupted the temperature readings that the BMC gets from the IMC shall be these estimated values 731463 DIMM Temperature Sensor Input to Fan Speed ControlA clamp algorithm s used for controlling fan speed based on DIMM temperatures Aggregate DIMM temperature margin sensors are used as the control input to the algorithm
731464 Dynamic (Hybrid) CLTTThe system will support dynamic (memory) CLTT for which the BMC FW dynamically modifies thermal offset registers in the IMC during runtime based on changes in system cooling (fan speed) For static CLTT a fixed offset value is applied to the TSOD reading to get the die temperature however this is does not provide as accurate results as when the offset takes into account the current airflow over the DIMM as is done with dynamic CLTTIn order to support this feature the BMC FW will derive the air velocity for each fan domain based on the PWM value being driven for the domain Since this relationship is dependent on the chassis configuration a method must be used which support this dependency (for example through OEM SDR) that establishes a lookup table providing this relationshipBIOS will have an embedded lookup table that provides thermal offset
HA201-TP Users Manual
107
Chapter 7 Intelreg Server Board S2600TP Platform Management
values for each DIMM type altitude setting and air velocity range (3 ranges of air velocity are supported) During system boot BIOS will provide 3 offset values (corresponding to the 3 air velocity ranges) tothe BMC for each enabled DIMM Using this data the BMC FW constructs a table that maps the offset value corresponding to a given air velocity range for each DIMM During runtime the BMC applies an averaging algorithm to determine the target offset value corresponding to thecurrent air velocity and then the BMC writes this new offset value into the IMC thermal offset register for the DIMM
731465 Fan ProfilesThe server system supports multiple fan control profiles to support acoustic targets and American Society of Heating Refrigerating and Air Conditioning Engineers (ASHRAE) compliance The BIOS Setup utility can be used to choose between meeting the target acoustic level or enhanced system performance This is accomplished through fan profilesThe BMC supports eight fan profiles numbered from 0 to 7 Fan management policy is dictated by the Tcontrol SDRs These SDRs provide a way to associate fan control behavior with one or more fan profilesEach group of profiles allows for varying fan control policies based on the altitude For a given altitude the Tcontrol SDRs associated with an acoustics-optimized profile generate less noise than the equivalent performance-optimized profile by driving lower fan speeds and the BIOSreduces thermal management requirements by configuring more aggressive memory throttling See Table 16 for more informationThe BMC provides commands that query for fan profile support and it provides a way to enable a fan profile Enabling a fan profile determines which Tcontrol SDRs are used for fan management The BMC only supports enabling a fan profile through the command if that profile is supported on all fan domains defined for the system It is important to configure the SDRs so that all desired fan profiles are supported on each fan domain If no single profile is supported across all domains the BMC by default uses profile 0 and does not allow it to be changedAt system boot the BIOS can use the Get Fan Control Configuration command to query the BMC about which fan profiles are supported The BIOS uses this information to display options in the BIOS Setup utility The BIOS indicates the fan profile to the BMC as dictated by the BIOS Setup Utility options for fan mode and altitude using the Set Fan Control
HA201-TP Users Manual
108
Chapter 7 Intelreg Server Board S2600TP Platform Management
Configuration commandThe BMC uses this information as an input to its fan-control algorithm as supported by the Tcontrol OEM SDR The BMC only allows enabling of fan profiles that the BMC indicates are supported using the Get Fan Control Configuration command For example if the Get Fan Control Configuration command indicates that only profile 1 is supported then using the Set Fan Control Configuration command to enable profile 2 will result in the return of an error completion codeThe BMC requires the BIOS to send the Set Fan Control Configuration command to the BMC on every system boot This must be done after the BIOS has completed any throttling-related chipset configuration
HA201-TP Users Manual
109
Chapter 7 Intelreg Server Board S2600TP Platform Management
731466 Open-Loop Thermal Throttling FallbackNormal system operation uses closed-loop thermal throttling (CLTT) and DIMM temperature monitoring as major factors in overall thermal and acoustics management In the event that BIOS is unable to configure the system for CLTT it defaults to open-loop thermal throttling (OLTT) In the OLTT mode it is assumed that the DIMM temperature sensors are not available for fan speed control The BIOS communicates the throttling mode to the BMC along with the fan profile number when it sends the Set Fan Control Configuration commandWhen OLTT mode is specified the BMC internally blocks access to the DIMM temperatures causing the DIMM aggregate margin sensors to be marked as ReadingState Unavailable The BMC then uses the failure-control values for these sensors if specified in the Tcontrol SDRs as their fan speed contributions
731467 ASHRAE ComplianceSystem requirements for ASHRAE compliance is defined in the Common Fan Speed Control amp Thermal Management Platform Architecture Specification Altitude-related changes in fan speed control are handled through profiles for different altitude ranges
73147 Power Supply Fan Speed ControlThis section describes the system level control of the fans internal to the power supply over the PMBus Some but not all Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family will require that the power supplies be included in the system level fan speed control For any system that requires either of these capabilities the power supply must be PMBus-compliant
731471 System Control of Power Supply FansSome products require that the BMC control the speed of the power supply fans as is done with normal system (chassis) fans except that the BMC cannot reduce the power supply fan any lower than the internal power supply control is driving it For these products the BMC FW must have the ability to control and monitor the power supply fans through PMBus commands The power supply fans are treated as a system fan domain for which fan control policies are mapped just as for chassis system fans with system thermal sensors (rather than internal power
HA201-TP Users Manual
110
Chapter 7 Intelreg Server Board S2600TP Platform Management
supply thermal sensors) used as the input to a clamp algorithm for the power supply fan control This domain has both piecewise clipping curves and clamped sensors mapped into the power supply fan domain All the power supplies can be defined as a single fan domain
731472 Use of Power Supply Thermal Sensors as Input to System (Chassis) Fan Control Some products require that the power supply internal thermal sensors are used as control inputs to the system (chassis) fans in the same manner as other system thermal sensors are used for this purpose The power supply thermal sensors are included as clamped sensors into one or more system fan domains which may include the power supply fan domain
73148 Fan Boosting due to Fan FailuresIntelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce additional capabilities for handling fan failure or removal as described in this sectionEach fan failure shall be able to define a unique response from all other fan domains An OEM SDR table defines the response of each fan domain based on a failure of any fan including both system and power supply fans (for PMBus-compliant power supplies only) This meansthat if a system has six fans then there will be six different fan fail reactions
73149 Programmable Fan PWM OffsetThe system provides a BIOS Setup option to boost the system fan speed by a programmable positive offset or a ldquoMaxrdquo setting Setting the programmable offset causes the BMC to add the offset to the fan speeds that it would otherwise be driving the fans to The Max setting causes the BMC to replace the domain minimum speed with alternate domain minimums that also are programmable through SDRsThis capability is offered to provide system administrators the option to manually configure fans speeds in instances where the fan speed optimized for a given platform may not be sufficient when a high end add-in is configured into the system This enables easier usage of the fan speed control to support Intelreg as well as third party chassis and better support of ambient temperatures higher than 35C
HA201-TP Users Manual
111
Chapter 7 Intelreg Server Board S2600TP Platform Management
supply thermal sensors) used as the input to a clamp algorithm for the power supply fan control This domain has both piecewise clipping curves and clamped sensors mapped into the power supply fan domain All the power supplies can be defined as a single fan domain
731472 Use of Power Supply Thermal Sensors as Input to System (Chassis) Fan Control Some products require that the power supply internal thermal sensors are used as control inputs to the system (chassis) fans in the same manner as other system thermal sensors are used for this purpose The power supply thermal sensors are included as clamped sensors into one or more system fan domains which may include the power supply fan domain
73148 Fan Boosting due to Fan FailuresIntelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce additional capabilities for handling fan failure or removal as described in this sectionEach fan failure shall be able to define a unique response from all other fan domains An OEM SDR table defines the response of each fan domain based on a failure of any fan including both system and power supply fans (for PMBus-compliant power supplies only) This meansthat if a system has six fans then there will be six different fan fail reactions
73149 Programmable Fan PWM OffsetThe system provides a BIOS Setup option to boost the system fan speed by a programmable positive offset or a ldquoMaxrdquo setting Setting the programmable offset causes the BMC to add the offset to the fan speeds that it would otherwise be driving the fans to The Max setting causes the BMC to replace the domain minimum speed with alternate domain minimums that also are programmable through SDRsThis capability is offered to provide system administrators the option to manually configure fans speeds in instances where the fan speed optimized for a given platform may not be sufficient when a high end add-in is configured into the system This enables easier usage of the fan speed control to support Intelreg as well as third party chassis and better support of ambient temperatures higher than 35C
HA201-TP Users Manual
112
Chapter 7 Intelreg Server Board S2600TP Platform Management
7315 Power Management Bus (PMBus)The Power Management Bus (ldquoPMBusrdquo) is an open standard protocol that is built upon the SMBus 20 transport It defines a means of communicating with power conversion and other devices using SMBus-based commands A system must have PMBus-compliant power supplies installed in order for the BMC or ME to monitor them for status andor power metering purposesFor more information on PMBus see the System Management Interface Forum Web sitehttpwwwpowersigorg
7316 Power Supply Dynamic Redundancy SensorThe BMC supports redundant power subsystems and implements a Power Unit Redundancy sensor per platform A Power Unit Redundancy sensor is of sensor type Power Unit (09h) and reading type Availability Status (0Bh) This sensor generates events when a power subsystem transitions between redundant and non-redundant states as determined by the number and health of the power subsystemrsquos component power supplies The BMC implements Dynamic Power Supply Redundancy status based upon current system load requirements as well as total Power Supply capacity This status is independent of the Cold Redundancy status This prevents the BMC from reporting Fully Redundant Power supplies when the load required by the system exceeds half the power capability of all power supplies installed and operationalDynamic Redundancy detects this condition and generates the appropriate SEL event to notify the user of the condition Power supplies of different power ratings may be swapped in and out to adjust the power capacity and the BMC will adjust the Redundancy status accordinglyThe definition of redundancy is power subsystem dependent and sometimes even configuration dependent See the appropriate Platform Specific Information for power unit redundancy supportThis sensor is configured as manual-rearm sensor in order to avoid the possibility of extraneous SEL events that could occur under certain system configuration and workload conditions The sensor shall rearm for the following conditionsbull PSU hot-addbull System reset
HA201-TP Users Manual
113
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull AC power cyclebull DC power cycleSystem AC power is applied but on standby ndash Power unit redundancy is based on OEM SDR power unit record and number of PSU presentSystem is (DC) powered on - The BMC calculates Dynamic Power Supply Redundancy status based upon current system load requirements as well as total Power Supply capacityThe BMC allows redundancy to be configured on a per power-unit-redundancy sensor basis by means of the OEM SDR records
7317 Component Fault LED ControlSeveral sets of component fault LEDs are supported on the server board Some LEDs are owned by the BMC and some by the BIOSThe BMC owns control of the following FRUfault LEDsbull Fan fault LEDs ndash A fan fault LED is associated with each fan The BMC
lights a fan fault LED if the associated fan-tach sensor has a lower critical threshold event status asserted Fan-tach sensors are manual re-arm sensors Once the lower critical threshold is crossed the LED remains lit until the sensor is rearmed These sensors are rearmed at system DC power-on and system reset
bull DIMM fault LEDs ndash The BMC owns the hardware control for these LEDs The LEDs reflect the state of BIOS-owned event-only sensors When the BIOS detects a DIMM fault condition it sends an IPMI OEM command (Set Fault Indication) to the BMC to instruct the BMC to turn on the associated DIMM Fault LED These LEDs are only active when the system is in the lsquoonrsquo state The BMC will not activate or change the state of the LEDs unless instructed by the BIOS
bull Hard Disk Drive Status LEDs ndash The HSBP PSoC owns the hardware control for these LEDs and detection of the faultstatus conditions that the LEDs reflect
bull CPU Fault LEDs The BMC owns control for these LEDs An LED is lit if there is an MSID mismatch (that is CPU power rating is incompatible with the board)
HA201-TP Users Manual
114
Chapter 7 Intelreg Server Board S2600TP Platform Management
7318 CMOS Battery MonitoringThe BMC monitors the voltage level from the CMOS battery which provides battery backup to the chipset RTC This is monitored as an auto-rearm threshold sensorUnlike monitoring of other voltage sources for which the Emulex Pilot III component continuously cycles through each input the voltage channel used for the battery monitoring provides a software enable bit to allow the BMC FW to poll the battery voltage at a relativelyslow rate in order to conserve battery power
HA201-TP Users Manual
115
Chapter 7 Intelreg Server Board S2600TP Platform Management
74 Intelreg Intelligent Power Node Manager (NM)Power management deals with requirements to manage processor power consumption and manage power at the platform level to meet critical business needs Node Manager (NM) is a platform resident technology that enforces power capping and thermal-triggered powercapping policies for the platform These policies are applied by exploiting subsystem settings (such as processor P and T states) that can be used to control power consumption NM enables data center power management by exposing an external interface to management software through which platform policies can be specified It also implements specific data center power management usage models such as power limiting and thermal monitoringThe NM feature is implemented by a complementary architecture utilizing the ME BMC BIOS and an ACPI-compliant OS The ME provides the NM policy engine and power controllimiting functions (referred to as Node Manager or NM) while the BMC provides the external LAN link by which external management software can interact with the feature The BIOS provides system power information utilized by the NM algorithms and also exports ACPI Source Language (ASL) code used by OS-Directed Power Management (OSPM) for negotiating processor P and T state changes for power limiting PMBus-compliant power supplies provide the capability to monitor input power consumption which is necessary to support NMThe NM architecture applicable to this generation of servers is defined by the NPTM Architecture Specification v20 NPTM is an evolving technology that is expected to continue to add new capabilities that will be defined in subsequent versions of the specification The ME NM implements the NPTM policy engine and controlmonitoring algorithms defined in the Node Power and Thermal Manager (NPTM) specification
741 Hardware RequirementsNM is supported only on platforms that have the NM FW functionality loaded and enabled on the Management Engine (ME) in the SSB and that have a BMC present to support the external LAN interface to the ME NM power limiting features requires a means for the ME to monitorinput power consumption for the platform This capability is generally provided by means of PMBus-compliant power supplies although an alternative model using a simpler SMBus power monitoring device is possible (there is potential loss in accuracy and responsiveness using
HA201-TP Users Manual
116
Chapter 7 Intelreg Server Board S2600TP Platform Management
non-PMBus devices) The NM SmaRTCLST feature does specifically require PMBus-compliant power supplies as well as additional hardware on the server board
742 FeaturesNM provides feature support for policy management monitoring and querying alerts and notifications and an external interface protocol The policy management features implement specific IT goals that can be specified as policy directives for NM Monitoring and querying features enable tracking of power consumption Alerts and notifications provide the foundation for automation of power management in the data center management stack The external interface specifies the protocols that must be supported in this version of NM
743 ME System Management Bus (SMBus) Interfacebull The ME uses the SMLink0 on the SSB in multi-master mode as a
dedicated bus for communication with the BMC using the IPMB protocol The BMC FW considers this a secondary IPMB bus and runs at 400 kHz
bull The ME uses the SMLink1 on the SSB in multi-master mode bus for communication with PMBus devices in the power supplies for support of various NM-related features This bus is shared with the BMC which polls these PMBus power supplies for sensor monitoring purposes (for example power supply status input power and so on) This bus runs at 100 KHz
bull The Management Engine has access to the ldquoHost SMBusrdquo
744 PECI 30bull The BMC owns the PECI bus for all Intel server implementations and
acts as a proxy for the ME when necessary
745 NM ldquoDiscoveryrdquo OEM SDRAn NM ldquodiscoveryrdquo OEM SDR must be loaded into the BMCrsquos SDR repository if and only if the NM feature is supported on that product This OEM SDR is used by management software to detect if NM is supported and to understand how to communicate with itSince PMBus compliant power supplies are required in order to support NM the system should be probed when the SDRs are loaded into the
HA201-TP Users Manual
117
Chapter 7 Intelreg Server Board S2600TP Platform Management
BMCrsquos SDR repository in order to determine whether or not the installed power supplies do in fact support PMBus If the installed power supplies are not PMBus compliant then the NM ldquodiscoveryrdquo OEM SDR should not be loadedPlease refer to the Intelreg Intelligent Power Node Manager 20 External Architecture Specification using IPMI for details of this interface
746 SmaRTCLSTThe power supply optimization provided by SmaRTCLST relies on a platform HW capability as well as ME FW support When a PMBus-compliant power supply detects insufficient input voltage an overcurrent condition or an over-temperature condition it will assert theSMBAlert signal on the power supply SMBus (such as the PMBus) Through the use of external gates this results in a momentary assertion of the PROCHOT and MEMHOT signals to the processors thereby throttling the processors and memory The ME FW also sees the SMBAlert assertion queries the power supplies to determine the condition causing the assertion and applies an algorithm to either release or prolong the throttling based on the situationSystem power control modes include1 SmaRT Low AC in put voltage event results in a one-time momentary
throttle for each event to the maximum throttle state2 Electrical Protection CLST High output energy event results in a
throttling hiccup mode with fixed maximum throttle time and a fix throttle release ramp time
3 Thermal Protection CLST High power supply thermal event results in a throttling hiccup mode with fixed maximum throttle time and a fix throttle release ramp time
When the SMBAlert signal is asserted the fans will be gated by HW for a short period (~100ms) to reduce overall power consumption It is expected that the interruption to the fans will be of short enough duration to avoid false lower threshold crossings for the fan tachsensors however this may need to be comprehended by the fan monitoring FW if it does have this side-effectME FW will log an event into the SEL to indicate when the system has been throttled by the SmaRTCLST power management feature This is dependent on ME FW support for this sensor Please refer ME FW EPS for SEL log details
HA201-TP Users Manual
118
Chapter 7 Intelreg Server Board S2600TP Platform Management
74611 Dependencies on PMBus-compliant Power Supply SupportThe SmaRTCLST system feature depends on functionality present in the ME NM SKU This feature requires power supplies that are compliant with the PMBus
note For additional information on Intelreg Intelligent Power Node
Manager usage and support please visit the following Intel Website
httpwwwintelcomcontentwwwusendata-centerdata-center-managementnode-manager-generalhtmlwapkw=node+manager
HA201-TP Users Manual
119
Chapter 7 Intelreg Server Board S2600TP Platform Management
75 Basic and Advanced Server Management FeaturesThe integrated BMC has support for basic and advanced server management features Basic management features are available by default Advanced management features are enabled with the addition of an optionally installed Remote Management Module 4 Lite (RMM4 Lite) key
When the BMC FW initializes it attempts to access the Intelreg RMM4 lite If the attempt to access Intelreg RMM4 lite is successful then the BMC activates the Advanced featuresThe following table identifies both Basic and Advanced server management features
HA201-TP Users Manual
120
Chapter 7 Intelreg Server Board S2600TP Platform Management
751 Dedicated Management PortThe server board includes a dedicated 1GbE RJ45 Management Port The management port is active with or without the RMM4 Lite key installed
752 Embedded Web ServerBMC Base manageability provides an embedded web server and an OEM-customizable web GUI which exposes the manageability features of the BMC base feature set It is supported over all on-board NICs that have management connectivity to the BMC as well as an optionaldedicated add-in management NIC At least two concurrent web sessions from up to two different users is supported The embedded web user interface shall support the following client web browsersbull Microsoft Internet Explorer 90bull Microsoft Internet Explorer 100bull Mozilla Firefox 24bull Mozilla Firefox 25The embedded web user interface supports strong security (authentication encryption and firewall support) since it enables remote server configuration and control The user interface presented by the embedded web user interface shall authenticate the user before allowing a web session to be initiated Encryption using 128-bit SSL is supported User authentication is based on user id and passwordThe GUI presented by the embedded web server authenticates the user before allowing a web session to be initiated It presents all functions to all users but grays-out those functions that the user does not have privilege to execute For example if a user does not have privilege to power control then the item shall be displayed in grey-out font in that userrsquos UI display The web GUI also provides a launch point for some of the advanced features such as KVM and media redirection These features are grayed out in the GUI unless the system has been updated to support these advanced features The embedded web server only displays US English or Chinese language outputAdditional features supported by the web GUI includesbull Presents all the Basic features to the usersbull Power onoffreset the server and view current power statebull Displays BIOS BMC ME and SDR version informationbull Display overall system health
HA201-TP Users Manual
121
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull Configuration of various IPMI over LAN parameters for both IPV4 and IPV6
bull Configuration of alerting (SNMP and SMTP)bull Display system asset information for the product board and
chassisbull Display of BMC-owned sensors (name status current reading
enabled thresholds) including color-code status of sensorsbull Provides ability to filter sensors based on sensor type (Voltage
Temperature Fan and Power supply related)bull Automatic refresh of sensor data with a configurable refresh ratebull On-line helpbull Displayclear SEL (display is in easily understandable human
readable format)bull Supports major industry-standard browsers (Microsoft Internet
Explorer and Mozilla Firefox)bull The GUI session automatically times-out after a user-configurable
inactivity period By default this inactivity period is 30 minutesbull Embedded Platform Debug feature - Allow the user to initiate
a ldquodebug dumprdquo to a file that can be sent to Intelreg for debug purposes
bull Virtual Front Panel The Virtual Front Panel provides the same functionality as the local front panel The displayed LEDs match the current state of the local panel LEDs The displayed buttons (for example power button) can be used in the same manner as the local buttons
bull Display of ME sensor data Only sensors that have associated SDRs loaded will be displayed
bull Ability to save the SEL to a filebull Ability to force HTTPS connectivity for greater security This is
provided through a configuration option in the UIbull Display of processor and memory information as is available over
IPMI over LANbull Ability to get and set Node Manager (NM) power policiesbull Display of power consumed by the serverbull Ability to view and configure VLAN settingsbull Warn user the reconfiguration of IP address will cause disconnectbull Capability to block logins for a period of time after several
HA201-TP Users Manual
122
Chapter 7 Intelreg Server Board S2600TP Platform Management
consecutive failed login attempts The lock-out period and the number of failed logins that initiates the lockout period are configurable by the user
bull Server Power Control - Ability to force into Setup on a resetbull System POST results ndash The web server provides the systemrsquos Power-
On Self Test (POST) sequence for the previous two boot cycles including timestamps The timestamps may be viewed in relative to the start of POST or the previous POST code
bull Customizable ports - The web server provides the ability to customize the port numbers used for SMASH http https KVM secure KVM remote media and secure remote media
For additional information reference the Intelreg Remote Management Module 4 and Integrated BMC Web Console Users Guide
753 Advanced Management Feature Support (RMM4 Lite)The integrated baseboard management controller has support for advanced management features which are enabled when an optional Intelreg Remote Management Module 4 Lite (RMM4 Lite) is installed The Intel RMM4 add-on offers convenient remote KVM access and control through LAN and internet It captures digitizes and compresses video and transmits it with keyboard and mouse signals to and from a remote computer Remote access and controlsoftware runs in the integrated baseboard management controller utilizing expanded capabilities enabled by the Intel RMM4 hardwareKey Features of the RMM4 add-on arebull KVM redirection from either the dedicated management NIC or
the server board NICs used for management traffic upto to two KVM sessions
bull Media Redirection ndash The media redirection feature is intended to allow system administrators or users to mount a remote IDE or USB CDROM floppy drive or a USB flash disk as a remote device to the server Once mounted the remote device appears just like a local device to the server allowing system administrators or users to install software (including operating systems) copy files update BIOS or boot the server from this device
bull KVM ndash Automatically senses video resolution for best possible screen capture high performance mouse tracking and
HA201-TP Users Manual
123
Chapter 7 Intelreg Server Board S2600TP Platform Management
synchronization It allows remote viewing and configuration in pre-boot POST and BIOS setup
7531 Keyboard Video Mouse (KVM) RedirectionThe BMC firmware supports keyboard video and mouse redirection (KVM) over LAN This feature is available remotely from the embedded web server as a Java applet This feature is only enabled when the Intelreg RMM4 lite is present The client system must have a Java Runtime Environment (JRE) version 60 or later to run the KVM or media redirection appletsThe BMC supports an embedded KVM application (Remote Console) that can be launched from the embedded web server from a remote console USB11 or USB 20 based mouse and keyboard redirection are supported It is also possible to use the KVM-redirection (KVM-r) session concurrently with media-redirection (media-r) This feature allows a user to interactively use the keyboard video and mouse (KVM) functions of the remote server as if the user were physically at the managed server KVM redirection console supports the following keyboard layouts English Dutch French German Italian Russian and SpanishKVM redirection includes a ldquosoft keyboardrdquo function The ldquosoft keyboardrdquo is used to simulate an entire keyboard that is connected to the remote system The ldquosoft keyboardrdquo functionality supports the following layouts English Dutch French German Italian Russian and SpanishThe KVM-redirection feature automatically senses video resolution for best possible screen capture and provides high-performance mouse tracking and synchronization It allows remote viewing and configuration in pre-boot POST and BIOS setup once BIOS has initialized videoOther attributes of this feature includebull Encryption of the redirected screen keyboard and mousebull Compression of the redirected screenbull Ability to select a mouse configuration based on the OS typebull Supports user definable keyboard macrosKVM redirection feature supports the following resolutions and refresh ratesbull 640x480 at 60Hz 72Hz 75Hz 85Hz 100Hzbull 800x600 at 60Hz 72Hz 75Hz 85Hzbull 1024x768 at 60Hx 72Hz 75Hz 85Hzbull 1280x960 at 60Hz
HA201-TP Users Manual
124
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull 1280x1024 at 60Hzbull 1600x1200 at 60Hzbull 1920x1080 (1080p)bull 1920x1200 (WUXGA)bull 1650x1080 (WSXGA+)
7532 Remote ConsoleThe Remote Console is the redirected screen keyboard and mouse of the remote host system To use the Remote Console window of your managed host system the browser must include a Java Runtime Environment plug-in If the browser has no Java support such as with a small handheld device the user can maintain the remote host system using the administration forms displayed by the browserThe Remote Console window is a Java Applet that establishes TCP connections to the BMCThe protocol that is run over these connections is a unique KVM protocol and not HTTP or HTTPS This protocol uses ports 7578 for KVM 5120 for CDROM media redirection and 5123 for FloppyUSB media redirection When encryption is enabled the protocol uses ports 7582 for KVM 5124 for CDROM media redirection and 5127 for FloppyUSB media redirection The local network environment must permit these connections to be made that is the firewall and in case of a private internal network the NAT (Network Address Translation) settings have to be configured accordingly
7533 PerformanceThe remote display accurately represents the local display The feature adapts to changes to the video resolution of the local display and continues to work smoothly when the system transitions from graphics to text or vice-versa The responsiveness may be slightly delayed depending on the bandwidth and latency of the networkEnabling KVM andor media encryption will degrade performance Enabling video compression provides the fastest response while disabling compression provides better video qualityFor the best possible KVM performance a 2Mbsec link or higher is recommendedThe redirection of KVM over IP is performed in parallel with the local KVM without affecting the local KVM operation
HA201-TP Users Manual
125
Chapter 7 Intelreg Server Board S2600TP Platform Management
7534 SecurityThe KVM redirection feature supports multiple encryption algorithms including RC4 and AES The actual algorithm that is used is negotiated with the client based on the clientrsquos capabilities
7535 AvailabilityThe remote KVM session is available even when the server is powered-off (in stand-by mode) No re-start of the remote KVM session shall be required during a server reset or power onoff A BMC reset (for example due to an BMC Watchdog initiated reset or BMC reset after BMC FWupdate) will require the session to be re-established KVM sessions persist across system reset but not across an AC power loss
7536 UsageAs the server is powered up the remote KVM session displays the complete BIOS boot process The user is able interact with BIOS setup change and save settings as well as enter and interact with option ROM configuration screensAt least two concurrent remote KVM sessions are supported It is possible for at least two different users to connect to same server and start remote KVM sessions
7537 Force-enter BIOS SetupKVM redirection can present an option to force-enter BIOS Setup This enables the system to enter F2 setup while booting which is often missed by the time the remote console redirects the video
7538 Media RedirectionThe embedded web server provides a Java applet to enable remote media redirection This may be used in conjunction with the remote KVM feature or as a standalone applet The media redirection feature is intended to allow system administrators or users to mount a remote IDE or USB CD-ROM floppy drive or a USB flash disk as a remote device to the server Once mounted the remote device appears just like a local device to the server allowing system administrators or users to install software (including operating systems) copy files update BIOS and so on or boot the server from this deviceThe following capabilities are supported
HA201-TP Users Manual
126
Chapter 7 Intelreg Server Board S2600TP Platform Management
The operation of remotely mounted devices is independent of the local devices on the server Both remote and local devices are useable in parallelbull Either IDE (CD-ROM floppy) or USB devices can be mounted as a
remote device to the serverbull It is possible to boot all supported operating systems from the remotely
mounted device and to boot from disk IMAGE (IMG) and CD-ROM or DVD-ROM ISO files See the Testedsupported Operating System List for more information
bull Media redirection supports redirection for both a virtual CD device and a virtual FloppyUSB device concurrently The CD device may be either a local CD drive or else an ISO image file the FloppyUSB device may be either a local Floppy drive a local USB device or else a disk image file
bull The media redirection feature supports multiple encryption algorithms including RC4 and AES The actual algorithm that is used is negotiated with the client based on the clientrsquos capabilities
bull A remote media session is maintained even when the server is powered-off (in standby mode) No restart of the remote media session is required during a server reset or power onoff An BMC reset (for example due to an BMC reset after BMC FW update) will require the session to be re-established
bull The mounted device is visible to (and useable by) managed systemrsquos OS and BIOS in both pre-boot and post-boot states
bull The mounted device shows up in the BIOS boot order and it is possible to change the BIOS boot order to boot from this remote device
bull It is possible to install an operating system on a bare metal server (no OS present) using the remotely mounted device This may also require the use of KVM-r to configure the OS during install
USB storage devices will appear as floppy disks over media redirection This allows for the installation of device drivers during OS installationIf either a virtual IDE or virtual floppy device is remotely attached during system boot both the virtual IDE and virtual floppy are presented as bootable devices It is not possible to present only a single-mounted device type to the system BIOS
HA201-TP Users Manual
127
Chapter 7 Intelreg Server Board S2600TP Platform Management
75381 AvailabilityThe default inactivity timeout is 30 minutes and is not user-configurable Media redirection sessions persist across system reset but not across an AC power loss or BMC reset
75382 Network Port UsageThe KVM and media redirection features use the following portsbull 5120 ndash CD Redirectionbull 5123 ndash FD Redirectionbull 5124 ndash CD Redirection (Secure)bull 5127 ndash FD Redirection (Secure)bull 7578 ndash Video Redirectionbull 7582 ndash Video Redirection (Secure)For additional information reference the Intelreg Remote Management Module 4 and Integrated BMC Web Console Users Guide
Chapter 1 Product IntroductionChapter 1 Product IntroductionChapter 8 Technical Support
wwwaicipccom
bull TAIWANTel +886 3 433 9188Fax +886 3 287 1818Email salesaicipccomtw
bull CHINA Tel +862154961421 +862154961422Fax Extension 608Email Technical Support supportaicipccom
bull AMERICA - West coastTel +19098958989Fax +19098958999Email salesaicipccom
bull AMERICA - East coastTel +19738848886Fax +19738844794Email njsalesaicipccom
bull EUROPETel +31306386789Fax +31306360638Emailsalesaicipcnl
Email Technical Support supportaicipccom
- PREFACE
-
- SAFETY INSTRUCTIONS
-
- Chapter 1 Product Introduction
-
- 11 Box Content
- 12 Specifications
- 13 General Information
-
- Chapter 2 Hardware Installation
-
- 21 Removing and Installing top cover
- 22 Central Processing Unit (CPU)
- 23 Diagram of the Correct Installation
- 24 System Memory
- 25 RemovingInstalling a Drive Tray Hard Drive Drive Slot Map
- 26 Removing and Installing a Fan Module
- 27 Power Supply
- 28 Tool-less Blade Slide Installation introduction
-
- Chapter 3 Motherborad Overview
-
- 31 Intelreg Server Board Feature Set
- 32 Motherboard Layout
- 33 Motherboard block diagram
- 34 Motherboard Configuration Jumpers
- 35 Motherboard Configuration Jumpers
-
- Chapter 4 12GB Expander Borad Overview
-
- 41 12GB Expander Board
-
- Chapter 5 Backplane Overview
-
- 51 Backplane
-
- Chapter 6 Debug amp Firmware Update Introduction
-
- 61 Expender firmware update through smart console port
- 62 Update the expander firmware through in-band
- 63 12G expander EDFB setting
- 64 Slot HDD power setting
- 65 HDD BP thermal sensor temperature setting
-
- Chapter 7 Intelreg Server Board S2600TP Platform Management
-
- 71 Server Management Function Architecture
- 72 Features and Functions
- 73 Sensor Monitoring
- 74 Intelreg Intelligent Power Node Manager (NM)
- 75 Basic and Advanced Server Management Features
-
- Chapter 8 Technical Support
-
- 按鈕11
![Page 9: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/9.jpg)
2
HA201-TP Users Manual
Chapter 1 Product Introduction
12 Specifications
Dimensions (W x D x H)(with chassis ears)
mm 4826 x 815 x 88
inches 19 x 32 x 35
Motherboard (per node) Intelreg Server Board S2600TP
Processor (per node)
Processor Support
Two Intelreg Xeonreg Processors E5-2600 v3 Product Family
QPI Speeds 96 GTs 8 GTs 72 GTs
Socket Type Socket R3 (FCLGA2011-3)
Chipset Support(per node) Intelreg C612 Chipset
System Memory (per node)
bull 16 DIMM slots in total across 8 memory channelsbull Registered DDR4 (RDIMM) Load Reduced DDR4 (LRDIMM)bull Memory DDR4 data transfer rates of 160018662133 MTsbull DIMM sizes of 4 GB 8 GB 16 GB or 32 GB depending on ranks and technology
Front Panel Power onoff
LEDs
A bull Power (Secondary)bull Warning
B bull Power (Primary)bull Warning
Drive Bays External 25 hot swap 24
Backplane 1 x 24-port 12Gb SAS dual-loop backplane
Expander Board(per node)
1 x 24-port 12Gb SAS expander board with 3 SFF-8643 connectors
Expansion Slots (per node) PCIe 30 5 x8 (4 LP amp 1 FHHL)
Internal IO Connectors Headers(per node)
bull 1 x internal USB 20 connector (port 67)bull 1 x 2x7pin header for system fan modulebull 1 x 1x12pin control panel headerbull 1 x DH-10 serial Port A connectorbull 1 x SATA 6Gbs port for SATA DOMbull 4 x SATA 6Gbs connectors (port 0123)bull 1 x 2x4 pin header for Intel RMM4 Litebull 1 x 1x4 pin header for Storage Upgrade Keybull 1 x 1x8 pin backup power controler connector
Rear IO(per node)
bull 1 x RJ45 (dedicated port for remote server management) 2 x RJ45 1 x VGA 2 x USB 20 Type A 1 x Mini SAS HD
Ethernet (per node) Dual GbE Intelreg I350 Gigabit Ethernet Controller
Video Support(per node) Integrated Matrox G200 2D Graphics Controller
Server Management(per node)
Onboard Server Engines LLC Pilot III Controller Support for Intelreg Remote Management Module 4 solutions IntelregLight-Guided Diagnostics on field replaceable units Support for Intelreg System Management Software Support for Intelreg Intelligent Power Node Manager (Need PMBus - compliant power supply) BIOS Flash Winbond W25Q64BV
Power Supply 1200W 1+1 redundant power supply
System Cooling (per node)
2 x 6056 fans
EnvironmentalSpecifications
Temperature 0degC - 35degCHumidity 5 - 95 non-condensing
Gross Weight (w PSU amp Rail)kgs 41
lbs 90
PackagingDimensions
(W x D x H)mm 590 x 1150 x 330
inches 232 x 453 x 13
Mounting Standard 28 tool-less slide rail
3
HA201-TP Users Manual
Chapter 1 Product Introduction
13 General Information
HA201-WP a 2U Storage Server Barebone supports two Intelreg Xeonreg processors E5-2600 v3 series HA201-TP has a 24 x 25rdquo HDD bays as system drive bays It is a perfect building block for Storage Servers
bull Front Panel
LED Indicator and Switch
24 x 25 hot-swap SATASAS HDD bays
4
HA201-TP Users Manual
Chapter 1 Product Introduction
bull Rear Panel
1200W 1+1 80+ redundant power supply
2 x low-profile add-on card for external connection
SFF 8644 for JBOD connection
5
HA201-TP Users Manual
Chapter 1 Product Introduction
bull Top View of HA201-TP system
2 x hot-swappable storage control node
Intelreg Xeonreg E5-2600 v3 Processors
bull TOP View Of Controller Canister
6
HA201-TP Users Manual
Chapter 1 Product Introduction
bull Top View of HA201-TP system
4 x 60x56mm hot-swap fans
Intel S2600TP
HA201-TP Users Manual
7
Chapter 2 Hardware Installation
7
21 Removing and Installing top coverUnscrew the screws(2pcs for the back and 4pcs for the both right and left side)Slide the cover backwards to remove top cover form the chassis
Chapter 2 Hardware Installation
This section demonstrates maintenance procedures in replacing a defective part once the HA201-TP UM appliance is installed and is operational
Chapter 2 Hardware Installation
HA201-TP Users Manual
8
22 Central Processing Unit (CPU)221 Removing the Processor HeatsinkThe heatsink is attached to the server board or processor socket with captive fasteners Using a 2 Phillips screwdriver loosen the four screws located on the heatsink corners in a diagonal manner using the following procedurebull Using a 2 Phillips screwdriver start with screw 1 and loosen it by
giving it two rotations and stop (see letter A) (IMPORTANT Do not fully loosen)
bull Proceed to screw 2 and loosen it by giving it two rotations and stop (see letter B) Similarly loosen screws 3 and 4 Repeat steps A and B by giving each screw two rotations each time until all screws are loosened
bull Lift the heatsink straight up (see letter C)
REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
Processor
Socket
2
3
1
4
A
B
C
HA201-TP Users Manual
9
Chapter 2 Hardware Installation
222 Installing the Processor
2221 Unlatch the CPU Load Platebull Push the lever handle labeled ldquoOPEN 1strdquo (see letter A) down and
toward the CPU socket Rotate the lever handle upbull Repeat the steps for the second lever handle (see letter B)
CAUTION
Processor must be appropriate You may damage the server board if
you install a processor that is inappropriate for your server
CAUTION
ESD and handling processors Reduce the risk of electrostatic
discharge (ESD) damage to the processor by doing the following
(1) Touch the metal chassis before touching the processor or
server board Keep part of your body in contact with the metal
chassis to dissipate the static charge while handling the processor
(2) Avoid moving around unnecessarily
REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
A REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
B
Chapter 2 Hardware Installation
HA201-TP Users Manual
10
2222 Lift open the Load Platebull Rotate the right lever handle down until it releases the Load Plate
(see letter A)bull While holding down the lever handle with your other hand lift open
the Load Plate (see letter B)
BREMOVE
REMOVE
LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
A
HA201-TP Users Manual
11
Chapter 2 Hardware Installation
2223 Install the Processorbull Remove the processor from its packagebull Carefully remove the protective cover from the bottom side of the
CPU taking care not to touch any CPU contacts (see letter A)bull Orient the processor with the socket so that the processor cutouts
match the four orientation posts on the socket (see letter B) Note the location of a gold key at the corner of the processor (see letter C) Carefully place (Do NOT drop) the CPU into the socket
CAUTION
The pins inside the CPU socket are extremely sensitive Other than
the CPU no object should make contact with the pins inside the
CPU socket A damaged CPU socket pin may render the socket inoperable
and will produce erroneous CPU or other system errors if used
NOTE
The underside of the processor has components that may damage the
socket pins if installed improperly The processor must align correctly
with the socket opening before installation DO NOT DROP the
processor into the socket
NOTE
When possible a CPU insertion tool should be used when installing the
CPU
A
B
C
Chapter 2 Hardware Installation
HA201-TP Users Manual
12
2224 Remove the Socket Cover Remove the socket cover from the load plate by pressing it out
2225 Close the Load Plate Carefully lower the load plate down over the processor
2226 Lock down the Load Platebull Push down on the locking lever on the CLOSE 1st side (see letter A)
Slide the tip of the lever under the notch in the load plate (see letter B) Make sure the load plate tab engages under the socket lever when fully closed
bull bRepeat the steps to latch the locking lever on the other side (see letter C) Latch the levers in the order as shown
NOTE
The socket cover should be saved and re-used should the processor
need to be removed at anytime in the future
Save theprotectivecover
A
BC
HA201-TP Users Manual
13
Chapter 2 Hardware Installation
223 Installing the Processor Heatsink
bull If present remove the protective film covering the Thermal Interface Material on the bottom side of the heatsink (see letter A)
bull Align the heatsink fins to the front and back of the chassis for correct airflow The airflow goes from front-to-back of the chassis (see letter B)
bull Each heatsink has four captive fasteners and should be tightened in a diagonal manner using the following procedure
bull Using a 2 Phillips screwdriver start with screw 1 and engage screw threads by giving it two rotations and stop (see letter C) (Do not fully tighten)
bull Proceed to screw 2 and engage screw threads by giving it two rotations and stop (see letter D) Similarly engage screws 3 and 4
bull Repeat steps C and D by giving each screw two rotations each time until each screw is lightly tightened up to a maximum of 8 inch-lbs torque (see letter E)
NOTE
The processor heatsink for CPU1 and CPU2 is different FXXCA91X91HS is
for CPU1 while FXXEA91X91HS2 is for CPU2 Mislocating the heatsink
will cause serious thermal damage
C
Processor
Socket
AIRFLOW
Chassis Front
2
3
1
4
A
B
TIM
D
CAUTIONDo not over-tighten fasteners
E
Chapter 2 Hardware Installation
HA201-TP Users Manual
14
224 Removing the Processor
NOTE
Remove the processor by carefully lifting it out of the socket taking care
NOT to drop the processor and not touching any pins inside the socket
Install the socket cover if a replacement processor is not going to be installed
HA201-TP Users Manual
15
Chapter 2 Hardware Installation
225 Different heatsink for each of its CPUsCarefully position heatsink over the CPU0 and CPU1 and align the heatsink screws with the screw holes in the motherboard
Caution - Possible thermal damage Avoid moving the heatsink after
it has contacted the top of the CPU Too much movement could
disturb the layer of thermal compound causing voids and leading
to ineffective heat dissipation and component damage
CPU0
CPU0
CPU1
CPU1
Chapter 2 Hardware Installation
HA201-TP Users Manual
16
23 Diagram of the Correct InstallationNOTE The heat sinkrsquos fans should be blowing toward the rear end
of the chassis If one of the fans is facing the wrong direction
please remove the heat sink and reinstall it so that it is facing
the correct direction
AIRFLOW
AIRFLOW
FAN FAN
HA201-TP Users Manual
17
Chapter 2 Hardware Installation
24 System Memory241 Supported Memory
Chapter 2 Hardware Installation
HA201-TP Users Manual
18
242 Memory Population Rules
bull Each installed processor provides four channels of memory On the Intelreg Server Board S2600TP product family each memory channel supports two memory slots for a total possible 16 DIMMs installed
bull The memory channels from processor socket 1 are identified as Channel A B C and D The memory channels from processor socket 2 are identified as Channel E F G and H
bull The silk screened DIMM slot identifiers on the board provide information about the channel and therefore the processor to which they belong For example DIMM_A1 is the first slot on Channel A on processor 1 DIMM_E1 is the first DIMM socket on Channel E on processor 2
bull The memory slots associated with a given processor are unavailable if the corresponding processor socket is not populated
bull A processor may be installed without populating the associated memory slots provided a second processor is installed with associated memory In this case the memory is shared by the processors However the platform suffers performance degradation and latency due to the remote memory
bull Processor sockets are self-contained and autonomous However all memory subsystem support (such as Memory RAS and Error Management) in the BIOS setup is applied commonly across processor sockets
bull All DIMMs must be DDR4 DIMMsbull Mixing of LRDIMM with any other DIMM type is not allowed per
platformbull Mixing of DDR4 operating frequencies is not validated within a socket
or across sockets by Intel If DIMMs with different frequencies are mixed all DIMMs run at the common lowest frequency
bull A maximum of eight logical ranks (ranks seen by the host) per channel is allowed
bull The BLUE memory slots on the server board identify the first memory slot for a given memory channel
NOTE
Although mixed DIMM configurations are supported Intel only performs platform
validation on systems that are configured with identical DIMMs installed
HA201-TP Users Manual
19
Chapter 2 Hardware Installation
bull DIMM population rules require that DIMMs within a channel be populated starting with the BLUE DIMM slot or DIMM farthest from the processor in a ldquofill-farthestrdquo approach In addition when populating a Quad-rank DIMM with a Single- or Dual-rank DIMM in the same channel the Quad-rank DIMM must be populated farthest from the processor Intelreg MRC will check for correct DIMM placement
Chapter 2 Hardware Installation
HA201-TP Users Manual
20
On the Intelreg Server Board S2600TP product a total of sixteen DIMM slots are provided (two CPUs ndash four channels per CPU and two DIMMs per channel) The nomenclature for DIMM sockets is detailed in the following table
HA201-TP Users Manual
21
Chapter 2 Hardware Installation
243 Publishing System MemoryThere are a number of different situations in which the memory size andor configuration are displayed Most of these displays differ in one way or another so the same memory configuration may appear to display differently depending on when and where the display occurs
bull The BIOS displays the ldquoTotal Memoryrdquo of the system during POST if Quiet Boot is disabled in BIOS setup This is the total size of memory discovered by the BIOS during POST and is the sum of the individual sizes of installed DDR4 DIMMs in the system
bull The BIOS displays the ldquoEffective Memoryrdquo of the system in the BIOS Setup The term Effective Memory refers to the total size of all DDR4 DIMMs that are active (not disabled) and not used as redundant units (see Note below)
bull The BIOS provides the total memory of the system in the main page of BIOS setup This total is the same as the amount described by the first bullet above
bull If Quiet Boot is disabled the BIOS displays the total system memory on the diagnostic screen at the end of POST This total is the same as the amount described by the first bullet above
bull The BIOS provides the total amount of memory in the system by supporting the EFI Boot Service function GetMemoryMap()
bull The BIOS provides the total amount of memory in the system by supporting the INT 15h E820h function For details see the Advanced Configuration and Power Interface Specification
Chapter 2 Hardware Installation
HA201-TP Users Manual
22
25 RemovingInstalling a Drive Tray Hard Drive Drive Slot Map
251 Removing a Disk DriveRelease a drive tray by pressing the unlock button and pinching the lock lever slightly and pulling out the drive tray
252 Installing a Disk Drive
Adjust and place the 25rdquo HDD on the HDD tray Choose to the proper position screw hole then secure it with screws on the bottomMounting location 1 HDD with the interposer board attached
HA201-TP Users Manual
23
Chapter 2 Hardware Installation
253 Installing a Hard Disk Drive Tray
Insert the drive carrier into its bay Push the tray lever until it clicks Make sure the drive tray is correctly secured in place when its front edge aligns with the bay edge
254 Drive Slot Map
The drive slot map follows
HBA Card
MegaRaid Card
Chapter 2 Hardware Installation
HA201-TP Users Manual
24
26 Removing and Installing a Fan Module261 Removing a FAN module
Unscrew the thumb screw on the cover to open the top cover from NODE Unpluging the FAN cable Remove the fan by lifting it and pulling it out
262 Installing a FAN module To installing a fan module follow the reverse order
HA201-TP Users Manual
25
Chapter 2 Hardware Installation
27 Power Supply
271 Removing or Replacing the Power Supply Module
Slide the small levers (see red arrow ) to the right Hold the PSU lever and firmly pull the PSU out of the server chassis
Chapter 2 Hardware Installation
HA201-TP Users Manual
26
28 Tool-less Blade Slide Installation introduction
1 Pull on the lsquorsquoFront-Releasersquorsquoto unlock the inner channel from the Slide Assembly
2 Release the Detent-Lock and push Middle Channel inwards to retract Middle Channel
HA201-TP Users Manual
27
Chapter 2 Hardware Installation
Metal Spacer
OptionalRemove Metal Spacer for Aluminium Racks
3 Aligning the Front Bracket with the Mounting Hole
4 Push in to assembly the Front Bracket onto the Rack
Chapter 2 Hardware Installation
HA201-TP Users Manual
28
5 Now the bracket is fixed onto the Rack(Optional M6x10L screws are to secure the rails with posts if needed)
6 Refer to Diagram 3 amp 4 to assemble the End Bracket onto the Rack
HA201-TP Users Manual
29
Chapter 2 Hardware Installation
7 Assemble the inner channel onto the chassis using thescrews provided
8 Push the chassis with inner channels into Slide to complete Rack Installation
Chapter 3 Motherborad Overview
HA201-TP Users Manual
30
Chapter 3 Motherborad Overview
The Intelreg Server Board S2600TP is a monolithic printed circuit board (PCB) with features designed to support the high-performance and high-density computing markets This server board is designed to support the Intelreg Xeonreg processor E5-2600 v3 product family Previous generation Intelreg Xeonreg processors are not supported Many of the features and functions of the server board family are common A board will be identified by name when a described feature or function is unique to it
Chapter 3 Motherborad Overview
HA201-TP Users Manual
31
31 Intelreg Server Board Feature Set
Chapter 3 Motherborad Overview
HA201-TP Users Manual
32
WARNING The riser slot 1 on the server board is designed for
plugging in ONLY the riser card Plugging in the PCIe card may
cause permanent server board and PCIe card damage
Chapter 3 Motherborad Overview
HA201-TP Users Manual
33
32 Motherboard Layout
DiagnosticLED
RMM4LiteRiser Slot 2
SATASGPIO
BridgeBoard
SATA 3USB
Riser Slot 4
Riser Slot 4
HDD Activity
Riser Slot 3ControlPanel
SystemFan 1
CPU 2Socket
CPU 1Socket
SystemFan 3
SystemFan 2
MainPower 1
FanConnector
MainPower 2
InfiniBandPort(QSFP+)DedicatedManagementPortUSBStatus LEDID LED
VGA
NIC 2
IPMB
NIC 1
SerialPort A
CR 2032 3W
Backup Power Control
SATA 2
SATA 0
SATA RAID 5
SATA
Upgrade Key
DOM
SATA 1
Chapter 3 Motherborad Overview
HA201-TP Users Manual
34
33 Motherboard block diagram
Chapter 3 Motherborad Overview
HA201-TP Users Manual
35
34 Motherboard Configuration JumpersThe following table provides a summary and description of configuration test and debug jumpers The server board has several 3-pin jumper blocks that can be used Pin 1 on each jumper block can be identified by the following symbol on the silkscreen
Chapter 3 Motherborad Overview
HA201-TP Users Manual
36
Chapter 3 Motherborad Overview
HA201-TP Users Manual
37
35 Motherboard Configuration JumpersThe Intelreg Server Board S2600TP has the following board rear connector placement
HA201-TP Users Manual
38
Chapter 4 12G Expander Board Overview
This chapter provides detailed introduction guide on hardware introduction
41 12GB Expander Board411 Placement amp Connectors location
J1
JPIC_DBGU8
Chip 3x36RExpander
U14Flash U16
PIC
SW1
SW2
JEXP_UART
J7
JPIC_SMT
J2 J3 J4 CN1 JUSB_MB
JVBATT_12C
JVBATT CN3 CN2 JIPMI_LOC
CN5
CN8
JPWR1
JPWR2
JHDDPWRJPSON_OK
JIPMB
JUSB_PIC
J8
CN7CN6
CN4
JPWR_SW
Chapter 4 12GB Expander Borad Overview
HA201-TP Users Manual
39
Chapter 4 12G Expander Board Overview
412 PIN-OUT
Power Connector ndash JPWR1JPWR2
OS HDD Power Connector ndash JHDDPWR
Battery Connector ndash JVBATT
FAN Connector ndash J7J8
Console for Expander ndash JEXP_UART
HA201-TP Users Manual
40
Chapter 4 12G Expander Board Overview
PIC Smart portndash JPIC_SMT
PIC Debug port(For MPLAB I2C tool) ndash JPIC_DBG
PIC USB ndash JUSB_PIC
Battery Function ndash JVBATT_I2C
IPMB ndash JIPMB
HA201-TP Users Manual
41
Chapter 4 12G Expander Board Overview
BP PIC USB ndash JUSB_MB
IPMB for Device ndash JIPMI_LOC
Power SW ndash JPWR_SW
MB power on frunction ndash JPSON_OK
HA201-TP Users Manual
42
Chapter 4 12G Expander Board Overview
413 LEDs
LED_CN6
LED2
LED3
HA201-TP Users Manual
43
Chapter 5 Backplane OverviewChapter 5 Backplane Overview
This chapter provides detailed introduction guide on backplane introduction
51 Backplane511 Placement amp Connectors location
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
9
18
10
1
3
G
D
SG
D
S
C
B
C
E
B
C
E
GA
O
I
2
43
12
43
1
14
1
3
45
134
5
134
5
134
5
16
2
9
10
1
1
1120
10
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
16
17
32
33
484964
J2
JP1J11
For Secondary Canister Node
J4 J1 SW1
SW2
J3
For Primary Canister Node
HA201-TP Users Manual
44
Chapter 5 Backplane Overview
512 PIN-OUT
Power Connector ndash J2
Power Connector ndash J11
PMBUS Connector ndash JP1
FAN Connector ndash J4
HA201-TP Users Manual
45
Chapter 5 Backplane Overview
Console for MCU ndash J1
Front IO ndash J3
HA201-TP Users Manual
46
Chapter 5 Backplane Overview
513 LEDs
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
18
9
10
1
31
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
3 4
21
3 4
21
4
1
16
2
91
101
11
10
20
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ActFail
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
9
18
10
1
3
G
D
SG
D
S
C
B
C
E
B
C
E
GA
O
I
2
43
12
43
1
14
1
3
45
134
5
134
5
134
5
16
2
9
10
1
1
1120
10
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
16
17
32
33
484964LED1
HA201-TP Users Manual
47
Chapter 6 HDD Backplane Introduction
61 Expender firmware update through smart console port611 Update Expander firmware revision
Step 1 Set up HA201-TP console serial cable Insert console serial cable into console port shown below also
the other side inert serial port into motherboard
PLEASE FIND THE CONSOLE SERIAL CABLE IN THE PACKAGE BOX
Chapter 6 Debug amp Firmware Update Introduction
HA201-TP Users Manual
48
Chapter 6 HDD Backplane Introduction
Step 2 Set up HA201-TP RS232 connectionSet up RS232 connection application into your HA201-TP as shown in the example process below
For exampleOS Microsoft WindowsRS232 connection application Hyperterminal
Step 2 Install HyperTrmexe
HA201-TP Users Manual
49
Chapter 6 HDD Backplane Introduction
Step 3 Enter a new name for the icon in the field below and click OK
Step 4 Connecting by using selecting an option in the drop down menu circled in red below (we selected COM2 in this example) and click OK
HA201-TP Users Manual
50
Chapter 6 HDD Backplane Introduction
Properties
Port Setting
Bits per second
Data bits
Parity
Stop bits
Flow control None
Restore Defaults
OK ApplyCancel
None
Step 6 Set up is complete The diagram below depicts what screen should displayed
Step 5 For ldquoBits per secondrdquo select 38400 For ldquoFlow controlrdquo select None Click OK when you have finished your selections
cmdgt_
HA201-TP Users Manual
51
Chapter 6 HDD Backplane Introduction
Step 7To get firmware image amp MFG Configuration Image version information from AIC SAS Related Firmware Downloadne website
httpppmsaicipccomtw8888downloadexpandermcu
HA201-TP Users Manual
52
Chapter 6 HDD Backplane Introduction
Step 8Comand line for show current firmware revisioncmdgtrev
HA201-TP Users Manual
53
Chapter 6 HDD Backplane Introduction
Step 9Start to update expander firmwarecmdgtfdl 0 0_
Step 10Select the tool bar Transfer -gt Send File
HA201-TP Users Manual
54
Chapter 6 HDD Backplane Introduction
Step 11bull Choose new firmware path file fw 3A1_v11221bull Protocol have to choose Xmodem
Step 12Firmware download complete
HA201-TP Users Manual
55
Chapter 6 HDD Backplane Introduction
Step 13Reset computer for success update firmwarecmdgtreset
reset
HA201-TP Users Manual
56
Chapter 6 HDD Backplane Introduction
612 Update expander configuration MFG
Step 1Comand line for show current configuration MFGcmdgt showmfg
HA201-TP Users Manual
57
Chapter 6 HDD Backplane Introduction
Step 2Start to update expander configuration MFGcmdgtfdl 83 0_
Step 3Select the tool bar Transfer -gt Send File
83 0
HA201-TP Users Manual
58
Chapter 6 HDD Backplane Introduction
Step 4bull Choose new MFG path file mfg 3A10_eob_1201binbull Protocol have to choose Xmodem
Step 5MFG download complete
HA201-TP Users Manual
59
Chapter 6 HDD Backplane Introduction
Step 6Reset computer for success update MFGcmdgtreset
reset
HA201-TP Users Manual
60
Chapter 6 HDD Backplane Introduction
62 Update the expander firmware through in-band
FOR EXAMPLEStep 1
Download and install SG3_utilsexe which compatible with Linux OSFrom website httpsgdannyczsgsg3_utilshtml website Reference version sg3_utils-140tgz
Step 2To get firmware image amp MFG Configuration Image version information from AIC SAS Related Firmware Downloadne websitehttpppmsaicipccomtw8888downloadexpandermcu
HA201-TP Users Manual
61
Chapter 6 HDD Backplane Introduction
Step 3Execute terminal under the same new firmware folderexample Setting a new firmware folder on Home page Open Terminal by click to the right button of mouse in the same window Home
Step 4Typing sudo -s to into administrator mode
HA201-TP Users Manual
62
Chapter 6 HDD Backplane Introduction
Step 5 Find expander location$ sg_map -i
Step 6Update Expander firmware$ sg_write_buffer --id=0x0 --in=fw3A1_v11221 --mode=0x2 --offset=0 devsg0
HA201-TP Users Manual
63
Chapter 6 HDD Backplane Introduction
Step 7 Update Expander MFG$ sg_write_buffer --id=0x83 --in=mfg3A10_eob_1201bin --mode=0x2 --offset=0 devsg0
Step 8Reboot computer for success update firmware amp MFGrootubuntu~ reboot
HA201-TP Users Manual
64
Chapter 6 HDD Backplane Introduction
63 12G expander EDFB settingStep 1
For Install HyperTerminalexe refer to section 41
Step 2 Get EDFB statuscmdgt edfb
HA201-TP Users Manual
65
Chapter 6 HDD Backplane Introduction
Step 3Change EDFB setting
Enable EDFB functioncmdgtedfb on
Disable EDFB functioncmdgtedfb off
cmd gtedfbEDFB is OFF
cmd gtedfb onSucceeded to set EDFB
cmd gtedfb offEDFB is OFF
HA201-TP Users Manual
66
Chapter 6 HDD Backplane Introduction
64 Slot HDD power setting(Only for system cooling Fan controled by expander)
Step 1 For Install sg3exe tool and get new firmware from website refer to section 42
Step 2Execute terminal under the same new firmware folderexample Setting a new firmware folder on Home page Open Terminal by click to the right button of mouse in the same window Home
HA201-TP Users Manual
67
Chapter 6 HDD Backplane Introduction
Step 3 Typing sudo -s to into administrator mode
Step 4 Find expander location$ sg_map -i
HA201-TP Users Manual
68
Chapter 6 HDD Backplane Introduction
Step 5For example
If would like to turn the Disk004 power off under the HBA card Need to check Disk004 power status $ sg_ses --page=7 devsg0
Under HBA card the Element 3 = Disk004
HA201-TP Users Manual
69
Chapter 6 HDD Backplane Introduction
Step 6To check Disk004 (element 3) power status is ok$ sg_ses --page=2 devsg0
Status shows belowThe status of Element 3 is OK
HA201-TP Users Manual
70
Chapter 6 HDD Backplane Introduction
Step 7Turn off a HDD power$ sg_ses --descriptor=Disk004 --set=341 devsg0
Step 8Turn on a HDD power$ sg_ses --descriptor=Disk004 --clear=341 devsg0
HA201-TP Users Manual
71
Chapter 6 HDD Backplane Introduction
65 HDD BP thermal sensor temperature setting(Only for system cooling Fan controled by expander)
Step 1 For Install HyperTerminalexe refer to section 41
Step 2Get the current temperature settingscmdgt temperature
HA201-TP Users Manual
72
Chapter 6 HDD Backplane Introduction
Step 3For example
Set new temperatureT1=20 C 4 18 CT2=50 C 4 52 CWarning threshold=50 C 448 CAlarm threshold=55 C 454 C
The new setting will take effect after resetcmdgt temperature 18 52 48 54cmdgt reset
HA201-TP Users Manual
73
Chapter 6 HDD Backplane Introduction
Step 4Check fan speed amp temperature informationcmdgt sensor
cmd gtsensor==ENCLOSURE STATUS========================================================== Total fan number 2 System Fan-0 speed 10888 RPM System Fan-1 speed 10971 RPM System PWN-0 82 Expander Temperature 76 Celsius degree Sytem Temperature-0 33 Celsius degree T1 20 Celsius degree T2 50 Celsius degree TC 55 Celsius degree Voltage Sensor 09V 093V Voltage Sensor 18V 180V
cmd gt_
HA201-TP Users Manual
74
Chapter 7 Intelreg Server Board S2600TP Platform Management
71 Server Management Function Architecture711 IPMI Feature7111 IPMI 20 Featuresbull Baseboard management controller (BMC)bull IPMI Watchdog timerbull Messaging support including command bridging and usersession
supportbull Chassis device functionality including powerreset control and BIOS boot
flags supportbull Event receiver device The BMC receives and processes events from
other platform subsystemsbull Field Replaceable Unit (FRU) inventory device functionality The BMC
supports access to system FRU devices using IPMI FRU commandsbull System Event Log (SEL) device functionality The BMC supports and
provides access to a SELbull Sensor Data Record (SDR) repository device functionality The BMC
supports storage and access of system SDRsbull Sensor device and sensor scanningmonitoring The BMC provides IPMI
management of sensors It polls sensors to monitor and report system health
bull IPMI interfaceso Host interfaces include system management software (SMS) with receive message queue support and server management mode (SMM)o IPMB interfaceo LAN interface that supports the IPMI-over-LAN protocol (RMCP RMCP+)
bull Serial-over-LAN (SOL)bull ACPI state synchronization The BMC tracks ACPI state changes that are
provided by the BIOSbull BMC self-test The BMC performs initialization and run-time self-tests and
makes results available to external entitiesbull See also the Intelligent Platform Management Interface Specification
Second Generation v20
Chapter 7 Intelreg Server Board S2600TP Platform Management
HA201-TP Users Manual
75
Chapter 7 Intelreg Server Board S2600TP Platform Management
7112 Non IPMI FeaturesThe BMC supports the following non-IPMI featuresbull In-circuit BMC firmware updatebull Fault resilient booting (FRB) FRB2 is supported by the watchdog timer
functionalitybull Chassis intrusion detectionbull Basic fan control using Control version 2 SDRsbull Fan redundancy monitoring and supportbull Enhancements to fan speed controlbull Power supply redundancy monitoring and supportbull Hot-swap fan supportbull Acoustic management Support for multiple fan profilesbull Signal testing support The BMC provides test commands for setting
and getting platform signal statesbull The BMC generates diagnostic beep codes for fault conditionsbull System GUID storage and retrievalbull Front panel management The BMC controls the system status LED
and chassis ID LED It supports secure lockout of certain front panel functionality and monitors button presses The chassis ID LED is turned on using a front panel button or a command
bull Power state retentionbull Power fault analysisbull Intelreg Light-Guided Diagnosticsbull Power unit management Support for power unit sensor The BMC
handles power-good dropout conditionsbull DIMM temperature monitoring New sensors and improved acoustic
management using closed-loop fan control algorithm taking into account DIMM temperature readings
bull Address Resolution Protocol (ARP) The BMC sends and responds to ARPs (supported on embedded NICs)
bull Dynamic Host Configuration Protocol (DHCP) The BMC performs DHCP (supported on embedded NICs)
bull Platform environment control interface (PECI) thermal management support
bull E-mail alertingbull Support for embedded web server UI in Basic Manageability feature
set
HA201-TP Users Manual
76
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull Enhancements to embedded web servero Human-readable SELo Additional system configurabilityo Additional system monitoring capabilityo Enhanced on-line help
bull Integrated KVMbull Enhancements to KVM redirection
o Support for higher resolutionbull Integrated Remote Media Redirectionbull Lightweight Directory Access Protocol (LDAP) supportbull Intelreg Intelligent Power Node Manager supportbull Embedded platform debug feature which allows capture of detailed
data for later analysisbull Provisioning and inventory enhancements
o Inventory datasystem information export (partial SMBIOS table)bull DCMI 15 compliance (product-specific)bull Management support for PMBus rev12 compliant power suppliesbull BMC Data Repository (Managed Data Region Feature)bull Support for an Intelreg Local Control Display Panelbull System Airflow Monitoringbull Exit Air Temperature Monitoringbull Ethernet Controller Thermal Monitoringbull Global Aggregate Temperature Margin Sensorbull Memory Thermal Managementbull Power Supply Fan Sensorsbull Energy Star Server Supportbull Smart Ride Through (SmaRT) Closed Loop System Throttling (CLST)bull Power Supply Cold Redundancybull Power Supply FW Updatebull Power Supply Compatibility Checkbull BMC FW reliability enhancements
o Redundant BMC boot blocks to avoid possibility of a corrupted boot block resulting in a scenario that prevents a user from updating the BMC o BMC System Management Health Monitoring
HA201-TP Users Manual
77
Chapter 7 Intelreg Server Board S2600TP Platform Management
72 Features and Functions721 Power SubsystemThe server board supports several power control sources which can initiate power-up orpower-down activity
HA201-TP Users Manual
78
Chapter 7 Intelreg Server Board S2600TP Platform Management
722 Advanced Configuration and Power Interface (ACPI)The server board has support for the following ACPI states
HA201-TP Users Manual
79
Chapter 7 Intelreg Server Board S2600TP Platform Management
723 System InitializationDuring system initialization both the BIOS and the BMC initialize the following items
7231 Processor Tcontrol SettingProcessors used with this chipset implement a feature called Tcontrol which provides a processor-specific value that can be used to adjust the fan-control behavior to achieve optimum cooling and acoustics The BMC reads these from the CPU through PECI Proxy mechanism provided by Manageability Engine (ME) The BMC uses these values as part of thefan-speed-control algorithm
HA201-TP Users Manual
80
Chapter 7 Intelreg Server Board S2600TP Platform Management
7232 Fault Resilient Booting (FRB)Fault resilient booting (FRB) is a set of BIOS and BMC algorithms and hardware support that allow a multiprocessor system to boot even if the bootstrap processor (BSP) fails Only FRB2 is supported using watchdog timer commandsFRB2 refers to the FRB algorithm that detects system failures during POST The BIOS uses the BMC watchdog timer to back up its operation during POST The BIOS configures the watchdog timer to indicate that the BIOS is using the timer for the FRB2 phase of the boot operationAfter the BIOS has identified and saved the BSP information it sets the FRB2 timer use bit and loads the watchdog timer with the new timeout intervalIf the watchdog timer expires while the watchdog use bit is set to FRB2 the BMC (if so configured) logs a watchdog expiration event showing the FRB2 timeout in the event data bytes The BMC then hard resets the system assuming the BIOS-selected reset as the watchdog timeout actionThe BIOS is responsible for disabling the FRB2 timeout before initiating the option ROM scan and before displaying a request for a boot password If the processor fails and causes an FRB2 timeout the BMC resets the systemThe BIOS gets the watchdog expiration status from the BMC If the status shows an expired FRB2 timer the BIOS enters the failure in the system event log (SEL) In the OEM bytes entry in the SEL the last POST code generated during the previous boot attempt is written FRB2failure is not reflected in the processor status sensor valueThe FRB2 failure does not affect the front panel LEDs
HA201-TP Users Manual
81
Chapter 7 Intelreg Server Board S2600TP Platform Management
7233 Post Code DisplayThe BMC upon receiving standby power initializes internal hardware to monitor port 80h (POST code) writes Data written to port 80h is output to the system POST LEDsThe BMC deactivates POST LEDs after POST had completed
HA201-TP Users Manual
82
Chapter 7 Intelreg Server Board S2600TP Platform Management
724 System Event Log (SEL)The BMC implements the system event log as specified in the Intelligent Platform Management Interface Specification Version 20 The SEL is accessible regardless of the system power state through the BMCs in-band and out-of-band interfacesThe BMC allocates 95231bytes (approx 93 KB) of non-volatile storage space to store system events The SEL timestamps may not be in order Up to 3639 SEL records can be stored at a time Because the SEL is circular any command that results in an overflow of the SEL beyond the allocated space will overwrite the oldest entries in the SEL while setting the overflow flag
HA201-TP Users Manual
83
Chapter 7 Intelreg Server Board S2600TP Platform Management
73 Sensor MonitoringThe BMC monitors system hardware and reports system health The information gathered from physical sensors is translated into IPMI sensors as part of the ldquoIPMI Sensor Modelrdquo The BMC also reports various system state changes by maintaining virtual sensors that are not specifically tied to physical hardware This section describes general aspects of BMC sensor management as well as describing how specific sensor types are modeled Unless otherwise specified the term ldquosensorrdquo refers to the IPMI sensor-model definition of a sensor
531 Sensor ScanningThe value of many of the BMCrsquos sensors is derived by the BMC FW periodically polling physical sensors in the system to read temperature voltages and so on Some of these physical sensors are built-in to the BMC component itself and some are physically separated from the BMC Polling of physical sensors for support of IPMI sensor monitoring does not occur until the BMCrsquos operational code is running and the IPMI FW subsystem has completed initializationIPMI sensor monitoring is not supported in the BMC boot code Additionally the BMC selectively polls physical sensors based on the current power and reset state of the system and the availability of the physical sensor when in that state For example non-standby voltages are not monitored when the system is in S4 or S5 power state
HA201-TP Users Manual
84
Chapter 7 Intelreg Server Board S2600TP Platform Management
732 Sensor Rearm Behavior7321 Manual versus Re-arm SensorsSensors can be either manual or automatic re-arm An automatic re-arm sensor will re-arm (clear) the assertion event state for a threshold or offset it if that threshold or offset is deasserted after having been asserted This allows a subsequent assertion of the threshold or an offset to generate a new event and associated side-effect An example side-effect would be boosting fans due to an upper critical threshold crossing of a temperature sensor The event state and the input state (value) of the sensor track each other Most sensors are auto-rearmA manual re-arm sensor does not clear the assertion state even when the threshold or offset becomes de-asserted In this case the event state and the input state (value) of the sensor do not track each other The event assertion state is sticky The following methods can be used to re-arm a sensorbull Automatic re-arm ndash Only applies to sensors that are designated as
ldquoauto-rearmrdquobull IPMI command Re-arm Sensor Eventbull BMC internal method ndash The BMC may re-arm certain sensors due to a
trigger condition For example some sensors may be re-armed due to a system reset A BMC reset will re-arm all sensorsbull System reset or DC power cycle will re-arm all system fan sensors
HA201-TP Users Manual
85
Chapter 7 Intelreg Server Board S2600TP Platform Management
7322 Re-arm and Event GenerationAll BMC-owned sensors that show an asserted event status generate a de-assertion SEL event when the sensor is re-armed provided that the associated SDR is configured to enable a deassertion event for that condition This applies regardless of whether the sensor is a thresholdanalog sensor or a discrete sensor
To manually re-arm the sensors the sequence is outlined below1 A failure condition occurs and the BMC logs an assertion event2 If this failure condition disappears the BMC logs a de-assertion event (if
so configured)3 The sensor is re-armed by one of the methods described in the
previous section4 The BMC clears the sensor status5 The sensor is put into reading-state-unavailable state until it is polled
again or otherwise updated6 The sensor is updated and the ldquoreading-state-unavailablerdquo state is
cleared A new assertion event will be logged if the fault state is once again detected
All auto-rearm sensors that show an asserted event status generate a de-assertion SEL event at the time the BMC detects that the condition causing the original assertion is no longer present and the associated SDR is configured to enable a de-assertion event for that condition
HA201-TP Users Manual
86
Chapter 7 Intelreg Server Board S2600TP Platform Management
733 BIOS Event-Only SensorsBIOS-owned discrete sensors are used for event generation only and are not accessible through IPMI sensor commands like the Get Sensor Reading command Note that in this case the sensor owner designated in the SDR is not the BMCAn example of this usage would be the SELs logged by the BIOS for uncorrectable memory errors Such SEL entries would identify a BIOS-owned sensor ID
734 Margin SensorsThere is sometimes a need for an IPMI sensor to report the difference (margin) from a nonzero reference offset For the purposes of this document these type sensors are referred to as margin sensors For instance for the case of a temperature margin sensor if the referencevalue is 90 degrees and the actual temperature of the device being monitored is 85 degreesthe margin value would be -5
735 IPMI Watchdog SensorThe BMC supports a Watchdog Sensor as a means to log SEL events due to expirations of the IPMI 20 compliant Watchdog Timer
736 BMC Watchdog SensorThe BMC supports an IPMI sensor to report that a BMC reset has occurred due to action taken by the BMC Watchdog feature A SEL event will be logged whenever either the BMC FW stack is reset or the BMC CPU itself is reset
737 BMC System Management Health MonitoringThe BMC tracks the health of each of its IPMI sensors and report failures by providing a ldquoBMC FW Healthrdquo sensor of the IPMI 20 sensor type Management Subsystem Health with support for the Sensor Failure offset Only assertions should be logged into the SEL for the Sensor Failure offset The BMC Firmware Health sensor asserts for any sensor when 10 consecutive sensor errors are read These are not standard sensor events (that is threshold crossings or discrete assertions) These are BMC Hardware Access Layer (HAL) errors If a successful sensor read is completed the counter resets to zero
HA201-TP Users Manual
87
Chapter 7 Intelreg Server Board S2600TP Platform Management
738 VR Watchdog TimerThe BMC FW monitors that the power sequence for the board VR controllers is completed when a DC power-on is initiated Incompletion of the sequence indicates a board problem in which case the FW powers down the systemThe BMC FW supports a discrete IPMI sensor for reporting and logging this fault condition
739 System Airflow MonitoringThe BMC provides an IPMI sensor to report the volumetric system airflow in CFM (cubic feet per minute) The air flow in CFM is calculated based on the system fan PWM values The specific Pulse Width Modulation (PWM or PWMs) used to determine the CFM is SDR configurable The relationship between PWM and CFM is based on a lookup table in an OEM SDRThe airflow data is used in the calculation for exit air temperature monitoring It is exposed as an IPMI sensor to allow a datacenter management application to access this data for use in rack-level thermal management
7310 Thermal MonitoringThe BMC provides monitoring of component and board temperature sensing devices This monitoring capability is instantiated in the form of IPMI analogthreshold or discrete sensors depending on the nature of the measurementFor analogthreshold sensors with the exception of Processor Temperature sensors critical and non-critical thresholds (upper and lower) are set through SDRs and event generation enabled for both assertion and de-assertion eventsFor discrete sensors both assertion and de-assertion event generation are enabledMandatory monitoring of platform thermal sensors includes
HA201-TP Users Manual
88
Chapter 7 Intelreg Server Board S2600TP Platform Management
7310 Thermal MonitoringThe BMC provides monitoring of component and board temperature sensing devices This monitoring capability is instantiated in the form of IPMI analogthreshold or discrete sensors depending on the nature of the measurementFor analogthreshold sensors with the exception of Processor Temperature sensors critical and non-critical thresholds (upper and lower) are set through SDRs and event generation enabled for both assertion and de-assertion eventsFor discrete sensors both assertion and de-assertion event generation are enabledMandatory monitoring of platform thermal sensors includesbull Inlet temperature (physical sensor is typically on system front panel or
HDD back plane)bull Board ambient thermal sensorsbull Processor temperaturebull Memory (DIMM) temperaturebull CPU VRD Hot monitoringbull Power supply (only supported for PMBus-compliant PSUs)Additionally the BMC FW may create ldquovirtualrdquo sensors that are based on a combination of aggregation of multiple physical thermal sensors and application of a mathematical formula to thermal or power sensor readings
73101 Absolute Value versus Margin SensorsThermal monitoring sensors fall into three basic categoriesbull Absolute temperature sensors ndash These are analogthreshold sensors
that provide a value that corresponds to an absolute temperature value
bull Thermal margin sensors ndash These are analogthreshold sensors that provide a value that is relative to some reference value
bull Thermal fault indication sensors ndash These are discrete sensors that indicate a specific thermal fault condition
HA201-TP Users Manual
89
Chapter 7 Intelreg Server Board S2600TP Platform Management
73102 Processor DTS-Spec Margin Sensor(s)Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family incorporate a DTS based thermal spec This allows a much more accurate control of the thermal solution and will enable lower fan speeds and lower fan power consumption The main usage of this sensor is as an input to the BMCrsquos fan control algorithms The BMC implements this as a threshold sensor There is one DTS sensor for each installed physical processor package Thresholds are not set and alert generation is not enabled for these sensors
73103 Processor Thermal Margin Sensor(s)Each processor supports a physical thermal margin sensor per core that is readable through the PECI interface This provides a relative value representing a thermal margin from the corersquos throttling thermal trip point Assuming that temp controlled throttling is enabled the physical core temp sensor reads lsquo0rsquo which indicates the processor core is being throttledThe BMC supports one IPMI processor (margin) temperature sensor per physical processor package This sensor aggregates the readings of the individual core temperatures in a package to provide the hottest core temperature reading When the sensor reads lsquo0rsquo it indicates that the hottest processor core is throttlingDue to the fact that the readings are capped at the corersquos thermal throttling trip point (reading = 0) thresholds are not set and alert generation is not enabled for these sensors
73104 Processor Thermal Control Monitoring (Prochot)The BMC FW monitors the percentage of time that a processor has been operationally constrained over a given time window (nominally six seconds) due to internal thermal management algorithms engaging to reduce the temperature of the device When any processor core temperature reaches its maximum operating temperature the processorpackage PROCHOT (processor hot) signal is asserted and these management algorithms known as the Thermal Control Circuit (TCC) engage to reduce the temperature provided TCC is enabled TCC is enabled by BIOS during system boot This monitoring is instantiated as one IPMI analogthreshold sensor per processor package The BMC implements this as a threshold sensor on a per-processor basis
HA201-TP Users Manual
90
Chapter 7 Intelreg Server Board S2600TP Platform Management
Under normal operation this sensor is expected to read lsquo0rsquo indicating that no processor throttling has occurredThe processor provides PECI-accessible counters one for the total processor time elapsed and one for the total thermally constrained time which are used to calculate the percentage assertion over the given time window
73105 Processor Voltage Regulator (VRD) Over-Temperature SensorThe BMC monitors processor VRD_HOT signals The processor VRD_HOT signals are routed to the respective processor PROCHOT input in order initiate throttling to reduce processor power draw therefore indirectly lowering the VRD temperatureThere is one processor VRD_HOT signal per CPU slot The BMC instantiates one discrete IPMI sensor for each VRD_HOT signal This sensor monitors a digital signal that indicates whether a processor VRD is running in an over-temperature condition When the BMC detects that this signal is asserted it will cause a sensor assertion which will result in an event being written into the sensor event log (SEL)
73106 Inlet Temperature SensorEach platform supports a thermal sensor for monitoring the inlet temperature There are four potential sources for inlet temperature reading
73107 Baseboard Ambient Temperature Sensor(s)The server baseboard provides one or more physical thermal sensors for monitoring the ambient temperature of a board location This is typically to provide rudimentary thermal monitoring of components that lack internal thermal sensors
73108 Server South Bridge (SSB) Thermal MonitoringThe BMC monitors the SSB temperature This is instantiated as an analog (threshold) IPMI thermal sensor
73109 Exit Air Temperature Monitoring The BMC synthesizes a virtual sensor to approximate system exit air temperature for use in fan control This is calculated based on the total power being consumed by the system and the total volumetric air flow provided by the system fans
HA201-TP Users Manual
91
Chapter 7 Intelreg Server Board S2600TP Platform Management
Each system shall be characterized in tabular format to understand total volumetric flow versus fan speed The BMC calculates an average exit air temperature based on the total system power front panel temperature the volumetric system air flow (cubic feet per meter or CFM) and altitude rangeThis sensor is only available on systems in an Intelreg chassis The Exit Air temp sensor is only available when PMBus power supplies are installed
731010 Ethernet Controller Thermal MonitoringThe Intelreg Ethernet Controller I350-AM4 and Intelreg Ethernet Controller 10 Gigabit X540 support an on-die thermal sensor For baseboard Ethernet controllers that use these devices the BMC will monitor the sensors and use this data as input to the fan speed control The BMC will instantiate an IPMI temperature sensor for each device on the baseboard
731011 Memory VRD-Hot Sensor(s)The BMC monitors memory VRD_HOT signals The memory VRD_HOT signals are routed to the respective processor MEMHOT inputs in order to throttle the associated memory to effectively lower the temperature of the VRD feeding that memoryFor Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family there are 2 memory VRD_HOT signals per CPU slot The BMC instantiates one discrete IPMI sensor for each memory VRD_HOT signal
731012 Add-in Module Thermal MonitoringSome boards have dedicated slots for an IO module andor a SAS module For boards that support these slots the BMC will instantiate an IPMI temperature sensor for each slot The modules themselves may or may not provide a physical thermal sensor (a TMP75 device) If the BMC detects that a module is installed it will attempt to access the physical thermal sensor and if found enable the associated IPMI temperature sensor
731013 Processor ThermTripWhen a Processor ThermTrip occurs the system hardware will automatically power down the server If the BMC detects that a ThermTrip occurred then it will set the ThermTrip offset for the applicable
HA201-TP Users Manual
92
Chapter 7 Intelreg Server Board S2600TP Platform Management
processor status sensor
731014 Server South Bridge (SSB) ThermTrip Monitoring The BMC supports SSB ThermTrip monitoring that is instantiated as an IPMI discrete sensor When a SSB ThermTrip occurs the system hardware will automatically power down the server and the BMC will assert the sensor offset and log an event
731015 DIMM ThermTrip MonitoringThe BMC supports DIMM ThermTrip monitoring that is instantiated as one aggregate IPMI discrete sensor per CPU When a DIMM ThermTrip occurs the system hardware will automatically power down the server and the BMC will assert the sensor offset and log an eventThis is a manual re-arm sensor that is rearmed on system resets and power-on (AC or DC power on transitions)
HA201-TP Users Manual
93
Chapter 7 Intelreg Server Board S2600TP Platform Management
7311 Processor SensorsThe BMC provides IPMI sensors for processors and associated components such as voltage regulators and fans The sensors are implemented on a per-processor basis
HA201-TP Users Manual
94
Chapter 7 Intelreg Server Board S2600TP Platform Management
73111 Processor Status SensorsThe BMC provides an IPMI sensor of type processor for monitoring status information for each processor slot If an event state (sensor offset) has been asserted it remains asserted until one of the following happens1 A Rearm Sensor Events command is executed for the processor status
sensor2 AC or DC power cycle system reset or system boot occurs
The BMC provides system status indication to the front panel LEDs for processor fault conditions shown belowCPU Presence status is not saved across AC power cycles and therefore will not generate a deassertion after cycling AC power
HA201-TP Users Manual
95
Chapter 7 Intelreg Server Board S2600TP Platform Management
73112 Processor Population Fault (CPU Missing) SensorThe BMC supports a Processor Population Fault sensor This is used to monitor for the condition in which processor slots are not populated as required by the platform hardware to allow power-on of the systemAt BMC startup the BMC will check for the fault condition and set the sensor state accordinglyThe BMC also checks for this fault condition at each attempt to DC power-on the system At each DC power-on attempt a beep code is generated if this fault is detectedThe following steps are used to correct the fault condition and clear the sensor state1 AC power down the server2 Install the missing processor into the correct slot3 AC power on the server
73113 ERR2 Timeout MonitoringThe BMC supports an ERR2 Timeout Sensor (1 per CPU) that asserts if a CPUrsquos ERR2 signal has been asserted for longer than a fixed time period (gt 90 seconds) ERR[2] is a processor signal that indicates when the IIO (Integrated IO module in the processor) has a fatal error which could not be communicated to the core to trigger SMI ERR[2] events are fatal error conditions where the BIOS and OS will attempt to gracefully handle error but may not be always do so reliably A continuously asserted ERR2 signal is an indication that the BIOS cannot service the condition that caused the error This is usually because that condition prevents the BIOS from runningWhen an ERR2 timeout occurs the BMC assertsde-asserts the ERR2 Timeout Sensor and logs a SEL event for that sensor The default behavior for BMC core firmware is to initiate a system reset upon detection of an ERR2 timeout The BIOS setup utility provides an option to disable or enable system reset by the BMC for detection of this condition
HA201-TP Users Manual
96
Chapter 7 Intelreg Server Board S2600TP Platform Management
73114 CATERR SensorThe BMC supports a CATERR sensor for monitoring the system CATERR signalThe CATERR signal is defined as having 3 statesbull high (no event)bull pulsed low (possibly fatal may be able to recover)bull low (fatal)All processors in a system have their CATERR pins tied together The pin is used as a communication path to signal a catastrophic system event to all CPUs The BMC has direct access to this aggregate CATERR signalThe BMC only monitors for the ldquoCATERR held lowrdquo condition A pulsed low condition is ignored by the BMC If a CATERR-low condition is detected the BMC logs an error message to the SEL against the CATERR sensor and the default action after logging the SEL entry is to reset the system The BIOS setup utility provides an option to disable or enable system reset by the BMC for detection of this conditionThe sensor is rearmed on power-on (AC or DC power on transitions) It is not rearmed on system resets in order to avoid multiple SEL events that could occur due to a potential reset loop if the CATERR keeps recurring which would be the case if the CATERR was due to an MSID mismatch conditionWhen the BMC detects that this aggregate CATERR signal has asserted it can then go through PECI to query each CPU to determine which one was the source of the error and write an OEM code identifying the CPU slot into an event data byte in the SEL entry If PECI is non-functional (it isnrsquot guaranteed in this situation) then the OEM code should indicate that the source is unknownEvent data byte 2 and byte 3 for CATERR sensor SEL events
ED1 ndash 0xA1ED2 - CATERR type0 Unknown1 CATERR2 CPU Core Error (not supported on Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family)3 MSID Mismatch4 CATERR due to CPU 3-strike timeout
HA201-TP Users Manual
97
Chapter 7 Intelreg Server Board S2600TP Platform Management
ED3 - CPU bitmap that causes the system CATERR[0] CPU1[1] CPU2[2] CPU3[3] CPU4When a CATERR Timeout event is determined to be a CPU 3-strike timeout The BMC shall log the logical FRU information (eg busdevfunc for a PCIe device CPU or DIMM) that identifies the FRU that caused the error in the extended SEL data bytes In this case Ext-ED0 will be set to 0x70 and the remaining ED1-ED7 will be set according to the device type and info available
73115 MSID Mismatch SensorThe BMC supports a MSID Mismatch sensor for monitoring for the fault condition that will occur if there is a power rating incompatibility between a baseboard and a processor The sensor is rearmed on power-on (AC or DC power on transitions)
7312 Voltage MonitoringThe BMC provides voltage monitoring capability for voltage sources on the main board and processors such that all major areas of the system are covered This monitoring capability is instantiated in the form of IPMI analogthreshold sensors
73121 DIMM Voltage SensorsSome systems support either LVDDR (Low Voltage DDR) memory or regular (non-LVDDR) memory During POST the system BIOS detects which type of memory is installed and configures the hardware to deliver the correct voltageSince the nominal voltage range is different this necessitates the ability to set different thresholds for any associated IPMI voltage sensors The BMC FW supports this by implementing separate sensors (that is separate IPMI sensor numbers) for each nominal voltage range supported for a single physical sensor and it enablesdisables the correct IPMI sensor based on which type memory is installed The sensor data records for both these DIMM voltage sensor types have scanning disabled by default Once the BIOS has completed its POST routine it is responsible for communicating the DIMM voltage type to the BMC which will then
HA201-TP Users Manual
98
Chapter 7 Intelreg Server Board S2600TP Platform Management
enable sensor scanning of the correct DIMM voltage sensor
7313 Fan MonitoringBMC fan monitoring support includes monitoring of fan speed (RPM) and fan presence
73131 Fan Tach SensorsFan Tach sensors are used for fan failure detection The reported sensor reading is proportional to the fanrsquos RPM This monitoring capability is instantiated in the form of IPMI analogthreshold sensorsMost fan implementations provide for a variable speed fan so the variations in fan speed can be large Therefore the threshold values must be set sufficiently low as to not result in inappropriate threshold crossingsFan tach sensors are implemented as manual re-arm sensors because a lower-critical threshold crossing can result in full boosting of the fans This in turn may cause a failing fanrsquos speed to rise above the threshold and can result in fan oscillationsAs a result fan tach sensors do not auto-rearm when the fault condition goes away but rather are rearmed for either of the following occurrences1 The system is reset or power-cycled2 The fan is removed and either replaced with another fan or re-
inserted This applies to hot-swappable fans only This re-arm is triggered by change in the state of the associated fan presence sensor
After the sensor is rearmed if the fan speed is detected to be in a normal range the failure conditions shall be cleared and a de-assertion event shall be logged
HA201-TP Users Manual
99
Chapter 7 Intelreg Server Board S2600TP Platform Management
73132 Fan Presence SensorsSome chassis and server boards provide support for hot-swap fans These fans can be removed and replaced while the system is powered on and operating normally The BMC implements fan presence sensors for each hot swappable fan These are instantiated as IPMI discrete sensorsEvents are only logged for fan presence upon changes in the presence state after AC power is applied (no events logged for initial state)
73133 Fan Redundancy SensorThe BMC supports redundant fan monitoring and implements fan redundancy sensors for products that have redundant fans Support for redundant fans is chassis-specificA fan redundancy sensor generates events when its associated set of fans transition between redundant and non-redundant states as determined by the number and health of the component fans The definition of fan redundancy is configuration dependent The BMCallows redundancy to be configured on a per fan-redundancy sensor basis through OEM SDR recordsThere is a fan redundancy sensor implemented for each redundant group of fans in the systemAssertion and de-assertion event generation is enabled for each redundancy state
73134 Power Supply Fan SensorsMonitoring is implemented through IPMI discrete sensors one for each power supply fan The BMC polls each installed power supply using the PMBus fan status commands to check for failure conditions for the power supply fans The BMC asserts the ldquoperformance lagsrdquo offset of the IPMI sensor if a fan failure is detectedPower supply fan sensors are implemented as manual re-arm sensors because a failure condition can result in boosting of the fans This in turn may cause a failing fanrsquos speed to rise above the ldquofaultrdquo threshold and can result in fan oscillations As a result these sensors do not auto-rearm when the fault condition goes away but rather are rearmed only when the system is reset or power-cycled or the PSU is removed and replaced with the same or another PSUAfter the sensor is rearmed if the fan is no longer showing a failed state the failure condition in the IPMI sensor shall be cleared and a de-
HA201-TP Users Manual
100
Chapter 7 Intelreg Server Board S2600TP Platform Management
assertion event shall be logged
73135 Monitoring for ldquoFans Offrdquo ScenarioOn Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family it is likely that there will be situations where specific fans are turned off based on current system conditions BMC Fan monitoring will comprehend this scenario and not log false failure eventsThe recommended method is for the BMC FW to halt updates to the value of the associated fan tach sensor and set that sensorrsquos IPMI sensor state to ldquoreading-state-unavailablerdquo when this mode is active Management software must comprehend this state for fan tach sensors and not report these as failure conditionsThe scenario for which this occurs is that the BMC Fan Speed Control (FSC) code turns off the fans by setting the PWM for the domain to 0 This is done when based on one or more global aggregate thermal margin sensor readings dropping below a specified thresholdBy default the fans-off feature will be disabled There is a BMC command and BIOS setup option to enabledisable this featureThe SmaRTCLST system feature will also momentarily gate power to all the system fans to reduce overall system power consumption in response to a power supply event (for example to ride out an AC power glitch) However for this scenario the fan power is gated by hardware for only 100ms which should not be long enough to result in triggering a fan fault SEL event
7314 Standard Fan ManagementThe BMC controls and monitors the system fans Each fan is associated with a fan speed sensor that detects fan failure and may also be associated with a fan presence sensor for hotswap support For redundant fan configurations the fan failure and presence status determines the fan redundancy sensor stateThe system fans are divided into fan domains each of which has a separate fan speed control signal and a separate configurable fan control policy A fan domain can have a set of temperature and fan sensors associated with it These are used to determine the current fan domain state
HA201-TP Users Manual
101
Chapter 7 Intelreg Server Board S2600TP Platform Management
A fan domain has three statesbull The sleep and boost states have fixed (but configurable through OEM
SDRs) fan speeds associated with thembull The nominal state has a variable speed determined by the fan
domain policy An OEM SDR record is used to configure the fan domain policy
The fan domain state is controlled by several factors They are listed below in order of precedence high to lowbull Boost
o Associated fan is in a critical state or missing The SDR describes which fan domains are boosted in response to a fan failure or removal in each domain If a fan is removed when the system is in lsquoFans-offrsquo mode it will not be detected and there will not be any fan boost till system comes out of lsquoFans-off modeo Any associated temperature sensor is in a critical state The SDR describes which temperature-threshold violations cause fan boost for each fan domaino The BMC is in firmware update mode or the operational firmware is corruptedo If any of the above conditions apply the fans are set to a fixed boost state speed
bull Nominalo A fan domainrsquos nominal fan speed can be configured as static (fixed value) or controlled by the state of one or more associated temperature sensorso See section 43144 for more details
73141 Hot-Swap FansHot-swap fans are supported These fans can be removed and replaced while the system is powered on and operating The BMC implements fan presence sensors for each hotswappable fanWhen a fan is not present the associated fan speed sensor is put into the readingunavailable state and any associated fan domains are put into the boost state The fans may already be boosted due to a previous fan failure or fan removalWhen a removed fan is inserted the associated fan speed sensor is rearmed If there are no other critical conditions causing a fan boost condition the fan speed returns to the nominal state Power cycling or
HA201-TP Users Manual
102
Chapter 7 Intelreg Server Board S2600TP Platform Management
resetting the system re-arms the fan speed sensors and clears fanfailure conditions If the failure condition is still present the boost state returns once the sensor has re-initialized and the threshold violation is detected again
73142 Fan Redundancy DetectionThe BMC supports redundant fan monitoring and implements a fan redundancy sensor A fan redundancy sensor generates events when its associated set of fans transitions between redundant and non-redundant states as determined by the number and health of the fans The definition of fan redundancy is configuration dependent The BMC allows redundancy to be configured on a per fan redundancy sensor basis through OEM SDR recordsA fan failure or removal of hot-swap fans up to the number of redundant fans specified in the SDR in a fan configuration is a non-critical failure and is reflected in the front panel status A fan failure or removal that exceeds the number of redundant fans is a non-fatal insufficientresources condition and is reflected in the front panel status as a non-fatal errorRedundancy is checked only when the system is in the DC-on state Fan redundancy changes that occur when the system is DC-off or when AC is removed will not be logged until the system is turned on
73143 Fan DomainsSystem fan speeds are controlled through pulse width modulation (PWM) signals which are driven separately for each domain by integrated PWM hardware Fan speed is changed by adjusting the duty cycle which is the percentage of time the signal is driven high in each pulseThe BMC controls the average duty cycle of each PWM signal through direct manipulation of the integrated PWM control registersThe same device may drive multiple PWM signals
HA201-TP Users Manual
103
Chapter 7 Intelreg Server Board S2600TP Platform Management
73144 Nominal Fan SpeedA fan domainrsquos nominal fan speed can be configured as static (fixed value) or controlled by the state of one or more associated temperature sensorsOEM SDR records are used to configure which temperature sensors are associated with which fan control domains and the algorithmic relationship between the temperature and fan speed Multiple OEM SDRs can reference or control the same fan control domain and multiple OEM SDRs can reference the same temperature sensorsThe PWM duty-cycle value for a domain is computed as a percentage using one or more instances of a stepwise linear algorithm and a clamp algorithm The transition from one computed nominal fan speed (PWM value) to another is ramped over time to minimize audible transitions The ramp rate is configurable by means of the OEM SDRMultiple stepwise linear and clamp controls can be defined for each fan domain and used simultaneously For each domain the BMC uses the maximum of the domainrsquos stepwise linear control contributions and the sum of the domainrsquos clamp control contributions to compute the domainrsquos PWM value except that a stepwise linear instance can be configured to provide the domain maximumHysteresis can be specified to minimize fan speed oscillation and to smooth fan speed transitions If a Tcontrol SDR record does not contain a hysteresis definition for example an SDR adhering to a legacy format the BMC assumes a hysteresis value of zero
73145 Thermal and Acoustic ManagementThis feature refers to enhanced fan management to keep the system optimally cooled while reducing the amount of noise generated by the system fans Aggressive acoustics standards might require a trade-off between fan speed and system performance parameters that contribute to the cooling requirements primarily memory bandwidth The BIOS BMC and SDRs work together to provide control over how this trade-off is determinedThis capability requires the BMC to access temperature sensors on the individual memory DIMMs Additionally closed-loop thermal throttling is only supported with buffered DIMMs
HA201-TP Users Manual
104
Chapter 7 Intelreg Server Board S2600TP Platform Management
73146 Thermal Sensor Input to Fan Speed ControlThe BMC uses various IPMI sensors as input to the fan speed control Some of the sensors are IPMI models of actual physical sensors whereas some are ldquovirtualrdquo sensors whose values are derived from physical sensors using calculations andor tabular informationThe following IPMI thermal sensors are used as input to the fan speed controlbull Front panel temperature sensorbull Baseboard temperature sensorsbull CPU DTS-Spec margin sensorsbull DIMM thermal margin sensorsbull Exit air temperature sensorbull Global aggregate thermal margin sensorsbull SSB (Intelreg C610 Series Chipset) temperature sensorbull On-board Ethernet controller temperature sensors (support for this is
specific to the Ethernet controller being used)bull Add-in Intelreg SASIO module temperature sensor(s) (if present)bull Power supply thermal sensors (only available on PMBus-compliant
power supplies)A simple model is shown in the following figure which gives a high level graphic of the fan speed control structure creates the resulting fan speeds
HA201-TP Users Manual
105
Chapter 7 Intelreg Server Board S2600TP Platform Management
731461 Processor Thermal ManagementProcessor thermal management utilizes clamp algorithms for which the Processor DTS-Spec margin sensor is a controlling input This replaces the use of the (legacy) raw DTS sensor reading that was utilized on previous generation platforms The legacy DTS sensor is retained only for monitoring purposes and is not used as an input to the fan speed control
731462 Memory Thermal ManagementThe system memory is the most complex subsystem to thermally manage as it requires substantial interactions between the BMC BIOS and the embedded memory controller This section provides an overview of this management capability from a BMC perspective
7314621 Memory Thermal ThrottlingThe system shall support thermal management through open loop throttling (OLTT) and closed loop throttling (CLTT) of system memory based on the platform as well as availability of valid temperature sensors on the installed memory DIMMs Throttling levels are changed dynamically to cap throttling based on memory and system thermal conditions as determined by the system and DIMM power and thermal parameters Support for CLTT on mixed-mode DIMM populations (that is some installed DIMMs have valid temp sensors and some do not) is not supported The BMC fan speed control functionality is related to the memory throttling mechanism usedThe following terminology is used for the various memory throttling optionsbull Static Open Loop Thermal Throttling (Static-OLTT) OLTT control registers
are configured by BIOS MRC remain fixed after post The system does not change any of the throttling control registers in the embedded memory controller during runtime
bull Static Closed Loop Thermal Throttling (Static-CLTT) CLTT control registers are configured by BIOS MRC during POST The memory throttling is run as a closed-loop system with the DIMM temperature sensors as the control input Otherwise the system does not change any of the throttling control registers in the embedded memory controller during runtime
bull Dynamic Open Loop Thermal Throttling (Dynamic-OLTT) OLTT control registers are configured by BIOS MRC during POST Adjustments are
HA201-TP Users Manual
106
Chapter 7 Intelreg Server Board S2600TP Platform Management
made to the throttling during runtime based on changes in system cooling (fan speed)
bull Dynamic Closed Loop Thermal Throttling (Dynamic-CLTT) CLTT control registers are configured by BIOS MRC during POST The memory throttling is run as a closed-loop system with the DIMM temperature sensors as the control input Adjustments are made to the throttling during runtime based on changes in system cooling (fan speed)
Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce a new type of CLTT which is referred to as Hybrid CLTT for which the Integrated Memory Controller estimates the DRAM temperature in between actual reads of the TSODsHybrid CLTTT shall be used on all Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family that have DIMMs with thermal sensors Therefore the terms Dynamic-CLTT and Static-CLTT are really referring to this lsquohybridrsquo mode Note that if the IMCrsquos polling of the TSODs is interrupted the temperature readings that the BMC gets from the IMC shall be these estimated values 731463 DIMM Temperature Sensor Input to Fan Speed ControlA clamp algorithm s used for controlling fan speed based on DIMM temperatures Aggregate DIMM temperature margin sensors are used as the control input to the algorithm
731464 Dynamic (Hybrid) CLTTThe system will support dynamic (memory) CLTT for which the BMC FW dynamically modifies thermal offset registers in the IMC during runtime based on changes in system cooling (fan speed) For static CLTT a fixed offset value is applied to the TSOD reading to get the die temperature however this is does not provide as accurate results as when the offset takes into account the current airflow over the DIMM as is done with dynamic CLTTIn order to support this feature the BMC FW will derive the air velocity for each fan domain based on the PWM value being driven for the domain Since this relationship is dependent on the chassis configuration a method must be used which support this dependency (for example through OEM SDR) that establishes a lookup table providing this relationshipBIOS will have an embedded lookup table that provides thermal offset
HA201-TP Users Manual
107
Chapter 7 Intelreg Server Board S2600TP Platform Management
values for each DIMM type altitude setting and air velocity range (3 ranges of air velocity are supported) During system boot BIOS will provide 3 offset values (corresponding to the 3 air velocity ranges) tothe BMC for each enabled DIMM Using this data the BMC FW constructs a table that maps the offset value corresponding to a given air velocity range for each DIMM During runtime the BMC applies an averaging algorithm to determine the target offset value corresponding to thecurrent air velocity and then the BMC writes this new offset value into the IMC thermal offset register for the DIMM
731465 Fan ProfilesThe server system supports multiple fan control profiles to support acoustic targets and American Society of Heating Refrigerating and Air Conditioning Engineers (ASHRAE) compliance The BIOS Setup utility can be used to choose between meeting the target acoustic level or enhanced system performance This is accomplished through fan profilesThe BMC supports eight fan profiles numbered from 0 to 7 Fan management policy is dictated by the Tcontrol SDRs These SDRs provide a way to associate fan control behavior with one or more fan profilesEach group of profiles allows for varying fan control policies based on the altitude For a given altitude the Tcontrol SDRs associated with an acoustics-optimized profile generate less noise than the equivalent performance-optimized profile by driving lower fan speeds and the BIOSreduces thermal management requirements by configuring more aggressive memory throttling See Table 16 for more informationThe BMC provides commands that query for fan profile support and it provides a way to enable a fan profile Enabling a fan profile determines which Tcontrol SDRs are used for fan management The BMC only supports enabling a fan profile through the command if that profile is supported on all fan domains defined for the system It is important to configure the SDRs so that all desired fan profiles are supported on each fan domain If no single profile is supported across all domains the BMC by default uses profile 0 and does not allow it to be changedAt system boot the BIOS can use the Get Fan Control Configuration command to query the BMC about which fan profiles are supported The BIOS uses this information to display options in the BIOS Setup utility The BIOS indicates the fan profile to the BMC as dictated by the BIOS Setup Utility options for fan mode and altitude using the Set Fan Control
HA201-TP Users Manual
108
Chapter 7 Intelreg Server Board S2600TP Platform Management
Configuration commandThe BMC uses this information as an input to its fan-control algorithm as supported by the Tcontrol OEM SDR The BMC only allows enabling of fan profiles that the BMC indicates are supported using the Get Fan Control Configuration command For example if the Get Fan Control Configuration command indicates that only profile 1 is supported then using the Set Fan Control Configuration command to enable profile 2 will result in the return of an error completion codeThe BMC requires the BIOS to send the Set Fan Control Configuration command to the BMC on every system boot This must be done after the BIOS has completed any throttling-related chipset configuration
HA201-TP Users Manual
109
Chapter 7 Intelreg Server Board S2600TP Platform Management
731466 Open-Loop Thermal Throttling FallbackNormal system operation uses closed-loop thermal throttling (CLTT) and DIMM temperature monitoring as major factors in overall thermal and acoustics management In the event that BIOS is unable to configure the system for CLTT it defaults to open-loop thermal throttling (OLTT) In the OLTT mode it is assumed that the DIMM temperature sensors are not available for fan speed control The BIOS communicates the throttling mode to the BMC along with the fan profile number when it sends the Set Fan Control Configuration commandWhen OLTT mode is specified the BMC internally blocks access to the DIMM temperatures causing the DIMM aggregate margin sensors to be marked as ReadingState Unavailable The BMC then uses the failure-control values for these sensors if specified in the Tcontrol SDRs as their fan speed contributions
731467 ASHRAE ComplianceSystem requirements for ASHRAE compliance is defined in the Common Fan Speed Control amp Thermal Management Platform Architecture Specification Altitude-related changes in fan speed control are handled through profiles for different altitude ranges
73147 Power Supply Fan Speed ControlThis section describes the system level control of the fans internal to the power supply over the PMBus Some but not all Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family will require that the power supplies be included in the system level fan speed control For any system that requires either of these capabilities the power supply must be PMBus-compliant
731471 System Control of Power Supply FansSome products require that the BMC control the speed of the power supply fans as is done with normal system (chassis) fans except that the BMC cannot reduce the power supply fan any lower than the internal power supply control is driving it For these products the BMC FW must have the ability to control and monitor the power supply fans through PMBus commands The power supply fans are treated as a system fan domain for which fan control policies are mapped just as for chassis system fans with system thermal sensors (rather than internal power
HA201-TP Users Manual
110
Chapter 7 Intelreg Server Board S2600TP Platform Management
supply thermal sensors) used as the input to a clamp algorithm for the power supply fan control This domain has both piecewise clipping curves and clamped sensors mapped into the power supply fan domain All the power supplies can be defined as a single fan domain
731472 Use of Power Supply Thermal Sensors as Input to System (Chassis) Fan Control Some products require that the power supply internal thermal sensors are used as control inputs to the system (chassis) fans in the same manner as other system thermal sensors are used for this purpose The power supply thermal sensors are included as clamped sensors into one or more system fan domains which may include the power supply fan domain
73148 Fan Boosting due to Fan FailuresIntelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce additional capabilities for handling fan failure or removal as described in this sectionEach fan failure shall be able to define a unique response from all other fan domains An OEM SDR table defines the response of each fan domain based on a failure of any fan including both system and power supply fans (for PMBus-compliant power supplies only) This meansthat if a system has six fans then there will be six different fan fail reactions
73149 Programmable Fan PWM OffsetThe system provides a BIOS Setup option to boost the system fan speed by a programmable positive offset or a ldquoMaxrdquo setting Setting the programmable offset causes the BMC to add the offset to the fan speeds that it would otherwise be driving the fans to The Max setting causes the BMC to replace the domain minimum speed with alternate domain minimums that also are programmable through SDRsThis capability is offered to provide system administrators the option to manually configure fans speeds in instances where the fan speed optimized for a given platform may not be sufficient when a high end add-in is configured into the system This enables easier usage of the fan speed control to support Intelreg as well as third party chassis and better support of ambient temperatures higher than 35C
HA201-TP Users Manual
111
Chapter 7 Intelreg Server Board S2600TP Platform Management
supply thermal sensors) used as the input to a clamp algorithm for the power supply fan control This domain has both piecewise clipping curves and clamped sensors mapped into the power supply fan domain All the power supplies can be defined as a single fan domain
731472 Use of Power Supply Thermal Sensors as Input to System (Chassis) Fan Control Some products require that the power supply internal thermal sensors are used as control inputs to the system (chassis) fans in the same manner as other system thermal sensors are used for this purpose The power supply thermal sensors are included as clamped sensors into one or more system fan domains which may include the power supply fan domain
73148 Fan Boosting due to Fan FailuresIntelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce additional capabilities for handling fan failure or removal as described in this sectionEach fan failure shall be able to define a unique response from all other fan domains An OEM SDR table defines the response of each fan domain based on a failure of any fan including both system and power supply fans (for PMBus-compliant power supplies only) This meansthat if a system has six fans then there will be six different fan fail reactions
73149 Programmable Fan PWM OffsetThe system provides a BIOS Setup option to boost the system fan speed by a programmable positive offset or a ldquoMaxrdquo setting Setting the programmable offset causes the BMC to add the offset to the fan speeds that it would otherwise be driving the fans to The Max setting causes the BMC to replace the domain minimum speed with alternate domain minimums that also are programmable through SDRsThis capability is offered to provide system administrators the option to manually configure fans speeds in instances where the fan speed optimized for a given platform may not be sufficient when a high end add-in is configured into the system This enables easier usage of the fan speed control to support Intelreg as well as third party chassis and better support of ambient temperatures higher than 35C
HA201-TP Users Manual
112
Chapter 7 Intelreg Server Board S2600TP Platform Management
7315 Power Management Bus (PMBus)The Power Management Bus (ldquoPMBusrdquo) is an open standard protocol that is built upon the SMBus 20 transport It defines a means of communicating with power conversion and other devices using SMBus-based commands A system must have PMBus-compliant power supplies installed in order for the BMC or ME to monitor them for status andor power metering purposesFor more information on PMBus see the System Management Interface Forum Web sitehttpwwwpowersigorg
7316 Power Supply Dynamic Redundancy SensorThe BMC supports redundant power subsystems and implements a Power Unit Redundancy sensor per platform A Power Unit Redundancy sensor is of sensor type Power Unit (09h) and reading type Availability Status (0Bh) This sensor generates events when a power subsystem transitions between redundant and non-redundant states as determined by the number and health of the power subsystemrsquos component power supplies The BMC implements Dynamic Power Supply Redundancy status based upon current system load requirements as well as total Power Supply capacity This status is independent of the Cold Redundancy status This prevents the BMC from reporting Fully Redundant Power supplies when the load required by the system exceeds half the power capability of all power supplies installed and operationalDynamic Redundancy detects this condition and generates the appropriate SEL event to notify the user of the condition Power supplies of different power ratings may be swapped in and out to adjust the power capacity and the BMC will adjust the Redundancy status accordinglyThe definition of redundancy is power subsystem dependent and sometimes even configuration dependent See the appropriate Platform Specific Information for power unit redundancy supportThis sensor is configured as manual-rearm sensor in order to avoid the possibility of extraneous SEL events that could occur under certain system configuration and workload conditions The sensor shall rearm for the following conditionsbull PSU hot-addbull System reset
HA201-TP Users Manual
113
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull AC power cyclebull DC power cycleSystem AC power is applied but on standby ndash Power unit redundancy is based on OEM SDR power unit record and number of PSU presentSystem is (DC) powered on - The BMC calculates Dynamic Power Supply Redundancy status based upon current system load requirements as well as total Power Supply capacityThe BMC allows redundancy to be configured on a per power-unit-redundancy sensor basis by means of the OEM SDR records
7317 Component Fault LED ControlSeveral sets of component fault LEDs are supported on the server board Some LEDs are owned by the BMC and some by the BIOSThe BMC owns control of the following FRUfault LEDsbull Fan fault LEDs ndash A fan fault LED is associated with each fan The BMC
lights a fan fault LED if the associated fan-tach sensor has a lower critical threshold event status asserted Fan-tach sensors are manual re-arm sensors Once the lower critical threshold is crossed the LED remains lit until the sensor is rearmed These sensors are rearmed at system DC power-on and system reset
bull DIMM fault LEDs ndash The BMC owns the hardware control for these LEDs The LEDs reflect the state of BIOS-owned event-only sensors When the BIOS detects a DIMM fault condition it sends an IPMI OEM command (Set Fault Indication) to the BMC to instruct the BMC to turn on the associated DIMM Fault LED These LEDs are only active when the system is in the lsquoonrsquo state The BMC will not activate or change the state of the LEDs unless instructed by the BIOS
bull Hard Disk Drive Status LEDs ndash The HSBP PSoC owns the hardware control for these LEDs and detection of the faultstatus conditions that the LEDs reflect
bull CPU Fault LEDs The BMC owns control for these LEDs An LED is lit if there is an MSID mismatch (that is CPU power rating is incompatible with the board)
HA201-TP Users Manual
114
Chapter 7 Intelreg Server Board S2600TP Platform Management
7318 CMOS Battery MonitoringThe BMC monitors the voltage level from the CMOS battery which provides battery backup to the chipset RTC This is monitored as an auto-rearm threshold sensorUnlike monitoring of other voltage sources for which the Emulex Pilot III component continuously cycles through each input the voltage channel used for the battery monitoring provides a software enable bit to allow the BMC FW to poll the battery voltage at a relativelyslow rate in order to conserve battery power
HA201-TP Users Manual
115
Chapter 7 Intelreg Server Board S2600TP Platform Management
74 Intelreg Intelligent Power Node Manager (NM)Power management deals with requirements to manage processor power consumption and manage power at the platform level to meet critical business needs Node Manager (NM) is a platform resident technology that enforces power capping and thermal-triggered powercapping policies for the platform These policies are applied by exploiting subsystem settings (such as processor P and T states) that can be used to control power consumption NM enables data center power management by exposing an external interface to management software through which platform policies can be specified It also implements specific data center power management usage models such as power limiting and thermal monitoringThe NM feature is implemented by a complementary architecture utilizing the ME BMC BIOS and an ACPI-compliant OS The ME provides the NM policy engine and power controllimiting functions (referred to as Node Manager or NM) while the BMC provides the external LAN link by which external management software can interact with the feature The BIOS provides system power information utilized by the NM algorithms and also exports ACPI Source Language (ASL) code used by OS-Directed Power Management (OSPM) for negotiating processor P and T state changes for power limiting PMBus-compliant power supplies provide the capability to monitor input power consumption which is necessary to support NMThe NM architecture applicable to this generation of servers is defined by the NPTM Architecture Specification v20 NPTM is an evolving technology that is expected to continue to add new capabilities that will be defined in subsequent versions of the specification The ME NM implements the NPTM policy engine and controlmonitoring algorithms defined in the Node Power and Thermal Manager (NPTM) specification
741 Hardware RequirementsNM is supported only on platforms that have the NM FW functionality loaded and enabled on the Management Engine (ME) in the SSB and that have a BMC present to support the external LAN interface to the ME NM power limiting features requires a means for the ME to monitorinput power consumption for the platform This capability is generally provided by means of PMBus-compliant power supplies although an alternative model using a simpler SMBus power monitoring device is possible (there is potential loss in accuracy and responsiveness using
HA201-TP Users Manual
116
Chapter 7 Intelreg Server Board S2600TP Platform Management
non-PMBus devices) The NM SmaRTCLST feature does specifically require PMBus-compliant power supplies as well as additional hardware on the server board
742 FeaturesNM provides feature support for policy management monitoring and querying alerts and notifications and an external interface protocol The policy management features implement specific IT goals that can be specified as policy directives for NM Monitoring and querying features enable tracking of power consumption Alerts and notifications provide the foundation for automation of power management in the data center management stack The external interface specifies the protocols that must be supported in this version of NM
743 ME System Management Bus (SMBus) Interfacebull The ME uses the SMLink0 on the SSB in multi-master mode as a
dedicated bus for communication with the BMC using the IPMB protocol The BMC FW considers this a secondary IPMB bus and runs at 400 kHz
bull The ME uses the SMLink1 on the SSB in multi-master mode bus for communication with PMBus devices in the power supplies for support of various NM-related features This bus is shared with the BMC which polls these PMBus power supplies for sensor monitoring purposes (for example power supply status input power and so on) This bus runs at 100 KHz
bull The Management Engine has access to the ldquoHost SMBusrdquo
744 PECI 30bull The BMC owns the PECI bus for all Intel server implementations and
acts as a proxy for the ME when necessary
745 NM ldquoDiscoveryrdquo OEM SDRAn NM ldquodiscoveryrdquo OEM SDR must be loaded into the BMCrsquos SDR repository if and only if the NM feature is supported on that product This OEM SDR is used by management software to detect if NM is supported and to understand how to communicate with itSince PMBus compliant power supplies are required in order to support NM the system should be probed when the SDRs are loaded into the
HA201-TP Users Manual
117
Chapter 7 Intelreg Server Board S2600TP Platform Management
BMCrsquos SDR repository in order to determine whether or not the installed power supplies do in fact support PMBus If the installed power supplies are not PMBus compliant then the NM ldquodiscoveryrdquo OEM SDR should not be loadedPlease refer to the Intelreg Intelligent Power Node Manager 20 External Architecture Specification using IPMI for details of this interface
746 SmaRTCLSTThe power supply optimization provided by SmaRTCLST relies on a platform HW capability as well as ME FW support When a PMBus-compliant power supply detects insufficient input voltage an overcurrent condition or an over-temperature condition it will assert theSMBAlert signal on the power supply SMBus (such as the PMBus) Through the use of external gates this results in a momentary assertion of the PROCHOT and MEMHOT signals to the processors thereby throttling the processors and memory The ME FW also sees the SMBAlert assertion queries the power supplies to determine the condition causing the assertion and applies an algorithm to either release or prolong the throttling based on the situationSystem power control modes include1 SmaRT Low AC in put voltage event results in a one-time momentary
throttle for each event to the maximum throttle state2 Electrical Protection CLST High output energy event results in a
throttling hiccup mode with fixed maximum throttle time and a fix throttle release ramp time
3 Thermal Protection CLST High power supply thermal event results in a throttling hiccup mode with fixed maximum throttle time and a fix throttle release ramp time
When the SMBAlert signal is asserted the fans will be gated by HW for a short period (~100ms) to reduce overall power consumption It is expected that the interruption to the fans will be of short enough duration to avoid false lower threshold crossings for the fan tachsensors however this may need to be comprehended by the fan monitoring FW if it does have this side-effectME FW will log an event into the SEL to indicate when the system has been throttled by the SmaRTCLST power management feature This is dependent on ME FW support for this sensor Please refer ME FW EPS for SEL log details
HA201-TP Users Manual
118
Chapter 7 Intelreg Server Board S2600TP Platform Management
74611 Dependencies on PMBus-compliant Power Supply SupportThe SmaRTCLST system feature depends on functionality present in the ME NM SKU This feature requires power supplies that are compliant with the PMBus
note For additional information on Intelreg Intelligent Power Node
Manager usage and support please visit the following Intel Website
httpwwwintelcomcontentwwwusendata-centerdata-center-managementnode-manager-generalhtmlwapkw=node+manager
HA201-TP Users Manual
119
Chapter 7 Intelreg Server Board S2600TP Platform Management
75 Basic and Advanced Server Management FeaturesThe integrated BMC has support for basic and advanced server management features Basic management features are available by default Advanced management features are enabled with the addition of an optionally installed Remote Management Module 4 Lite (RMM4 Lite) key
When the BMC FW initializes it attempts to access the Intelreg RMM4 lite If the attempt to access Intelreg RMM4 lite is successful then the BMC activates the Advanced featuresThe following table identifies both Basic and Advanced server management features
HA201-TP Users Manual
120
Chapter 7 Intelreg Server Board S2600TP Platform Management
751 Dedicated Management PortThe server board includes a dedicated 1GbE RJ45 Management Port The management port is active with or without the RMM4 Lite key installed
752 Embedded Web ServerBMC Base manageability provides an embedded web server and an OEM-customizable web GUI which exposes the manageability features of the BMC base feature set It is supported over all on-board NICs that have management connectivity to the BMC as well as an optionaldedicated add-in management NIC At least two concurrent web sessions from up to two different users is supported The embedded web user interface shall support the following client web browsersbull Microsoft Internet Explorer 90bull Microsoft Internet Explorer 100bull Mozilla Firefox 24bull Mozilla Firefox 25The embedded web user interface supports strong security (authentication encryption and firewall support) since it enables remote server configuration and control The user interface presented by the embedded web user interface shall authenticate the user before allowing a web session to be initiated Encryption using 128-bit SSL is supported User authentication is based on user id and passwordThe GUI presented by the embedded web server authenticates the user before allowing a web session to be initiated It presents all functions to all users but grays-out those functions that the user does not have privilege to execute For example if a user does not have privilege to power control then the item shall be displayed in grey-out font in that userrsquos UI display The web GUI also provides a launch point for some of the advanced features such as KVM and media redirection These features are grayed out in the GUI unless the system has been updated to support these advanced features The embedded web server only displays US English or Chinese language outputAdditional features supported by the web GUI includesbull Presents all the Basic features to the usersbull Power onoffreset the server and view current power statebull Displays BIOS BMC ME and SDR version informationbull Display overall system health
HA201-TP Users Manual
121
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull Configuration of various IPMI over LAN parameters for both IPV4 and IPV6
bull Configuration of alerting (SNMP and SMTP)bull Display system asset information for the product board and
chassisbull Display of BMC-owned sensors (name status current reading
enabled thresholds) including color-code status of sensorsbull Provides ability to filter sensors based on sensor type (Voltage
Temperature Fan and Power supply related)bull Automatic refresh of sensor data with a configurable refresh ratebull On-line helpbull Displayclear SEL (display is in easily understandable human
readable format)bull Supports major industry-standard browsers (Microsoft Internet
Explorer and Mozilla Firefox)bull The GUI session automatically times-out after a user-configurable
inactivity period By default this inactivity period is 30 minutesbull Embedded Platform Debug feature - Allow the user to initiate
a ldquodebug dumprdquo to a file that can be sent to Intelreg for debug purposes
bull Virtual Front Panel The Virtual Front Panel provides the same functionality as the local front panel The displayed LEDs match the current state of the local panel LEDs The displayed buttons (for example power button) can be used in the same manner as the local buttons
bull Display of ME sensor data Only sensors that have associated SDRs loaded will be displayed
bull Ability to save the SEL to a filebull Ability to force HTTPS connectivity for greater security This is
provided through a configuration option in the UIbull Display of processor and memory information as is available over
IPMI over LANbull Ability to get and set Node Manager (NM) power policiesbull Display of power consumed by the serverbull Ability to view and configure VLAN settingsbull Warn user the reconfiguration of IP address will cause disconnectbull Capability to block logins for a period of time after several
HA201-TP Users Manual
122
Chapter 7 Intelreg Server Board S2600TP Platform Management
consecutive failed login attempts The lock-out period and the number of failed logins that initiates the lockout period are configurable by the user
bull Server Power Control - Ability to force into Setup on a resetbull System POST results ndash The web server provides the systemrsquos Power-
On Self Test (POST) sequence for the previous two boot cycles including timestamps The timestamps may be viewed in relative to the start of POST or the previous POST code
bull Customizable ports - The web server provides the ability to customize the port numbers used for SMASH http https KVM secure KVM remote media and secure remote media
For additional information reference the Intelreg Remote Management Module 4 and Integrated BMC Web Console Users Guide
753 Advanced Management Feature Support (RMM4 Lite)The integrated baseboard management controller has support for advanced management features which are enabled when an optional Intelreg Remote Management Module 4 Lite (RMM4 Lite) is installed The Intel RMM4 add-on offers convenient remote KVM access and control through LAN and internet It captures digitizes and compresses video and transmits it with keyboard and mouse signals to and from a remote computer Remote access and controlsoftware runs in the integrated baseboard management controller utilizing expanded capabilities enabled by the Intel RMM4 hardwareKey Features of the RMM4 add-on arebull KVM redirection from either the dedicated management NIC or
the server board NICs used for management traffic upto to two KVM sessions
bull Media Redirection ndash The media redirection feature is intended to allow system administrators or users to mount a remote IDE or USB CDROM floppy drive or a USB flash disk as a remote device to the server Once mounted the remote device appears just like a local device to the server allowing system administrators or users to install software (including operating systems) copy files update BIOS or boot the server from this device
bull KVM ndash Automatically senses video resolution for best possible screen capture high performance mouse tracking and
HA201-TP Users Manual
123
Chapter 7 Intelreg Server Board S2600TP Platform Management
synchronization It allows remote viewing and configuration in pre-boot POST and BIOS setup
7531 Keyboard Video Mouse (KVM) RedirectionThe BMC firmware supports keyboard video and mouse redirection (KVM) over LAN This feature is available remotely from the embedded web server as a Java applet This feature is only enabled when the Intelreg RMM4 lite is present The client system must have a Java Runtime Environment (JRE) version 60 or later to run the KVM or media redirection appletsThe BMC supports an embedded KVM application (Remote Console) that can be launched from the embedded web server from a remote console USB11 or USB 20 based mouse and keyboard redirection are supported It is also possible to use the KVM-redirection (KVM-r) session concurrently with media-redirection (media-r) This feature allows a user to interactively use the keyboard video and mouse (KVM) functions of the remote server as if the user were physically at the managed server KVM redirection console supports the following keyboard layouts English Dutch French German Italian Russian and SpanishKVM redirection includes a ldquosoft keyboardrdquo function The ldquosoft keyboardrdquo is used to simulate an entire keyboard that is connected to the remote system The ldquosoft keyboardrdquo functionality supports the following layouts English Dutch French German Italian Russian and SpanishThe KVM-redirection feature automatically senses video resolution for best possible screen capture and provides high-performance mouse tracking and synchronization It allows remote viewing and configuration in pre-boot POST and BIOS setup once BIOS has initialized videoOther attributes of this feature includebull Encryption of the redirected screen keyboard and mousebull Compression of the redirected screenbull Ability to select a mouse configuration based on the OS typebull Supports user definable keyboard macrosKVM redirection feature supports the following resolutions and refresh ratesbull 640x480 at 60Hz 72Hz 75Hz 85Hz 100Hzbull 800x600 at 60Hz 72Hz 75Hz 85Hzbull 1024x768 at 60Hx 72Hz 75Hz 85Hzbull 1280x960 at 60Hz
HA201-TP Users Manual
124
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull 1280x1024 at 60Hzbull 1600x1200 at 60Hzbull 1920x1080 (1080p)bull 1920x1200 (WUXGA)bull 1650x1080 (WSXGA+)
7532 Remote ConsoleThe Remote Console is the redirected screen keyboard and mouse of the remote host system To use the Remote Console window of your managed host system the browser must include a Java Runtime Environment plug-in If the browser has no Java support such as with a small handheld device the user can maintain the remote host system using the administration forms displayed by the browserThe Remote Console window is a Java Applet that establishes TCP connections to the BMCThe protocol that is run over these connections is a unique KVM protocol and not HTTP or HTTPS This protocol uses ports 7578 for KVM 5120 for CDROM media redirection and 5123 for FloppyUSB media redirection When encryption is enabled the protocol uses ports 7582 for KVM 5124 for CDROM media redirection and 5127 for FloppyUSB media redirection The local network environment must permit these connections to be made that is the firewall and in case of a private internal network the NAT (Network Address Translation) settings have to be configured accordingly
7533 PerformanceThe remote display accurately represents the local display The feature adapts to changes to the video resolution of the local display and continues to work smoothly when the system transitions from graphics to text or vice-versa The responsiveness may be slightly delayed depending on the bandwidth and latency of the networkEnabling KVM andor media encryption will degrade performance Enabling video compression provides the fastest response while disabling compression provides better video qualityFor the best possible KVM performance a 2Mbsec link or higher is recommendedThe redirection of KVM over IP is performed in parallel with the local KVM without affecting the local KVM operation
HA201-TP Users Manual
125
Chapter 7 Intelreg Server Board S2600TP Platform Management
7534 SecurityThe KVM redirection feature supports multiple encryption algorithms including RC4 and AES The actual algorithm that is used is negotiated with the client based on the clientrsquos capabilities
7535 AvailabilityThe remote KVM session is available even when the server is powered-off (in stand-by mode) No re-start of the remote KVM session shall be required during a server reset or power onoff A BMC reset (for example due to an BMC Watchdog initiated reset or BMC reset after BMC FWupdate) will require the session to be re-established KVM sessions persist across system reset but not across an AC power loss
7536 UsageAs the server is powered up the remote KVM session displays the complete BIOS boot process The user is able interact with BIOS setup change and save settings as well as enter and interact with option ROM configuration screensAt least two concurrent remote KVM sessions are supported It is possible for at least two different users to connect to same server and start remote KVM sessions
7537 Force-enter BIOS SetupKVM redirection can present an option to force-enter BIOS Setup This enables the system to enter F2 setup while booting which is often missed by the time the remote console redirects the video
7538 Media RedirectionThe embedded web server provides a Java applet to enable remote media redirection This may be used in conjunction with the remote KVM feature or as a standalone applet The media redirection feature is intended to allow system administrators or users to mount a remote IDE or USB CD-ROM floppy drive or a USB flash disk as a remote device to the server Once mounted the remote device appears just like a local device to the server allowing system administrators or users to install software (including operating systems) copy files update BIOS and so on or boot the server from this deviceThe following capabilities are supported
HA201-TP Users Manual
126
Chapter 7 Intelreg Server Board S2600TP Platform Management
The operation of remotely mounted devices is independent of the local devices on the server Both remote and local devices are useable in parallelbull Either IDE (CD-ROM floppy) or USB devices can be mounted as a
remote device to the serverbull It is possible to boot all supported operating systems from the remotely
mounted device and to boot from disk IMAGE (IMG) and CD-ROM or DVD-ROM ISO files See the Testedsupported Operating System List for more information
bull Media redirection supports redirection for both a virtual CD device and a virtual FloppyUSB device concurrently The CD device may be either a local CD drive or else an ISO image file the FloppyUSB device may be either a local Floppy drive a local USB device or else a disk image file
bull The media redirection feature supports multiple encryption algorithms including RC4 and AES The actual algorithm that is used is negotiated with the client based on the clientrsquos capabilities
bull A remote media session is maintained even when the server is powered-off (in standby mode) No restart of the remote media session is required during a server reset or power onoff An BMC reset (for example due to an BMC reset after BMC FW update) will require the session to be re-established
bull The mounted device is visible to (and useable by) managed systemrsquos OS and BIOS in both pre-boot and post-boot states
bull The mounted device shows up in the BIOS boot order and it is possible to change the BIOS boot order to boot from this remote device
bull It is possible to install an operating system on a bare metal server (no OS present) using the remotely mounted device This may also require the use of KVM-r to configure the OS during install
USB storage devices will appear as floppy disks over media redirection This allows for the installation of device drivers during OS installationIf either a virtual IDE or virtual floppy device is remotely attached during system boot both the virtual IDE and virtual floppy are presented as bootable devices It is not possible to present only a single-mounted device type to the system BIOS
HA201-TP Users Manual
127
Chapter 7 Intelreg Server Board S2600TP Platform Management
75381 AvailabilityThe default inactivity timeout is 30 minutes and is not user-configurable Media redirection sessions persist across system reset but not across an AC power loss or BMC reset
75382 Network Port UsageThe KVM and media redirection features use the following portsbull 5120 ndash CD Redirectionbull 5123 ndash FD Redirectionbull 5124 ndash CD Redirection (Secure)bull 5127 ndash FD Redirection (Secure)bull 7578 ndash Video Redirectionbull 7582 ndash Video Redirection (Secure)For additional information reference the Intelreg Remote Management Module 4 and Integrated BMC Web Console Users Guide
Chapter 1 Product IntroductionChapter 1 Product IntroductionChapter 8 Technical Support
wwwaicipccom
bull TAIWANTel +886 3 433 9188Fax +886 3 287 1818Email salesaicipccomtw
bull CHINA Tel +862154961421 +862154961422Fax Extension 608Email Technical Support supportaicipccom
bull AMERICA - West coastTel +19098958989Fax +19098958999Email salesaicipccom
bull AMERICA - East coastTel +19738848886Fax +19738844794Email njsalesaicipccom
bull EUROPETel +31306386789Fax +31306360638Emailsalesaicipcnl
Email Technical Support supportaicipccom
- PREFACE
-
- SAFETY INSTRUCTIONS
-
- Chapter 1 Product Introduction
-
- 11 Box Content
- 12 Specifications
- 13 General Information
-
- Chapter 2 Hardware Installation
-
- 21 Removing and Installing top cover
- 22 Central Processing Unit (CPU)
- 23 Diagram of the Correct Installation
- 24 System Memory
- 25 RemovingInstalling a Drive Tray Hard Drive Drive Slot Map
- 26 Removing and Installing a Fan Module
- 27 Power Supply
- 28 Tool-less Blade Slide Installation introduction
-
- Chapter 3 Motherborad Overview
-
- 31 Intelreg Server Board Feature Set
- 32 Motherboard Layout
- 33 Motherboard block diagram
- 34 Motherboard Configuration Jumpers
- 35 Motherboard Configuration Jumpers
-
- Chapter 4 12GB Expander Borad Overview
-
- 41 12GB Expander Board
-
- Chapter 5 Backplane Overview
-
- 51 Backplane
-
- Chapter 6 Debug amp Firmware Update Introduction
-
- 61 Expender firmware update through smart console port
- 62 Update the expander firmware through in-band
- 63 12G expander EDFB setting
- 64 Slot HDD power setting
- 65 HDD BP thermal sensor temperature setting
-
- Chapter 7 Intelreg Server Board S2600TP Platform Management
-
- 71 Server Management Function Architecture
- 72 Features and Functions
- 73 Sensor Monitoring
- 74 Intelreg Intelligent Power Node Manager (NM)
- 75 Basic and Advanced Server Management Features
-
- Chapter 8 Technical Support
-
- 按鈕11
![Page 10: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/10.jpg)
3
HA201-TP Users Manual
Chapter 1 Product Introduction
13 General Information
HA201-WP a 2U Storage Server Barebone supports two Intelreg Xeonreg processors E5-2600 v3 series HA201-TP has a 24 x 25rdquo HDD bays as system drive bays It is a perfect building block for Storage Servers
bull Front Panel
LED Indicator and Switch
24 x 25 hot-swap SATASAS HDD bays
4
HA201-TP Users Manual
Chapter 1 Product Introduction
bull Rear Panel
1200W 1+1 80+ redundant power supply
2 x low-profile add-on card for external connection
SFF 8644 for JBOD connection
5
HA201-TP Users Manual
Chapter 1 Product Introduction
bull Top View of HA201-TP system
2 x hot-swappable storage control node
Intelreg Xeonreg E5-2600 v3 Processors
bull TOP View Of Controller Canister
6
HA201-TP Users Manual
Chapter 1 Product Introduction
bull Top View of HA201-TP system
4 x 60x56mm hot-swap fans
Intel S2600TP
HA201-TP Users Manual
7
Chapter 2 Hardware Installation
7
21 Removing and Installing top coverUnscrew the screws(2pcs for the back and 4pcs for the both right and left side)Slide the cover backwards to remove top cover form the chassis
Chapter 2 Hardware Installation
This section demonstrates maintenance procedures in replacing a defective part once the HA201-TP UM appliance is installed and is operational
Chapter 2 Hardware Installation
HA201-TP Users Manual
8
22 Central Processing Unit (CPU)221 Removing the Processor HeatsinkThe heatsink is attached to the server board or processor socket with captive fasteners Using a 2 Phillips screwdriver loosen the four screws located on the heatsink corners in a diagonal manner using the following procedurebull Using a 2 Phillips screwdriver start with screw 1 and loosen it by
giving it two rotations and stop (see letter A) (IMPORTANT Do not fully loosen)
bull Proceed to screw 2 and loosen it by giving it two rotations and stop (see letter B) Similarly loosen screws 3 and 4 Repeat steps A and B by giving each screw two rotations each time until all screws are loosened
bull Lift the heatsink straight up (see letter C)
REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
Processor
Socket
2
3
1
4
A
B
C
HA201-TP Users Manual
9
Chapter 2 Hardware Installation
222 Installing the Processor
2221 Unlatch the CPU Load Platebull Push the lever handle labeled ldquoOPEN 1strdquo (see letter A) down and
toward the CPU socket Rotate the lever handle upbull Repeat the steps for the second lever handle (see letter B)
CAUTION
Processor must be appropriate You may damage the server board if
you install a processor that is inappropriate for your server
CAUTION
ESD and handling processors Reduce the risk of electrostatic
discharge (ESD) damage to the processor by doing the following
(1) Touch the metal chassis before touching the processor or
server board Keep part of your body in contact with the metal
chassis to dissipate the static charge while handling the processor
(2) Avoid moving around unnecessarily
REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
A REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
B
Chapter 2 Hardware Installation
HA201-TP Users Manual
10
2222 Lift open the Load Platebull Rotate the right lever handle down until it releases the Load Plate
(see letter A)bull While holding down the lever handle with your other hand lift open
the Load Plate (see letter B)
BREMOVE
REMOVE
LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
A
HA201-TP Users Manual
11
Chapter 2 Hardware Installation
2223 Install the Processorbull Remove the processor from its packagebull Carefully remove the protective cover from the bottom side of the
CPU taking care not to touch any CPU contacts (see letter A)bull Orient the processor with the socket so that the processor cutouts
match the four orientation posts on the socket (see letter B) Note the location of a gold key at the corner of the processor (see letter C) Carefully place (Do NOT drop) the CPU into the socket
CAUTION
The pins inside the CPU socket are extremely sensitive Other than
the CPU no object should make contact with the pins inside the
CPU socket A damaged CPU socket pin may render the socket inoperable
and will produce erroneous CPU or other system errors if used
NOTE
The underside of the processor has components that may damage the
socket pins if installed improperly The processor must align correctly
with the socket opening before installation DO NOT DROP the
processor into the socket
NOTE
When possible a CPU insertion tool should be used when installing the
CPU
A
B
C
Chapter 2 Hardware Installation
HA201-TP Users Manual
12
2224 Remove the Socket Cover Remove the socket cover from the load plate by pressing it out
2225 Close the Load Plate Carefully lower the load plate down over the processor
2226 Lock down the Load Platebull Push down on the locking lever on the CLOSE 1st side (see letter A)
Slide the tip of the lever under the notch in the load plate (see letter B) Make sure the load plate tab engages under the socket lever when fully closed
bull bRepeat the steps to latch the locking lever on the other side (see letter C) Latch the levers in the order as shown
NOTE
The socket cover should be saved and re-used should the processor
need to be removed at anytime in the future
Save theprotectivecover
A
BC
HA201-TP Users Manual
13
Chapter 2 Hardware Installation
223 Installing the Processor Heatsink
bull If present remove the protective film covering the Thermal Interface Material on the bottom side of the heatsink (see letter A)
bull Align the heatsink fins to the front and back of the chassis for correct airflow The airflow goes from front-to-back of the chassis (see letter B)
bull Each heatsink has four captive fasteners and should be tightened in a diagonal manner using the following procedure
bull Using a 2 Phillips screwdriver start with screw 1 and engage screw threads by giving it two rotations and stop (see letter C) (Do not fully tighten)
bull Proceed to screw 2 and engage screw threads by giving it two rotations and stop (see letter D) Similarly engage screws 3 and 4
bull Repeat steps C and D by giving each screw two rotations each time until each screw is lightly tightened up to a maximum of 8 inch-lbs torque (see letter E)
NOTE
The processor heatsink for CPU1 and CPU2 is different FXXCA91X91HS is
for CPU1 while FXXEA91X91HS2 is for CPU2 Mislocating the heatsink
will cause serious thermal damage
C
Processor
Socket
AIRFLOW
Chassis Front
2
3
1
4
A
B
TIM
D
CAUTIONDo not over-tighten fasteners
E
Chapter 2 Hardware Installation
HA201-TP Users Manual
14
224 Removing the Processor
NOTE
Remove the processor by carefully lifting it out of the socket taking care
NOT to drop the processor and not touching any pins inside the socket
Install the socket cover if a replacement processor is not going to be installed
HA201-TP Users Manual
15
Chapter 2 Hardware Installation
225 Different heatsink for each of its CPUsCarefully position heatsink over the CPU0 and CPU1 and align the heatsink screws with the screw holes in the motherboard
Caution - Possible thermal damage Avoid moving the heatsink after
it has contacted the top of the CPU Too much movement could
disturb the layer of thermal compound causing voids and leading
to ineffective heat dissipation and component damage
CPU0
CPU0
CPU1
CPU1
Chapter 2 Hardware Installation
HA201-TP Users Manual
16
23 Diagram of the Correct InstallationNOTE The heat sinkrsquos fans should be blowing toward the rear end
of the chassis If one of the fans is facing the wrong direction
please remove the heat sink and reinstall it so that it is facing
the correct direction
AIRFLOW
AIRFLOW
FAN FAN
HA201-TP Users Manual
17
Chapter 2 Hardware Installation
24 System Memory241 Supported Memory
Chapter 2 Hardware Installation
HA201-TP Users Manual
18
242 Memory Population Rules
bull Each installed processor provides four channels of memory On the Intelreg Server Board S2600TP product family each memory channel supports two memory slots for a total possible 16 DIMMs installed
bull The memory channels from processor socket 1 are identified as Channel A B C and D The memory channels from processor socket 2 are identified as Channel E F G and H
bull The silk screened DIMM slot identifiers on the board provide information about the channel and therefore the processor to which they belong For example DIMM_A1 is the first slot on Channel A on processor 1 DIMM_E1 is the first DIMM socket on Channel E on processor 2
bull The memory slots associated with a given processor are unavailable if the corresponding processor socket is not populated
bull A processor may be installed without populating the associated memory slots provided a second processor is installed with associated memory In this case the memory is shared by the processors However the platform suffers performance degradation and latency due to the remote memory
bull Processor sockets are self-contained and autonomous However all memory subsystem support (such as Memory RAS and Error Management) in the BIOS setup is applied commonly across processor sockets
bull All DIMMs must be DDR4 DIMMsbull Mixing of LRDIMM with any other DIMM type is not allowed per
platformbull Mixing of DDR4 operating frequencies is not validated within a socket
or across sockets by Intel If DIMMs with different frequencies are mixed all DIMMs run at the common lowest frequency
bull A maximum of eight logical ranks (ranks seen by the host) per channel is allowed
bull The BLUE memory slots on the server board identify the first memory slot for a given memory channel
NOTE
Although mixed DIMM configurations are supported Intel only performs platform
validation on systems that are configured with identical DIMMs installed
HA201-TP Users Manual
19
Chapter 2 Hardware Installation
bull DIMM population rules require that DIMMs within a channel be populated starting with the BLUE DIMM slot or DIMM farthest from the processor in a ldquofill-farthestrdquo approach In addition when populating a Quad-rank DIMM with a Single- or Dual-rank DIMM in the same channel the Quad-rank DIMM must be populated farthest from the processor Intelreg MRC will check for correct DIMM placement
Chapter 2 Hardware Installation
HA201-TP Users Manual
20
On the Intelreg Server Board S2600TP product a total of sixteen DIMM slots are provided (two CPUs ndash four channels per CPU and two DIMMs per channel) The nomenclature for DIMM sockets is detailed in the following table
HA201-TP Users Manual
21
Chapter 2 Hardware Installation
243 Publishing System MemoryThere are a number of different situations in which the memory size andor configuration are displayed Most of these displays differ in one way or another so the same memory configuration may appear to display differently depending on when and where the display occurs
bull The BIOS displays the ldquoTotal Memoryrdquo of the system during POST if Quiet Boot is disabled in BIOS setup This is the total size of memory discovered by the BIOS during POST and is the sum of the individual sizes of installed DDR4 DIMMs in the system
bull The BIOS displays the ldquoEffective Memoryrdquo of the system in the BIOS Setup The term Effective Memory refers to the total size of all DDR4 DIMMs that are active (not disabled) and not used as redundant units (see Note below)
bull The BIOS provides the total memory of the system in the main page of BIOS setup This total is the same as the amount described by the first bullet above
bull If Quiet Boot is disabled the BIOS displays the total system memory on the diagnostic screen at the end of POST This total is the same as the amount described by the first bullet above
bull The BIOS provides the total amount of memory in the system by supporting the EFI Boot Service function GetMemoryMap()
bull The BIOS provides the total amount of memory in the system by supporting the INT 15h E820h function For details see the Advanced Configuration and Power Interface Specification
Chapter 2 Hardware Installation
HA201-TP Users Manual
22
25 RemovingInstalling a Drive Tray Hard Drive Drive Slot Map
251 Removing a Disk DriveRelease a drive tray by pressing the unlock button and pinching the lock lever slightly and pulling out the drive tray
252 Installing a Disk Drive
Adjust and place the 25rdquo HDD on the HDD tray Choose to the proper position screw hole then secure it with screws on the bottomMounting location 1 HDD with the interposer board attached
HA201-TP Users Manual
23
Chapter 2 Hardware Installation
253 Installing a Hard Disk Drive Tray
Insert the drive carrier into its bay Push the tray lever until it clicks Make sure the drive tray is correctly secured in place when its front edge aligns with the bay edge
254 Drive Slot Map
The drive slot map follows
HBA Card
MegaRaid Card
Chapter 2 Hardware Installation
HA201-TP Users Manual
24
26 Removing and Installing a Fan Module261 Removing a FAN module
Unscrew the thumb screw on the cover to open the top cover from NODE Unpluging the FAN cable Remove the fan by lifting it and pulling it out
262 Installing a FAN module To installing a fan module follow the reverse order
HA201-TP Users Manual
25
Chapter 2 Hardware Installation
27 Power Supply
271 Removing or Replacing the Power Supply Module
Slide the small levers (see red arrow ) to the right Hold the PSU lever and firmly pull the PSU out of the server chassis
Chapter 2 Hardware Installation
HA201-TP Users Manual
26
28 Tool-less Blade Slide Installation introduction
1 Pull on the lsquorsquoFront-Releasersquorsquoto unlock the inner channel from the Slide Assembly
2 Release the Detent-Lock and push Middle Channel inwards to retract Middle Channel
HA201-TP Users Manual
27
Chapter 2 Hardware Installation
Metal Spacer
OptionalRemove Metal Spacer for Aluminium Racks
3 Aligning the Front Bracket with the Mounting Hole
4 Push in to assembly the Front Bracket onto the Rack
Chapter 2 Hardware Installation
HA201-TP Users Manual
28
5 Now the bracket is fixed onto the Rack(Optional M6x10L screws are to secure the rails with posts if needed)
6 Refer to Diagram 3 amp 4 to assemble the End Bracket onto the Rack
HA201-TP Users Manual
29
Chapter 2 Hardware Installation
7 Assemble the inner channel onto the chassis using thescrews provided
8 Push the chassis with inner channels into Slide to complete Rack Installation
Chapter 3 Motherborad Overview
HA201-TP Users Manual
30
Chapter 3 Motherborad Overview
The Intelreg Server Board S2600TP is a monolithic printed circuit board (PCB) with features designed to support the high-performance and high-density computing markets This server board is designed to support the Intelreg Xeonreg processor E5-2600 v3 product family Previous generation Intelreg Xeonreg processors are not supported Many of the features and functions of the server board family are common A board will be identified by name when a described feature or function is unique to it
Chapter 3 Motherborad Overview
HA201-TP Users Manual
31
31 Intelreg Server Board Feature Set
Chapter 3 Motherborad Overview
HA201-TP Users Manual
32
WARNING The riser slot 1 on the server board is designed for
plugging in ONLY the riser card Plugging in the PCIe card may
cause permanent server board and PCIe card damage
Chapter 3 Motherborad Overview
HA201-TP Users Manual
33
32 Motherboard Layout
DiagnosticLED
RMM4LiteRiser Slot 2
SATASGPIO
BridgeBoard
SATA 3USB
Riser Slot 4
Riser Slot 4
HDD Activity
Riser Slot 3ControlPanel
SystemFan 1
CPU 2Socket
CPU 1Socket
SystemFan 3
SystemFan 2
MainPower 1
FanConnector
MainPower 2
InfiniBandPort(QSFP+)DedicatedManagementPortUSBStatus LEDID LED
VGA
NIC 2
IPMB
NIC 1
SerialPort A
CR 2032 3W
Backup Power Control
SATA 2
SATA 0
SATA RAID 5
SATA
Upgrade Key
DOM
SATA 1
Chapter 3 Motherborad Overview
HA201-TP Users Manual
34
33 Motherboard block diagram
Chapter 3 Motherborad Overview
HA201-TP Users Manual
35
34 Motherboard Configuration JumpersThe following table provides a summary and description of configuration test and debug jumpers The server board has several 3-pin jumper blocks that can be used Pin 1 on each jumper block can be identified by the following symbol on the silkscreen
Chapter 3 Motherborad Overview
HA201-TP Users Manual
36
Chapter 3 Motherborad Overview
HA201-TP Users Manual
37
35 Motherboard Configuration JumpersThe Intelreg Server Board S2600TP has the following board rear connector placement
HA201-TP Users Manual
38
Chapter 4 12G Expander Board Overview
This chapter provides detailed introduction guide on hardware introduction
41 12GB Expander Board411 Placement amp Connectors location
J1
JPIC_DBGU8
Chip 3x36RExpander
U14Flash U16
PIC
SW1
SW2
JEXP_UART
J7
JPIC_SMT
J2 J3 J4 CN1 JUSB_MB
JVBATT_12C
JVBATT CN3 CN2 JIPMI_LOC
CN5
CN8
JPWR1
JPWR2
JHDDPWRJPSON_OK
JIPMB
JUSB_PIC
J8
CN7CN6
CN4
JPWR_SW
Chapter 4 12GB Expander Borad Overview
HA201-TP Users Manual
39
Chapter 4 12G Expander Board Overview
412 PIN-OUT
Power Connector ndash JPWR1JPWR2
OS HDD Power Connector ndash JHDDPWR
Battery Connector ndash JVBATT
FAN Connector ndash J7J8
Console for Expander ndash JEXP_UART
HA201-TP Users Manual
40
Chapter 4 12G Expander Board Overview
PIC Smart portndash JPIC_SMT
PIC Debug port(For MPLAB I2C tool) ndash JPIC_DBG
PIC USB ndash JUSB_PIC
Battery Function ndash JVBATT_I2C
IPMB ndash JIPMB
HA201-TP Users Manual
41
Chapter 4 12G Expander Board Overview
BP PIC USB ndash JUSB_MB
IPMB for Device ndash JIPMI_LOC
Power SW ndash JPWR_SW
MB power on frunction ndash JPSON_OK
HA201-TP Users Manual
42
Chapter 4 12G Expander Board Overview
413 LEDs
LED_CN6
LED2
LED3
HA201-TP Users Manual
43
Chapter 5 Backplane OverviewChapter 5 Backplane Overview
This chapter provides detailed introduction guide on backplane introduction
51 Backplane511 Placement amp Connectors location
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
9
18
10
1
3
G
D
SG
D
S
C
B
C
E
B
C
E
GA
O
I
2
43
12
43
1
14
1
3
45
134
5
134
5
134
5
16
2
9
10
1
1
1120
10
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
16
17
32
33
484964
J2
JP1J11
For Secondary Canister Node
J4 J1 SW1
SW2
J3
For Primary Canister Node
HA201-TP Users Manual
44
Chapter 5 Backplane Overview
512 PIN-OUT
Power Connector ndash J2
Power Connector ndash J11
PMBUS Connector ndash JP1
FAN Connector ndash J4
HA201-TP Users Manual
45
Chapter 5 Backplane Overview
Console for MCU ndash J1
Front IO ndash J3
HA201-TP Users Manual
46
Chapter 5 Backplane Overview
513 LEDs
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
18
9
10
1
31
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
3 4
21
3 4
21
4
1
16
2
91
101
11
10
20
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ActFail
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
9
18
10
1
3
G
D
SG
D
S
C
B
C
E
B
C
E
GA
O
I
2
43
12
43
1
14
1
3
45
134
5
134
5
134
5
16
2
9
10
1
1
1120
10
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
16
17
32
33
484964LED1
HA201-TP Users Manual
47
Chapter 6 HDD Backplane Introduction
61 Expender firmware update through smart console port611 Update Expander firmware revision
Step 1 Set up HA201-TP console serial cable Insert console serial cable into console port shown below also
the other side inert serial port into motherboard
PLEASE FIND THE CONSOLE SERIAL CABLE IN THE PACKAGE BOX
Chapter 6 Debug amp Firmware Update Introduction
HA201-TP Users Manual
48
Chapter 6 HDD Backplane Introduction
Step 2 Set up HA201-TP RS232 connectionSet up RS232 connection application into your HA201-TP as shown in the example process below
For exampleOS Microsoft WindowsRS232 connection application Hyperterminal
Step 2 Install HyperTrmexe
HA201-TP Users Manual
49
Chapter 6 HDD Backplane Introduction
Step 3 Enter a new name for the icon in the field below and click OK
Step 4 Connecting by using selecting an option in the drop down menu circled in red below (we selected COM2 in this example) and click OK
HA201-TP Users Manual
50
Chapter 6 HDD Backplane Introduction
Properties
Port Setting
Bits per second
Data bits
Parity
Stop bits
Flow control None
Restore Defaults
OK ApplyCancel
None
Step 6 Set up is complete The diagram below depicts what screen should displayed
Step 5 For ldquoBits per secondrdquo select 38400 For ldquoFlow controlrdquo select None Click OK when you have finished your selections
cmdgt_
HA201-TP Users Manual
51
Chapter 6 HDD Backplane Introduction
Step 7To get firmware image amp MFG Configuration Image version information from AIC SAS Related Firmware Downloadne website
httpppmsaicipccomtw8888downloadexpandermcu
HA201-TP Users Manual
52
Chapter 6 HDD Backplane Introduction
Step 8Comand line for show current firmware revisioncmdgtrev
HA201-TP Users Manual
53
Chapter 6 HDD Backplane Introduction
Step 9Start to update expander firmwarecmdgtfdl 0 0_
Step 10Select the tool bar Transfer -gt Send File
HA201-TP Users Manual
54
Chapter 6 HDD Backplane Introduction
Step 11bull Choose new firmware path file fw 3A1_v11221bull Protocol have to choose Xmodem
Step 12Firmware download complete
HA201-TP Users Manual
55
Chapter 6 HDD Backplane Introduction
Step 13Reset computer for success update firmwarecmdgtreset
reset
HA201-TP Users Manual
56
Chapter 6 HDD Backplane Introduction
612 Update expander configuration MFG
Step 1Comand line for show current configuration MFGcmdgt showmfg
HA201-TP Users Manual
57
Chapter 6 HDD Backplane Introduction
Step 2Start to update expander configuration MFGcmdgtfdl 83 0_
Step 3Select the tool bar Transfer -gt Send File
83 0
HA201-TP Users Manual
58
Chapter 6 HDD Backplane Introduction
Step 4bull Choose new MFG path file mfg 3A10_eob_1201binbull Protocol have to choose Xmodem
Step 5MFG download complete
HA201-TP Users Manual
59
Chapter 6 HDD Backplane Introduction
Step 6Reset computer for success update MFGcmdgtreset
reset
HA201-TP Users Manual
60
Chapter 6 HDD Backplane Introduction
62 Update the expander firmware through in-band
FOR EXAMPLEStep 1
Download and install SG3_utilsexe which compatible with Linux OSFrom website httpsgdannyczsgsg3_utilshtml website Reference version sg3_utils-140tgz
Step 2To get firmware image amp MFG Configuration Image version information from AIC SAS Related Firmware Downloadne websitehttpppmsaicipccomtw8888downloadexpandermcu
HA201-TP Users Manual
61
Chapter 6 HDD Backplane Introduction
Step 3Execute terminal under the same new firmware folderexample Setting a new firmware folder on Home page Open Terminal by click to the right button of mouse in the same window Home
Step 4Typing sudo -s to into administrator mode
HA201-TP Users Manual
62
Chapter 6 HDD Backplane Introduction
Step 5 Find expander location$ sg_map -i
Step 6Update Expander firmware$ sg_write_buffer --id=0x0 --in=fw3A1_v11221 --mode=0x2 --offset=0 devsg0
HA201-TP Users Manual
63
Chapter 6 HDD Backplane Introduction
Step 7 Update Expander MFG$ sg_write_buffer --id=0x83 --in=mfg3A10_eob_1201bin --mode=0x2 --offset=0 devsg0
Step 8Reboot computer for success update firmware amp MFGrootubuntu~ reboot
HA201-TP Users Manual
64
Chapter 6 HDD Backplane Introduction
63 12G expander EDFB settingStep 1
For Install HyperTerminalexe refer to section 41
Step 2 Get EDFB statuscmdgt edfb
HA201-TP Users Manual
65
Chapter 6 HDD Backplane Introduction
Step 3Change EDFB setting
Enable EDFB functioncmdgtedfb on
Disable EDFB functioncmdgtedfb off
cmd gtedfbEDFB is OFF
cmd gtedfb onSucceeded to set EDFB
cmd gtedfb offEDFB is OFF
HA201-TP Users Manual
66
Chapter 6 HDD Backplane Introduction
64 Slot HDD power setting(Only for system cooling Fan controled by expander)
Step 1 For Install sg3exe tool and get new firmware from website refer to section 42
Step 2Execute terminal under the same new firmware folderexample Setting a new firmware folder on Home page Open Terminal by click to the right button of mouse in the same window Home
HA201-TP Users Manual
67
Chapter 6 HDD Backplane Introduction
Step 3 Typing sudo -s to into administrator mode
Step 4 Find expander location$ sg_map -i
HA201-TP Users Manual
68
Chapter 6 HDD Backplane Introduction
Step 5For example
If would like to turn the Disk004 power off under the HBA card Need to check Disk004 power status $ sg_ses --page=7 devsg0
Under HBA card the Element 3 = Disk004
HA201-TP Users Manual
69
Chapter 6 HDD Backplane Introduction
Step 6To check Disk004 (element 3) power status is ok$ sg_ses --page=2 devsg0
Status shows belowThe status of Element 3 is OK
HA201-TP Users Manual
70
Chapter 6 HDD Backplane Introduction
Step 7Turn off a HDD power$ sg_ses --descriptor=Disk004 --set=341 devsg0
Step 8Turn on a HDD power$ sg_ses --descriptor=Disk004 --clear=341 devsg0
HA201-TP Users Manual
71
Chapter 6 HDD Backplane Introduction
65 HDD BP thermal sensor temperature setting(Only for system cooling Fan controled by expander)
Step 1 For Install HyperTerminalexe refer to section 41
Step 2Get the current temperature settingscmdgt temperature
HA201-TP Users Manual
72
Chapter 6 HDD Backplane Introduction
Step 3For example
Set new temperatureT1=20 C 4 18 CT2=50 C 4 52 CWarning threshold=50 C 448 CAlarm threshold=55 C 454 C
The new setting will take effect after resetcmdgt temperature 18 52 48 54cmdgt reset
HA201-TP Users Manual
73
Chapter 6 HDD Backplane Introduction
Step 4Check fan speed amp temperature informationcmdgt sensor
cmd gtsensor==ENCLOSURE STATUS========================================================== Total fan number 2 System Fan-0 speed 10888 RPM System Fan-1 speed 10971 RPM System PWN-0 82 Expander Temperature 76 Celsius degree Sytem Temperature-0 33 Celsius degree T1 20 Celsius degree T2 50 Celsius degree TC 55 Celsius degree Voltage Sensor 09V 093V Voltage Sensor 18V 180V
cmd gt_
HA201-TP Users Manual
74
Chapter 7 Intelreg Server Board S2600TP Platform Management
71 Server Management Function Architecture711 IPMI Feature7111 IPMI 20 Featuresbull Baseboard management controller (BMC)bull IPMI Watchdog timerbull Messaging support including command bridging and usersession
supportbull Chassis device functionality including powerreset control and BIOS boot
flags supportbull Event receiver device The BMC receives and processes events from
other platform subsystemsbull Field Replaceable Unit (FRU) inventory device functionality The BMC
supports access to system FRU devices using IPMI FRU commandsbull System Event Log (SEL) device functionality The BMC supports and
provides access to a SELbull Sensor Data Record (SDR) repository device functionality The BMC
supports storage and access of system SDRsbull Sensor device and sensor scanningmonitoring The BMC provides IPMI
management of sensors It polls sensors to monitor and report system health
bull IPMI interfaceso Host interfaces include system management software (SMS) with receive message queue support and server management mode (SMM)o IPMB interfaceo LAN interface that supports the IPMI-over-LAN protocol (RMCP RMCP+)
bull Serial-over-LAN (SOL)bull ACPI state synchronization The BMC tracks ACPI state changes that are
provided by the BIOSbull BMC self-test The BMC performs initialization and run-time self-tests and
makes results available to external entitiesbull See also the Intelligent Platform Management Interface Specification
Second Generation v20
Chapter 7 Intelreg Server Board S2600TP Platform Management
HA201-TP Users Manual
75
Chapter 7 Intelreg Server Board S2600TP Platform Management
7112 Non IPMI FeaturesThe BMC supports the following non-IPMI featuresbull In-circuit BMC firmware updatebull Fault resilient booting (FRB) FRB2 is supported by the watchdog timer
functionalitybull Chassis intrusion detectionbull Basic fan control using Control version 2 SDRsbull Fan redundancy monitoring and supportbull Enhancements to fan speed controlbull Power supply redundancy monitoring and supportbull Hot-swap fan supportbull Acoustic management Support for multiple fan profilesbull Signal testing support The BMC provides test commands for setting
and getting platform signal statesbull The BMC generates diagnostic beep codes for fault conditionsbull System GUID storage and retrievalbull Front panel management The BMC controls the system status LED
and chassis ID LED It supports secure lockout of certain front panel functionality and monitors button presses The chassis ID LED is turned on using a front panel button or a command
bull Power state retentionbull Power fault analysisbull Intelreg Light-Guided Diagnosticsbull Power unit management Support for power unit sensor The BMC
handles power-good dropout conditionsbull DIMM temperature monitoring New sensors and improved acoustic
management using closed-loop fan control algorithm taking into account DIMM temperature readings
bull Address Resolution Protocol (ARP) The BMC sends and responds to ARPs (supported on embedded NICs)
bull Dynamic Host Configuration Protocol (DHCP) The BMC performs DHCP (supported on embedded NICs)
bull Platform environment control interface (PECI) thermal management support
bull E-mail alertingbull Support for embedded web server UI in Basic Manageability feature
set
HA201-TP Users Manual
76
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull Enhancements to embedded web servero Human-readable SELo Additional system configurabilityo Additional system monitoring capabilityo Enhanced on-line help
bull Integrated KVMbull Enhancements to KVM redirection
o Support for higher resolutionbull Integrated Remote Media Redirectionbull Lightweight Directory Access Protocol (LDAP) supportbull Intelreg Intelligent Power Node Manager supportbull Embedded platform debug feature which allows capture of detailed
data for later analysisbull Provisioning and inventory enhancements
o Inventory datasystem information export (partial SMBIOS table)bull DCMI 15 compliance (product-specific)bull Management support for PMBus rev12 compliant power suppliesbull BMC Data Repository (Managed Data Region Feature)bull Support for an Intelreg Local Control Display Panelbull System Airflow Monitoringbull Exit Air Temperature Monitoringbull Ethernet Controller Thermal Monitoringbull Global Aggregate Temperature Margin Sensorbull Memory Thermal Managementbull Power Supply Fan Sensorsbull Energy Star Server Supportbull Smart Ride Through (SmaRT) Closed Loop System Throttling (CLST)bull Power Supply Cold Redundancybull Power Supply FW Updatebull Power Supply Compatibility Checkbull BMC FW reliability enhancements
o Redundant BMC boot blocks to avoid possibility of a corrupted boot block resulting in a scenario that prevents a user from updating the BMC o BMC System Management Health Monitoring
HA201-TP Users Manual
77
Chapter 7 Intelreg Server Board S2600TP Platform Management
72 Features and Functions721 Power SubsystemThe server board supports several power control sources which can initiate power-up orpower-down activity
HA201-TP Users Manual
78
Chapter 7 Intelreg Server Board S2600TP Platform Management
722 Advanced Configuration and Power Interface (ACPI)The server board has support for the following ACPI states
HA201-TP Users Manual
79
Chapter 7 Intelreg Server Board S2600TP Platform Management
723 System InitializationDuring system initialization both the BIOS and the BMC initialize the following items
7231 Processor Tcontrol SettingProcessors used with this chipset implement a feature called Tcontrol which provides a processor-specific value that can be used to adjust the fan-control behavior to achieve optimum cooling and acoustics The BMC reads these from the CPU through PECI Proxy mechanism provided by Manageability Engine (ME) The BMC uses these values as part of thefan-speed-control algorithm
HA201-TP Users Manual
80
Chapter 7 Intelreg Server Board S2600TP Platform Management
7232 Fault Resilient Booting (FRB)Fault resilient booting (FRB) is a set of BIOS and BMC algorithms and hardware support that allow a multiprocessor system to boot even if the bootstrap processor (BSP) fails Only FRB2 is supported using watchdog timer commandsFRB2 refers to the FRB algorithm that detects system failures during POST The BIOS uses the BMC watchdog timer to back up its operation during POST The BIOS configures the watchdog timer to indicate that the BIOS is using the timer for the FRB2 phase of the boot operationAfter the BIOS has identified and saved the BSP information it sets the FRB2 timer use bit and loads the watchdog timer with the new timeout intervalIf the watchdog timer expires while the watchdog use bit is set to FRB2 the BMC (if so configured) logs a watchdog expiration event showing the FRB2 timeout in the event data bytes The BMC then hard resets the system assuming the BIOS-selected reset as the watchdog timeout actionThe BIOS is responsible for disabling the FRB2 timeout before initiating the option ROM scan and before displaying a request for a boot password If the processor fails and causes an FRB2 timeout the BMC resets the systemThe BIOS gets the watchdog expiration status from the BMC If the status shows an expired FRB2 timer the BIOS enters the failure in the system event log (SEL) In the OEM bytes entry in the SEL the last POST code generated during the previous boot attempt is written FRB2failure is not reflected in the processor status sensor valueThe FRB2 failure does not affect the front panel LEDs
HA201-TP Users Manual
81
Chapter 7 Intelreg Server Board S2600TP Platform Management
7233 Post Code DisplayThe BMC upon receiving standby power initializes internal hardware to monitor port 80h (POST code) writes Data written to port 80h is output to the system POST LEDsThe BMC deactivates POST LEDs after POST had completed
HA201-TP Users Manual
82
Chapter 7 Intelreg Server Board S2600TP Platform Management
724 System Event Log (SEL)The BMC implements the system event log as specified in the Intelligent Platform Management Interface Specification Version 20 The SEL is accessible regardless of the system power state through the BMCs in-band and out-of-band interfacesThe BMC allocates 95231bytes (approx 93 KB) of non-volatile storage space to store system events The SEL timestamps may not be in order Up to 3639 SEL records can be stored at a time Because the SEL is circular any command that results in an overflow of the SEL beyond the allocated space will overwrite the oldest entries in the SEL while setting the overflow flag
HA201-TP Users Manual
83
Chapter 7 Intelreg Server Board S2600TP Platform Management
73 Sensor MonitoringThe BMC monitors system hardware and reports system health The information gathered from physical sensors is translated into IPMI sensors as part of the ldquoIPMI Sensor Modelrdquo The BMC also reports various system state changes by maintaining virtual sensors that are not specifically tied to physical hardware This section describes general aspects of BMC sensor management as well as describing how specific sensor types are modeled Unless otherwise specified the term ldquosensorrdquo refers to the IPMI sensor-model definition of a sensor
531 Sensor ScanningThe value of many of the BMCrsquos sensors is derived by the BMC FW periodically polling physical sensors in the system to read temperature voltages and so on Some of these physical sensors are built-in to the BMC component itself and some are physically separated from the BMC Polling of physical sensors for support of IPMI sensor monitoring does not occur until the BMCrsquos operational code is running and the IPMI FW subsystem has completed initializationIPMI sensor monitoring is not supported in the BMC boot code Additionally the BMC selectively polls physical sensors based on the current power and reset state of the system and the availability of the physical sensor when in that state For example non-standby voltages are not monitored when the system is in S4 or S5 power state
HA201-TP Users Manual
84
Chapter 7 Intelreg Server Board S2600TP Platform Management
732 Sensor Rearm Behavior7321 Manual versus Re-arm SensorsSensors can be either manual or automatic re-arm An automatic re-arm sensor will re-arm (clear) the assertion event state for a threshold or offset it if that threshold or offset is deasserted after having been asserted This allows a subsequent assertion of the threshold or an offset to generate a new event and associated side-effect An example side-effect would be boosting fans due to an upper critical threshold crossing of a temperature sensor The event state and the input state (value) of the sensor track each other Most sensors are auto-rearmA manual re-arm sensor does not clear the assertion state even when the threshold or offset becomes de-asserted In this case the event state and the input state (value) of the sensor do not track each other The event assertion state is sticky The following methods can be used to re-arm a sensorbull Automatic re-arm ndash Only applies to sensors that are designated as
ldquoauto-rearmrdquobull IPMI command Re-arm Sensor Eventbull BMC internal method ndash The BMC may re-arm certain sensors due to a
trigger condition For example some sensors may be re-armed due to a system reset A BMC reset will re-arm all sensorsbull System reset or DC power cycle will re-arm all system fan sensors
HA201-TP Users Manual
85
Chapter 7 Intelreg Server Board S2600TP Platform Management
7322 Re-arm and Event GenerationAll BMC-owned sensors that show an asserted event status generate a de-assertion SEL event when the sensor is re-armed provided that the associated SDR is configured to enable a deassertion event for that condition This applies regardless of whether the sensor is a thresholdanalog sensor or a discrete sensor
To manually re-arm the sensors the sequence is outlined below1 A failure condition occurs and the BMC logs an assertion event2 If this failure condition disappears the BMC logs a de-assertion event (if
so configured)3 The sensor is re-armed by one of the methods described in the
previous section4 The BMC clears the sensor status5 The sensor is put into reading-state-unavailable state until it is polled
again or otherwise updated6 The sensor is updated and the ldquoreading-state-unavailablerdquo state is
cleared A new assertion event will be logged if the fault state is once again detected
All auto-rearm sensors that show an asserted event status generate a de-assertion SEL event at the time the BMC detects that the condition causing the original assertion is no longer present and the associated SDR is configured to enable a de-assertion event for that condition
HA201-TP Users Manual
86
Chapter 7 Intelreg Server Board S2600TP Platform Management
733 BIOS Event-Only SensorsBIOS-owned discrete sensors are used for event generation only and are not accessible through IPMI sensor commands like the Get Sensor Reading command Note that in this case the sensor owner designated in the SDR is not the BMCAn example of this usage would be the SELs logged by the BIOS for uncorrectable memory errors Such SEL entries would identify a BIOS-owned sensor ID
734 Margin SensorsThere is sometimes a need for an IPMI sensor to report the difference (margin) from a nonzero reference offset For the purposes of this document these type sensors are referred to as margin sensors For instance for the case of a temperature margin sensor if the referencevalue is 90 degrees and the actual temperature of the device being monitored is 85 degreesthe margin value would be -5
735 IPMI Watchdog SensorThe BMC supports a Watchdog Sensor as a means to log SEL events due to expirations of the IPMI 20 compliant Watchdog Timer
736 BMC Watchdog SensorThe BMC supports an IPMI sensor to report that a BMC reset has occurred due to action taken by the BMC Watchdog feature A SEL event will be logged whenever either the BMC FW stack is reset or the BMC CPU itself is reset
737 BMC System Management Health MonitoringThe BMC tracks the health of each of its IPMI sensors and report failures by providing a ldquoBMC FW Healthrdquo sensor of the IPMI 20 sensor type Management Subsystem Health with support for the Sensor Failure offset Only assertions should be logged into the SEL for the Sensor Failure offset The BMC Firmware Health sensor asserts for any sensor when 10 consecutive sensor errors are read These are not standard sensor events (that is threshold crossings or discrete assertions) These are BMC Hardware Access Layer (HAL) errors If a successful sensor read is completed the counter resets to zero
HA201-TP Users Manual
87
Chapter 7 Intelreg Server Board S2600TP Platform Management
738 VR Watchdog TimerThe BMC FW monitors that the power sequence for the board VR controllers is completed when a DC power-on is initiated Incompletion of the sequence indicates a board problem in which case the FW powers down the systemThe BMC FW supports a discrete IPMI sensor for reporting and logging this fault condition
739 System Airflow MonitoringThe BMC provides an IPMI sensor to report the volumetric system airflow in CFM (cubic feet per minute) The air flow in CFM is calculated based on the system fan PWM values The specific Pulse Width Modulation (PWM or PWMs) used to determine the CFM is SDR configurable The relationship between PWM and CFM is based on a lookup table in an OEM SDRThe airflow data is used in the calculation for exit air temperature monitoring It is exposed as an IPMI sensor to allow a datacenter management application to access this data for use in rack-level thermal management
7310 Thermal MonitoringThe BMC provides monitoring of component and board temperature sensing devices This monitoring capability is instantiated in the form of IPMI analogthreshold or discrete sensors depending on the nature of the measurementFor analogthreshold sensors with the exception of Processor Temperature sensors critical and non-critical thresholds (upper and lower) are set through SDRs and event generation enabled for both assertion and de-assertion eventsFor discrete sensors both assertion and de-assertion event generation are enabledMandatory monitoring of platform thermal sensors includes
HA201-TP Users Manual
88
Chapter 7 Intelreg Server Board S2600TP Platform Management
7310 Thermal MonitoringThe BMC provides monitoring of component and board temperature sensing devices This monitoring capability is instantiated in the form of IPMI analogthreshold or discrete sensors depending on the nature of the measurementFor analogthreshold sensors with the exception of Processor Temperature sensors critical and non-critical thresholds (upper and lower) are set through SDRs and event generation enabled for both assertion and de-assertion eventsFor discrete sensors both assertion and de-assertion event generation are enabledMandatory monitoring of platform thermal sensors includesbull Inlet temperature (physical sensor is typically on system front panel or
HDD back plane)bull Board ambient thermal sensorsbull Processor temperaturebull Memory (DIMM) temperaturebull CPU VRD Hot monitoringbull Power supply (only supported for PMBus-compliant PSUs)Additionally the BMC FW may create ldquovirtualrdquo sensors that are based on a combination of aggregation of multiple physical thermal sensors and application of a mathematical formula to thermal or power sensor readings
73101 Absolute Value versus Margin SensorsThermal monitoring sensors fall into three basic categoriesbull Absolute temperature sensors ndash These are analogthreshold sensors
that provide a value that corresponds to an absolute temperature value
bull Thermal margin sensors ndash These are analogthreshold sensors that provide a value that is relative to some reference value
bull Thermal fault indication sensors ndash These are discrete sensors that indicate a specific thermal fault condition
HA201-TP Users Manual
89
Chapter 7 Intelreg Server Board S2600TP Platform Management
73102 Processor DTS-Spec Margin Sensor(s)Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family incorporate a DTS based thermal spec This allows a much more accurate control of the thermal solution and will enable lower fan speeds and lower fan power consumption The main usage of this sensor is as an input to the BMCrsquos fan control algorithms The BMC implements this as a threshold sensor There is one DTS sensor for each installed physical processor package Thresholds are not set and alert generation is not enabled for these sensors
73103 Processor Thermal Margin Sensor(s)Each processor supports a physical thermal margin sensor per core that is readable through the PECI interface This provides a relative value representing a thermal margin from the corersquos throttling thermal trip point Assuming that temp controlled throttling is enabled the physical core temp sensor reads lsquo0rsquo which indicates the processor core is being throttledThe BMC supports one IPMI processor (margin) temperature sensor per physical processor package This sensor aggregates the readings of the individual core temperatures in a package to provide the hottest core temperature reading When the sensor reads lsquo0rsquo it indicates that the hottest processor core is throttlingDue to the fact that the readings are capped at the corersquos thermal throttling trip point (reading = 0) thresholds are not set and alert generation is not enabled for these sensors
73104 Processor Thermal Control Monitoring (Prochot)The BMC FW monitors the percentage of time that a processor has been operationally constrained over a given time window (nominally six seconds) due to internal thermal management algorithms engaging to reduce the temperature of the device When any processor core temperature reaches its maximum operating temperature the processorpackage PROCHOT (processor hot) signal is asserted and these management algorithms known as the Thermal Control Circuit (TCC) engage to reduce the temperature provided TCC is enabled TCC is enabled by BIOS during system boot This monitoring is instantiated as one IPMI analogthreshold sensor per processor package The BMC implements this as a threshold sensor on a per-processor basis
HA201-TP Users Manual
90
Chapter 7 Intelreg Server Board S2600TP Platform Management
Under normal operation this sensor is expected to read lsquo0rsquo indicating that no processor throttling has occurredThe processor provides PECI-accessible counters one for the total processor time elapsed and one for the total thermally constrained time which are used to calculate the percentage assertion over the given time window
73105 Processor Voltage Regulator (VRD) Over-Temperature SensorThe BMC monitors processor VRD_HOT signals The processor VRD_HOT signals are routed to the respective processor PROCHOT input in order initiate throttling to reduce processor power draw therefore indirectly lowering the VRD temperatureThere is one processor VRD_HOT signal per CPU slot The BMC instantiates one discrete IPMI sensor for each VRD_HOT signal This sensor monitors a digital signal that indicates whether a processor VRD is running in an over-temperature condition When the BMC detects that this signal is asserted it will cause a sensor assertion which will result in an event being written into the sensor event log (SEL)
73106 Inlet Temperature SensorEach platform supports a thermal sensor for monitoring the inlet temperature There are four potential sources for inlet temperature reading
73107 Baseboard Ambient Temperature Sensor(s)The server baseboard provides one or more physical thermal sensors for monitoring the ambient temperature of a board location This is typically to provide rudimentary thermal monitoring of components that lack internal thermal sensors
73108 Server South Bridge (SSB) Thermal MonitoringThe BMC monitors the SSB temperature This is instantiated as an analog (threshold) IPMI thermal sensor
73109 Exit Air Temperature Monitoring The BMC synthesizes a virtual sensor to approximate system exit air temperature for use in fan control This is calculated based on the total power being consumed by the system and the total volumetric air flow provided by the system fans
HA201-TP Users Manual
91
Chapter 7 Intelreg Server Board S2600TP Platform Management
Each system shall be characterized in tabular format to understand total volumetric flow versus fan speed The BMC calculates an average exit air temperature based on the total system power front panel temperature the volumetric system air flow (cubic feet per meter or CFM) and altitude rangeThis sensor is only available on systems in an Intelreg chassis The Exit Air temp sensor is only available when PMBus power supplies are installed
731010 Ethernet Controller Thermal MonitoringThe Intelreg Ethernet Controller I350-AM4 and Intelreg Ethernet Controller 10 Gigabit X540 support an on-die thermal sensor For baseboard Ethernet controllers that use these devices the BMC will monitor the sensors and use this data as input to the fan speed control The BMC will instantiate an IPMI temperature sensor for each device on the baseboard
731011 Memory VRD-Hot Sensor(s)The BMC monitors memory VRD_HOT signals The memory VRD_HOT signals are routed to the respective processor MEMHOT inputs in order to throttle the associated memory to effectively lower the temperature of the VRD feeding that memoryFor Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family there are 2 memory VRD_HOT signals per CPU slot The BMC instantiates one discrete IPMI sensor for each memory VRD_HOT signal
731012 Add-in Module Thermal MonitoringSome boards have dedicated slots for an IO module andor a SAS module For boards that support these slots the BMC will instantiate an IPMI temperature sensor for each slot The modules themselves may or may not provide a physical thermal sensor (a TMP75 device) If the BMC detects that a module is installed it will attempt to access the physical thermal sensor and if found enable the associated IPMI temperature sensor
731013 Processor ThermTripWhen a Processor ThermTrip occurs the system hardware will automatically power down the server If the BMC detects that a ThermTrip occurred then it will set the ThermTrip offset for the applicable
HA201-TP Users Manual
92
Chapter 7 Intelreg Server Board S2600TP Platform Management
processor status sensor
731014 Server South Bridge (SSB) ThermTrip Monitoring The BMC supports SSB ThermTrip monitoring that is instantiated as an IPMI discrete sensor When a SSB ThermTrip occurs the system hardware will automatically power down the server and the BMC will assert the sensor offset and log an event
731015 DIMM ThermTrip MonitoringThe BMC supports DIMM ThermTrip monitoring that is instantiated as one aggregate IPMI discrete sensor per CPU When a DIMM ThermTrip occurs the system hardware will automatically power down the server and the BMC will assert the sensor offset and log an eventThis is a manual re-arm sensor that is rearmed on system resets and power-on (AC or DC power on transitions)
HA201-TP Users Manual
93
Chapter 7 Intelreg Server Board S2600TP Platform Management
7311 Processor SensorsThe BMC provides IPMI sensors for processors and associated components such as voltage regulators and fans The sensors are implemented on a per-processor basis
HA201-TP Users Manual
94
Chapter 7 Intelreg Server Board S2600TP Platform Management
73111 Processor Status SensorsThe BMC provides an IPMI sensor of type processor for monitoring status information for each processor slot If an event state (sensor offset) has been asserted it remains asserted until one of the following happens1 A Rearm Sensor Events command is executed for the processor status
sensor2 AC or DC power cycle system reset or system boot occurs
The BMC provides system status indication to the front panel LEDs for processor fault conditions shown belowCPU Presence status is not saved across AC power cycles and therefore will not generate a deassertion after cycling AC power
HA201-TP Users Manual
95
Chapter 7 Intelreg Server Board S2600TP Platform Management
73112 Processor Population Fault (CPU Missing) SensorThe BMC supports a Processor Population Fault sensor This is used to monitor for the condition in which processor slots are not populated as required by the platform hardware to allow power-on of the systemAt BMC startup the BMC will check for the fault condition and set the sensor state accordinglyThe BMC also checks for this fault condition at each attempt to DC power-on the system At each DC power-on attempt a beep code is generated if this fault is detectedThe following steps are used to correct the fault condition and clear the sensor state1 AC power down the server2 Install the missing processor into the correct slot3 AC power on the server
73113 ERR2 Timeout MonitoringThe BMC supports an ERR2 Timeout Sensor (1 per CPU) that asserts if a CPUrsquos ERR2 signal has been asserted for longer than a fixed time period (gt 90 seconds) ERR[2] is a processor signal that indicates when the IIO (Integrated IO module in the processor) has a fatal error which could not be communicated to the core to trigger SMI ERR[2] events are fatal error conditions where the BIOS and OS will attempt to gracefully handle error but may not be always do so reliably A continuously asserted ERR2 signal is an indication that the BIOS cannot service the condition that caused the error This is usually because that condition prevents the BIOS from runningWhen an ERR2 timeout occurs the BMC assertsde-asserts the ERR2 Timeout Sensor and logs a SEL event for that sensor The default behavior for BMC core firmware is to initiate a system reset upon detection of an ERR2 timeout The BIOS setup utility provides an option to disable or enable system reset by the BMC for detection of this condition
HA201-TP Users Manual
96
Chapter 7 Intelreg Server Board S2600TP Platform Management
73114 CATERR SensorThe BMC supports a CATERR sensor for monitoring the system CATERR signalThe CATERR signal is defined as having 3 statesbull high (no event)bull pulsed low (possibly fatal may be able to recover)bull low (fatal)All processors in a system have their CATERR pins tied together The pin is used as a communication path to signal a catastrophic system event to all CPUs The BMC has direct access to this aggregate CATERR signalThe BMC only monitors for the ldquoCATERR held lowrdquo condition A pulsed low condition is ignored by the BMC If a CATERR-low condition is detected the BMC logs an error message to the SEL against the CATERR sensor and the default action after logging the SEL entry is to reset the system The BIOS setup utility provides an option to disable or enable system reset by the BMC for detection of this conditionThe sensor is rearmed on power-on (AC or DC power on transitions) It is not rearmed on system resets in order to avoid multiple SEL events that could occur due to a potential reset loop if the CATERR keeps recurring which would be the case if the CATERR was due to an MSID mismatch conditionWhen the BMC detects that this aggregate CATERR signal has asserted it can then go through PECI to query each CPU to determine which one was the source of the error and write an OEM code identifying the CPU slot into an event data byte in the SEL entry If PECI is non-functional (it isnrsquot guaranteed in this situation) then the OEM code should indicate that the source is unknownEvent data byte 2 and byte 3 for CATERR sensor SEL events
ED1 ndash 0xA1ED2 - CATERR type0 Unknown1 CATERR2 CPU Core Error (not supported on Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family)3 MSID Mismatch4 CATERR due to CPU 3-strike timeout
HA201-TP Users Manual
97
Chapter 7 Intelreg Server Board S2600TP Platform Management
ED3 - CPU bitmap that causes the system CATERR[0] CPU1[1] CPU2[2] CPU3[3] CPU4When a CATERR Timeout event is determined to be a CPU 3-strike timeout The BMC shall log the logical FRU information (eg busdevfunc for a PCIe device CPU or DIMM) that identifies the FRU that caused the error in the extended SEL data bytes In this case Ext-ED0 will be set to 0x70 and the remaining ED1-ED7 will be set according to the device type and info available
73115 MSID Mismatch SensorThe BMC supports a MSID Mismatch sensor for monitoring for the fault condition that will occur if there is a power rating incompatibility between a baseboard and a processor The sensor is rearmed on power-on (AC or DC power on transitions)
7312 Voltage MonitoringThe BMC provides voltage monitoring capability for voltage sources on the main board and processors such that all major areas of the system are covered This monitoring capability is instantiated in the form of IPMI analogthreshold sensors
73121 DIMM Voltage SensorsSome systems support either LVDDR (Low Voltage DDR) memory or regular (non-LVDDR) memory During POST the system BIOS detects which type of memory is installed and configures the hardware to deliver the correct voltageSince the nominal voltage range is different this necessitates the ability to set different thresholds for any associated IPMI voltage sensors The BMC FW supports this by implementing separate sensors (that is separate IPMI sensor numbers) for each nominal voltage range supported for a single physical sensor and it enablesdisables the correct IPMI sensor based on which type memory is installed The sensor data records for both these DIMM voltage sensor types have scanning disabled by default Once the BIOS has completed its POST routine it is responsible for communicating the DIMM voltage type to the BMC which will then
HA201-TP Users Manual
98
Chapter 7 Intelreg Server Board S2600TP Platform Management
enable sensor scanning of the correct DIMM voltage sensor
7313 Fan MonitoringBMC fan monitoring support includes monitoring of fan speed (RPM) and fan presence
73131 Fan Tach SensorsFan Tach sensors are used for fan failure detection The reported sensor reading is proportional to the fanrsquos RPM This monitoring capability is instantiated in the form of IPMI analogthreshold sensorsMost fan implementations provide for a variable speed fan so the variations in fan speed can be large Therefore the threshold values must be set sufficiently low as to not result in inappropriate threshold crossingsFan tach sensors are implemented as manual re-arm sensors because a lower-critical threshold crossing can result in full boosting of the fans This in turn may cause a failing fanrsquos speed to rise above the threshold and can result in fan oscillationsAs a result fan tach sensors do not auto-rearm when the fault condition goes away but rather are rearmed for either of the following occurrences1 The system is reset or power-cycled2 The fan is removed and either replaced with another fan or re-
inserted This applies to hot-swappable fans only This re-arm is triggered by change in the state of the associated fan presence sensor
After the sensor is rearmed if the fan speed is detected to be in a normal range the failure conditions shall be cleared and a de-assertion event shall be logged
HA201-TP Users Manual
99
Chapter 7 Intelreg Server Board S2600TP Platform Management
73132 Fan Presence SensorsSome chassis and server boards provide support for hot-swap fans These fans can be removed and replaced while the system is powered on and operating normally The BMC implements fan presence sensors for each hot swappable fan These are instantiated as IPMI discrete sensorsEvents are only logged for fan presence upon changes in the presence state after AC power is applied (no events logged for initial state)
73133 Fan Redundancy SensorThe BMC supports redundant fan monitoring and implements fan redundancy sensors for products that have redundant fans Support for redundant fans is chassis-specificA fan redundancy sensor generates events when its associated set of fans transition between redundant and non-redundant states as determined by the number and health of the component fans The definition of fan redundancy is configuration dependent The BMCallows redundancy to be configured on a per fan-redundancy sensor basis through OEM SDR recordsThere is a fan redundancy sensor implemented for each redundant group of fans in the systemAssertion and de-assertion event generation is enabled for each redundancy state
73134 Power Supply Fan SensorsMonitoring is implemented through IPMI discrete sensors one for each power supply fan The BMC polls each installed power supply using the PMBus fan status commands to check for failure conditions for the power supply fans The BMC asserts the ldquoperformance lagsrdquo offset of the IPMI sensor if a fan failure is detectedPower supply fan sensors are implemented as manual re-arm sensors because a failure condition can result in boosting of the fans This in turn may cause a failing fanrsquos speed to rise above the ldquofaultrdquo threshold and can result in fan oscillations As a result these sensors do not auto-rearm when the fault condition goes away but rather are rearmed only when the system is reset or power-cycled or the PSU is removed and replaced with the same or another PSUAfter the sensor is rearmed if the fan is no longer showing a failed state the failure condition in the IPMI sensor shall be cleared and a de-
HA201-TP Users Manual
100
Chapter 7 Intelreg Server Board S2600TP Platform Management
assertion event shall be logged
73135 Monitoring for ldquoFans Offrdquo ScenarioOn Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family it is likely that there will be situations where specific fans are turned off based on current system conditions BMC Fan monitoring will comprehend this scenario and not log false failure eventsThe recommended method is for the BMC FW to halt updates to the value of the associated fan tach sensor and set that sensorrsquos IPMI sensor state to ldquoreading-state-unavailablerdquo when this mode is active Management software must comprehend this state for fan tach sensors and not report these as failure conditionsThe scenario for which this occurs is that the BMC Fan Speed Control (FSC) code turns off the fans by setting the PWM for the domain to 0 This is done when based on one or more global aggregate thermal margin sensor readings dropping below a specified thresholdBy default the fans-off feature will be disabled There is a BMC command and BIOS setup option to enabledisable this featureThe SmaRTCLST system feature will also momentarily gate power to all the system fans to reduce overall system power consumption in response to a power supply event (for example to ride out an AC power glitch) However for this scenario the fan power is gated by hardware for only 100ms which should not be long enough to result in triggering a fan fault SEL event
7314 Standard Fan ManagementThe BMC controls and monitors the system fans Each fan is associated with a fan speed sensor that detects fan failure and may also be associated with a fan presence sensor for hotswap support For redundant fan configurations the fan failure and presence status determines the fan redundancy sensor stateThe system fans are divided into fan domains each of which has a separate fan speed control signal and a separate configurable fan control policy A fan domain can have a set of temperature and fan sensors associated with it These are used to determine the current fan domain state
HA201-TP Users Manual
101
Chapter 7 Intelreg Server Board S2600TP Platform Management
A fan domain has three statesbull The sleep and boost states have fixed (but configurable through OEM
SDRs) fan speeds associated with thembull The nominal state has a variable speed determined by the fan
domain policy An OEM SDR record is used to configure the fan domain policy
The fan domain state is controlled by several factors They are listed below in order of precedence high to lowbull Boost
o Associated fan is in a critical state or missing The SDR describes which fan domains are boosted in response to a fan failure or removal in each domain If a fan is removed when the system is in lsquoFans-offrsquo mode it will not be detected and there will not be any fan boost till system comes out of lsquoFans-off modeo Any associated temperature sensor is in a critical state The SDR describes which temperature-threshold violations cause fan boost for each fan domaino The BMC is in firmware update mode or the operational firmware is corruptedo If any of the above conditions apply the fans are set to a fixed boost state speed
bull Nominalo A fan domainrsquos nominal fan speed can be configured as static (fixed value) or controlled by the state of one or more associated temperature sensorso See section 43144 for more details
73141 Hot-Swap FansHot-swap fans are supported These fans can be removed and replaced while the system is powered on and operating The BMC implements fan presence sensors for each hotswappable fanWhen a fan is not present the associated fan speed sensor is put into the readingunavailable state and any associated fan domains are put into the boost state The fans may already be boosted due to a previous fan failure or fan removalWhen a removed fan is inserted the associated fan speed sensor is rearmed If there are no other critical conditions causing a fan boost condition the fan speed returns to the nominal state Power cycling or
HA201-TP Users Manual
102
Chapter 7 Intelreg Server Board S2600TP Platform Management
resetting the system re-arms the fan speed sensors and clears fanfailure conditions If the failure condition is still present the boost state returns once the sensor has re-initialized and the threshold violation is detected again
73142 Fan Redundancy DetectionThe BMC supports redundant fan monitoring and implements a fan redundancy sensor A fan redundancy sensor generates events when its associated set of fans transitions between redundant and non-redundant states as determined by the number and health of the fans The definition of fan redundancy is configuration dependent The BMC allows redundancy to be configured on a per fan redundancy sensor basis through OEM SDR recordsA fan failure or removal of hot-swap fans up to the number of redundant fans specified in the SDR in a fan configuration is a non-critical failure and is reflected in the front panel status A fan failure or removal that exceeds the number of redundant fans is a non-fatal insufficientresources condition and is reflected in the front panel status as a non-fatal errorRedundancy is checked only when the system is in the DC-on state Fan redundancy changes that occur when the system is DC-off or when AC is removed will not be logged until the system is turned on
73143 Fan DomainsSystem fan speeds are controlled through pulse width modulation (PWM) signals which are driven separately for each domain by integrated PWM hardware Fan speed is changed by adjusting the duty cycle which is the percentage of time the signal is driven high in each pulseThe BMC controls the average duty cycle of each PWM signal through direct manipulation of the integrated PWM control registersThe same device may drive multiple PWM signals
HA201-TP Users Manual
103
Chapter 7 Intelreg Server Board S2600TP Platform Management
73144 Nominal Fan SpeedA fan domainrsquos nominal fan speed can be configured as static (fixed value) or controlled by the state of one or more associated temperature sensorsOEM SDR records are used to configure which temperature sensors are associated with which fan control domains and the algorithmic relationship between the temperature and fan speed Multiple OEM SDRs can reference or control the same fan control domain and multiple OEM SDRs can reference the same temperature sensorsThe PWM duty-cycle value for a domain is computed as a percentage using one or more instances of a stepwise linear algorithm and a clamp algorithm The transition from one computed nominal fan speed (PWM value) to another is ramped over time to minimize audible transitions The ramp rate is configurable by means of the OEM SDRMultiple stepwise linear and clamp controls can be defined for each fan domain and used simultaneously For each domain the BMC uses the maximum of the domainrsquos stepwise linear control contributions and the sum of the domainrsquos clamp control contributions to compute the domainrsquos PWM value except that a stepwise linear instance can be configured to provide the domain maximumHysteresis can be specified to minimize fan speed oscillation and to smooth fan speed transitions If a Tcontrol SDR record does not contain a hysteresis definition for example an SDR adhering to a legacy format the BMC assumes a hysteresis value of zero
73145 Thermal and Acoustic ManagementThis feature refers to enhanced fan management to keep the system optimally cooled while reducing the amount of noise generated by the system fans Aggressive acoustics standards might require a trade-off between fan speed and system performance parameters that contribute to the cooling requirements primarily memory bandwidth The BIOS BMC and SDRs work together to provide control over how this trade-off is determinedThis capability requires the BMC to access temperature sensors on the individual memory DIMMs Additionally closed-loop thermal throttling is only supported with buffered DIMMs
HA201-TP Users Manual
104
Chapter 7 Intelreg Server Board S2600TP Platform Management
73146 Thermal Sensor Input to Fan Speed ControlThe BMC uses various IPMI sensors as input to the fan speed control Some of the sensors are IPMI models of actual physical sensors whereas some are ldquovirtualrdquo sensors whose values are derived from physical sensors using calculations andor tabular informationThe following IPMI thermal sensors are used as input to the fan speed controlbull Front panel temperature sensorbull Baseboard temperature sensorsbull CPU DTS-Spec margin sensorsbull DIMM thermal margin sensorsbull Exit air temperature sensorbull Global aggregate thermal margin sensorsbull SSB (Intelreg C610 Series Chipset) temperature sensorbull On-board Ethernet controller temperature sensors (support for this is
specific to the Ethernet controller being used)bull Add-in Intelreg SASIO module temperature sensor(s) (if present)bull Power supply thermal sensors (only available on PMBus-compliant
power supplies)A simple model is shown in the following figure which gives a high level graphic of the fan speed control structure creates the resulting fan speeds
HA201-TP Users Manual
105
Chapter 7 Intelreg Server Board S2600TP Platform Management
731461 Processor Thermal ManagementProcessor thermal management utilizes clamp algorithms for which the Processor DTS-Spec margin sensor is a controlling input This replaces the use of the (legacy) raw DTS sensor reading that was utilized on previous generation platforms The legacy DTS sensor is retained only for monitoring purposes and is not used as an input to the fan speed control
731462 Memory Thermal ManagementThe system memory is the most complex subsystem to thermally manage as it requires substantial interactions between the BMC BIOS and the embedded memory controller This section provides an overview of this management capability from a BMC perspective
7314621 Memory Thermal ThrottlingThe system shall support thermal management through open loop throttling (OLTT) and closed loop throttling (CLTT) of system memory based on the platform as well as availability of valid temperature sensors on the installed memory DIMMs Throttling levels are changed dynamically to cap throttling based on memory and system thermal conditions as determined by the system and DIMM power and thermal parameters Support for CLTT on mixed-mode DIMM populations (that is some installed DIMMs have valid temp sensors and some do not) is not supported The BMC fan speed control functionality is related to the memory throttling mechanism usedThe following terminology is used for the various memory throttling optionsbull Static Open Loop Thermal Throttling (Static-OLTT) OLTT control registers
are configured by BIOS MRC remain fixed after post The system does not change any of the throttling control registers in the embedded memory controller during runtime
bull Static Closed Loop Thermal Throttling (Static-CLTT) CLTT control registers are configured by BIOS MRC during POST The memory throttling is run as a closed-loop system with the DIMM temperature sensors as the control input Otherwise the system does not change any of the throttling control registers in the embedded memory controller during runtime
bull Dynamic Open Loop Thermal Throttling (Dynamic-OLTT) OLTT control registers are configured by BIOS MRC during POST Adjustments are
HA201-TP Users Manual
106
Chapter 7 Intelreg Server Board S2600TP Platform Management
made to the throttling during runtime based on changes in system cooling (fan speed)
bull Dynamic Closed Loop Thermal Throttling (Dynamic-CLTT) CLTT control registers are configured by BIOS MRC during POST The memory throttling is run as a closed-loop system with the DIMM temperature sensors as the control input Adjustments are made to the throttling during runtime based on changes in system cooling (fan speed)
Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce a new type of CLTT which is referred to as Hybrid CLTT for which the Integrated Memory Controller estimates the DRAM temperature in between actual reads of the TSODsHybrid CLTTT shall be used on all Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family that have DIMMs with thermal sensors Therefore the terms Dynamic-CLTT and Static-CLTT are really referring to this lsquohybridrsquo mode Note that if the IMCrsquos polling of the TSODs is interrupted the temperature readings that the BMC gets from the IMC shall be these estimated values 731463 DIMM Temperature Sensor Input to Fan Speed ControlA clamp algorithm s used for controlling fan speed based on DIMM temperatures Aggregate DIMM temperature margin sensors are used as the control input to the algorithm
731464 Dynamic (Hybrid) CLTTThe system will support dynamic (memory) CLTT for which the BMC FW dynamically modifies thermal offset registers in the IMC during runtime based on changes in system cooling (fan speed) For static CLTT a fixed offset value is applied to the TSOD reading to get the die temperature however this is does not provide as accurate results as when the offset takes into account the current airflow over the DIMM as is done with dynamic CLTTIn order to support this feature the BMC FW will derive the air velocity for each fan domain based on the PWM value being driven for the domain Since this relationship is dependent on the chassis configuration a method must be used which support this dependency (for example through OEM SDR) that establishes a lookup table providing this relationshipBIOS will have an embedded lookup table that provides thermal offset
HA201-TP Users Manual
107
Chapter 7 Intelreg Server Board S2600TP Platform Management
values for each DIMM type altitude setting and air velocity range (3 ranges of air velocity are supported) During system boot BIOS will provide 3 offset values (corresponding to the 3 air velocity ranges) tothe BMC for each enabled DIMM Using this data the BMC FW constructs a table that maps the offset value corresponding to a given air velocity range for each DIMM During runtime the BMC applies an averaging algorithm to determine the target offset value corresponding to thecurrent air velocity and then the BMC writes this new offset value into the IMC thermal offset register for the DIMM
731465 Fan ProfilesThe server system supports multiple fan control profiles to support acoustic targets and American Society of Heating Refrigerating and Air Conditioning Engineers (ASHRAE) compliance The BIOS Setup utility can be used to choose between meeting the target acoustic level or enhanced system performance This is accomplished through fan profilesThe BMC supports eight fan profiles numbered from 0 to 7 Fan management policy is dictated by the Tcontrol SDRs These SDRs provide a way to associate fan control behavior with one or more fan profilesEach group of profiles allows for varying fan control policies based on the altitude For a given altitude the Tcontrol SDRs associated with an acoustics-optimized profile generate less noise than the equivalent performance-optimized profile by driving lower fan speeds and the BIOSreduces thermal management requirements by configuring more aggressive memory throttling See Table 16 for more informationThe BMC provides commands that query for fan profile support and it provides a way to enable a fan profile Enabling a fan profile determines which Tcontrol SDRs are used for fan management The BMC only supports enabling a fan profile through the command if that profile is supported on all fan domains defined for the system It is important to configure the SDRs so that all desired fan profiles are supported on each fan domain If no single profile is supported across all domains the BMC by default uses profile 0 and does not allow it to be changedAt system boot the BIOS can use the Get Fan Control Configuration command to query the BMC about which fan profiles are supported The BIOS uses this information to display options in the BIOS Setup utility The BIOS indicates the fan profile to the BMC as dictated by the BIOS Setup Utility options for fan mode and altitude using the Set Fan Control
HA201-TP Users Manual
108
Chapter 7 Intelreg Server Board S2600TP Platform Management
Configuration commandThe BMC uses this information as an input to its fan-control algorithm as supported by the Tcontrol OEM SDR The BMC only allows enabling of fan profiles that the BMC indicates are supported using the Get Fan Control Configuration command For example if the Get Fan Control Configuration command indicates that only profile 1 is supported then using the Set Fan Control Configuration command to enable profile 2 will result in the return of an error completion codeThe BMC requires the BIOS to send the Set Fan Control Configuration command to the BMC on every system boot This must be done after the BIOS has completed any throttling-related chipset configuration
HA201-TP Users Manual
109
Chapter 7 Intelreg Server Board S2600TP Platform Management
731466 Open-Loop Thermal Throttling FallbackNormal system operation uses closed-loop thermal throttling (CLTT) and DIMM temperature monitoring as major factors in overall thermal and acoustics management In the event that BIOS is unable to configure the system for CLTT it defaults to open-loop thermal throttling (OLTT) In the OLTT mode it is assumed that the DIMM temperature sensors are not available for fan speed control The BIOS communicates the throttling mode to the BMC along with the fan profile number when it sends the Set Fan Control Configuration commandWhen OLTT mode is specified the BMC internally blocks access to the DIMM temperatures causing the DIMM aggregate margin sensors to be marked as ReadingState Unavailable The BMC then uses the failure-control values for these sensors if specified in the Tcontrol SDRs as their fan speed contributions
731467 ASHRAE ComplianceSystem requirements for ASHRAE compliance is defined in the Common Fan Speed Control amp Thermal Management Platform Architecture Specification Altitude-related changes in fan speed control are handled through profiles for different altitude ranges
73147 Power Supply Fan Speed ControlThis section describes the system level control of the fans internal to the power supply over the PMBus Some but not all Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family will require that the power supplies be included in the system level fan speed control For any system that requires either of these capabilities the power supply must be PMBus-compliant
731471 System Control of Power Supply FansSome products require that the BMC control the speed of the power supply fans as is done with normal system (chassis) fans except that the BMC cannot reduce the power supply fan any lower than the internal power supply control is driving it For these products the BMC FW must have the ability to control and monitor the power supply fans through PMBus commands The power supply fans are treated as a system fan domain for which fan control policies are mapped just as for chassis system fans with system thermal sensors (rather than internal power
HA201-TP Users Manual
110
Chapter 7 Intelreg Server Board S2600TP Platform Management
supply thermal sensors) used as the input to a clamp algorithm for the power supply fan control This domain has both piecewise clipping curves and clamped sensors mapped into the power supply fan domain All the power supplies can be defined as a single fan domain
731472 Use of Power Supply Thermal Sensors as Input to System (Chassis) Fan Control Some products require that the power supply internal thermal sensors are used as control inputs to the system (chassis) fans in the same manner as other system thermal sensors are used for this purpose The power supply thermal sensors are included as clamped sensors into one or more system fan domains which may include the power supply fan domain
73148 Fan Boosting due to Fan FailuresIntelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce additional capabilities for handling fan failure or removal as described in this sectionEach fan failure shall be able to define a unique response from all other fan domains An OEM SDR table defines the response of each fan domain based on a failure of any fan including both system and power supply fans (for PMBus-compliant power supplies only) This meansthat if a system has six fans then there will be six different fan fail reactions
73149 Programmable Fan PWM OffsetThe system provides a BIOS Setup option to boost the system fan speed by a programmable positive offset or a ldquoMaxrdquo setting Setting the programmable offset causes the BMC to add the offset to the fan speeds that it would otherwise be driving the fans to The Max setting causes the BMC to replace the domain minimum speed with alternate domain minimums that also are programmable through SDRsThis capability is offered to provide system administrators the option to manually configure fans speeds in instances where the fan speed optimized for a given platform may not be sufficient when a high end add-in is configured into the system This enables easier usage of the fan speed control to support Intelreg as well as third party chassis and better support of ambient temperatures higher than 35C
HA201-TP Users Manual
111
Chapter 7 Intelreg Server Board S2600TP Platform Management
supply thermal sensors) used as the input to a clamp algorithm for the power supply fan control This domain has both piecewise clipping curves and clamped sensors mapped into the power supply fan domain All the power supplies can be defined as a single fan domain
731472 Use of Power Supply Thermal Sensors as Input to System (Chassis) Fan Control Some products require that the power supply internal thermal sensors are used as control inputs to the system (chassis) fans in the same manner as other system thermal sensors are used for this purpose The power supply thermal sensors are included as clamped sensors into one or more system fan domains which may include the power supply fan domain
73148 Fan Boosting due to Fan FailuresIntelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce additional capabilities for handling fan failure or removal as described in this sectionEach fan failure shall be able to define a unique response from all other fan domains An OEM SDR table defines the response of each fan domain based on a failure of any fan including both system and power supply fans (for PMBus-compliant power supplies only) This meansthat if a system has six fans then there will be six different fan fail reactions
73149 Programmable Fan PWM OffsetThe system provides a BIOS Setup option to boost the system fan speed by a programmable positive offset or a ldquoMaxrdquo setting Setting the programmable offset causes the BMC to add the offset to the fan speeds that it would otherwise be driving the fans to The Max setting causes the BMC to replace the domain minimum speed with alternate domain minimums that also are programmable through SDRsThis capability is offered to provide system administrators the option to manually configure fans speeds in instances where the fan speed optimized for a given platform may not be sufficient when a high end add-in is configured into the system This enables easier usage of the fan speed control to support Intelreg as well as third party chassis and better support of ambient temperatures higher than 35C
HA201-TP Users Manual
112
Chapter 7 Intelreg Server Board S2600TP Platform Management
7315 Power Management Bus (PMBus)The Power Management Bus (ldquoPMBusrdquo) is an open standard protocol that is built upon the SMBus 20 transport It defines a means of communicating with power conversion and other devices using SMBus-based commands A system must have PMBus-compliant power supplies installed in order for the BMC or ME to monitor them for status andor power metering purposesFor more information on PMBus see the System Management Interface Forum Web sitehttpwwwpowersigorg
7316 Power Supply Dynamic Redundancy SensorThe BMC supports redundant power subsystems and implements a Power Unit Redundancy sensor per platform A Power Unit Redundancy sensor is of sensor type Power Unit (09h) and reading type Availability Status (0Bh) This sensor generates events when a power subsystem transitions between redundant and non-redundant states as determined by the number and health of the power subsystemrsquos component power supplies The BMC implements Dynamic Power Supply Redundancy status based upon current system load requirements as well as total Power Supply capacity This status is independent of the Cold Redundancy status This prevents the BMC from reporting Fully Redundant Power supplies when the load required by the system exceeds half the power capability of all power supplies installed and operationalDynamic Redundancy detects this condition and generates the appropriate SEL event to notify the user of the condition Power supplies of different power ratings may be swapped in and out to adjust the power capacity and the BMC will adjust the Redundancy status accordinglyThe definition of redundancy is power subsystem dependent and sometimes even configuration dependent See the appropriate Platform Specific Information for power unit redundancy supportThis sensor is configured as manual-rearm sensor in order to avoid the possibility of extraneous SEL events that could occur under certain system configuration and workload conditions The sensor shall rearm for the following conditionsbull PSU hot-addbull System reset
HA201-TP Users Manual
113
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull AC power cyclebull DC power cycleSystem AC power is applied but on standby ndash Power unit redundancy is based on OEM SDR power unit record and number of PSU presentSystem is (DC) powered on - The BMC calculates Dynamic Power Supply Redundancy status based upon current system load requirements as well as total Power Supply capacityThe BMC allows redundancy to be configured on a per power-unit-redundancy sensor basis by means of the OEM SDR records
7317 Component Fault LED ControlSeveral sets of component fault LEDs are supported on the server board Some LEDs are owned by the BMC and some by the BIOSThe BMC owns control of the following FRUfault LEDsbull Fan fault LEDs ndash A fan fault LED is associated with each fan The BMC
lights a fan fault LED if the associated fan-tach sensor has a lower critical threshold event status asserted Fan-tach sensors are manual re-arm sensors Once the lower critical threshold is crossed the LED remains lit until the sensor is rearmed These sensors are rearmed at system DC power-on and system reset
bull DIMM fault LEDs ndash The BMC owns the hardware control for these LEDs The LEDs reflect the state of BIOS-owned event-only sensors When the BIOS detects a DIMM fault condition it sends an IPMI OEM command (Set Fault Indication) to the BMC to instruct the BMC to turn on the associated DIMM Fault LED These LEDs are only active when the system is in the lsquoonrsquo state The BMC will not activate or change the state of the LEDs unless instructed by the BIOS
bull Hard Disk Drive Status LEDs ndash The HSBP PSoC owns the hardware control for these LEDs and detection of the faultstatus conditions that the LEDs reflect
bull CPU Fault LEDs The BMC owns control for these LEDs An LED is lit if there is an MSID mismatch (that is CPU power rating is incompatible with the board)
HA201-TP Users Manual
114
Chapter 7 Intelreg Server Board S2600TP Platform Management
7318 CMOS Battery MonitoringThe BMC monitors the voltage level from the CMOS battery which provides battery backup to the chipset RTC This is monitored as an auto-rearm threshold sensorUnlike monitoring of other voltage sources for which the Emulex Pilot III component continuously cycles through each input the voltage channel used for the battery monitoring provides a software enable bit to allow the BMC FW to poll the battery voltage at a relativelyslow rate in order to conserve battery power
HA201-TP Users Manual
115
Chapter 7 Intelreg Server Board S2600TP Platform Management
74 Intelreg Intelligent Power Node Manager (NM)Power management deals with requirements to manage processor power consumption and manage power at the platform level to meet critical business needs Node Manager (NM) is a platform resident technology that enforces power capping and thermal-triggered powercapping policies for the platform These policies are applied by exploiting subsystem settings (such as processor P and T states) that can be used to control power consumption NM enables data center power management by exposing an external interface to management software through which platform policies can be specified It also implements specific data center power management usage models such as power limiting and thermal monitoringThe NM feature is implemented by a complementary architecture utilizing the ME BMC BIOS and an ACPI-compliant OS The ME provides the NM policy engine and power controllimiting functions (referred to as Node Manager or NM) while the BMC provides the external LAN link by which external management software can interact with the feature The BIOS provides system power information utilized by the NM algorithms and also exports ACPI Source Language (ASL) code used by OS-Directed Power Management (OSPM) for negotiating processor P and T state changes for power limiting PMBus-compliant power supplies provide the capability to monitor input power consumption which is necessary to support NMThe NM architecture applicable to this generation of servers is defined by the NPTM Architecture Specification v20 NPTM is an evolving technology that is expected to continue to add new capabilities that will be defined in subsequent versions of the specification The ME NM implements the NPTM policy engine and controlmonitoring algorithms defined in the Node Power and Thermal Manager (NPTM) specification
741 Hardware RequirementsNM is supported only on platforms that have the NM FW functionality loaded and enabled on the Management Engine (ME) in the SSB and that have a BMC present to support the external LAN interface to the ME NM power limiting features requires a means for the ME to monitorinput power consumption for the platform This capability is generally provided by means of PMBus-compliant power supplies although an alternative model using a simpler SMBus power monitoring device is possible (there is potential loss in accuracy and responsiveness using
HA201-TP Users Manual
116
Chapter 7 Intelreg Server Board S2600TP Platform Management
non-PMBus devices) The NM SmaRTCLST feature does specifically require PMBus-compliant power supplies as well as additional hardware on the server board
742 FeaturesNM provides feature support for policy management monitoring and querying alerts and notifications and an external interface protocol The policy management features implement specific IT goals that can be specified as policy directives for NM Monitoring and querying features enable tracking of power consumption Alerts and notifications provide the foundation for automation of power management in the data center management stack The external interface specifies the protocols that must be supported in this version of NM
743 ME System Management Bus (SMBus) Interfacebull The ME uses the SMLink0 on the SSB in multi-master mode as a
dedicated bus for communication with the BMC using the IPMB protocol The BMC FW considers this a secondary IPMB bus and runs at 400 kHz
bull The ME uses the SMLink1 on the SSB in multi-master mode bus for communication with PMBus devices in the power supplies for support of various NM-related features This bus is shared with the BMC which polls these PMBus power supplies for sensor monitoring purposes (for example power supply status input power and so on) This bus runs at 100 KHz
bull The Management Engine has access to the ldquoHost SMBusrdquo
744 PECI 30bull The BMC owns the PECI bus for all Intel server implementations and
acts as a proxy for the ME when necessary
745 NM ldquoDiscoveryrdquo OEM SDRAn NM ldquodiscoveryrdquo OEM SDR must be loaded into the BMCrsquos SDR repository if and only if the NM feature is supported on that product This OEM SDR is used by management software to detect if NM is supported and to understand how to communicate with itSince PMBus compliant power supplies are required in order to support NM the system should be probed when the SDRs are loaded into the
HA201-TP Users Manual
117
Chapter 7 Intelreg Server Board S2600TP Platform Management
BMCrsquos SDR repository in order to determine whether or not the installed power supplies do in fact support PMBus If the installed power supplies are not PMBus compliant then the NM ldquodiscoveryrdquo OEM SDR should not be loadedPlease refer to the Intelreg Intelligent Power Node Manager 20 External Architecture Specification using IPMI for details of this interface
746 SmaRTCLSTThe power supply optimization provided by SmaRTCLST relies on a platform HW capability as well as ME FW support When a PMBus-compliant power supply detects insufficient input voltage an overcurrent condition or an over-temperature condition it will assert theSMBAlert signal on the power supply SMBus (such as the PMBus) Through the use of external gates this results in a momentary assertion of the PROCHOT and MEMHOT signals to the processors thereby throttling the processors and memory The ME FW also sees the SMBAlert assertion queries the power supplies to determine the condition causing the assertion and applies an algorithm to either release or prolong the throttling based on the situationSystem power control modes include1 SmaRT Low AC in put voltage event results in a one-time momentary
throttle for each event to the maximum throttle state2 Electrical Protection CLST High output energy event results in a
throttling hiccup mode with fixed maximum throttle time and a fix throttle release ramp time
3 Thermal Protection CLST High power supply thermal event results in a throttling hiccup mode with fixed maximum throttle time and a fix throttle release ramp time
When the SMBAlert signal is asserted the fans will be gated by HW for a short period (~100ms) to reduce overall power consumption It is expected that the interruption to the fans will be of short enough duration to avoid false lower threshold crossings for the fan tachsensors however this may need to be comprehended by the fan monitoring FW if it does have this side-effectME FW will log an event into the SEL to indicate when the system has been throttled by the SmaRTCLST power management feature This is dependent on ME FW support for this sensor Please refer ME FW EPS for SEL log details
HA201-TP Users Manual
118
Chapter 7 Intelreg Server Board S2600TP Platform Management
74611 Dependencies on PMBus-compliant Power Supply SupportThe SmaRTCLST system feature depends on functionality present in the ME NM SKU This feature requires power supplies that are compliant with the PMBus
note For additional information on Intelreg Intelligent Power Node
Manager usage and support please visit the following Intel Website
httpwwwintelcomcontentwwwusendata-centerdata-center-managementnode-manager-generalhtmlwapkw=node+manager
HA201-TP Users Manual
119
Chapter 7 Intelreg Server Board S2600TP Platform Management
75 Basic and Advanced Server Management FeaturesThe integrated BMC has support for basic and advanced server management features Basic management features are available by default Advanced management features are enabled with the addition of an optionally installed Remote Management Module 4 Lite (RMM4 Lite) key
When the BMC FW initializes it attempts to access the Intelreg RMM4 lite If the attempt to access Intelreg RMM4 lite is successful then the BMC activates the Advanced featuresThe following table identifies both Basic and Advanced server management features
HA201-TP Users Manual
120
Chapter 7 Intelreg Server Board S2600TP Platform Management
751 Dedicated Management PortThe server board includes a dedicated 1GbE RJ45 Management Port The management port is active with or without the RMM4 Lite key installed
752 Embedded Web ServerBMC Base manageability provides an embedded web server and an OEM-customizable web GUI which exposes the manageability features of the BMC base feature set It is supported over all on-board NICs that have management connectivity to the BMC as well as an optionaldedicated add-in management NIC At least two concurrent web sessions from up to two different users is supported The embedded web user interface shall support the following client web browsersbull Microsoft Internet Explorer 90bull Microsoft Internet Explorer 100bull Mozilla Firefox 24bull Mozilla Firefox 25The embedded web user interface supports strong security (authentication encryption and firewall support) since it enables remote server configuration and control The user interface presented by the embedded web user interface shall authenticate the user before allowing a web session to be initiated Encryption using 128-bit SSL is supported User authentication is based on user id and passwordThe GUI presented by the embedded web server authenticates the user before allowing a web session to be initiated It presents all functions to all users but grays-out those functions that the user does not have privilege to execute For example if a user does not have privilege to power control then the item shall be displayed in grey-out font in that userrsquos UI display The web GUI also provides a launch point for some of the advanced features such as KVM and media redirection These features are grayed out in the GUI unless the system has been updated to support these advanced features The embedded web server only displays US English or Chinese language outputAdditional features supported by the web GUI includesbull Presents all the Basic features to the usersbull Power onoffreset the server and view current power statebull Displays BIOS BMC ME and SDR version informationbull Display overall system health
HA201-TP Users Manual
121
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull Configuration of various IPMI over LAN parameters for both IPV4 and IPV6
bull Configuration of alerting (SNMP and SMTP)bull Display system asset information for the product board and
chassisbull Display of BMC-owned sensors (name status current reading
enabled thresholds) including color-code status of sensorsbull Provides ability to filter sensors based on sensor type (Voltage
Temperature Fan and Power supply related)bull Automatic refresh of sensor data with a configurable refresh ratebull On-line helpbull Displayclear SEL (display is in easily understandable human
readable format)bull Supports major industry-standard browsers (Microsoft Internet
Explorer and Mozilla Firefox)bull The GUI session automatically times-out after a user-configurable
inactivity period By default this inactivity period is 30 minutesbull Embedded Platform Debug feature - Allow the user to initiate
a ldquodebug dumprdquo to a file that can be sent to Intelreg for debug purposes
bull Virtual Front Panel The Virtual Front Panel provides the same functionality as the local front panel The displayed LEDs match the current state of the local panel LEDs The displayed buttons (for example power button) can be used in the same manner as the local buttons
bull Display of ME sensor data Only sensors that have associated SDRs loaded will be displayed
bull Ability to save the SEL to a filebull Ability to force HTTPS connectivity for greater security This is
provided through a configuration option in the UIbull Display of processor and memory information as is available over
IPMI over LANbull Ability to get and set Node Manager (NM) power policiesbull Display of power consumed by the serverbull Ability to view and configure VLAN settingsbull Warn user the reconfiguration of IP address will cause disconnectbull Capability to block logins for a period of time after several
HA201-TP Users Manual
122
Chapter 7 Intelreg Server Board S2600TP Platform Management
consecutive failed login attempts The lock-out period and the number of failed logins that initiates the lockout period are configurable by the user
bull Server Power Control - Ability to force into Setup on a resetbull System POST results ndash The web server provides the systemrsquos Power-
On Self Test (POST) sequence for the previous two boot cycles including timestamps The timestamps may be viewed in relative to the start of POST or the previous POST code
bull Customizable ports - The web server provides the ability to customize the port numbers used for SMASH http https KVM secure KVM remote media and secure remote media
For additional information reference the Intelreg Remote Management Module 4 and Integrated BMC Web Console Users Guide
753 Advanced Management Feature Support (RMM4 Lite)The integrated baseboard management controller has support for advanced management features which are enabled when an optional Intelreg Remote Management Module 4 Lite (RMM4 Lite) is installed The Intel RMM4 add-on offers convenient remote KVM access and control through LAN and internet It captures digitizes and compresses video and transmits it with keyboard and mouse signals to and from a remote computer Remote access and controlsoftware runs in the integrated baseboard management controller utilizing expanded capabilities enabled by the Intel RMM4 hardwareKey Features of the RMM4 add-on arebull KVM redirection from either the dedicated management NIC or
the server board NICs used for management traffic upto to two KVM sessions
bull Media Redirection ndash The media redirection feature is intended to allow system administrators or users to mount a remote IDE or USB CDROM floppy drive or a USB flash disk as a remote device to the server Once mounted the remote device appears just like a local device to the server allowing system administrators or users to install software (including operating systems) copy files update BIOS or boot the server from this device
bull KVM ndash Automatically senses video resolution for best possible screen capture high performance mouse tracking and
HA201-TP Users Manual
123
Chapter 7 Intelreg Server Board S2600TP Platform Management
synchronization It allows remote viewing and configuration in pre-boot POST and BIOS setup
7531 Keyboard Video Mouse (KVM) RedirectionThe BMC firmware supports keyboard video and mouse redirection (KVM) over LAN This feature is available remotely from the embedded web server as a Java applet This feature is only enabled when the Intelreg RMM4 lite is present The client system must have a Java Runtime Environment (JRE) version 60 or later to run the KVM or media redirection appletsThe BMC supports an embedded KVM application (Remote Console) that can be launched from the embedded web server from a remote console USB11 or USB 20 based mouse and keyboard redirection are supported It is also possible to use the KVM-redirection (KVM-r) session concurrently with media-redirection (media-r) This feature allows a user to interactively use the keyboard video and mouse (KVM) functions of the remote server as if the user were physically at the managed server KVM redirection console supports the following keyboard layouts English Dutch French German Italian Russian and SpanishKVM redirection includes a ldquosoft keyboardrdquo function The ldquosoft keyboardrdquo is used to simulate an entire keyboard that is connected to the remote system The ldquosoft keyboardrdquo functionality supports the following layouts English Dutch French German Italian Russian and SpanishThe KVM-redirection feature automatically senses video resolution for best possible screen capture and provides high-performance mouse tracking and synchronization It allows remote viewing and configuration in pre-boot POST and BIOS setup once BIOS has initialized videoOther attributes of this feature includebull Encryption of the redirected screen keyboard and mousebull Compression of the redirected screenbull Ability to select a mouse configuration based on the OS typebull Supports user definable keyboard macrosKVM redirection feature supports the following resolutions and refresh ratesbull 640x480 at 60Hz 72Hz 75Hz 85Hz 100Hzbull 800x600 at 60Hz 72Hz 75Hz 85Hzbull 1024x768 at 60Hx 72Hz 75Hz 85Hzbull 1280x960 at 60Hz
HA201-TP Users Manual
124
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull 1280x1024 at 60Hzbull 1600x1200 at 60Hzbull 1920x1080 (1080p)bull 1920x1200 (WUXGA)bull 1650x1080 (WSXGA+)
7532 Remote ConsoleThe Remote Console is the redirected screen keyboard and mouse of the remote host system To use the Remote Console window of your managed host system the browser must include a Java Runtime Environment plug-in If the browser has no Java support such as with a small handheld device the user can maintain the remote host system using the administration forms displayed by the browserThe Remote Console window is a Java Applet that establishes TCP connections to the BMCThe protocol that is run over these connections is a unique KVM protocol and not HTTP or HTTPS This protocol uses ports 7578 for KVM 5120 for CDROM media redirection and 5123 for FloppyUSB media redirection When encryption is enabled the protocol uses ports 7582 for KVM 5124 for CDROM media redirection and 5127 for FloppyUSB media redirection The local network environment must permit these connections to be made that is the firewall and in case of a private internal network the NAT (Network Address Translation) settings have to be configured accordingly
7533 PerformanceThe remote display accurately represents the local display The feature adapts to changes to the video resolution of the local display and continues to work smoothly when the system transitions from graphics to text or vice-versa The responsiveness may be slightly delayed depending on the bandwidth and latency of the networkEnabling KVM andor media encryption will degrade performance Enabling video compression provides the fastest response while disabling compression provides better video qualityFor the best possible KVM performance a 2Mbsec link or higher is recommendedThe redirection of KVM over IP is performed in parallel with the local KVM without affecting the local KVM operation
HA201-TP Users Manual
125
Chapter 7 Intelreg Server Board S2600TP Platform Management
7534 SecurityThe KVM redirection feature supports multiple encryption algorithms including RC4 and AES The actual algorithm that is used is negotiated with the client based on the clientrsquos capabilities
7535 AvailabilityThe remote KVM session is available even when the server is powered-off (in stand-by mode) No re-start of the remote KVM session shall be required during a server reset or power onoff A BMC reset (for example due to an BMC Watchdog initiated reset or BMC reset after BMC FWupdate) will require the session to be re-established KVM sessions persist across system reset but not across an AC power loss
7536 UsageAs the server is powered up the remote KVM session displays the complete BIOS boot process The user is able interact with BIOS setup change and save settings as well as enter and interact with option ROM configuration screensAt least two concurrent remote KVM sessions are supported It is possible for at least two different users to connect to same server and start remote KVM sessions
7537 Force-enter BIOS SetupKVM redirection can present an option to force-enter BIOS Setup This enables the system to enter F2 setup while booting which is often missed by the time the remote console redirects the video
7538 Media RedirectionThe embedded web server provides a Java applet to enable remote media redirection This may be used in conjunction with the remote KVM feature or as a standalone applet The media redirection feature is intended to allow system administrators or users to mount a remote IDE or USB CD-ROM floppy drive or a USB flash disk as a remote device to the server Once mounted the remote device appears just like a local device to the server allowing system administrators or users to install software (including operating systems) copy files update BIOS and so on or boot the server from this deviceThe following capabilities are supported
HA201-TP Users Manual
126
Chapter 7 Intelreg Server Board S2600TP Platform Management
The operation of remotely mounted devices is independent of the local devices on the server Both remote and local devices are useable in parallelbull Either IDE (CD-ROM floppy) or USB devices can be mounted as a
remote device to the serverbull It is possible to boot all supported operating systems from the remotely
mounted device and to boot from disk IMAGE (IMG) and CD-ROM or DVD-ROM ISO files See the Testedsupported Operating System List for more information
bull Media redirection supports redirection for both a virtual CD device and a virtual FloppyUSB device concurrently The CD device may be either a local CD drive or else an ISO image file the FloppyUSB device may be either a local Floppy drive a local USB device or else a disk image file
bull The media redirection feature supports multiple encryption algorithms including RC4 and AES The actual algorithm that is used is negotiated with the client based on the clientrsquos capabilities
bull A remote media session is maintained even when the server is powered-off (in standby mode) No restart of the remote media session is required during a server reset or power onoff An BMC reset (for example due to an BMC reset after BMC FW update) will require the session to be re-established
bull The mounted device is visible to (and useable by) managed systemrsquos OS and BIOS in both pre-boot and post-boot states
bull The mounted device shows up in the BIOS boot order and it is possible to change the BIOS boot order to boot from this remote device
bull It is possible to install an operating system on a bare metal server (no OS present) using the remotely mounted device This may also require the use of KVM-r to configure the OS during install
USB storage devices will appear as floppy disks over media redirection This allows for the installation of device drivers during OS installationIf either a virtual IDE or virtual floppy device is remotely attached during system boot both the virtual IDE and virtual floppy are presented as bootable devices It is not possible to present only a single-mounted device type to the system BIOS
HA201-TP Users Manual
127
Chapter 7 Intelreg Server Board S2600TP Platform Management
75381 AvailabilityThe default inactivity timeout is 30 minutes and is not user-configurable Media redirection sessions persist across system reset but not across an AC power loss or BMC reset
75382 Network Port UsageThe KVM and media redirection features use the following portsbull 5120 ndash CD Redirectionbull 5123 ndash FD Redirectionbull 5124 ndash CD Redirection (Secure)bull 5127 ndash FD Redirection (Secure)bull 7578 ndash Video Redirectionbull 7582 ndash Video Redirection (Secure)For additional information reference the Intelreg Remote Management Module 4 and Integrated BMC Web Console Users Guide
Chapter 1 Product IntroductionChapter 1 Product IntroductionChapter 8 Technical Support
wwwaicipccom
bull TAIWANTel +886 3 433 9188Fax +886 3 287 1818Email salesaicipccomtw
bull CHINA Tel +862154961421 +862154961422Fax Extension 608Email Technical Support supportaicipccom
bull AMERICA - West coastTel +19098958989Fax +19098958999Email salesaicipccom
bull AMERICA - East coastTel +19738848886Fax +19738844794Email njsalesaicipccom
bull EUROPETel +31306386789Fax +31306360638Emailsalesaicipcnl
Email Technical Support supportaicipccom
- PREFACE
-
- SAFETY INSTRUCTIONS
-
- Chapter 1 Product Introduction
-
- 11 Box Content
- 12 Specifications
- 13 General Information
-
- Chapter 2 Hardware Installation
-
- 21 Removing and Installing top cover
- 22 Central Processing Unit (CPU)
- 23 Diagram of the Correct Installation
- 24 System Memory
- 25 RemovingInstalling a Drive Tray Hard Drive Drive Slot Map
- 26 Removing and Installing a Fan Module
- 27 Power Supply
- 28 Tool-less Blade Slide Installation introduction
-
- Chapter 3 Motherborad Overview
-
- 31 Intelreg Server Board Feature Set
- 32 Motherboard Layout
- 33 Motherboard block diagram
- 34 Motherboard Configuration Jumpers
- 35 Motherboard Configuration Jumpers
-
- Chapter 4 12GB Expander Borad Overview
-
- 41 12GB Expander Board
-
- Chapter 5 Backplane Overview
-
- 51 Backplane
-
- Chapter 6 Debug amp Firmware Update Introduction
-
- 61 Expender firmware update through smart console port
- 62 Update the expander firmware through in-band
- 63 12G expander EDFB setting
- 64 Slot HDD power setting
- 65 HDD BP thermal sensor temperature setting
-
- Chapter 7 Intelreg Server Board S2600TP Platform Management
-
- 71 Server Management Function Architecture
- 72 Features and Functions
- 73 Sensor Monitoring
- 74 Intelreg Intelligent Power Node Manager (NM)
- 75 Basic and Advanced Server Management Features
-
- Chapter 8 Technical Support
-
- 按鈕11
![Page 11: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/11.jpg)
4
HA201-TP Users Manual
Chapter 1 Product Introduction
bull Rear Panel
1200W 1+1 80+ redundant power supply
2 x low-profile add-on card for external connection
SFF 8644 for JBOD connection
5
HA201-TP Users Manual
Chapter 1 Product Introduction
bull Top View of HA201-TP system
2 x hot-swappable storage control node
Intelreg Xeonreg E5-2600 v3 Processors
bull TOP View Of Controller Canister
6
HA201-TP Users Manual
Chapter 1 Product Introduction
bull Top View of HA201-TP system
4 x 60x56mm hot-swap fans
Intel S2600TP
HA201-TP Users Manual
7
Chapter 2 Hardware Installation
7
21 Removing and Installing top coverUnscrew the screws(2pcs for the back and 4pcs for the both right and left side)Slide the cover backwards to remove top cover form the chassis
Chapter 2 Hardware Installation
This section demonstrates maintenance procedures in replacing a defective part once the HA201-TP UM appliance is installed and is operational
Chapter 2 Hardware Installation
HA201-TP Users Manual
8
22 Central Processing Unit (CPU)221 Removing the Processor HeatsinkThe heatsink is attached to the server board or processor socket with captive fasteners Using a 2 Phillips screwdriver loosen the four screws located on the heatsink corners in a diagonal manner using the following procedurebull Using a 2 Phillips screwdriver start with screw 1 and loosen it by
giving it two rotations and stop (see letter A) (IMPORTANT Do not fully loosen)
bull Proceed to screw 2 and loosen it by giving it two rotations and stop (see letter B) Similarly loosen screws 3 and 4 Repeat steps A and B by giving each screw two rotations each time until all screws are loosened
bull Lift the heatsink straight up (see letter C)
REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
Processor
Socket
2
3
1
4
A
B
C
HA201-TP Users Manual
9
Chapter 2 Hardware Installation
222 Installing the Processor
2221 Unlatch the CPU Load Platebull Push the lever handle labeled ldquoOPEN 1strdquo (see letter A) down and
toward the CPU socket Rotate the lever handle upbull Repeat the steps for the second lever handle (see letter B)
CAUTION
Processor must be appropriate You may damage the server board if
you install a processor that is inappropriate for your server
CAUTION
ESD and handling processors Reduce the risk of electrostatic
discharge (ESD) damage to the processor by doing the following
(1) Touch the metal chassis before touching the processor or
server board Keep part of your body in contact with the metal
chassis to dissipate the static charge while handling the processor
(2) Avoid moving around unnecessarily
REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
A REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
B
Chapter 2 Hardware Installation
HA201-TP Users Manual
10
2222 Lift open the Load Platebull Rotate the right lever handle down until it releases the Load Plate
(see letter A)bull While holding down the lever handle with your other hand lift open
the Load Plate (see letter B)
BREMOVE
REMOVE
LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
A
HA201-TP Users Manual
11
Chapter 2 Hardware Installation
2223 Install the Processorbull Remove the processor from its packagebull Carefully remove the protective cover from the bottom side of the
CPU taking care not to touch any CPU contacts (see letter A)bull Orient the processor with the socket so that the processor cutouts
match the four orientation posts on the socket (see letter B) Note the location of a gold key at the corner of the processor (see letter C) Carefully place (Do NOT drop) the CPU into the socket
CAUTION
The pins inside the CPU socket are extremely sensitive Other than
the CPU no object should make contact with the pins inside the
CPU socket A damaged CPU socket pin may render the socket inoperable
and will produce erroneous CPU or other system errors if used
NOTE
The underside of the processor has components that may damage the
socket pins if installed improperly The processor must align correctly
with the socket opening before installation DO NOT DROP the
processor into the socket
NOTE
When possible a CPU insertion tool should be used when installing the
CPU
A
B
C
Chapter 2 Hardware Installation
HA201-TP Users Manual
12
2224 Remove the Socket Cover Remove the socket cover from the load plate by pressing it out
2225 Close the Load Plate Carefully lower the load plate down over the processor
2226 Lock down the Load Platebull Push down on the locking lever on the CLOSE 1st side (see letter A)
Slide the tip of the lever under the notch in the load plate (see letter B) Make sure the load plate tab engages under the socket lever when fully closed
bull bRepeat the steps to latch the locking lever on the other side (see letter C) Latch the levers in the order as shown
NOTE
The socket cover should be saved and re-used should the processor
need to be removed at anytime in the future
Save theprotectivecover
A
BC
HA201-TP Users Manual
13
Chapter 2 Hardware Installation
223 Installing the Processor Heatsink
bull If present remove the protective film covering the Thermal Interface Material on the bottom side of the heatsink (see letter A)
bull Align the heatsink fins to the front and back of the chassis for correct airflow The airflow goes from front-to-back of the chassis (see letter B)
bull Each heatsink has four captive fasteners and should be tightened in a diagonal manner using the following procedure
bull Using a 2 Phillips screwdriver start with screw 1 and engage screw threads by giving it two rotations and stop (see letter C) (Do not fully tighten)
bull Proceed to screw 2 and engage screw threads by giving it two rotations and stop (see letter D) Similarly engage screws 3 and 4
bull Repeat steps C and D by giving each screw two rotations each time until each screw is lightly tightened up to a maximum of 8 inch-lbs torque (see letter E)
NOTE
The processor heatsink for CPU1 and CPU2 is different FXXCA91X91HS is
for CPU1 while FXXEA91X91HS2 is for CPU2 Mislocating the heatsink
will cause serious thermal damage
C
Processor
Socket
AIRFLOW
Chassis Front
2
3
1
4
A
B
TIM
D
CAUTIONDo not over-tighten fasteners
E
Chapter 2 Hardware Installation
HA201-TP Users Manual
14
224 Removing the Processor
NOTE
Remove the processor by carefully lifting it out of the socket taking care
NOT to drop the processor and not touching any pins inside the socket
Install the socket cover if a replacement processor is not going to be installed
HA201-TP Users Manual
15
Chapter 2 Hardware Installation
225 Different heatsink for each of its CPUsCarefully position heatsink over the CPU0 and CPU1 and align the heatsink screws with the screw holes in the motherboard
Caution - Possible thermal damage Avoid moving the heatsink after
it has contacted the top of the CPU Too much movement could
disturb the layer of thermal compound causing voids and leading
to ineffective heat dissipation and component damage
CPU0
CPU0
CPU1
CPU1
Chapter 2 Hardware Installation
HA201-TP Users Manual
16
23 Diagram of the Correct InstallationNOTE The heat sinkrsquos fans should be blowing toward the rear end
of the chassis If one of the fans is facing the wrong direction
please remove the heat sink and reinstall it so that it is facing
the correct direction
AIRFLOW
AIRFLOW
FAN FAN
HA201-TP Users Manual
17
Chapter 2 Hardware Installation
24 System Memory241 Supported Memory
Chapter 2 Hardware Installation
HA201-TP Users Manual
18
242 Memory Population Rules
bull Each installed processor provides four channels of memory On the Intelreg Server Board S2600TP product family each memory channel supports two memory slots for a total possible 16 DIMMs installed
bull The memory channels from processor socket 1 are identified as Channel A B C and D The memory channels from processor socket 2 are identified as Channel E F G and H
bull The silk screened DIMM slot identifiers on the board provide information about the channel and therefore the processor to which they belong For example DIMM_A1 is the first slot on Channel A on processor 1 DIMM_E1 is the first DIMM socket on Channel E on processor 2
bull The memory slots associated with a given processor are unavailable if the corresponding processor socket is not populated
bull A processor may be installed without populating the associated memory slots provided a second processor is installed with associated memory In this case the memory is shared by the processors However the platform suffers performance degradation and latency due to the remote memory
bull Processor sockets are self-contained and autonomous However all memory subsystem support (such as Memory RAS and Error Management) in the BIOS setup is applied commonly across processor sockets
bull All DIMMs must be DDR4 DIMMsbull Mixing of LRDIMM with any other DIMM type is not allowed per
platformbull Mixing of DDR4 operating frequencies is not validated within a socket
or across sockets by Intel If DIMMs with different frequencies are mixed all DIMMs run at the common lowest frequency
bull A maximum of eight logical ranks (ranks seen by the host) per channel is allowed
bull The BLUE memory slots on the server board identify the first memory slot for a given memory channel
NOTE
Although mixed DIMM configurations are supported Intel only performs platform
validation on systems that are configured with identical DIMMs installed
HA201-TP Users Manual
19
Chapter 2 Hardware Installation
bull DIMM population rules require that DIMMs within a channel be populated starting with the BLUE DIMM slot or DIMM farthest from the processor in a ldquofill-farthestrdquo approach In addition when populating a Quad-rank DIMM with a Single- or Dual-rank DIMM in the same channel the Quad-rank DIMM must be populated farthest from the processor Intelreg MRC will check for correct DIMM placement
Chapter 2 Hardware Installation
HA201-TP Users Manual
20
On the Intelreg Server Board S2600TP product a total of sixteen DIMM slots are provided (two CPUs ndash four channels per CPU and two DIMMs per channel) The nomenclature for DIMM sockets is detailed in the following table
HA201-TP Users Manual
21
Chapter 2 Hardware Installation
243 Publishing System MemoryThere are a number of different situations in which the memory size andor configuration are displayed Most of these displays differ in one way or another so the same memory configuration may appear to display differently depending on when and where the display occurs
bull The BIOS displays the ldquoTotal Memoryrdquo of the system during POST if Quiet Boot is disabled in BIOS setup This is the total size of memory discovered by the BIOS during POST and is the sum of the individual sizes of installed DDR4 DIMMs in the system
bull The BIOS displays the ldquoEffective Memoryrdquo of the system in the BIOS Setup The term Effective Memory refers to the total size of all DDR4 DIMMs that are active (not disabled) and not used as redundant units (see Note below)
bull The BIOS provides the total memory of the system in the main page of BIOS setup This total is the same as the amount described by the first bullet above
bull If Quiet Boot is disabled the BIOS displays the total system memory on the diagnostic screen at the end of POST This total is the same as the amount described by the first bullet above
bull The BIOS provides the total amount of memory in the system by supporting the EFI Boot Service function GetMemoryMap()
bull The BIOS provides the total amount of memory in the system by supporting the INT 15h E820h function For details see the Advanced Configuration and Power Interface Specification
Chapter 2 Hardware Installation
HA201-TP Users Manual
22
25 RemovingInstalling a Drive Tray Hard Drive Drive Slot Map
251 Removing a Disk DriveRelease a drive tray by pressing the unlock button and pinching the lock lever slightly and pulling out the drive tray
252 Installing a Disk Drive
Adjust and place the 25rdquo HDD on the HDD tray Choose to the proper position screw hole then secure it with screws on the bottomMounting location 1 HDD with the interposer board attached
HA201-TP Users Manual
23
Chapter 2 Hardware Installation
253 Installing a Hard Disk Drive Tray
Insert the drive carrier into its bay Push the tray lever until it clicks Make sure the drive tray is correctly secured in place when its front edge aligns with the bay edge
254 Drive Slot Map
The drive slot map follows
HBA Card
MegaRaid Card
Chapter 2 Hardware Installation
HA201-TP Users Manual
24
26 Removing and Installing a Fan Module261 Removing a FAN module
Unscrew the thumb screw on the cover to open the top cover from NODE Unpluging the FAN cable Remove the fan by lifting it and pulling it out
262 Installing a FAN module To installing a fan module follow the reverse order
HA201-TP Users Manual
25
Chapter 2 Hardware Installation
27 Power Supply
271 Removing or Replacing the Power Supply Module
Slide the small levers (see red arrow ) to the right Hold the PSU lever and firmly pull the PSU out of the server chassis
Chapter 2 Hardware Installation
HA201-TP Users Manual
26
28 Tool-less Blade Slide Installation introduction
1 Pull on the lsquorsquoFront-Releasersquorsquoto unlock the inner channel from the Slide Assembly
2 Release the Detent-Lock and push Middle Channel inwards to retract Middle Channel
HA201-TP Users Manual
27
Chapter 2 Hardware Installation
Metal Spacer
OptionalRemove Metal Spacer for Aluminium Racks
3 Aligning the Front Bracket with the Mounting Hole
4 Push in to assembly the Front Bracket onto the Rack
Chapter 2 Hardware Installation
HA201-TP Users Manual
28
5 Now the bracket is fixed onto the Rack(Optional M6x10L screws are to secure the rails with posts if needed)
6 Refer to Diagram 3 amp 4 to assemble the End Bracket onto the Rack
HA201-TP Users Manual
29
Chapter 2 Hardware Installation
7 Assemble the inner channel onto the chassis using thescrews provided
8 Push the chassis with inner channels into Slide to complete Rack Installation
Chapter 3 Motherborad Overview
HA201-TP Users Manual
30
Chapter 3 Motherborad Overview
The Intelreg Server Board S2600TP is a monolithic printed circuit board (PCB) with features designed to support the high-performance and high-density computing markets This server board is designed to support the Intelreg Xeonreg processor E5-2600 v3 product family Previous generation Intelreg Xeonreg processors are not supported Many of the features and functions of the server board family are common A board will be identified by name when a described feature or function is unique to it
Chapter 3 Motherborad Overview
HA201-TP Users Manual
31
31 Intelreg Server Board Feature Set
Chapter 3 Motherborad Overview
HA201-TP Users Manual
32
WARNING The riser slot 1 on the server board is designed for
plugging in ONLY the riser card Plugging in the PCIe card may
cause permanent server board and PCIe card damage
Chapter 3 Motherborad Overview
HA201-TP Users Manual
33
32 Motherboard Layout
DiagnosticLED
RMM4LiteRiser Slot 2
SATASGPIO
BridgeBoard
SATA 3USB
Riser Slot 4
Riser Slot 4
HDD Activity
Riser Slot 3ControlPanel
SystemFan 1
CPU 2Socket
CPU 1Socket
SystemFan 3
SystemFan 2
MainPower 1
FanConnector
MainPower 2
InfiniBandPort(QSFP+)DedicatedManagementPortUSBStatus LEDID LED
VGA
NIC 2
IPMB
NIC 1
SerialPort A
CR 2032 3W
Backup Power Control
SATA 2
SATA 0
SATA RAID 5
SATA
Upgrade Key
DOM
SATA 1
Chapter 3 Motherborad Overview
HA201-TP Users Manual
34
33 Motherboard block diagram
Chapter 3 Motherborad Overview
HA201-TP Users Manual
35
34 Motherboard Configuration JumpersThe following table provides a summary and description of configuration test and debug jumpers The server board has several 3-pin jumper blocks that can be used Pin 1 on each jumper block can be identified by the following symbol on the silkscreen
Chapter 3 Motherborad Overview
HA201-TP Users Manual
36
Chapter 3 Motherborad Overview
HA201-TP Users Manual
37
35 Motherboard Configuration JumpersThe Intelreg Server Board S2600TP has the following board rear connector placement
HA201-TP Users Manual
38
Chapter 4 12G Expander Board Overview
This chapter provides detailed introduction guide on hardware introduction
41 12GB Expander Board411 Placement amp Connectors location
J1
JPIC_DBGU8
Chip 3x36RExpander
U14Flash U16
PIC
SW1
SW2
JEXP_UART
J7
JPIC_SMT
J2 J3 J4 CN1 JUSB_MB
JVBATT_12C
JVBATT CN3 CN2 JIPMI_LOC
CN5
CN8
JPWR1
JPWR2
JHDDPWRJPSON_OK
JIPMB
JUSB_PIC
J8
CN7CN6
CN4
JPWR_SW
Chapter 4 12GB Expander Borad Overview
HA201-TP Users Manual
39
Chapter 4 12G Expander Board Overview
412 PIN-OUT
Power Connector ndash JPWR1JPWR2
OS HDD Power Connector ndash JHDDPWR
Battery Connector ndash JVBATT
FAN Connector ndash J7J8
Console for Expander ndash JEXP_UART
HA201-TP Users Manual
40
Chapter 4 12G Expander Board Overview
PIC Smart portndash JPIC_SMT
PIC Debug port(For MPLAB I2C tool) ndash JPIC_DBG
PIC USB ndash JUSB_PIC
Battery Function ndash JVBATT_I2C
IPMB ndash JIPMB
HA201-TP Users Manual
41
Chapter 4 12G Expander Board Overview
BP PIC USB ndash JUSB_MB
IPMB for Device ndash JIPMI_LOC
Power SW ndash JPWR_SW
MB power on frunction ndash JPSON_OK
HA201-TP Users Manual
42
Chapter 4 12G Expander Board Overview
413 LEDs
LED_CN6
LED2
LED3
HA201-TP Users Manual
43
Chapter 5 Backplane OverviewChapter 5 Backplane Overview
This chapter provides detailed introduction guide on backplane introduction
51 Backplane511 Placement amp Connectors location
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
9
18
10
1
3
G
D
SG
D
S
C
B
C
E
B
C
E
GA
O
I
2
43
12
43
1
14
1
3
45
134
5
134
5
134
5
16
2
9
10
1
1
1120
10
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
16
17
32
33
484964
J2
JP1J11
For Secondary Canister Node
J4 J1 SW1
SW2
J3
For Primary Canister Node
HA201-TP Users Manual
44
Chapter 5 Backplane Overview
512 PIN-OUT
Power Connector ndash J2
Power Connector ndash J11
PMBUS Connector ndash JP1
FAN Connector ndash J4
HA201-TP Users Manual
45
Chapter 5 Backplane Overview
Console for MCU ndash J1
Front IO ndash J3
HA201-TP Users Manual
46
Chapter 5 Backplane Overview
513 LEDs
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
18
9
10
1
31
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
3 4
21
3 4
21
4
1
16
2
91
101
11
10
20
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ActFail
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
9
18
10
1
3
G
D
SG
D
S
C
B
C
E
B
C
E
GA
O
I
2
43
12
43
1
14
1
3
45
134
5
134
5
134
5
16
2
9
10
1
1
1120
10
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
16
17
32
33
484964LED1
HA201-TP Users Manual
47
Chapter 6 HDD Backplane Introduction
61 Expender firmware update through smart console port611 Update Expander firmware revision
Step 1 Set up HA201-TP console serial cable Insert console serial cable into console port shown below also
the other side inert serial port into motherboard
PLEASE FIND THE CONSOLE SERIAL CABLE IN THE PACKAGE BOX
Chapter 6 Debug amp Firmware Update Introduction
HA201-TP Users Manual
48
Chapter 6 HDD Backplane Introduction
Step 2 Set up HA201-TP RS232 connectionSet up RS232 connection application into your HA201-TP as shown in the example process below
For exampleOS Microsoft WindowsRS232 connection application Hyperterminal
Step 2 Install HyperTrmexe
HA201-TP Users Manual
49
Chapter 6 HDD Backplane Introduction
Step 3 Enter a new name for the icon in the field below and click OK
Step 4 Connecting by using selecting an option in the drop down menu circled in red below (we selected COM2 in this example) and click OK
HA201-TP Users Manual
50
Chapter 6 HDD Backplane Introduction
Properties
Port Setting
Bits per second
Data bits
Parity
Stop bits
Flow control None
Restore Defaults
OK ApplyCancel
None
Step 6 Set up is complete The diagram below depicts what screen should displayed
Step 5 For ldquoBits per secondrdquo select 38400 For ldquoFlow controlrdquo select None Click OK when you have finished your selections
cmdgt_
HA201-TP Users Manual
51
Chapter 6 HDD Backplane Introduction
Step 7To get firmware image amp MFG Configuration Image version information from AIC SAS Related Firmware Downloadne website
httpppmsaicipccomtw8888downloadexpandermcu
HA201-TP Users Manual
52
Chapter 6 HDD Backplane Introduction
Step 8Comand line for show current firmware revisioncmdgtrev
HA201-TP Users Manual
53
Chapter 6 HDD Backplane Introduction
Step 9Start to update expander firmwarecmdgtfdl 0 0_
Step 10Select the tool bar Transfer -gt Send File
HA201-TP Users Manual
54
Chapter 6 HDD Backplane Introduction
Step 11bull Choose new firmware path file fw 3A1_v11221bull Protocol have to choose Xmodem
Step 12Firmware download complete
HA201-TP Users Manual
55
Chapter 6 HDD Backplane Introduction
Step 13Reset computer for success update firmwarecmdgtreset
reset
HA201-TP Users Manual
56
Chapter 6 HDD Backplane Introduction
612 Update expander configuration MFG
Step 1Comand line for show current configuration MFGcmdgt showmfg
HA201-TP Users Manual
57
Chapter 6 HDD Backplane Introduction
Step 2Start to update expander configuration MFGcmdgtfdl 83 0_
Step 3Select the tool bar Transfer -gt Send File
83 0
HA201-TP Users Manual
58
Chapter 6 HDD Backplane Introduction
Step 4bull Choose new MFG path file mfg 3A10_eob_1201binbull Protocol have to choose Xmodem
Step 5MFG download complete
HA201-TP Users Manual
59
Chapter 6 HDD Backplane Introduction
Step 6Reset computer for success update MFGcmdgtreset
reset
HA201-TP Users Manual
60
Chapter 6 HDD Backplane Introduction
62 Update the expander firmware through in-band
FOR EXAMPLEStep 1
Download and install SG3_utilsexe which compatible with Linux OSFrom website httpsgdannyczsgsg3_utilshtml website Reference version sg3_utils-140tgz
Step 2To get firmware image amp MFG Configuration Image version information from AIC SAS Related Firmware Downloadne websitehttpppmsaicipccomtw8888downloadexpandermcu
HA201-TP Users Manual
61
Chapter 6 HDD Backplane Introduction
Step 3Execute terminal under the same new firmware folderexample Setting a new firmware folder on Home page Open Terminal by click to the right button of mouse in the same window Home
Step 4Typing sudo -s to into administrator mode
HA201-TP Users Manual
62
Chapter 6 HDD Backplane Introduction
Step 5 Find expander location$ sg_map -i
Step 6Update Expander firmware$ sg_write_buffer --id=0x0 --in=fw3A1_v11221 --mode=0x2 --offset=0 devsg0
HA201-TP Users Manual
63
Chapter 6 HDD Backplane Introduction
Step 7 Update Expander MFG$ sg_write_buffer --id=0x83 --in=mfg3A10_eob_1201bin --mode=0x2 --offset=0 devsg0
Step 8Reboot computer for success update firmware amp MFGrootubuntu~ reboot
HA201-TP Users Manual
64
Chapter 6 HDD Backplane Introduction
63 12G expander EDFB settingStep 1
For Install HyperTerminalexe refer to section 41
Step 2 Get EDFB statuscmdgt edfb
HA201-TP Users Manual
65
Chapter 6 HDD Backplane Introduction
Step 3Change EDFB setting
Enable EDFB functioncmdgtedfb on
Disable EDFB functioncmdgtedfb off
cmd gtedfbEDFB is OFF
cmd gtedfb onSucceeded to set EDFB
cmd gtedfb offEDFB is OFF
HA201-TP Users Manual
66
Chapter 6 HDD Backplane Introduction
64 Slot HDD power setting(Only for system cooling Fan controled by expander)
Step 1 For Install sg3exe tool and get new firmware from website refer to section 42
Step 2Execute terminal under the same new firmware folderexample Setting a new firmware folder on Home page Open Terminal by click to the right button of mouse in the same window Home
HA201-TP Users Manual
67
Chapter 6 HDD Backplane Introduction
Step 3 Typing sudo -s to into administrator mode
Step 4 Find expander location$ sg_map -i
HA201-TP Users Manual
68
Chapter 6 HDD Backplane Introduction
Step 5For example
If would like to turn the Disk004 power off under the HBA card Need to check Disk004 power status $ sg_ses --page=7 devsg0
Under HBA card the Element 3 = Disk004
HA201-TP Users Manual
69
Chapter 6 HDD Backplane Introduction
Step 6To check Disk004 (element 3) power status is ok$ sg_ses --page=2 devsg0
Status shows belowThe status of Element 3 is OK
HA201-TP Users Manual
70
Chapter 6 HDD Backplane Introduction
Step 7Turn off a HDD power$ sg_ses --descriptor=Disk004 --set=341 devsg0
Step 8Turn on a HDD power$ sg_ses --descriptor=Disk004 --clear=341 devsg0
HA201-TP Users Manual
71
Chapter 6 HDD Backplane Introduction
65 HDD BP thermal sensor temperature setting(Only for system cooling Fan controled by expander)
Step 1 For Install HyperTerminalexe refer to section 41
Step 2Get the current temperature settingscmdgt temperature
HA201-TP Users Manual
72
Chapter 6 HDD Backplane Introduction
Step 3For example
Set new temperatureT1=20 C 4 18 CT2=50 C 4 52 CWarning threshold=50 C 448 CAlarm threshold=55 C 454 C
The new setting will take effect after resetcmdgt temperature 18 52 48 54cmdgt reset
HA201-TP Users Manual
73
Chapter 6 HDD Backplane Introduction
Step 4Check fan speed amp temperature informationcmdgt sensor
cmd gtsensor==ENCLOSURE STATUS========================================================== Total fan number 2 System Fan-0 speed 10888 RPM System Fan-1 speed 10971 RPM System PWN-0 82 Expander Temperature 76 Celsius degree Sytem Temperature-0 33 Celsius degree T1 20 Celsius degree T2 50 Celsius degree TC 55 Celsius degree Voltage Sensor 09V 093V Voltage Sensor 18V 180V
cmd gt_
HA201-TP Users Manual
74
Chapter 7 Intelreg Server Board S2600TP Platform Management
71 Server Management Function Architecture711 IPMI Feature7111 IPMI 20 Featuresbull Baseboard management controller (BMC)bull IPMI Watchdog timerbull Messaging support including command bridging and usersession
supportbull Chassis device functionality including powerreset control and BIOS boot
flags supportbull Event receiver device The BMC receives and processes events from
other platform subsystemsbull Field Replaceable Unit (FRU) inventory device functionality The BMC
supports access to system FRU devices using IPMI FRU commandsbull System Event Log (SEL) device functionality The BMC supports and
provides access to a SELbull Sensor Data Record (SDR) repository device functionality The BMC
supports storage and access of system SDRsbull Sensor device and sensor scanningmonitoring The BMC provides IPMI
management of sensors It polls sensors to monitor and report system health
bull IPMI interfaceso Host interfaces include system management software (SMS) with receive message queue support and server management mode (SMM)o IPMB interfaceo LAN interface that supports the IPMI-over-LAN protocol (RMCP RMCP+)
bull Serial-over-LAN (SOL)bull ACPI state synchronization The BMC tracks ACPI state changes that are
provided by the BIOSbull BMC self-test The BMC performs initialization and run-time self-tests and
makes results available to external entitiesbull See also the Intelligent Platform Management Interface Specification
Second Generation v20
Chapter 7 Intelreg Server Board S2600TP Platform Management
HA201-TP Users Manual
75
Chapter 7 Intelreg Server Board S2600TP Platform Management
7112 Non IPMI FeaturesThe BMC supports the following non-IPMI featuresbull In-circuit BMC firmware updatebull Fault resilient booting (FRB) FRB2 is supported by the watchdog timer
functionalitybull Chassis intrusion detectionbull Basic fan control using Control version 2 SDRsbull Fan redundancy monitoring and supportbull Enhancements to fan speed controlbull Power supply redundancy monitoring and supportbull Hot-swap fan supportbull Acoustic management Support for multiple fan profilesbull Signal testing support The BMC provides test commands for setting
and getting platform signal statesbull The BMC generates diagnostic beep codes for fault conditionsbull System GUID storage and retrievalbull Front panel management The BMC controls the system status LED
and chassis ID LED It supports secure lockout of certain front panel functionality and monitors button presses The chassis ID LED is turned on using a front panel button or a command
bull Power state retentionbull Power fault analysisbull Intelreg Light-Guided Diagnosticsbull Power unit management Support for power unit sensor The BMC
handles power-good dropout conditionsbull DIMM temperature monitoring New sensors and improved acoustic
management using closed-loop fan control algorithm taking into account DIMM temperature readings
bull Address Resolution Protocol (ARP) The BMC sends and responds to ARPs (supported on embedded NICs)
bull Dynamic Host Configuration Protocol (DHCP) The BMC performs DHCP (supported on embedded NICs)
bull Platform environment control interface (PECI) thermal management support
bull E-mail alertingbull Support for embedded web server UI in Basic Manageability feature
set
HA201-TP Users Manual
76
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull Enhancements to embedded web servero Human-readable SELo Additional system configurabilityo Additional system monitoring capabilityo Enhanced on-line help
bull Integrated KVMbull Enhancements to KVM redirection
o Support for higher resolutionbull Integrated Remote Media Redirectionbull Lightweight Directory Access Protocol (LDAP) supportbull Intelreg Intelligent Power Node Manager supportbull Embedded platform debug feature which allows capture of detailed
data for later analysisbull Provisioning and inventory enhancements
o Inventory datasystem information export (partial SMBIOS table)bull DCMI 15 compliance (product-specific)bull Management support for PMBus rev12 compliant power suppliesbull BMC Data Repository (Managed Data Region Feature)bull Support for an Intelreg Local Control Display Panelbull System Airflow Monitoringbull Exit Air Temperature Monitoringbull Ethernet Controller Thermal Monitoringbull Global Aggregate Temperature Margin Sensorbull Memory Thermal Managementbull Power Supply Fan Sensorsbull Energy Star Server Supportbull Smart Ride Through (SmaRT) Closed Loop System Throttling (CLST)bull Power Supply Cold Redundancybull Power Supply FW Updatebull Power Supply Compatibility Checkbull BMC FW reliability enhancements
o Redundant BMC boot blocks to avoid possibility of a corrupted boot block resulting in a scenario that prevents a user from updating the BMC o BMC System Management Health Monitoring
HA201-TP Users Manual
77
Chapter 7 Intelreg Server Board S2600TP Platform Management
72 Features and Functions721 Power SubsystemThe server board supports several power control sources which can initiate power-up orpower-down activity
HA201-TP Users Manual
78
Chapter 7 Intelreg Server Board S2600TP Platform Management
722 Advanced Configuration and Power Interface (ACPI)The server board has support for the following ACPI states
HA201-TP Users Manual
79
Chapter 7 Intelreg Server Board S2600TP Platform Management
723 System InitializationDuring system initialization both the BIOS and the BMC initialize the following items
7231 Processor Tcontrol SettingProcessors used with this chipset implement a feature called Tcontrol which provides a processor-specific value that can be used to adjust the fan-control behavior to achieve optimum cooling and acoustics The BMC reads these from the CPU through PECI Proxy mechanism provided by Manageability Engine (ME) The BMC uses these values as part of thefan-speed-control algorithm
HA201-TP Users Manual
80
Chapter 7 Intelreg Server Board S2600TP Platform Management
7232 Fault Resilient Booting (FRB)Fault resilient booting (FRB) is a set of BIOS and BMC algorithms and hardware support that allow a multiprocessor system to boot even if the bootstrap processor (BSP) fails Only FRB2 is supported using watchdog timer commandsFRB2 refers to the FRB algorithm that detects system failures during POST The BIOS uses the BMC watchdog timer to back up its operation during POST The BIOS configures the watchdog timer to indicate that the BIOS is using the timer for the FRB2 phase of the boot operationAfter the BIOS has identified and saved the BSP information it sets the FRB2 timer use bit and loads the watchdog timer with the new timeout intervalIf the watchdog timer expires while the watchdog use bit is set to FRB2 the BMC (if so configured) logs a watchdog expiration event showing the FRB2 timeout in the event data bytes The BMC then hard resets the system assuming the BIOS-selected reset as the watchdog timeout actionThe BIOS is responsible for disabling the FRB2 timeout before initiating the option ROM scan and before displaying a request for a boot password If the processor fails and causes an FRB2 timeout the BMC resets the systemThe BIOS gets the watchdog expiration status from the BMC If the status shows an expired FRB2 timer the BIOS enters the failure in the system event log (SEL) In the OEM bytes entry in the SEL the last POST code generated during the previous boot attempt is written FRB2failure is not reflected in the processor status sensor valueThe FRB2 failure does not affect the front panel LEDs
HA201-TP Users Manual
81
Chapter 7 Intelreg Server Board S2600TP Platform Management
7233 Post Code DisplayThe BMC upon receiving standby power initializes internal hardware to monitor port 80h (POST code) writes Data written to port 80h is output to the system POST LEDsThe BMC deactivates POST LEDs after POST had completed
HA201-TP Users Manual
82
Chapter 7 Intelreg Server Board S2600TP Platform Management
724 System Event Log (SEL)The BMC implements the system event log as specified in the Intelligent Platform Management Interface Specification Version 20 The SEL is accessible regardless of the system power state through the BMCs in-band and out-of-band interfacesThe BMC allocates 95231bytes (approx 93 KB) of non-volatile storage space to store system events The SEL timestamps may not be in order Up to 3639 SEL records can be stored at a time Because the SEL is circular any command that results in an overflow of the SEL beyond the allocated space will overwrite the oldest entries in the SEL while setting the overflow flag
HA201-TP Users Manual
83
Chapter 7 Intelreg Server Board S2600TP Platform Management
73 Sensor MonitoringThe BMC monitors system hardware and reports system health The information gathered from physical sensors is translated into IPMI sensors as part of the ldquoIPMI Sensor Modelrdquo The BMC also reports various system state changes by maintaining virtual sensors that are not specifically tied to physical hardware This section describes general aspects of BMC sensor management as well as describing how specific sensor types are modeled Unless otherwise specified the term ldquosensorrdquo refers to the IPMI sensor-model definition of a sensor
531 Sensor ScanningThe value of many of the BMCrsquos sensors is derived by the BMC FW periodically polling physical sensors in the system to read temperature voltages and so on Some of these physical sensors are built-in to the BMC component itself and some are physically separated from the BMC Polling of physical sensors for support of IPMI sensor monitoring does not occur until the BMCrsquos operational code is running and the IPMI FW subsystem has completed initializationIPMI sensor monitoring is not supported in the BMC boot code Additionally the BMC selectively polls physical sensors based on the current power and reset state of the system and the availability of the physical sensor when in that state For example non-standby voltages are not monitored when the system is in S4 or S5 power state
HA201-TP Users Manual
84
Chapter 7 Intelreg Server Board S2600TP Platform Management
732 Sensor Rearm Behavior7321 Manual versus Re-arm SensorsSensors can be either manual or automatic re-arm An automatic re-arm sensor will re-arm (clear) the assertion event state for a threshold or offset it if that threshold or offset is deasserted after having been asserted This allows a subsequent assertion of the threshold or an offset to generate a new event and associated side-effect An example side-effect would be boosting fans due to an upper critical threshold crossing of a temperature sensor The event state and the input state (value) of the sensor track each other Most sensors are auto-rearmA manual re-arm sensor does not clear the assertion state even when the threshold or offset becomes de-asserted In this case the event state and the input state (value) of the sensor do not track each other The event assertion state is sticky The following methods can be used to re-arm a sensorbull Automatic re-arm ndash Only applies to sensors that are designated as
ldquoauto-rearmrdquobull IPMI command Re-arm Sensor Eventbull BMC internal method ndash The BMC may re-arm certain sensors due to a
trigger condition For example some sensors may be re-armed due to a system reset A BMC reset will re-arm all sensorsbull System reset or DC power cycle will re-arm all system fan sensors
HA201-TP Users Manual
85
Chapter 7 Intelreg Server Board S2600TP Platform Management
7322 Re-arm and Event GenerationAll BMC-owned sensors that show an asserted event status generate a de-assertion SEL event when the sensor is re-armed provided that the associated SDR is configured to enable a deassertion event for that condition This applies regardless of whether the sensor is a thresholdanalog sensor or a discrete sensor
To manually re-arm the sensors the sequence is outlined below1 A failure condition occurs and the BMC logs an assertion event2 If this failure condition disappears the BMC logs a de-assertion event (if
so configured)3 The sensor is re-armed by one of the methods described in the
previous section4 The BMC clears the sensor status5 The sensor is put into reading-state-unavailable state until it is polled
again or otherwise updated6 The sensor is updated and the ldquoreading-state-unavailablerdquo state is
cleared A new assertion event will be logged if the fault state is once again detected
All auto-rearm sensors that show an asserted event status generate a de-assertion SEL event at the time the BMC detects that the condition causing the original assertion is no longer present and the associated SDR is configured to enable a de-assertion event for that condition
HA201-TP Users Manual
86
Chapter 7 Intelreg Server Board S2600TP Platform Management
733 BIOS Event-Only SensorsBIOS-owned discrete sensors are used for event generation only and are not accessible through IPMI sensor commands like the Get Sensor Reading command Note that in this case the sensor owner designated in the SDR is not the BMCAn example of this usage would be the SELs logged by the BIOS for uncorrectable memory errors Such SEL entries would identify a BIOS-owned sensor ID
734 Margin SensorsThere is sometimes a need for an IPMI sensor to report the difference (margin) from a nonzero reference offset For the purposes of this document these type sensors are referred to as margin sensors For instance for the case of a temperature margin sensor if the referencevalue is 90 degrees and the actual temperature of the device being monitored is 85 degreesthe margin value would be -5
735 IPMI Watchdog SensorThe BMC supports a Watchdog Sensor as a means to log SEL events due to expirations of the IPMI 20 compliant Watchdog Timer
736 BMC Watchdog SensorThe BMC supports an IPMI sensor to report that a BMC reset has occurred due to action taken by the BMC Watchdog feature A SEL event will be logged whenever either the BMC FW stack is reset or the BMC CPU itself is reset
737 BMC System Management Health MonitoringThe BMC tracks the health of each of its IPMI sensors and report failures by providing a ldquoBMC FW Healthrdquo sensor of the IPMI 20 sensor type Management Subsystem Health with support for the Sensor Failure offset Only assertions should be logged into the SEL for the Sensor Failure offset The BMC Firmware Health sensor asserts for any sensor when 10 consecutive sensor errors are read These are not standard sensor events (that is threshold crossings or discrete assertions) These are BMC Hardware Access Layer (HAL) errors If a successful sensor read is completed the counter resets to zero
HA201-TP Users Manual
87
Chapter 7 Intelreg Server Board S2600TP Platform Management
738 VR Watchdog TimerThe BMC FW monitors that the power sequence for the board VR controllers is completed when a DC power-on is initiated Incompletion of the sequence indicates a board problem in which case the FW powers down the systemThe BMC FW supports a discrete IPMI sensor for reporting and logging this fault condition
739 System Airflow MonitoringThe BMC provides an IPMI sensor to report the volumetric system airflow in CFM (cubic feet per minute) The air flow in CFM is calculated based on the system fan PWM values The specific Pulse Width Modulation (PWM or PWMs) used to determine the CFM is SDR configurable The relationship between PWM and CFM is based on a lookup table in an OEM SDRThe airflow data is used in the calculation for exit air temperature monitoring It is exposed as an IPMI sensor to allow a datacenter management application to access this data for use in rack-level thermal management
7310 Thermal MonitoringThe BMC provides monitoring of component and board temperature sensing devices This monitoring capability is instantiated in the form of IPMI analogthreshold or discrete sensors depending on the nature of the measurementFor analogthreshold sensors with the exception of Processor Temperature sensors critical and non-critical thresholds (upper and lower) are set through SDRs and event generation enabled for both assertion and de-assertion eventsFor discrete sensors both assertion and de-assertion event generation are enabledMandatory monitoring of platform thermal sensors includes
HA201-TP Users Manual
88
Chapter 7 Intelreg Server Board S2600TP Platform Management
7310 Thermal MonitoringThe BMC provides monitoring of component and board temperature sensing devices This monitoring capability is instantiated in the form of IPMI analogthreshold or discrete sensors depending on the nature of the measurementFor analogthreshold sensors with the exception of Processor Temperature sensors critical and non-critical thresholds (upper and lower) are set through SDRs and event generation enabled for both assertion and de-assertion eventsFor discrete sensors both assertion and de-assertion event generation are enabledMandatory monitoring of platform thermal sensors includesbull Inlet temperature (physical sensor is typically on system front panel or
HDD back plane)bull Board ambient thermal sensorsbull Processor temperaturebull Memory (DIMM) temperaturebull CPU VRD Hot monitoringbull Power supply (only supported for PMBus-compliant PSUs)Additionally the BMC FW may create ldquovirtualrdquo sensors that are based on a combination of aggregation of multiple physical thermal sensors and application of a mathematical formula to thermal or power sensor readings
73101 Absolute Value versus Margin SensorsThermal monitoring sensors fall into three basic categoriesbull Absolute temperature sensors ndash These are analogthreshold sensors
that provide a value that corresponds to an absolute temperature value
bull Thermal margin sensors ndash These are analogthreshold sensors that provide a value that is relative to some reference value
bull Thermal fault indication sensors ndash These are discrete sensors that indicate a specific thermal fault condition
HA201-TP Users Manual
89
Chapter 7 Intelreg Server Board S2600TP Platform Management
73102 Processor DTS-Spec Margin Sensor(s)Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family incorporate a DTS based thermal spec This allows a much more accurate control of the thermal solution and will enable lower fan speeds and lower fan power consumption The main usage of this sensor is as an input to the BMCrsquos fan control algorithms The BMC implements this as a threshold sensor There is one DTS sensor for each installed physical processor package Thresholds are not set and alert generation is not enabled for these sensors
73103 Processor Thermal Margin Sensor(s)Each processor supports a physical thermal margin sensor per core that is readable through the PECI interface This provides a relative value representing a thermal margin from the corersquos throttling thermal trip point Assuming that temp controlled throttling is enabled the physical core temp sensor reads lsquo0rsquo which indicates the processor core is being throttledThe BMC supports one IPMI processor (margin) temperature sensor per physical processor package This sensor aggregates the readings of the individual core temperatures in a package to provide the hottest core temperature reading When the sensor reads lsquo0rsquo it indicates that the hottest processor core is throttlingDue to the fact that the readings are capped at the corersquos thermal throttling trip point (reading = 0) thresholds are not set and alert generation is not enabled for these sensors
73104 Processor Thermal Control Monitoring (Prochot)The BMC FW monitors the percentage of time that a processor has been operationally constrained over a given time window (nominally six seconds) due to internal thermal management algorithms engaging to reduce the temperature of the device When any processor core temperature reaches its maximum operating temperature the processorpackage PROCHOT (processor hot) signal is asserted and these management algorithms known as the Thermal Control Circuit (TCC) engage to reduce the temperature provided TCC is enabled TCC is enabled by BIOS during system boot This monitoring is instantiated as one IPMI analogthreshold sensor per processor package The BMC implements this as a threshold sensor on a per-processor basis
HA201-TP Users Manual
90
Chapter 7 Intelreg Server Board S2600TP Platform Management
Under normal operation this sensor is expected to read lsquo0rsquo indicating that no processor throttling has occurredThe processor provides PECI-accessible counters one for the total processor time elapsed and one for the total thermally constrained time which are used to calculate the percentage assertion over the given time window
73105 Processor Voltage Regulator (VRD) Over-Temperature SensorThe BMC monitors processor VRD_HOT signals The processor VRD_HOT signals are routed to the respective processor PROCHOT input in order initiate throttling to reduce processor power draw therefore indirectly lowering the VRD temperatureThere is one processor VRD_HOT signal per CPU slot The BMC instantiates one discrete IPMI sensor for each VRD_HOT signal This sensor monitors a digital signal that indicates whether a processor VRD is running in an over-temperature condition When the BMC detects that this signal is asserted it will cause a sensor assertion which will result in an event being written into the sensor event log (SEL)
73106 Inlet Temperature SensorEach platform supports a thermal sensor for monitoring the inlet temperature There are four potential sources for inlet temperature reading
73107 Baseboard Ambient Temperature Sensor(s)The server baseboard provides one or more physical thermal sensors for monitoring the ambient temperature of a board location This is typically to provide rudimentary thermal monitoring of components that lack internal thermal sensors
73108 Server South Bridge (SSB) Thermal MonitoringThe BMC monitors the SSB temperature This is instantiated as an analog (threshold) IPMI thermal sensor
73109 Exit Air Temperature Monitoring The BMC synthesizes a virtual sensor to approximate system exit air temperature for use in fan control This is calculated based on the total power being consumed by the system and the total volumetric air flow provided by the system fans
HA201-TP Users Manual
91
Chapter 7 Intelreg Server Board S2600TP Platform Management
Each system shall be characterized in tabular format to understand total volumetric flow versus fan speed The BMC calculates an average exit air temperature based on the total system power front panel temperature the volumetric system air flow (cubic feet per meter or CFM) and altitude rangeThis sensor is only available on systems in an Intelreg chassis The Exit Air temp sensor is only available when PMBus power supplies are installed
731010 Ethernet Controller Thermal MonitoringThe Intelreg Ethernet Controller I350-AM4 and Intelreg Ethernet Controller 10 Gigabit X540 support an on-die thermal sensor For baseboard Ethernet controllers that use these devices the BMC will monitor the sensors and use this data as input to the fan speed control The BMC will instantiate an IPMI temperature sensor for each device on the baseboard
731011 Memory VRD-Hot Sensor(s)The BMC monitors memory VRD_HOT signals The memory VRD_HOT signals are routed to the respective processor MEMHOT inputs in order to throttle the associated memory to effectively lower the temperature of the VRD feeding that memoryFor Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family there are 2 memory VRD_HOT signals per CPU slot The BMC instantiates one discrete IPMI sensor for each memory VRD_HOT signal
731012 Add-in Module Thermal MonitoringSome boards have dedicated slots for an IO module andor a SAS module For boards that support these slots the BMC will instantiate an IPMI temperature sensor for each slot The modules themselves may or may not provide a physical thermal sensor (a TMP75 device) If the BMC detects that a module is installed it will attempt to access the physical thermal sensor and if found enable the associated IPMI temperature sensor
731013 Processor ThermTripWhen a Processor ThermTrip occurs the system hardware will automatically power down the server If the BMC detects that a ThermTrip occurred then it will set the ThermTrip offset for the applicable
HA201-TP Users Manual
92
Chapter 7 Intelreg Server Board S2600TP Platform Management
processor status sensor
731014 Server South Bridge (SSB) ThermTrip Monitoring The BMC supports SSB ThermTrip monitoring that is instantiated as an IPMI discrete sensor When a SSB ThermTrip occurs the system hardware will automatically power down the server and the BMC will assert the sensor offset and log an event
731015 DIMM ThermTrip MonitoringThe BMC supports DIMM ThermTrip monitoring that is instantiated as one aggregate IPMI discrete sensor per CPU When a DIMM ThermTrip occurs the system hardware will automatically power down the server and the BMC will assert the sensor offset and log an eventThis is a manual re-arm sensor that is rearmed on system resets and power-on (AC or DC power on transitions)
HA201-TP Users Manual
93
Chapter 7 Intelreg Server Board S2600TP Platform Management
7311 Processor SensorsThe BMC provides IPMI sensors for processors and associated components such as voltage regulators and fans The sensors are implemented on a per-processor basis
HA201-TP Users Manual
94
Chapter 7 Intelreg Server Board S2600TP Platform Management
73111 Processor Status SensorsThe BMC provides an IPMI sensor of type processor for monitoring status information for each processor slot If an event state (sensor offset) has been asserted it remains asserted until one of the following happens1 A Rearm Sensor Events command is executed for the processor status
sensor2 AC or DC power cycle system reset or system boot occurs
The BMC provides system status indication to the front panel LEDs for processor fault conditions shown belowCPU Presence status is not saved across AC power cycles and therefore will not generate a deassertion after cycling AC power
HA201-TP Users Manual
95
Chapter 7 Intelreg Server Board S2600TP Platform Management
73112 Processor Population Fault (CPU Missing) SensorThe BMC supports a Processor Population Fault sensor This is used to monitor for the condition in which processor slots are not populated as required by the platform hardware to allow power-on of the systemAt BMC startup the BMC will check for the fault condition and set the sensor state accordinglyThe BMC also checks for this fault condition at each attempt to DC power-on the system At each DC power-on attempt a beep code is generated if this fault is detectedThe following steps are used to correct the fault condition and clear the sensor state1 AC power down the server2 Install the missing processor into the correct slot3 AC power on the server
73113 ERR2 Timeout MonitoringThe BMC supports an ERR2 Timeout Sensor (1 per CPU) that asserts if a CPUrsquos ERR2 signal has been asserted for longer than a fixed time period (gt 90 seconds) ERR[2] is a processor signal that indicates when the IIO (Integrated IO module in the processor) has a fatal error which could not be communicated to the core to trigger SMI ERR[2] events are fatal error conditions where the BIOS and OS will attempt to gracefully handle error but may not be always do so reliably A continuously asserted ERR2 signal is an indication that the BIOS cannot service the condition that caused the error This is usually because that condition prevents the BIOS from runningWhen an ERR2 timeout occurs the BMC assertsde-asserts the ERR2 Timeout Sensor and logs a SEL event for that sensor The default behavior for BMC core firmware is to initiate a system reset upon detection of an ERR2 timeout The BIOS setup utility provides an option to disable or enable system reset by the BMC for detection of this condition
HA201-TP Users Manual
96
Chapter 7 Intelreg Server Board S2600TP Platform Management
73114 CATERR SensorThe BMC supports a CATERR sensor for monitoring the system CATERR signalThe CATERR signal is defined as having 3 statesbull high (no event)bull pulsed low (possibly fatal may be able to recover)bull low (fatal)All processors in a system have their CATERR pins tied together The pin is used as a communication path to signal a catastrophic system event to all CPUs The BMC has direct access to this aggregate CATERR signalThe BMC only monitors for the ldquoCATERR held lowrdquo condition A pulsed low condition is ignored by the BMC If a CATERR-low condition is detected the BMC logs an error message to the SEL against the CATERR sensor and the default action after logging the SEL entry is to reset the system The BIOS setup utility provides an option to disable or enable system reset by the BMC for detection of this conditionThe sensor is rearmed on power-on (AC or DC power on transitions) It is not rearmed on system resets in order to avoid multiple SEL events that could occur due to a potential reset loop if the CATERR keeps recurring which would be the case if the CATERR was due to an MSID mismatch conditionWhen the BMC detects that this aggregate CATERR signal has asserted it can then go through PECI to query each CPU to determine which one was the source of the error and write an OEM code identifying the CPU slot into an event data byte in the SEL entry If PECI is non-functional (it isnrsquot guaranteed in this situation) then the OEM code should indicate that the source is unknownEvent data byte 2 and byte 3 for CATERR sensor SEL events
ED1 ndash 0xA1ED2 - CATERR type0 Unknown1 CATERR2 CPU Core Error (not supported on Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family)3 MSID Mismatch4 CATERR due to CPU 3-strike timeout
HA201-TP Users Manual
97
Chapter 7 Intelreg Server Board S2600TP Platform Management
ED3 - CPU bitmap that causes the system CATERR[0] CPU1[1] CPU2[2] CPU3[3] CPU4When a CATERR Timeout event is determined to be a CPU 3-strike timeout The BMC shall log the logical FRU information (eg busdevfunc for a PCIe device CPU or DIMM) that identifies the FRU that caused the error in the extended SEL data bytes In this case Ext-ED0 will be set to 0x70 and the remaining ED1-ED7 will be set according to the device type and info available
73115 MSID Mismatch SensorThe BMC supports a MSID Mismatch sensor for monitoring for the fault condition that will occur if there is a power rating incompatibility between a baseboard and a processor The sensor is rearmed on power-on (AC or DC power on transitions)
7312 Voltage MonitoringThe BMC provides voltage monitoring capability for voltage sources on the main board and processors such that all major areas of the system are covered This monitoring capability is instantiated in the form of IPMI analogthreshold sensors
73121 DIMM Voltage SensorsSome systems support either LVDDR (Low Voltage DDR) memory or regular (non-LVDDR) memory During POST the system BIOS detects which type of memory is installed and configures the hardware to deliver the correct voltageSince the nominal voltage range is different this necessitates the ability to set different thresholds for any associated IPMI voltage sensors The BMC FW supports this by implementing separate sensors (that is separate IPMI sensor numbers) for each nominal voltage range supported for a single physical sensor and it enablesdisables the correct IPMI sensor based on which type memory is installed The sensor data records for both these DIMM voltage sensor types have scanning disabled by default Once the BIOS has completed its POST routine it is responsible for communicating the DIMM voltage type to the BMC which will then
HA201-TP Users Manual
98
Chapter 7 Intelreg Server Board S2600TP Platform Management
enable sensor scanning of the correct DIMM voltage sensor
7313 Fan MonitoringBMC fan monitoring support includes monitoring of fan speed (RPM) and fan presence
73131 Fan Tach SensorsFan Tach sensors are used for fan failure detection The reported sensor reading is proportional to the fanrsquos RPM This monitoring capability is instantiated in the form of IPMI analogthreshold sensorsMost fan implementations provide for a variable speed fan so the variations in fan speed can be large Therefore the threshold values must be set sufficiently low as to not result in inappropriate threshold crossingsFan tach sensors are implemented as manual re-arm sensors because a lower-critical threshold crossing can result in full boosting of the fans This in turn may cause a failing fanrsquos speed to rise above the threshold and can result in fan oscillationsAs a result fan tach sensors do not auto-rearm when the fault condition goes away but rather are rearmed for either of the following occurrences1 The system is reset or power-cycled2 The fan is removed and either replaced with another fan or re-
inserted This applies to hot-swappable fans only This re-arm is triggered by change in the state of the associated fan presence sensor
After the sensor is rearmed if the fan speed is detected to be in a normal range the failure conditions shall be cleared and a de-assertion event shall be logged
HA201-TP Users Manual
99
Chapter 7 Intelreg Server Board S2600TP Platform Management
73132 Fan Presence SensorsSome chassis and server boards provide support for hot-swap fans These fans can be removed and replaced while the system is powered on and operating normally The BMC implements fan presence sensors for each hot swappable fan These are instantiated as IPMI discrete sensorsEvents are only logged for fan presence upon changes in the presence state after AC power is applied (no events logged for initial state)
73133 Fan Redundancy SensorThe BMC supports redundant fan monitoring and implements fan redundancy sensors for products that have redundant fans Support for redundant fans is chassis-specificA fan redundancy sensor generates events when its associated set of fans transition between redundant and non-redundant states as determined by the number and health of the component fans The definition of fan redundancy is configuration dependent The BMCallows redundancy to be configured on a per fan-redundancy sensor basis through OEM SDR recordsThere is a fan redundancy sensor implemented for each redundant group of fans in the systemAssertion and de-assertion event generation is enabled for each redundancy state
73134 Power Supply Fan SensorsMonitoring is implemented through IPMI discrete sensors one for each power supply fan The BMC polls each installed power supply using the PMBus fan status commands to check for failure conditions for the power supply fans The BMC asserts the ldquoperformance lagsrdquo offset of the IPMI sensor if a fan failure is detectedPower supply fan sensors are implemented as manual re-arm sensors because a failure condition can result in boosting of the fans This in turn may cause a failing fanrsquos speed to rise above the ldquofaultrdquo threshold and can result in fan oscillations As a result these sensors do not auto-rearm when the fault condition goes away but rather are rearmed only when the system is reset or power-cycled or the PSU is removed and replaced with the same or another PSUAfter the sensor is rearmed if the fan is no longer showing a failed state the failure condition in the IPMI sensor shall be cleared and a de-
HA201-TP Users Manual
100
Chapter 7 Intelreg Server Board S2600TP Platform Management
assertion event shall be logged
73135 Monitoring for ldquoFans Offrdquo ScenarioOn Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family it is likely that there will be situations where specific fans are turned off based on current system conditions BMC Fan monitoring will comprehend this scenario and not log false failure eventsThe recommended method is for the BMC FW to halt updates to the value of the associated fan tach sensor and set that sensorrsquos IPMI sensor state to ldquoreading-state-unavailablerdquo when this mode is active Management software must comprehend this state for fan tach sensors and not report these as failure conditionsThe scenario for which this occurs is that the BMC Fan Speed Control (FSC) code turns off the fans by setting the PWM for the domain to 0 This is done when based on one or more global aggregate thermal margin sensor readings dropping below a specified thresholdBy default the fans-off feature will be disabled There is a BMC command and BIOS setup option to enabledisable this featureThe SmaRTCLST system feature will also momentarily gate power to all the system fans to reduce overall system power consumption in response to a power supply event (for example to ride out an AC power glitch) However for this scenario the fan power is gated by hardware for only 100ms which should not be long enough to result in triggering a fan fault SEL event
7314 Standard Fan ManagementThe BMC controls and monitors the system fans Each fan is associated with a fan speed sensor that detects fan failure and may also be associated with a fan presence sensor for hotswap support For redundant fan configurations the fan failure and presence status determines the fan redundancy sensor stateThe system fans are divided into fan domains each of which has a separate fan speed control signal and a separate configurable fan control policy A fan domain can have a set of temperature and fan sensors associated with it These are used to determine the current fan domain state
HA201-TP Users Manual
101
Chapter 7 Intelreg Server Board S2600TP Platform Management
A fan domain has three statesbull The sleep and boost states have fixed (but configurable through OEM
SDRs) fan speeds associated with thembull The nominal state has a variable speed determined by the fan
domain policy An OEM SDR record is used to configure the fan domain policy
The fan domain state is controlled by several factors They are listed below in order of precedence high to lowbull Boost
o Associated fan is in a critical state or missing The SDR describes which fan domains are boosted in response to a fan failure or removal in each domain If a fan is removed when the system is in lsquoFans-offrsquo mode it will not be detected and there will not be any fan boost till system comes out of lsquoFans-off modeo Any associated temperature sensor is in a critical state The SDR describes which temperature-threshold violations cause fan boost for each fan domaino The BMC is in firmware update mode or the operational firmware is corruptedo If any of the above conditions apply the fans are set to a fixed boost state speed
bull Nominalo A fan domainrsquos nominal fan speed can be configured as static (fixed value) or controlled by the state of one or more associated temperature sensorso See section 43144 for more details
73141 Hot-Swap FansHot-swap fans are supported These fans can be removed and replaced while the system is powered on and operating The BMC implements fan presence sensors for each hotswappable fanWhen a fan is not present the associated fan speed sensor is put into the readingunavailable state and any associated fan domains are put into the boost state The fans may already be boosted due to a previous fan failure or fan removalWhen a removed fan is inserted the associated fan speed sensor is rearmed If there are no other critical conditions causing a fan boost condition the fan speed returns to the nominal state Power cycling or
HA201-TP Users Manual
102
Chapter 7 Intelreg Server Board S2600TP Platform Management
resetting the system re-arms the fan speed sensors and clears fanfailure conditions If the failure condition is still present the boost state returns once the sensor has re-initialized and the threshold violation is detected again
73142 Fan Redundancy DetectionThe BMC supports redundant fan monitoring and implements a fan redundancy sensor A fan redundancy sensor generates events when its associated set of fans transitions between redundant and non-redundant states as determined by the number and health of the fans The definition of fan redundancy is configuration dependent The BMC allows redundancy to be configured on a per fan redundancy sensor basis through OEM SDR recordsA fan failure or removal of hot-swap fans up to the number of redundant fans specified in the SDR in a fan configuration is a non-critical failure and is reflected in the front panel status A fan failure or removal that exceeds the number of redundant fans is a non-fatal insufficientresources condition and is reflected in the front panel status as a non-fatal errorRedundancy is checked only when the system is in the DC-on state Fan redundancy changes that occur when the system is DC-off or when AC is removed will not be logged until the system is turned on
73143 Fan DomainsSystem fan speeds are controlled through pulse width modulation (PWM) signals which are driven separately for each domain by integrated PWM hardware Fan speed is changed by adjusting the duty cycle which is the percentage of time the signal is driven high in each pulseThe BMC controls the average duty cycle of each PWM signal through direct manipulation of the integrated PWM control registersThe same device may drive multiple PWM signals
HA201-TP Users Manual
103
Chapter 7 Intelreg Server Board S2600TP Platform Management
73144 Nominal Fan SpeedA fan domainrsquos nominal fan speed can be configured as static (fixed value) or controlled by the state of one or more associated temperature sensorsOEM SDR records are used to configure which temperature sensors are associated with which fan control domains and the algorithmic relationship between the temperature and fan speed Multiple OEM SDRs can reference or control the same fan control domain and multiple OEM SDRs can reference the same temperature sensorsThe PWM duty-cycle value for a domain is computed as a percentage using one or more instances of a stepwise linear algorithm and a clamp algorithm The transition from one computed nominal fan speed (PWM value) to another is ramped over time to minimize audible transitions The ramp rate is configurable by means of the OEM SDRMultiple stepwise linear and clamp controls can be defined for each fan domain and used simultaneously For each domain the BMC uses the maximum of the domainrsquos stepwise linear control contributions and the sum of the domainrsquos clamp control contributions to compute the domainrsquos PWM value except that a stepwise linear instance can be configured to provide the domain maximumHysteresis can be specified to minimize fan speed oscillation and to smooth fan speed transitions If a Tcontrol SDR record does not contain a hysteresis definition for example an SDR adhering to a legacy format the BMC assumes a hysteresis value of zero
73145 Thermal and Acoustic ManagementThis feature refers to enhanced fan management to keep the system optimally cooled while reducing the amount of noise generated by the system fans Aggressive acoustics standards might require a trade-off between fan speed and system performance parameters that contribute to the cooling requirements primarily memory bandwidth The BIOS BMC and SDRs work together to provide control over how this trade-off is determinedThis capability requires the BMC to access temperature sensors on the individual memory DIMMs Additionally closed-loop thermal throttling is only supported with buffered DIMMs
HA201-TP Users Manual
104
Chapter 7 Intelreg Server Board S2600TP Platform Management
73146 Thermal Sensor Input to Fan Speed ControlThe BMC uses various IPMI sensors as input to the fan speed control Some of the sensors are IPMI models of actual physical sensors whereas some are ldquovirtualrdquo sensors whose values are derived from physical sensors using calculations andor tabular informationThe following IPMI thermal sensors are used as input to the fan speed controlbull Front panel temperature sensorbull Baseboard temperature sensorsbull CPU DTS-Spec margin sensorsbull DIMM thermal margin sensorsbull Exit air temperature sensorbull Global aggregate thermal margin sensorsbull SSB (Intelreg C610 Series Chipset) temperature sensorbull On-board Ethernet controller temperature sensors (support for this is
specific to the Ethernet controller being used)bull Add-in Intelreg SASIO module temperature sensor(s) (if present)bull Power supply thermal sensors (only available on PMBus-compliant
power supplies)A simple model is shown in the following figure which gives a high level graphic of the fan speed control structure creates the resulting fan speeds
HA201-TP Users Manual
105
Chapter 7 Intelreg Server Board S2600TP Platform Management
731461 Processor Thermal ManagementProcessor thermal management utilizes clamp algorithms for which the Processor DTS-Spec margin sensor is a controlling input This replaces the use of the (legacy) raw DTS sensor reading that was utilized on previous generation platforms The legacy DTS sensor is retained only for monitoring purposes and is not used as an input to the fan speed control
731462 Memory Thermal ManagementThe system memory is the most complex subsystem to thermally manage as it requires substantial interactions between the BMC BIOS and the embedded memory controller This section provides an overview of this management capability from a BMC perspective
7314621 Memory Thermal ThrottlingThe system shall support thermal management through open loop throttling (OLTT) and closed loop throttling (CLTT) of system memory based on the platform as well as availability of valid temperature sensors on the installed memory DIMMs Throttling levels are changed dynamically to cap throttling based on memory and system thermal conditions as determined by the system and DIMM power and thermal parameters Support for CLTT on mixed-mode DIMM populations (that is some installed DIMMs have valid temp sensors and some do not) is not supported The BMC fan speed control functionality is related to the memory throttling mechanism usedThe following terminology is used for the various memory throttling optionsbull Static Open Loop Thermal Throttling (Static-OLTT) OLTT control registers
are configured by BIOS MRC remain fixed after post The system does not change any of the throttling control registers in the embedded memory controller during runtime
bull Static Closed Loop Thermal Throttling (Static-CLTT) CLTT control registers are configured by BIOS MRC during POST The memory throttling is run as a closed-loop system with the DIMM temperature sensors as the control input Otherwise the system does not change any of the throttling control registers in the embedded memory controller during runtime
bull Dynamic Open Loop Thermal Throttling (Dynamic-OLTT) OLTT control registers are configured by BIOS MRC during POST Adjustments are
HA201-TP Users Manual
106
Chapter 7 Intelreg Server Board S2600TP Platform Management
made to the throttling during runtime based on changes in system cooling (fan speed)
bull Dynamic Closed Loop Thermal Throttling (Dynamic-CLTT) CLTT control registers are configured by BIOS MRC during POST The memory throttling is run as a closed-loop system with the DIMM temperature sensors as the control input Adjustments are made to the throttling during runtime based on changes in system cooling (fan speed)
Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce a new type of CLTT which is referred to as Hybrid CLTT for which the Integrated Memory Controller estimates the DRAM temperature in between actual reads of the TSODsHybrid CLTTT shall be used on all Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family that have DIMMs with thermal sensors Therefore the terms Dynamic-CLTT and Static-CLTT are really referring to this lsquohybridrsquo mode Note that if the IMCrsquos polling of the TSODs is interrupted the temperature readings that the BMC gets from the IMC shall be these estimated values 731463 DIMM Temperature Sensor Input to Fan Speed ControlA clamp algorithm s used for controlling fan speed based on DIMM temperatures Aggregate DIMM temperature margin sensors are used as the control input to the algorithm
731464 Dynamic (Hybrid) CLTTThe system will support dynamic (memory) CLTT for which the BMC FW dynamically modifies thermal offset registers in the IMC during runtime based on changes in system cooling (fan speed) For static CLTT a fixed offset value is applied to the TSOD reading to get the die temperature however this is does not provide as accurate results as when the offset takes into account the current airflow over the DIMM as is done with dynamic CLTTIn order to support this feature the BMC FW will derive the air velocity for each fan domain based on the PWM value being driven for the domain Since this relationship is dependent on the chassis configuration a method must be used which support this dependency (for example through OEM SDR) that establishes a lookup table providing this relationshipBIOS will have an embedded lookup table that provides thermal offset
HA201-TP Users Manual
107
Chapter 7 Intelreg Server Board S2600TP Platform Management
values for each DIMM type altitude setting and air velocity range (3 ranges of air velocity are supported) During system boot BIOS will provide 3 offset values (corresponding to the 3 air velocity ranges) tothe BMC for each enabled DIMM Using this data the BMC FW constructs a table that maps the offset value corresponding to a given air velocity range for each DIMM During runtime the BMC applies an averaging algorithm to determine the target offset value corresponding to thecurrent air velocity and then the BMC writes this new offset value into the IMC thermal offset register for the DIMM
731465 Fan ProfilesThe server system supports multiple fan control profiles to support acoustic targets and American Society of Heating Refrigerating and Air Conditioning Engineers (ASHRAE) compliance The BIOS Setup utility can be used to choose between meeting the target acoustic level or enhanced system performance This is accomplished through fan profilesThe BMC supports eight fan profiles numbered from 0 to 7 Fan management policy is dictated by the Tcontrol SDRs These SDRs provide a way to associate fan control behavior with one or more fan profilesEach group of profiles allows for varying fan control policies based on the altitude For a given altitude the Tcontrol SDRs associated with an acoustics-optimized profile generate less noise than the equivalent performance-optimized profile by driving lower fan speeds and the BIOSreduces thermal management requirements by configuring more aggressive memory throttling See Table 16 for more informationThe BMC provides commands that query for fan profile support and it provides a way to enable a fan profile Enabling a fan profile determines which Tcontrol SDRs are used for fan management The BMC only supports enabling a fan profile through the command if that profile is supported on all fan domains defined for the system It is important to configure the SDRs so that all desired fan profiles are supported on each fan domain If no single profile is supported across all domains the BMC by default uses profile 0 and does not allow it to be changedAt system boot the BIOS can use the Get Fan Control Configuration command to query the BMC about which fan profiles are supported The BIOS uses this information to display options in the BIOS Setup utility The BIOS indicates the fan profile to the BMC as dictated by the BIOS Setup Utility options for fan mode and altitude using the Set Fan Control
HA201-TP Users Manual
108
Chapter 7 Intelreg Server Board S2600TP Platform Management
Configuration commandThe BMC uses this information as an input to its fan-control algorithm as supported by the Tcontrol OEM SDR The BMC only allows enabling of fan profiles that the BMC indicates are supported using the Get Fan Control Configuration command For example if the Get Fan Control Configuration command indicates that only profile 1 is supported then using the Set Fan Control Configuration command to enable profile 2 will result in the return of an error completion codeThe BMC requires the BIOS to send the Set Fan Control Configuration command to the BMC on every system boot This must be done after the BIOS has completed any throttling-related chipset configuration
HA201-TP Users Manual
109
Chapter 7 Intelreg Server Board S2600TP Platform Management
731466 Open-Loop Thermal Throttling FallbackNormal system operation uses closed-loop thermal throttling (CLTT) and DIMM temperature monitoring as major factors in overall thermal and acoustics management In the event that BIOS is unable to configure the system for CLTT it defaults to open-loop thermal throttling (OLTT) In the OLTT mode it is assumed that the DIMM temperature sensors are not available for fan speed control The BIOS communicates the throttling mode to the BMC along with the fan profile number when it sends the Set Fan Control Configuration commandWhen OLTT mode is specified the BMC internally blocks access to the DIMM temperatures causing the DIMM aggregate margin sensors to be marked as ReadingState Unavailable The BMC then uses the failure-control values for these sensors if specified in the Tcontrol SDRs as their fan speed contributions
731467 ASHRAE ComplianceSystem requirements for ASHRAE compliance is defined in the Common Fan Speed Control amp Thermal Management Platform Architecture Specification Altitude-related changes in fan speed control are handled through profiles for different altitude ranges
73147 Power Supply Fan Speed ControlThis section describes the system level control of the fans internal to the power supply over the PMBus Some but not all Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family will require that the power supplies be included in the system level fan speed control For any system that requires either of these capabilities the power supply must be PMBus-compliant
731471 System Control of Power Supply FansSome products require that the BMC control the speed of the power supply fans as is done with normal system (chassis) fans except that the BMC cannot reduce the power supply fan any lower than the internal power supply control is driving it For these products the BMC FW must have the ability to control and monitor the power supply fans through PMBus commands The power supply fans are treated as a system fan domain for which fan control policies are mapped just as for chassis system fans with system thermal sensors (rather than internal power
HA201-TP Users Manual
110
Chapter 7 Intelreg Server Board S2600TP Platform Management
supply thermal sensors) used as the input to a clamp algorithm for the power supply fan control This domain has both piecewise clipping curves and clamped sensors mapped into the power supply fan domain All the power supplies can be defined as a single fan domain
731472 Use of Power Supply Thermal Sensors as Input to System (Chassis) Fan Control Some products require that the power supply internal thermal sensors are used as control inputs to the system (chassis) fans in the same manner as other system thermal sensors are used for this purpose The power supply thermal sensors are included as clamped sensors into one or more system fan domains which may include the power supply fan domain
73148 Fan Boosting due to Fan FailuresIntelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce additional capabilities for handling fan failure or removal as described in this sectionEach fan failure shall be able to define a unique response from all other fan domains An OEM SDR table defines the response of each fan domain based on a failure of any fan including both system and power supply fans (for PMBus-compliant power supplies only) This meansthat if a system has six fans then there will be six different fan fail reactions
73149 Programmable Fan PWM OffsetThe system provides a BIOS Setup option to boost the system fan speed by a programmable positive offset or a ldquoMaxrdquo setting Setting the programmable offset causes the BMC to add the offset to the fan speeds that it would otherwise be driving the fans to The Max setting causes the BMC to replace the domain minimum speed with alternate domain minimums that also are programmable through SDRsThis capability is offered to provide system administrators the option to manually configure fans speeds in instances where the fan speed optimized for a given platform may not be sufficient when a high end add-in is configured into the system This enables easier usage of the fan speed control to support Intelreg as well as third party chassis and better support of ambient temperatures higher than 35C
HA201-TP Users Manual
111
Chapter 7 Intelreg Server Board S2600TP Platform Management
supply thermal sensors) used as the input to a clamp algorithm for the power supply fan control This domain has both piecewise clipping curves and clamped sensors mapped into the power supply fan domain All the power supplies can be defined as a single fan domain
731472 Use of Power Supply Thermal Sensors as Input to System (Chassis) Fan Control Some products require that the power supply internal thermal sensors are used as control inputs to the system (chassis) fans in the same manner as other system thermal sensors are used for this purpose The power supply thermal sensors are included as clamped sensors into one or more system fan domains which may include the power supply fan domain
73148 Fan Boosting due to Fan FailuresIntelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce additional capabilities for handling fan failure or removal as described in this sectionEach fan failure shall be able to define a unique response from all other fan domains An OEM SDR table defines the response of each fan domain based on a failure of any fan including both system and power supply fans (for PMBus-compliant power supplies only) This meansthat if a system has six fans then there will be six different fan fail reactions
73149 Programmable Fan PWM OffsetThe system provides a BIOS Setup option to boost the system fan speed by a programmable positive offset or a ldquoMaxrdquo setting Setting the programmable offset causes the BMC to add the offset to the fan speeds that it would otherwise be driving the fans to The Max setting causes the BMC to replace the domain minimum speed with alternate domain minimums that also are programmable through SDRsThis capability is offered to provide system administrators the option to manually configure fans speeds in instances where the fan speed optimized for a given platform may not be sufficient when a high end add-in is configured into the system This enables easier usage of the fan speed control to support Intelreg as well as third party chassis and better support of ambient temperatures higher than 35C
HA201-TP Users Manual
112
Chapter 7 Intelreg Server Board S2600TP Platform Management
7315 Power Management Bus (PMBus)The Power Management Bus (ldquoPMBusrdquo) is an open standard protocol that is built upon the SMBus 20 transport It defines a means of communicating with power conversion and other devices using SMBus-based commands A system must have PMBus-compliant power supplies installed in order for the BMC or ME to monitor them for status andor power metering purposesFor more information on PMBus see the System Management Interface Forum Web sitehttpwwwpowersigorg
7316 Power Supply Dynamic Redundancy SensorThe BMC supports redundant power subsystems and implements a Power Unit Redundancy sensor per platform A Power Unit Redundancy sensor is of sensor type Power Unit (09h) and reading type Availability Status (0Bh) This sensor generates events when a power subsystem transitions between redundant and non-redundant states as determined by the number and health of the power subsystemrsquos component power supplies The BMC implements Dynamic Power Supply Redundancy status based upon current system load requirements as well as total Power Supply capacity This status is independent of the Cold Redundancy status This prevents the BMC from reporting Fully Redundant Power supplies when the load required by the system exceeds half the power capability of all power supplies installed and operationalDynamic Redundancy detects this condition and generates the appropriate SEL event to notify the user of the condition Power supplies of different power ratings may be swapped in and out to adjust the power capacity and the BMC will adjust the Redundancy status accordinglyThe definition of redundancy is power subsystem dependent and sometimes even configuration dependent See the appropriate Platform Specific Information for power unit redundancy supportThis sensor is configured as manual-rearm sensor in order to avoid the possibility of extraneous SEL events that could occur under certain system configuration and workload conditions The sensor shall rearm for the following conditionsbull PSU hot-addbull System reset
HA201-TP Users Manual
113
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull AC power cyclebull DC power cycleSystem AC power is applied but on standby ndash Power unit redundancy is based on OEM SDR power unit record and number of PSU presentSystem is (DC) powered on - The BMC calculates Dynamic Power Supply Redundancy status based upon current system load requirements as well as total Power Supply capacityThe BMC allows redundancy to be configured on a per power-unit-redundancy sensor basis by means of the OEM SDR records
7317 Component Fault LED ControlSeveral sets of component fault LEDs are supported on the server board Some LEDs are owned by the BMC and some by the BIOSThe BMC owns control of the following FRUfault LEDsbull Fan fault LEDs ndash A fan fault LED is associated with each fan The BMC
lights a fan fault LED if the associated fan-tach sensor has a lower critical threshold event status asserted Fan-tach sensors are manual re-arm sensors Once the lower critical threshold is crossed the LED remains lit until the sensor is rearmed These sensors are rearmed at system DC power-on and system reset
bull DIMM fault LEDs ndash The BMC owns the hardware control for these LEDs The LEDs reflect the state of BIOS-owned event-only sensors When the BIOS detects a DIMM fault condition it sends an IPMI OEM command (Set Fault Indication) to the BMC to instruct the BMC to turn on the associated DIMM Fault LED These LEDs are only active when the system is in the lsquoonrsquo state The BMC will not activate or change the state of the LEDs unless instructed by the BIOS
bull Hard Disk Drive Status LEDs ndash The HSBP PSoC owns the hardware control for these LEDs and detection of the faultstatus conditions that the LEDs reflect
bull CPU Fault LEDs The BMC owns control for these LEDs An LED is lit if there is an MSID mismatch (that is CPU power rating is incompatible with the board)
HA201-TP Users Manual
114
Chapter 7 Intelreg Server Board S2600TP Platform Management
7318 CMOS Battery MonitoringThe BMC monitors the voltage level from the CMOS battery which provides battery backup to the chipset RTC This is monitored as an auto-rearm threshold sensorUnlike monitoring of other voltage sources for which the Emulex Pilot III component continuously cycles through each input the voltage channel used for the battery monitoring provides a software enable bit to allow the BMC FW to poll the battery voltage at a relativelyslow rate in order to conserve battery power
HA201-TP Users Manual
115
Chapter 7 Intelreg Server Board S2600TP Platform Management
74 Intelreg Intelligent Power Node Manager (NM)Power management deals with requirements to manage processor power consumption and manage power at the platform level to meet critical business needs Node Manager (NM) is a platform resident technology that enforces power capping and thermal-triggered powercapping policies for the platform These policies are applied by exploiting subsystem settings (such as processor P and T states) that can be used to control power consumption NM enables data center power management by exposing an external interface to management software through which platform policies can be specified It also implements specific data center power management usage models such as power limiting and thermal monitoringThe NM feature is implemented by a complementary architecture utilizing the ME BMC BIOS and an ACPI-compliant OS The ME provides the NM policy engine and power controllimiting functions (referred to as Node Manager or NM) while the BMC provides the external LAN link by which external management software can interact with the feature The BIOS provides system power information utilized by the NM algorithms and also exports ACPI Source Language (ASL) code used by OS-Directed Power Management (OSPM) for negotiating processor P and T state changes for power limiting PMBus-compliant power supplies provide the capability to monitor input power consumption which is necessary to support NMThe NM architecture applicable to this generation of servers is defined by the NPTM Architecture Specification v20 NPTM is an evolving technology that is expected to continue to add new capabilities that will be defined in subsequent versions of the specification The ME NM implements the NPTM policy engine and controlmonitoring algorithms defined in the Node Power and Thermal Manager (NPTM) specification
741 Hardware RequirementsNM is supported only on platforms that have the NM FW functionality loaded and enabled on the Management Engine (ME) in the SSB and that have a BMC present to support the external LAN interface to the ME NM power limiting features requires a means for the ME to monitorinput power consumption for the platform This capability is generally provided by means of PMBus-compliant power supplies although an alternative model using a simpler SMBus power monitoring device is possible (there is potential loss in accuracy and responsiveness using
HA201-TP Users Manual
116
Chapter 7 Intelreg Server Board S2600TP Platform Management
non-PMBus devices) The NM SmaRTCLST feature does specifically require PMBus-compliant power supplies as well as additional hardware on the server board
742 FeaturesNM provides feature support for policy management monitoring and querying alerts and notifications and an external interface protocol The policy management features implement specific IT goals that can be specified as policy directives for NM Monitoring and querying features enable tracking of power consumption Alerts and notifications provide the foundation for automation of power management in the data center management stack The external interface specifies the protocols that must be supported in this version of NM
743 ME System Management Bus (SMBus) Interfacebull The ME uses the SMLink0 on the SSB in multi-master mode as a
dedicated bus for communication with the BMC using the IPMB protocol The BMC FW considers this a secondary IPMB bus and runs at 400 kHz
bull The ME uses the SMLink1 on the SSB in multi-master mode bus for communication with PMBus devices in the power supplies for support of various NM-related features This bus is shared with the BMC which polls these PMBus power supplies for sensor monitoring purposes (for example power supply status input power and so on) This bus runs at 100 KHz
bull The Management Engine has access to the ldquoHost SMBusrdquo
744 PECI 30bull The BMC owns the PECI bus for all Intel server implementations and
acts as a proxy for the ME when necessary
745 NM ldquoDiscoveryrdquo OEM SDRAn NM ldquodiscoveryrdquo OEM SDR must be loaded into the BMCrsquos SDR repository if and only if the NM feature is supported on that product This OEM SDR is used by management software to detect if NM is supported and to understand how to communicate with itSince PMBus compliant power supplies are required in order to support NM the system should be probed when the SDRs are loaded into the
HA201-TP Users Manual
117
Chapter 7 Intelreg Server Board S2600TP Platform Management
BMCrsquos SDR repository in order to determine whether or not the installed power supplies do in fact support PMBus If the installed power supplies are not PMBus compliant then the NM ldquodiscoveryrdquo OEM SDR should not be loadedPlease refer to the Intelreg Intelligent Power Node Manager 20 External Architecture Specification using IPMI for details of this interface
746 SmaRTCLSTThe power supply optimization provided by SmaRTCLST relies on a platform HW capability as well as ME FW support When a PMBus-compliant power supply detects insufficient input voltage an overcurrent condition or an over-temperature condition it will assert theSMBAlert signal on the power supply SMBus (such as the PMBus) Through the use of external gates this results in a momentary assertion of the PROCHOT and MEMHOT signals to the processors thereby throttling the processors and memory The ME FW also sees the SMBAlert assertion queries the power supplies to determine the condition causing the assertion and applies an algorithm to either release or prolong the throttling based on the situationSystem power control modes include1 SmaRT Low AC in put voltage event results in a one-time momentary
throttle for each event to the maximum throttle state2 Electrical Protection CLST High output energy event results in a
throttling hiccup mode with fixed maximum throttle time and a fix throttle release ramp time
3 Thermal Protection CLST High power supply thermal event results in a throttling hiccup mode with fixed maximum throttle time and a fix throttle release ramp time
When the SMBAlert signal is asserted the fans will be gated by HW for a short period (~100ms) to reduce overall power consumption It is expected that the interruption to the fans will be of short enough duration to avoid false lower threshold crossings for the fan tachsensors however this may need to be comprehended by the fan monitoring FW if it does have this side-effectME FW will log an event into the SEL to indicate when the system has been throttled by the SmaRTCLST power management feature This is dependent on ME FW support for this sensor Please refer ME FW EPS for SEL log details
HA201-TP Users Manual
118
Chapter 7 Intelreg Server Board S2600TP Platform Management
74611 Dependencies on PMBus-compliant Power Supply SupportThe SmaRTCLST system feature depends on functionality present in the ME NM SKU This feature requires power supplies that are compliant with the PMBus
note For additional information on Intelreg Intelligent Power Node
Manager usage and support please visit the following Intel Website
httpwwwintelcomcontentwwwusendata-centerdata-center-managementnode-manager-generalhtmlwapkw=node+manager
HA201-TP Users Manual
119
Chapter 7 Intelreg Server Board S2600TP Platform Management
75 Basic and Advanced Server Management FeaturesThe integrated BMC has support for basic and advanced server management features Basic management features are available by default Advanced management features are enabled with the addition of an optionally installed Remote Management Module 4 Lite (RMM4 Lite) key
When the BMC FW initializes it attempts to access the Intelreg RMM4 lite If the attempt to access Intelreg RMM4 lite is successful then the BMC activates the Advanced featuresThe following table identifies both Basic and Advanced server management features
HA201-TP Users Manual
120
Chapter 7 Intelreg Server Board S2600TP Platform Management
751 Dedicated Management PortThe server board includes a dedicated 1GbE RJ45 Management Port The management port is active with or without the RMM4 Lite key installed
752 Embedded Web ServerBMC Base manageability provides an embedded web server and an OEM-customizable web GUI which exposes the manageability features of the BMC base feature set It is supported over all on-board NICs that have management connectivity to the BMC as well as an optionaldedicated add-in management NIC At least two concurrent web sessions from up to two different users is supported The embedded web user interface shall support the following client web browsersbull Microsoft Internet Explorer 90bull Microsoft Internet Explorer 100bull Mozilla Firefox 24bull Mozilla Firefox 25The embedded web user interface supports strong security (authentication encryption and firewall support) since it enables remote server configuration and control The user interface presented by the embedded web user interface shall authenticate the user before allowing a web session to be initiated Encryption using 128-bit SSL is supported User authentication is based on user id and passwordThe GUI presented by the embedded web server authenticates the user before allowing a web session to be initiated It presents all functions to all users but grays-out those functions that the user does not have privilege to execute For example if a user does not have privilege to power control then the item shall be displayed in grey-out font in that userrsquos UI display The web GUI also provides a launch point for some of the advanced features such as KVM and media redirection These features are grayed out in the GUI unless the system has been updated to support these advanced features The embedded web server only displays US English or Chinese language outputAdditional features supported by the web GUI includesbull Presents all the Basic features to the usersbull Power onoffreset the server and view current power statebull Displays BIOS BMC ME and SDR version informationbull Display overall system health
HA201-TP Users Manual
121
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull Configuration of various IPMI over LAN parameters for both IPV4 and IPV6
bull Configuration of alerting (SNMP and SMTP)bull Display system asset information for the product board and
chassisbull Display of BMC-owned sensors (name status current reading
enabled thresholds) including color-code status of sensorsbull Provides ability to filter sensors based on sensor type (Voltage
Temperature Fan and Power supply related)bull Automatic refresh of sensor data with a configurable refresh ratebull On-line helpbull Displayclear SEL (display is in easily understandable human
readable format)bull Supports major industry-standard browsers (Microsoft Internet
Explorer and Mozilla Firefox)bull The GUI session automatically times-out after a user-configurable
inactivity period By default this inactivity period is 30 minutesbull Embedded Platform Debug feature - Allow the user to initiate
a ldquodebug dumprdquo to a file that can be sent to Intelreg for debug purposes
bull Virtual Front Panel The Virtual Front Panel provides the same functionality as the local front panel The displayed LEDs match the current state of the local panel LEDs The displayed buttons (for example power button) can be used in the same manner as the local buttons
bull Display of ME sensor data Only sensors that have associated SDRs loaded will be displayed
bull Ability to save the SEL to a filebull Ability to force HTTPS connectivity for greater security This is
provided through a configuration option in the UIbull Display of processor and memory information as is available over
IPMI over LANbull Ability to get and set Node Manager (NM) power policiesbull Display of power consumed by the serverbull Ability to view and configure VLAN settingsbull Warn user the reconfiguration of IP address will cause disconnectbull Capability to block logins for a period of time after several
HA201-TP Users Manual
122
Chapter 7 Intelreg Server Board S2600TP Platform Management
consecutive failed login attempts The lock-out period and the number of failed logins that initiates the lockout period are configurable by the user
bull Server Power Control - Ability to force into Setup on a resetbull System POST results ndash The web server provides the systemrsquos Power-
On Self Test (POST) sequence for the previous two boot cycles including timestamps The timestamps may be viewed in relative to the start of POST or the previous POST code
bull Customizable ports - The web server provides the ability to customize the port numbers used for SMASH http https KVM secure KVM remote media and secure remote media
For additional information reference the Intelreg Remote Management Module 4 and Integrated BMC Web Console Users Guide
753 Advanced Management Feature Support (RMM4 Lite)The integrated baseboard management controller has support for advanced management features which are enabled when an optional Intelreg Remote Management Module 4 Lite (RMM4 Lite) is installed The Intel RMM4 add-on offers convenient remote KVM access and control through LAN and internet It captures digitizes and compresses video and transmits it with keyboard and mouse signals to and from a remote computer Remote access and controlsoftware runs in the integrated baseboard management controller utilizing expanded capabilities enabled by the Intel RMM4 hardwareKey Features of the RMM4 add-on arebull KVM redirection from either the dedicated management NIC or
the server board NICs used for management traffic upto to two KVM sessions
bull Media Redirection ndash The media redirection feature is intended to allow system administrators or users to mount a remote IDE or USB CDROM floppy drive or a USB flash disk as a remote device to the server Once mounted the remote device appears just like a local device to the server allowing system administrators or users to install software (including operating systems) copy files update BIOS or boot the server from this device
bull KVM ndash Automatically senses video resolution for best possible screen capture high performance mouse tracking and
HA201-TP Users Manual
123
Chapter 7 Intelreg Server Board S2600TP Platform Management
synchronization It allows remote viewing and configuration in pre-boot POST and BIOS setup
7531 Keyboard Video Mouse (KVM) RedirectionThe BMC firmware supports keyboard video and mouse redirection (KVM) over LAN This feature is available remotely from the embedded web server as a Java applet This feature is only enabled when the Intelreg RMM4 lite is present The client system must have a Java Runtime Environment (JRE) version 60 or later to run the KVM or media redirection appletsThe BMC supports an embedded KVM application (Remote Console) that can be launched from the embedded web server from a remote console USB11 or USB 20 based mouse and keyboard redirection are supported It is also possible to use the KVM-redirection (KVM-r) session concurrently with media-redirection (media-r) This feature allows a user to interactively use the keyboard video and mouse (KVM) functions of the remote server as if the user were physically at the managed server KVM redirection console supports the following keyboard layouts English Dutch French German Italian Russian and SpanishKVM redirection includes a ldquosoft keyboardrdquo function The ldquosoft keyboardrdquo is used to simulate an entire keyboard that is connected to the remote system The ldquosoft keyboardrdquo functionality supports the following layouts English Dutch French German Italian Russian and SpanishThe KVM-redirection feature automatically senses video resolution for best possible screen capture and provides high-performance mouse tracking and synchronization It allows remote viewing and configuration in pre-boot POST and BIOS setup once BIOS has initialized videoOther attributes of this feature includebull Encryption of the redirected screen keyboard and mousebull Compression of the redirected screenbull Ability to select a mouse configuration based on the OS typebull Supports user definable keyboard macrosKVM redirection feature supports the following resolutions and refresh ratesbull 640x480 at 60Hz 72Hz 75Hz 85Hz 100Hzbull 800x600 at 60Hz 72Hz 75Hz 85Hzbull 1024x768 at 60Hx 72Hz 75Hz 85Hzbull 1280x960 at 60Hz
HA201-TP Users Manual
124
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull 1280x1024 at 60Hzbull 1600x1200 at 60Hzbull 1920x1080 (1080p)bull 1920x1200 (WUXGA)bull 1650x1080 (WSXGA+)
7532 Remote ConsoleThe Remote Console is the redirected screen keyboard and mouse of the remote host system To use the Remote Console window of your managed host system the browser must include a Java Runtime Environment plug-in If the browser has no Java support such as with a small handheld device the user can maintain the remote host system using the administration forms displayed by the browserThe Remote Console window is a Java Applet that establishes TCP connections to the BMCThe protocol that is run over these connections is a unique KVM protocol and not HTTP or HTTPS This protocol uses ports 7578 for KVM 5120 for CDROM media redirection and 5123 for FloppyUSB media redirection When encryption is enabled the protocol uses ports 7582 for KVM 5124 for CDROM media redirection and 5127 for FloppyUSB media redirection The local network environment must permit these connections to be made that is the firewall and in case of a private internal network the NAT (Network Address Translation) settings have to be configured accordingly
7533 PerformanceThe remote display accurately represents the local display The feature adapts to changes to the video resolution of the local display and continues to work smoothly when the system transitions from graphics to text or vice-versa The responsiveness may be slightly delayed depending on the bandwidth and latency of the networkEnabling KVM andor media encryption will degrade performance Enabling video compression provides the fastest response while disabling compression provides better video qualityFor the best possible KVM performance a 2Mbsec link or higher is recommendedThe redirection of KVM over IP is performed in parallel with the local KVM without affecting the local KVM operation
HA201-TP Users Manual
125
Chapter 7 Intelreg Server Board S2600TP Platform Management
7534 SecurityThe KVM redirection feature supports multiple encryption algorithms including RC4 and AES The actual algorithm that is used is negotiated with the client based on the clientrsquos capabilities
7535 AvailabilityThe remote KVM session is available even when the server is powered-off (in stand-by mode) No re-start of the remote KVM session shall be required during a server reset or power onoff A BMC reset (for example due to an BMC Watchdog initiated reset or BMC reset after BMC FWupdate) will require the session to be re-established KVM sessions persist across system reset but not across an AC power loss
7536 UsageAs the server is powered up the remote KVM session displays the complete BIOS boot process The user is able interact with BIOS setup change and save settings as well as enter and interact with option ROM configuration screensAt least two concurrent remote KVM sessions are supported It is possible for at least two different users to connect to same server and start remote KVM sessions
7537 Force-enter BIOS SetupKVM redirection can present an option to force-enter BIOS Setup This enables the system to enter F2 setup while booting which is often missed by the time the remote console redirects the video
7538 Media RedirectionThe embedded web server provides a Java applet to enable remote media redirection This may be used in conjunction with the remote KVM feature or as a standalone applet The media redirection feature is intended to allow system administrators or users to mount a remote IDE or USB CD-ROM floppy drive or a USB flash disk as a remote device to the server Once mounted the remote device appears just like a local device to the server allowing system administrators or users to install software (including operating systems) copy files update BIOS and so on or boot the server from this deviceThe following capabilities are supported
HA201-TP Users Manual
126
Chapter 7 Intelreg Server Board S2600TP Platform Management
The operation of remotely mounted devices is independent of the local devices on the server Both remote and local devices are useable in parallelbull Either IDE (CD-ROM floppy) or USB devices can be mounted as a
remote device to the serverbull It is possible to boot all supported operating systems from the remotely
mounted device and to boot from disk IMAGE (IMG) and CD-ROM or DVD-ROM ISO files See the Testedsupported Operating System List for more information
bull Media redirection supports redirection for both a virtual CD device and a virtual FloppyUSB device concurrently The CD device may be either a local CD drive or else an ISO image file the FloppyUSB device may be either a local Floppy drive a local USB device or else a disk image file
bull The media redirection feature supports multiple encryption algorithms including RC4 and AES The actual algorithm that is used is negotiated with the client based on the clientrsquos capabilities
bull A remote media session is maintained even when the server is powered-off (in standby mode) No restart of the remote media session is required during a server reset or power onoff An BMC reset (for example due to an BMC reset after BMC FW update) will require the session to be re-established
bull The mounted device is visible to (and useable by) managed systemrsquos OS and BIOS in both pre-boot and post-boot states
bull The mounted device shows up in the BIOS boot order and it is possible to change the BIOS boot order to boot from this remote device
bull It is possible to install an operating system on a bare metal server (no OS present) using the remotely mounted device This may also require the use of KVM-r to configure the OS during install
USB storage devices will appear as floppy disks over media redirection This allows for the installation of device drivers during OS installationIf either a virtual IDE or virtual floppy device is remotely attached during system boot both the virtual IDE and virtual floppy are presented as bootable devices It is not possible to present only a single-mounted device type to the system BIOS
HA201-TP Users Manual
127
Chapter 7 Intelreg Server Board S2600TP Platform Management
75381 AvailabilityThe default inactivity timeout is 30 minutes and is not user-configurable Media redirection sessions persist across system reset but not across an AC power loss or BMC reset
75382 Network Port UsageThe KVM and media redirection features use the following portsbull 5120 ndash CD Redirectionbull 5123 ndash FD Redirectionbull 5124 ndash CD Redirection (Secure)bull 5127 ndash FD Redirection (Secure)bull 7578 ndash Video Redirectionbull 7582 ndash Video Redirection (Secure)For additional information reference the Intelreg Remote Management Module 4 and Integrated BMC Web Console Users Guide
Chapter 1 Product IntroductionChapter 1 Product IntroductionChapter 8 Technical Support
wwwaicipccom
bull TAIWANTel +886 3 433 9188Fax +886 3 287 1818Email salesaicipccomtw
bull CHINA Tel +862154961421 +862154961422Fax Extension 608Email Technical Support supportaicipccom
bull AMERICA - West coastTel +19098958989Fax +19098958999Email salesaicipccom
bull AMERICA - East coastTel +19738848886Fax +19738844794Email njsalesaicipccom
bull EUROPETel +31306386789Fax +31306360638Emailsalesaicipcnl
Email Technical Support supportaicipccom
- PREFACE
-
- SAFETY INSTRUCTIONS
-
- Chapter 1 Product Introduction
-
- 11 Box Content
- 12 Specifications
- 13 General Information
-
- Chapter 2 Hardware Installation
-
- 21 Removing and Installing top cover
- 22 Central Processing Unit (CPU)
- 23 Diagram of the Correct Installation
- 24 System Memory
- 25 RemovingInstalling a Drive Tray Hard Drive Drive Slot Map
- 26 Removing and Installing a Fan Module
- 27 Power Supply
- 28 Tool-less Blade Slide Installation introduction
-
- Chapter 3 Motherborad Overview
-
- 31 Intelreg Server Board Feature Set
- 32 Motherboard Layout
- 33 Motherboard block diagram
- 34 Motherboard Configuration Jumpers
- 35 Motherboard Configuration Jumpers
-
- Chapter 4 12GB Expander Borad Overview
-
- 41 12GB Expander Board
-
- Chapter 5 Backplane Overview
-
- 51 Backplane
-
- Chapter 6 Debug amp Firmware Update Introduction
-
- 61 Expender firmware update through smart console port
- 62 Update the expander firmware through in-band
- 63 12G expander EDFB setting
- 64 Slot HDD power setting
- 65 HDD BP thermal sensor temperature setting
-
- Chapter 7 Intelreg Server Board S2600TP Platform Management
-
- 71 Server Management Function Architecture
- 72 Features and Functions
- 73 Sensor Monitoring
- 74 Intelreg Intelligent Power Node Manager (NM)
- 75 Basic and Advanced Server Management Features
-
- Chapter 8 Technical Support
-
- 按鈕11
![Page 12: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/12.jpg)
5
HA201-TP Users Manual
Chapter 1 Product Introduction
bull Top View of HA201-TP system
2 x hot-swappable storage control node
Intelreg Xeonreg E5-2600 v3 Processors
bull TOP View Of Controller Canister
6
HA201-TP Users Manual
Chapter 1 Product Introduction
bull Top View of HA201-TP system
4 x 60x56mm hot-swap fans
Intel S2600TP
HA201-TP Users Manual
7
Chapter 2 Hardware Installation
7
21 Removing and Installing top coverUnscrew the screws(2pcs for the back and 4pcs for the both right and left side)Slide the cover backwards to remove top cover form the chassis
Chapter 2 Hardware Installation
This section demonstrates maintenance procedures in replacing a defective part once the HA201-TP UM appliance is installed and is operational
Chapter 2 Hardware Installation
HA201-TP Users Manual
8
22 Central Processing Unit (CPU)221 Removing the Processor HeatsinkThe heatsink is attached to the server board or processor socket with captive fasteners Using a 2 Phillips screwdriver loosen the four screws located on the heatsink corners in a diagonal manner using the following procedurebull Using a 2 Phillips screwdriver start with screw 1 and loosen it by
giving it two rotations and stop (see letter A) (IMPORTANT Do not fully loosen)
bull Proceed to screw 2 and loosen it by giving it two rotations and stop (see letter B) Similarly loosen screws 3 and 4 Repeat steps A and B by giving each screw two rotations each time until all screws are loosened
bull Lift the heatsink straight up (see letter C)
REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
Processor
Socket
2
3
1
4
A
B
C
HA201-TP Users Manual
9
Chapter 2 Hardware Installation
222 Installing the Processor
2221 Unlatch the CPU Load Platebull Push the lever handle labeled ldquoOPEN 1strdquo (see letter A) down and
toward the CPU socket Rotate the lever handle upbull Repeat the steps for the second lever handle (see letter B)
CAUTION
Processor must be appropriate You may damage the server board if
you install a processor that is inappropriate for your server
CAUTION
ESD and handling processors Reduce the risk of electrostatic
discharge (ESD) damage to the processor by doing the following
(1) Touch the metal chassis before touching the processor or
server board Keep part of your body in contact with the metal
chassis to dissipate the static charge while handling the processor
(2) Avoid moving around unnecessarily
REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
A REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
B
Chapter 2 Hardware Installation
HA201-TP Users Manual
10
2222 Lift open the Load Platebull Rotate the right lever handle down until it releases the Load Plate
(see letter A)bull While holding down the lever handle with your other hand lift open
the Load Plate (see letter B)
BREMOVE
REMOVE
LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
A
HA201-TP Users Manual
11
Chapter 2 Hardware Installation
2223 Install the Processorbull Remove the processor from its packagebull Carefully remove the protective cover from the bottom side of the
CPU taking care not to touch any CPU contacts (see letter A)bull Orient the processor with the socket so that the processor cutouts
match the four orientation posts on the socket (see letter B) Note the location of a gold key at the corner of the processor (see letter C) Carefully place (Do NOT drop) the CPU into the socket
CAUTION
The pins inside the CPU socket are extremely sensitive Other than
the CPU no object should make contact with the pins inside the
CPU socket A damaged CPU socket pin may render the socket inoperable
and will produce erroneous CPU or other system errors if used
NOTE
The underside of the processor has components that may damage the
socket pins if installed improperly The processor must align correctly
with the socket opening before installation DO NOT DROP the
processor into the socket
NOTE
When possible a CPU insertion tool should be used when installing the
CPU
A
B
C
Chapter 2 Hardware Installation
HA201-TP Users Manual
12
2224 Remove the Socket Cover Remove the socket cover from the load plate by pressing it out
2225 Close the Load Plate Carefully lower the load plate down over the processor
2226 Lock down the Load Platebull Push down on the locking lever on the CLOSE 1st side (see letter A)
Slide the tip of the lever under the notch in the load plate (see letter B) Make sure the load plate tab engages under the socket lever when fully closed
bull bRepeat the steps to latch the locking lever on the other side (see letter C) Latch the levers in the order as shown
NOTE
The socket cover should be saved and re-used should the processor
need to be removed at anytime in the future
Save theprotectivecover
A
BC
HA201-TP Users Manual
13
Chapter 2 Hardware Installation
223 Installing the Processor Heatsink
bull If present remove the protective film covering the Thermal Interface Material on the bottom side of the heatsink (see letter A)
bull Align the heatsink fins to the front and back of the chassis for correct airflow The airflow goes from front-to-back of the chassis (see letter B)
bull Each heatsink has four captive fasteners and should be tightened in a diagonal manner using the following procedure
bull Using a 2 Phillips screwdriver start with screw 1 and engage screw threads by giving it two rotations and stop (see letter C) (Do not fully tighten)
bull Proceed to screw 2 and engage screw threads by giving it two rotations and stop (see letter D) Similarly engage screws 3 and 4
bull Repeat steps C and D by giving each screw two rotations each time until each screw is lightly tightened up to a maximum of 8 inch-lbs torque (see letter E)
NOTE
The processor heatsink for CPU1 and CPU2 is different FXXCA91X91HS is
for CPU1 while FXXEA91X91HS2 is for CPU2 Mislocating the heatsink
will cause serious thermal damage
C
Processor
Socket
AIRFLOW
Chassis Front
2
3
1
4
A
B
TIM
D
CAUTIONDo not over-tighten fasteners
E
Chapter 2 Hardware Installation
HA201-TP Users Manual
14
224 Removing the Processor
NOTE
Remove the processor by carefully lifting it out of the socket taking care
NOT to drop the processor and not touching any pins inside the socket
Install the socket cover if a replacement processor is not going to be installed
HA201-TP Users Manual
15
Chapter 2 Hardware Installation
225 Different heatsink for each of its CPUsCarefully position heatsink over the CPU0 and CPU1 and align the heatsink screws with the screw holes in the motherboard
Caution - Possible thermal damage Avoid moving the heatsink after
it has contacted the top of the CPU Too much movement could
disturb the layer of thermal compound causing voids and leading
to ineffective heat dissipation and component damage
CPU0
CPU0
CPU1
CPU1
Chapter 2 Hardware Installation
HA201-TP Users Manual
16
23 Diagram of the Correct InstallationNOTE The heat sinkrsquos fans should be blowing toward the rear end
of the chassis If one of the fans is facing the wrong direction
please remove the heat sink and reinstall it so that it is facing
the correct direction
AIRFLOW
AIRFLOW
FAN FAN
HA201-TP Users Manual
17
Chapter 2 Hardware Installation
24 System Memory241 Supported Memory
Chapter 2 Hardware Installation
HA201-TP Users Manual
18
242 Memory Population Rules
bull Each installed processor provides four channels of memory On the Intelreg Server Board S2600TP product family each memory channel supports two memory slots for a total possible 16 DIMMs installed
bull The memory channels from processor socket 1 are identified as Channel A B C and D The memory channels from processor socket 2 are identified as Channel E F G and H
bull The silk screened DIMM slot identifiers on the board provide information about the channel and therefore the processor to which they belong For example DIMM_A1 is the first slot on Channel A on processor 1 DIMM_E1 is the first DIMM socket on Channel E on processor 2
bull The memory slots associated with a given processor are unavailable if the corresponding processor socket is not populated
bull A processor may be installed without populating the associated memory slots provided a second processor is installed with associated memory In this case the memory is shared by the processors However the platform suffers performance degradation and latency due to the remote memory
bull Processor sockets are self-contained and autonomous However all memory subsystem support (such as Memory RAS and Error Management) in the BIOS setup is applied commonly across processor sockets
bull All DIMMs must be DDR4 DIMMsbull Mixing of LRDIMM with any other DIMM type is not allowed per
platformbull Mixing of DDR4 operating frequencies is not validated within a socket
or across sockets by Intel If DIMMs with different frequencies are mixed all DIMMs run at the common lowest frequency
bull A maximum of eight logical ranks (ranks seen by the host) per channel is allowed
bull The BLUE memory slots on the server board identify the first memory slot for a given memory channel
NOTE
Although mixed DIMM configurations are supported Intel only performs platform
validation on systems that are configured with identical DIMMs installed
HA201-TP Users Manual
19
Chapter 2 Hardware Installation
bull DIMM population rules require that DIMMs within a channel be populated starting with the BLUE DIMM slot or DIMM farthest from the processor in a ldquofill-farthestrdquo approach In addition when populating a Quad-rank DIMM with a Single- or Dual-rank DIMM in the same channel the Quad-rank DIMM must be populated farthest from the processor Intelreg MRC will check for correct DIMM placement
Chapter 2 Hardware Installation
HA201-TP Users Manual
20
On the Intelreg Server Board S2600TP product a total of sixteen DIMM slots are provided (two CPUs ndash four channels per CPU and two DIMMs per channel) The nomenclature for DIMM sockets is detailed in the following table
HA201-TP Users Manual
21
Chapter 2 Hardware Installation
243 Publishing System MemoryThere are a number of different situations in which the memory size andor configuration are displayed Most of these displays differ in one way or another so the same memory configuration may appear to display differently depending on when and where the display occurs
bull The BIOS displays the ldquoTotal Memoryrdquo of the system during POST if Quiet Boot is disabled in BIOS setup This is the total size of memory discovered by the BIOS during POST and is the sum of the individual sizes of installed DDR4 DIMMs in the system
bull The BIOS displays the ldquoEffective Memoryrdquo of the system in the BIOS Setup The term Effective Memory refers to the total size of all DDR4 DIMMs that are active (not disabled) and not used as redundant units (see Note below)
bull The BIOS provides the total memory of the system in the main page of BIOS setup This total is the same as the amount described by the first bullet above
bull If Quiet Boot is disabled the BIOS displays the total system memory on the diagnostic screen at the end of POST This total is the same as the amount described by the first bullet above
bull The BIOS provides the total amount of memory in the system by supporting the EFI Boot Service function GetMemoryMap()
bull The BIOS provides the total amount of memory in the system by supporting the INT 15h E820h function For details see the Advanced Configuration and Power Interface Specification
Chapter 2 Hardware Installation
HA201-TP Users Manual
22
25 RemovingInstalling a Drive Tray Hard Drive Drive Slot Map
251 Removing a Disk DriveRelease a drive tray by pressing the unlock button and pinching the lock lever slightly and pulling out the drive tray
252 Installing a Disk Drive
Adjust and place the 25rdquo HDD on the HDD tray Choose to the proper position screw hole then secure it with screws on the bottomMounting location 1 HDD with the interposer board attached
HA201-TP Users Manual
23
Chapter 2 Hardware Installation
253 Installing a Hard Disk Drive Tray
Insert the drive carrier into its bay Push the tray lever until it clicks Make sure the drive tray is correctly secured in place when its front edge aligns with the bay edge
254 Drive Slot Map
The drive slot map follows
HBA Card
MegaRaid Card
Chapter 2 Hardware Installation
HA201-TP Users Manual
24
26 Removing and Installing a Fan Module261 Removing a FAN module
Unscrew the thumb screw on the cover to open the top cover from NODE Unpluging the FAN cable Remove the fan by lifting it and pulling it out
262 Installing a FAN module To installing a fan module follow the reverse order
HA201-TP Users Manual
25
Chapter 2 Hardware Installation
27 Power Supply
271 Removing or Replacing the Power Supply Module
Slide the small levers (see red arrow ) to the right Hold the PSU lever and firmly pull the PSU out of the server chassis
Chapter 2 Hardware Installation
HA201-TP Users Manual
26
28 Tool-less Blade Slide Installation introduction
1 Pull on the lsquorsquoFront-Releasersquorsquoto unlock the inner channel from the Slide Assembly
2 Release the Detent-Lock and push Middle Channel inwards to retract Middle Channel
HA201-TP Users Manual
27
Chapter 2 Hardware Installation
Metal Spacer
OptionalRemove Metal Spacer for Aluminium Racks
3 Aligning the Front Bracket with the Mounting Hole
4 Push in to assembly the Front Bracket onto the Rack
Chapter 2 Hardware Installation
HA201-TP Users Manual
28
5 Now the bracket is fixed onto the Rack(Optional M6x10L screws are to secure the rails with posts if needed)
6 Refer to Diagram 3 amp 4 to assemble the End Bracket onto the Rack
HA201-TP Users Manual
29
Chapter 2 Hardware Installation
7 Assemble the inner channel onto the chassis using thescrews provided
8 Push the chassis with inner channels into Slide to complete Rack Installation
Chapter 3 Motherborad Overview
HA201-TP Users Manual
30
Chapter 3 Motherborad Overview
The Intelreg Server Board S2600TP is a monolithic printed circuit board (PCB) with features designed to support the high-performance and high-density computing markets This server board is designed to support the Intelreg Xeonreg processor E5-2600 v3 product family Previous generation Intelreg Xeonreg processors are not supported Many of the features and functions of the server board family are common A board will be identified by name when a described feature or function is unique to it
Chapter 3 Motherborad Overview
HA201-TP Users Manual
31
31 Intelreg Server Board Feature Set
Chapter 3 Motherborad Overview
HA201-TP Users Manual
32
WARNING The riser slot 1 on the server board is designed for
plugging in ONLY the riser card Plugging in the PCIe card may
cause permanent server board and PCIe card damage
Chapter 3 Motherborad Overview
HA201-TP Users Manual
33
32 Motherboard Layout
DiagnosticLED
RMM4LiteRiser Slot 2
SATASGPIO
BridgeBoard
SATA 3USB
Riser Slot 4
Riser Slot 4
HDD Activity
Riser Slot 3ControlPanel
SystemFan 1
CPU 2Socket
CPU 1Socket
SystemFan 3
SystemFan 2
MainPower 1
FanConnector
MainPower 2
InfiniBandPort(QSFP+)DedicatedManagementPortUSBStatus LEDID LED
VGA
NIC 2
IPMB
NIC 1
SerialPort A
CR 2032 3W
Backup Power Control
SATA 2
SATA 0
SATA RAID 5
SATA
Upgrade Key
DOM
SATA 1
Chapter 3 Motherborad Overview
HA201-TP Users Manual
34
33 Motherboard block diagram
Chapter 3 Motherborad Overview
HA201-TP Users Manual
35
34 Motherboard Configuration JumpersThe following table provides a summary and description of configuration test and debug jumpers The server board has several 3-pin jumper blocks that can be used Pin 1 on each jumper block can be identified by the following symbol on the silkscreen
Chapter 3 Motherborad Overview
HA201-TP Users Manual
36
Chapter 3 Motherborad Overview
HA201-TP Users Manual
37
35 Motherboard Configuration JumpersThe Intelreg Server Board S2600TP has the following board rear connector placement
HA201-TP Users Manual
38
Chapter 4 12G Expander Board Overview
This chapter provides detailed introduction guide on hardware introduction
41 12GB Expander Board411 Placement amp Connectors location
J1
JPIC_DBGU8
Chip 3x36RExpander
U14Flash U16
PIC
SW1
SW2
JEXP_UART
J7
JPIC_SMT
J2 J3 J4 CN1 JUSB_MB
JVBATT_12C
JVBATT CN3 CN2 JIPMI_LOC
CN5
CN8
JPWR1
JPWR2
JHDDPWRJPSON_OK
JIPMB
JUSB_PIC
J8
CN7CN6
CN4
JPWR_SW
Chapter 4 12GB Expander Borad Overview
HA201-TP Users Manual
39
Chapter 4 12G Expander Board Overview
412 PIN-OUT
Power Connector ndash JPWR1JPWR2
OS HDD Power Connector ndash JHDDPWR
Battery Connector ndash JVBATT
FAN Connector ndash J7J8
Console for Expander ndash JEXP_UART
HA201-TP Users Manual
40
Chapter 4 12G Expander Board Overview
PIC Smart portndash JPIC_SMT
PIC Debug port(For MPLAB I2C tool) ndash JPIC_DBG
PIC USB ndash JUSB_PIC
Battery Function ndash JVBATT_I2C
IPMB ndash JIPMB
HA201-TP Users Manual
41
Chapter 4 12G Expander Board Overview
BP PIC USB ndash JUSB_MB
IPMB for Device ndash JIPMI_LOC
Power SW ndash JPWR_SW
MB power on frunction ndash JPSON_OK
HA201-TP Users Manual
42
Chapter 4 12G Expander Board Overview
413 LEDs
LED_CN6
LED2
LED3
HA201-TP Users Manual
43
Chapter 5 Backplane OverviewChapter 5 Backplane Overview
This chapter provides detailed introduction guide on backplane introduction
51 Backplane511 Placement amp Connectors location
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
9
18
10
1
3
G
D
SG
D
S
C
B
C
E
B
C
E
GA
O
I
2
43
12
43
1
14
1
3
45
134
5
134
5
134
5
16
2
9
10
1
1
1120
10
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
16
17
32
33
484964
J2
JP1J11
For Secondary Canister Node
J4 J1 SW1
SW2
J3
For Primary Canister Node
HA201-TP Users Manual
44
Chapter 5 Backplane Overview
512 PIN-OUT
Power Connector ndash J2
Power Connector ndash J11
PMBUS Connector ndash JP1
FAN Connector ndash J4
HA201-TP Users Manual
45
Chapter 5 Backplane Overview
Console for MCU ndash J1
Front IO ndash J3
HA201-TP Users Manual
46
Chapter 5 Backplane Overview
513 LEDs
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
18
9
10
1
31
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
3 4
21
3 4
21
4
1
16
2
91
101
11
10
20
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ActFail
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
9
18
10
1
3
G
D
SG
D
S
C
B
C
E
B
C
E
GA
O
I
2
43
12
43
1
14
1
3
45
134
5
134
5
134
5
16
2
9
10
1
1
1120
10
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
16
17
32
33
484964LED1
HA201-TP Users Manual
47
Chapter 6 HDD Backplane Introduction
61 Expender firmware update through smart console port611 Update Expander firmware revision
Step 1 Set up HA201-TP console serial cable Insert console serial cable into console port shown below also
the other side inert serial port into motherboard
PLEASE FIND THE CONSOLE SERIAL CABLE IN THE PACKAGE BOX
Chapter 6 Debug amp Firmware Update Introduction
HA201-TP Users Manual
48
Chapter 6 HDD Backplane Introduction
Step 2 Set up HA201-TP RS232 connectionSet up RS232 connection application into your HA201-TP as shown in the example process below
For exampleOS Microsoft WindowsRS232 connection application Hyperterminal
Step 2 Install HyperTrmexe
HA201-TP Users Manual
49
Chapter 6 HDD Backplane Introduction
Step 3 Enter a new name for the icon in the field below and click OK
Step 4 Connecting by using selecting an option in the drop down menu circled in red below (we selected COM2 in this example) and click OK
HA201-TP Users Manual
50
Chapter 6 HDD Backplane Introduction
Properties
Port Setting
Bits per second
Data bits
Parity
Stop bits
Flow control None
Restore Defaults
OK ApplyCancel
None
Step 6 Set up is complete The diagram below depicts what screen should displayed
Step 5 For ldquoBits per secondrdquo select 38400 For ldquoFlow controlrdquo select None Click OK when you have finished your selections
cmdgt_
HA201-TP Users Manual
51
Chapter 6 HDD Backplane Introduction
Step 7To get firmware image amp MFG Configuration Image version information from AIC SAS Related Firmware Downloadne website
httpppmsaicipccomtw8888downloadexpandermcu
HA201-TP Users Manual
52
Chapter 6 HDD Backplane Introduction
Step 8Comand line for show current firmware revisioncmdgtrev
HA201-TP Users Manual
53
Chapter 6 HDD Backplane Introduction
Step 9Start to update expander firmwarecmdgtfdl 0 0_
Step 10Select the tool bar Transfer -gt Send File
HA201-TP Users Manual
54
Chapter 6 HDD Backplane Introduction
Step 11bull Choose new firmware path file fw 3A1_v11221bull Protocol have to choose Xmodem
Step 12Firmware download complete
HA201-TP Users Manual
55
Chapter 6 HDD Backplane Introduction
Step 13Reset computer for success update firmwarecmdgtreset
reset
HA201-TP Users Manual
56
Chapter 6 HDD Backplane Introduction
612 Update expander configuration MFG
Step 1Comand line for show current configuration MFGcmdgt showmfg
HA201-TP Users Manual
57
Chapter 6 HDD Backplane Introduction
Step 2Start to update expander configuration MFGcmdgtfdl 83 0_
Step 3Select the tool bar Transfer -gt Send File
83 0
HA201-TP Users Manual
58
Chapter 6 HDD Backplane Introduction
Step 4bull Choose new MFG path file mfg 3A10_eob_1201binbull Protocol have to choose Xmodem
Step 5MFG download complete
HA201-TP Users Manual
59
Chapter 6 HDD Backplane Introduction
Step 6Reset computer for success update MFGcmdgtreset
reset
HA201-TP Users Manual
60
Chapter 6 HDD Backplane Introduction
62 Update the expander firmware through in-band
FOR EXAMPLEStep 1
Download and install SG3_utilsexe which compatible with Linux OSFrom website httpsgdannyczsgsg3_utilshtml website Reference version sg3_utils-140tgz
Step 2To get firmware image amp MFG Configuration Image version information from AIC SAS Related Firmware Downloadne websitehttpppmsaicipccomtw8888downloadexpandermcu
HA201-TP Users Manual
61
Chapter 6 HDD Backplane Introduction
Step 3Execute terminal under the same new firmware folderexample Setting a new firmware folder on Home page Open Terminal by click to the right button of mouse in the same window Home
Step 4Typing sudo -s to into administrator mode
HA201-TP Users Manual
62
Chapter 6 HDD Backplane Introduction
Step 5 Find expander location$ sg_map -i
Step 6Update Expander firmware$ sg_write_buffer --id=0x0 --in=fw3A1_v11221 --mode=0x2 --offset=0 devsg0
HA201-TP Users Manual
63
Chapter 6 HDD Backplane Introduction
Step 7 Update Expander MFG$ sg_write_buffer --id=0x83 --in=mfg3A10_eob_1201bin --mode=0x2 --offset=0 devsg0
Step 8Reboot computer for success update firmware amp MFGrootubuntu~ reboot
HA201-TP Users Manual
64
Chapter 6 HDD Backplane Introduction
63 12G expander EDFB settingStep 1
For Install HyperTerminalexe refer to section 41
Step 2 Get EDFB statuscmdgt edfb
HA201-TP Users Manual
65
Chapter 6 HDD Backplane Introduction
Step 3Change EDFB setting
Enable EDFB functioncmdgtedfb on
Disable EDFB functioncmdgtedfb off
cmd gtedfbEDFB is OFF
cmd gtedfb onSucceeded to set EDFB
cmd gtedfb offEDFB is OFF
HA201-TP Users Manual
66
Chapter 6 HDD Backplane Introduction
64 Slot HDD power setting(Only for system cooling Fan controled by expander)
Step 1 For Install sg3exe tool and get new firmware from website refer to section 42
Step 2Execute terminal under the same new firmware folderexample Setting a new firmware folder on Home page Open Terminal by click to the right button of mouse in the same window Home
HA201-TP Users Manual
67
Chapter 6 HDD Backplane Introduction
Step 3 Typing sudo -s to into administrator mode
Step 4 Find expander location$ sg_map -i
HA201-TP Users Manual
68
Chapter 6 HDD Backplane Introduction
Step 5For example
If would like to turn the Disk004 power off under the HBA card Need to check Disk004 power status $ sg_ses --page=7 devsg0
Under HBA card the Element 3 = Disk004
HA201-TP Users Manual
69
Chapter 6 HDD Backplane Introduction
Step 6To check Disk004 (element 3) power status is ok$ sg_ses --page=2 devsg0
Status shows belowThe status of Element 3 is OK
HA201-TP Users Manual
70
Chapter 6 HDD Backplane Introduction
Step 7Turn off a HDD power$ sg_ses --descriptor=Disk004 --set=341 devsg0
Step 8Turn on a HDD power$ sg_ses --descriptor=Disk004 --clear=341 devsg0
HA201-TP Users Manual
71
Chapter 6 HDD Backplane Introduction
65 HDD BP thermal sensor temperature setting(Only for system cooling Fan controled by expander)
Step 1 For Install HyperTerminalexe refer to section 41
Step 2Get the current temperature settingscmdgt temperature
HA201-TP Users Manual
72
Chapter 6 HDD Backplane Introduction
Step 3For example
Set new temperatureT1=20 C 4 18 CT2=50 C 4 52 CWarning threshold=50 C 448 CAlarm threshold=55 C 454 C
The new setting will take effect after resetcmdgt temperature 18 52 48 54cmdgt reset
HA201-TP Users Manual
73
Chapter 6 HDD Backplane Introduction
Step 4Check fan speed amp temperature informationcmdgt sensor
cmd gtsensor==ENCLOSURE STATUS========================================================== Total fan number 2 System Fan-0 speed 10888 RPM System Fan-1 speed 10971 RPM System PWN-0 82 Expander Temperature 76 Celsius degree Sytem Temperature-0 33 Celsius degree T1 20 Celsius degree T2 50 Celsius degree TC 55 Celsius degree Voltage Sensor 09V 093V Voltage Sensor 18V 180V
cmd gt_
HA201-TP Users Manual
74
Chapter 7 Intelreg Server Board S2600TP Platform Management
71 Server Management Function Architecture711 IPMI Feature7111 IPMI 20 Featuresbull Baseboard management controller (BMC)bull IPMI Watchdog timerbull Messaging support including command bridging and usersession
supportbull Chassis device functionality including powerreset control and BIOS boot
flags supportbull Event receiver device The BMC receives and processes events from
other platform subsystemsbull Field Replaceable Unit (FRU) inventory device functionality The BMC
supports access to system FRU devices using IPMI FRU commandsbull System Event Log (SEL) device functionality The BMC supports and
provides access to a SELbull Sensor Data Record (SDR) repository device functionality The BMC
supports storage and access of system SDRsbull Sensor device and sensor scanningmonitoring The BMC provides IPMI
management of sensors It polls sensors to monitor and report system health
bull IPMI interfaceso Host interfaces include system management software (SMS) with receive message queue support and server management mode (SMM)o IPMB interfaceo LAN interface that supports the IPMI-over-LAN protocol (RMCP RMCP+)
bull Serial-over-LAN (SOL)bull ACPI state synchronization The BMC tracks ACPI state changes that are
provided by the BIOSbull BMC self-test The BMC performs initialization and run-time self-tests and
makes results available to external entitiesbull See also the Intelligent Platform Management Interface Specification
Second Generation v20
Chapter 7 Intelreg Server Board S2600TP Platform Management
HA201-TP Users Manual
75
Chapter 7 Intelreg Server Board S2600TP Platform Management
7112 Non IPMI FeaturesThe BMC supports the following non-IPMI featuresbull In-circuit BMC firmware updatebull Fault resilient booting (FRB) FRB2 is supported by the watchdog timer
functionalitybull Chassis intrusion detectionbull Basic fan control using Control version 2 SDRsbull Fan redundancy monitoring and supportbull Enhancements to fan speed controlbull Power supply redundancy monitoring and supportbull Hot-swap fan supportbull Acoustic management Support for multiple fan profilesbull Signal testing support The BMC provides test commands for setting
and getting platform signal statesbull The BMC generates diagnostic beep codes for fault conditionsbull System GUID storage and retrievalbull Front panel management The BMC controls the system status LED
and chassis ID LED It supports secure lockout of certain front panel functionality and monitors button presses The chassis ID LED is turned on using a front panel button or a command
bull Power state retentionbull Power fault analysisbull Intelreg Light-Guided Diagnosticsbull Power unit management Support for power unit sensor The BMC
handles power-good dropout conditionsbull DIMM temperature monitoring New sensors and improved acoustic
management using closed-loop fan control algorithm taking into account DIMM temperature readings
bull Address Resolution Protocol (ARP) The BMC sends and responds to ARPs (supported on embedded NICs)
bull Dynamic Host Configuration Protocol (DHCP) The BMC performs DHCP (supported on embedded NICs)
bull Platform environment control interface (PECI) thermal management support
bull E-mail alertingbull Support for embedded web server UI in Basic Manageability feature
set
HA201-TP Users Manual
76
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull Enhancements to embedded web servero Human-readable SELo Additional system configurabilityo Additional system monitoring capabilityo Enhanced on-line help
bull Integrated KVMbull Enhancements to KVM redirection
o Support for higher resolutionbull Integrated Remote Media Redirectionbull Lightweight Directory Access Protocol (LDAP) supportbull Intelreg Intelligent Power Node Manager supportbull Embedded platform debug feature which allows capture of detailed
data for later analysisbull Provisioning and inventory enhancements
o Inventory datasystem information export (partial SMBIOS table)bull DCMI 15 compliance (product-specific)bull Management support for PMBus rev12 compliant power suppliesbull BMC Data Repository (Managed Data Region Feature)bull Support for an Intelreg Local Control Display Panelbull System Airflow Monitoringbull Exit Air Temperature Monitoringbull Ethernet Controller Thermal Monitoringbull Global Aggregate Temperature Margin Sensorbull Memory Thermal Managementbull Power Supply Fan Sensorsbull Energy Star Server Supportbull Smart Ride Through (SmaRT) Closed Loop System Throttling (CLST)bull Power Supply Cold Redundancybull Power Supply FW Updatebull Power Supply Compatibility Checkbull BMC FW reliability enhancements
o Redundant BMC boot blocks to avoid possibility of a corrupted boot block resulting in a scenario that prevents a user from updating the BMC o BMC System Management Health Monitoring
HA201-TP Users Manual
77
Chapter 7 Intelreg Server Board S2600TP Platform Management
72 Features and Functions721 Power SubsystemThe server board supports several power control sources which can initiate power-up orpower-down activity
HA201-TP Users Manual
78
Chapter 7 Intelreg Server Board S2600TP Platform Management
722 Advanced Configuration and Power Interface (ACPI)The server board has support for the following ACPI states
HA201-TP Users Manual
79
Chapter 7 Intelreg Server Board S2600TP Platform Management
723 System InitializationDuring system initialization both the BIOS and the BMC initialize the following items
7231 Processor Tcontrol SettingProcessors used with this chipset implement a feature called Tcontrol which provides a processor-specific value that can be used to adjust the fan-control behavior to achieve optimum cooling and acoustics The BMC reads these from the CPU through PECI Proxy mechanism provided by Manageability Engine (ME) The BMC uses these values as part of thefan-speed-control algorithm
HA201-TP Users Manual
80
Chapter 7 Intelreg Server Board S2600TP Platform Management
7232 Fault Resilient Booting (FRB)Fault resilient booting (FRB) is a set of BIOS and BMC algorithms and hardware support that allow a multiprocessor system to boot even if the bootstrap processor (BSP) fails Only FRB2 is supported using watchdog timer commandsFRB2 refers to the FRB algorithm that detects system failures during POST The BIOS uses the BMC watchdog timer to back up its operation during POST The BIOS configures the watchdog timer to indicate that the BIOS is using the timer for the FRB2 phase of the boot operationAfter the BIOS has identified and saved the BSP information it sets the FRB2 timer use bit and loads the watchdog timer with the new timeout intervalIf the watchdog timer expires while the watchdog use bit is set to FRB2 the BMC (if so configured) logs a watchdog expiration event showing the FRB2 timeout in the event data bytes The BMC then hard resets the system assuming the BIOS-selected reset as the watchdog timeout actionThe BIOS is responsible for disabling the FRB2 timeout before initiating the option ROM scan and before displaying a request for a boot password If the processor fails and causes an FRB2 timeout the BMC resets the systemThe BIOS gets the watchdog expiration status from the BMC If the status shows an expired FRB2 timer the BIOS enters the failure in the system event log (SEL) In the OEM bytes entry in the SEL the last POST code generated during the previous boot attempt is written FRB2failure is not reflected in the processor status sensor valueThe FRB2 failure does not affect the front panel LEDs
HA201-TP Users Manual
81
Chapter 7 Intelreg Server Board S2600TP Platform Management
7233 Post Code DisplayThe BMC upon receiving standby power initializes internal hardware to monitor port 80h (POST code) writes Data written to port 80h is output to the system POST LEDsThe BMC deactivates POST LEDs after POST had completed
HA201-TP Users Manual
82
Chapter 7 Intelreg Server Board S2600TP Platform Management
724 System Event Log (SEL)The BMC implements the system event log as specified in the Intelligent Platform Management Interface Specification Version 20 The SEL is accessible regardless of the system power state through the BMCs in-band and out-of-band interfacesThe BMC allocates 95231bytes (approx 93 KB) of non-volatile storage space to store system events The SEL timestamps may not be in order Up to 3639 SEL records can be stored at a time Because the SEL is circular any command that results in an overflow of the SEL beyond the allocated space will overwrite the oldest entries in the SEL while setting the overflow flag
HA201-TP Users Manual
83
Chapter 7 Intelreg Server Board S2600TP Platform Management
73 Sensor MonitoringThe BMC monitors system hardware and reports system health The information gathered from physical sensors is translated into IPMI sensors as part of the ldquoIPMI Sensor Modelrdquo The BMC also reports various system state changes by maintaining virtual sensors that are not specifically tied to physical hardware This section describes general aspects of BMC sensor management as well as describing how specific sensor types are modeled Unless otherwise specified the term ldquosensorrdquo refers to the IPMI sensor-model definition of a sensor
531 Sensor ScanningThe value of many of the BMCrsquos sensors is derived by the BMC FW periodically polling physical sensors in the system to read temperature voltages and so on Some of these physical sensors are built-in to the BMC component itself and some are physically separated from the BMC Polling of physical sensors for support of IPMI sensor monitoring does not occur until the BMCrsquos operational code is running and the IPMI FW subsystem has completed initializationIPMI sensor monitoring is not supported in the BMC boot code Additionally the BMC selectively polls physical sensors based on the current power and reset state of the system and the availability of the physical sensor when in that state For example non-standby voltages are not monitored when the system is in S4 or S5 power state
HA201-TP Users Manual
84
Chapter 7 Intelreg Server Board S2600TP Platform Management
732 Sensor Rearm Behavior7321 Manual versus Re-arm SensorsSensors can be either manual or automatic re-arm An automatic re-arm sensor will re-arm (clear) the assertion event state for a threshold or offset it if that threshold or offset is deasserted after having been asserted This allows a subsequent assertion of the threshold or an offset to generate a new event and associated side-effect An example side-effect would be boosting fans due to an upper critical threshold crossing of a temperature sensor The event state and the input state (value) of the sensor track each other Most sensors are auto-rearmA manual re-arm sensor does not clear the assertion state even when the threshold or offset becomes de-asserted In this case the event state and the input state (value) of the sensor do not track each other The event assertion state is sticky The following methods can be used to re-arm a sensorbull Automatic re-arm ndash Only applies to sensors that are designated as
ldquoauto-rearmrdquobull IPMI command Re-arm Sensor Eventbull BMC internal method ndash The BMC may re-arm certain sensors due to a
trigger condition For example some sensors may be re-armed due to a system reset A BMC reset will re-arm all sensorsbull System reset or DC power cycle will re-arm all system fan sensors
HA201-TP Users Manual
85
Chapter 7 Intelreg Server Board S2600TP Platform Management
7322 Re-arm and Event GenerationAll BMC-owned sensors that show an asserted event status generate a de-assertion SEL event when the sensor is re-armed provided that the associated SDR is configured to enable a deassertion event for that condition This applies regardless of whether the sensor is a thresholdanalog sensor or a discrete sensor
To manually re-arm the sensors the sequence is outlined below1 A failure condition occurs and the BMC logs an assertion event2 If this failure condition disappears the BMC logs a de-assertion event (if
so configured)3 The sensor is re-armed by one of the methods described in the
previous section4 The BMC clears the sensor status5 The sensor is put into reading-state-unavailable state until it is polled
again or otherwise updated6 The sensor is updated and the ldquoreading-state-unavailablerdquo state is
cleared A new assertion event will be logged if the fault state is once again detected
All auto-rearm sensors that show an asserted event status generate a de-assertion SEL event at the time the BMC detects that the condition causing the original assertion is no longer present and the associated SDR is configured to enable a de-assertion event for that condition
HA201-TP Users Manual
86
Chapter 7 Intelreg Server Board S2600TP Platform Management
733 BIOS Event-Only SensorsBIOS-owned discrete sensors are used for event generation only and are not accessible through IPMI sensor commands like the Get Sensor Reading command Note that in this case the sensor owner designated in the SDR is not the BMCAn example of this usage would be the SELs logged by the BIOS for uncorrectable memory errors Such SEL entries would identify a BIOS-owned sensor ID
734 Margin SensorsThere is sometimes a need for an IPMI sensor to report the difference (margin) from a nonzero reference offset For the purposes of this document these type sensors are referred to as margin sensors For instance for the case of a temperature margin sensor if the referencevalue is 90 degrees and the actual temperature of the device being monitored is 85 degreesthe margin value would be -5
735 IPMI Watchdog SensorThe BMC supports a Watchdog Sensor as a means to log SEL events due to expirations of the IPMI 20 compliant Watchdog Timer
736 BMC Watchdog SensorThe BMC supports an IPMI sensor to report that a BMC reset has occurred due to action taken by the BMC Watchdog feature A SEL event will be logged whenever either the BMC FW stack is reset or the BMC CPU itself is reset
737 BMC System Management Health MonitoringThe BMC tracks the health of each of its IPMI sensors and report failures by providing a ldquoBMC FW Healthrdquo sensor of the IPMI 20 sensor type Management Subsystem Health with support for the Sensor Failure offset Only assertions should be logged into the SEL for the Sensor Failure offset The BMC Firmware Health sensor asserts for any sensor when 10 consecutive sensor errors are read These are not standard sensor events (that is threshold crossings or discrete assertions) These are BMC Hardware Access Layer (HAL) errors If a successful sensor read is completed the counter resets to zero
HA201-TP Users Manual
87
Chapter 7 Intelreg Server Board S2600TP Platform Management
738 VR Watchdog TimerThe BMC FW monitors that the power sequence for the board VR controllers is completed when a DC power-on is initiated Incompletion of the sequence indicates a board problem in which case the FW powers down the systemThe BMC FW supports a discrete IPMI sensor for reporting and logging this fault condition
739 System Airflow MonitoringThe BMC provides an IPMI sensor to report the volumetric system airflow in CFM (cubic feet per minute) The air flow in CFM is calculated based on the system fan PWM values The specific Pulse Width Modulation (PWM or PWMs) used to determine the CFM is SDR configurable The relationship between PWM and CFM is based on a lookup table in an OEM SDRThe airflow data is used in the calculation for exit air temperature monitoring It is exposed as an IPMI sensor to allow a datacenter management application to access this data for use in rack-level thermal management
7310 Thermal MonitoringThe BMC provides monitoring of component and board temperature sensing devices This monitoring capability is instantiated in the form of IPMI analogthreshold or discrete sensors depending on the nature of the measurementFor analogthreshold sensors with the exception of Processor Temperature sensors critical and non-critical thresholds (upper and lower) are set through SDRs and event generation enabled for both assertion and de-assertion eventsFor discrete sensors both assertion and de-assertion event generation are enabledMandatory monitoring of platform thermal sensors includes
HA201-TP Users Manual
88
Chapter 7 Intelreg Server Board S2600TP Platform Management
7310 Thermal MonitoringThe BMC provides monitoring of component and board temperature sensing devices This monitoring capability is instantiated in the form of IPMI analogthreshold or discrete sensors depending on the nature of the measurementFor analogthreshold sensors with the exception of Processor Temperature sensors critical and non-critical thresholds (upper and lower) are set through SDRs and event generation enabled for both assertion and de-assertion eventsFor discrete sensors both assertion and de-assertion event generation are enabledMandatory monitoring of platform thermal sensors includesbull Inlet temperature (physical sensor is typically on system front panel or
HDD back plane)bull Board ambient thermal sensorsbull Processor temperaturebull Memory (DIMM) temperaturebull CPU VRD Hot monitoringbull Power supply (only supported for PMBus-compliant PSUs)Additionally the BMC FW may create ldquovirtualrdquo sensors that are based on a combination of aggregation of multiple physical thermal sensors and application of a mathematical formula to thermal or power sensor readings
73101 Absolute Value versus Margin SensorsThermal monitoring sensors fall into three basic categoriesbull Absolute temperature sensors ndash These are analogthreshold sensors
that provide a value that corresponds to an absolute temperature value
bull Thermal margin sensors ndash These are analogthreshold sensors that provide a value that is relative to some reference value
bull Thermal fault indication sensors ndash These are discrete sensors that indicate a specific thermal fault condition
HA201-TP Users Manual
89
Chapter 7 Intelreg Server Board S2600TP Platform Management
73102 Processor DTS-Spec Margin Sensor(s)Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family incorporate a DTS based thermal spec This allows a much more accurate control of the thermal solution and will enable lower fan speeds and lower fan power consumption The main usage of this sensor is as an input to the BMCrsquos fan control algorithms The BMC implements this as a threshold sensor There is one DTS sensor for each installed physical processor package Thresholds are not set and alert generation is not enabled for these sensors
73103 Processor Thermal Margin Sensor(s)Each processor supports a physical thermal margin sensor per core that is readable through the PECI interface This provides a relative value representing a thermal margin from the corersquos throttling thermal trip point Assuming that temp controlled throttling is enabled the physical core temp sensor reads lsquo0rsquo which indicates the processor core is being throttledThe BMC supports one IPMI processor (margin) temperature sensor per physical processor package This sensor aggregates the readings of the individual core temperatures in a package to provide the hottest core temperature reading When the sensor reads lsquo0rsquo it indicates that the hottest processor core is throttlingDue to the fact that the readings are capped at the corersquos thermal throttling trip point (reading = 0) thresholds are not set and alert generation is not enabled for these sensors
73104 Processor Thermal Control Monitoring (Prochot)The BMC FW monitors the percentage of time that a processor has been operationally constrained over a given time window (nominally six seconds) due to internal thermal management algorithms engaging to reduce the temperature of the device When any processor core temperature reaches its maximum operating temperature the processorpackage PROCHOT (processor hot) signal is asserted and these management algorithms known as the Thermal Control Circuit (TCC) engage to reduce the temperature provided TCC is enabled TCC is enabled by BIOS during system boot This monitoring is instantiated as one IPMI analogthreshold sensor per processor package The BMC implements this as a threshold sensor on a per-processor basis
HA201-TP Users Manual
90
Chapter 7 Intelreg Server Board S2600TP Platform Management
Under normal operation this sensor is expected to read lsquo0rsquo indicating that no processor throttling has occurredThe processor provides PECI-accessible counters one for the total processor time elapsed and one for the total thermally constrained time which are used to calculate the percentage assertion over the given time window
73105 Processor Voltage Regulator (VRD) Over-Temperature SensorThe BMC monitors processor VRD_HOT signals The processor VRD_HOT signals are routed to the respective processor PROCHOT input in order initiate throttling to reduce processor power draw therefore indirectly lowering the VRD temperatureThere is one processor VRD_HOT signal per CPU slot The BMC instantiates one discrete IPMI sensor for each VRD_HOT signal This sensor monitors a digital signal that indicates whether a processor VRD is running in an over-temperature condition When the BMC detects that this signal is asserted it will cause a sensor assertion which will result in an event being written into the sensor event log (SEL)
73106 Inlet Temperature SensorEach platform supports a thermal sensor for monitoring the inlet temperature There are four potential sources for inlet temperature reading
73107 Baseboard Ambient Temperature Sensor(s)The server baseboard provides one or more physical thermal sensors for monitoring the ambient temperature of a board location This is typically to provide rudimentary thermal monitoring of components that lack internal thermal sensors
73108 Server South Bridge (SSB) Thermal MonitoringThe BMC monitors the SSB temperature This is instantiated as an analog (threshold) IPMI thermal sensor
73109 Exit Air Temperature Monitoring The BMC synthesizes a virtual sensor to approximate system exit air temperature for use in fan control This is calculated based on the total power being consumed by the system and the total volumetric air flow provided by the system fans
HA201-TP Users Manual
91
Chapter 7 Intelreg Server Board S2600TP Platform Management
Each system shall be characterized in tabular format to understand total volumetric flow versus fan speed The BMC calculates an average exit air temperature based on the total system power front panel temperature the volumetric system air flow (cubic feet per meter or CFM) and altitude rangeThis sensor is only available on systems in an Intelreg chassis The Exit Air temp sensor is only available when PMBus power supplies are installed
731010 Ethernet Controller Thermal MonitoringThe Intelreg Ethernet Controller I350-AM4 and Intelreg Ethernet Controller 10 Gigabit X540 support an on-die thermal sensor For baseboard Ethernet controllers that use these devices the BMC will monitor the sensors and use this data as input to the fan speed control The BMC will instantiate an IPMI temperature sensor for each device on the baseboard
731011 Memory VRD-Hot Sensor(s)The BMC monitors memory VRD_HOT signals The memory VRD_HOT signals are routed to the respective processor MEMHOT inputs in order to throttle the associated memory to effectively lower the temperature of the VRD feeding that memoryFor Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family there are 2 memory VRD_HOT signals per CPU slot The BMC instantiates one discrete IPMI sensor for each memory VRD_HOT signal
731012 Add-in Module Thermal MonitoringSome boards have dedicated slots for an IO module andor a SAS module For boards that support these slots the BMC will instantiate an IPMI temperature sensor for each slot The modules themselves may or may not provide a physical thermal sensor (a TMP75 device) If the BMC detects that a module is installed it will attempt to access the physical thermal sensor and if found enable the associated IPMI temperature sensor
731013 Processor ThermTripWhen a Processor ThermTrip occurs the system hardware will automatically power down the server If the BMC detects that a ThermTrip occurred then it will set the ThermTrip offset for the applicable
HA201-TP Users Manual
92
Chapter 7 Intelreg Server Board S2600TP Platform Management
processor status sensor
731014 Server South Bridge (SSB) ThermTrip Monitoring The BMC supports SSB ThermTrip monitoring that is instantiated as an IPMI discrete sensor When a SSB ThermTrip occurs the system hardware will automatically power down the server and the BMC will assert the sensor offset and log an event
731015 DIMM ThermTrip MonitoringThe BMC supports DIMM ThermTrip monitoring that is instantiated as one aggregate IPMI discrete sensor per CPU When a DIMM ThermTrip occurs the system hardware will automatically power down the server and the BMC will assert the sensor offset and log an eventThis is a manual re-arm sensor that is rearmed on system resets and power-on (AC or DC power on transitions)
HA201-TP Users Manual
93
Chapter 7 Intelreg Server Board S2600TP Platform Management
7311 Processor SensorsThe BMC provides IPMI sensors for processors and associated components such as voltage regulators and fans The sensors are implemented on a per-processor basis
HA201-TP Users Manual
94
Chapter 7 Intelreg Server Board S2600TP Platform Management
73111 Processor Status SensorsThe BMC provides an IPMI sensor of type processor for monitoring status information for each processor slot If an event state (sensor offset) has been asserted it remains asserted until one of the following happens1 A Rearm Sensor Events command is executed for the processor status
sensor2 AC or DC power cycle system reset or system boot occurs
The BMC provides system status indication to the front panel LEDs for processor fault conditions shown belowCPU Presence status is not saved across AC power cycles and therefore will not generate a deassertion after cycling AC power
HA201-TP Users Manual
95
Chapter 7 Intelreg Server Board S2600TP Platform Management
73112 Processor Population Fault (CPU Missing) SensorThe BMC supports a Processor Population Fault sensor This is used to monitor for the condition in which processor slots are not populated as required by the platform hardware to allow power-on of the systemAt BMC startup the BMC will check for the fault condition and set the sensor state accordinglyThe BMC also checks for this fault condition at each attempt to DC power-on the system At each DC power-on attempt a beep code is generated if this fault is detectedThe following steps are used to correct the fault condition and clear the sensor state1 AC power down the server2 Install the missing processor into the correct slot3 AC power on the server
73113 ERR2 Timeout MonitoringThe BMC supports an ERR2 Timeout Sensor (1 per CPU) that asserts if a CPUrsquos ERR2 signal has been asserted for longer than a fixed time period (gt 90 seconds) ERR[2] is a processor signal that indicates when the IIO (Integrated IO module in the processor) has a fatal error which could not be communicated to the core to trigger SMI ERR[2] events are fatal error conditions where the BIOS and OS will attempt to gracefully handle error but may not be always do so reliably A continuously asserted ERR2 signal is an indication that the BIOS cannot service the condition that caused the error This is usually because that condition prevents the BIOS from runningWhen an ERR2 timeout occurs the BMC assertsde-asserts the ERR2 Timeout Sensor and logs a SEL event for that sensor The default behavior for BMC core firmware is to initiate a system reset upon detection of an ERR2 timeout The BIOS setup utility provides an option to disable or enable system reset by the BMC for detection of this condition
HA201-TP Users Manual
96
Chapter 7 Intelreg Server Board S2600TP Platform Management
73114 CATERR SensorThe BMC supports a CATERR sensor for monitoring the system CATERR signalThe CATERR signal is defined as having 3 statesbull high (no event)bull pulsed low (possibly fatal may be able to recover)bull low (fatal)All processors in a system have their CATERR pins tied together The pin is used as a communication path to signal a catastrophic system event to all CPUs The BMC has direct access to this aggregate CATERR signalThe BMC only monitors for the ldquoCATERR held lowrdquo condition A pulsed low condition is ignored by the BMC If a CATERR-low condition is detected the BMC logs an error message to the SEL against the CATERR sensor and the default action after logging the SEL entry is to reset the system The BIOS setup utility provides an option to disable or enable system reset by the BMC for detection of this conditionThe sensor is rearmed on power-on (AC or DC power on transitions) It is not rearmed on system resets in order to avoid multiple SEL events that could occur due to a potential reset loop if the CATERR keeps recurring which would be the case if the CATERR was due to an MSID mismatch conditionWhen the BMC detects that this aggregate CATERR signal has asserted it can then go through PECI to query each CPU to determine which one was the source of the error and write an OEM code identifying the CPU slot into an event data byte in the SEL entry If PECI is non-functional (it isnrsquot guaranteed in this situation) then the OEM code should indicate that the source is unknownEvent data byte 2 and byte 3 for CATERR sensor SEL events
ED1 ndash 0xA1ED2 - CATERR type0 Unknown1 CATERR2 CPU Core Error (not supported on Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family)3 MSID Mismatch4 CATERR due to CPU 3-strike timeout
HA201-TP Users Manual
97
Chapter 7 Intelreg Server Board S2600TP Platform Management
ED3 - CPU bitmap that causes the system CATERR[0] CPU1[1] CPU2[2] CPU3[3] CPU4When a CATERR Timeout event is determined to be a CPU 3-strike timeout The BMC shall log the logical FRU information (eg busdevfunc for a PCIe device CPU or DIMM) that identifies the FRU that caused the error in the extended SEL data bytes In this case Ext-ED0 will be set to 0x70 and the remaining ED1-ED7 will be set according to the device type and info available
73115 MSID Mismatch SensorThe BMC supports a MSID Mismatch sensor for monitoring for the fault condition that will occur if there is a power rating incompatibility between a baseboard and a processor The sensor is rearmed on power-on (AC or DC power on transitions)
7312 Voltage MonitoringThe BMC provides voltage monitoring capability for voltage sources on the main board and processors such that all major areas of the system are covered This monitoring capability is instantiated in the form of IPMI analogthreshold sensors
73121 DIMM Voltage SensorsSome systems support either LVDDR (Low Voltage DDR) memory or regular (non-LVDDR) memory During POST the system BIOS detects which type of memory is installed and configures the hardware to deliver the correct voltageSince the nominal voltage range is different this necessitates the ability to set different thresholds for any associated IPMI voltage sensors The BMC FW supports this by implementing separate sensors (that is separate IPMI sensor numbers) for each nominal voltage range supported for a single physical sensor and it enablesdisables the correct IPMI sensor based on which type memory is installed The sensor data records for both these DIMM voltage sensor types have scanning disabled by default Once the BIOS has completed its POST routine it is responsible for communicating the DIMM voltage type to the BMC which will then
HA201-TP Users Manual
98
Chapter 7 Intelreg Server Board S2600TP Platform Management
enable sensor scanning of the correct DIMM voltage sensor
7313 Fan MonitoringBMC fan monitoring support includes monitoring of fan speed (RPM) and fan presence
73131 Fan Tach SensorsFan Tach sensors are used for fan failure detection The reported sensor reading is proportional to the fanrsquos RPM This monitoring capability is instantiated in the form of IPMI analogthreshold sensorsMost fan implementations provide for a variable speed fan so the variations in fan speed can be large Therefore the threshold values must be set sufficiently low as to not result in inappropriate threshold crossingsFan tach sensors are implemented as manual re-arm sensors because a lower-critical threshold crossing can result in full boosting of the fans This in turn may cause a failing fanrsquos speed to rise above the threshold and can result in fan oscillationsAs a result fan tach sensors do not auto-rearm when the fault condition goes away but rather are rearmed for either of the following occurrences1 The system is reset or power-cycled2 The fan is removed and either replaced with another fan or re-
inserted This applies to hot-swappable fans only This re-arm is triggered by change in the state of the associated fan presence sensor
After the sensor is rearmed if the fan speed is detected to be in a normal range the failure conditions shall be cleared and a de-assertion event shall be logged
HA201-TP Users Manual
99
Chapter 7 Intelreg Server Board S2600TP Platform Management
73132 Fan Presence SensorsSome chassis and server boards provide support for hot-swap fans These fans can be removed and replaced while the system is powered on and operating normally The BMC implements fan presence sensors for each hot swappable fan These are instantiated as IPMI discrete sensorsEvents are only logged for fan presence upon changes in the presence state after AC power is applied (no events logged for initial state)
73133 Fan Redundancy SensorThe BMC supports redundant fan monitoring and implements fan redundancy sensors for products that have redundant fans Support for redundant fans is chassis-specificA fan redundancy sensor generates events when its associated set of fans transition between redundant and non-redundant states as determined by the number and health of the component fans The definition of fan redundancy is configuration dependent The BMCallows redundancy to be configured on a per fan-redundancy sensor basis through OEM SDR recordsThere is a fan redundancy sensor implemented for each redundant group of fans in the systemAssertion and de-assertion event generation is enabled for each redundancy state
73134 Power Supply Fan SensorsMonitoring is implemented through IPMI discrete sensors one for each power supply fan The BMC polls each installed power supply using the PMBus fan status commands to check for failure conditions for the power supply fans The BMC asserts the ldquoperformance lagsrdquo offset of the IPMI sensor if a fan failure is detectedPower supply fan sensors are implemented as manual re-arm sensors because a failure condition can result in boosting of the fans This in turn may cause a failing fanrsquos speed to rise above the ldquofaultrdquo threshold and can result in fan oscillations As a result these sensors do not auto-rearm when the fault condition goes away but rather are rearmed only when the system is reset or power-cycled or the PSU is removed and replaced with the same or another PSUAfter the sensor is rearmed if the fan is no longer showing a failed state the failure condition in the IPMI sensor shall be cleared and a de-
HA201-TP Users Manual
100
Chapter 7 Intelreg Server Board S2600TP Platform Management
assertion event shall be logged
73135 Monitoring for ldquoFans Offrdquo ScenarioOn Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family it is likely that there will be situations where specific fans are turned off based on current system conditions BMC Fan monitoring will comprehend this scenario and not log false failure eventsThe recommended method is for the BMC FW to halt updates to the value of the associated fan tach sensor and set that sensorrsquos IPMI sensor state to ldquoreading-state-unavailablerdquo when this mode is active Management software must comprehend this state for fan tach sensors and not report these as failure conditionsThe scenario for which this occurs is that the BMC Fan Speed Control (FSC) code turns off the fans by setting the PWM for the domain to 0 This is done when based on one or more global aggregate thermal margin sensor readings dropping below a specified thresholdBy default the fans-off feature will be disabled There is a BMC command and BIOS setup option to enabledisable this featureThe SmaRTCLST system feature will also momentarily gate power to all the system fans to reduce overall system power consumption in response to a power supply event (for example to ride out an AC power glitch) However for this scenario the fan power is gated by hardware for only 100ms which should not be long enough to result in triggering a fan fault SEL event
7314 Standard Fan ManagementThe BMC controls and monitors the system fans Each fan is associated with a fan speed sensor that detects fan failure and may also be associated with a fan presence sensor for hotswap support For redundant fan configurations the fan failure and presence status determines the fan redundancy sensor stateThe system fans are divided into fan domains each of which has a separate fan speed control signal and a separate configurable fan control policy A fan domain can have a set of temperature and fan sensors associated with it These are used to determine the current fan domain state
HA201-TP Users Manual
101
Chapter 7 Intelreg Server Board S2600TP Platform Management
A fan domain has three statesbull The sleep and boost states have fixed (but configurable through OEM
SDRs) fan speeds associated with thembull The nominal state has a variable speed determined by the fan
domain policy An OEM SDR record is used to configure the fan domain policy
The fan domain state is controlled by several factors They are listed below in order of precedence high to lowbull Boost
o Associated fan is in a critical state or missing The SDR describes which fan domains are boosted in response to a fan failure or removal in each domain If a fan is removed when the system is in lsquoFans-offrsquo mode it will not be detected and there will not be any fan boost till system comes out of lsquoFans-off modeo Any associated temperature sensor is in a critical state The SDR describes which temperature-threshold violations cause fan boost for each fan domaino The BMC is in firmware update mode or the operational firmware is corruptedo If any of the above conditions apply the fans are set to a fixed boost state speed
bull Nominalo A fan domainrsquos nominal fan speed can be configured as static (fixed value) or controlled by the state of one or more associated temperature sensorso See section 43144 for more details
73141 Hot-Swap FansHot-swap fans are supported These fans can be removed and replaced while the system is powered on and operating The BMC implements fan presence sensors for each hotswappable fanWhen a fan is not present the associated fan speed sensor is put into the readingunavailable state and any associated fan domains are put into the boost state The fans may already be boosted due to a previous fan failure or fan removalWhen a removed fan is inserted the associated fan speed sensor is rearmed If there are no other critical conditions causing a fan boost condition the fan speed returns to the nominal state Power cycling or
HA201-TP Users Manual
102
Chapter 7 Intelreg Server Board S2600TP Platform Management
resetting the system re-arms the fan speed sensors and clears fanfailure conditions If the failure condition is still present the boost state returns once the sensor has re-initialized and the threshold violation is detected again
73142 Fan Redundancy DetectionThe BMC supports redundant fan monitoring and implements a fan redundancy sensor A fan redundancy sensor generates events when its associated set of fans transitions between redundant and non-redundant states as determined by the number and health of the fans The definition of fan redundancy is configuration dependent The BMC allows redundancy to be configured on a per fan redundancy sensor basis through OEM SDR recordsA fan failure or removal of hot-swap fans up to the number of redundant fans specified in the SDR in a fan configuration is a non-critical failure and is reflected in the front panel status A fan failure or removal that exceeds the number of redundant fans is a non-fatal insufficientresources condition and is reflected in the front panel status as a non-fatal errorRedundancy is checked only when the system is in the DC-on state Fan redundancy changes that occur when the system is DC-off or when AC is removed will not be logged until the system is turned on
73143 Fan DomainsSystem fan speeds are controlled through pulse width modulation (PWM) signals which are driven separately for each domain by integrated PWM hardware Fan speed is changed by adjusting the duty cycle which is the percentage of time the signal is driven high in each pulseThe BMC controls the average duty cycle of each PWM signal through direct manipulation of the integrated PWM control registersThe same device may drive multiple PWM signals
HA201-TP Users Manual
103
Chapter 7 Intelreg Server Board S2600TP Platform Management
73144 Nominal Fan SpeedA fan domainrsquos nominal fan speed can be configured as static (fixed value) or controlled by the state of one or more associated temperature sensorsOEM SDR records are used to configure which temperature sensors are associated with which fan control domains and the algorithmic relationship between the temperature and fan speed Multiple OEM SDRs can reference or control the same fan control domain and multiple OEM SDRs can reference the same temperature sensorsThe PWM duty-cycle value for a domain is computed as a percentage using one or more instances of a stepwise linear algorithm and a clamp algorithm The transition from one computed nominal fan speed (PWM value) to another is ramped over time to minimize audible transitions The ramp rate is configurable by means of the OEM SDRMultiple stepwise linear and clamp controls can be defined for each fan domain and used simultaneously For each domain the BMC uses the maximum of the domainrsquos stepwise linear control contributions and the sum of the domainrsquos clamp control contributions to compute the domainrsquos PWM value except that a stepwise linear instance can be configured to provide the domain maximumHysteresis can be specified to minimize fan speed oscillation and to smooth fan speed transitions If a Tcontrol SDR record does not contain a hysteresis definition for example an SDR adhering to a legacy format the BMC assumes a hysteresis value of zero
73145 Thermal and Acoustic ManagementThis feature refers to enhanced fan management to keep the system optimally cooled while reducing the amount of noise generated by the system fans Aggressive acoustics standards might require a trade-off between fan speed and system performance parameters that contribute to the cooling requirements primarily memory bandwidth The BIOS BMC and SDRs work together to provide control over how this trade-off is determinedThis capability requires the BMC to access temperature sensors on the individual memory DIMMs Additionally closed-loop thermal throttling is only supported with buffered DIMMs
HA201-TP Users Manual
104
Chapter 7 Intelreg Server Board S2600TP Platform Management
73146 Thermal Sensor Input to Fan Speed ControlThe BMC uses various IPMI sensors as input to the fan speed control Some of the sensors are IPMI models of actual physical sensors whereas some are ldquovirtualrdquo sensors whose values are derived from physical sensors using calculations andor tabular informationThe following IPMI thermal sensors are used as input to the fan speed controlbull Front panel temperature sensorbull Baseboard temperature sensorsbull CPU DTS-Spec margin sensorsbull DIMM thermal margin sensorsbull Exit air temperature sensorbull Global aggregate thermal margin sensorsbull SSB (Intelreg C610 Series Chipset) temperature sensorbull On-board Ethernet controller temperature sensors (support for this is
specific to the Ethernet controller being used)bull Add-in Intelreg SASIO module temperature sensor(s) (if present)bull Power supply thermal sensors (only available on PMBus-compliant
power supplies)A simple model is shown in the following figure which gives a high level graphic of the fan speed control structure creates the resulting fan speeds
HA201-TP Users Manual
105
Chapter 7 Intelreg Server Board S2600TP Platform Management
731461 Processor Thermal ManagementProcessor thermal management utilizes clamp algorithms for which the Processor DTS-Spec margin sensor is a controlling input This replaces the use of the (legacy) raw DTS sensor reading that was utilized on previous generation platforms The legacy DTS sensor is retained only for monitoring purposes and is not used as an input to the fan speed control
731462 Memory Thermal ManagementThe system memory is the most complex subsystem to thermally manage as it requires substantial interactions between the BMC BIOS and the embedded memory controller This section provides an overview of this management capability from a BMC perspective
7314621 Memory Thermal ThrottlingThe system shall support thermal management through open loop throttling (OLTT) and closed loop throttling (CLTT) of system memory based on the platform as well as availability of valid temperature sensors on the installed memory DIMMs Throttling levels are changed dynamically to cap throttling based on memory and system thermal conditions as determined by the system and DIMM power and thermal parameters Support for CLTT on mixed-mode DIMM populations (that is some installed DIMMs have valid temp sensors and some do not) is not supported The BMC fan speed control functionality is related to the memory throttling mechanism usedThe following terminology is used for the various memory throttling optionsbull Static Open Loop Thermal Throttling (Static-OLTT) OLTT control registers
are configured by BIOS MRC remain fixed after post The system does not change any of the throttling control registers in the embedded memory controller during runtime
bull Static Closed Loop Thermal Throttling (Static-CLTT) CLTT control registers are configured by BIOS MRC during POST The memory throttling is run as a closed-loop system with the DIMM temperature sensors as the control input Otherwise the system does not change any of the throttling control registers in the embedded memory controller during runtime
bull Dynamic Open Loop Thermal Throttling (Dynamic-OLTT) OLTT control registers are configured by BIOS MRC during POST Adjustments are
HA201-TP Users Manual
106
Chapter 7 Intelreg Server Board S2600TP Platform Management
made to the throttling during runtime based on changes in system cooling (fan speed)
bull Dynamic Closed Loop Thermal Throttling (Dynamic-CLTT) CLTT control registers are configured by BIOS MRC during POST The memory throttling is run as a closed-loop system with the DIMM temperature sensors as the control input Adjustments are made to the throttling during runtime based on changes in system cooling (fan speed)
Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce a new type of CLTT which is referred to as Hybrid CLTT for which the Integrated Memory Controller estimates the DRAM temperature in between actual reads of the TSODsHybrid CLTTT shall be used on all Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family that have DIMMs with thermal sensors Therefore the terms Dynamic-CLTT and Static-CLTT are really referring to this lsquohybridrsquo mode Note that if the IMCrsquos polling of the TSODs is interrupted the temperature readings that the BMC gets from the IMC shall be these estimated values 731463 DIMM Temperature Sensor Input to Fan Speed ControlA clamp algorithm s used for controlling fan speed based on DIMM temperatures Aggregate DIMM temperature margin sensors are used as the control input to the algorithm
731464 Dynamic (Hybrid) CLTTThe system will support dynamic (memory) CLTT for which the BMC FW dynamically modifies thermal offset registers in the IMC during runtime based on changes in system cooling (fan speed) For static CLTT a fixed offset value is applied to the TSOD reading to get the die temperature however this is does not provide as accurate results as when the offset takes into account the current airflow over the DIMM as is done with dynamic CLTTIn order to support this feature the BMC FW will derive the air velocity for each fan domain based on the PWM value being driven for the domain Since this relationship is dependent on the chassis configuration a method must be used which support this dependency (for example through OEM SDR) that establishes a lookup table providing this relationshipBIOS will have an embedded lookup table that provides thermal offset
HA201-TP Users Manual
107
Chapter 7 Intelreg Server Board S2600TP Platform Management
values for each DIMM type altitude setting and air velocity range (3 ranges of air velocity are supported) During system boot BIOS will provide 3 offset values (corresponding to the 3 air velocity ranges) tothe BMC for each enabled DIMM Using this data the BMC FW constructs a table that maps the offset value corresponding to a given air velocity range for each DIMM During runtime the BMC applies an averaging algorithm to determine the target offset value corresponding to thecurrent air velocity and then the BMC writes this new offset value into the IMC thermal offset register for the DIMM
731465 Fan ProfilesThe server system supports multiple fan control profiles to support acoustic targets and American Society of Heating Refrigerating and Air Conditioning Engineers (ASHRAE) compliance The BIOS Setup utility can be used to choose between meeting the target acoustic level or enhanced system performance This is accomplished through fan profilesThe BMC supports eight fan profiles numbered from 0 to 7 Fan management policy is dictated by the Tcontrol SDRs These SDRs provide a way to associate fan control behavior with one or more fan profilesEach group of profiles allows for varying fan control policies based on the altitude For a given altitude the Tcontrol SDRs associated with an acoustics-optimized profile generate less noise than the equivalent performance-optimized profile by driving lower fan speeds and the BIOSreduces thermal management requirements by configuring more aggressive memory throttling See Table 16 for more informationThe BMC provides commands that query for fan profile support and it provides a way to enable a fan profile Enabling a fan profile determines which Tcontrol SDRs are used for fan management The BMC only supports enabling a fan profile through the command if that profile is supported on all fan domains defined for the system It is important to configure the SDRs so that all desired fan profiles are supported on each fan domain If no single profile is supported across all domains the BMC by default uses profile 0 and does not allow it to be changedAt system boot the BIOS can use the Get Fan Control Configuration command to query the BMC about which fan profiles are supported The BIOS uses this information to display options in the BIOS Setup utility The BIOS indicates the fan profile to the BMC as dictated by the BIOS Setup Utility options for fan mode and altitude using the Set Fan Control
HA201-TP Users Manual
108
Chapter 7 Intelreg Server Board S2600TP Platform Management
Configuration commandThe BMC uses this information as an input to its fan-control algorithm as supported by the Tcontrol OEM SDR The BMC only allows enabling of fan profiles that the BMC indicates are supported using the Get Fan Control Configuration command For example if the Get Fan Control Configuration command indicates that only profile 1 is supported then using the Set Fan Control Configuration command to enable profile 2 will result in the return of an error completion codeThe BMC requires the BIOS to send the Set Fan Control Configuration command to the BMC on every system boot This must be done after the BIOS has completed any throttling-related chipset configuration
HA201-TP Users Manual
109
Chapter 7 Intelreg Server Board S2600TP Platform Management
731466 Open-Loop Thermal Throttling FallbackNormal system operation uses closed-loop thermal throttling (CLTT) and DIMM temperature monitoring as major factors in overall thermal and acoustics management In the event that BIOS is unable to configure the system for CLTT it defaults to open-loop thermal throttling (OLTT) In the OLTT mode it is assumed that the DIMM temperature sensors are not available for fan speed control The BIOS communicates the throttling mode to the BMC along with the fan profile number when it sends the Set Fan Control Configuration commandWhen OLTT mode is specified the BMC internally blocks access to the DIMM temperatures causing the DIMM aggregate margin sensors to be marked as ReadingState Unavailable The BMC then uses the failure-control values for these sensors if specified in the Tcontrol SDRs as their fan speed contributions
731467 ASHRAE ComplianceSystem requirements for ASHRAE compliance is defined in the Common Fan Speed Control amp Thermal Management Platform Architecture Specification Altitude-related changes in fan speed control are handled through profiles for different altitude ranges
73147 Power Supply Fan Speed ControlThis section describes the system level control of the fans internal to the power supply over the PMBus Some but not all Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family will require that the power supplies be included in the system level fan speed control For any system that requires either of these capabilities the power supply must be PMBus-compliant
731471 System Control of Power Supply FansSome products require that the BMC control the speed of the power supply fans as is done with normal system (chassis) fans except that the BMC cannot reduce the power supply fan any lower than the internal power supply control is driving it For these products the BMC FW must have the ability to control and monitor the power supply fans through PMBus commands The power supply fans are treated as a system fan domain for which fan control policies are mapped just as for chassis system fans with system thermal sensors (rather than internal power
HA201-TP Users Manual
110
Chapter 7 Intelreg Server Board S2600TP Platform Management
supply thermal sensors) used as the input to a clamp algorithm for the power supply fan control This domain has both piecewise clipping curves and clamped sensors mapped into the power supply fan domain All the power supplies can be defined as a single fan domain
731472 Use of Power Supply Thermal Sensors as Input to System (Chassis) Fan Control Some products require that the power supply internal thermal sensors are used as control inputs to the system (chassis) fans in the same manner as other system thermal sensors are used for this purpose The power supply thermal sensors are included as clamped sensors into one or more system fan domains which may include the power supply fan domain
73148 Fan Boosting due to Fan FailuresIntelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce additional capabilities for handling fan failure or removal as described in this sectionEach fan failure shall be able to define a unique response from all other fan domains An OEM SDR table defines the response of each fan domain based on a failure of any fan including both system and power supply fans (for PMBus-compliant power supplies only) This meansthat if a system has six fans then there will be six different fan fail reactions
73149 Programmable Fan PWM OffsetThe system provides a BIOS Setup option to boost the system fan speed by a programmable positive offset or a ldquoMaxrdquo setting Setting the programmable offset causes the BMC to add the offset to the fan speeds that it would otherwise be driving the fans to The Max setting causes the BMC to replace the domain minimum speed with alternate domain minimums that also are programmable through SDRsThis capability is offered to provide system administrators the option to manually configure fans speeds in instances where the fan speed optimized for a given platform may not be sufficient when a high end add-in is configured into the system This enables easier usage of the fan speed control to support Intelreg as well as third party chassis and better support of ambient temperatures higher than 35C
HA201-TP Users Manual
111
Chapter 7 Intelreg Server Board S2600TP Platform Management
supply thermal sensors) used as the input to a clamp algorithm for the power supply fan control This domain has both piecewise clipping curves and clamped sensors mapped into the power supply fan domain All the power supplies can be defined as a single fan domain
731472 Use of Power Supply Thermal Sensors as Input to System (Chassis) Fan Control Some products require that the power supply internal thermal sensors are used as control inputs to the system (chassis) fans in the same manner as other system thermal sensors are used for this purpose The power supply thermal sensors are included as clamped sensors into one or more system fan domains which may include the power supply fan domain
73148 Fan Boosting due to Fan FailuresIntelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce additional capabilities for handling fan failure or removal as described in this sectionEach fan failure shall be able to define a unique response from all other fan domains An OEM SDR table defines the response of each fan domain based on a failure of any fan including both system and power supply fans (for PMBus-compliant power supplies only) This meansthat if a system has six fans then there will be six different fan fail reactions
73149 Programmable Fan PWM OffsetThe system provides a BIOS Setup option to boost the system fan speed by a programmable positive offset or a ldquoMaxrdquo setting Setting the programmable offset causes the BMC to add the offset to the fan speeds that it would otherwise be driving the fans to The Max setting causes the BMC to replace the domain minimum speed with alternate domain minimums that also are programmable through SDRsThis capability is offered to provide system administrators the option to manually configure fans speeds in instances where the fan speed optimized for a given platform may not be sufficient when a high end add-in is configured into the system This enables easier usage of the fan speed control to support Intelreg as well as third party chassis and better support of ambient temperatures higher than 35C
HA201-TP Users Manual
112
Chapter 7 Intelreg Server Board S2600TP Platform Management
7315 Power Management Bus (PMBus)The Power Management Bus (ldquoPMBusrdquo) is an open standard protocol that is built upon the SMBus 20 transport It defines a means of communicating with power conversion and other devices using SMBus-based commands A system must have PMBus-compliant power supplies installed in order for the BMC or ME to monitor them for status andor power metering purposesFor more information on PMBus see the System Management Interface Forum Web sitehttpwwwpowersigorg
7316 Power Supply Dynamic Redundancy SensorThe BMC supports redundant power subsystems and implements a Power Unit Redundancy sensor per platform A Power Unit Redundancy sensor is of sensor type Power Unit (09h) and reading type Availability Status (0Bh) This sensor generates events when a power subsystem transitions between redundant and non-redundant states as determined by the number and health of the power subsystemrsquos component power supplies The BMC implements Dynamic Power Supply Redundancy status based upon current system load requirements as well as total Power Supply capacity This status is independent of the Cold Redundancy status This prevents the BMC from reporting Fully Redundant Power supplies when the load required by the system exceeds half the power capability of all power supplies installed and operationalDynamic Redundancy detects this condition and generates the appropriate SEL event to notify the user of the condition Power supplies of different power ratings may be swapped in and out to adjust the power capacity and the BMC will adjust the Redundancy status accordinglyThe definition of redundancy is power subsystem dependent and sometimes even configuration dependent See the appropriate Platform Specific Information for power unit redundancy supportThis sensor is configured as manual-rearm sensor in order to avoid the possibility of extraneous SEL events that could occur under certain system configuration and workload conditions The sensor shall rearm for the following conditionsbull PSU hot-addbull System reset
HA201-TP Users Manual
113
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull AC power cyclebull DC power cycleSystem AC power is applied but on standby ndash Power unit redundancy is based on OEM SDR power unit record and number of PSU presentSystem is (DC) powered on - The BMC calculates Dynamic Power Supply Redundancy status based upon current system load requirements as well as total Power Supply capacityThe BMC allows redundancy to be configured on a per power-unit-redundancy sensor basis by means of the OEM SDR records
7317 Component Fault LED ControlSeveral sets of component fault LEDs are supported on the server board Some LEDs are owned by the BMC and some by the BIOSThe BMC owns control of the following FRUfault LEDsbull Fan fault LEDs ndash A fan fault LED is associated with each fan The BMC
lights a fan fault LED if the associated fan-tach sensor has a lower critical threshold event status asserted Fan-tach sensors are manual re-arm sensors Once the lower critical threshold is crossed the LED remains lit until the sensor is rearmed These sensors are rearmed at system DC power-on and system reset
bull DIMM fault LEDs ndash The BMC owns the hardware control for these LEDs The LEDs reflect the state of BIOS-owned event-only sensors When the BIOS detects a DIMM fault condition it sends an IPMI OEM command (Set Fault Indication) to the BMC to instruct the BMC to turn on the associated DIMM Fault LED These LEDs are only active when the system is in the lsquoonrsquo state The BMC will not activate or change the state of the LEDs unless instructed by the BIOS
bull Hard Disk Drive Status LEDs ndash The HSBP PSoC owns the hardware control for these LEDs and detection of the faultstatus conditions that the LEDs reflect
bull CPU Fault LEDs The BMC owns control for these LEDs An LED is lit if there is an MSID mismatch (that is CPU power rating is incompatible with the board)
HA201-TP Users Manual
114
Chapter 7 Intelreg Server Board S2600TP Platform Management
7318 CMOS Battery MonitoringThe BMC monitors the voltage level from the CMOS battery which provides battery backup to the chipset RTC This is monitored as an auto-rearm threshold sensorUnlike monitoring of other voltage sources for which the Emulex Pilot III component continuously cycles through each input the voltage channel used for the battery monitoring provides a software enable bit to allow the BMC FW to poll the battery voltage at a relativelyslow rate in order to conserve battery power
HA201-TP Users Manual
115
Chapter 7 Intelreg Server Board S2600TP Platform Management
74 Intelreg Intelligent Power Node Manager (NM)Power management deals with requirements to manage processor power consumption and manage power at the platform level to meet critical business needs Node Manager (NM) is a platform resident technology that enforces power capping and thermal-triggered powercapping policies for the platform These policies are applied by exploiting subsystem settings (such as processor P and T states) that can be used to control power consumption NM enables data center power management by exposing an external interface to management software through which platform policies can be specified It also implements specific data center power management usage models such as power limiting and thermal monitoringThe NM feature is implemented by a complementary architecture utilizing the ME BMC BIOS and an ACPI-compliant OS The ME provides the NM policy engine and power controllimiting functions (referred to as Node Manager or NM) while the BMC provides the external LAN link by which external management software can interact with the feature The BIOS provides system power information utilized by the NM algorithms and also exports ACPI Source Language (ASL) code used by OS-Directed Power Management (OSPM) for negotiating processor P and T state changes for power limiting PMBus-compliant power supplies provide the capability to monitor input power consumption which is necessary to support NMThe NM architecture applicable to this generation of servers is defined by the NPTM Architecture Specification v20 NPTM is an evolving technology that is expected to continue to add new capabilities that will be defined in subsequent versions of the specification The ME NM implements the NPTM policy engine and controlmonitoring algorithms defined in the Node Power and Thermal Manager (NPTM) specification
741 Hardware RequirementsNM is supported only on platforms that have the NM FW functionality loaded and enabled on the Management Engine (ME) in the SSB and that have a BMC present to support the external LAN interface to the ME NM power limiting features requires a means for the ME to monitorinput power consumption for the platform This capability is generally provided by means of PMBus-compliant power supplies although an alternative model using a simpler SMBus power monitoring device is possible (there is potential loss in accuracy and responsiveness using
HA201-TP Users Manual
116
Chapter 7 Intelreg Server Board S2600TP Platform Management
non-PMBus devices) The NM SmaRTCLST feature does specifically require PMBus-compliant power supplies as well as additional hardware on the server board
742 FeaturesNM provides feature support for policy management monitoring and querying alerts and notifications and an external interface protocol The policy management features implement specific IT goals that can be specified as policy directives for NM Monitoring and querying features enable tracking of power consumption Alerts and notifications provide the foundation for automation of power management in the data center management stack The external interface specifies the protocols that must be supported in this version of NM
743 ME System Management Bus (SMBus) Interfacebull The ME uses the SMLink0 on the SSB in multi-master mode as a
dedicated bus for communication with the BMC using the IPMB protocol The BMC FW considers this a secondary IPMB bus and runs at 400 kHz
bull The ME uses the SMLink1 on the SSB in multi-master mode bus for communication with PMBus devices in the power supplies for support of various NM-related features This bus is shared with the BMC which polls these PMBus power supplies for sensor monitoring purposes (for example power supply status input power and so on) This bus runs at 100 KHz
bull The Management Engine has access to the ldquoHost SMBusrdquo
744 PECI 30bull The BMC owns the PECI bus for all Intel server implementations and
acts as a proxy for the ME when necessary
745 NM ldquoDiscoveryrdquo OEM SDRAn NM ldquodiscoveryrdquo OEM SDR must be loaded into the BMCrsquos SDR repository if and only if the NM feature is supported on that product This OEM SDR is used by management software to detect if NM is supported and to understand how to communicate with itSince PMBus compliant power supplies are required in order to support NM the system should be probed when the SDRs are loaded into the
HA201-TP Users Manual
117
Chapter 7 Intelreg Server Board S2600TP Platform Management
BMCrsquos SDR repository in order to determine whether or not the installed power supplies do in fact support PMBus If the installed power supplies are not PMBus compliant then the NM ldquodiscoveryrdquo OEM SDR should not be loadedPlease refer to the Intelreg Intelligent Power Node Manager 20 External Architecture Specification using IPMI for details of this interface
746 SmaRTCLSTThe power supply optimization provided by SmaRTCLST relies on a platform HW capability as well as ME FW support When a PMBus-compliant power supply detects insufficient input voltage an overcurrent condition or an over-temperature condition it will assert theSMBAlert signal on the power supply SMBus (such as the PMBus) Through the use of external gates this results in a momentary assertion of the PROCHOT and MEMHOT signals to the processors thereby throttling the processors and memory The ME FW also sees the SMBAlert assertion queries the power supplies to determine the condition causing the assertion and applies an algorithm to either release or prolong the throttling based on the situationSystem power control modes include1 SmaRT Low AC in put voltage event results in a one-time momentary
throttle for each event to the maximum throttle state2 Electrical Protection CLST High output energy event results in a
throttling hiccup mode with fixed maximum throttle time and a fix throttle release ramp time
3 Thermal Protection CLST High power supply thermal event results in a throttling hiccup mode with fixed maximum throttle time and a fix throttle release ramp time
When the SMBAlert signal is asserted the fans will be gated by HW for a short period (~100ms) to reduce overall power consumption It is expected that the interruption to the fans will be of short enough duration to avoid false lower threshold crossings for the fan tachsensors however this may need to be comprehended by the fan monitoring FW if it does have this side-effectME FW will log an event into the SEL to indicate when the system has been throttled by the SmaRTCLST power management feature This is dependent on ME FW support for this sensor Please refer ME FW EPS for SEL log details
HA201-TP Users Manual
118
Chapter 7 Intelreg Server Board S2600TP Platform Management
74611 Dependencies on PMBus-compliant Power Supply SupportThe SmaRTCLST system feature depends on functionality present in the ME NM SKU This feature requires power supplies that are compliant with the PMBus
note For additional information on Intelreg Intelligent Power Node
Manager usage and support please visit the following Intel Website
httpwwwintelcomcontentwwwusendata-centerdata-center-managementnode-manager-generalhtmlwapkw=node+manager
HA201-TP Users Manual
119
Chapter 7 Intelreg Server Board S2600TP Platform Management
75 Basic and Advanced Server Management FeaturesThe integrated BMC has support for basic and advanced server management features Basic management features are available by default Advanced management features are enabled with the addition of an optionally installed Remote Management Module 4 Lite (RMM4 Lite) key
When the BMC FW initializes it attempts to access the Intelreg RMM4 lite If the attempt to access Intelreg RMM4 lite is successful then the BMC activates the Advanced featuresThe following table identifies both Basic and Advanced server management features
HA201-TP Users Manual
120
Chapter 7 Intelreg Server Board S2600TP Platform Management
751 Dedicated Management PortThe server board includes a dedicated 1GbE RJ45 Management Port The management port is active with or without the RMM4 Lite key installed
752 Embedded Web ServerBMC Base manageability provides an embedded web server and an OEM-customizable web GUI which exposes the manageability features of the BMC base feature set It is supported over all on-board NICs that have management connectivity to the BMC as well as an optionaldedicated add-in management NIC At least two concurrent web sessions from up to two different users is supported The embedded web user interface shall support the following client web browsersbull Microsoft Internet Explorer 90bull Microsoft Internet Explorer 100bull Mozilla Firefox 24bull Mozilla Firefox 25The embedded web user interface supports strong security (authentication encryption and firewall support) since it enables remote server configuration and control The user interface presented by the embedded web user interface shall authenticate the user before allowing a web session to be initiated Encryption using 128-bit SSL is supported User authentication is based on user id and passwordThe GUI presented by the embedded web server authenticates the user before allowing a web session to be initiated It presents all functions to all users but grays-out those functions that the user does not have privilege to execute For example if a user does not have privilege to power control then the item shall be displayed in grey-out font in that userrsquos UI display The web GUI also provides a launch point for some of the advanced features such as KVM and media redirection These features are grayed out in the GUI unless the system has been updated to support these advanced features The embedded web server only displays US English or Chinese language outputAdditional features supported by the web GUI includesbull Presents all the Basic features to the usersbull Power onoffreset the server and view current power statebull Displays BIOS BMC ME and SDR version informationbull Display overall system health
HA201-TP Users Manual
121
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull Configuration of various IPMI over LAN parameters for both IPV4 and IPV6
bull Configuration of alerting (SNMP and SMTP)bull Display system asset information for the product board and
chassisbull Display of BMC-owned sensors (name status current reading
enabled thresholds) including color-code status of sensorsbull Provides ability to filter sensors based on sensor type (Voltage
Temperature Fan and Power supply related)bull Automatic refresh of sensor data with a configurable refresh ratebull On-line helpbull Displayclear SEL (display is in easily understandable human
readable format)bull Supports major industry-standard browsers (Microsoft Internet
Explorer and Mozilla Firefox)bull The GUI session automatically times-out after a user-configurable
inactivity period By default this inactivity period is 30 minutesbull Embedded Platform Debug feature - Allow the user to initiate
a ldquodebug dumprdquo to a file that can be sent to Intelreg for debug purposes
bull Virtual Front Panel The Virtual Front Panel provides the same functionality as the local front panel The displayed LEDs match the current state of the local panel LEDs The displayed buttons (for example power button) can be used in the same manner as the local buttons
bull Display of ME sensor data Only sensors that have associated SDRs loaded will be displayed
bull Ability to save the SEL to a filebull Ability to force HTTPS connectivity for greater security This is
provided through a configuration option in the UIbull Display of processor and memory information as is available over
IPMI over LANbull Ability to get and set Node Manager (NM) power policiesbull Display of power consumed by the serverbull Ability to view and configure VLAN settingsbull Warn user the reconfiguration of IP address will cause disconnectbull Capability to block logins for a period of time after several
HA201-TP Users Manual
122
Chapter 7 Intelreg Server Board S2600TP Platform Management
consecutive failed login attempts The lock-out period and the number of failed logins that initiates the lockout period are configurable by the user
bull Server Power Control - Ability to force into Setup on a resetbull System POST results ndash The web server provides the systemrsquos Power-
On Self Test (POST) sequence for the previous two boot cycles including timestamps The timestamps may be viewed in relative to the start of POST or the previous POST code
bull Customizable ports - The web server provides the ability to customize the port numbers used for SMASH http https KVM secure KVM remote media and secure remote media
For additional information reference the Intelreg Remote Management Module 4 and Integrated BMC Web Console Users Guide
753 Advanced Management Feature Support (RMM4 Lite)The integrated baseboard management controller has support for advanced management features which are enabled when an optional Intelreg Remote Management Module 4 Lite (RMM4 Lite) is installed The Intel RMM4 add-on offers convenient remote KVM access and control through LAN and internet It captures digitizes and compresses video and transmits it with keyboard and mouse signals to and from a remote computer Remote access and controlsoftware runs in the integrated baseboard management controller utilizing expanded capabilities enabled by the Intel RMM4 hardwareKey Features of the RMM4 add-on arebull KVM redirection from either the dedicated management NIC or
the server board NICs used for management traffic upto to two KVM sessions
bull Media Redirection ndash The media redirection feature is intended to allow system administrators or users to mount a remote IDE or USB CDROM floppy drive or a USB flash disk as a remote device to the server Once mounted the remote device appears just like a local device to the server allowing system administrators or users to install software (including operating systems) copy files update BIOS or boot the server from this device
bull KVM ndash Automatically senses video resolution for best possible screen capture high performance mouse tracking and
HA201-TP Users Manual
123
Chapter 7 Intelreg Server Board S2600TP Platform Management
synchronization It allows remote viewing and configuration in pre-boot POST and BIOS setup
7531 Keyboard Video Mouse (KVM) RedirectionThe BMC firmware supports keyboard video and mouse redirection (KVM) over LAN This feature is available remotely from the embedded web server as a Java applet This feature is only enabled when the Intelreg RMM4 lite is present The client system must have a Java Runtime Environment (JRE) version 60 or later to run the KVM or media redirection appletsThe BMC supports an embedded KVM application (Remote Console) that can be launched from the embedded web server from a remote console USB11 or USB 20 based mouse and keyboard redirection are supported It is also possible to use the KVM-redirection (KVM-r) session concurrently with media-redirection (media-r) This feature allows a user to interactively use the keyboard video and mouse (KVM) functions of the remote server as if the user were physically at the managed server KVM redirection console supports the following keyboard layouts English Dutch French German Italian Russian and SpanishKVM redirection includes a ldquosoft keyboardrdquo function The ldquosoft keyboardrdquo is used to simulate an entire keyboard that is connected to the remote system The ldquosoft keyboardrdquo functionality supports the following layouts English Dutch French German Italian Russian and SpanishThe KVM-redirection feature automatically senses video resolution for best possible screen capture and provides high-performance mouse tracking and synchronization It allows remote viewing and configuration in pre-boot POST and BIOS setup once BIOS has initialized videoOther attributes of this feature includebull Encryption of the redirected screen keyboard and mousebull Compression of the redirected screenbull Ability to select a mouse configuration based on the OS typebull Supports user definable keyboard macrosKVM redirection feature supports the following resolutions and refresh ratesbull 640x480 at 60Hz 72Hz 75Hz 85Hz 100Hzbull 800x600 at 60Hz 72Hz 75Hz 85Hzbull 1024x768 at 60Hx 72Hz 75Hz 85Hzbull 1280x960 at 60Hz
HA201-TP Users Manual
124
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull 1280x1024 at 60Hzbull 1600x1200 at 60Hzbull 1920x1080 (1080p)bull 1920x1200 (WUXGA)bull 1650x1080 (WSXGA+)
7532 Remote ConsoleThe Remote Console is the redirected screen keyboard and mouse of the remote host system To use the Remote Console window of your managed host system the browser must include a Java Runtime Environment plug-in If the browser has no Java support such as with a small handheld device the user can maintain the remote host system using the administration forms displayed by the browserThe Remote Console window is a Java Applet that establishes TCP connections to the BMCThe protocol that is run over these connections is a unique KVM protocol and not HTTP or HTTPS This protocol uses ports 7578 for KVM 5120 for CDROM media redirection and 5123 for FloppyUSB media redirection When encryption is enabled the protocol uses ports 7582 for KVM 5124 for CDROM media redirection and 5127 for FloppyUSB media redirection The local network environment must permit these connections to be made that is the firewall and in case of a private internal network the NAT (Network Address Translation) settings have to be configured accordingly
7533 PerformanceThe remote display accurately represents the local display The feature adapts to changes to the video resolution of the local display and continues to work smoothly when the system transitions from graphics to text or vice-versa The responsiveness may be slightly delayed depending on the bandwidth and latency of the networkEnabling KVM andor media encryption will degrade performance Enabling video compression provides the fastest response while disabling compression provides better video qualityFor the best possible KVM performance a 2Mbsec link or higher is recommendedThe redirection of KVM over IP is performed in parallel with the local KVM without affecting the local KVM operation
HA201-TP Users Manual
125
Chapter 7 Intelreg Server Board S2600TP Platform Management
7534 SecurityThe KVM redirection feature supports multiple encryption algorithms including RC4 and AES The actual algorithm that is used is negotiated with the client based on the clientrsquos capabilities
7535 AvailabilityThe remote KVM session is available even when the server is powered-off (in stand-by mode) No re-start of the remote KVM session shall be required during a server reset or power onoff A BMC reset (for example due to an BMC Watchdog initiated reset or BMC reset after BMC FWupdate) will require the session to be re-established KVM sessions persist across system reset but not across an AC power loss
7536 UsageAs the server is powered up the remote KVM session displays the complete BIOS boot process The user is able interact with BIOS setup change and save settings as well as enter and interact with option ROM configuration screensAt least two concurrent remote KVM sessions are supported It is possible for at least two different users to connect to same server and start remote KVM sessions
7537 Force-enter BIOS SetupKVM redirection can present an option to force-enter BIOS Setup This enables the system to enter F2 setup while booting which is often missed by the time the remote console redirects the video
7538 Media RedirectionThe embedded web server provides a Java applet to enable remote media redirection This may be used in conjunction with the remote KVM feature or as a standalone applet The media redirection feature is intended to allow system administrators or users to mount a remote IDE or USB CD-ROM floppy drive or a USB flash disk as a remote device to the server Once mounted the remote device appears just like a local device to the server allowing system administrators or users to install software (including operating systems) copy files update BIOS and so on or boot the server from this deviceThe following capabilities are supported
HA201-TP Users Manual
126
Chapter 7 Intelreg Server Board S2600TP Platform Management
The operation of remotely mounted devices is independent of the local devices on the server Both remote and local devices are useable in parallelbull Either IDE (CD-ROM floppy) or USB devices can be mounted as a
remote device to the serverbull It is possible to boot all supported operating systems from the remotely
mounted device and to boot from disk IMAGE (IMG) and CD-ROM or DVD-ROM ISO files See the Testedsupported Operating System List for more information
bull Media redirection supports redirection for both a virtual CD device and a virtual FloppyUSB device concurrently The CD device may be either a local CD drive or else an ISO image file the FloppyUSB device may be either a local Floppy drive a local USB device or else a disk image file
bull The media redirection feature supports multiple encryption algorithms including RC4 and AES The actual algorithm that is used is negotiated with the client based on the clientrsquos capabilities
bull A remote media session is maintained even when the server is powered-off (in standby mode) No restart of the remote media session is required during a server reset or power onoff An BMC reset (for example due to an BMC reset after BMC FW update) will require the session to be re-established
bull The mounted device is visible to (and useable by) managed systemrsquos OS and BIOS in both pre-boot and post-boot states
bull The mounted device shows up in the BIOS boot order and it is possible to change the BIOS boot order to boot from this remote device
bull It is possible to install an operating system on a bare metal server (no OS present) using the remotely mounted device This may also require the use of KVM-r to configure the OS during install
USB storage devices will appear as floppy disks over media redirection This allows for the installation of device drivers during OS installationIf either a virtual IDE or virtual floppy device is remotely attached during system boot both the virtual IDE and virtual floppy are presented as bootable devices It is not possible to present only a single-mounted device type to the system BIOS
HA201-TP Users Manual
127
Chapter 7 Intelreg Server Board S2600TP Platform Management
75381 AvailabilityThe default inactivity timeout is 30 minutes and is not user-configurable Media redirection sessions persist across system reset but not across an AC power loss or BMC reset
75382 Network Port UsageThe KVM and media redirection features use the following portsbull 5120 ndash CD Redirectionbull 5123 ndash FD Redirectionbull 5124 ndash CD Redirection (Secure)bull 5127 ndash FD Redirection (Secure)bull 7578 ndash Video Redirectionbull 7582 ndash Video Redirection (Secure)For additional information reference the Intelreg Remote Management Module 4 and Integrated BMC Web Console Users Guide
Chapter 1 Product IntroductionChapter 1 Product IntroductionChapter 8 Technical Support
wwwaicipccom
bull TAIWANTel +886 3 433 9188Fax +886 3 287 1818Email salesaicipccomtw
bull CHINA Tel +862154961421 +862154961422Fax Extension 608Email Technical Support supportaicipccom
bull AMERICA - West coastTel +19098958989Fax +19098958999Email salesaicipccom
bull AMERICA - East coastTel +19738848886Fax +19738844794Email njsalesaicipccom
bull EUROPETel +31306386789Fax +31306360638Emailsalesaicipcnl
Email Technical Support supportaicipccom
- PREFACE
-
- SAFETY INSTRUCTIONS
-
- Chapter 1 Product Introduction
-
- 11 Box Content
- 12 Specifications
- 13 General Information
-
- Chapter 2 Hardware Installation
-
- 21 Removing and Installing top cover
- 22 Central Processing Unit (CPU)
- 23 Diagram of the Correct Installation
- 24 System Memory
- 25 RemovingInstalling a Drive Tray Hard Drive Drive Slot Map
- 26 Removing and Installing a Fan Module
- 27 Power Supply
- 28 Tool-less Blade Slide Installation introduction
-
- Chapter 3 Motherborad Overview
-
- 31 Intelreg Server Board Feature Set
- 32 Motherboard Layout
- 33 Motherboard block diagram
- 34 Motherboard Configuration Jumpers
- 35 Motherboard Configuration Jumpers
-
- Chapter 4 12GB Expander Borad Overview
-
- 41 12GB Expander Board
-
- Chapter 5 Backplane Overview
-
- 51 Backplane
-
- Chapter 6 Debug amp Firmware Update Introduction
-
- 61 Expender firmware update through smart console port
- 62 Update the expander firmware through in-band
- 63 12G expander EDFB setting
- 64 Slot HDD power setting
- 65 HDD BP thermal sensor temperature setting
-
- Chapter 7 Intelreg Server Board S2600TP Platform Management
-
- 71 Server Management Function Architecture
- 72 Features and Functions
- 73 Sensor Monitoring
- 74 Intelreg Intelligent Power Node Manager (NM)
- 75 Basic and Advanced Server Management Features
-
- Chapter 8 Technical Support
-
- 按鈕11
![Page 13: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/13.jpg)
6
HA201-TP Users Manual
Chapter 1 Product Introduction
bull Top View of HA201-TP system
4 x 60x56mm hot-swap fans
Intel S2600TP
HA201-TP Users Manual
7
Chapter 2 Hardware Installation
7
21 Removing and Installing top coverUnscrew the screws(2pcs for the back and 4pcs for the both right and left side)Slide the cover backwards to remove top cover form the chassis
Chapter 2 Hardware Installation
This section demonstrates maintenance procedures in replacing a defective part once the HA201-TP UM appliance is installed and is operational
Chapter 2 Hardware Installation
HA201-TP Users Manual
8
22 Central Processing Unit (CPU)221 Removing the Processor HeatsinkThe heatsink is attached to the server board or processor socket with captive fasteners Using a 2 Phillips screwdriver loosen the four screws located on the heatsink corners in a diagonal manner using the following procedurebull Using a 2 Phillips screwdriver start with screw 1 and loosen it by
giving it two rotations and stop (see letter A) (IMPORTANT Do not fully loosen)
bull Proceed to screw 2 and loosen it by giving it two rotations and stop (see letter B) Similarly loosen screws 3 and 4 Repeat steps A and B by giving each screw two rotations each time until all screws are loosened
bull Lift the heatsink straight up (see letter C)
REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
Processor
Socket
2
3
1
4
A
B
C
HA201-TP Users Manual
9
Chapter 2 Hardware Installation
222 Installing the Processor
2221 Unlatch the CPU Load Platebull Push the lever handle labeled ldquoOPEN 1strdquo (see letter A) down and
toward the CPU socket Rotate the lever handle upbull Repeat the steps for the second lever handle (see letter B)
CAUTION
Processor must be appropriate You may damage the server board if
you install a processor that is inappropriate for your server
CAUTION
ESD and handling processors Reduce the risk of electrostatic
discharge (ESD) damage to the processor by doing the following
(1) Touch the metal chassis before touching the processor or
server board Keep part of your body in contact with the metal
chassis to dissipate the static charge while handling the processor
(2) Avoid moving around unnecessarily
REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
A REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
B
Chapter 2 Hardware Installation
HA201-TP Users Manual
10
2222 Lift open the Load Platebull Rotate the right lever handle down until it releases the Load Plate
(see letter A)bull While holding down the lever handle with your other hand lift open
the Load Plate (see letter B)
BREMOVE
REMOVE
LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
A
HA201-TP Users Manual
11
Chapter 2 Hardware Installation
2223 Install the Processorbull Remove the processor from its packagebull Carefully remove the protective cover from the bottom side of the
CPU taking care not to touch any CPU contacts (see letter A)bull Orient the processor with the socket so that the processor cutouts
match the four orientation posts on the socket (see letter B) Note the location of a gold key at the corner of the processor (see letter C) Carefully place (Do NOT drop) the CPU into the socket
CAUTION
The pins inside the CPU socket are extremely sensitive Other than
the CPU no object should make contact with the pins inside the
CPU socket A damaged CPU socket pin may render the socket inoperable
and will produce erroneous CPU or other system errors if used
NOTE
The underside of the processor has components that may damage the
socket pins if installed improperly The processor must align correctly
with the socket opening before installation DO NOT DROP the
processor into the socket
NOTE
When possible a CPU insertion tool should be used when installing the
CPU
A
B
C
Chapter 2 Hardware Installation
HA201-TP Users Manual
12
2224 Remove the Socket Cover Remove the socket cover from the load plate by pressing it out
2225 Close the Load Plate Carefully lower the load plate down over the processor
2226 Lock down the Load Platebull Push down on the locking lever on the CLOSE 1st side (see letter A)
Slide the tip of the lever under the notch in the load plate (see letter B) Make sure the load plate tab engages under the socket lever when fully closed
bull bRepeat the steps to latch the locking lever on the other side (see letter C) Latch the levers in the order as shown
NOTE
The socket cover should be saved and re-used should the processor
need to be removed at anytime in the future
Save theprotectivecover
A
BC
HA201-TP Users Manual
13
Chapter 2 Hardware Installation
223 Installing the Processor Heatsink
bull If present remove the protective film covering the Thermal Interface Material on the bottom side of the heatsink (see letter A)
bull Align the heatsink fins to the front and back of the chassis for correct airflow The airflow goes from front-to-back of the chassis (see letter B)
bull Each heatsink has four captive fasteners and should be tightened in a diagonal manner using the following procedure
bull Using a 2 Phillips screwdriver start with screw 1 and engage screw threads by giving it two rotations and stop (see letter C) (Do not fully tighten)
bull Proceed to screw 2 and engage screw threads by giving it two rotations and stop (see letter D) Similarly engage screws 3 and 4
bull Repeat steps C and D by giving each screw two rotations each time until each screw is lightly tightened up to a maximum of 8 inch-lbs torque (see letter E)
NOTE
The processor heatsink for CPU1 and CPU2 is different FXXCA91X91HS is
for CPU1 while FXXEA91X91HS2 is for CPU2 Mislocating the heatsink
will cause serious thermal damage
C
Processor
Socket
AIRFLOW
Chassis Front
2
3
1
4
A
B
TIM
D
CAUTIONDo not over-tighten fasteners
E
Chapter 2 Hardware Installation
HA201-TP Users Manual
14
224 Removing the Processor
NOTE
Remove the processor by carefully lifting it out of the socket taking care
NOT to drop the processor and not touching any pins inside the socket
Install the socket cover if a replacement processor is not going to be installed
HA201-TP Users Manual
15
Chapter 2 Hardware Installation
225 Different heatsink for each of its CPUsCarefully position heatsink over the CPU0 and CPU1 and align the heatsink screws with the screw holes in the motherboard
Caution - Possible thermal damage Avoid moving the heatsink after
it has contacted the top of the CPU Too much movement could
disturb the layer of thermal compound causing voids and leading
to ineffective heat dissipation and component damage
CPU0
CPU0
CPU1
CPU1
Chapter 2 Hardware Installation
HA201-TP Users Manual
16
23 Diagram of the Correct InstallationNOTE The heat sinkrsquos fans should be blowing toward the rear end
of the chassis If one of the fans is facing the wrong direction
please remove the heat sink and reinstall it so that it is facing
the correct direction
AIRFLOW
AIRFLOW
FAN FAN
HA201-TP Users Manual
17
Chapter 2 Hardware Installation
24 System Memory241 Supported Memory
Chapter 2 Hardware Installation
HA201-TP Users Manual
18
242 Memory Population Rules
bull Each installed processor provides four channels of memory On the Intelreg Server Board S2600TP product family each memory channel supports two memory slots for a total possible 16 DIMMs installed
bull The memory channels from processor socket 1 are identified as Channel A B C and D The memory channels from processor socket 2 are identified as Channel E F G and H
bull The silk screened DIMM slot identifiers on the board provide information about the channel and therefore the processor to which they belong For example DIMM_A1 is the first slot on Channel A on processor 1 DIMM_E1 is the first DIMM socket on Channel E on processor 2
bull The memory slots associated with a given processor are unavailable if the corresponding processor socket is not populated
bull A processor may be installed without populating the associated memory slots provided a second processor is installed with associated memory In this case the memory is shared by the processors However the platform suffers performance degradation and latency due to the remote memory
bull Processor sockets are self-contained and autonomous However all memory subsystem support (such as Memory RAS and Error Management) in the BIOS setup is applied commonly across processor sockets
bull All DIMMs must be DDR4 DIMMsbull Mixing of LRDIMM with any other DIMM type is not allowed per
platformbull Mixing of DDR4 operating frequencies is not validated within a socket
or across sockets by Intel If DIMMs with different frequencies are mixed all DIMMs run at the common lowest frequency
bull A maximum of eight logical ranks (ranks seen by the host) per channel is allowed
bull The BLUE memory slots on the server board identify the first memory slot for a given memory channel
NOTE
Although mixed DIMM configurations are supported Intel only performs platform
validation on systems that are configured with identical DIMMs installed
HA201-TP Users Manual
19
Chapter 2 Hardware Installation
bull DIMM population rules require that DIMMs within a channel be populated starting with the BLUE DIMM slot or DIMM farthest from the processor in a ldquofill-farthestrdquo approach In addition when populating a Quad-rank DIMM with a Single- or Dual-rank DIMM in the same channel the Quad-rank DIMM must be populated farthest from the processor Intelreg MRC will check for correct DIMM placement
Chapter 2 Hardware Installation
HA201-TP Users Manual
20
On the Intelreg Server Board S2600TP product a total of sixteen DIMM slots are provided (two CPUs ndash four channels per CPU and two DIMMs per channel) The nomenclature for DIMM sockets is detailed in the following table
HA201-TP Users Manual
21
Chapter 2 Hardware Installation
243 Publishing System MemoryThere are a number of different situations in which the memory size andor configuration are displayed Most of these displays differ in one way or another so the same memory configuration may appear to display differently depending on when and where the display occurs
bull The BIOS displays the ldquoTotal Memoryrdquo of the system during POST if Quiet Boot is disabled in BIOS setup This is the total size of memory discovered by the BIOS during POST and is the sum of the individual sizes of installed DDR4 DIMMs in the system
bull The BIOS displays the ldquoEffective Memoryrdquo of the system in the BIOS Setup The term Effective Memory refers to the total size of all DDR4 DIMMs that are active (not disabled) and not used as redundant units (see Note below)
bull The BIOS provides the total memory of the system in the main page of BIOS setup This total is the same as the amount described by the first bullet above
bull If Quiet Boot is disabled the BIOS displays the total system memory on the diagnostic screen at the end of POST This total is the same as the amount described by the first bullet above
bull The BIOS provides the total amount of memory in the system by supporting the EFI Boot Service function GetMemoryMap()
bull The BIOS provides the total amount of memory in the system by supporting the INT 15h E820h function For details see the Advanced Configuration and Power Interface Specification
Chapter 2 Hardware Installation
HA201-TP Users Manual
22
25 RemovingInstalling a Drive Tray Hard Drive Drive Slot Map
251 Removing a Disk DriveRelease a drive tray by pressing the unlock button and pinching the lock lever slightly and pulling out the drive tray
252 Installing a Disk Drive
Adjust and place the 25rdquo HDD on the HDD tray Choose to the proper position screw hole then secure it with screws on the bottomMounting location 1 HDD with the interposer board attached
HA201-TP Users Manual
23
Chapter 2 Hardware Installation
253 Installing a Hard Disk Drive Tray
Insert the drive carrier into its bay Push the tray lever until it clicks Make sure the drive tray is correctly secured in place when its front edge aligns with the bay edge
254 Drive Slot Map
The drive slot map follows
HBA Card
MegaRaid Card
Chapter 2 Hardware Installation
HA201-TP Users Manual
24
26 Removing and Installing a Fan Module261 Removing a FAN module
Unscrew the thumb screw on the cover to open the top cover from NODE Unpluging the FAN cable Remove the fan by lifting it and pulling it out
262 Installing a FAN module To installing a fan module follow the reverse order
HA201-TP Users Manual
25
Chapter 2 Hardware Installation
27 Power Supply
271 Removing or Replacing the Power Supply Module
Slide the small levers (see red arrow ) to the right Hold the PSU lever and firmly pull the PSU out of the server chassis
Chapter 2 Hardware Installation
HA201-TP Users Manual
26
28 Tool-less Blade Slide Installation introduction
1 Pull on the lsquorsquoFront-Releasersquorsquoto unlock the inner channel from the Slide Assembly
2 Release the Detent-Lock and push Middle Channel inwards to retract Middle Channel
HA201-TP Users Manual
27
Chapter 2 Hardware Installation
Metal Spacer
OptionalRemove Metal Spacer for Aluminium Racks
3 Aligning the Front Bracket with the Mounting Hole
4 Push in to assembly the Front Bracket onto the Rack
Chapter 2 Hardware Installation
HA201-TP Users Manual
28
5 Now the bracket is fixed onto the Rack(Optional M6x10L screws are to secure the rails with posts if needed)
6 Refer to Diagram 3 amp 4 to assemble the End Bracket onto the Rack
HA201-TP Users Manual
29
Chapter 2 Hardware Installation
7 Assemble the inner channel onto the chassis using thescrews provided
8 Push the chassis with inner channels into Slide to complete Rack Installation
Chapter 3 Motherborad Overview
HA201-TP Users Manual
30
Chapter 3 Motherborad Overview
The Intelreg Server Board S2600TP is a monolithic printed circuit board (PCB) with features designed to support the high-performance and high-density computing markets This server board is designed to support the Intelreg Xeonreg processor E5-2600 v3 product family Previous generation Intelreg Xeonreg processors are not supported Many of the features and functions of the server board family are common A board will be identified by name when a described feature or function is unique to it
Chapter 3 Motherborad Overview
HA201-TP Users Manual
31
31 Intelreg Server Board Feature Set
Chapter 3 Motherborad Overview
HA201-TP Users Manual
32
WARNING The riser slot 1 on the server board is designed for
plugging in ONLY the riser card Plugging in the PCIe card may
cause permanent server board and PCIe card damage
Chapter 3 Motherborad Overview
HA201-TP Users Manual
33
32 Motherboard Layout
DiagnosticLED
RMM4LiteRiser Slot 2
SATASGPIO
BridgeBoard
SATA 3USB
Riser Slot 4
Riser Slot 4
HDD Activity
Riser Slot 3ControlPanel
SystemFan 1
CPU 2Socket
CPU 1Socket
SystemFan 3
SystemFan 2
MainPower 1
FanConnector
MainPower 2
InfiniBandPort(QSFP+)DedicatedManagementPortUSBStatus LEDID LED
VGA
NIC 2
IPMB
NIC 1
SerialPort A
CR 2032 3W
Backup Power Control
SATA 2
SATA 0
SATA RAID 5
SATA
Upgrade Key
DOM
SATA 1
Chapter 3 Motherborad Overview
HA201-TP Users Manual
34
33 Motherboard block diagram
Chapter 3 Motherborad Overview
HA201-TP Users Manual
35
34 Motherboard Configuration JumpersThe following table provides a summary and description of configuration test and debug jumpers The server board has several 3-pin jumper blocks that can be used Pin 1 on each jumper block can be identified by the following symbol on the silkscreen
Chapter 3 Motherborad Overview
HA201-TP Users Manual
36
Chapter 3 Motherborad Overview
HA201-TP Users Manual
37
35 Motherboard Configuration JumpersThe Intelreg Server Board S2600TP has the following board rear connector placement
HA201-TP Users Manual
38
Chapter 4 12G Expander Board Overview
This chapter provides detailed introduction guide on hardware introduction
41 12GB Expander Board411 Placement amp Connectors location
J1
JPIC_DBGU8
Chip 3x36RExpander
U14Flash U16
PIC
SW1
SW2
JEXP_UART
J7
JPIC_SMT
J2 J3 J4 CN1 JUSB_MB
JVBATT_12C
JVBATT CN3 CN2 JIPMI_LOC
CN5
CN8
JPWR1
JPWR2
JHDDPWRJPSON_OK
JIPMB
JUSB_PIC
J8
CN7CN6
CN4
JPWR_SW
Chapter 4 12GB Expander Borad Overview
HA201-TP Users Manual
39
Chapter 4 12G Expander Board Overview
412 PIN-OUT
Power Connector ndash JPWR1JPWR2
OS HDD Power Connector ndash JHDDPWR
Battery Connector ndash JVBATT
FAN Connector ndash J7J8
Console for Expander ndash JEXP_UART
HA201-TP Users Manual
40
Chapter 4 12G Expander Board Overview
PIC Smart portndash JPIC_SMT
PIC Debug port(For MPLAB I2C tool) ndash JPIC_DBG
PIC USB ndash JUSB_PIC
Battery Function ndash JVBATT_I2C
IPMB ndash JIPMB
HA201-TP Users Manual
41
Chapter 4 12G Expander Board Overview
BP PIC USB ndash JUSB_MB
IPMB for Device ndash JIPMI_LOC
Power SW ndash JPWR_SW
MB power on frunction ndash JPSON_OK
HA201-TP Users Manual
42
Chapter 4 12G Expander Board Overview
413 LEDs
LED_CN6
LED2
LED3
HA201-TP Users Manual
43
Chapter 5 Backplane OverviewChapter 5 Backplane Overview
This chapter provides detailed introduction guide on backplane introduction
51 Backplane511 Placement amp Connectors location
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
9
18
10
1
3
G
D
SG
D
S
C
B
C
E
B
C
E
GA
O
I
2
43
12
43
1
14
1
3
45
134
5
134
5
134
5
16
2
9
10
1
1
1120
10
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
16
17
32
33
484964
J2
JP1J11
For Secondary Canister Node
J4 J1 SW1
SW2
J3
For Primary Canister Node
HA201-TP Users Manual
44
Chapter 5 Backplane Overview
512 PIN-OUT
Power Connector ndash J2
Power Connector ndash J11
PMBUS Connector ndash JP1
FAN Connector ndash J4
HA201-TP Users Manual
45
Chapter 5 Backplane Overview
Console for MCU ndash J1
Front IO ndash J3
HA201-TP Users Manual
46
Chapter 5 Backplane Overview
513 LEDs
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
18
9
10
1
31
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
3 4
21
3 4
21
4
1
16
2
91
101
11
10
20
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ActFail
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
9
18
10
1
3
G
D
SG
D
S
C
B
C
E
B
C
E
GA
O
I
2
43
12
43
1
14
1
3
45
134
5
134
5
134
5
16
2
9
10
1
1
1120
10
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
16
17
32
33
484964LED1
HA201-TP Users Manual
47
Chapter 6 HDD Backplane Introduction
61 Expender firmware update through smart console port611 Update Expander firmware revision
Step 1 Set up HA201-TP console serial cable Insert console serial cable into console port shown below also
the other side inert serial port into motherboard
PLEASE FIND THE CONSOLE SERIAL CABLE IN THE PACKAGE BOX
Chapter 6 Debug amp Firmware Update Introduction
HA201-TP Users Manual
48
Chapter 6 HDD Backplane Introduction
Step 2 Set up HA201-TP RS232 connectionSet up RS232 connection application into your HA201-TP as shown in the example process below
For exampleOS Microsoft WindowsRS232 connection application Hyperterminal
Step 2 Install HyperTrmexe
HA201-TP Users Manual
49
Chapter 6 HDD Backplane Introduction
Step 3 Enter a new name for the icon in the field below and click OK
Step 4 Connecting by using selecting an option in the drop down menu circled in red below (we selected COM2 in this example) and click OK
HA201-TP Users Manual
50
Chapter 6 HDD Backplane Introduction
Properties
Port Setting
Bits per second
Data bits
Parity
Stop bits
Flow control None
Restore Defaults
OK ApplyCancel
None
Step 6 Set up is complete The diagram below depicts what screen should displayed
Step 5 For ldquoBits per secondrdquo select 38400 For ldquoFlow controlrdquo select None Click OK when you have finished your selections
cmdgt_
HA201-TP Users Manual
51
Chapter 6 HDD Backplane Introduction
Step 7To get firmware image amp MFG Configuration Image version information from AIC SAS Related Firmware Downloadne website
httpppmsaicipccomtw8888downloadexpandermcu
HA201-TP Users Manual
52
Chapter 6 HDD Backplane Introduction
Step 8Comand line for show current firmware revisioncmdgtrev
HA201-TP Users Manual
53
Chapter 6 HDD Backplane Introduction
Step 9Start to update expander firmwarecmdgtfdl 0 0_
Step 10Select the tool bar Transfer -gt Send File
HA201-TP Users Manual
54
Chapter 6 HDD Backplane Introduction
Step 11bull Choose new firmware path file fw 3A1_v11221bull Protocol have to choose Xmodem
Step 12Firmware download complete
HA201-TP Users Manual
55
Chapter 6 HDD Backplane Introduction
Step 13Reset computer for success update firmwarecmdgtreset
reset
HA201-TP Users Manual
56
Chapter 6 HDD Backplane Introduction
612 Update expander configuration MFG
Step 1Comand line for show current configuration MFGcmdgt showmfg
HA201-TP Users Manual
57
Chapter 6 HDD Backplane Introduction
Step 2Start to update expander configuration MFGcmdgtfdl 83 0_
Step 3Select the tool bar Transfer -gt Send File
83 0
HA201-TP Users Manual
58
Chapter 6 HDD Backplane Introduction
Step 4bull Choose new MFG path file mfg 3A10_eob_1201binbull Protocol have to choose Xmodem
Step 5MFG download complete
HA201-TP Users Manual
59
Chapter 6 HDD Backplane Introduction
Step 6Reset computer for success update MFGcmdgtreset
reset
HA201-TP Users Manual
60
Chapter 6 HDD Backplane Introduction
62 Update the expander firmware through in-band
FOR EXAMPLEStep 1
Download and install SG3_utilsexe which compatible with Linux OSFrom website httpsgdannyczsgsg3_utilshtml website Reference version sg3_utils-140tgz
Step 2To get firmware image amp MFG Configuration Image version information from AIC SAS Related Firmware Downloadne websitehttpppmsaicipccomtw8888downloadexpandermcu
HA201-TP Users Manual
61
Chapter 6 HDD Backplane Introduction
Step 3Execute terminal under the same new firmware folderexample Setting a new firmware folder on Home page Open Terminal by click to the right button of mouse in the same window Home
Step 4Typing sudo -s to into administrator mode
HA201-TP Users Manual
62
Chapter 6 HDD Backplane Introduction
Step 5 Find expander location$ sg_map -i
Step 6Update Expander firmware$ sg_write_buffer --id=0x0 --in=fw3A1_v11221 --mode=0x2 --offset=0 devsg0
HA201-TP Users Manual
63
Chapter 6 HDD Backplane Introduction
Step 7 Update Expander MFG$ sg_write_buffer --id=0x83 --in=mfg3A10_eob_1201bin --mode=0x2 --offset=0 devsg0
Step 8Reboot computer for success update firmware amp MFGrootubuntu~ reboot
HA201-TP Users Manual
64
Chapter 6 HDD Backplane Introduction
63 12G expander EDFB settingStep 1
For Install HyperTerminalexe refer to section 41
Step 2 Get EDFB statuscmdgt edfb
HA201-TP Users Manual
65
Chapter 6 HDD Backplane Introduction
Step 3Change EDFB setting
Enable EDFB functioncmdgtedfb on
Disable EDFB functioncmdgtedfb off
cmd gtedfbEDFB is OFF
cmd gtedfb onSucceeded to set EDFB
cmd gtedfb offEDFB is OFF
HA201-TP Users Manual
66
Chapter 6 HDD Backplane Introduction
64 Slot HDD power setting(Only for system cooling Fan controled by expander)
Step 1 For Install sg3exe tool and get new firmware from website refer to section 42
Step 2Execute terminal under the same new firmware folderexample Setting a new firmware folder on Home page Open Terminal by click to the right button of mouse in the same window Home
HA201-TP Users Manual
67
Chapter 6 HDD Backplane Introduction
Step 3 Typing sudo -s to into administrator mode
Step 4 Find expander location$ sg_map -i
HA201-TP Users Manual
68
Chapter 6 HDD Backplane Introduction
Step 5For example
If would like to turn the Disk004 power off under the HBA card Need to check Disk004 power status $ sg_ses --page=7 devsg0
Under HBA card the Element 3 = Disk004
HA201-TP Users Manual
69
Chapter 6 HDD Backplane Introduction
Step 6To check Disk004 (element 3) power status is ok$ sg_ses --page=2 devsg0
Status shows belowThe status of Element 3 is OK
HA201-TP Users Manual
70
Chapter 6 HDD Backplane Introduction
Step 7Turn off a HDD power$ sg_ses --descriptor=Disk004 --set=341 devsg0
Step 8Turn on a HDD power$ sg_ses --descriptor=Disk004 --clear=341 devsg0
HA201-TP Users Manual
71
Chapter 6 HDD Backplane Introduction
65 HDD BP thermal sensor temperature setting(Only for system cooling Fan controled by expander)
Step 1 For Install HyperTerminalexe refer to section 41
Step 2Get the current temperature settingscmdgt temperature
HA201-TP Users Manual
72
Chapter 6 HDD Backplane Introduction
Step 3For example
Set new temperatureT1=20 C 4 18 CT2=50 C 4 52 CWarning threshold=50 C 448 CAlarm threshold=55 C 454 C
The new setting will take effect after resetcmdgt temperature 18 52 48 54cmdgt reset
HA201-TP Users Manual
73
Chapter 6 HDD Backplane Introduction
Step 4Check fan speed amp temperature informationcmdgt sensor
cmd gtsensor==ENCLOSURE STATUS========================================================== Total fan number 2 System Fan-0 speed 10888 RPM System Fan-1 speed 10971 RPM System PWN-0 82 Expander Temperature 76 Celsius degree Sytem Temperature-0 33 Celsius degree T1 20 Celsius degree T2 50 Celsius degree TC 55 Celsius degree Voltage Sensor 09V 093V Voltage Sensor 18V 180V
cmd gt_
HA201-TP Users Manual
74
Chapter 7 Intelreg Server Board S2600TP Platform Management
71 Server Management Function Architecture711 IPMI Feature7111 IPMI 20 Featuresbull Baseboard management controller (BMC)bull IPMI Watchdog timerbull Messaging support including command bridging and usersession
supportbull Chassis device functionality including powerreset control and BIOS boot
flags supportbull Event receiver device The BMC receives and processes events from
other platform subsystemsbull Field Replaceable Unit (FRU) inventory device functionality The BMC
supports access to system FRU devices using IPMI FRU commandsbull System Event Log (SEL) device functionality The BMC supports and
provides access to a SELbull Sensor Data Record (SDR) repository device functionality The BMC
supports storage and access of system SDRsbull Sensor device and sensor scanningmonitoring The BMC provides IPMI
management of sensors It polls sensors to monitor and report system health
bull IPMI interfaceso Host interfaces include system management software (SMS) with receive message queue support and server management mode (SMM)o IPMB interfaceo LAN interface that supports the IPMI-over-LAN protocol (RMCP RMCP+)
bull Serial-over-LAN (SOL)bull ACPI state synchronization The BMC tracks ACPI state changes that are
provided by the BIOSbull BMC self-test The BMC performs initialization and run-time self-tests and
makes results available to external entitiesbull See also the Intelligent Platform Management Interface Specification
Second Generation v20
Chapter 7 Intelreg Server Board S2600TP Platform Management
HA201-TP Users Manual
75
Chapter 7 Intelreg Server Board S2600TP Platform Management
7112 Non IPMI FeaturesThe BMC supports the following non-IPMI featuresbull In-circuit BMC firmware updatebull Fault resilient booting (FRB) FRB2 is supported by the watchdog timer
functionalitybull Chassis intrusion detectionbull Basic fan control using Control version 2 SDRsbull Fan redundancy monitoring and supportbull Enhancements to fan speed controlbull Power supply redundancy monitoring and supportbull Hot-swap fan supportbull Acoustic management Support for multiple fan profilesbull Signal testing support The BMC provides test commands for setting
and getting platform signal statesbull The BMC generates diagnostic beep codes for fault conditionsbull System GUID storage and retrievalbull Front panel management The BMC controls the system status LED
and chassis ID LED It supports secure lockout of certain front panel functionality and monitors button presses The chassis ID LED is turned on using a front panel button or a command
bull Power state retentionbull Power fault analysisbull Intelreg Light-Guided Diagnosticsbull Power unit management Support for power unit sensor The BMC
handles power-good dropout conditionsbull DIMM temperature monitoring New sensors and improved acoustic
management using closed-loop fan control algorithm taking into account DIMM temperature readings
bull Address Resolution Protocol (ARP) The BMC sends and responds to ARPs (supported on embedded NICs)
bull Dynamic Host Configuration Protocol (DHCP) The BMC performs DHCP (supported on embedded NICs)
bull Platform environment control interface (PECI) thermal management support
bull E-mail alertingbull Support for embedded web server UI in Basic Manageability feature
set
HA201-TP Users Manual
76
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull Enhancements to embedded web servero Human-readable SELo Additional system configurabilityo Additional system monitoring capabilityo Enhanced on-line help
bull Integrated KVMbull Enhancements to KVM redirection
o Support for higher resolutionbull Integrated Remote Media Redirectionbull Lightweight Directory Access Protocol (LDAP) supportbull Intelreg Intelligent Power Node Manager supportbull Embedded platform debug feature which allows capture of detailed
data for later analysisbull Provisioning and inventory enhancements
o Inventory datasystem information export (partial SMBIOS table)bull DCMI 15 compliance (product-specific)bull Management support for PMBus rev12 compliant power suppliesbull BMC Data Repository (Managed Data Region Feature)bull Support for an Intelreg Local Control Display Panelbull System Airflow Monitoringbull Exit Air Temperature Monitoringbull Ethernet Controller Thermal Monitoringbull Global Aggregate Temperature Margin Sensorbull Memory Thermal Managementbull Power Supply Fan Sensorsbull Energy Star Server Supportbull Smart Ride Through (SmaRT) Closed Loop System Throttling (CLST)bull Power Supply Cold Redundancybull Power Supply FW Updatebull Power Supply Compatibility Checkbull BMC FW reliability enhancements
o Redundant BMC boot blocks to avoid possibility of a corrupted boot block resulting in a scenario that prevents a user from updating the BMC o BMC System Management Health Monitoring
HA201-TP Users Manual
77
Chapter 7 Intelreg Server Board S2600TP Platform Management
72 Features and Functions721 Power SubsystemThe server board supports several power control sources which can initiate power-up orpower-down activity
HA201-TP Users Manual
78
Chapter 7 Intelreg Server Board S2600TP Platform Management
722 Advanced Configuration and Power Interface (ACPI)The server board has support for the following ACPI states
HA201-TP Users Manual
79
Chapter 7 Intelreg Server Board S2600TP Platform Management
723 System InitializationDuring system initialization both the BIOS and the BMC initialize the following items
7231 Processor Tcontrol SettingProcessors used with this chipset implement a feature called Tcontrol which provides a processor-specific value that can be used to adjust the fan-control behavior to achieve optimum cooling and acoustics The BMC reads these from the CPU through PECI Proxy mechanism provided by Manageability Engine (ME) The BMC uses these values as part of thefan-speed-control algorithm
HA201-TP Users Manual
80
Chapter 7 Intelreg Server Board S2600TP Platform Management
7232 Fault Resilient Booting (FRB)Fault resilient booting (FRB) is a set of BIOS and BMC algorithms and hardware support that allow a multiprocessor system to boot even if the bootstrap processor (BSP) fails Only FRB2 is supported using watchdog timer commandsFRB2 refers to the FRB algorithm that detects system failures during POST The BIOS uses the BMC watchdog timer to back up its operation during POST The BIOS configures the watchdog timer to indicate that the BIOS is using the timer for the FRB2 phase of the boot operationAfter the BIOS has identified and saved the BSP information it sets the FRB2 timer use bit and loads the watchdog timer with the new timeout intervalIf the watchdog timer expires while the watchdog use bit is set to FRB2 the BMC (if so configured) logs a watchdog expiration event showing the FRB2 timeout in the event data bytes The BMC then hard resets the system assuming the BIOS-selected reset as the watchdog timeout actionThe BIOS is responsible for disabling the FRB2 timeout before initiating the option ROM scan and before displaying a request for a boot password If the processor fails and causes an FRB2 timeout the BMC resets the systemThe BIOS gets the watchdog expiration status from the BMC If the status shows an expired FRB2 timer the BIOS enters the failure in the system event log (SEL) In the OEM bytes entry in the SEL the last POST code generated during the previous boot attempt is written FRB2failure is not reflected in the processor status sensor valueThe FRB2 failure does not affect the front panel LEDs
HA201-TP Users Manual
81
Chapter 7 Intelreg Server Board S2600TP Platform Management
7233 Post Code DisplayThe BMC upon receiving standby power initializes internal hardware to monitor port 80h (POST code) writes Data written to port 80h is output to the system POST LEDsThe BMC deactivates POST LEDs after POST had completed
HA201-TP Users Manual
82
Chapter 7 Intelreg Server Board S2600TP Platform Management
724 System Event Log (SEL)The BMC implements the system event log as specified in the Intelligent Platform Management Interface Specification Version 20 The SEL is accessible regardless of the system power state through the BMCs in-band and out-of-band interfacesThe BMC allocates 95231bytes (approx 93 KB) of non-volatile storage space to store system events The SEL timestamps may not be in order Up to 3639 SEL records can be stored at a time Because the SEL is circular any command that results in an overflow of the SEL beyond the allocated space will overwrite the oldest entries in the SEL while setting the overflow flag
HA201-TP Users Manual
83
Chapter 7 Intelreg Server Board S2600TP Platform Management
73 Sensor MonitoringThe BMC monitors system hardware and reports system health The information gathered from physical sensors is translated into IPMI sensors as part of the ldquoIPMI Sensor Modelrdquo The BMC also reports various system state changes by maintaining virtual sensors that are not specifically tied to physical hardware This section describes general aspects of BMC sensor management as well as describing how specific sensor types are modeled Unless otherwise specified the term ldquosensorrdquo refers to the IPMI sensor-model definition of a sensor
531 Sensor ScanningThe value of many of the BMCrsquos sensors is derived by the BMC FW periodically polling physical sensors in the system to read temperature voltages and so on Some of these physical sensors are built-in to the BMC component itself and some are physically separated from the BMC Polling of physical sensors for support of IPMI sensor monitoring does not occur until the BMCrsquos operational code is running and the IPMI FW subsystem has completed initializationIPMI sensor monitoring is not supported in the BMC boot code Additionally the BMC selectively polls physical sensors based on the current power and reset state of the system and the availability of the physical sensor when in that state For example non-standby voltages are not monitored when the system is in S4 or S5 power state
HA201-TP Users Manual
84
Chapter 7 Intelreg Server Board S2600TP Platform Management
732 Sensor Rearm Behavior7321 Manual versus Re-arm SensorsSensors can be either manual or automatic re-arm An automatic re-arm sensor will re-arm (clear) the assertion event state for a threshold or offset it if that threshold or offset is deasserted after having been asserted This allows a subsequent assertion of the threshold or an offset to generate a new event and associated side-effect An example side-effect would be boosting fans due to an upper critical threshold crossing of a temperature sensor The event state and the input state (value) of the sensor track each other Most sensors are auto-rearmA manual re-arm sensor does not clear the assertion state even when the threshold or offset becomes de-asserted In this case the event state and the input state (value) of the sensor do not track each other The event assertion state is sticky The following methods can be used to re-arm a sensorbull Automatic re-arm ndash Only applies to sensors that are designated as
ldquoauto-rearmrdquobull IPMI command Re-arm Sensor Eventbull BMC internal method ndash The BMC may re-arm certain sensors due to a
trigger condition For example some sensors may be re-armed due to a system reset A BMC reset will re-arm all sensorsbull System reset or DC power cycle will re-arm all system fan sensors
HA201-TP Users Manual
85
Chapter 7 Intelreg Server Board S2600TP Platform Management
7322 Re-arm and Event GenerationAll BMC-owned sensors that show an asserted event status generate a de-assertion SEL event when the sensor is re-armed provided that the associated SDR is configured to enable a deassertion event for that condition This applies regardless of whether the sensor is a thresholdanalog sensor or a discrete sensor
To manually re-arm the sensors the sequence is outlined below1 A failure condition occurs and the BMC logs an assertion event2 If this failure condition disappears the BMC logs a de-assertion event (if
so configured)3 The sensor is re-armed by one of the methods described in the
previous section4 The BMC clears the sensor status5 The sensor is put into reading-state-unavailable state until it is polled
again or otherwise updated6 The sensor is updated and the ldquoreading-state-unavailablerdquo state is
cleared A new assertion event will be logged if the fault state is once again detected
All auto-rearm sensors that show an asserted event status generate a de-assertion SEL event at the time the BMC detects that the condition causing the original assertion is no longer present and the associated SDR is configured to enable a de-assertion event for that condition
HA201-TP Users Manual
86
Chapter 7 Intelreg Server Board S2600TP Platform Management
733 BIOS Event-Only SensorsBIOS-owned discrete sensors are used for event generation only and are not accessible through IPMI sensor commands like the Get Sensor Reading command Note that in this case the sensor owner designated in the SDR is not the BMCAn example of this usage would be the SELs logged by the BIOS for uncorrectable memory errors Such SEL entries would identify a BIOS-owned sensor ID
734 Margin SensorsThere is sometimes a need for an IPMI sensor to report the difference (margin) from a nonzero reference offset For the purposes of this document these type sensors are referred to as margin sensors For instance for the case of a temperature margin sensor if the referencevalue is 90 degrees and the actual temperature of the device being monitored is 85 degreesthe margin value would be -5
735 IPMI Watchdog SensorThe BMC supports a Watchdog Sensor as a means to log SEL events due to expirations of the IPMI 20 compliant Watchdog Timer
736 BMC Watchdog SensorThe BMC supports an IPMI sensor to report that a BMC reset has occurred due to action taken by the BMC Watchdog feature A SEL event will be logged whenever either the BMC FW stack is reset or the BMC CPU itself is reset
737 BMC System Management Health MonitoringThe BMC tracks the health of each of its IPMI sensors and report failures by providing a ldquoBMC FW Healthrdquo sensor of the IPMI 20 sensor type Management Subsystem Health with support for the Sensor Failure offset Only assertions should be logged into the SEL for the Sensor Failure offset The BMC Firmware Health sensor asserts for any sensor when 10 consecutive sensor errors are read These are not standard sensor events (that is threshold crossings or discrete assertions) These are BMC Hardware Access Layer (HAL) errors If a successful sensor read is completed the counter resets to zero
HA201-TP Users Manual
87
Chapter 7 Intelreg Server Board S2600TP Platform Management
738 VR Watchdog TimerThe BMC FW monitors that the power sequence for the board VR controllers is completed when a DC power-on is initiated Incompletion of the sequence indicates a board problem in which case the FW powers down the systemThe BMC FW supports a discrete IPMI sensor for reporting and logging this fault condition
739 System Airflow MonitoringThe BMC provides an IPMI sensor to report the volumetric system airflow in CFM (cubic feet per minute) The air flow in CFM is calculated based on the system fan PWM values The specific Pulse Width Modulation (PWM or PWMs) used to determine the CFM is SDR configurable The relationship between PWM and CFM is based on a lookup table in an OEM SDRThe airflow data is used in the calculation for exit air temperature monitoring It is exposed as an IPMI sensor to allow a datacenter management application to access this data for use in rack-level thermal management
7310 Thermal MonitoringThe BMC provides monitoring of component and board temperature sensing devices This monitoring capability is instantiated in the form of IPMI analogthreshold or discrete sensors depending on the nature of the measurementFor analogthreshold sensors with the exception of Processor Temperature sensors critical and non-critical thresholds (upper and lower) are set through SDRs and event generation enabled for both assertion and de-assertion eventsFor discrete sensors both assertion and de-assertion event generation are enabledMandatory monitoring of platform thermal sensors includes
HA201-TP Users Manual
88
Chapter 7 Intelreg Server Board S2600TP Platform Management
7310 Thermal MonitoringThe BMC provides monitoring of component and board temperature sensing devices This monitoring capability is instantiated in the form of IPMI analogthreshold or discrete sensors depending on the nature of the measurementFor analogthreshold sensors with the exception of Processor Temperature sensors critical and non-critical thresholds (upper and lower) are set through SDRs and event generation enabled for both assertion and de-assertion eventsFor discrete sensors both assertion and de-assertion event generation are enabledMandatory monitoring of platform thermal sensors includesbull Inlet temperature (physical sensor is typically on system front panel or
HDD back plane)bull Board ambient thermal sensorsbull Processor temperaturebull Memory (DIMM) temperaturebull CPU VRD Hot monitoringbull Power supply (only supported for PMBus-compliant PSUs)Additionally the BMC FW may create ldquovirtualrdquo sensors that are based on a combination of aggregation of multiple physical thermal sensors and application of a mathematical formula to thermal or power sensor readings
73101 Absolute Value versus Margin SensorsThermal monitoring sensors fall into three basic categoriesbull Absolute temperature sensors ndash These are analogthreshold sensors
that provide a value that corresponds to an absolute temperature value
bull Thermal margin sensors ndash These are analogthreshold sensors that provide a value that is relative to some reference value
bull Thermal fault indication sensors ndash These are discrete sensors that indicate a specific thermal fault condition
HA201-TP Users Manual
89
Chapter 7 Intelreg Server Board S2600TP Platform Management
73102 Processor DTS-Spec Margin Sensor(s)Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family incorporate a DTS based thermal spec This allows a much more accurate control of the thermal solution and will enable lower fan speeds and lower fan power consumption The main usage of this sensor is as an input to the BMCrsquos fan control algorithms The BMC implements this as a threshold sensor There is one DTS sensor for each installed physical processor package Thresholds are not set and alert generation is not enabled for these sensors
73103 Processor Thermal Margin Sensor(s)Each processor supports a physical thermal margin sensor per core that is readable through the PECI interface This provides a relative value representing a thermal margin from the corersquos throttling thermal trip point Assuming that temp controlled throttling is enabled the physical core temp sensor reads lsquo0rsquo which indicates the processor core is being throttledThe BMC supports one IPMI processor (margin) temperature sensor per physical processor package This sensor aggregates the readings of the individual core temperatures in a package to provide the hottest core temperature reading When the sensor reads lsquo0rsquo it indicates that the hottest processor core is throttlingDue to the fact that the readings are capped at the corersquos thermal throttling trip point (reading = 0) thresholds are not set and alert generation is not enabled for these sensors
73104 Processor Thermal Control Monitoring (Prochot)The BMC FW monitors the percentage of time that a processor has been operationally constrained over a given time window (nominally six seconds) due to internal thermal management algorithms engaging to reduce the temperature of the device When any processor core temperature reaches its maximum operating temperature the processorpackage PROCHOT (processor hot) signal is asserted and these management algorithms known as the Thermal Control Circuit (TCC) engage to reduce the temperature provided TCC is enabled TCC is enabled by BIOS during system boot This monitoring is instantiated as one IPMI analogthreshold sensor per processor package The BMC implements this as a threshold sensor on a per-processor basis
HA201-TP Users Manual
90
Chapter 7 Intelreg Server Board S2600TP Platform Management
Under normal operation this sensor is expected to read lsquo0rsquo indicating that no processor throttling has occurredThe processor provides PECI-accessible counters one for the total processor time elapsed and one for the total thermally constrained time which are used to calculate the percentage assertion over the given time window
73105 Processor Voltage Regulator (VRD) Over-Temperature SensorThe BMC monitors processor VRD_HOT signals The processor VRD_HOT signals are routed to the respective processor PROCHOT input in order initiate throttling to reduce processor power draw therefore indirectly lowering the VRD temperatureThere is one processor VRD_HOT signal per CPU slot The BMC instantiates one discrete IPMI sensor for each VRD_HOT signal This sensor monitors a digital signal that indicates whether a processor VRD is running in an over-temperature condition When the BMC detects that this signal is asserted it will cause a sensor assertion which will result in an event being written into the sensor event log (SEL)
73106 Inlet Temperature SensorEach platform supports a thermal sensor for monitoring the inlet temperature There are four potential sources for inlet temperature reading
73107 Baseboard Ambient Temperature Sensor(s)The server baseboard provides one or more physical thermal sensors for monitoring the ambient temperature of a board location This is typically to provide rudimentary thermal monitoring of components that lack internal thermal sensors
73108 Server South Bridge (SSB) Thermal MonitoringThe BMC monitors the SSB temperature This is instantiated as an analog (threshold) IPMI thermal sensor
73109 Exit Air Temperature Monitoring The BMC synthesizes a virtual sensor to approximate system exit air temperature for use in fan control This is calculated based on the total power being consumed by the system and the total volumetric air flow provided by the system fans
HA201-TP Users Manual
91
Chapter 7 Intelreg Server Board S2600TP Platform Management
Each system shall be characterized in tabular format to understand total volumetric flow versus fan speed The BMC calculates an average exit air temperature based on the total system power front panel temperature the volumetric system air flow (cubic feet per meter or CFM) and altitude rangeThis sensor is only available on systems in an Intelreg chassis The Exit Air temp sensor is only available when PMBus power supplies are installed
731010 Ethernet Controller Thermal MonitoringThe Intelreg Ethernet Controller I350-AM4 and Intelreg Ethernet Controller 10 Gigabit X540 support an on-die thermal sensor For baseboard Ethernet controllers that use these devices the BMC will monitor the sensors and use this data as input to the fan speed control The BMC will instantiate an IPMI temperature sensor for each device on the baseboard
731011 Memory VRD-Hot Sensor(s)The BMC monitors memory VRD_HOT signals The memory VRD_HOT signals are routed to the respective processor MEMHOT inputs in order to throttle the associated memory to effectively lower the temperature of the VRD feeding that memoryFor Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family there are 2 memory VRD_HOT signals per CPU slot The BMC instantiates one discrete IPMI sensor for each memory VRD_HOT signal
731012 Add-in Module Thermal MonitoringSome boards have dedicated slots for an IO module andor a SAS module For boards that support these slots the BMC will instantiate an IPMI temperature sensor for each slot The modules themselves may or may not provide a physical thermal sensor (a TMP75 device) If the BMC detects that a module is installed it will attempt to access the physical thermal sensor and if found enable the associated IPMI temperature sensor
731013 Processor ThermTripWhen a Processor ThermTrip occurs the system hardware will automatically power down the server If the BMC detects that a ThermTrip occurred then it will set the ThermTrip offset for the applicable
HA201-TP Users Manual
92
Chapter 7 Intelreg Server Board S2600TP Platform Management
processor status sensor
731014 Server South Bridge (SSB) ThermTrip Monitoring The BMC supports SSB ThermTrip monitoring that is instantiated as an IPMI discrete sensor When a SSB ThermTrip occurs the system hardware will automatically power down the server and the BMC will assert the sensor offset and log an event
731015 DIMM ThermTrip MonitoringThe BMC supports DIMM ThermTrip monitoring that is instantiated as one aggregate IPMI discrete sensor per CPU When a DIMM ThermTrip occurs the system hardware will automatically power down the server and the BMC will assert the sensor offset and log an eventThis is a manual re-arm sensor that is rearmed on system resets and power-on (AC or DC power on transitions)
HA201-TP Users Manual
93
Chapter 7 Intelreg Server Board S2600TP Platform Management
7311 Processor SensorsThe BMC provides IPMI sensors for processors and associated components such as voltage regulators and fans The sensors are implemented on a per-processor basis
HA201-TP Users Manual
94
Chapter 7 Intelreg Server Board S2600TP Platform Management
73111 Processor Status SensorsThe BMC provides an IPMI sensor of type processor for monitoring status information for each processor slot If an event state (sensor offset) has been asserted it remains asserted until one of the following happens1 A Rearm Sensor Events command is executed for the processor status
sensor2 AC or DC power cycle system reset or system boot occurs
The BMC provides system status indication to the front panel LEDs for processor fault conditions shown belowCPU Presence status is not saved across AC power cycles and therefore will not generate a deassertion after cycling AC power
HA201-TP Users Manual
95
Chapter 7 Intelreg Server Board S2600TP Platform Management
73112 Processor Population Fault (CPU Missing) SensorThe BMC supports a Processor Population Fault sensor This is used to monitor for the condition in which processor slots are not populated as required by the platform hardware to allow power-on of the systemAt BMC startup the BMC will check for the fault condition and set the sensor state accordinglyThe BMC also checks for this fault condition at each attempt to DC power-on the system At each DC power-on attempt a beep code is generated if this fault is detectedThe following steps are used to correct the fault condition and clear the sensor state1 AC power down the server2 Install the missing processor into the correct slot3 AC power on the server
73113 ERR2 Timeout MonitoringThe BMC supports an ERR2 Timeout Sensor (1 per CPU) that asserts if a CPUrsquos ERR2 signal has been asserted for longer than a fixed time period (gt 90 seconds) ERR[2] is a processor signal that indicates when the IIO (Integrated IO module in the processor) has a fatal error which could not be communicated to the core to trigger SMI ERR[2] events are fatal error conditions where the BIOS and OS will attempt to gracefully handle error but may not be always do so reliably A continuously asserted ERR2 signal is an indication that the BIOS cannot service the condition that caused the error This is usually because that condition prevents the BIOS from runningWhen an ERR2 timeout occurs the BMC assertsde-asserts the ERR2 Timeout Sensor and logs a SEL event for that sensor The default behavior for BMC core firmware is to initiate a system reset upon detection of an ERR2 timeout The BIOS setup utility provides an option to disable or enable system reset by the BMC for detection of this condition
HA201-TP Users Manual
96
Chapter 7 Intelreg Server Board S2600TP Platform Management
73114 CATERR SensorThe BMC supports a CATERR sensor for monitoring the system CATERR signalThe CATERR signal is defined as having 3 statesbull high (no event)bull pulsed low (possibly fatal may be able to recover)bull low (fatal)All processors in a system have their CATERR pins tied together The pin is used as a communication path to signal a catastrophic system event to all CPUs The BMC has direct access to this aggregate CATERR signalThe BMC only monitors for the ldquoCATERR held lowrdquo condition A pulsed low condition is ignored by the BMC If a CATERR-low condition is detected the BMC logs an error message to the SEL against the CATERR sensor and the default action after logging the SEL entry is to reset the system The BIOS setup utility provides an option to disable or enable system reset by the BMC for detection of this conditionThe sensor is rearmed on power-on (AC or DC power on transitions) It is not rearmed on system resets in order to avoid multiple SEL events that could occur due to a potential reset loop if the CATERR keeps recurring which would be the case if the CATERR was due to an MSID mismatch conditionWhen the BMC detects that this aggregate CATERR signal has asserted it can then go through PECI to query each CPU to determine which one was the source of the error and write an OEM code identifying the CPU slot into an event data byte in the SEL entry If PECI is non-functional (it isnrsquot guaranteed in this situation) then the OEM code should indicate that the source is unknownEvent data byte 2 and byte 3 for CATERR sensor SEL events
ED1 ndash 0xA1ED2 - CATERR type0 Unknown1 CATERR2 CPU Core Error (not supported on Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family)3 MSID Mismatch4 CATERR due to CPU 3-strike timeout
HA201-TP Users Manual
97
Chapter 7 Intelreg Server Board S2600TP Platform Management
ED3 - CPU bitmap that causes the system CATERR[0] CPU1[1] CPU2[2] CPU3[3] CPU4When a CATERR Timeout event is determined to be a CPU 3-strike timeout The BMC shall log the logical FRU information (eg busdevfunc for a PCIe device CPU or DIMM) that identifies the FRU that caused the error in the extended SEL data bytes In this case Ext-ED0 will be set to 0x70 and the remaining ED1-ED7 will be set according to the device type and info available
73115 MSID Mismatch SensorThe BMC supports a MSID Mismatch sensor for monitoring for the fault condition that will occur if there is a power rating incompatibility between a baseboard and a processor The sensor is rearmed on power-on (AC or DC power on transitions)
7312 Voltage MonitoringThe BMC provides voltage monitoring capability for voltage sources on the main board and processors such that all major areas of the system are covered This monitoring capability is instantiated in the form of IPMI analogthreshold sensors
73121 DIMM Voltage SensorsSome systems support either LVDDR (Low Voltage DDR) memory or regular (non-LVDDR) memory During POST the system BIOS detects which type of memory is installed and configures the hardware to deliver the correct voltageSince the nominal voltage range is different this necessitates the ability to set different thresholds for any associated IPMI voltage sensors The BMC FW supports this by implementing separate sensors (that is separate IPMI sensor numbers) for each nominal voltage range supported for a single physical sensor and it enablesdisables the correct IPMI sensor based on which type memory is installed The sensor data records for both these DIMM voltage sensor types have scanning disabled by default Once the BIOS has completed its POST routine it is responsible for communicating the DIMM voltage type to the BMC which will then
HA201-TP Users Manual
98
Chapter 7 Intelreg Server Board S2600TP Platform Management
enable sensor scanning of the correct DIMM voltage sensor
7313 Fan MonitoringBMC fan monitoring support includes monitoring of fan speed (RPM) and fan presence
73131 Fan Tach SensorsFan Tach sensors are used for fan failure detection The reported sensor reading is proportional to the fanrsquos RPM This monitoring capability is instantiated in the form of IPMI analogthreshold sensorsMost fan implementations provide for a variable speed fan so the variations in fan speed can be large Therefore the threshold values must be set sufficiently low as to not result in inappropriate threshold crossingsFan tach sensors are implemented as manual re-arm sensors because a lower-critical threshold crossing can result in full boosting of the fans This in turn may cause a failing fanrsquos speed to rise above the threshold and can result in fan oscillationsAs a result fan tach sensors do not auto-rearm when the fault condition goes away but rather are rearmed for either of the following occurrences1 The system is reset or power-cycled2 The fan is removed and either replaced with another fan or re-
inserted This applies to hot-swappable fans only This re-arm is triggered by change in the state of the associated fan presence sensor
After the sensor is rearmed if the fan speed is detected to be in a normal range the failure conditions shall be cleared and a de-assertion event shall be logged
HA201-TP Users Manual
99
Chapter 7 Intelreg Server Board S2600TP Platform Management
73132 Fan Presence SensorsSome chassis and server boards provide support for hot-swap fans These fans can be removed and replaced while the system is powered on and operating normally The BMC implements fan presence sensors for each hot swappable fan These are instantiated as IPMI discrete sensorsEvents are only logged for fan presence upon changes in the presence state after AC power is applied (no events logged for initial state)
73133 Fan Redundancy SensorThe BMC supports redundant fan monitoring and implements fan redundancy sensors for products that have redundant fans Support for redundant fans is chassis-specificA fan redundancy sensor generates events when its associated set of fans transition between redundant and non-redundant states as determined by the number and health of the component fans The definition of fan redundancy is configuration dependent The BMCallows redundancy to be configured on a per fan-redundancy sensor basis through OEM SDR recordsThere is a fan redundancy sensor implemented for each redundant group of fans in the systemAssertion and de-assertion event generation is enabled for each redundancy state
73134 Power Supply Fan SensorsMonitoring is implemented through IPMI discrete sensors one for each power supply fan The BMC polls each installed power supply using the PMBus fan status commands to check for failure conditions for the power supply fans The BMC asserts the ldquoperformance lagsrdquo offset of the IPMI sensor if a fan failure is detectedPower supply fan sensors are implemented as manual re-arm sensors because a failure condition can result in boosting of the fans This in turn may cause a failing fanrsquos speed to rise above the ldquofaultrdquo threshold and can result in fan oscillations As a result these sensors do not auto-rearm when the fault condition goes away but rather are rearmed only when the system is reset or power-cycled or the PSU is removed and replaced with the same or another PSUAfter the sensor is rearmed if the fan is no longer showing a failed state the failure condition in the IPMI sensor shall be cleared and a de-
HA201-TP Users Manual
100
Chapter 7 Intelreg Server Board S2600TP Platform Management
assertion event shall be logged
73135 Monitoring for ldquoFans Offrdquo ScenarioOn Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family it is likely that there will be situations where specific fans are turned off based on current system conditions BMC Fan monitoring will comprehend this scenario and not log false failure eventsThe recommended method is for the BMC FW to halt updates to the value of the associated fan tach sensor and set that sensorrsquos IPMI sensor state to ldquoreading-state-unavailablerdquo when this mode is active Management software must comprehend this state for fan tach sensors and not report these as failure conditionsThe scenario for which this occurs is that the BMC Fan Speed Control (FSC) code turns off the fans by setting the PWM for the domain to 0 This is done when based on one or more global aggregate thermal margin sensor readings dropping below a specified thresholdBy default the fans-off feature will be disabled There is a BMC command and BIOS setup option to enabledisable this featureThe SmaRTCLST system feature will also momentarily gate power to all the system fans to reduce overall system power consumption in response to a power supply event (for example to ride out an AC power glitch) However for this scenario the fan power is gated by hardware for only 100ms which should not be long enough to result in triggering a fan fault SEL event
7314 Standard Fan ManagementThe BMC controls and monitors the system fans Each fan is associated with a fan speed sensor that detects fan failure and may also be associated with a fan presence sensor for hotswap support For redundant fan configurations the fan failure and presence status determines the fan redundancy sensor stateThe system fans are divided into fan domains each of which has a separate fan speed control signal and a separate configurable fan control policy A fan domain can have a set of temperature and fan sensors associated with it These are used to determine the current fan domain state
HA201-TP Users Manual
101
Chapter 7 Intelreg Server Board S2600TP Platform Management
A fan domain has three statesbull The sleep and boost states have fixed (but configurable through OEM
SDRs) fan speeds associated with thembull The nominal state has a variable speed determined by the fan
domain policy An OEM SDR record is used to configure the fan domain policy
The fan domain state is controlled by several factors They are listed below in order of precedence high to lowbull Boost
o Associated fan is in a critical state or missing The SDR describes which fan domains are boosted in response to a fan failure or removal in each domain If a fan is removed when the system is in lsquoFans-offrsquo mode it will not be detected and there will not be any fan boost till system comes out of lsquoFans-off modeo Any associated temperature sensor is in a critical state The SDR describes which temperature-threshold violations cause fan boost for each fan domaino The BMC is in firmware update mode or the operational firmware is corruptedo If any of the above conditions apply the fans are set to a fixed boost state speed
bull Nominalo A fan domainrsquos nominal fan speed can be configured as static (fixed value) or controlled by the state of one or more associated temperature sensorso See section 43144 for more details
73141 Hot-Swap FansHot-swap fans are supported These fans can be removed and replaced while the system is powered on and operating The BMC implements fan presence sensors for each hotswappable fanWhen a fan is not present the associated fan speed sensor is put into the readingunavailable state and any associated fan domains are put into the boost state The fans may already be boosted due to a previous fan failure or fan removalWhen a removed fan is inserted the associated fan speed sensor is rearmed If there are no other critical conditions causing a fan boost condition the fan speed returns to the nominal state Power cycling or
HA201-TP Users Manual
102
Chapter 7 Intelreg Server Board S2600TP Platform Management
resetting the system re-arms the fan speed sensors and clears fanfailure conditions If the failure condition is still present the boost state returns once the sensor has re-initialized and the threshold violation is detected again
73142 Fan Redundancy DetectionThe BMC supports redundant fan monitoring and implements a fan redundancy sensor A fan redundancy sensor generates events when its associated set of fans transitions between redundant and non-redundant states as determined by the number and health of the fans The definition of fan redundancy is configuration dependent The BMC allows redundancy to be configured on a per fan redundancy sensor basis through OEM SDR recordsA fan failure or removal of hot-swap fans up to the number of redundant fans specified in the SDR in a fan configuration is a non-critical failure and is reflected in the front panel status A fan failure or removal that exceeds the number of redundant fans is a non-fatal insufficientresources condition and is reflected in the front panel status as a non-fatal errorRedundancy is checked only when the system is in the DC-on state Fan redundancy changes that occur when the system is DC-off or when AC is removed will not be logged until the system is turned on
73143 Fan DomainsSystem fan speeds are controlled through pulse width modulation (PWM) signals which are driven separately for each domain by integrated PWM hardware Fan speed is changed by adjusting the duty cycle which is the percentage of time the signal is driven high in each pulseThe BMC controls the average duty cycle of each PWM signal through direct manipulation of the integrated PWM control registersThe same device may drive multiple PWM signals
HA201-TP Users Manual
103
Chapter 7 Intelreg Server Board S2600TP Platform Management
73144 Nominal Fan SpeedA fan domainrsquos nominal fan speed can be configured as static (fixed value) or controlled by the state of one or more associated temperature sensorsOEM SDR records are used to configure which temperature sensors are associated with which fan control domains and the algorithmic relationship between the temperature and fan speed Multiple OEM SDRs can reference or control the same fan control domain and multiple OEM SDRs can reference the same temperature sensorsThe PWM duty-cycle value for a domain is computed as a percentage using one or more instances of a stepwise linear algorithm and a clamp algorithm The transition from one computed nominal fan speed (PWM value) to another is ramped over time to minimize audible transitions The ramp rate is configurable by means of the OEM SDRMultiple stepwise linear and clamp controls can be defined for each fan domain and used simultaneously For each domain the BMC uses the maximum of the domainrsquos stepwise linear control contributions and the sum of the domainrsquos clamp control contributions to compute the domainrsquos PWM value except that a stepwise linear instance can be configured to provide the domain maximumHysteresis can be specified to minimize fan speed oscillation and to smooth fan speed transitions If a Tcontrol SDR record does not contain a hysteresis definition for example an SDR adhering to a legacy format the BMC assumes a hysteresis value of zero
73145 Thermal and Acoustic ManagementThis feature refers to enhanced fan management to keep the system optimally cooled while reducing the amount of noise generated by the system fans Aggressive acoustics standards might require a trade-off between fan speed and system performance parameters that contribute to the cooling requirements primarily memory bandwidth The BIOS BMC and SDRs work together to provide control over how this trade-off is determinedThis capability requires the BMC to access temperature sensors on the individual memory DIMMs Additionally closed-loop thermal throttling is only supported with buffered DIMMs
HA201-TP Users Manual
104
Chapter 7 Intelreg Server Board S2600TP Platform Management
73146 Thermal Sensor Input to Fan Speed ControlThe BMC uses various IPMI sensors as input to the fan speed control Some of the sensors are IPMI models of actual physical sensors whereas some are ldquovirtualrdquo sensors whose values are derived from physical sensors using calculations andor tabular informationThe following IPMI thermal sensors are used as input to the fan speed controlbull Front panel temperature sensorbull Baseboard temperature sensorsbull CPU DTS-Spec margin sensorsbull DIMM thermal margin sensorsbull Exit air temperature sensorbull Global aggregate thermal margin sensorsbull SSB (Intelreg C610 Series Chipset) temperature sensorbull On-board Ethernet controller temperature sensors (support for this is
specific to the Ethernet controller being used)bull Add-in Intelreg SASIO module temperature sensor(s) (if present)bull Power supply thermal sensors (only available on PMBus-compliant
power supplies)A simple model is shown in the following figure which gives a high level graphic of the fan speed control structure creates the resulting fan speeds
HA201-TP Users Manual
105
Chapter 7 Intelreg Server Board S2600TP Platform Management
731461 Processor Thermal ManagementProcessor thermal management utilizes clamp algorithms for which the Processor DTS-Spec margin sensor is a controlling input This replaces the use of the (legacy) raw DTS sensor reading that was utilized on previous generation platforms The legacy DTS sensor is retained only for monitoring purposes and is not used as an input to the fan speed control
731462 Memory Thermal ManagementThe system memory is the most complex subsystem to thermally manage as it requires substantial interactions between the BMC BIOS and the embedded memory controller This section provides an overview of this management capability from a BMC perspective
7314621 Memory Thermal ThrottlingThe system shall support thermal management through open loop throttling (OLTT) and closed loop throttling (CLTT) of system memory based on the platform as well as availability of valid temperature sensors on the installed memory DIMMs Throttling levels are changed dynamically to cap throttling based on memory and system thermal conditions as determined by the system and DIMM power and thermal parameters Support for CLTT on mixed-mode DIMM populations (that is some installed DIMMs have valid temp sensors and some do not) is not supported The BMC fan speed control functionality is related to the memory throttling mechanism usedThe following terminology is used for the various memory throttling optionsbull Static Open Loop Thermal Throttling (Static-OLTT) OLTT control registers
are configured by BIOS MRC remain fixed after post The system does not change any of the throttling control registers in the embedded memory controller during runtime
bull Static Closed Loop Thermal Throttling (Static-CLTT) CLTT control registers are configured by BIOS MRC during POST The memory throttling is run as a closed-loop system with the DIMM temperature sensors as the control input Otherwise the system does not change any of the throttling control registers in the embedded memory controller during runtime
bull Dynamic Open Loop Thermal Throttling (Dynamic-OLTT) OLTT control registers are configured by BIOS MRC during POST Adjustments are
HA201-TP Users Manual
106
Chapter 7 Intelreg Server Board S2600TP Platform Management
made to the throttling during runtime based on changes in system cooling (fan speed)
bull Dynamic Closed Loop Thermal Throttling (Dynamic-CLTT) CLTT control registers are configured by BIOS MRC during POST The memory throttling is run as a closed-loop system with the DIMM temperature sensors as the control input Adjustments are made to the throttling during runtime based on changes in system cooling (fan speed)
Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce a new type of CLTT which is referred to as Hybrid CLTT for which the Integrated Memory Controller estimates the DRAM temperature in between actual reads of the TSODsHybrid CLTTT shall be used on all Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family that have DIMMs with thermal sensors Therefore the terms Dynamic-CLTT and Static-CLTT are really referring to this lsquohybridrsquo mode Note that if the IMCrsquos polling of the TSODs is interrupted the temperature readings that the BMC gets from the IMC shall be these estimated values 731463 DIMM Temperature Sensor Input to Fan Speed ControlA clamp algorithm s used for controlling fan speed based on DIMM temperatures Aggregate DIMM temperature margin sensors are used as the control input to the algorithm
731464 Dynamic (Hybrid) CLTTThe system will support dynamic (memory) CLTT for which the BMC FW dynamically modifies thermal offset registers in the IMC during runtime based on changes in system cooling (fan speed) For static CLTT a fixed offset value is applied to the TSOD reading to get the die temperature however this is does not provide as accurate results as when the offset takes into account the current airflow over the DIMM as is done with dynamic CLTTIn order to support this feature the BMC FW will derive the air velocity for each fan domain based on the PWM value being driven for the domain Since this relationship is dependent on the chassis configuration a method must be used which support this dependency (for example through OEM SDR) that establishes a lookup table providing this relationshipBIOS will have an embedded lookup table that provides thermal offset
HA201-TP Users Manual
107
Chapter 7 Intelreg Server Board S2600TP Platform Management
values for each DIMM type altitude setting and air velocity range (3 ranges of air velocity are supported) During system boot BIOS will provide 3 offset values (corresponding to the 3 air velocity ranges) tothe BMC for each enabled DIMM Using this data the BMC FW constructs a table that maps the offset value corresponding to a given air velocity range for each DIMM During runtime the BMC applies an averaging algorithm to determine the target offset value corresponding to thecurrent air velocity and then the BMC writes this new offset value into the IMC thermal offset register for the DIMM
731465 Fan ProfilesThe server system supports multiple fan control profiles to support acoustic targets and American Society of Heating Refrigerating and Air Conditioning Engineers (ASHRAE) compliance The BIOS Setup utility can be used to choose between meeting the target acoustic level or enhanced system performance This is accomplished through fan profilesThe BMC supports eight fan profiles numbered from 0 to 7 Fan management policy is dictated by the Tcontrol SDRs These SDRs provide a way to associate fan control behavior with one or more fan profilesEach group of profiles allows for varying fan control policies based on the altitude For a given altitude the Tcontrol SDRs associated with an acoustics-optimized profile generate less noise than the equivalent performance-optimized profile by driving lower fan speeds and the BIOSreduces thermal management requirements by configuring more aggressive memory throttling See Table 16 for more informationThe BMC provides commands that query for fan profile support and it provides a way to enable a fan profile Enabling a fan profile determines which Tcontrol SDRs are used for fan management The BMC only supports enabling a fan profile through the command if that profile is supported on all fan domains defined for the system It is important to configure the SDRs so that all desired fan profiles are supported on each fan domain If no single profile is supported across all domains the BMC by default uses profile 0 and does not allow it to be changedAt system boot the BIOS can use the Get Fan Control Configuration command to query the BMC about which fan profiles are supported The BIOS uses this information to display options in the BIOS Setup utility The BIOS indicates the fan profile to the BMC as dictated by the BIOS Setup Utility options for fan mode and altitude using the Set Fan Control
HA201-TP Users Manual
108
Chapter 7 Intelreg Server Board S2600TP Platform Management
Configuration commandThe BMC uses this information as an input to its fan-control algorithm as supported by the Tcontrol OEM SDR The BMC only allows enabling of fan profiles that the BMC indicates are supported using the Get Fan Control Configuration command For example if the Get Fan Control Configuration command indicates that only profile 1 is supported then using the Set Fan Control Configuration command to enable profile 2 will result in the return of an error completion codeThe BMC requires the BIOS to send the Set Fan Control Configuration command to the BMC on every system boot This must be done after the BIOS has completed any throttling-related chipset configuration
HA201-TP Users Manual
109
Chapter 7 Intelreg Server Board S2600TP Platform Management
731466 Open-Loop Thermal Throttling FallbackNormal system operation uses closed-loop thermal throttling (CLTT) and DIMM temperature monitoring as major factors in overall thermal and acoustics management In the event that BIOS is unable to configure the system for CLTT it defaults to open-loop thermal throttling (OLTT) In the OLTT mode it is assumed that the DIMM temperature sensors are not available for fan speed control The BIOS communicates the throttling mode to the BMC along with the fan profile number when it sends the Set Fan Control Configuration commandWhen OLTT mode is specified the BMC internally blocks access to the DIMM temperatures causing the DIMM aggregate margin sensors to be marked as ReadingState Unavailable The BMC then uses the failure-control values for these sensors if specified in the Tcontrol SDRs as their fan speed contributions
731467 ASHRAE ComplianceSystem requirements for ASHRAE compliance is defined in the Common Fan Speed Control amp Thermal Management Platform Architecture Specification Altitude-related changes in fan speed control are handled through profiles for different altitude ranges
73147 Power Supply Fan Speed ControlThis section describes the system level control of the fans internal to the power supply over the PMBus Some but not all Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family will require that the power supplies be included in the system level fan speed control For any system that requires either of these capabilities the power supply must be PMBus-compliant
731471 System Control of Power Supply FansSome products require that the BMC control the speed of the power supply fans as is done with normal system (chassis) fans except that the BMC cannot reduce the power supply fan any lower than the internal power supply control is driving it For these products the BMC FW must have the ability to control and monitor the power supply fans through PMBus commands The power supply fans are treated as a system fan domain for which fan control policies are mapped just as for chassis system fans with system thermal sensors (rather than internal power
HA201-TP Users Manual
110
Chapter 7 Intelreg Server Board S2600TP Platform Management
supply thermal sensors) used as the input to a clamp algorithm for the power supply fan control This domain has both piecewise clipping curves and clamped sensors mapped into the power supply fan domain All the power supplies can be defined as a single fan domain
731472 Use of Power Supply Thermal Sensors as Input to System (Chassis) Fan Control Some products require that the power supply internal thermal sensors are used as control inputs to the system (chassis) fans in the same manner as other system thermal sensors are used for this purpose The power supply thermal sensors are included as clamped sensors into one or more system fan domains which may include the power supply fan domain
73148 Fan Boosting due to Fan FailuresIntelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce additional capabilities for handling fan failure or removal as described in this sectionEach fan failure shall be able to define a unique response from all other fan domains An OEM SDR table defines the response of each fan domain based on a failure of any fan including both system and power supply fans (for PMBus-compliant power supplies only) This meansthat if a system has six fans then there will be six different fan fail reactions
73149 Programmable Fan PWM OffsetThe system provides a BIOS Setup option to boost the system fan speed by a programmable positive offset or a ldquoMaxrdquo setting Setting the programmable offset causes the BMC to add the offset to the fan speeds that it would otherwise be driving the fans to The Max setting causes the BMC to replace the domain minimum speed with alternate domain minimums that also are programmable through SDRsThis capability is offered to provide system administrators the option to manually configure fans speeds in instances where the fan speed optimized for a given platform may not be sufficient when a high end add-in is configured into the system This enables easier usage of the fan speed control to support Intelreg as well as third party chassis and better support of ambient temperatures higher than 35C
HA201-TP Users Manual
111
Chapter 7 Intelreg Server Board S2600TP Platform Management
supply thermal sensors) used as the input to a clamp algorithm for the power supply fan control This domain has both piecewise clipping curves and clamped sensors mapped into the power supply fan domain All the power supplies can be defined as a single fan domain
731472 Use of Power Supply Thermal Sensors as Input to System (Chassis) Fan Control Some products require that the power supply internal thermal sensors are used as control inputs to the system (chassis) fans in the same manner as other system thermal sensors are used for this purpose The power supply thermal sensors are included as clamped sensors into one or more system fan domains which may include the power supply fan domain
73148 Fan Boosting due to Fan FailuresIntelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce additional capabilities for handling fan failure or removal as described in this sectionEach fan failure shall be able to define a unique response from all other fan domains An OEM SDR table defines the response of each fan domain based on a failure of any fan including both system and power supply fans (for PMBus-compliant power supplies only) This meansthat if a system has six fans then there will be six different fan fail reactions
73149 Programmable Fan PWM OffsetThe system provides a BIOS Setup option to boost the system fan speed by a programmable positive offset or a ldquoMaxrdquo setting Setting the programmable offset causes the BMC to add the offset to the fan speeds that it would otherwise be driving the fans to The Max setting causes the BMC to replace the domain minimum speed with alternate domain minimums that also are programmable through SDRsThis capability is offered to provide system administrators the option to manually configure fans speeds in instances where the fan speed optimized for a given platform may not be sufficient when a high end add-in is configured into the system This enables easier usage of the fan speed control to support Intelreg as well as third party chassis and better support of ambient temperatures higher than 35C
HA201-TP Users Manual
112
Chapter 7 Intelreg Server Board S2600TP Platform Management
7315 Power Management Bus (PMBus)The Power Management Bus (ldquoPMBusrdquo) is an open standard protocol that is built upon the SMBus 20 transport It defines a means of communicating with power conversion and other devices using SMBus-based commands A system must have PMBus-compliant power supplies installed in order for the BMC or ME to monitor them for status andor power metering purposesFor more information on PMBus see the System Management Interface Forum Web sitehttpwwwpowersigorg
7316 Power Supply Dynamic Redundancy SensorThe BMC supports redundant power subsystems and implements a Power Unit Redundancy sensor per platform A Power Unit Redundancy sensor is of sensor type Power Unit (09h) and reading type Availability Status (0Bh) This sensor generates events when a power subsystem transitions between redundant and non-redundant states as determined by the number and health of the power subsystemrsquos component power supplies The BMC implements Dynamic Power Supply Redundancy status based upon current system load requirements as well as total Power Supply capacity This status is independent of the Cold Redundancy status This prevents the BMC from reporting Fully Redundant Power supplies when the load required by the system exceeds half the power capability of all power supplies installed and operationalDynamic Redundancy detects this condition and generates the appropriate SEL event to notify the user of the condition Power supplies of different power ratings may be swapped in and out to adjust the power capacity and the BMC will adjust the Redundancy status accordinglyThe definition of redundancy is power subsystem dependent and sometimes even configuration dependent See the appropriate Platform Specific Information for power unit redundancy supportThis sensor is configured as manual-rearm sensor in order to avoid the possibility of extraneous SEL events that could occur under certain system configuration and workload conditions The sensor shall rearm for the following conditionsbull PSU hot-addbull System reset
HA201-TP Users Manual
113
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull AC power cyclebull DC power cycleSystem AC power is applied but on standby ndash Power unit redundancy is based on OEM SDR power unit record and number of PSU presentSystem is (DC) powered on - The BMC calculates Dynamic Power Supply Redundancy status based upon current system load requirements as well as total Power Supply capacityThe BMC allows redundancy to be configured on a per power-unit-redundancy sensor basis by means of the OEM SDR records
7317 Component Fault LED ControlSeveral sets of component fault LEDs are supported on the server board Some LEDs are owned by the BMC and some by the BIOSThe BMC owns control of the following FRUfault LEDsbull Fan fault LEDs ndash A fan fault LED is associated with each fan The BMC
lights a fan fault LED if the associated fan-tach sensor has a lower critical threshold event status asserted Fan-tach sensors are manual re-arm sensors Once the lower critical threshold is crossed the LED remains lit until the sensor is rearmed These sensors are rearmed at system DC power-on and system reset
bull DIMM fault LEDs ndash The BMC owns the hardware control for these LEDs The LEDs reflect the state of BIOS-owned event-only sensors When the BIOS detects a DIMM fault condition it sends an IPMI OEM command (Set Fault Indication) to the BMC to instruct the BMC to turn on the associated DIMM Fault LED These LEDs are only active when the system is in the lsquoonrsquo state The BMC will not activate or change the state of the LEDs unless instructed by the BIOS
bull Hard Disk Drive Status LEDs ndash The HSBP PSoC owns the hardware control for these LEDs and detection of the faultstatus conditions that the LEDs reflect
bull CPU Fault LEDs The BMC owns control for these LEDs An LED is lit if there is an MSID mismatch (that is CPU power rating is incompatible with the board)
HA201-TP Users Manual
114
Chapter 7 Intelreg Server Board S2600TP Platform Management
7318 CMOS Battery MonitoringThe BMC monitors the voltage level from the CMOS battery which provides battery backup to the chipset RTC This is monitored as an auto-rearm threshold sensorUnlike monitoring of other voltage sources for which the Emulex Pilot III component continuously cycles through each input the voltage channel used for the battery monitoring provides a software enable bit to allow the BMC FW to poll the battery voltage at a relativelyslow rate in order to conserve battery power
HA201-TP Users Manual
115
Chapter 7 Intelreg Server Board S2600TP Platform Management
74 Intelreg Intelligent Power Node Manager (NM)Power management deals with requirements to manage processor power consumption and manage power at the platform level to meet critical business needs Node Manager (NM) is a platform resident technology that enforces power capping and thermal-triggered powercapping policies for the platform These policies are applied by exploiting subsystem settings (such as processor P and T states) that can be used to control power consumption NM enables data center power management by exposing an external interface to management software through which platform policies can be specified It also implements specific data center power management usage models such as power limiting and thermal monitoringThe NM feature is implemented by a complementary architecture utilizing the ME BMC BIOS and an ACPI-compliant OS The ME provides the NM policy engine and power controllimiting functions (referred to as Node Manager or NM) while the BMC provides the external LAN link by which external management software can interact with the feature The BIOS provides system power information utilized by the NM algorithms and also exports ACPI Source Language (ASL) code used by OS-Directed Power Management (OSPM) for negotiating processor P and T state changes for power limiting PMBus-compliant power supplies provide the capability to monitor input power consumption which is necessary to support NMThe NM architecture applicable to this generation of servers is defined by the NPTM Architecture Specification v20 NPTM is an evolving technology that is expected to continue to add new capabilities that will be defined in subsequent versions of the specification The ME NM implements the NPTM policy engine and controlmonitoring algorithms defined in the Node Power and Thermal Manager (NPTM) specification
741 Hardware RequirementsNM is supported only on platforms that have the NM FW functionality loaded and enabled on the Management Engine (ME) in the SSB and that have a BMC present to support the external LAN interface to the ME NM power limiting features requires a means for the ME to monitorinput power consumption for the platform This capability is generally provided by means of PMBus-compliant power supplies although an alternative model using a simpler SMBus power monitoring device is possible (there is potential loss in accuracy and responsiveness using
HA201-TP Users Manual
116
Chapter 7 Intelreg Server Board S2600TP Platform Management
non-PMBus devices) The NM SmaRTCLST feature does specifically require PMBus-compliant power supplies as well as additional hardware on the server board
742 FeaturesNM provides feature support for policy management monitoring and querying alerts and notifications and an external interface protocol The policy management features implement specific IT goals that can be specified as policy directives for NM Monitoring and querying features enable tracking of power consumption Alerts and notifications provide the foundation for automation of power management in the data center management stack The external interface specifies the protocols that must be supported in this version of NM
743 ME System Management Bus (SMBus) Interfacebull The ME uses the SMLink0 on the SSB in multi-master mode as a
dedicated bus for communication with the BMC using the IPMB protocol The BMC FW considers this a secondary IPMB bus and runs at 400 kHz
bull The ME uses the SMLink1 on the SSB in multi-master mode bus for communication with PMBus devices in the power supplies for support of various NM-related features This bus is shared with the BMC which polls these PMBus power supplies for sensor monitoring purposes (for example power supply status input power and so on) This bus runs at 100 KHz
bull The Management Engine has access to the ldquoHost SMBusrdquo
744 PECI 30bull The BMC owns the PECI bus for all Intel server implementations and
acts as a proxy for the ME when necessary
745 NM ldquoDiscoveryrdquo OEM SDRAn NM ldquodiscoveryrdquo OEM SDR must be loaded into the BMCrsquos SDR repository if and only if the NM feature is supported on that product This OEM SDR is used by management software to detect if NM is supported and to understand how to communicate with itSince PMBus compliant power supplies are required in order to support NM the system should be probed when the SDRs are loaded into the
HA201-TP Users Manual
117
Chapter 7 Intelreg Server Board S2600TP Platform Management
BMCrsquos SDR repository in order to determine whether or not the installed power supplies do in fact support PMBus If the installed power supplies are not PMBus compliant then the NM ldquodiscoveryrdquo OEM SDR should not be loadedPlease refer to the Intelreg Intelligent Power Node Manager 20 External Architecture Specification using IPMI for details of this interface
746 SmaRTCLSTThe power supply optimization provided by SmaRTCLST relies on a platform HW capability as well as ME FW support When a PMBus-compliant power supply detects insufficient input voltage an overcurrent condition or an over-temperature condition it will assert theSMBAlert signal on the power supply SMBus (such as the PMBus) Through the use of external gates this results in a momentary assertion of the PROCHOT and MEMHOT signals to the processors thereby throttling the processors and memory The ME FW also sees the SMBAlert assertion queries the power supplies to determine the condition causing the assertion and applies an algorithm to either release or prolong the throttling based on the situationSystem power control modes include1 SmaRT Low AC in put voltage event results in a one-time momentary
throttle for each event to the maximum throttle state2 Electrical Protection CLST High output energy event results in a
throttling hiccup mode with fixed maximum throttle time and a fix throttle release ramp time
3 Thermal Protection CLST High power supply thermal event results in a throttling hiccup mode with fixed maximum throttle time and a fix throttle release ramp time
When the SMBAlert signal is asserted the fans will be gated by HW for a short period (~100ms) to reduce overall power consumption It is expected that the interruption to the fans will be of short enough duration to avoid false lower threshold crossings for the fan tachsensors however this may need to be comprehended by the fan monitoring FW if it does have this side-effectME FW will log an event into the SEL to indicate when the system has been throttled by the SmaRTCLST power management feature This is dependent on ME FW support for this sensor Please refer ME FW EPS for SEL log details
HA201-TP Users Manual
118
Chapter 7 Intelreg Server Board S2600TP Platform Management
74611 Dependencies on PMBus-compliant Power Supply SupportThe SmaRTCLST system feature depends on functionality present in the ME NM SKU This feature requires power supplies that are compliant with the PMBus
note For additional information on Intelreg Intelligent Power Node
Manager usage and support please visit the following Intel Website
httpwwwintelcomcontentwwwusendata-centerdata-center-managementnode-manager-generalhtmlwapkw=node+manager
HA201-TP Users Manual
119
Chapter 7 Intelreg Server Board S2600TP Platform Management
75 Basic and Advanced Server Management FeaturesThe integrated BMC has support for basic and advanced server management features Basic management features are available by default Advanced management features are enabled with the addition of an optionally installed Remote Management Module 4 Lite (RMM4 Lite) key
When the BMC FW initializes it attempts to access the Intelreg RMM4 lite If the attempt to access Intelreg RMM4 lite is successful then the BMC activates the Advanced featuresThe following table identifies both Basic and Advanced server management features
HA201-TP Users Manual
120
Chapter 7 Intelreg Server Board S2600TP Platform Management
751 Dedicated Management PortThe server board includes a dedicated 1GbE RJ45 Management Port The management port is active with or without the RMM4 Lite key installed
752 Embedded Web ServerBMC Base manageability provides an embedded web server and an OEM-customizable web GUI which exposes the manageability features of the BMC base feature set It is supported over all on-board NICs that have management connectivity to the BMC as well as an optionaldedicated add-in management NIC At least two concurrent web sessions from up to two different users is supported The embedded web user interface shall support the following client web browsersbull Microsoft Internet Explorer 90bull Microsoft Internet Explorer 100bull Mozilla Firefox 24bull Mozilla Firefox 25The embedded web user interface supports strong security (authentication encryption and firewall support) since it enables remote server configuration and control The user interface presented by the embedded web user interface shall authenticate the user before allowing a web session to be initiated Encryption using 128-bit SSL is supported User authentication is based on user id and passwordThe GUI presented by the embedded web server authenticates the user before allowing a web session to be initiated It presents all functions to all users but grays-out those functions that the user does not have privilege to execute For example if a user does not have privilege to power control then the item shall be displayed in grey-out font in that userrsquos UI display The web GUI also provides a launch point for some of the advanced features such as KVM and media redirection These features are grayed out in the GUI unless the system has been updated to support these advanced features The embedded web server only displays US English or Chinese language outputAdditional features supported by the web GUI includesbull Presents all the Basic features to the usersbull Power onoffreset the server and view current power statebull Displays BIOS BMC ME and SDR version informationbull Display overall system health
HA201-TP Users Manual
121
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull Configuration of various IPMI over LAN parameters for both IPV4 and IPV6
bull Configuration of alerting (SNMP and SMTP)bull Display system asset information for the product board and
chassisbull Display of BMC-owned sensors (name status current reading
enabled thresholds) including color-code status of sensorsbull Provides ability to filter sensors based on sensor type (Voltage
Temperature Fan and Power supply related)bull Automatic refresh of sensor data with a configurable refresh ratebull On-line helpbull Displayclear SEL (display is in easily understandable human
readable format)bull Supports major industry-standard browsers (Microsoft Internet
Explorer and Mozilla Firefox)bull The GUI session automatically times-out after a user-configurable
inactivity period By default this inactivity period is 30 minutesbull Embedded Platform Debug feature - Allow the user to initiate
a ldquodebug dumprdquo to a file that can be sent to Intelreg for debug purposes
bull Virtual Front Panel The Virtual Front Panel provides the same functionality as the local front panel The displayed LEDs match the current state of the local panel LEDs The displayed buttons (for example power button) can be used in the same manner as the local buttons
bull Display of ME sensor data Only sensors that have associated SDRs loaded will be displayed
bull Ability to save the SEL to a filebull Ability to force HTTPS connectivity for greater security This is
provided through a configuration option in the UIbull Display of processor and memory information as is available over
IPMI over LANbull Ability to get and set Node Manager (NM) power policiesbull Display of power consumed by the serverbull Ability to view and configure VLAN settingsbull Warn user the reconfiguration of IP address will cause disconnectbull Capability to block logins for a period of time after several
HA201-TP Users Manual
122
Chapter 7 Intelreg Server Board S2600TP Platform Management
consecutive failed login attempts The lock-out period and the number of failed logins that initiates the lockout period are configurable by the user
bull Server Power Control - Ability to force into Setup on a resetbull System POST results ndash The web server provides the systemrsquos Power-
On Self Test (POST) sequence for the previous two boot cycles including timestamps The timestamps may be viewed in relative to the start of POST or the previous POST code
bull Customizable ports - The web server provides the ability to customize the port numbers used for SMASH http https KVM secure KVM remote media and secure remote media
For additional information reference the Intelreg Remote Management Module 4 and Integrated BMC Web Console Users Guide
753 Advanced Management Feature Support (RMM4 Lite)The integrated baseboard management controller has support for advanced management features which are enabled when an optional Intelreg Remote Management Module 4 Lite (RMM4 Lite) is installed The Intel RMM4 add-on offers convenient remote KVM access and control through LAN and internet It captures digitizes and compresses video and transmits it with keyboard and mouse signals to and from a remote computer Remote access and controlsoftware runs in the integrated baseboard management controller utilizing expanded capabilities enabled by the Intel RMM4 hardwareKey Features of the RMM4 add-on arebull KVM redirection from either the dedicated management NIC or
the server board NICs used for management traffic upto to two KVM sessions
bull Media Redirection ndash The media redirection feature is intended to allow system administrators or users to mount a remote IDE or USB CDROM floppy drive or a USB flash disk as a remote device to the server Once mounted the remote device appears just like a local device to the server allowing system administrators or users to install software (including operating systems) copy files update BIOS or boot the server from this device
bull KVM ndash Automatically senses video resolution for best possible screen capture high performance mouse tracking and
HA201-TP Users Manual
123
Chapter 7 Intelreg Server Board S2600TP Platform Management
synchronization It allows remote viewing and configuration in pre-boot POST and BIOS setup
7531 Keyboard Video Mouse (KVM) RedirectionThe BMC firmware supports keyboard video and mouse redirection (KVM) over LAN This feature is available remotely from the embedded web server as a Java applet This feature is only enabled when the Intelreg RMM4 lite is present The client system must have a Java Runtime Environment (JRE) version 60 or later to run the KVM or media redirection appletsThe BMC supports an embedded KVM application (Remote Console) that can be launched from the embedded web server from a remote console USB11 or USB 20 based mouse and keyboard redirection are supported It is also possible to use the KVM-redirection (KVM-r) session concurrently with media-redirection (media-r) This feature allows a user to interactively use the keyboard video and mouse (KVM) functions of the remote server as if the user were physically at the managed server KVM redirection console supports the following keyboard layouts English Dutch French German Italian Russian and SpanishKVM redirection includes a ldquosoft keyboardrdquo function The ldquosoft keyboardrdquo is used to simulate an entire keyboard that is connected to the remote system The ldquosoft keyboardrdquo functionality supports the following layouts English Dutch French German Italian Russian and SpanishThe KVM-redirection feature automatically senses video resolution for best possible screen capture and provides high-performance mouse tracking and synchronization It allows remote viewing and configuration in pre-boot POST and BIOS setup once BIOS has initialized videoOther attributes of this feature includebull Encryption of the redirected screen keyboard and mousebull Compression of the redirected screenbull Ability to select a mouse configuration based on the OS typebull Supports user definable keyboard macrosKVM redirection feature supports the following resolutions and refresh ratesbull 640x480 at 60Hz 72Hz 75Hz 85Hz 100Hzbull 800x600 at 60Hz 72Hz 75Hz 85Hzbull 1024x768 at 60Hx 72Hz 75Hz 85Hzbull 1280x960 at 60Hz
HA201-TP Users Manual
124
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull 1280x1024 at 60Hzbull 1600x1200 at 60Hzbull 1920x1080 (1080p)bull 1920x1200 (WUXGA)bull 1650x1080 (WSXGA+)
7532 Remote ConsoleThe Remote Console is the redirected screen keyboard and mouse of the remote host system To use the Remote Console window of your managed host system the browser must include a Java Runtime Environment plug-in If the browser has no Java support such as with a small handheld device the user can maintain the remote host system using the administration forms displayed by the browserThe Remote Console window is a Java Applet that establishes TCP connections to the BMCThe protocol that is run over these connections is a unique KVM protocol and not HTTP or HTTPS This protocol uses ports 7578 for KVM 5120 for CDROM media redirection and 5123 for FloppyUSB media redirection When encryption is enabled the protocol uses ports 7582 for KVM 5124 for CDROM media redirection and 5127 for FloppyUSB media redirection The local network environment must permit these connections to be made that is the firewall and in case of a private internal network the NAT (Network Address Translation) settings have to be configured accordingly
7533 PerformanceThe remote display accurately represents the local display The feature adapts to changes to the video resolution of the local display and continues to work smoothly when the system transitions from graphics to text or vice-versa The responsiveness may be slightly delayed depending on the bandwidth and latency of the networkEnabling KVM andor media encryption will degrade performance Enabling video compression provides the fastest response while disabling compression provides better video qualityFor the best possible KVM performance a 2Mbsec link or higher is recommendedThe redirection of KVM over IP is performed in parallel with the local KVM without affecting the local KVM operation
HA201-TP Users Manual
125
Chapter 7 Intelreg Server Board S2600TP Platform Management
7534 SecurityThe KVM redirection feature supports multiple encryption algorithms including RC4 and AES The actual algorithm that is used is negotiated with the client based on the clientrsquos capabilities
7535 AvailabilityThe remote KVM session is available even when the server is powered-off (in stand-by mode) No re-start of the remote KVM session shall be required during a server reset or power onoff A BMC reset (for example due to an BMC Watchdog initiated reset or BMC reset after BMC FWupdate) will require the session to be re-established KVM sessions persist across system reset but not across an AC power loss
7536 UsageAs the server is powered up the remote KVM session displays the complete BIOS boot process The user is able interact with BIOS setup change and save settings as well as enter and interact with option ROM configuration screensAt least two concurrent remote KVM sessions are supported It is possible for at least two different users to connect to same server and start remote KVM sessions
7537 Force-enter BIOS SetupKVM redirection can present an option to force-enter BIOS Setup This enables the system to enter F2 setup while booting which is often missed by the time the remote console redirects the video
7538 Media RedirectionThe embedded web server provides a Java applet to enable remote media redirection This may be used in conjunction with the remote KVM feature or as a standalone applet The media redirection feature is intended to allow system administrators or users to mount a remote IDE or USB CD-ROM floppy drive or a USB flash disk as a remote device to the server Once mounted the remote device appears just like a local device to the server allowing system administrators or users to install software (including operating systems) copy files update BIOS and so on or boot the server from this deviceThe following capabilities are supported
HA201-TP Users Manual
126
Chapter 7 Intelreg Server Board S2600TP Platform Management
The operation of remotely mounted devices is independent of the local devices on the server Both remote and local devices are useable in parallelbull Either IDE (CD-ROM floppy) or USB devices can be mounted as a
remote device to the serverbull It is possible to boot all supported operating systems from the remotely
mounted device and to boot from disk IMAGE (IMG) and CD-ROM or DVD-ROM ISO files See the Testedsupported Operating System List for more information
bull Media redirection supports redirection for both a virtual CD device and a virtual FloppyUSB device concurrently The CD device may be either a local CD drive or else an ISO image file the FloppyUSB device may be either a local Floppy drive a local USB device or else a disk image file
bull The media redirection feature supports multiple encryption algorithms including RC4 and AES The actual algorithm that is used is negotiated with the client based on the clientrsquos capabilities
bull A remote media session is maintained even when the server is powered-off (in standby mode) No restart of the remote media session is required during a server reset or power onoff An BMC reset (for example due to an BMC reset after BMC FW update) will require the session to be re-established
bull The mounted device is visible to (and useable by) managed systemrsquos OS and BIOS in both pre-boot and post-boot states
bull The mounted device shows up in the BIOS boot order and it is possible to change the BIOS boot order to boot from this remote device
bull It is possible to install an operating system on a bare metal server (no OS present) using the remotely mounted device This may also require the use of KVM-r to configure the OS during install
USB storage devices will appear as floppy disks over media redirection This allows for the installation of device drivers during OS installationIf either a virtual IDE or virtual floppy device is remotely attached during system boot both the virtual IDE and virtual floppy are presented as bootable devices It is not possible to present only a single-mounted device type to the system BIOS
HA201-TP Users Manual
127
Chapter 7 Intelreg Server Board S2600TP Platform Management
75381 AvailabilityThe default inactivity timeout is 30 minutes and is not user-configurable Media redirection sessions persist across system reset but not across an AC power loss or BMC reset
75382 Network Port UsageThe KVM and media redirection features use the following portsbull 5120 ndash CD Redirectionbull 5123 ndash FD Redirectionbull 5124 ndash CD Redirection (Secure)bull 5127 ndash FD Redirection (Secure)bull 7578 ndash Video Redirectionbull 7582 ndash Video Redirection (Secure)For additional information reference the Intelreg Remote Management Module 4 and Integrated BMC Web Console Users Guide
Chapter 1 Product IntroductionChapter 1 Product IntroductionChapter 8 Technical Support
wwwaicipccom
bull TAIWANTel +886 3 433 9188Fax +886 3 287 1818Email salesaicipccomtw
bull CHINA Tel +862154961421 +862154961422Fax Extension 608Email Technical Support supportaicipccom
bull AMERICA - West coastTel +19098958989Fax +19098958999Email salesaicipccom
bull AMERICA - East coastTel +19738848886Fax +19738844794Email njsalesaicipccom
bull EUROPETel +31306386789Fax +31306360638Emailsalesaicipcnl
Email Technical Support supportaicipccom
- PREFACE
-
- SAFETY INSTRUCTIONS
-
- Chapter 1 Product Introduction
-
- 11 Box Content
- 12 Specifications
- 13 General Information
-
- Chapter 2 Hardware Installation
-
- 21 Removing and Installing top cover
- 22 Central Processing Unit (CPU)
- 23 Diagram of the Correct Installation
- 24 System Memory
- 25 RemovingInstalling a Drive Tray Hard Drive Drive Slot Map
- 26 Removing and Installing a Fan Module
- 27 Power Supply
- 28 Tool-less Blade Slide Installation introduction
-
- Chapter 3 Motherborad Overview
-
- 31 Intelreg Server Board Feature Set
- 32 Motherboard Layout
- 33 Motherboard block diagram
- 34 Motherboard Configuration Jumpers
- 35 Motherboard Configuration Jumpers
-
- Chapter 4 12GB Expander Borad Overview
-
- 41 12GB Expander Board
-
- Chapter 5 Backplane Overview
-
- 51 Backplane
-
- Chapter 6 Debug amp Firmware Update Introduction
-
- 61 Expender firmware update through smart console port
- 62 Update the expander firmware through in-band
- 63 12G expander EDFB setting
- 64 Slot HDD power setting
- 65 HDD BP thermal sensor temperature setting
-
- Chapter 7 Intelreg Server Board S2600TP Platform Management
-
- 71 Server Management Function Architecture
- 72 Features and Functions
- 73 Sensor Monitoring
- 74 Intelreg Intelligent Power Node Manager (NM)
- 75 Basic and Advanced Server Management Features
-
- Chapter 8 Technical Support
-
- 按鈕11
![Page 14: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/14.jpg)
HA201-TP Users Manual
7
Chapter 2 Hardware Installation
7
21 Removing and Installing top coverUnscrew the screws(2pcs for the back and 4pcs for the both right and left side)Slide the cover backwards to remove top cover form the chassis
Chapter 2 Hardware Installation
This section demonstrates maintenance procedures in replacing a defective part once the HA201-TP UM appliance is installed and is operational
Chapter 2 Hardware Installation
HA201-TP Users Manual
8
22 Central Processing Unit (CPU)221 Removing the Processor HeatsinkThe heatsink is attached to the server board or processor socket with captive fasteners Using a 2 Phillips screwdriver loosen the four screws located on the heatsink corners in a diagonal manner using the following procedurebull Using a 2 Phillips screwdriver start with screw 1 and loosen it by
giving it two rotations and stop (see letter A) (IMPORTANT Do not fully loosen)
bull Proceed to screw 2 and loosen it by giving it two rotations and stop (see letter B) Similarly loosen screws 3 and 4 Repeat steps A and B by giving each screw two rotations each time until all screws are loosened
bull Lift the heatsink straight up (see letter C)
REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
Processor
Socket
2
3
1
4
A
B
C
HA201-TP Users Manual
9
Chapter 2 Hardware Installation
222 Installing the Processor
2221 Unlatch the CPU Load Platebull Push the lever handle labeled ldquoOPEN 1strdquo (see letter A) down and
toward the CPU socket Rotate the lever handle upbull Repeat the steps for the second lever handle (see letter B)
CAUTION
Processor must be appropriate You may damage the server board if
you install a processor that is inappropriate for your server
CAUTION
ESD and handling processors Reduce the risk of electrostatic
discharge (ESD) damage to the processor by doing the following
(1) Touch the metal chassis before touching the processor or
server board Keep part of your body in contact with the metal
chassis to dissipate the static charge while handling the processor
(2) Avoid moving around unnecessarily
REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
A REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
B
Chapter 2 Hardware Installation
HA201-TP Users Manual
10
2222 Lift open the Load Platebull Rotate the right lever handle down until it releases the Load Plate
(see letter A)bull While holding down the lever handle with your other hand lift open
the Load Plate (see letter B)
BREMOVE
REMOVE
LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
A
HA201-TP Users Manual
11
Chapter 2 Hardware Installation
2223 Install the Processorbull Remove the processor from its packagebull Carefully remove the protective cover from the bottom side of the
CPU taking care not to touch any CPU contacts (see letter A)bull Orient the processor with the socket so that the processor cutouts
match the four orientation posts on the socket (see letter B) Note the location of a gold key at the corner of the processor (see letter C) Carefully place (Do NOT drop) the CPU into the socket
CAUTION
The pins inside the CPU socket are extremely sensitive Other than
the CPU no object should make contact with the pins inside the
CPU socket A damaged CPU socket pin may render the socket inoperable
and will produce erroneous CPU or other system errors if used
NOTE
The underside of the processor has components that may damage the
socket pins if installed improperly The processor must align correctly
with the socket opening before installation DO NOT DROP the
processor into the socket
NOTE
When possible a CPU insertion tool should be used when installing the
CPU
A
B
C
Chapter 2 Hardware Installation
HA201-TP Users Manual
12
2224 Remove the Socket Cover Remove the socket cover from the load plate by pressing it out
2225 Close the Load Plate Carefully lower the load plate down over the processor
2226 Lock down the Load Platebull Push down on the locking lever on the CLOSE 1st side (see letter A)
Slide the tip of the lever under the notch in the load plate (see letter B) Make sure the load plate tab engages under the socket lever when fully closed
bull bRepeat the steps to latch the locking lever on the other side (see letter C) Latch the levers in the order as shown
NOTE
The socket cover should be saved and re-used should the processor
need to be removed at anytime in the future
Save theprotectivecover
A
BC
HA201-TP Users Manual
13
Chapter 2 Hardware Installation
223 Installing the Processor Heatsink
bull If present remove the protective film covering the Thermal Interface Material on the bottom side of the heatsink (see letter A)
bull Align the heatsink fins to the front and back of the chassis for correct airflow The airflow goes from front-to-back of the chassis (see letter B)
bull Each heatsink has four captive fasteners and should be tightened in a diagonal manner using the following procedure
bull Using a 2 Phillips screwdriver start with screw 1 and engage screw threads by giving it two rotations and stop (see letter C) (Do not fully tighten)
bull Proceed to screw 2 and engage screw threads by giving it two rotations and stop (see letter D) Similarly engage screws 3 and 4
bull Repeat steps C and D by giving each screw two rotations each time until each screw is lightly tightened up to a maximum of 8 inch-lbs torque (see letter E)
NOTE
The processor heatsink for CPU1 and CPU2 is different FXXCA91X91HS is
for CPU1 while FXXEA91X91HS2 is for CPU2 Mislocating the heatsink
will cause serious thermal damage
C
Processor
Socket
AIRFLOW
Chassis Front
2
3
1
4
A
B
TIM
D
CAUTIONDo not over-tighten fasteners
E
Chapter 2 Hardware Installation
HA201-TP Users Manual
14
224 Removing the Processor
NOTE
Remove the processor by carefully lifting it out of the socket taking care
NOT to drop the processor and not touching any pins inside the socket
Install the socket cover if a replacement processor is not going to be installed
HA201-TP Users Manual
15
Chapter 2 Hardware Installation
225 Different heatsink for each of its CPUsCarefully position heatsink over the CPU0 and CPU1 and align the heatsink screws with the screw holes in the motherboard
Caution - Possible thermal damage Avoid moving the heatsink after
it has contacted the top of the CPU Too much movement could
disturb the layer of thermal compound causing voids and leading
to ineffective heat dissipation and component damage
CPU0
CPU0
CPU1
CPU1
Chapter 2 Hardware Installation
HA201-TP Users Manual
16
23 Diagram of the Correct InstallationNOTE The heat sinkrsquos fans should be blowing toward the rear end
of the chassis If one of the fans is facing the wrong direction
please remove the heat sink and reinstall it so that it is facing
the correct direction
AIRFLOW
AIRFLOW
FAN FAN
HA201-TP Users Manual
17
Chapter 2 Hardware Installation
24 System Memory241 Supported Memory
Chapter 2 Hardware Installation
HA201-TP Users Manual
18
242 Memory Population Rules
bull Each installed processor provides four channels of memory On the Intelreg Server Board S2600TP product family each memory channel supports two memory slots for a total possible 16 DIMMs installed
bull The memory channels from processor socket 1 are identified as Channel A B C and D The memory channels from processor socket 2 are identified as Channel E F G and H
bull The silk screened DIMM slot identifiers on the board provide information about the channel and therefore the processor to which they belong For example DIMM_A1 is the first slot on Channel A on processor 1 DIMM_E1 is the first DIMM socket on Channel E on processor 2
bull The memory slots associated with a given processor are unavailable if the corresponding processor socket is not populated
bull A processor may be installed without populating the associated memory slots provided a second processor is installed with associated memory In this case the memory is shared by the processors However the platform suffers performance degradation and latency due to the remote memory
bull Processor sockets are self-contained and autonomous However all memory subsystem support (such as Memory RAS and Error Management) in the BIOS setup is applied commonly across processor sockets
bull All DIMMs must be DDR4 DIMMsbull Mixing of LRDIMM with any other DIMM type is not allowed per
platformbull Mixing of DDR4 operating frequencies is not validated within a socket
or across sockets by Intel If DIMMs with different frequencies are mixed all DIMMs run at the common lowest frequency
bull A maximum of eight logical ranks (ranks seen by the host) per channel is allowed
bull The BLUE memory slots on the server board identify the first memory slot for a given memory channel
NOTE
Although mixed DIMM configurations are supported Intel only performs platform
validation on systems that are configured with identical DIMMs installed
HA201-TP Users Manual
19
Chapter 2 Hardware Installation
bull DIMM population rules require that DIMMs within a channel be populated starting with the BLUE DIMM slot or DIMM farthest from the processor in a ldquofill-farthestrdquo approach In addition when populating a Quad-rank DIMM with a Single- or Dual-rank DIMM in the same channel the Quad-rank DIMM must be populated farthest from the processor Intelreg MRC will check for correct DIMM placement
Chapter 2 Hardware Installation
HA201-TP Users Manual
20
On the Intelreg Server Board S2600TP product a total of sixteen DIMM slots are provided (two CPUs ndash four channels per CPU and two DIMMs per channel) The nomenclature for DIMM sockets is detailed in the following table
HA201-TP Users Manual
21
Chapter 2 Hardware Installation
243 Publishing System MemoryThere are a number of different situations in which the memory size andor configuration are displayed Most of these displays differ in one way or another so the same memory configuration may appear to display differently depending on when and where the display occurs
bull The BIOS displays the ldquoTotal Memoryrdquo of the system during POST if Quiet Boot is disabled in BIOS setup This is the total size of memory discovered by the BIOS during POST and is the sum of the individual sizes of installed DDR4 DIMMs in the system
bull The BIOS displays the ldquoEffective Memoryrdquo of the system in the BIOS Setup The term Effective Memory refers to the total size of all DDR4 DIMMs that are active (not disabled) and not used as redundant units (see Note below)
bull The BIOS provides the total memory of the system in the main page of BIOS setup This total is the same as the amount described by the first bullet above
bull If Quiet Boot is disabled the BIOS displays the total system memory on the diagnostic screen at the end of POST This total is the same as the amount described by the first bullet above
bull The BIOS provides the total amount of memory in the system by supporting the EFI Boot Service function GetMemoryMap()
bull The BIOS provides the total amount of memory in the system by supporting the INT 15h E820h function For details see the Advanced Configuration and Power Interface Specification
Chapter 2 Hardware Installation
HA201-TP Users Manual
22
25 RemovingInstalling a Drive Tray Hard Drive Drive Slot Map
251 Removing a Disk DriveRelease a drive tray by pressing the unlock button and pinching the lock lever slightly and pulling out the drive tray
252 Installing a Disk Drive
Adjust and place the 25rdquo HDD on the HDD tray Choose to the proper position screw hole then secure it with screws on the bottomMounting location 1 HDD with the interposer board attached
HA201-TP Users Manual
23
Chapter 2 Hardware Installation
253 Installing a Hard Disk Drive Tray
Insert the drive carrier into its bay Push the tray lever until it clicks Make sure the drive tray is correctly secured in place when its front edge aligns with the bay edge
254 Drive Slot Map
The drive slot map follows
HBA Card
MegaRaid Card
Chapter 2 Hardware Installation
HA201-TP Users Manual
24
26 Removing and Installing a Fan Module261 Removing a FAN module
Unscrew the thumb screw on the cover to open the top cover from NODE Unpluging the FAN cable Remove the fan by lifting it and pulling it out
262 Installing a FAN module To installing a fan module follow the reverse order
HA201-TP Users Manual
25
Chapter 2 Hardware Installation
27 Power Supply
271 Removing or Replacing the Power Supply Module
Slide the small levers (see red arrow ) to the right Hold the PSU lever and firmly pull the PSU out of the server chassis
Chapter 2 Hardware Installation
HA201-TP Users Manual
26
28 Tool-less Blade Slide Installation introduction
1 Pull on the lsquorsquoFront-Releasersquorsquoto unlock the inner channel from the Slide Assembly
2 Release the Detent-Lock and push Middle Channel inwards to retract Middle Channel
HA201-TP Users Manual
27
Chapter 2 Hardware Installation
Metal Spacer
OptionalRemove Metal Spacer for Aluminium Racks
3 Aligning the Front Bracket with the Mounting Hole
4 Push in to assembly the Front Bracket onto the Rack
Chapter 2 Hardware Installation
HA201-TP Users Manual
28
5 Now the bracket is fixed onto the Rack(Optional M6x10L screws are to secure the rails with posts if needed)
6 Refer to Diagram 3 amp 4 to assemble the End Bracket onto the Rack
HA201-TP Users Manual
29
Chapter 2 Hardware Installation
7 Assemble the inner channel onto the chassis using thescrews provided
8 Push the chassis with inner channels into Slide to complete Rack Installation
Chapter 3 Motherborad Overview
HA201-TP Users Manual
30
Chapter 3 Motherborad Overview
The Intelreg Server Board S2600TP is a monolithic printed circuit board (PCB) with features designed to support the high-performance and high-density computing markets This server board is designed to support the Intelreg Xeonreg processor E5-2600 v3 product family Previous generation Intelreg Xeonreg processors are not supported Many of the features and functions of the server board family are common A board will be identified by name when a described feature or function is unique to it
Chapter 3 Motherborad Overview
HA201-TP Users Manual
31
31 Intelreg Server Board Feature Set
Chapter 3 Motherborad Overview
HA201-TP Users Manual
32
WARNING The riser slot 1 on the server board is designed for
plugging in ONLY the riser card Plugging in the PCIe card may
cause permanent server board and PCIe card damage
Chapter 3 Motherborad Overview
HA201-TP Users Manual
33
32 Motherboard Layout
DiagnosticLED
RMM4LiteRiser Slot 2
SATASGPIO
BridgeBoard
SATA 3USB
Riser Slot 4
Riser Slot 4
HDD Activity
Riser Slot 3ControlPanel
SystemFan 1
CPU 2Socket
CPU 1Socket
SystemFan 3
SystemFan 2
MainPower 1
FanConnector
MainPower 2
InfiniBandPort(QSFP+)DedicatedManagementPortUSBStatus LEDID LED
VGA
NIC 2
IPMB
NIC 1
SerialPort A
CR 2032 3W
Backup Power Control
SATA 2
SATA 0
SATA RAID 5
SATA
Upgrade Key
DOM
SATA 1
Chapter 3 Motherborad Overview
HA201-TP Users Manual
34
33 Motherboard block diagram
Chapter 3 Motherborad Overview
HA201-TP Users Manual
35
34 Motherboard Configuration JumpersThe following table provides a summary and description of configuration test and debug jumpers The server board has several 3-pin jumper blocks that can be used Pin 1 on each jumper block can be identified by the following symbol on the silkscreen
Chapter 3 Motherborad Overview
HA201-TP Users Manual
36
Chapter 3 Motherborad Overview
HA201-TP Users Manual
37
35 Motherboard Configuration JumpersThe Intelreg Server Board S2600TP has the following board rear connector placement
HA201-TP Users Manual
38
Chapter 4 12G Expander Board Overview
This chapter provides detailed introduction guide on hardware introduction
41 12GB Expander Board411 Placement amp Connectors location
J1
JPIC_DBGU8
Chip 3x36RExpander
U14Flash U16
PIC
SW1
SW2
JEXP_UART
J7
JPIC_SMT
J2 J3 J4 CN1 JUSB_MB
JVBATT_12C
JVBATT CN3 CN2 JIPMI_LOC
CN5
CN8
JPWR1
JPWR2
JHDDPWRJPSON_OK
JIPMB
JUSB_PIC
J8
CN7CN6
CN4
JPWR_SW
Chapter 4 12GB Expander Borad Overview
HA201-TP Users Manual
39
Chapter 4 12G Expander Board Overview
412 PIN-OUT
Power Connector ndash JPWR1JPWR2
OS HDD Power Connector ndash JHDDPWR
Battery Connector ndash JVBATT
FAN Connector ndash J7J8
Console for Expander ndash JEXP_UART
HA201-TP Users Manual
40
Chapter 4 12G Expander Board Overview
PIC Smart portndash JPIC_SMT
PIC Debug port(For MPLAB I2C tool) ndash JPIC_DBG
PIC USB ndash JUSB_PIC
Battery Function ndash JVBATT_I2C
IPMB ndash JIPMB
HA201-TP Users Manual
41
Chapter 4 12G Expander Board Overview
BP PIC USB ndash JUSB_MB
IPMB for Device ndash JIPMI_LOC
Power SW ndash JPWR_SW
MB power on frunction ndash JPSON_OK
HA201-TP Users Manual
42
Chapter 4 12G Expander Board Overview
413 LEDs
LED_CN6
LED2
LED3
HA201-TP Users Manual
43
Chapter 5 Backplane OverviewChapter 5 Backplane Overview
This chapter provides detailed introduction guide on backplane introduction
51 Backplane511 Placement amp Connectors location
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
9
18
10
1
3
G
D
SG
D
S
C
B
C
E
B
C
E
GA
O
I
2
43
12
43
1
14
1
3
45
134
5
134
5
134
5
16
2
9
10
1
1
1120
10
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
16
17
32
33
484964
J2
JP1J11
For Secondary Canister Node
J4 J1 SW1
SW2
J3
For Primary Canister Node
HA201-TP Users Manual
44
Chapter 5 Backplane Overview
512 PIN-OUT
Power Connector ndash J2
Power Connector ndash J11
PMBUS Connector ndash JP1
FAN Connector ndash J4
HA201-TP Users Manual
45
Chapter 5 Backplane Overview
Console for MCU ndash J1
Front IO ndash J3
HA201-TP Users Manual
46
Chapter 5 Backplane Overview
513 LEDs
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
18
9
10
1
31
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
3 4
21
3 4
21
4
1
16
2
91
101
11
10
20
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ActFail
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
9
18
10
1
3
G
D
SG
D
S
C
B
C
E
B
C
E
GA
O
I
2
43
12
43
1
14
1
3
45
134
5
134
5
134
5
16
2
9
10
1
1
1120
10
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
16
17
32
33
484964LED1
HA201-TP Users Manual
47
Chapter 6 HDD Backplane Introduction
61 Expender firmware update through smart console port611 Update Expander firmware revision
Step 1 Set up HA201-TP console serial cable Insert console serial cable into console port shown below also
the other side inert serial port into motherboard
PLEASE FIND THE CONSOLE SERIAL CABLE IN THE PACKAGE BOX
Chapter 6 Debug amp Firmware Update Introduction
HA201-TP Users Manual
48
Chapter 6 HDD Backplane Introduction
Step 2 Set up HA201-TP RS232 connectionSet up RS232 connection application into your HA201-TP as shown in the example process below
For exampleOS Microsoft WindowsRS232 connection application Hyperterminal
Step 2 Install HyperTrmexe
HA201-TP Users Manual
49
Chapter 6 HDD Backplane Introduction
Step 3 Enter a new name for the icon in the field below and click OK
Step 4 Connecting by using selecting an option in the drop down menu circled in red below (we selected COM2 in this example) and click OK
HA201-TP Users Manual
50
Chapter 6 HDD Backplane Introduction
Properties
Port Setting
Bits per second
Data bits
Parity
Stop bits
Flow control None
Restore Defaults
OK ApplyCancel
None
Step 6 Set up is complete The diagram below depicts what screen should displayed
Step 5 For ldquoBits per secondrdquo select 38400 For ldquoFlow controlrdquo select None Click OK when you have finished your selections
cmdgt_
HA201-TP Users Manual
51
Chapter 6 HDD Backplane Introduction
Step 7To get firmware image amp MFG Configuration Image version information from AIC SAS Related Firmware Downloadne website
httpppmsaicipccomtw8888downloadexpandermcu
HA201-TP Users Manual
52
Chapter 6 HDD Backplane Introduction
Step 8Comand line for show current firmware revisioncmdgtrev
HA201-TP Users Manual
53
Chapter 6 HDD Backplane Introduction
Step 9Start to update expander firmwarecmdgtfdl 0 0_
Step 10Select the tool bar Transfer -gt Send File
HA201-TP Users Manual
54
Chapter 6 HDD Backplane Introduction
Step 11bull Choose new firmware path file fw 3A1_v11221bull Protocol have to choose Xmodem
Step 12Firmware download complete
HA201-TP Users Manual
55
Chapter 6 HDD Backplane Introduction
Step 13Reset computer for success update firmwarecmdgtreset
reset
HA201-TP Users Manual
56
Chapter 6 HDD Backplane Introduction
612 Update expander configuration MFG
Step 1Comand line for show current configuration MFGcmdgt showmfg
HA201-TP Users Manual
57
Chapter 6 HDD Backplane Introduction
Step 2Start to update expander configuration MFGcmdgtfdl 83 0_
Step 3Select the tool bar Transfer -gt Send File
83 0
HA201-TP Users Manual
58
Chapter 6 HDD Backplane Introduction
Step 4bull Choose new MFG path file mfg 3A10_eob_1201binbull Protocol have to choose Xmodem
Step 5MFG download complete
HA201-TP Users Manual
59
Chapter 6 HDD Backplane Introduction
Step 6Reset computer for success update MFGcmdgtreset
reset
HA201-TP Users Manual
60
Chapter 6 HDD Backplane Introduction
62 Update the expander firmware through in-band
FOR EXAMPLEStep 1
Download and install SG3_utilsexe which compatible with Linux OSFrom website httpsgdannyczsgsg3_utilshtml website Reference version sg3_utils-140tgz
Step 2To get firmware image amp MFG Configuration Image version information from AIC SAS Related Firmware Downloadne websitehttpppmsaicipccomtw8888downloadexpandermcu
HA201-TP Users Manual
61
Chapter 6 HDD Backplane Introduction
Step 3Execute terminal under the same new firmware folderexample Setting a new firmware folder on Home page Open Terminal by click to the right button of mouse in the same window Home
Step 4Typing sudo -s to into administrator mode
HA201-TP Users Manual
62
Chapter 6 HDD Backplane Introduction
Step 5 Find expander location$ sg_map -i
Step 6Update Expander firmware$ sg_write_buffer --id=0x0 --in=fw3A1_v11221 --mode=0x2 --offset=0 devsg0
HA201-TP Users Manual
63
Chapter 6 HDD Backplane Introduction
Step 7 Update Expander MFG$ sg_write_buffer --id=0x83 --in=mfg3A10_eob_1201bin --mode=0x2 --offset=0 devsg0
Step 8Reboot computer for success update firmware amp MFGrootubuntu~ reboot
HA201-TP Users Manual
64
Chapter 6 HDD Backplane Introduction
63 12G expander EDFB settingStep 1
For Install HyperTerminalexe refer to section 41
Step 2 Get EDFB statuscmdgt edfb
HA201-TP Users Manual
65
Chapter 6 HDD Backplane Introduction
Step 3Change EDFB setting
Enable EDFB functioncmdgtedfb on
Disable EDFB functioncmdgtedfb off
cmd gtedfbEDFB is OFF
cmd gtedfb onSucceeded to set EDFB
cmd gtedfb offEDFB is OFF
HA201-TP Users Manual
66
Chapter 6 HDD Backplane Introduction
64 Slot HDD power setting(Only for system cooling Fan controled by expander)
Step 1 For Install sg3exe tool and get new firmware from website refer to section 42
Step 2Execute terminal under the same new firmware folderexample Setting a new firmware folder on Home page Open Terminal by click to the right button of mouse in the same window Home
HA201-TP Users Manual
67
Chapter 6 HDD Backplane Introduction
Step 3 Typing sudo -s to into administrator mode
Step 4 Find expander location$ sg_map -i
HA201-TP Users Manual
68
Chapter 6 HDD Backplane Introduction
Step 5For example
If would like to turn the Disk004 power off under the HBA card Need to check Disk004 power status $ sg_ses --page=7 devsg0
Under HBA card the Element 3 = Disk004
HA201-TP Users Manual
69
Chapter 6 HDD Backplane Introduction
Step 6To check Disk004 (element 3) power status is ok$ sg_ses --page=2 devsg0
Status shows belowThe status of Element 3 is OK
HA201-TP Users Manual
70
Chapter 6 HDD Backplane Introduction
Step 7Turn off a HDD power$ sg_ses --descriptor=Disk004 --set=341 devsg0
Step 8Turn on a HDD power$ sg_ses --descriptor=Disk004 --clear=341 devsg0
HA201-TP Users Manual
71
Chapter 6 HDD Backplane Introduction
65 HDD BP thermal sensor temperature setting(Only for system cooling Fan controled by expander)
Step 1 For Install HyperTerminalexe refer to section 41
Step 2Get the current temperature settingscmdgt temperature
HA201-TP Users Manual
72
Chapter 6 HDD Backplane Introduction
Step 3For example
Set new temperatureT1=20 C 4 18 CT2=50 C 4 52 CWarning threshold=50 C 448 CAlarm threshold=55 C 454 C
The new setting will take effect after resetcmdgt temperature 18 52 48 54cmdgt reset
HA201-TP Users Manual
73
Chapter 6 HDD Backplane Introduction
Step 4Check fan speed amp temperature informationcmdgt sensor
cmd gtsensor==ENCLOSURE STATUS========================================================== Total fan number 2 System Fan-0 speed 10888 RPM System Fan-1 speed 10971 RPM System PWN-0 82 Expander Temperature 76 Celsius degree Sytem Temperature-0 33 Celsius degree T1 20 Celsius degree T2 50 Celsius degree TC 55 Celsius degree Voltage Sensor 09V 093V Voltage Sensor 18V 180V
cmd gt_
HA201-TP Users Manual
74
Chapter 7 Intelreg Server Board S2600TP Platform Management
71 Server Management Function Architecture711 IPMI Feature7111 IPMI 20 Featuresbull Baseboard management controller (BMC)bull IPMI Watchdog timerbull Messaging support including command bridging and usersession
supportbull Chassis device functionality including powerreset control and BIOS boot
flags supportbull Event receiver device The BMC receives and processes events from
other platform subsystemsbull Field Replaceable Unit (FRU) inventory device functionality The BMC
supports access to system FRU devices using IPMI FRU commandsbull System Event Log (SEL) device functionality The BMC supports and
provides access to a SELbull Sensor Data Record (SDR) repository device functionality The BMC
supports storage and access of system SDRsbull Sensor device and sensor scanningmonitoring The BMC provides IPMI
management of sensors It polls sensors to monitor and report system health
bull IPMI interfaceso Host interfaces include system management software (SMS) with receive message queue support and server management mode (SMM)o IPMB interfaceo LAN interface that supports the IPMI-over-LAN protocol (RMCP RMCP+)
bull Serial-over-LAN (SOL)bull ACPI state synchronization The BMC tracks ACPI state changes that are
provided by the BIOSbull BMC self-test The BMC performs initialization and run-time self-tests and
makes results available to external entitiesbull See also the Intelligent Platform Management Interface Specification
Second Generation v20
Chapter 7 Intelreg Server Board S2600TP Platform Management
HA201-TP Users Manual
75
Chapter 7 Intelreg Server Board S2600TP Platform Management
7112 Non IPMI FeaturesThe BMC supports the following non-IPMI featuresbull In-circuit BMC firmware updatebull Fault resilient booting (FRB) FRB2 is supported by the watchdog timer
functionalitybull Chassis intrusion detectionbull Basic fan control using Control version 2 SDRsbull Fan redundancy monitoring and supportbull Enhancements to fan speed controlbull Power supply redundancy monitoring and supportbull Hot-swap fan supportbull Acoustic management Support for multiple fan profilesbull Signal testing support The BMC provides test commands for setting
and getting platform signal statesbull The BMC generates diagnostic beep codes for fault conditionsbull System GUID storage and retrievalbull Front panel management The BMC controls the system status LED
and chassis ID LED It supports secure lockout of certain front panel functionality and monitors button presses The chassis ID LED is turned on using a front panel button or a command
bull Power state retentionbull Power fault analysisbull Intelreg Light-Guided Diagnosticsbull Power unit management Support for power unit sensor The BMC
handles power-good dropout conditionsbull DIMM temperature monitoring New sensors and improved acoustic
management using closed-loop fan control algorithm taking into account DIMM temperature readings
bull Address Resolution Protocol (ARP) The BMC sends and responds to ARPs (supported on embedded NICs)
bull Dynamic Host Configuration Protocol (DHCP) The BMC performs DHCP (supported on embedded NICs)
bull Platform environment control interface (PECI) thermal management support
bull E-mail alertingbull Support for embedded web server UI in Basic Manageability feature
set
HA201-TP Users Manual
76
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull Enhancements to embedded web servero Human-readable SELo Additional system configurabilityo Additional system monitoring capabilityo Enhanced on-line help
bull Integrated KVMbull Enhancements to KVM redirection
o Support for higher resolutionbull Integrated Remote Media Redirectionbull Lightweight Directory Access Protocol (LDAP) supportbull Intelreg Intelligent Power Node Manager supportbull Embedded platform debug feature which allows capture of detailed
data for later analysisbull Provisioning and inventory enhancements
o Inventory datasystem information export (partial SMBIOS table)bull DCMI 15 compliance (product-specific)bull Management support for PMBus rev12 compliant power suppliesbull BMC Data Repository (Managed Data Region Feature)bull Support for an Intelreg Local Control Display Panelbull System Airflow Monitoringbull Exit Air Temperature Monitoringbull Ethernet Controller Thermal Monitoringbull Global Aggregate Temperature Margin Sensorbull Memory Thermal Managementbull Power Supply Fan Sensorsbull Energy Star Server Supportbull Smart Ride Through (SmaRT) Closed Loop System Throttling (CLST)bull Power Supply Cold Redundancybull Power Supply FW Updatebull Power Supply Compatibility Checkbull BMC FW reliability enhancements
o Redundant BMC boot blocks to avoid possibility of a corrupted boot block resulting in a scenario that prevents a user from updating the BMC o BMC System Management Health Monitoring
HA201-TP Users Manual
77
Chapter 7 Intelreg Server Board S2600TP Platform Management
72 Features and Functions721 Power SubsystemThe server board supports several power control sources which can initiate power-up orpower-down activity
HA201-TP Users Manual
78
Chapter 7 Intelreg Server Board S2600TP Platform Management
722 Advanced Configuration and Power Interface (ACPI)The server board has support for the following ACPI states
HA201-TP Users Manual
79
Chapter 7 Intelreg Server Board S2600TP Platform Management
723 System InitializationDuring system initialization both the BIOS and the BMC initialize the following items
7231 Processor Tcontrol SettingProcessors used with this chipset implement a feature called Tcontrol which provides a processor-specific value that can be used to adjust the fan-control behavior to achieve optimum cooling and acoustics The BMC reads these from the CPU through PECI Proxy mechanism provided by Manageability Engine (ME) The BMC uses these values as part of thefan-speed-control algorithm
HA201-TP Users Manual
80
Chapter 7 Intelreg Server Board S2600TP Platform Management
7232 Fault Resilient Booting (FRB)Fault resilient booting (FRB) is a set of BIOS and BMC algorithms and hardware support that allow a multiprocessor system to boot even if the bootstrap processor (BSP) fails Only FRB2 is supported using watchdog timer commandsFRB2 refers to the FRB algorithm that detects system failures during POST The BIOS uses the BMC watchdog timer to back up its operation during POST The BIOS configures the watchdog timer to indicate that the BIOS is using the timer for the FRB2 phase of the boot operationAfter the BIOS has identified and saved the BSP information it sets the FRB2 timer use bit and loads the watchdog timer with the new timeout intervalIf the watchdog timer expires while the watchdog use bit is set to FRB2 the BMC (if so configured) logs a watchdog expiration event showing the FRB2 timeout in the event data bytes The BMC then hard resets the system assuming the BIOS-selected reset as the watchdog timeout actionThe BIOS is responsible for disabling the FRB2 timeout before initiating the option ROM scan and before displaying a request for a boot password If the processor fails and causes an FRB2 timeout the BMC resets the systemThe BIOS gets the watchdog expiration status from the BMC If the status shows an expired FRB2 timer the BIOS enters the failure in the system event log (SEL) In the OEM bytes entry in the SEL the last POST code generated during the previous boot attempt is written FRB2failure is not reflected in the processor status sensor valueThe FRB2 failure does not affect the front panel LEDs
HA201-TP Users Manual
81
Chapter 7 Intelreg Server Board S2600TP Platform Management
7233 Post Code DisplayThe BMC upon receiving standby power initializes internal hardware to monitor port 80h (POST code) writes Data written to port 80h is output to the system POST LEDsThe BMC deactivates POST LEDs after POST had completed
HA201-TP Users Manual
82
Chapter 7 Intelreg Server Board S2600TP Platform Management
724 System Event Log (SEL)The BMC implements the system event log as specified in the Intelligent Platform Management Interface Specification Version 20 The SEL is accessible regardless of the system power state through the BMCs in-band and out-of-band interfacesThe BMC allocates 95231bytes (approx 93 KB) of non-volatile storage space to store system events The SEL timestamps may not be in order Up to 3639 SEL records can be stored at a time Because the SEL is circular any command that results in an overflow of the SEL beyond the allocated space will overwrite the oldest entries in the SEL while setting the overflow flag
HA201-TP Users Manual
83
Chapter 7 Intelreg Server Board S2600TP Platform Management
73 Sensor MonitoringThe BMC monitors system hardware and reports system health The information gathered from physical sensors is translated into IPMI sensors as part of the ldquoIPMI Sensor Modelrdquo The BMC also reports various system state changes by maintaining virtual sensors that are not specifically tied to physical hardware This section describes general aspects of BMC sensor management as well as describing how specific sensor types are modeled Unless otherwise specified the term ldquosensorrdquo refers to the IPMI sensor-model definition of a sensor
531 Sensor ScanningThe value of many of the BMCrsquos sensors is derived by the BMC FW periodically polling physical sensors in the system to read temperature voltages and so on Some of these physical sensors are built-in to the BMC component itself and some are physically separated from the BMC Polling of physical sensors for support of IPMI sensor monitoring does not occur until the BMCrsquos operational code is running and the IPMI FW subsystem has completed initializationIPMI sensor monitoring is not supported in the BMC boot code Additionally the BMC selectively polls physical sensors based on the current power and reset state of the system and the availability of the physical sensor when in that state For example non-standby voltages are not monitored when the system is in S4 or S5 power state
HA201-TP Users Manual
84
Chapter 7 Intelreg Server Board S2600TP Platform Management
732 Sensor Rearm Behavior7321 Manual versus Re-arm SensorsSensors can be either manual or automatic re-arm An automatic re-arm sensor will re-arm (clear) the assertion event state for a threshold or offset it if that threshold or offset is deasserted after having been asserted This allows a subsequent assertion of the threshold or an offset to generate a new event and associated side-effect An example side-effect would be boosting fans due to an upper critical threshold crossing of a temperature sensor The event state and the input state (value) of the sensor track each other Most sensors are auto-rearmA manual re-arm sensor does not clear the assertion state even when the threshold or offset becomes de-asserted In this case the event state and the input state (value) of the sensor do not track each other The event assertion state is sticky The following methods can be used to re-arm a sensorbull Automatic re-arm ndash Only applies to sensors that are designated as
ldquoauto-rearmrdquobull IPMI command Re-arm Sensor Eventbull BMC internal method ndash The BMC may re-arm certain sensors due to a
trigger condition For example some sensors may be re-armed due to a system reset A BMC reset will re-arm all sensorsbull System reset or DC power cycle will re-arm all system fan sensors
HA201-TP Users Manual
85
Chapter 7 Intelreg Server Board S2600TP Platform Management
7322 Re-arm and Event GenerationAll BMC-owned sensors that show an asserted event status generate a de-assertion SEL event when the sensor is re-armed provided that the associated SDR is configured to enable a deassertion event for that condition This applies regardless of whether the sensor is a thresholdanalog sensor or a discrete sensor
To manually re-arm the sensors the sequence is outlined below1 A failure condition occurs and the BMC logs an assertion event2 If this failure condition disappears the BMC logs a de-assertion event (if
so configured)3 The sensor is re-armed by one of the methods described in the
previous section4 The BMC clears the sensor status5 The sensor is put into reading-state-unavailable state until it is polled
again or otherwise updated6 The sensor is updated and the ldquoreading-state-unavailablerdquo state is
cleared A new assertion event will be logged if the fault state is once again detected
All auto-rearm sensors that show an asserted event status generate a de-assertion SEL event at the time the BMC detects that the condition causing the original assertion is no longer present and the associated SDR is configured to enable a de-assertion event for that condition
HA201-TP Users Manual
86
Chapter 7 Intelreg Server Board S2600TP Platform Management
733 BIOS Event-Only SensorsBIOS-owned discrete sensors are used for event generation only and are not accessible through IPMI sensor commands like the Get Sensor Reading command Note that in this case the sensor owner designated in the SDR is not the BMCAn example of this usage would be the SELs logged by the BIOS for uncorrectable memory errors Such SEL entries would identify a BIOS-owned sensor ID
734 Margin SensorsThere is sometimes a need for an IPMI sensor to report the difference (margin) from a nonzero reference offset For the purposes of this document these type sensors are referred to as margin sensors For instance for the case of a temperature margin sensor if the referencevalue is 90 degrees and the actual temperature of the device being monitored is 85 degreesthe margin value would be -5
735 IPMI Watchdog SensorThe BMC supports a Watchdog Sensor as a means to log SEL events due to expirations of the IPMI 20 compliant Watchdog Timer
736 BMC Watchdog SensorThe BMC supports an IPMI sensor to report that a BMC reset has occurred due to action taken by the BMC Watchdog feature A SEL event will be logged whenever either the BMC FW stack is reset or the BMC CPU itself is reset
737 BMC System Management Health MonitoringThe BMC tracks the health of each of its IPMI sensors and report failures by providing a ldquoBMC FW Healthrdquo sensor of the IPMI 20 sensor type Management Subsystem Health with support for the Sensor Failure offset Only assertions should be logged into the SEL for the Sensor Failure offset The BMC Firmware Health sensor asserts for any sensor when 10 consecutive sensor errors are read These are not standard sensor events (that is threshold crossings or discrete assertions) These are BMC Hardware Access Layer (HAL) errors If a successful sensor read is completed the counter resets to zero
HA201-TP Users Manual
87
Chapter 7 Intelreg Server Board S2600TP Platform Management
738 VR Watchdog TimerThe BMC FW monitors that the power sequence for the board VR controllers is completed when a DC power-on is initiated Incompletion of the sequence indicates a board problem in which case the FW powers down the systemThe BMC FW supports a discrete IPMI sensor for reporting and logging this fault condition
739 System Airflow MonitoringThe BMC provides an IPMI sensor to report the volumetric system airflow in CFM (cubic feet per minute) The air flow in CFM is calculated based on the system fan PWM values The specific Pulse Width Modulation (PWM or PWMs) used to determine the CFM is SDR configurable The relationship between PWM and CFM is based on a lookup table in an OEM SDRThe airflow data is used in the calculation for exit air temperature monitoring It is exposed as an IPMI sensor to allow a datacenter management application to access this data for use in rack-level thermal management
7310 Thermal MonitoringThe BMC provides monitoring of component and board temperature sensing devices This monitoring capability is instantiated in the form of IPMI analogthreshold or discrete sensors depending on the nature of the measurementFor analogthreshold sensors with the exception of Processor Temperature sensors critical and non-critical thresholds (upper and lower) are set through SDRs and event generation enabled for both assertion and de-assertion eventsFor discrete sensors both assertion and de-assertion event generation are enabledMandatory monitoring of platform thermal sensors includes
HA201-TP Users Manual
88
Chapter 7 Intelreg Server Board S2600TP Platform Management
7310 Thermal MonitoringThe BMC provides monitoring of component and board temperature sensing devices This monitoring capability is instantiated in the form of IPMI analogthreshold or discrete sensors depending on the nature of the measurementFor analogthreshold sensors with the exception of Processor Temperature sensors critical and non-critical thresholds (upper and lower) are set through SDRs and event generation enabled for both assertion and de-assertion eventsFor discrete sensors both assertion and de-assertion event generation are enabledMandatory monitoring of platform thermal sensors includesbull Inlet temperature (physical sensor is typically on system front panel or
HDD back plane)bull Board ambient thermal sensorsbull Processor temperaturebull Memory (DIMM) temperaturebull CPU VRD Hot monitoringbull Power supply (only supported for PMBus-compliant PSUs)Additionally the BMC FW may create ldquovirtualrdquo sensors that are based on a combination of aggregation of multiple physical thermal sensors and application of a mathematical formula to thermal or power sensor readings
73101 Absolute Value versus Margin SensorsThermal monitoring sensors fall into three basic categoriesbull Absolute temperature sensors ndash These are analogthreshold sensors
that provide a value that corresponds to an absolute temperature value
bull Thermal margin sensors ndash These are analogthreshold sensors that provide a value that is relative to some reference value
bull Thermal fault indication sensors ndash These are discrete sensors that indicate a specific thermal fault condition
HA201-TP Users Manual
89
Chapter 7 Intelreg Server Board S2600TP Platform Management
73102 Processor DTS-Spec Margin Sensor(s)Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family incorporate a DTS based thermal spec This allows a much more accurate control of the thermal solution and will enable lower fan speeds and lower fan power consumption The main usage of this sensor is as an input to the BMCrsquos fan control algorithms The BMC implements this as a threshold sensor There is one DTS sensor for each installed physical processor package Thresholds are not set and alert generation is not enabled for these sensors
73103 Processor Thermal Margin Sensor(s)Each processor supports a physical thermal margin sensor per core that is readable through the PECI interface This provides a relative value representing a thermal margin from the corersquos throttling thermal trip point Assuming that temp controlled throttling is enabled the physical core temp sensor reads lsquo0rsquo which indicates the processor core is being throttledThe BMC supports one IPMI processor (margin) temperature sensor per physical processor package This sensor aggregates the readings of the individual core temperatures in a package to provide the hottest core temperature reading When the sensor reads lsquo0rsquo it indicates that the hottest processor core is throttlingDue to the fact that the readings are capped at the corersquos thermal throttling trip point (reading = 0) thresholds are not set and alert generation is not enabled for these sensors
73104 Processor Thermal Control Monitoring (Prochot)The BMC FW monitors the percentage of time that a processor has been operationally constrained over a given time window (nominally six seconds) due to internal thermal management algorithms engaging to reduce the temperature of the device When any processor core temperature reaches its maximum operating temperature the processorpackage PROCHOT (processor hot) signal is asserted and these management algorithms known as the Thermal Control Circuit (TCC) engage to reduce the temperature provided TCC is enabled TCC is enabled by BIOS during system boot This monitoring is instantiated as one IPMI analogthreshold sensor per processor package The BMC implements this as a threshold sensor on a per-processor basis
HA201-TP Users Manual
90
Chapter 7 Intelreg Server Board S2600TP Platform Management
Under normal operation this sensor is expected to read lsquo0rsquo indicating that no processor throttling has occurredThe processor provides PECI-accessible counters one for the total processor time elapsed and one for the total thermally constrained time which are used to calculate the percentage assertion over the given time window
73105 Processor Voltage Regulator (VRD) Over-Temperature SensorThe BMC monitors processor VRD_HOT signals The processor VRD_HOT signals are routed to the respective processor PROCHOT input in order initiate throttling to reduce processor power draw therefore indirectly lowering the VRD temperatureThere is one processor VRD_HOT signal per CPU slot The BMC instantiates one discrete IPMI sensor for each VRD_HOT signal This sensor monitors a digital signal that indicates whether a processor VRD is running in an over-temperature condition When the BMC detects that this signal is asserted it will cause a sensor assertion which will result in an event being written into the sensor event log (SEL)
73106 Inlet Temperature SensorEach platform supports a thermal sensor for monitoring the inlet temperature There are four potential sources for inlet temperature reading
73107 Baseboard Ambient Temperature Sensor(s)The server baseboard provides one or more physical thermal sensors for monitoring the ambient temperature of a board location This is typically to provide rudimentary thermal monitoring of components that lack internal thermal sensors
73108 Server South Bridge (SSB) Thermal MonitoringThe BMC monitors the SSB temperature This is instantiated as an analog (threshold) IPMI thermal sensor
73109 Exit Air Temperature Monitoring The BMC synthesizes a virtual sensor to approximate system exit air temperature for use in fan control This is calculated based on the total power being consumed by the system and the total volumetric air flow provided by the system fans
HA201-TP Users Manual
91
Chapter 7 Intelreg Server Board S2600TP Platform Management
Each system shall be characterized in tabular format to understand total volumetric flow versus fan speed The BMC calculates an average exit air temperature based on the total system power front panel temperature the volumetric system air flow (cubic feet per meter or CFM) and altitude rangeThis sensor is only available on systems in an Intelreg chassis The Exit Air temp sensor is only available when PMBus power supplies are installed
731010 Ethernet Controller Thermal MonitoringThe Intelreg Ethernet Controller I350-AM4 and Intelreg Ethernet Controller 10 Gigabit X540 support an on-die thermal sensor For baseboard Ethernet controllers that use these devices the BMC will monitor the sensors and use this data as input to the fan speed control The BMC will instantiate an IPMI temperature sensor for each device on the baseboard
731011 Memory VRD-Hot Sensor(s)The BMC monitors memory VRD_HOT signals The memory VRD_HOT signals are routed to the respective processor MEMHOT inputs in order to throttle the associated memory to effectively lower the temperature of the VRD feeding that memoryFor Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family there are 2 memory VRD_HOT signals per CPU slot The BMC instantiates one discrete IPMI sensor for each memory VRD_HOT signal
731012 Add-in Module Thermal MonitoringSome boards have dedicated slots for an IO module andor a SAS module For boards that support these slots the BMC will instantiate an IPMI temperature sensor for each slot The modules themselves may or may not provide a physical thermal sensor (a TMP75 device) If the BMC detects that a module is installed it will attempt to access the physical thermal sensor and if found enable the associated IPMI temperature sensor
731013 Processor ThermTripWhen a Processor ThermTrip occurs the system hardware will automatically power down the server If the BMC detects that a ThermTrip occurred then it will set the ThermTrip offset for the applicable
HA201-TP Users Manual
92
Chapter 7 Intelreg Server Board S2600TP Platform Management
processor status sensor
731014 Server South Bridge (SSB) ThermTrip Monitoring The BMC supports SSB ThermTrip monitoring that is instantiated as an IPMI discrete sensor When a SSB ThermTrip occurs the system hardware will automatically power down the server and the BMC will assert the sensor offset and log an event
731015 DIMM ThermTrip MonitoringThe BMC supports DIMM ThermTrip monitoring that is instantiated as one aggregate IPMI discrete sensor per CPU When a DIMM ThermTrip occurs the system hardware will automatically power down the server and the BMC will assert the sensor offset and log an eventThis is a manual re-arm sensor that is rearmed on system resets and power-on (AC or DC power on transitions)
HA201-TP Users Manual
93
Chapter 7 Intelreg Server Board S2600TP Platform Management
7311 Processor SensorsThe BMC provides IPMI sensors for processors and associated components such as voltage regulators and fans The sensors are implemented on a per-processor basis
HA201-TP Users Manual
94
Chapter 7 Intelreg Server Board S2600TP Platform Management
73111 Processor Status SensorsThe BMC provides an IPMI sensor of type processor for monitoring status information for each processor slot If an event state (sensor offset) has been asserted it remains asserted until one of the following happens1 A Rearm Sensor Events command is executed for the processor status
sensor2 AC or DC power cycle system reset or system boot occurs
The BMC provides system status indication to the front panel LEDs for processor fault conditions shown belowCPU Presence status is not saved across AC power cycles and therefore will not generate a deassertion after cycling AC power
HA201-TP Users Manual
95
Chapter 7 Intelreg Server Board S2600TP Platform Management
73112 Processor Population Fault (CPU Missing) SensorThe BMC supports a Processor Population Fault sensor This is used to monitor for the condition in which processor slots are not populated as required by the platform hardware to allow power-on of the systemAt BMC startup the BMC will check for the fault condition and set the sensor state accordinglyThe BMC also checks for this fault condition at each attempt to DC power-on the system At each DC power-on attempt a beep code is generated if this fault is detectedThe following steps are used to correct the fault condition and clear the sensor state1 AC power down the server2 Install the missing processor into the correct slot3 AC power on the server
73113 ERR2 Timeout MonitoringThe BMC supports an ERR2 Timeout Sensor (1 per CPU) that asserts if a CPUrsquos ERR2 signal has been asserted for longer than a fixed time period (gt 90 seconds) ERR[2] is a processor signal that indicates when the IIO (Integrated IO module in the processor) has a fatal error which could not be communicated to the core to trigger SMI ERR[2] events are fatal error conditions where the BIOS and OS will attempt to gracefully handle error but may not be always do so reliably A continuously asserted ERR2 signal is an indication that the BIOS cannot service the condition that caused the error This is usually because that condition prevents the BIOS from runningWhen an ERR2 timeout occurs the BMC assertsde-asserts the ERR2 Timeout Sensor and logs a SEL event for that sensor The default behavior for BMC core firmware is to initiate a system reset upon detection of an ERR2 timeout The BIOS setup utility provides an option to disable or enable system reset by the BMC for detection of this condition
HA201-TP Users Manual
96
Chapter 7 Intelreg Server Board S2600TP Platform Management
73114 CATERR SensorThe BMC supports a CATERR sensor for monitoring the system CATERR signalThe CATERR signal is defined as having 3 statesbull high (no event)bull pulsed low (possibly fatal may be able to recover)bull low (fatal)All processors in a system have their CATERR pins tied together The pin is used as a communication path to signal a catastrophic system event to all CPUs The BMC has direct access to this aggregate CATERR signalThe BMC only monitors for the ldquoCATERR held lowrdquo condition A pulsed low condition is ignored by the BMC If a CATERR-low condition is detected the BMC logs an error message to the SEL against the CATERR sensor and the default action after logging the SEL entry is to reset the system The BIOS setup utility provides an option to disable or enable system reset by the BMC for detection of this conditionThe sensor is rearmed on power-on (AC or DC power on transitions) It is not rearmed on system resets in order to avoid multiple SEL events that could occur due to a potential reset loop if the CATERR keeps recurring which would be the case if the CATERR was due to an MSID mismatch conditionWhen the BMC detects that this aggregate CATERR signal has asserted it can then go through PECI to query each CPU to determine which one was the source of the error and write an OEM code identifying the CPU slot into an event data byte in the SEL entry If PECI is non-functional (it isnrsquot guaranteed in this situation) then the OEM code should indicate that the source is unknownEvent data byte 2 and byte 3 for CATERR sensor SEL events
ED1 ndash 0xA1ED2 - CATERR type0 Unknown1 CATERR2 CPU Core Error (not supported on Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family)3 MSID Mismatch4 CATERR due to CPU 3-strike timeout
HA201-TP Users Manual
97
Chapter 7 Intelreg Server Board S2600TP Platform Management
ED3 - CPU bitmap that causes the system CATERR[0] CPU1[1] CPU2[2] CPU3[3] CPU4When a CATERR Timeout event is determined to be a CPU 3-strike timeout The BMC shall log the logical FRU information (eg busdevfunc for a PCIe device CPU or DIMM) that identifies the FRU that caused the error in the extended SEL data bytes In this case Ext-ED0 will be set to 0x70 and the remaining ED1-ED7 will be set according to the device type and info available
73115 MSID Mismatch SensorThe BMC supports a MSID Mismatch sensor for monitoring for the fault condition that will occur if there is a power rating incompatibility between a baseboard and a processor The sensor is rearmed on power-on (AC or DC power on transitions)
7312 Voltage MonitoringThe BMC provides voltage monitoring capability for voltage sources on the main board and processors such that all major areas of the system are covered This monitoring capability is instantiated in the form of IPMI analogthreshold sensors
73121 DIMM Voltage SensorsSome systems support either LVDDR (Low Voltage DDR) memory or regular (non-LVDDR) memory During POST the system BIOS detects which type of memory is installed and configures the hardware to deliver the correct voltageSince the nominal voltage range is different this necessitates the ability to set different thresholds for any associated IPMI voltage sensors The BMC FW supports this by implementing separate sensors (that is separate IPMI sensor numbers) for each nominal voltage range supported for a single physical sensor and it enablesdisables the correct IPMI sensor based on which type memory is installed The sensor data records for both these DIMM voltage sensor types have scanning disabled by default Once the BIOS has completed its POST routine it is responsible for communicating the DIMM voltage type to the BMC which will then
HA201-TP Users Manual
98
Chapter 7 Intelreg Server Board S2600TP Platform Management
enable sensor scanning of the correct DIMM voltage sensor
7313 Fan MonitoringBMC fan monitoring support includes monitoring of fan speed (RPM) and fan presence
73131 Fan Tach SensorsFan Tach sensors are used for fan failure detection The reported sensor reading is proportional to the fanrsquos RPM This monitoring capability is instantiated in the form of IPMI analogthreshold sensorsMost fan implementations provide for a variable speed fan so the variations in fan speed can be large Therefore the threshold values must be set sufficiently low as to not result in inappropriate threshold crossingsFan tach sensors are implemented as manual re-arm sensors because a lower-critical threshold crossing can result in full boosting of the fans This in turn may cause a failing fanrsquos speed to rise above the threshold and can result in fan oscillationsAs a result fan tach sensors do not auto-rearm when the fault condition goes away but rather are rearmed for either of the following occurrences1 The system is reset or power-cycled2 The fan is removed and either replaced with another fan or re-
inserted This applies to hot-swappable fans only This re-arm is triggered by change in the state of the associated fan presence sensor
After the sensor is rearmed if the fan speed is detected to be in a normal range the failure conditions shall be cleared and a de-assertion event shall be logged
HA201-TP Users Manual
99
Chapter 7 Intelreg Server Board S2600TP Platform Management
73132 Fan Presence SensorsSome chassis and server boards provide support for hot-swap fans These fans can be removed and replaced while the system is powered on and operating normally The BMC implements fan presence sensors for each hot swappable fan These are instantiated as IPMI discrete sensorsEvents are only logged for fan presence upon changes in the presence state after AC power is applied (no events logged for initial state)
73133 Fan Redundancy SensorThe BMC supports redundant fan monitoring and implements fan redundancy sensors for products that have redundant fans Support for redundant fans is chassis-specificA fan redundancy sensor generates events when its associated set of fans transition between redundant and non-redundant states as determined by the number and health of the component fans The definition of fan redundancy is configuration dependent The BMCallows redundancy to be configured on a per fan-redundancy sensor basis through OEM SDR recordsThere is a fan redundancy sensor implemented for each redundant group of fans in the systemAssertion and de-assertion event generation is enabled for each redundancy state
73134 Power Supply Fan SensorsMonitoring is implemented through IPMI discrete sensors one for each power supply fan The BMC polls each installed power supply using the PMBus fan status commands to check for failure conditions for the power supply fans The BMC asserts the ldquoperformance lagsrdquo offset of the IPMI sensor if a fan failure is detectedPower supply fan sensors are implemented as manual re-arm sensors because a failure condition can result in boosting of the fans This in turn may cause a failing fanrsquos speed to rise above the ldquofaultrdquo threshold and can result in fan oscillations As a result these sensors do not auto-rearm when the fault condition goes away but rather are rearmed only when the system is reset or power-cycled or the PSU is removed and replaced with the same or another PSUAfter the sensor is rearmed if the fan is no longer showing a failed state the failure condition in the IPMI sensor shall be cleared and a de-
HA201-TP Users Manual
100
Chapter 7 Intelreg Server Board S2600TP Platform Management
assertion event shall be logged
73135 Monitoring for ldquoFans Offrdquo ScenarioOn Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family it is likely that there will be situations where specific fans are turned off based on current system conditions BMC Fan monitoring will comprehend this scenario and not log false failure eventsThe recommended method is for the BMC FW to halt updates to the value of the associated fan tach sensor and set that sensorrsquos IPMI sensor state to ldquoreading-state-unavailablerdquo when this mode is active Management software must comprehend this state for fan tach sensors and not report these as failure conditionsThe scenario for which this occurs is that the BMC Fan Speed Control (FSC) code turns off the fans by setting the PWM for the domain to 0 This is done when based on one or more global aggregate thermal margin sensor readings dropping below a specified thresholdBy default the fans-off feature will be disabled There is a BMC command and BIOS setup option to enabledisable this featureThe SmaRTCLST system feature will also momentarily gate power to all the system fans to reduce overall system power consumption in response to a power supply event (for example to ride out an AC power glitch) However for this scenario the fan power is gated by hardware for only 100ms which should not be long enough to result in triggering a fan fault SEL event
7314 Standard Fan ManagementThe BMC controls and monitors the system fans Each fan is associated with a fan speed sensor that detects fan failure and may also be associated with a fan presence sensor for hotswap support For redundant fan configurations the fan failure and presence status determines the fan redundancy sensor stateThe system fans are divided into fan domains each of which has a separate fan speed control signal and a separate configurable fan control policy A fan domain can have a set of temperature and fan sensors associated with it These are used to determine the current fan domain state
HA201-TP Users Manual
101
Chapter 7 Intelreg Server Board S2600TP Platform Management
A fan domain has three statesbull The sleep and boost states have fixed (but configurable through OEM
SDRs) fan speeds associated with thembull The nominal state has a variable speed determined by the fan
domain policy An OEM SDR record is used to configure the fan domain policy
The fan domain state is controlled by several factors They are listed below in order of precedence high to lowbull Boost
o Associated fan is in a critical state or missing The SDR describes which fan domains are boosted in response to a fan failure or removal in each domain If a fan is removed when the system is in lsquoFans-offrsquo mode it will not be detected and there will not be any fan boost till system comes out of lsquoFans-off modeo Any associated temperature sensor is in a critical state The SDR describes which temperature-threshold violations cause fan boost for each fan domaino The BMC is in firmware update mode or the operational firmware is corruptedo If any of the above conditions apply the fans are set to a fixed boost state speed
bull Nominalo A fan domainrsquos nominal fan speed can be configured as static (fixed value) or controlled by the state of one or more associated temperature sensorso See section 43144 for more details
73141 Hot-Swap FansHot-swap fans are supported These fans can be removed and replaced while the system is powered on and operating The BMC implements fan presence sensors for each hotswappable fanWhen a fan is not present the associated fan speed sensor is put into the readingunavailable state and any associated fan domains are put into the boost state The fans may already be boosted due to a previous fan failure or fan removalWhen a removed fan is inserted the associated fan speed sensor is rearmed If there are no other critical conditions causing a fan boost condition the fan speed returns to the nominal state Power cycling or
HA201-TP Users Manual
102
Chapter 7 Intelreg Server Board S2600TP Platform Management
resetting the system re-arms the fan speed sensors and clears fanfailure conditions If the failure condition is still present the boost state returns once the sensor has re-initialized and the threshold violation is detected again
73142 Fan Redundancy DetectionThe BMC supports redundant fan monitoring and implements a fan redundancy sensor A fan redundancy sensor generates events when its associated set of fans transitions between redundant and non-redundant states as determined by the number and health of the fans The definition of fan redundancy is configuration dependent The BMC allows redundancy to be configured on a per fan redundancy sensor basis through OEM SDR recordsA fan failure or removal of hot-swap fans up to the number of redundant fans specified in the SDR in a fan configuration is a non-critical failure and is reflected in the front panel status A fan failure or removal that exceeds the number of redundant fans is a non-fatal insufficientresources condition and is reflected in the front panel status as a non-fatal errorRedundancy is checked only when the system is in the DC-on state Fan redundancy changes that occur when the system is DC-off or when AC is removed will not be logged until the system is turned on
73143 Fan DomainsSystem fan speeds are controlled through pulse width modulation (PWM) signals which are driven separately for each domain by integrated PWM hardware Fan speed is changed by adjusting the duty cycle which is the percentage of time the signal is driven high in each pulseThe BMC controls the average duty cycle of each PWM signal through direct manipulation of the integrated PWM control registersThe same device may drive multiple PWM signals
HA201-TP Users Manual
103
Chapter 7 Intelreg Server Board S2600TP Platform Management
73144 Nominal Fan SpeedA fan domainrsquos nominal fan speed can be configured as static (fixed value) or controlled by the state of one or more associated temperature sensorsOEM SDR records are used to configure which temperature sensors are associated with which fan control domains and the algorithmic relationship between the temperature and fan speed Multiple OEM SDRs can reference or control the same fan control domain and multiple OEM SDRs can reference the same temperature sensorsThe PWM duty-cycle value for a domain is computed as a percentage using one or more instances of a stepwise linear algorithm and a clamp algorithm The transition from one computed nominal fan speed (PWM value) to another is ramped over time to minimize audible transitions The ramp rate is configurable by means of the OEM SDRMultiple stepwise linear and clamp controls can be defined for each fan domain and used simultaneously For each domain the BMC uses the maximum of the domainrsquos stepwise linear control contributions and the sum of the domainrsquos clamp control contributions to compute the domainrsquos PWM value except that a stepwise linear instance can be configured to provide the domain maximumHysteresis can be specified to minimize fan speed oscillation and to smooth fan speed transitions If a Tcontrol SDR record does not contain a hysteresis definition for example an SDR adhering to a legacy format the BMC assumes a hysteresis value of zero
73145 Thermal and Acoustic ManagementThis feature refers to enhanced fan management to keep the system optimally cooled while reducing the amount of noise generated by the system fans Aggressive acoustics standards might require a trade-off between fan speed and system performance parameters that contribute to the cooling requirements primarily memory bandwidth The BIOS BMC and SDRs work together to provide control over how this trade-off is determinedThis capability requires the BMC to access temperature sensors on the individual memory DIMMs Additionally closed-loop thermal throttling is only supported with buffered DIMMs
HA201-TP Users Manual
104
Chapter 7 Intelreg Server Board S2600TP Platform Management
73146 Thermal Sensor Input to Fan Speed ControlThe BMC uses various IPMI sensors as input to the fan speed control Some of the sensors are IPMI models of actual physical sensors whereas some are ldquovirtualrdquo sensors whose values are derived from physical sensors using calculations andor tabular informationThe following IPMI thermal sensors are used as input to the fan speed controlbull Front panel temperature sensorbull Baseboard temperature sensorsbull CPU DTS-Spec margin sensorsbull DIMM thermal margin sensorsbull Exit air temperature sensorbull Global aggregate thermal margin sensorsbull SSB (Intelreg C610 Series Chipset) temperature sensorbull On-board Ethernet controller temperature sensors (support for this is
specific to the Ethernet controller being used)bull Add-in Intelreg SASIO module temperature sensor(s) (if present)bull Power supply thermal sensors (only available on PMBus-compliant
power supplies)A simple model is shown in the following figure which gives a high level graphic of the fan speed control structure creates the resulting fan speeds
HA201-TP Users Manual
105
Chapter 7 Intelreg Server Board S2600TP Platform Management
731461 Processor Thermal ManagementProcessor thermal management utilizes clamp algorithms for which the Processor DTS-Spec margin sensor is a controlling input This replaces the use of the (legacy) raw DTS sensor reading that was utilized on previous generation platforms The legacy DTS sensor is retained only for monitoring purposes and is not used as an input to the fan speed control
731462 Memory Thermal ManagementThe system memory is the most complex subsystem to thermally manage as it requires substantial interactions between the BMC BIOS and the embedded memory controller This section provides an overview of this management capability from a BMC perspective
7314621 Memory Thermal ThrottlingThe system shall support thermal management through open loop throttling (OLTT) and closed loop throttling (CLTT) of system memory based on the platform as well as availability of valid temperature sensors on the installed memory DIMMs Throttling levels are changed dynamically to cap throttling based on memory and system thermal conditions as determined by the system and DIMM power and thermal parameters Support for CLTT on mixed-mode DIMM populations (that is some installed DIMMs have valid temp sensors and some do not) is not supported The BMC fan speed control functionality is related to the memory throttling mechanism usedThe following terminology is used for the various memory throttling optionsbull Static Open Loop Thermal Throttling (Static-OLTT) OLTT control registers
are configured by BIOS MRC remain fixed after post The system does not change any of the throttling control registers in the embedded memory controller during runtime
bull Static Closed Loop Thermal Throttling (Static-CLTT) CLTT control registers are configured by BIOS MRC during POST The memory throttling is run as a closed-loop system with the DIMM temperature sensors as the control input Otherwise the system does not change any of the throttling control registers in the embedded memory controller during runtime
bull Dynamic Open Loop Thermal Throttling (Dynamic-OLTT) OLTT control registers are configured by BIOS MRC during POST Adjustments are
HA201-TP Users Manual
106
Chapter 7 Intelreg Server Board S2600TP Platform Management
made to the throttling during runtime based on changes in system cooling (fan speed)
bull Dynamic Closed Loop Thermal Throttling (Dynamic-CLTT) CLTT control registers are configured by BIOS MRC during POST The memory throttling is run as a closed-loop system with the DIMM temperature sensors as the control input Adjustments are made to the throttling during runtime based on changes in system cooling (fan speed)
Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce a new type of CLTT which is referred to as Hybrid CLTT for which the Integrated Memory Controller estimates the DRAM temperature in between actual reads of the TSODsHybrid CLTTT shall be used on all Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family that have DIMMs with thermal sensors Therefore the terms Dynamic-CLTT and Static-CLTT are really referring to this lsquohybridrsquo mode Note that if the IMCrsquos polling of the TSODs is interrupted the temperature readings that the BMC gets from the IMC shall be these estimated values 731463 DIMM Temperature Sensor Input to Fan Speed ControlA clamp algorithm s used for controlling fan speed based on DIMM temperatures Aggregate DIMM temperature margin sensors are used as the control input to the algorithm
731464 Dynamic (Hybrid) CLTTThe system will support dynamic (memory) CLTT for which the BMC FW dynamically modifies thermal offset registers in the IMC during runtime based on changes in system cooling (fan speed) For static CLTT a fixed offset value is applied to the TSOD reading to get the die temperature however this is does not provide as accurate results as when the offset takes into account the current airflow over the DIMM as is done with dynamic CLTTIn order to support this feature the BMC FW will derive the air velocity for each fan domain based on the PWM value being driven for the domain Since this relationship is dependent on the chassis configuration a method must be used which support this dependency (for example through OEM SDR) that establishes a lookup table providing this relationshipBIOS will have an embedded lookup table that provides thermal offset
HA201-TP Users Manual
107
Chapter 7 Intelreg Server Board S2600TP Platform Management
values for each DIMM type altitude setting and air velocity range (3 ranges of air velocity are supported) During system boot BIOS will provide 3 offset values (corresponding to the 3 air velocity ranges) tothe BMC for each enabled DIMM Using this data the BMC FW constructs a table that maps the offset value corresponding to a given air velocity range for each DIMM During runtime the BMC applies an averaging algorithm to determine the target offset value corresponding to thecurrent air velocity and then the BMC writes this new offset value into the IMC thermal offset register for the DIMM
731465 Fan ProfilesThe server system supports multiple fan control profiles to support acoustic targets and American Society of Heating Refrigerating and Air Conditioning Engineers (ASHRAE) compliance The BIOS Setup utility can be used to choose between meeting the target acoustic level or enhanced system performance This is accomplished through fan profilesThe BMC supports eight fan profiles numbered from 0 to 7 Fan management policy is dictated by the Tcontrol SDRs These SDRs provide a way to associate fan control behavior with one or more fan profilesEach group of profiles allows for varying fan control policies based on the altitude For a given altitude the Tcontrol SDRs associated with an acoustics-optimized profile generate less noise than the equivalent performance-optimized profile by driving lower fan speeds and the BIOSreduces thermal management requirements by configuring more aggressive memory throttling See Table 16 for more informationThe BMC provides commands that query for fan profile support and it provides a way to enable a fan profile Enabling a fan profile determines which Tcontrol SDRs are used for fan management The BMC only supports enabling a fan profile through the command if that profile is supported on all fan domains defined for the system It is important to configure the SDRs so that all desired fan profiles are supported on each fan domain If no single profile is supported across all domains the BMC by default uses profile 0 and does not allow it to be changedAt system boot the BIOS can use the Get Fan Control Configuration command to query the BMC about which fan profiles are supported The BIOS uses this information to display options in the BIOS Setup utility The BIOS indicates the fan profile to the BMC as dictated by the BIOS Setup Utility options for fan mode and altitude using the Set Fan Control
HA201-TP Users Manual
108
Chapter 7 Intelreg Server Board S2600TP Platform Management
Configuration commandThe BMC uses this information as an input to its fan-control algorithm as supported by the Tcontrol OEM SDR The BMC only allows enabling of fan profiles that the BMC indicates are supported using the Get Fan Control Configuration command For example if the Get Fan Control Configuration command indicates that only profile 1 is supported then using the Set Fan Control Configuration command to enable profile 2 will result in the return of an error completion codeThe BMC requires the BIOS to send the Set Fan Control Configuration command to the BMC on every system boot This must be done after the BIOS has completed any throttling-related chipset configuration
HA201-TP Users Manual
109
Chapter 7 Intelreg Server Board S2600TP Platform Management
731466 Open-Loop Thermal Throttling FallbackNormal system operation uses closed-loop thermal throttling (CLTT) and DIMM temperature monitoring as major factors in overall thermal and acoustics management In the event that BIOS is unable to configure the system for CLTT it defaults to open-loop thermal throttling (OLTT) In the OLTT mode it is assumed that the DIMM temperature sensors are not available for fan speed control The BIOS communicates the throttling mode to the BMC along with the fan profile number when it sends the Set Fan Control Configuration commandWhen OLTT mode is specified the BMC internally blocks access to the DIMM temperatures causing the DIMM aggregate margin sensors to be marked as ReadingState Unavailable The BMC then uses the failure-control values for these sensors if specified in the Tcontrol SDRs as their fan speed contributions
731467 ASHRAE ComplianceSystem requirements for ASHRAE compliance is defined in the Common Fan Speed Control amp Thermal Management Platform Architecture Specification Altitude-related changes in fan speed control are handled through profiles for different altitude ranges
73147 Power Supply Fan Speed ControlThis section describes the system level control of the fans internal to the power supply over the PMBus Some but not all Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family will require that the power supplies be included in the system level fan speed control For any system that requires either of these capabilities the power supply must be PMBus-compliant
731471 System Control of Power Supply FansSome products require that the BMC control the speed of the power supply fans as is done with normal system (chassis) fans except that the BMC cannot reduce the power supply fan any lower than the internal power supply control is driving it For these products the BMC FW must have the ability to control and monitor the power supply fans through PMBus commands The power supply fans are treated as a system fan domain for which fan control policies are mapped just as for chassis system fans with system thermal sensors (rather than internal power
HA201-TP Users Manual
110
Chapter 7 Intelreg Server Board S2600TP Platform Management
supply thermal sensors) used as the input to a clamp algorithm for the power supply fan control This domain has both piecewise clipping curves and clamped sensors mapped into the power supply fan domain All the power supplies can be defined as a single fan domain
731472 Use of Power Supply Thermal Sensors as Input to System (Chassis) Fan Control Some products require that the power supply internal thermal sensors are used as control inputs to the system (chassis) fans in the same manner as other system thermal sensors are used for this purpose The power supply thermal sensors are included as clamped sensors into one or more system fan domains which may include the power supply fan domain
73148 Fan Boosting due to Fan FailuresIntelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce additional capabilities for handling fan failure or removal as described in this sectionEach fan failure shall be able to define a unique response from all other fan domains An OEM SDR table defines the response of each fan domain based on a failure of any fan including both system and power supply fans (for PMBus-compliant power supplies only) This meansthat if a system has six fans then there will be six different fan fail reactions
73149 Programmable Fan PWM OffsetThe system provides a BIOS Setup option to boost the system fan speed by a programmable positive offset or a ldquoMaxrdquo setting Setting the programmable offset causes the BMC to add the offset to the fan speeds that it would otherwise be driving the fans to The Max setting causes the BMC to replace the domain minimum speed with alternate domain minimums that also are programmable through SDRsThis capability is offered to provide system administrators the option to manually configure fans speeds in instances where the fan speed optimized for a given platform may not be sufficient when a high end add-in is configured into the system This enables easier usage of the fan speed control to support Intelreg as well as third party chassis and better support of ambient temperatures higher than 35C
HA201-TP Users Manual
111
Chapter 7 Intelreg Server Board S2600TP Platform Management
supply thermal sensors) used as the input to a clamp algorithm for the power supply fan control This domain has both piecewise clipping curves and clamped sensors mapped into the power supply fan domain All the power supplies can be defined as a single fan domain
731472 Use of Power Supply Thermal Sensors as Input to System (Chassis) Fan Control Some products require that the power supply internal thermal sensors are used as control inputs to the system (chassis) fans in the same manner as other system thermal sensors are used for this purpose The power supply thermal sensors are included as clamped sensors into one or more system fan domains which may include the power supply fan domain
73148 Fan Boosting due to Fan FailuresIntelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce additional capabilities for handling fan failure or removal as described in this sectionEach fan failure shall be able to define a unique response from all other fan domains An OEM SDR table defines the response of each fan domain based on a failure of any fan including both system and power supply fans (for PMBus-compliant power supplies only) This meansthat if a system has six fans then there will be six different fan fail reactions
73149 Programmable Fan PWM OffsetThe system provides a BIOS Setup option to boost the system fan speed by a programmable positive offset or a ldquoMaxrdquo setting Setting the programmable offset causes the BMC to add the offset to the fan speeds that it would otherwise be driving the fans to The Max setting causes the BMC to replace the domain minimum speed with alternate domain minimums that also are programmable through SDRsThis capability is offered to provide system administrators the option to manually configure fans speeds in instances where the fan speed optimized for a given platform may not be sufficient when a high end add-in is configured into the system This enables easier usage of the fan speed control to support Intelreg as well as third party chassis and better support of ambient temperatures higher than 35C
HA201-TP Users Manual
112
Chapter 7 Intelreg Server Board S2600TP Platform Management
7315 Power Management Bus (PMBus)The Power Management Bus (ldquoPMBusrdquo) is an open standard protocol that is built upon the SMBus 20 transport It defines a means of communicating with power conversion and other devices using SMBus-based commands A system must have PMBus-compliant power supplies installed in order for the BMC or ME to monitor them for status andor power metering purposesFor more information on PMBus see the System Management Interface Forum Web sitehttpwwwpowersigorg
7316 Power Supply Dynamic Redundancy SensorThe BMC supports redundant power subsystems and implements a Power Unit Redundancy sensor per platform A Power Unit Redundancy sensor is of sensor type Power Unit (09h) and reading type Availability Status (0Bh) This sensor generates events when a power subsystem transitions between redundant and non-redundant states as determined by the number and health of the power subsystemrsquos component power supplies The BMC implements Dynamic Power Supply Redundancy status based upon current system load requirements as well as total Power Supply capacity This status is independent of the Cold Redundancy status This prevents the BMC from reporting Fully Redundant Power supplies when the load required by the system exceeds half the power capability of all power supplies installed and operationalDynamic Redundancy detects this condition and generates the appropriate SEL event to notify the user of the condition Power supplies of different power ratings may be swapped in and out to adjust the power capacity and the BMC will adjust the Redundancy status accordinglyThe definition of redundancy is power subsystem dependent and sometimes even configuration dependent See the appropriate Platform Specific Information for power unit redundancy supportThis sensor is configured as manual-rearm sensor in order to avoid the possibility of extraneous SEL events that could occur under certain system configuration and workload conditions The sensor shall rearm for the following conditionsbull PSU hot-addbull System reset
HA201-TP Users Manual
113
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull AC power cyclebull DC power cycleSystem AC power is applied but on standby ndash Power unit redundancy is based on OEM SDR power unit record and number of PSU presentSystem is (DC) powered on - The BMC calculates Dynamic Power Supply Redundancy status based upon current system load requirements as well as total Power Supply capacityThe BMC allows redundancy to be configured on a per power-unit-redundancy sensor basis by means of the OEM SDR records
7317 Component Fault LED ControlSeveral sets of component fault LEDs are supported on the server board Some LEDs are owned by the BMC and some by the BIOSThe BMC owns control of the following FRUfault LEDsbull Fan fault LEDs ndash A fan fault LED is associated with each fan The BMC
lights a fan fault LED if the associated fan-tach sensor has a lower critical threshold event status asserted Fan-tach sensors are manual re-arm sensors Once the lower critical threshold is crossed the LED remains lit until the sensor is rearmed These sensors are rearmed at system DC power-on and system reset
bull DIMM fault LEDs ndash The BMC owns the hardware control for these LEDs The LEDs reflect the state of BIOS-owned event-only sensors When the BIOS detects a DIMM fault condition it sends an IPMI OEM command (Set Fault Indication) to the BMC to instruct the BMC to turn on the associated DIMM Fault LED These LEDs are only active when the system is in the lsquoonrsquo state The BMC will not activate or change the state of the LEDs unless instructed by the BIOS
bull Hard Disk Drive Status LEDs ndash The HSBP PSoC owns the hardware control for these LEDs and detection of the faultstatus conditions that the LEDs reflect
bull CPU Fault LEDs The BMC owns control for these LEDs An LED is lit if there is an MSID mismatch (that is CPU power rating is incompatible with the board)
HA201-TP Users Manual
114
Chapter 7 Intelreg Server Board S2600TP Platform Management
7318 CMOS Battery MonitoringThe BMC monitors the voltage level from the CMOS battery which provides battery backup to the chipset RTC This is monitored as an auto-rearm threshold sensorUnlike monitoring of other voltage sources for which the Emulex Pilot III component continuously cycles through each input the voltage channel used for the battery monitoring provides a software enable bit to allow the BMC FW to poll the battery voltage at a relativelyslow rate in order to conserve battery power
HA201-TP Users Manual
115
Chapter 7 Intelreg Server Board S2600TP Platform Management
74 Intelreg Intelligent Power Node Manager (NM)Power management deals with requirements to manage processor power consumption and manage power at the platform level to meet critical business needs Node Manager (NM) is a platform resident technology that enforces power capping and thermal-triggered powercapping policies for the platform These policies are applied by exploiting subsystem settings (such as processor P and T states) that can be used to control power consumption NM enables data center power management by exposing an external interface to management software through which platform policies can be specified It also implements specific data center power management usage models such as power limiting and thermal monitoringThe NM feature is implemented by a complementary architecture utilizing the ME BMC BIOS and an ACPI-compliant OS The ME provides the NM policy engine and power controllimiting functions (referred to as Node Manager or NM) while the BMC provides the external LAN link by which external management software can interact with the feature The BIOS provides system power information utilized by the NM algorithms and also exports ACPI Source Language (ASL) code used by OS-Directed Power Management (OSPM) for negotiating processor P and T state changes for power limiting PMBus-compliant power supplies provide the capability to monitor input power consumption which is necessary to support NMThe NM architecture applicable to this generation of servers is defined by the NPTM Architecture Specification v20 NPTM is an evolving technology that is expected to continue to add new capabilities that will be defined in subsequent versions of the specification The ME NM implements the NPTM policy engine and controlmonitoring algorithms defined in the Node Power and Thermal Manager (NPTM) specification
741 Hardware RequirementsNM is supported only on platforms that have the NM FW functionality loaded and enabled on the Management Engine (ME) in the SSB and that have a BMC present to support the external LAN interface to the ME NM power limiting features requires a means for the ME to monitorinput power consumption for the platform This capability is generally provided by means of PMBus-compliant power supplies although an alternative model using a simpler SMBus power monitoring device is possible (there is potential loss in accuracy and responsiveness using
HA201-TP Users Manual
116
Chapter 7 Intelreg Server Board S2600TP Platform Management
non-PMBus devices) The NM SmaRTCLST feature does specifically require PMBus-compliant power supplies as well as additional hardware on the server board
742 FeaturesNM provides feature support for policy management monitoring and querying alerts and notifications and an external interface protocol The policy management features implement specific IT goals that can be specified as policy directives for NM Monitoring and querying features enable tracking of power consumption Alerts and notifications provide the foundation for automation of power management in the data center management stack The external interface specifies the protocols that must be supported in this version of NM
743 ME System Management Bus (SMBus) Interfacebull The ME uses the SMLink0 on the SSB in multi-master mode as a
dedicated bus for communication with the BMC using the IPMB protocol The BMC FW considers this a secondary IPMB bus and runs at 400 kHz
bull The ME uses the SMLink1 on the SSB in multi-master mode bus for communication with PMBus devices in the power supplies for support of various NM-related features This bus is shared with the BMC which polls these PMBus power supplies for sensor monitoring purposes (for example power supply status input power and so on) This bus runs at 100 KHz
bull The Management Engine has access to the ldquoHost SMBusrdquo
744 PECI 30bull The BMC owns the PECI bus for all Intel server implementations and
acts as a proxy for the ME when necessary
745 NM ldquoDiscoveryrdquo OEM SDRAn NM ldquodiscoveryrdquo OEM SDR must be loaded into the BMCrsquos SDR repository if and only if the NM feature is supported on that product This OEM SDR is used by management software to detect if NM is supported and to understand how to communicate with itSince PMBus compliant power supplies are required in order to support NM the system should be probed when the SDRs are loaded into the
HA201-TP Users Manual
117
Chapter 7 Intelreg Server Board S2600TP Platform Management
BMCrsquos SDR repository in order to determine whether or not the installed power supplies do in fact support PMBus If the installed power supplies are not PMBus compliant then the NM ldquodiscoveryrdquo OEM SDR should not be loadedPlease refer to the Intelreg Intelligent Power Node Manager 20 External Architecture Specification using IPMI for details of this interface
746 SmaRTCLSTThe power supply optimization provided by SmaRTCLST relies on a platform HW capability as well as ME FW support When a PMBus-compliant power supply detects insufficient input voltage an overcurrent condition or an over-temperature condition it will assert theSMBAlert signal on the power supply SMBus (such as the PMBus) Through the use of external gates this results in a momentary assertion of the PROCHOT and MEMHOT signals to the processors thereby throttling the processors and memory The ME FW also sees the SMBAlert assertion queries the power supplies to determine the condition causing the assertion and applies an algorithm to either release or prolong the throttling based on the situationSystem power control modes include1 SmaRT Low AC in put voltage event results in a one-time momentary
throttle for each event to the maximum throttle state2 Electrical Protection CLST High output energy event results in a
throttling hiccup mode with fixed maximum throttle time and a fix throttle release ramp time
3 Thermal Protection CLST High power supply thermal event results in a throttling hiccup mode with fixed maximum throttle time and a fix throttle release ramp time
When the SMBAlert signal is asserted the fans will be gated by HW for a short period (~100ms) to reduce overall power consumption It is expected that the interruption to the fans will be of short enough duration to avoid false lower threshold crossings for the fan tachsensors however this may need to be comprehended by the fan monitoring FW if it does have this side-effectME FW will log an event into the SEL to indicate when the system has been throttled by the SmaRTCLST power management feature This is dependent on ME FW support for this sensor Please refer ME FW EPS for SEL log details
HA201-TP Users Manual
118
Chapter 7 Intelreg Server Board S2600TP Platform Management
74611 Dependencies on PMBus-compliant Power Supply SupportThe SmaRTCLST system feature depends on functionality present in the ME NM SKU This feature requires power supplies that are compliant with the PMBus
note For additional information on Intelreg Intelligent Power Node
Manager usage and support please visit the following Intel Website
httpwwwintelcomcontentwwwusendata-centerdata-center-managementnode-manager-generalhtmlwapkw=node+manager
HA201-TP Users Manual
119
Chapter 7 Intelreg Server Board S2600TP Platform Management
75 Basic and Advanced Server Management FeaturesThe integrated BMC has support for basic and advanced server management features Basic management features are available by default Advanced management features are enabled with the addition of an optionally installed Remote Management Module 4 Lite (RMM4 Lite) key
When the BMC FW initializes it attempts to access the Intelreg RMM4 lite If the attempt to access Intelreg RMM4 lite is successful then the BMC activates the Advanced featuresThe following table identifies both Basic and Advanced server management features
HA201-TP Users Manual
120
Chapter 7 Intelreg Server Board S2600TP Platform Management
751 Dedicated Management PortThe server board includes a dedicated 1GbE RJ45 Management Port The management port is active with or without the RMM4 Lite key installed
752 Embedded Web ServerBMC Base manageability provides an embedded web server and an OEM-customizable web GUI which exposes the manageability features of the BMC base feature set It is supported over all on-board NICs that have management connectivity to the BMC as well as an optionaldedicated add-in management NIC At least two concurrent web sessions from up to two different users is supported The embedded web user interface shall support the following client web browsersbull Microsoft Internet Explorer 90bull Microsoft Internet Explorer 100bull Mozilla Firefox 24bull Mozilla Firefox 25The embedded web user interface supports strong security (authentication encryption and firewall support) since it enables remote server configuration and control The user interface presented by the embedded web user interface shall authenticate the user before allowing a web session to be initiated Encryption using 128-bit SSL is supported User authentication is based on user id and passwordThe GUI presented by the embedded web server authenticates the user before allowing a web session to be initiated It presents all functions to all users but grays-out those functions that the user does not have privilege to execute For example if a user does not have privilege to power control then the item shall be displayed in grey-out font in that userrsquos UI display The web GUI also provides a launch point for some of the advanced features such as KVM and media redirection These features are grayed out in the GUI unless the system has been updated to support these advanced features The embedded web server only displays US English or Chinese language outputAdditional features supported by the web GUI includesbull Presents all the Basic features to the usersbull Power onoffreset the server and view current power statebull Displays BIOS BMC ME and SDR version informationbull Display overall system health
HA201-TP Users Manual
121
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull Configuration of various IPMI over LAN parameters for both IPV4 and IPV6
bull Configuration of alerting (SNMP and SMTP)bull Display system asset information for the product board and
chassisbull Display of BMC-owned sensors (name status current reading
enabled thresholds) including color-code status of sensorsbull Provides ability to filter sensors based on sensor type (Voltage
Temperature Fan and Power supply related)bull Automatic refresh of sensor data with a configurable refresh ratebull On-line helpbull Displayclear SEL (display is in easily understandable human
readable format)bull Supports major industry-standard browsers (Microsoft Internet
Explorer and Mozilla Firefox)bull The GUI session automatically times-out after a user-configurable
inactivity period By default this inactivity period is 30 minutesbull Embedded Platform Debug feature - Allow the user to initiate
a ldquodebug dumprdquo to a file that can be sent to Intelreg for debug purposes
bull Virtual Front Panel The Virtual Front Panel provides the same functionality as the local front panel The displayed LEDs match the current state of the local panel LEDs The displayed buttons (for example power button) can be used in the same manner as the local buttons
bull Display of ME sensor data Only sensors that have associated SDRs loaded will be displayed
bull Ability to save the SEL to a filebull Ability to force HTTPS connectivity for greater security This is
provided through a configuration option in the UIbull Display of processor and memory information as is available over
IPMI over LANbull Ability to get and set Node Manager (NM) power policiesbull Display of power consumed by the serverbull Ability to view and configure VLAN settingsbull Warn user the reconfiguration of IP address will cause disconnectbull Capability to block logins for a period of time after several
HA201-TP Users Manual
122
Chapter 7 Intelreg Server Board S2600TP Platform Management
consecutive failed login attempts The lock-out period and the number of failed logins that initiates the lockout period are configurable by the user
bull Server Power Control - Ability to force into Setup on a resetbull System POST results ndash The web server provides the systemrsquos Power-
On Self Test (POST) sequence for the previous two boot cycles including timestamps The timestamps may be viewed in relative to the start of POST or the previous POST code
bull Customizable ports - The web server provides the ability to customize the port numbers used for SMASH http https KVM secure KVM remote media and secure remote media
For additional information reference the Intelreg Remote Management Module 4 and Integrated BMC Web Console Users Guide
753 Advanced Management Feature Support (RMM4 Lite)The integrated baseboard management controller has support for advanced management features which are enabled when an optional Intelreg Remote Management Module 4 Lite (RMM4 Lite) is installed The Intel RMM4 add-on offers convenient remote KVM access and control through LAN and internet It captures digitizes and compresses video and transmits it with keyboard and mouse signals to and from a remote computer Remote access and controlsoftware runs in the integrated baseboard management controller utilizing expanded capabilities enabled by the Intel RMM4 hardwareKey Features of the RMM4 add-on arebull KVM redirection from either the dedicated management NIC or
the server board NICs used for management traffic upto to two KVM sessions
bull Media Redirection ndash The media redirection feature is intended to allow system administrators or users to mount a remote IDE or USB CDROM floppy drive or a USB flash disk as a remote device to the server Once mounted the remote device appears just like a local device to the server allowing system administrators or users to install software (including operating systems) copy files update BIOS or boot the server from this device
bull KVM ndash Automatically senses video resolution for best possible screen capture high performance mouse tracking and
HA201-TP Users Manual
123
Chapter 7 Intelreg Server Board S2600TP Platform Management
synchronization It allows remote viewing and configuration in pre-boot POST and BIOS setup
7531 Keyboard Video Mouse (KVM) RedirectionThe BMC firmware supports keyboard video and mouse redirection (KVM) over LAN This feature is available remotely from the embedded web server as a Java applet This feature is only enabled when the Intelreg RMM4 lite is present The client system must have a Java Runtime Environment (JRE) version 60 or later to run the KVM or media redirection appletsThe BMC supports an embedded KVM application (Remote Console) that can be launched from the embedded web server from a remote console USB11 or USB 20 based mouse and keyboard redirection are supported It is also possible to use the KVM-redirection (KVM-r) session concurrently with media-redirection (media-r) This feature allows a user to interactively use the keyboard video and mouse (KVM) functions of the remote server as if the user were physically at the managed server KVM redirection console supports the following keyboard layouts English Dutch French German Italian Russian and SpanishKVM redirection includes a ldquosoft keyboardrdquo function The ldquosoft keyboardrdquo is used to simulate an entire keyboard that is connected to the remote system The ldquosoft keyboardrdquo functionality supports the following layouts English Dutch French German Italian Russian and SpanishThe KVM-redirection feature automatically senses video resolution for best possible screen capture and provides high-performance mouse tracking and synchronization It allows remote viewing and configuration in pre-boot POST and BIOS setup once BIOS has initialized videoOther attributes of this feature includebull Encryption of the redirected screen keyboard and mousebull Compression of the redirected screenbull Ability to select a mouse configuration based on the OS typebull Supports user definable keyboard macrosKVM redirection feature supports the following resolutions and refresh ratesbull 640x480 at 60Hz 72Hz 75Hz 85Hz 100Hzbull 800x600 at 60Hz 72Hz 75Hz 85Hzbull 1024x768 at 60Hx 72Hz 75Hz 85Hzbull 1280x960 at 60Hz
HA201-TP Users Manual
124
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull 1280x1024 at 60Hzbull 1600x1200 at 60Hzbull 1920x1080 (1080p)bull 1920x1200 (WUXGA)bull 1650x1080 (WSXGA+)
7532 Remote ConsoleThe Remote Console is the redirected screen keyboard and mouse of the remote host system To use the Remote Console window of your managed host system the browser must include a Java Runtime Environment plug-in If the browser has no Java support such as with a small handheld device the user can maintain the remote host system using the administration forms displayed by the browserThe Remote Console window is a Java Applet that establishes TCP connections to the BMCThe protocol that is run over these connections is a unique KVM protocol and not HTTP or HTTPS This protocol uses ports 7578 for KVM 5120 for CDROM media redirection and 5123 for FloppyUSB media redirection When encryption is enabled the protocol uses ports 7582 for KVM 5124 for CDROM media redirection and 5127 for FloppyUSB media redirection The local network environment must permit these connections to be made that is the firewall and in case of a private internal network the NAT (Network Address Translation) settings have to be configured accordingly
7533 PerformanceThe remote display accurately represents the local display The feature adapts to changes to the video resolution of the local display and continues to work smoothly when the system transitions from graphics to text or vice-versa The responsiveness may be slightly delayed depending on the bandwidth and latency of the networkEnabling KVM andor media encryption will degrade performance Enabling video compression provides the fastest response while disabling compression provides better video qualityFor the best possible KVM performance a 2Mbsec link or higher is recommendedThe redirection of KVM over IP is performed in parallel with the local KVM without affecting the local KVM operation
HA201-TP Users Manual
125
Chapter 7 Intelreg Server Board S2600TP Platform Management
7534 SecurityThe KVM redirection feature supports multiple encryption algorithms including RC4 and AES The actual algorithm that is used is negotiated with the client based on the clientrsquos capabilities
7535 AvailabilityThe remote KVM session is available even when the server is powered-off (in stand-by mode) No re-start of the remote KVM session shall be required during a server reset or power onoff A BMC reset (for example due to an BMC Watchdog initiated reset or BMC reset after BMC FWupdate) will require the session to be re-established KVM sessions persist across system reset but not across an AC power loss
7536 UsageAs the server is powered up the remote KVM session displays the complete BIOS boot process The user is able interact with BIOS setup change and save settings as well as enter and interact with option ROM configuration screensAt least two concurrent remote KVM sessions are supported It is possible for at least two different users to connect to same server and start remote KVM sessions
7537 Force-enter BIOS SetupKVM redirection can present an option to force-enter BIOS Setup This enables the system to enter F2 setup while booting which is often missed by the time the remote console redirects the video
7538 Media RedirectionThe embedded web server provides a Java applet to enable remote media redirection This may be used in conjunction with the remote KVM feature or as a standalone applet The media redirection feature is intended to allow system administrators or users to mount a remote IDE or USB CD-ROM floppy drive or a USB flash disk as a remote device to the server Once mounted the remote device appears just like a local device to the server allowing system administrators or users to install software (including operating systems) copy files update BIOS and so on or boot the server from this deviceThe following capabilities are supported
HA201-TP Users Manual
126
Chapter 7 Intelreg Server Board S2600TP Platform Management
The operation of remotely mounted devices is independent of the local devices on the server Both remote and local devices are useable in parallelbull Either IDE (CD-ROM floppy) or USB devices can be mounted as a
remote device to the serverbull It is possible to boot all supported operating systems from the remotely
mounted device and to boot from disk IMAGE (IMG) and CD-ROM or DVD-ROM ISO files See the Testedsupported Operating System List for more information
bull Media redirection supports redirection for both a virtual CD device and a virtual FloppyUSB device concurrently The CD device may be either a local CD drive or else an ISO image file the FloppyUSB device may be either a local Floppy drive a local USB device or else a disk image file
bull The media redirection feature supports multiple encryption algorithms including RC4 and AES The actual algorithm that is used is negotiated with the client based on the clientrsquos capabilities
bull A remote media session is maintained even when the server is powered-off (in standby mode) No restart of the remote media session is required during a server reset or power onoff An BMC reset (for example due to an BMC reset after BMC FW update) will require the session to be re-established
bull The mounted device is visible to (and useable by) managed systemrsquos OS and BIOS in both pre-boot and post-boot states
bull The mounted device shows up in the BIOS boot order and it is possible to change the BIOS boot order to boot from this remote device
bull It is possible to install an operating system on a bare metal server (no OS present) using the remotely mounted device This may also require the use of KVM-r to configure the OS during install
USB storage devices will appear as floppy disks over media redirection This allows for the installation of device drivers during OS installationIf either a virtual IDE or virtual floppy device is remotely attached during system boot both the virtual IDE and virtual floppy are presented as bootable devices It is not possible to present only a single-mounted device type to the system BIOS
HA201-TP Users Manual
127
Chapter 7 Intelreg Server Board S2600TP Platform Management
75381 AvailabilityThe default inactivity timeout is 30 minutes and is not user-configurable Media redirection sessions persist across system reset but not across an AC power loss or BMC reset
75382 Network Port UsageThe KVM and media redirection features use the following portsbull 5120 ndash CD Redirectionbull 5123 ndash FD Redirectionbull 5124 ndash CD Redirection (Secure)bull 5127 ndash FD Redirection (Secure)bull 7578 ndash Video Redirectionbull 7582 ndash Video Redirection (Secure)For additional information reference the Intelreg Remote Management Module 4 and Integrated BMC Web Console Users Guide
Chapter 1 Product IntroductionChapter 1 Product IntroductionChapter 8 Technical Support
wwwaicipccom
bull TAIWANTel +886 3 433 9188Fax +886 3 287 1818Email salesaicipccomtw
bull CHINA Tel +862154961421 +862154961422Fax Extension 608Email Technical Support supportaicipccom
bull AMERICA - West coastTel +19098958989Fax +19098958999Email salesaicipccom
bull AMERICA - East coastTel +19738848886Fax +19738844794Email njsalesaicipccom
bull EUROPETel +31306386789Fax +31306360638Emailsalesaicipcnl
Email Technical Support supportaicipccom
- PREFACE
-
- SAFETY INSTRUCTIONS
-
- Chapter 1 Product Introduction
-
- 11 Box Content
- 12 Specifications
- 13 General Information
-
- Chapter 2 Hardware Installation
-
- 21 Removing and Installing top cover
- 22 Central Processing Unit (CPU)
- 23 Diagram of the Correct Installation
- 24 System Memory
- 25 RemovingInstalling a Drive Tray Hard Drive Drive Slot Map
- 26 Removing and Installing a Fan Module
- 27 Power Supply
- 28 Tool-less Blade Slide Installation introduction
-
- Chapter 3 Motherborad Overview
-
- 31 Intelreg Server Board Feature Set
- 32 Motherboard Layout
- 33 Motherboard block diagram
- 34 Motherboard Configuration Jumpers
- 35 Motherboard Configuration Jumpers
-
- Chapter 4 12GB Expander Borad Overview
-
- 41 12GB Expander Board
-
- Chapter 5 Backplane Overview
-
- 51 Backplane
-
- Chapter 6 Debug amp Firmware Update Introduction
-
- 61 Expender firmware update through smart console port
- 62 Update the expander firmware through in-band
- 63 12G expander EDFB setting
- 64 Slot HDD power setting
- 65 HDD BP thermal sensor temperature setting
-
- Chapter 7 Intelreg Server Board S2600TP Platform Management
-
- 71 Server Management Function Architecture
- 72 Features and Functions
- 73 Sensor Monitoring
- 74 Intelreg Intelligent Power Node Manager (NM)
- 75 Basic and Advanced Server Management Features
-
- Chapter 8 Technical Support
-
- 按鈕11
![Page 15: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/15.jpg)
Chapter 2 Hardware Installation
HA201-TP Users Manual
8
22 Central Processing Unit (CPU)221 Removing the Processor HeatsinkThe heatsink is attached to the server board or processor socket with captive fasteners Using a 2 Phillips screwdriver loosen the four screws located on the heatsink corners in a diagonal manner using the following procedurebull Using a 2 Phillips screwdriver start with screw 1 and loosen it by
giving it two rotations and stop (see letter A) (IMPORTANT Do not fully loosen)
bull Proceed to screw 2 and loosen it by giving it two rotations and stop (see letter B) Similarly loosen screws 3 and 4 Repeat steps A and B by giving each screw two rotations each time until all screws are loosened
bull Lift the heatsink straight up (see letter C)
REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
Processor
Socket
2
3
1
4
A
B
C
HA201-TP Users Manual
9
Chapter 2 Hardware Installation
222 Installing the Processor
2221 Unlatch the CPU Load Platebull Push the lever handle labeled ldquoOPEN 1strdquo (see letter A) down and
toward the CPU socket Rotate the lever handle upbull Repeat the steps for the second lever handle (see letter B)
CAUTION
Processor must be appropriate You may damage the server board if
you install a processor that is inappropriate for your server
CAUTION
ESD and handling processors Reduce the risk of electrostatic
discharge (ESD) damage to the processor by doing the following
(1) Touch the metal chassis before touching the processor or
server board Keep part of your body in contact with the metal
chassis to dissipate the static charge while handling the processor
(2) Avoid moving around unnecessarily
REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
A REMOVE
REMOVE LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
B
Chapter 2 Hardware Installation
HA201-TP Users Manual
10
2222 Lift open the Load Platebull Rotate the right lever handle down until it releases the Load Plate
(see letter A)bull While holding down the lever handle with your other hand lift open
the Load Plate (see letter B)
BREMOVE
REMOVE
LGA2011-3 SUPPLIER IDENTIFICATION HERE
INSTALL PROCESSOR BEFORE
REMOVING COVER
SAVE AND REPLACE COVER
IF PROCESSOR IS REMOVED
A
HA201-TP Users Manual
11
Chapter 2 Hardware Installation
2223 Install the Processorbull Remove the processor from its packagebull Carefully remove the protective cover from the bottom side of the
CPU taking care not to touch any CPU contacts (see letter A)bull Orient the processor with the socket so that the processor cutouts
match the four orientation posts on the socket (see letter B) Note the location of a gold key at the corner of the processor (see letter C) Carefully place (Do NOT drop) the CPU into the socket
CAUTION
The pins inside the CPU socket are extremely sensitive Other than
the CPU no object should make contact with the pins inside the
CPU socket A damaged CPU socket pin may render the socket inoperable
and will produce erroneous CPU or other system errors if used
NOTE
The underside of the processor has components that may damage the
socket pins if installed improperly The processor must align correctly
with the socket opening before installation DO NOT DROP the
processor into the socket
NOTE
When possible a CPU insertion tool should be used when installing the
CPU
A
B
C
Chapter 2 Hardware Installation
HA201-TP Users Manual
12
2224 Remove the Socket Cover Remove the socket cover from the load plate by pressing it out
2225 Close the Load Plate Carefully lower the load plate down over the processor
2226 Lock down the Load Platebull Push down on the locking lever on the CLOSE 1st side (see letter A)
Slide the tip of the lever under the notch in the load plate (see letter B) Make sure the load plate tab engages under the socket lever when fully closed
bull bRepeat the steps to latch the locking lever on the other side (see letter C) Latch the levers in the order as shown
NOTE
The socket cover should be saved and re-used should the processor
need to be removed at anytime in the future
Save theprotectivecover
A
BC
HA201-TP Users Manual
13
Chapter 2 Hardware Installation
223 Installing the Processor Heatsink
bull If present remove the protective film covering the Thermal Interface Material on the bottom side of the heatsink (see letter A)
bull Align the heatsink fins to the front and back of the chassis for correct airflow The airflow goes from front-to-back of the chassis (see letter B)
bull Each heatsink has four captive fasteners and should be tightened in a diagonal manner using the following procedure
bull Using a 2 Phillips screwdriver start with screw 1 and engage screw threads by giving it two rotations and stop (see letter C) (Do not fully tighten)
bull Proceed to screw 2 and engage screw threads by giving it two rotations and stop (see letter D) Similarly engage screws 3 and 4
bull Repeat steps C and D by giving each screw two rotations each time until each screw is lightly tightened up to a maximum of 8 inch-lbs torque (see letter E)
NOTE
The processor heatsink for CPU1 and CPU2 is different FXXCA91X91HS is
for CPU1 while FXXEA91X91HS2 is for CPU2 Mislocating the heatsink
will cause serious thermal damage
C
Processor
Socket
AIRFLOW
Chassis Front
2
3
1
4
A
B
TIM
D
CAUTIONDo not over-tighten fasteners
E
Chapter 2 Hardware Installation
HA201-TP Users Manual
14
224 Removing the Processor
NOTE
Remove the processor by carefully lifting it out of the socket taking care
NOT to drop the processor and not touching any pins inside the socket
Install the socket cover if a replacement processor is not going to be installed
HA201-TP Users Manual
15
Chapter 2 Hardware Installation
225 Different heatsink for each of its CPUsCarefully position heatsink over the CPU0 and CPU1 and align the heatsink screws with the screw holes in the motherboard
Caution - Possible thermal damage Avoid moving the heatsink after
it has contacted the top of the CPU Too much movement could
disturb the layer of thermal compound causing voids and leading
to ineffective heat dissipation and component damage
CPU0
CPU0
CPU1
CPU1
Chapter 2 Hardware Installation
HA201-TP Users Manual
16
23 Diagram of the Correct InstallationNOTE The heat sinkrsquos fans should be blowing toward the rear end
of the chassis If one of the fans is facing the wrong direction
please remove the heat sink and reinstall it so that it is facing
the correct direction
AIRFLOW
AIRFLOW
FAN FAN
HA201-TP Users Manual
17
Chapter 2 Hardware Installation
24 System Memory241 Supported Memory
Chapter 2 Hardware Installation
HA201-TP Users Manual
18
242 Memory Population Rules
bull Each installed processor provides four channels of memory On the Intelreg Server Board S2600TP product family each memory channel supports two memory slots for a total possible 16 DIMMs installed
bull The memory channels from processor socket 1 are identified as Channel A B C and D The memory channels from processor socket 2 are identified as Channel E F G and H
bull The silk screened DIMM slot identifiers on the board provide information about the channel and therefore the processor to which they belong For example DIMM_A1 is the first slot on Channel A on processor 1 DIMM_E1 is the first DIMM socket on Channel E on processor 2
bull The memory slots associated with a given processor are unavailable if the corresponding processor socket is not populated
bull A processor may be installed without populating the associated memory slots provided a second processor is installed with associated memory In this case the memory is shared by the processors However the platform suffers performance degradation and latency due to the remote memory
bull Processor sockets are self-contained and autonomous However all memory subsystem support (such as Memory RAS and Error Management) in the BIOS setup is applied commonly across processor sockets
bull All DIMMs must be DDR4 DIMMsbull Mixing of LRDIMM with any other DIMM type is not allowed per
platformbull Mixing of DDR4 operating frequencies is not validated within a socket
or across sockets by Intel If DIMMs with different frequencies are mixed all DIMMs run at the common lowest frequency
bull A maximum of eight logical ranks (ranks seen by the host) per channel is allowed
bull The BLUE memory slots on the server board identify the first memory slot for a given memory channel
NOTE
Although mixed DIMM configurations are supported Intel only performs platform
validation on systems that are configured with identical DIMMs installed
HA201-TP Users Manual
19
Chapter 2 Hardware Installation
bull DIMM population rules require that DIMMs within a channel be populated starting with the BLUE DIMM slot or DIMM farthest from the processor in a ldquofill-farthestrdquo approach In addition when populating a Quad-rank DIMM with a Single- or Dual-rank DIMM in the same channel the Quad-rank DIMM must be populated farthest from the processor Intelreg MRC will check for correct DIMM placement
Chapter 2 Hardware Installation
HA201-TP Users Manual
20
On the Intelreg Server Board S2600TP product a total of sixteen DIMM slots are provided (two CPUs ndash four channels per CPU and two DIMMs per channel) The nomenclature for DIMM sockets is detailed in the following table
HA201-TP Users Manual
21
Chapter 2 Hardware Installation
243 Publishing System MemoryThere are a number of different situations in which the memory size andor configuration are displayed Most of these displays differ in one way or another so the same memory configuration may appear to display differently depending on when and where the display occurs
bull The BIOS displays the ldquoTotal Memoryrdquo of the system during POST if Quiet Boot is disabled in BIOS setup This is the total size of memory discovered by the BIOS during POST and is the sum of the individual sizes of installed DDR4 DIMMs in the system
bull The BIOS displays the ldquoEffective Memoryrdquo of the system in the BIOS Setup The term Effective Memory refers to the total size of all DDR4 DIMMs that are active (not disabled) and not used as redundant units (see Note below)
bull The BIOS provides the total memory of the system in the main page of BIOS setup This total is the same as the amount described by the first bullet above
bull If Quiet Boot is disabled the BIOS displays the total system memory on the diagnostic screen at the end of POST This total is the same as the amount described by the first bullet above
bull The BIOS provides the total amount of memory in the system by supporting the EFI Boot Service function GetMemoryMap()
bull The BIOS provides the total amount of memory in the system by supporting the INT 15h E820h function For details see the Advanced Configuration and Power Interface Specification
Chapter 2 Hardware Installation
HA201-TP Users Manual
22
25 RemovingInstalling a Drive Tray Hard Drive Drive Slot Map
251 Removing a Disk DriveRelease a drive tray by pressing the unlock button and pinching the lock lever slightly and pulling out the drive tray
252 Installing a Disk Drive
Adjust and place the 25rdquo HDD on the HDD tray Choose to the proper position screw hole then secure it with screws on the bottomMounting location 1 HDD with the interposer board attached
HA201-TP Users Manual
23
Chapter 2 Hardware Installation
253 Installing a Hard Disk Drive Tray
Insert the drive carrier into its bay Push the tray lever until it clicks Make sure the drive tray is correctly secured in place when its front edge aligns with the bay edge
254 Drive Slot Map
The drive slot map follows
HBA Card
MegaRaid Card
Chapter 2 Hardware Installation
HA201-TP Users Manual
24
26 Removing and Installing a Fan Module261 Removing a FAN module
Unscrew the thumb screw on the cover to open the top cover from NODE Unpluging the FAN cable Remove the fan by lifting it and pulling it out
262 Installing a FAN module To installing a fan module follow the reverse order
HA201-TP Users Manual
25
Chapter 2 Hardware Installation
27 Power Supply
271 Removing or Replacing the Power Supply Module
Slide the small levers (see red arrow ) to the right Hold the PSU lever and firmly pull the PSU out of the server chassis
Chapter 2 Hardware Installation
HA201-TP Users Manual
26
28 Tool-less Blade Slide Installation introduction
1 Pull on the lsquorsquoFront-Releasersquorsquoto unlock the inner channel from the Slide Assembly
2 Release the Detent-Lock and push Middle Channel inwards to retract Middle Channel
HA201-TP Users Manual
27
Chapter 2 Hardware Installation
Metal Spacer
OptionalRemove Metal Spacer for Aluminium Racks
3 Aligning the Front Bracket with the Mounting Hole
4 Push in to assembly the Front Bracket onto the Rack
Chapter 2 Hardware Installation
HA201-TP Users Manual
28
5 Now the bracket is fixed onto the Rack(Optional M6x10L screws are to secure the rails with posts if needed)
6 Refer to Diagram 3 amp 4 to assemble the End Bracket onto the Rack
HA201-TP Users Manual
29
Chapter 2 Hardware Installation
7 Assemble the inner channel onto the chassis using thescrews provided
8 Push the chassis with inner channels into Slide to complete Rack Installation
Chapter 3 Motherborad Overview
HA201-TP Users Manual
30
Chapter 3 Motherborad Overview
The Intelreg Server Board S2600TP is a monolithic printed circuit board (PCB) with features designed to support the high-performance and high-density computing markets This server board is designed to support the Intelreg Xeonreg processor E5-2600 v3 product family Previous generation Intelreg Xeonreg processors are not supported Many of the features and functions of the server board family are common A board will be identified by name when a described feature or function is unique to it
Chapter 3 Motherborad Overview
HA201-TP Users Manual
31
31 Intelreg Server Board Feature Set
Chapter 3 Motherborad Overview
HA201-TP Users Manual
32
WARNING The riser slot 1 on the server board is designed for
plugging in ONLY the riser card Plugging in the PCIe card may
cause permanent server board and PCIe card damage
Chapter 3 Motherborad Overview
HA201-TP Users Manual
33
32 Motherboard Layout
DiagnosticLED
RMM4LiteRiser Slot 2
SATASGPIO
BridgeBoard
SATA 3USB
Riser Slot 4
Riser Slot 4
HDD Activity
Riser Slot 3ControlPanel
SystemFan 1
CPU 2Socket
CPU 1Socket
SystemFan 3
SystemFan 2
MainPower 1
FanConnector
MainPower 2
InfiniBandPort(QSFP+)DedicatedManagementPortUSBStatus LEDID LED
VGA
NIC 2
IPMB
NIC 1
SerialPort A
CR 2032 3W
Backup Power Control
SATA 2
SATA 0
SATA RAID 5
SATA
Upgrade Key
DOM
SATA 1
Chapter 3 Motherborad Overview
HA201-TP Users Manual
34
33 Motherboard block diagram
Chapter 3 Motherborad Overview
HA201-TP Users Manual
35
34 Motherboard Configuration JumpersThe following table provides a summary and description of configuration test and debug jumpers The server board has several 3-pin jumper blocks that can be used Pin 1 on each jumper block can be identified by the following symbol on the silkscreen
Chapter 3 Motherborad Overview
HA201-TP Users Manual
36
Chapter 3 Motherborad Overview
HA201-TP Users Manual
37
35 Motherboard Configuration JumpersThe Intelreg Server Board S2600TP has the following board rear connector placement
HA201-TP Users Manual
38
Chapter 4 12G Expander Board Overview
This chapter provides detailed introduction guide on hardware introduction
41 12GB Expander Board411 Placement amp Connectors location
J1
JPIC_DBGU8
Chip 3x36RExpander
U14Flash U16
PIC
SW1
SW2
JEXP_UART
J7
JPIC_SMT
J2 J3 J4 CN1 JUSB_MB
JVBATT_12C
JVBATT CN3 CN2 JIPMI_LOC
CN5
CN8
JPWR1
JPWR2
JHDDPWRJPSON_OK
JIPMB
JUSB_PIC
J8
CN7CN6
CN4
JPWR_SW
Chapter 4 12GB Expander Borad Overview
HA201-TP Users Manual
39
Chapter 4 12G Expander Board Overview
412 PIN-OUT
Power Connector ndash JPWR1JPWR2
OS HDD Power Connector ndash JHDDPWR
Battery Connector ndash JVBATT
FAN Connector ndash J7J8
Console for Expander ndash JEXP_UART
HA201-TP Users Manual
40
Chapter 4 12G Expander Board Overview
PIC Smart portndash JPIC_SMT
PIC Debug port(For MPLAB I2C tool) ndash JPIC_DBG
PIC USB ndash JUSB_PIC
Battery Function ndash JVBATT_I2C
IPMB ndash JIPMB
HA201-TP Users Manual
41
Chapter 4 12G Expander Board Overview
BP PIC USB ndash JUSB_MB
IPMB for Device ndash JIPMI_LOC
Power SW ndash JPWR_SW
MB power on frunction ndash JPSON_OK
HA201-TP Users Manual
42
Chapter 4 12G Expander Board Overview
413 LEDs
LED_CN6
LED2
LED3
HA201-TP Users Manual
43
Chapter 5 Backplane OverviewChapter 5 Backplane Overview
This chapter provides detailed introduction guide on backplane introduction
51 Backplane511 Placement amp Connectors location
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
9
18
10
1
3
G
D
SG
D
S
C
B
C
E
B
C
E
GA
O
I
2
43
12
43
1
14
1
3
45
134
5
134
5
134
5
16
2
9
10
1
1
1120
10
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
16
17
32
33
484964
J2
JP1J11
For Secondary Canister Node
J4 J1 SW1
SW2
J3
For Primary Canister Node
HA201-TP Users Manual
44
Chapter 5 Backplane Overview
512 PIN-OUT
Power Connector ndash J2
Power Connector ndash J11
PMBUS Connector ndash JP1
FAN Connector ndash J4
HA201-TP Users Manual
45
Chapter 5 Backplane Overview
Console for MCU ndash J1
Front IO ndash J3
HA201-TP Users Manual
46
Chapter 5 Backplane Overview
513 LEDs
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14
P1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
S1
S4
S7
S2S3S5S6S8
S14P
1
P15
P2
P14
18
9
10
1
31
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
B
C
E
3 4
21
3 4
21
4
1
16
2
91
101
11
10
20
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ABCDEF
GHI
6 5 34 2 1
PRESS FIT
ActFail
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
S1
S4
S7
P1
P15
P2
P14
9
18
10
1
3
G
D
SG
D
S
C
B
C
E
B
C
E
GA
O
I
2
43
12
43
1
14
1
3
45
134
5
134
5
134
5
16
2
9
10
1
1
1120
10
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
26 5 4 3 1
IHG
FEDCBA
PRESS FIT
16
17
32
33
484964LED1
HA201-TP Users Manual
47
Chapter 6 HDD Backplane Introduction
61 Expender firmware update through smart console port611 Update Expander firmware revision
Step 1 Set up HA201-TP console serial cable Insert console serial cable into console port shown below also
the other side inert serial port into motherboard
PLEASE FIND THE CONSOLE SERIAL CABLE IN THE PACKAGE BOX
Chapter 6 Debug amp Firmware Update Introduction
HA201-TP Users Manual
48
Chapter 6 HDD Backplane Introduction
Step 2 Set up HA201-TP RS232 connectionSet up RS232 connection application into your HA201-TP as shown in the example process below
For exampleOS Microsoft WindowsRS232 connection application Hyperterminal
Step 2 Install HyperTrmexe
HA201-TP Users Manual
49
Chapter 6 HDD Backplane Introduction
Step 3 Enter a new name for the icon in the field below and click OK
Step 4 Connecting by using selecting an option in the drop down menu circled in red below (we selected COM2 in this example) and click OK
HA201-TP Users Manual
50
Chapter 6 HDD Backplane Introduction
Properties
Port Setting
Bits per second
Data bits
Parity
Stop bits
Flow control None
Restore Defaults
OK ApplyCancel
None
Step 6 Set up is complete The diagram below depicts what screen should displayed
Step 5 For ldquoBits per secondrdquo select 38400 For ldquoFlow controlrdquo select None Click OK when you have finished your selections
cmdgt_
HA201-TP Users Manual
51
Chapter 6 HDD Backplane Introduction
Step 7To get firmware image amp MFG Configuration Image version information from AIC SAS Related Firmware Downloadne website
httpppmsaicipccomtw8888downloadexpandermcu
HA201-TP Users Manual
52
Chapter 6 HDD Backplane Introduction
Step 8Comand line for show current firmware revisioncmdgtrev
HA201-TP Users Manual
53
Chapter 6 HDD Backplane Introduction
Step 9Start to update expander firmwarecmdgtfdl 0 0_
Step 10Select the tool bar Transfer -gt Send File
HA201-TP Users Manual
54
Chapter 6 HDD Backplane Introduction
Step 11bull Choose new firmware path file fw 3A1_v11221bull Protocol have to choose Xmodem
Step 12Firmware download complete
HA201-TP Users Manual
55
Chapter 6 HDD Backplane Introduction
Step 13Reset computer for success update firmwarecmdgtreset
reset
HA201-TP Users Manual
56
Chapter 6 HDD Backplane Introduction
612 Update expander configuration MFG
Step 1Comand line for show current configuration MFGcmdgt showmfg
HA201-TP Users Manual
57
Chapter 6 HDD Backplane Introduction
Step 2Start to update expander configuration MFGcmdgtfdl 83 0_
Step 3Select the tool bar Transfer -gt Send File
83 0
HA201-TP Users Manual
58
Chapter 6 HDD Backplane Introduction
Step 4bull Choose new MFG path file mfg 3A10_eob_1201binbull Protocol have to choose Xmodem
Step 5MFG download complete
HA201-TP Users Manual
59
Chapter 6 HDD Backplane Introduction
Step 6Reset computer for success update MFGcmdgtreset
reset
HA201-TP Users Manual
60
Chapter 6 HDD Backplane Introduction
62 Update the expander firmware through in-band
FOR EXAMPLEStep 1
Download and install SG3_utilsexe which compatible with Linux OSFrom website httpsgdannyczsgsg3_utilshtml website Reference version sg3_utils-140tgz
Step 2To get firmware image amp MFG Configuration Image version information from AIC SAS Related Firmware Downloadne websitehttpppmsaicipccomtw8888downloadexpandermcu
HA201-TP Users Manual
61
Chapter 6 HDD Backplane Introduction
Step 3Execute terminal under the same new firmware folderexample Setting a new firmware folder on Home page Open Terminal by click to the right button of mouse in the same window Home
Step 4Typing sudo -s to into administrator mode
HA201-TP Users Manual
62
Chapter 6 HDD Backplane Introduction
Step 5 Find expander location$ sg_map -i
Step 6Update Expander firmware$ sg_write_buffer --id=0x0 --in=fw3A1_v11221 --mode=0x2 --offset=0 devsg0
HA201-TP Users Manual
63
Chapter 6 HDD Backplane Introduction
Step 7 Update Expander MFG$ sg_write_buffer --id=0x83 --in=mfg3A10_eob_1201bin --mode=0x2 --offset=0 devsg0
Step 8Reboot computer for success update firmware amp MFGrootubuntu~ reboot
HA201-TP Users Manual
64
Chapter 6 HDD Backplane Introduction
63 12G expander EDFB settingStep 1
For Install HyperTerminalexe refer to section 41
Step 2 Get EDFB statuscmdgt edfb
HA201-TP Users Manual
65
Chapter 6 HDD Backplane Introduction
Step 3Change EDFB setting
Enable EDFB functioncmdgtedfb on
Disable EDFB functioncmdgtedfb off
cmd gtedfbEDFB is OFF
cmd gtedfb onSucceeded to set EDFB
cmd gtedfb offEDFB is OFF
HA201-TP Users Manual
66
Chapter 6 HDD Backplane Introduction
64 Slot HDD power setting(Only for system cooling Fan controled by expander)
Step 1 For Install sg3exe tool and get new firmware from website refer to section 42
Step 2Execute terminal under the same new firmware folderexample Setting a new firmware folder on Home page Open Terminal by click to the right button of mouse in the same window Home
HA201-TP Users Manual
67
Chapter 6 HDD Backplane Introduction
Step 3 Typing sudo -s to into administrator mode
Step 4 Find expander location$ sg_map -i
HA201-TP Users Manual
68
Chapter 6 HDD Backplane Introduction
Step 5For example
If would like to turn the Disk004 power off under the HBA card Need to check Disk004 power status $ sg_ses --page=7 devsg0
Under HBA card the Element 3 = Disk004
HA201-TP Users Manual
69
Chapter 6 HDD Backplane Introduction
Step 6To check Disk004 (element 3) power status is ok$ sg_ses --page=2 devsg0
Status shows belowThe status of Element 3 is OK
HA201-TP Users Manual
70
Chapter 6 HDD Backplane Introduction
Step 7Turn off a HDD power$ sg_ses --descriptor=Disk004 --set=341 devsg0
Step 8Turn on a HDD power$ sg_ses --descriptor=Disk004 --clear=341 devsg0
HA201-TP Users Manual
71
Chapter 6 HDD Backplane Introduction
65 HDD BP thermal sensor temperature setting(Only for system cooling Fan controled by expander)
Step 1 For Install HyperTerminalexe refer to section 41
Step 2Get the current temperature settingscmdgt temperature
HA201-TP Users Manual
72
Chapter 6 HDD Backplane Introduction
Step 3For example
Set new temperatureT1=20 C 4 18 CT2=50 C 4 52 CWarning threshold=50 C 448 CAlarm threshold=55 C 454 C
The new setting will take effect after resetcmdgt temperature 18 52 48 54cmdgt reset
HA201-TP Users Manual
73
Chapter 6 HDD Backplane Introduction
Step 4Check fan speed amp temperature informationcmdgt sensor
cmd gtsensor==ENCLOSURE STATUS========================================================== Total fan number 2 System Fan-0 speed 10888 RPM System Fan-1 speed 10971 RPM System PWN-0 82 Expander Temperature 76 Celsius degree Sytem Temperature-0 33 Celsius degree T1 20 Celsius degree T2 50 Celsius degree TC 55 Celsius degree Voltage Sensor 09V 093V Voltage Sensor 18V 180V
cmd gt_
HA201-TP Users Manual
74
Chapter 7 Intelreg Server Board S2600TP Platform Management
71 Server Management Function Architecture711 IPMI Feature7111 IPMI 20 Featuresbull Baseboard management controller (BMC)bull IPMI Watchdog timerbull Messaging support including command bridging and usersession
supportbull Chassis device functionality including powerreset control and BIOS boot
flags supportbull Event receiver device The BMC receives and processes events from
other platform subsystemsbull Field Replaceable Unit (FRU) inventory device functionality The BMC
supports access to system FRU devices using IPMI FRU commandsbull System Event Log (SEL) device functionality The BMC supports and
provides access to a SELbull Sensor Data Record (SDR) repository device functionality The BMC
supports storage and access of system SDRsbull Sensor device and sensor scanningmonitoring The BMC provides IPMI
management of sensors It polls sensors to monitor and report system health
bull IPMI interfaceso Host interfaces include system management software (SMS) with receive message queue support and server management mode (SMM)o IPMB interfaceo LAN interface that supports the IPMI-over-LAN protocol (RMCP RMCP+)
bull Serial-over-LAN (SOL)bull ACPI state synchronization The BMC tracks ACPI state changes that are
provided by the BIOSbull BMC self-test The BMC performs initialization and run-time self-tests and
makes results available to external entitiesbull See also the Intelligent Platform Management Interface Specification
Second Generation v20
Chapter 7 Intelreg Server Board S2600TP Platform Management
HA201-TP Users Manual
75
Chapter 7 Intelreg Server Board S2600TP Platform Management
7112 Non IPMI FeaturesThe BMC supports the following non-IPMI featuresbull In-circuit BMC firmware updatebull Fault resilient booting (FRB) FRB2 is supported by the watchdog timer
functionalitybull Chassis intrusion detectionbull Basic fan control using Control version 2 SDRsbull Fan redundancy monitoring and supportbull Enhancements to fan speed controlbull Power supply redundancy monitoring and supportbull Hot-swap fan supportbull Acoustic management Support for multiple fan profilesbull Signal testing support The BMC provides test commands for setting
and getting platform signal statesbull The BMC generates diagnostic beep codes for fault conditionsbull System GUID storage and retrievalbull Front panel management The BMC controls the system status LED
and chassis ID LED It supports secure lockout of certain front panel functionality and monitors button presses The chassis ID LED is turned on using a front panel button or a command
bull Power state retentionbull Power fault analysisbull Intelreg Light-Guided Diagnosticsbull Power unit management Support for power unit sensor The BMC
handles power-good dropout conditionsbull DIMM temperature monitoring New sensors and improved acoustic
management using closed-loop fan control algorithm taking into account DIMM temperature readings
bull Address Resolution Protocol (ARP) The BMC sends and responds to ARPs (supported on embedded NICs)
bull Dynamic Host Configuration Protocol (DHCP) The BMC performs DHCP (supported on embedded NICs)
bull Platform environment control interface (PECI) thermal management support
bull E-mail alertingbull Support for embedded web server UI in Basic Manageability feature
set
HA201-TP Users Manual
76
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull Enhancements to embedded web servero Human-readable SELo Additional system configurabilityo Additional system monitoring capabilityo Enhanced on-line help
bull Integrated KVMbull Enhancements to KVM redirection
o Support for higher resolutionbull Integrated Remote Media Redirectionbull Lightweight Directory Access Protocol (LDAP) supportbull Intelreg Intelligent Power Node Manager supportbull Embedded platform debug feature which allows capture of detailed
data for later analysisbull Provisioning and inventory enhancements
o Inventory datasystem information export (partial SMBIOS table)bull DCMI 15 compliance (product-specific)bull Management support for PMBus rev12 compliant power suppliesbull BMC Data Repository (Managed Data Region Feature)bull Support for an Intelreg Local Control Display Panelbull System Airflow Monitoringbull Exit Air Temperature Monitoringbull Ethernet Controller Thermal Monitoringbull Global Aggregate Temperature Margin Sensorbull Memory Thermal Managementbull Power Supply Fan Sensorsbull Energy Star Server Supportbull Smart Ride Through (SmaRT) Closed Loop System Throttling (CLST)bull Power Supply Cold Redundancybull Power Supply FW Updatebull Power Supply Compatibility Checkbull BMC FW reliability enhancements
o Redundant BMC boot blocks to avoid possibility of a corrupted boot block resulting in a scenario that prevents a user from updating the BMC o BMC System Management Health Monitoring
HA201-TP Users Manual
77
Chapter 7 Intelreg Server Board S2600TP Platform Management
72 Features and Functions721 Power SubsystemThe server board supports several power control sources which can initiate power-up orpower-down activity
HA201-TP Users Manual
78
Chapter 7 Intelreg Server Board S2600TP Platform Management
722 Advanced Configuration and Power Interface (ACPI)The server board has support for the following ACPI states
HA201-TP Users Manual
79
Chapter 7 Intelreg Server Board S2600TP Platform Management
723 System InitializationDuring system initialization both the BIOS and the BMC initialize the following items
7231 Processor Tcontrol SettingProcessors used with this chipset implement a feature called Tcontrol which provides a processor-specific value that can be used to adjust the fan-control behavior to achieve optimum cooling and acoustics The BMC reads these from the CPU through PECI Proxy mechanism provided by Manageability Engine (ME) The BMC uses these values as part of thefan-speed-control algorithm
HA201-TP Users Manual
80
Chapter 7 Intelreg Server Board S2600TP Platform Management
7232 Fault Resilient Booting (FRB)Fault resilient booting (FRB) is a set of BIOS and BMC algorithms and hardware support that allow a multiprocessor system to boot even if the bootstrap processor (BSP) fails Only FRB2 is supported using watchdog timer commandsFRB2 refers to the FRB algorithm that detects system failures during POST The BIOS uses the BMC watchdog timer to back up its operation during POST The BIOS configures the watchdog timer to indicate that the BIOS is using the timer for the FRB2 phase of the boot operationAfter the BIOS has identified and saved the BSP information it sets the FRB2 timer use bit and loads the watchdog timer with the new timeout intervalIf the watchdog timer expires while the watchdog use bit is set to FRB2 the BMC (if so configured) logs a watchdog expiration event showing the FRB2 timeout in the event data bytes The BMC then hard resets the system assuming the BIOS-selected reset as the watchdog timeout actionThe BIOS is responsible for disabling the FRB2 timeout before initiating the option ROM scan and before displaying a request for a boot password If the processor fails and causes an FRB2 timeout the BMC resets the systemThe BIOS gets the watchdog expiration status from the BMC If the status shows an expired FRB2 timer the BIOS enters the failure in the system event log (SEL) In the OEM bytes entry in the SEL the last POST code generated during the previous boot attempt is written FRB2failure is not reflected in the processor status sensor valueThe FRB2 failure does not affect the front panel LEDs
HA201-TP Users Manual
81
Chapter 7 Intelreg Server Board S2600TP Platform Management
7233 Post Code DisplayThe BMC upon receiving standby power initializes internal hardware to monitor port 80h (POST code) writes Data written to port 80h is output to the system POST LEDsThe BMC deactivates POST LEDs after POST had completed
HA201-TP Users Manual
82
Chapter 7 Intelreg Server Board S2600TP Platform Management
724 System Event Log (SEL)The BMC implements the system event log as specified in the Intelligent Platform Management Interface Specification Version 20 The SEL is accessible regardless of the system power state through the BMCs in-band and out-of-band interfacesThe BMC allocates 95231bytes (approx 93 KB) of non-volatile storage space to store system events The SEL timestamps may not be in order Up to 3639 SEL records can be stored at a time Because the SEL is circular any command that results in an overflow of the SEL beyond the allocated space will overwrite the oldest entries in the SEL while setting the overflow flag
HA201-TP Users Manual
83
Chapter 7 Intelreg Server Board S2600TP Platform Management
73 Sensor MonitoringThe BMC monitors system hardware and reports system health The information gathered from physical sensors is translated into IPMI sensors as part of the ldquoIPMI Sensor Modelrdquo The BMC also reports various system state changes by maintaining virtual sensors that are not specifically tied to physical hardware This section describes general aspects of BMC sensor management as well as describing how specific sensor types are modeled Unless otherwise specified the term ldquosensorrdquo refers to the IPMI sensor-model definition of a sensor
531 Sensor ScanningThe value of many of the BMCrsquos sensors is derived by the BMC FW periodically polling physical sensors in the system to read temperature voltages and so on Some of these physical sensors are built-in to the BMC component itself and some are physically separated from the BMC Polling of physical sensors for support of IPMI sensor monitoring does not occur until the BMCrsquos operational code is running and the IPMI FW subsystem has completed initializationIPMI sensor monitoring is not supported in the BMC boot code Additionally the BMC selectively polls physical sensors based on the current power and reset state of the system and the availability of the physical sensor when in that state For example non-standby voltages are not monitored when the system is in S4 or S5 power state
HA201-TP Users Manual
84
Chapter 7 Intelreg Server Board S2600TP Platform Management
732 Sensor Rearm Behavior7321 Manual versus Re-arm SensorsSensors can be either manual or automatic re-arm An automatic re-arm sensor will re-arm (clear) the assertion event state for a threshold or offset it if that threshold or offset is deasserted after having been asserted This allows a subsequent assertion of the threshold or an offset to generate a new event and associated side-effect An example side-effect would be boosting fans due to an upper critical threshold crossing of a temperature sensor The event state and the input state (value) of the sensor track each other Most sensors are auto-rearmA manual re-arm sensor does not clear the assertion state even when the threshold or offset becomes de-asserted In this case the event state and the input state (value) of the sensor do not track each other The event assertion state is sticky The following methods can be used to re-arm a sensorbull Automatic re-arm ndash Only applies to sensors that are designated as
ldquoauto-rearmrdquobull IPMI command Re-arm Sensor Eventbull BMC internal method ndash The BMC may re-arm certain sensors due to a
trigger condition For example some sensors may be re-armed due to a system reset A BMC reset will re-arm all sensorsbull System reset or DC power cycle will re-arm all system fan sensors
HA201-TP Users Manual
85
Chapter 7 Intelreg Server Board S2600TP Platform Management
7322 Re-arm and Event GenerationAll BMC-owned sensors that show an asserted event status generate a de-assertion SEL event when the sensor is re-armed provided that the associated SDR is configured to enable a deassertion event for that condition This applies regardless of whether the sensor is a thresholdanalog sensor or a discrete sensor
To manually re-arm the sensors the sequence is outlined below1 A failure condition occurs and the BMC logs an assertion event2 If this failure condition disappears the BMC logs a de-assertion event (if
so configured)3 The sensor is re-armed by one of the methods described in the
previous section4 The BMC clears the sensor status5 The sensor is put into reading-state-unavailable state until it is polled
again or otherwise updated6 The sensor is updated and the ldquoreading-state-unavailablerdquo state is
cleared A new assertion event will be logged if the fault state is once again detected
All auto-rearm sensors that show an asserted event status generate a de-assertion SEL event at the time the BMC detects that the condition causing the original assertion is no longer present and the associated SDR is configured to enable a de-assertion event for that condition
HA201-TP Users Manual
86
Chapter 7 Intelreg Server Board S2600TP Platform Management
733 BIOS Event-Only SensorsBIOS-owned discrete sensors are used for event generation only and are not accessible through IPMI sensor commands like the Get Sensor Reading command Note that in this case the sensor owner designated in the SDR is not the BMCAn example of this usage would be the SELs logged by the BIOS for uncorrectable memory errors Such SEL entries would identify a BIOS-owned sensor ID
734 Margin SensorsThere is sometimes a need for an IPMI sensor to report the difference (margin) from a nonzero reference offset For the purposes of this document these type sensors are referred to as margin sensors For instance for the case of a temperature margin sensor if the referencevalue is 90 degrees and the actual temperature of the device being monitored is 85 degreesthe margin value would be -5
735 IPMI Watchdog SensorThe BMC supports a Watchdog Sensor as a means to log SEL events due to expirations of the IPMI 20 compliant Watchdog Timer
736 BMC Watchdog SensorThe BMC supports an IPMI sensor to report that a BMC reset has occurred due to action taken by the BMC Watchdog feature A SEL event will be logged whenever either the BMC FW stack is reset or the BMC CPU itself is reset
737 BMC System Management Health MonitoringThe BMC tracks the health of each of its IPMI sensors and report failures by providing a ldquoBMC FW Healthrdquo sensor of the IPMI 20 sensor type Management Subsystem Health with support for the Sensor Failure offset Only assertions should be logged into the SEL for the Sensor Failure offset The BMC Firmware Health sensor asserts for any sensor when 10 consecutive sensor errors are read These are not standard sensor events (that is threshold crossings or discrete assertions) These are BMC Hardware Access Layer (HAL) errors If a successful sensor read is completed the counter resets to zero
HA201-TP Users Manual
87
Chapter 7 Intelreg Server Board S2600TP Platform Management
738 VR Watchdog TimerThe BMC FW monitors that the power sequence for the board VR controllers is completed when a DC power-on is initiated Incompletion of the sequence indicates a board problem in which case the FW powers down the systemThe BMC FW supports a discrete IPMI sensor for reporting and logging this fault condition
739 System Airflow MonitoringThe BMC provides an IPMI sensor to report the volumetric system airflow in CFM (cubic feet per minute) The air flow in CFM is calculated based on the system fan PWM values The specific Pulse Width Modulation (PWM or PWMs) used to determine the CFM is SDR configurable The relationship between PWM and CFM is based on a lookup table in an OEM SDRThe airflow data is used in the calculation for exit air temperature monitoring It is exposed as an IPMI sensor to allow a datacenter management application to access this data for use in rack-level thermal management
7310 Thermal MonitoringThe BMC provides monitoring of component and board temperature sensing devices This monitoring capability is instantiated in the form of IPMI analogthreshold or discrete sensors depending on the nature of the measurementFor analogthreshold sensors with the exception of Processor Temperature sensors critical and non-critical thresholds (upper and lower) are set through SDRs and event generation enabled for both assertion and de-assertion eventsFor discrete sensors both assertion and de-assertion event generation are enabledMandatory monitoring of platform thermal sensors includes
HA201-TP Users Manual
88
Chapter 7 Intelreg Server Board S2600TP Platform Management
7310 Thermal MonitoringThe BMC provides monitoring of component and board temperature sensing devices This monitoring capability is instantiated in the form of IPMI analogthreshold or discrete sensors depending on the nature of the measurementFor analogthreshold sensors with the exception of Processor Temperature sensors critical and non-critical thresholds (upper and lower) are set through SDRs and event generation enabled for both assertion and de-assertion eventsFor discrete sensors both assertion and de-assertion event generation are enabledMandatory monitoring of platform thermal sensors includesbull Inlet temperature (physical sensor is typically on system front panel or
HDD back plane)bull Board ambient thermal sensorsbull Processor temperaturebull Memory (DIMM) temperaturebull CPU VRD Hot monitoringbull Power supply (only supported for PMBus-compliant PSUs)Additionally the BMC FW may create ldquovirtualrdquo sensors that are based on a combination of aggregation of multiple physical thermal sensors and application of a mathematical formula to thermal or power sensor readings
73101 Absolute Value versus Margin SensorsThermal monitoring sensors fall into three basic categoriesbull Absolute temperature sensors ndash These are analogthreshold sensors
that provide a value that corresponds to an absolute temperature value
bull Thermal margin sensors ndash These are analogthreshold sensors that provide a value that is relative to some reference value
bull Thermal fault indication sensors ndash These are discrete sensors that indicate a specific thermal fault condition
HA201-TP Users Manual
89
Chapter 7 Intelreg Server Board S2600TP Platform Management
73102 Processor DTS-Spec Margin Sensor(s)Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family incorporate a DTS based thermal spec This allows a much more accurate control of the thermal solution and will enable lower fan speeds and lower fan power consumption The main usage of this sensor is as an input to the BMCrsquos fan control algorithms The BMC implements this as a threshold sensor There is one DTS sensor for each installed physical processor package Thresholds are not set and alert generation is not enabled for these sensors
73103 Processor Thermal Margin Sensor(s)Each processor supports a physical thermal margin sensor per core that is readable through the PECI interface This provides a relative value representing a thermal margin from the corersquos throttling thermal trip point Assuming that temp controlled throttling is enabled the physical core temp sensor reads lsquo0rsquo which indicates the processor core is being throttledThe BMC supports one IPMI processor (margin) temperature sensor per physical processor package This sensor aggregates the readings of the individual core temperatures in a package to provide the hottest core temperature reading When the sensor reads lsquo0rsquo it indicates that the hottest processor core is throttlingDue to the fact that the readings are capped at the corersquos thermal throttling trip point (reading = 0) thresholds are not set and alert generation is not enabled for these sensors
73104 Processor Thermal Control Monitoring (Prochot)The BMC FW monitors the percentage of time that a processor has been operationally constrained over a given time window (nominally six seconds) due to internal thermal management algorithms engaging to reduce the temperature of the device When any processor core temperature reaches its maximum operating temperature the processorpackage PROCHOT (processor hot) signal is asserted and these management algorithms known as the Thermal Control Circuit (TCC) engage to reduce the temperature provided TCC is enabled TCC is enabled by BIOS during system boot This monitoring is instantiated as one IPMI analogthreshold sensor per processor package The BMC implements this as a threshold sensor on a per-processor basis
HA201-TP Users Manual
90
Chapter 7 Intelreg Server Board S2600TP Platform Management
Under normal operation this sensor is expected to read lsquo0rsquo indicating that no processor throttling has occurredThe processor provides PECI-accessible counters one for the total processor time elapsed and one for the total thermally constrained time which are used to calculate the percentage assertion over the given time window
73105 Processor Voltage Regulator (VRD) Over-Temperature SensorThe BMC monitors processor VRD_HOT signals The processor VRD_HOT signals are routed to the respective processor PROCHOT input in order initiate throttling to reduce processor power draw therefore indirectly lowering the VRD temperatureThere is one processor VRD_HOT signal per CPU slot The BMC instantiates one discrete IPMI sensor for each VRD_HOT signal This sensor monitors a digital signal that indicates whether a processor VRD is running in an over-temperature condition When the BMC detects that this signal is asserted it will cause a sensor assertion which will result in an event being written into the sensor event log (SEL)
73106 Inlet Temperature SensorEach platform supports a thermal sensor for monitoring the inlet temperature There are four potential sources for inlet temperature reading
73107 Baseboard Ambient Temperature Sensor(s)The server baseboard provides one or more physical thermal sensors for monitoring the ambient temperature of a board location This is typically to provide rudimentary thermal monitoring of components that lack internal thermal sensors
73108 Server South Bridge (SSB) Thermal MonitoringThe BMC monitors the SSB temperature This is instantiated as an analog (threshold) IPMI thermal sensor
73109 Exit Air Temperature Monitoring The BMC synthesizes a virtual sensor to approximate system exit air temperature for use in fan control This is calculated based on the total power being consumed by the system and the total volumetric air flow provided by the system fans
HA201-TP Users Manual
91
Chapter 7 Intelreg Server Board S2600TP Platform Management
Each system shall be characterized in tabular format to understand total volumetric flow versus fan speed The BMC calculates an average exit air temperature based on the total system power front panel temperature the volumetric system air flow (cubic feet per meter or CFM) and altitude rangeThis sensor is only available on systems in an Intelreg chassis The Exit Air temp sensor is only available when PMBus power supplies are installed
731010 Ethernet Controller Thermal MonitoringThe Intelreg Ethernet Controller I350-AM4 and Intelreg Ethernet Controller 10 Gigabit X540 support an on-die thermal sensor For baseboard Ethernet controllers that use these devices the BMC will monitor the sensors and use this data as input to the fan speed control The BMC will instantiate an IPMI temperature sensor for each device on the baseboard
731011 Memory VRD-Hot Sensor(s)The BMC monitors memory VRD_HOT signals The memory VRD_HOT signals are routed to the respective processor MEMHOT inputs in order to throttle the associated memory to effectively lower the temperature of the VRD feeding that memoryFor Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family there are 2 memory VRD_HOT signals per CPU slot The BMC instantiates one discrete IPMI sensor for each memory VRD_HOT signal
731012 Add-in Module Thermal MonitoringSome boards have dedicated slots for an IO module andor a SAS module For boards that support these slots the BMC will instantiate an IPMI temperature sensor for each slot The modules themselves may or may not provide a physical thermal sensor (a TMP75 device) If the BMC detects that a module is installed it will attempt to access the physical thermal sensor and if found enable the associated IPMI temperature sensor
731013 Processor ThermTripWhen a Processor ThermTrip occurs the system hardware will automatically power down the server If the BMC detects that a ThermTrip occurred then it will set the ThermTrip offset for the applicable
HA201-TP Users Manual
92
Chapter 7 Intelreg Server Board S2600TP Platform Management
processor status sensor
731014 Server South Bridge (SSB) ThermTrip Monitoring The BMC supports SSB ThermTrip monitoring that is instantiated as an IPMI discrete sensor When a SSB ThermTrip occurs the system hardware will automatically power down the server and the BMC will assert the sensor offset and log an event
731015 DIMM ThermTrip MonitoringThe BMC supports DIMM ThermTrip monitoring that is instantiated as one aggregate IPMI discrete sensor per CPU When a DIMM ThermTrip occurs the system hardware will automatically power down the server and the BMC will assert the sensor offset and log an eventThis is a manual re-arm sensor that is rearmed on system resets and power-on (AC or DC power on transitions)
HA201-TP Users Manual
93
Chapter 7 Intelreg Server Board S2600TP Platform Management
7311 Processor SensorsThe BMC provides IPMI sensors for processors and associated components such as voltage regulators and fans The sensors are implemented on a per-processor basis
HA201-TP Users Manual
94
Chapter 7 Intelreg Server Board S2600TP Platform Management
73111 Processor Status SensorsThe BMC provides an IPMI sensor of type processor for monitoring status information for each processor slot If an event state (sensor offset) has been asserted it remains asserted until one of the following happens1 A Rearm Sensor Events command is executed for the processor status
sensor2 AC or DC power cycle system reset or system boot occurs
The BMC provides system status indication to the front panel LEDs for processor fault conditions shown belowCPU Presence status is not saved across AC power cycles and therefore will not generate a deassertion after cycling AC power
HA201-TP Users Manual
95
Chapter 7 Intelreg Server Board S2600TP Platform Management
73112 Processor Population Fault (CPU Missing) SensorThe BMC supports a Processor Population Fault sensor This is used to monitor for the condition in which processor slots are not populated as required by the platform hardware to allow power-on of the systemAt BMC startup the BMC will check for the fault condition and set the sensor state accordinglyThe BMC also checks for this fault condition at each attempt to DC power-on the system At each DC power-on attempt a beep code is generated if this fault is detectedThe following steps are used to correct the fault condition and clear the sensor state1 AC power down the server2 Install the missing processor into the correct slot3 AC power on the server
73113 ERR2 Timeout MonitoringThe BMC supports an ERR2 Timeout Sensor (1 per CPU) that asserts if a CPUrsquos ERR2 signal has been asserted for longer than a fixed time period (gt 90 seconds) ERR[2] is a processor signal that indicates when the IIO (Integrated IO module in the processor) has a fatal error which could not be communicated to the core to trigger SMI ERR[2] events are fatal error conditions where the BIOS and OS will attempt to gracefully handle error but may not be always do so reliably A continuously asserted ERR2 signal is an indication that the BIOS cannot service the condition that caused the error This is usually because that condition prevents the BIOS from runningWhen an ERR2 timeout occurs the BMC assertsde-asserts the ERR2 Timeout Sensor and logs a SEL event for that sensor The default behavior for BMC core firmware is to initiate a system reset upon detection of an ERR2 timeout The BIOS setup utility provides an option to disable or enable system reset by the BMC for detection of this condition
HA201-TP Users Manual
96
Chapter 7 Intelreg Server Board S2600TP Platform Management
73114 CATERR SensorThe BMC supports a CATERR sensor for monitoring the system CATERR signalThe CATERR signal is defined as having 3 statesbull high (no event)bull pulsed low (possibly fatal may be able to recover)bull low (fatal)All processors in a system have their CATERR pins tied together The pin is used as a communication path to signal a catastrophic system event to all CPUs The BMC has direct access to this aggregate CATERR signalThe BMC only monitors for the ldquoCATERR held lowrdquo condition A pulsed low condition is ignored by the BMC If a CATERR-low condition is detected the BMC logs an error message to the SEL against the CATERR sensor and the default action after logging the SEL entry is to reset the system The BIOS setup utility provides an option to disable or enable system reset by the BMC for detection of this conditionThe sensor is rearmed on power-on (AC or DC power on transitions) It is not rearmed on system resets in order to avoid multiple SEL events that could occur due to a potential reset loop if the CATERR keeps recurring which would be the case if the CATERR was due to an MSID mismatch conditionWhen the BMC detects that this aggregate CATERR signal has asserted it can then go through PECI to query each CPU to determine which one was the source of the error and write an OEM code identifying the CPU slot into an event data byte in the SEL entry If PECI is non-functional (it isnrsquot guaranteed in this situation) then the OEM code should indicate that the source is unknownEvent data byte 2 and byte 3 for CATERR sensor SEL events
ED1 ndash 0xA1ED2 - CATERR type0 Unknown1 CATERR2 CPU Core Error (not supported on Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family)3 MSID Mismatch4 CATERR due to CPU 3-strike timeout
HA201-TP Users Manual
97
Chapter 7 Intelreg Server Board S2600TP Platform Management
ED3 - CPU bitmap that causes the system CATERR[0] CPU1[1] CPU2[2] CPU3[3] CPU4When a CATERR Timeout event is determined to be a CPU 3-strike timeout The BMC shall log the logical FRU information (eg busdevfunc for a PCIe device CPU or DIMM) that identifies the FRU that caused the error in the extended SEL data bytes In this case Ext-ED0 will be set to 0x70 and the remaining ED1-ED7 will be set according to the device type and info available
73115 MSID Mismatch SensorThe BMC supports a MSID Mismatch sensor for monitoring for the fault condition that will occur if there is a power rating incompatibility between a baseboard and a processor The sensor is rearmed on power-on (AC or DC power on transitions)
7312 Voltage MonitoringThe BMC provides voltage monitoring capability for voltage sources on the main board and processors such that all major areas of the system are covered This monitoring capability is instantiated in the form of IPMI analogthreshold sensors
73121 DIMM Voltage SensorsSome systems support either LVDDR (Low Voltage DDR) memory or regular (non-LVDDR) memory During POST the system BIOS detects which type of memory is installed and configures the hardware to deliver the correct voltageSince the nominal voltage range is different this necessitates the ability to set different thresholds for any associated IPMI voltage sensors The BMC FW supports this by implementing separate sensors (that is separate IPMI sensor numbers) for each nominal voltage range supported for a single physical sensor and it enablesdisables the correct IPMI sensor based on which type memory is installed The sensor data records for both these DIMM voltage sensor types have scanning disabled by default Once the BIOS has completed its POST routine it is responsible for communicating the DIMM voltage type to the BMC which will then
HA201-TP Users Manual
98
Chapter 7 Intelreg Server Board S2600TP Platform Management
enable sensor scanning of the correct DIMM voltage sensor
7313 Fan MonitoringBMC fan monitoring support includes monitoring of fan speed (RPM) and fan presence
73131 Fan Tach SensorsFan Tach sensors are used for fan failure detection The reported sensor reading is proportional to the fanrsquos RPM This monitoring capability is instantiated in the form of IPMI analogthreshold sensorsMost fan implementations provide for a variable speed fan so the variations in fan speed can be large Therefore the threshold values must be set sufficiently low as to not result in inappropriate threshold crossingsFan tach sensors are implemented as manual re-arm sensors because a lower-critical threshold crossing can result in full boosting of the fans This in turn may cause a failing fanrsquos speed to rise above the threshold and can result in fan oscillationsAs a result fan tach sensors do not auto-rearm when the fault condition goes away but rather are rearmed for either of the following occurrences1 The system is reset or power-cycled2 The fan is removed and either replaced with another fan or re-
inserted This applies to hot-swappable fans only This re-arm is triggered by change in the state of the associated fan presence sensor
After the sensor is rearmed if the fan speed is detected to be in a normal range the failure conditions shall be cleared and a de-assertion event shall be logged
HA201-TP Users Manual
99
Chapter 7 Intelreg Server Board S2600TP Platform Management
73132 Fan Presence SensorsSome chassis and server boards provide support for hot-swap fans These fans can be removed and replaced while the system is powered on and operating normally The BMC implements fan presence sensors for each hot swappable fan These are instantiated as IPMI discrete sensorsEvents are only logged for fan presence upon changes in the presence state after AC power is applied (no events logged for initial state)
73133 Fan Redundancy SensorThe BMC supports redundant fan monitoring and implements fan redundancy sensors for products that have redundant fans Support for redundant fans is chassis-specificA fan redundancy sensor generates events when its associated set of fans transition between redundant and non-redundant states as determined by the number and health of the component fans The definition of fan redundancy is configuration dependent The BMCallows redundancy to be configured on a per fan-redundancy sensor basis through OEM SDR recordsThere is a fan redundancy sensor implemented for each redundant group of fans in the systemAssertion and de-assertion event generation is enabled for each redundancy state
73134 Power Supply Fan SensorsMonitoring is implemented through IPMI discrete sensors one for each power supply fan The BMC polls each installed power supply using the PMBus fan status commands to check for failure conditions for the power supply fans The BMC asserts the ldquoperformance lagsrdquo offset of the IPMI sensor if a fan failure is detectedPower supply fan sensors are implemented as manual re-arm sensors because a failure condition can result in boosting of the fans This in turn may cause a failing fanrsquos speed to rise above the ldquofaultrdquo threshold and can result in fan oscillations As a result these sensors do not auto-rearm when the fault condition goes away but rather are rearmed only when the system is reset or power-cycled or the PSU is removed and replaced with the same or another PSUAfter the sensor is rearmed if the fan is no longer showing a failed state the failure condition in the IPMI sensor shall be cleared and a de-
HA201-TP Users Manual
100
Chapter 7 Intelreg Server Board S2600TP Platform Management
assertion event shall be logged
73135 Monitoring for ldquoFans Offrdquo ScenarioOn Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family it is likely that there will be situations where specific fans are turned off based on current system conditions BMC Fan monitoring will comprehend this scenario and not log false failure eventsThe recommended method is for the BMC FW to halt updates to the value of the associated fan tach sensor and set that sensorrsquos IPMI sensor state to ldquoreading-state-unavailablerdquo when this mode is active Management software must comprehend this state for fan tach sensors and not report these as failure conditionsThe scenario for which this occurs is that the BMC Fan Speed Control (FSC) code turns off the fans by setting the PWM for the domain to 0 This is done when based on one or more global aggregate thermal margin sensor readings dropping below a specified thresholdBy default the fans-off feature will be disabled There is a BMC command and BIOS setup option to enabledisable this featureThe SmaRTCLST system feature will also momentarily gate power to all the system fans to reduce overall system power consumption in response to a power supply event (for example to ride out an AC power glitch) However for this scenario the fan power is gated by hardware for only 100ms which should not be long enough to result in triggering a fan fault SEL event
7314 Standard Fan ManagementThe BMC controls and monitors the system fans Each fan is associated with a fan speed sensor that detects fan failure and may also be associated with a fan presence sensor for hotswap support For redundant fan configurations the fan failure and presence status determines the fan redundancy sensor stateThe system fans are divided into fan domains each of which has a separate fan speed control signal and a separate configurable fan control policy A fan domain can have a set of temperature and fan sensors associated with it These are used to determine the current fan domain state
HA201-TP Users Manual
101
Chapter 7 Intelreg Server Board S2600TP Platform Management
A fan domain has three statesbull The sleep and boost states have fixed (but configurable through OEM
SDRs) fan speeds associated with thembull The nominal state has a variable speed determined by the fan
domain policy An OEM SDR record is used to configure the fan domain policy
The fan domain state is controlled by several factors They are listed below in order of precedence high to lowbull Boost
o Associated fan is in a critical state or missing The SDR describes which fan domains are boosted in response to a fan failure or removal in each domain If a fan is removed when the system is in lsquoFans-offrsquo mode it will not be detected and there will not be any fan boost till system comes out of lsquoFans-off modeo Any associated temperature sensor is in a critical state The SDR describes which temperature-threshold violations cause fan boost for each fan domaino The BMC is in firmware update mode or the operational firmware is corruptedo If any of the above conditions apply the fans are set to a fixed boost state speed
bull Nominalo A fan domainrsquos nominal fan speed can be configured as static (fixed value) or controlled by the state of one or more associated temperature sensorso See section 43144 for more details
73141 Hot-Swap FansHot-swap fans are supported These fans can be removed and replaced while the system is powered on and operating The BMC implements fan presence sensors for each hotswappable fanWhen a fan is not present the associated fan speed sensor is put into the readingunavailable state and any associated fan domains are put into the boost state The fans may already be boosted due to a previous fan failure or fan removalWhen a removed fan is inserted the associated fan speed sensor is rearmed If there are no other critical conditions causing a fan boost condition the fan speed returns to the nominal state Power cycling or
HA201-TP Users Manual
102
Chapter 7 Intelreg Server Board S2600TP Platform Management
resetting the system re-arms the fan speed sensors and clears fanfailure conditions If the failure condition is still present the boost state returns once the sensor has re-initialized and the threshold violation is detected again
73142 Fan Redundancy DetectionThe BMC supports redundant fan monitoring and implements a fan redundancy sensor A fan redundancy sensor generates events when its associated set of fans transitions between redundant and non-redundant states as determined by the number and health of the fans The definition of fan redundancy is configuration dependent The BMC allows redundancy to be configured on a per fan redundancy sensor basis through OEM SDR recordsA fan failure or removal of hot-swap fans up to the number of redundant fans specified in the SDR in a fan configuration is a non-critical failure and is reflected in the front panel status A fan failure or removal that exceeds the number of redundant fans is a non-fatal insufficientresources condition and is reflected in the front panel status as a non-fatal errorRedundancy is checked only when the system is in the DC-on state Fan redundancy changes that occur when the system is DC-off or when AC is removed will not be logged until the system is turned on
73143 Fan DomainsSystem fan speeds are controlled through pulse width modulation (PWM) signals which are driven separately for each domain by integrated PWM hardware Fan speed is changed by adjusting the duty cycle which is the percentage of time the signal is driven high in each pulseThe BMC controls the average duty cycle of each PWM signal through direct manipulation of the integrated PWM control registersThe same device may drive multiple PWM signals
HA201-TP Users Manual
103
Chapter 7 Intelreg Server Board S2600TP Platform Management
73144 Nominal Fan SpeedA fan domainrsquos nominal fan speed can be configured as static (fixed value) or controlled by the state of one or more associated temperature sensorsOEM SDR records are used to configure which temperature sensors are associated with which fan control domains and the algorithmic relationship between the temperature and fan speed Multiple OEM SDRs can reference or control the same fan control domain and multiple OEM SDRs can reference the same temperature sensorsThe PWM duty-cycle value for a domain is computed as a percentage using one or more instances of a stepwise linear algorithm and a clamp algorithm The transition from one computed nominal fan speed (PWM value) to another is ramped over time to minimize audible transitions The ramp rate is configurable by means of the OEM SDRMultiple stepwise linear and clamp controls can be defined for each fan domain and used simultaneously For each domain the BMC uses the maximum of the domainrsquos stepwise linear control contributions and the sum of the domainrsquos clamp control contributions to compute the domainrsquos PWM value except that a stepwise linear instance can be configured to provide the domain maximumHysteresis can be specified to minimize fan speed oscillation and to smooth fan speed transitions If a Tcontrol SDR record does not contain a hysteresis definition for example an SDR adhering to a legacy format the BMC assumes a hysteresis value of zero
73145 Thermal and Acoustic ManagementThis feature refers to enhanced fan management to keep the system optimally cooled while reducing the amount of noise generated by the system fans Aggressive acoustics standards might require a trade-off between fan speed and system performance parameters that contribute to the cooling requirements primarily memory bandwidth The BIOS BMC and SDRs work together to provide control over how this trade-off is determinedThis capability requires the BMC to access temperature sensors on the individual memory DIMMs Additionally closed-loop thermal throttling is only supported with buffered DIMMs
HA201-TP Users Manual
104
Chapter 7 Intelreg Server Board S2600TP Platform Management
73146 Thermal Sensor Input to Fan Speed ControlThe BMC uses various IPMI sensors as input to the fan speed control Some of the sensors are IPMI models of actual physical sensors whereas some are ldquovirtualrdquo sensors whose values are derived from physical sensors using calculations andor tabular informationThe following IPMI thermal sensors are used as input to the fan speed controlbull Front panel temperature sensorbull Baseboard temperature sensorsbull CPU DTS-Spec margin sensorsbull DIMM thermal margin sensorsbull Exit air temperature sensorbull Global aggregate thermal margin sensorsbull SSB (Intelreg C610 Series Chipset) temperature sensorbull On-board Ethernet controller temperature sensors (support for this is
specific to the Ethernet controller being used)bull Add-in Intelreg SASIO module temperature sensor(s) (if present)bull Power supply thermal sensors (only available on PMBus-compliant
power supplies)A simple model is shown in the following figure which gives a high level graphic of the fan speed control structure creates the resulting fan speeds
HA201-TP Users Manual
105
Chapter 7 Intelreg Server Board S2600TP Platform Management
731461 Processor Thermal ManagementProcessor thermal management utilizes clamp algorithms for which the Processor DTS-Spec margin sensor is a controlling input This replaces the use of the (legacy) raw DTS sensor reading that was utilized on previous generation platforms The legacy DTS sensor is retained only for monitoring purposes and is not used as an input to the fan speed control
731462 Memory Thermal ManagementThe system memory is the most complex subsystem to thermally manage as it requires substantial interactions between the BMC BIOS and the embedded memory controller This section provides an overview of this management capability from a BMC perspective
7314621 Memory Thermal ThrottlingThe system shall support thermal management through open loop throttling (OLTT) and closed loop throttling (CLTT) of system memory based on the platform as well as availability of valid temperature sensors on the installed memory DIMMs Throttling levels are changed dynamically to cap throttling based on memory and system thermal conditions as determined by the system and DIMM power and thermal parameters Support for CLTT on mixed-mode DIMM populations (that is some installed DIMMs have valid temp sensors and some do not) is not supported The BMC fan speed control functionality is related to the memory throttling mechanism usedThe following terminology is used for the various memory throttling optionsbull Static Open Loop Thermal Throttling (Static-OLTT) OLTT control registers
are configured by BIOS MRC remain fixed after post The system does not change any of the throttling control registers in the embedded memory controller during runtime
bull Static Closed Loop Thermal Throttling (Static-CLTT) CLTT control registers are configured by BIOS MRC during POST The memory throttling is run as a closed-loop system with the DIMM temperature sensors as the control input Otherwise the system does not change any of the throttling control registers in the embedded memory controller during runtime
bull Dynamic Open Loop Thermal Throttling (Dynamic-OLTT) OLTT control registers are configured by BIOS MRC during POST Adjustments are
HA201-TP Users Manual
106
Chapter 7 Intelreg Server Board S2600TP Platform Management
made to the throttling during runtime based on changes in system cooling (fan speed)
bull Dynamic Closed Loop Thermal Throttling (Dynamic-CLTT) CLTT control registers are configured by BIOS MRC during POST The memory throttling is run as a closed-loop system with the DIMM temperature sensors as the control input Adjustments are made to the throttling during runtime based on changes in system cooling (fan speed)
Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce a new type of CLTT which is referred to as Hybrid CLTT for which the Integrated Memory Controller estimates the DRAM temperature in between actual reads of the TSODsHybrid CLTTT shall be used on all Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family that have DIMMs with thermal sensors Therefore the terms Dynamic-CLTT and Static-CLTT are really referring to this lsquohybridrsquo mode Note that if the IMCrsquos polling of the TSODs is interrupted the temperature readings that the BMC gets from the IMC shall be these estimated values 731463 DIMM Temperature Sensor Input to Fan Speed ControlA clamp algorithm s used for controlling fan speed based on DIMM temperatures Aggregate DIMM temperature margin sensors are used as the control input to the algorithm
731464 Dynamic (Hybrid) CLTTThe system will support dynamic (memory) CLTT for which the BMC FW dynamically modifies thermal offset registers in the IMC during runtime based on changes in system cooling (fan speed) For static CLTT a fixed offset value is applied to the TSOD reading to get the die temperature however this is does not provide as accurate results as when the offset takes into account the current airflow over the DIMM as is done with dynamic CLTTIn order to support this feature the BMC FW will derive the air velocity for each fan domain based on the PWM value being driven for the domain Since this relationship is dependent on the chassis configuration a method must be used which support this dependency (for example through OEM SDR) that establishes a lookup table providing this relationshipBIOS will have an embedded lookup table that provides thermal offset
HA201-TP Users Manual
107
Chapter 7 Intelreg Server Board S2600TP Platform Management
values for each DIMM type altitude setting and air velocity range (3 ranges of air velocity are supported) During system boot BIOS will provide 3 offset values (corresponding to the 3 air velocity ranges) tothe BMC for each enabled DIMM Using this data the BMC FW constructs a table that maps the offset value corresponding to a given air velocity range for each DIMM During runtime the BMC applies an averaging algorithm to determine the target offset value corresponding to thecurrent air velocity and then the BMC writes this new offset value into the IMC thermal offset register for the DIMM
731465 Fan ProfilesThe server system supports multiple fan control profiles to support acoustic targets and American Society of Heating Refrigerating and Air Conditioning Engineers (ASHRAE) compliance The BIOS Setup utility can be used to choose between meeting the target acoustic level or enhanced system performance This is accomplished through fan profilesThe BMC supports eight fan profiles numbered from 0 to 7 Fan management policy is dictated by the Tcontrol SDRs These SDRs provide a way to associate fan control behavior with one or more fan profilesEach group of profiles allows for varying fan control policies based on the altitude For a given altitude the Tcontrol SDRs associated with an acoustics-optimized profile generate less noise than the equivalent performance-optimized profile by driving lower fan speeds and the BIOSreduces thermal management requirements by configuring more aggressive memory throttling See Table 16 for more informationThe BMC provides commands that query for fan profile support and it provides a way to enable a fan profile Enabling a fan profile determines which Tcontrol SDRs are used for fan management The BMC only supports enabling a fan profile through the command if that profile is supported on all fan domains defined for the system It is important to configure the SDRs so that all desired fan profiles are supported on each fan domain If no single profile is supported across all domains the BMC by default uses profile 0 and does not allow it to be changedAt system boot the BIOS can use the Get Fan Control Configuration command to query the BMC about which fan profiles are supported The BIOS uses this information to display options in the BIOS Setup utility The BIOS indicates the fan profile to the BMC as dictated by the BIOS Setup Utility options for fan mode and altitude using the Set Fan Control
HA201-TP Users Manual
108
Chapter 7 Intelreg Server Board S2600TP Platform Management
Configuration commandThe BMC uses this information as an input to its fan-control algorithm as supported by the Tcontrol OEM SDR The BMC only allows enabling of fan profiles that the BMC indicates are supported using the Get Fan Control Configuration command For example if the Get Fan Control Configuration command indicates that only profile 1 is supported then using the Set Fan Control Configuration command to enable profile 2 will result in the return of an error completion codeThe BMC requires the BIOS to send the Set Fan Control Configuration command to the BMC on every system boot This must be done after the BIOS has completed any throttling-related chipset configuration
HA201-TP Users Manual
109
Chapter 7 Intelreg Server Board S2600TP Platform Management
731466 Open-Loop Thermal Throttling FallbackNormal system operation uses closed-loop thermal throttling (CLTT) and DIMM temperature monitoring as major factors in overall thermal and acoustics management In the event that BIOS is unable to configure the system for CLTT it defaults to open-loop thermal throttling (OLTT) In the OLTT mode it is assumed that the DIMM temperature sensors are not available for fan speed control The BIOS communicates the throttling mode to the BMC along with the fan profile number when it sends the Set Fan Control Configuration commandWhen OLTT mode is specified the BMC internally blocks access to the DIMM temperatures causing the DIMM aggregate margin sensors to be marked as ReadingState Unavailable The BMC then uses the failure-control values for these sensors if specified in the Tcontrol SDRs as their fan speed contributions
731467 ASHRAE ComplianceSystem requirements for ASHRAE compliance is defined in the Common Fan Speed Control amp Thermal Management Platform Architecture Specification Altitude-related changes in fan speed control are handled through profiles for different altitude ranges
73147 Power Supply Fan Speed ControlThis section describes the system level control of the fans internal to the power supply over the PMBus Some but not all Intelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family will require that the power supplies be included in the system level fan speed control For any system that requires either of these capabilities the power supply must be PMBus-compliant
731471 System Control of Power Supply FansSome products require that the BMC control the speed of the power supply fans as is done with normal system (chassis) fans except that the BMC cannot reduce the power supply fan any lower than the internal power supply control is driving it For these products the BMC FW must have the ability to control and monitor the power supply fans through PMBus commands The power supply fans are treated as a system fan domain for which fan control policies are mapped just as for chassis system fans with system thermal sensors (rather than internal power
HA201-TP Users Manual
110
Chapter 7 Intelreg Server Board S2600TP Platform Management
supply thermal sensors) used as the input to a clamp algorithm for the power supply fan control This domain has both piecewise clipping curves and clamped sensors mapped into the power supply fan domain All the power supplies can be defined as a single fan domain
731472 Use of Power Supply Thermal Sensors as Input to System (Chassis) Fan Control Some products require that the power supply internal thermal sensors are used as control inputs to the system (chassis) fans in the same manner as other system thermal sensors are used for this purpose The power supply thermal sensors are included as clamped sensors into one or more system fan domains which may include the power supply fan domain
73148 Fan Boosting due to Fan FailuresIntelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce additional capabilities for handling fan failure or removal as described in this sectionEach fan failure shall be able to define a unique response from all other fan domains An OEM SDR table defines the response of each fan domain based on a failure of any fan including both system and power supply fans (for PMBus-compliant power supplies only) This meansthat if a system has six fans then there will be six different fan fail reactions
73149 Programmable Fan PWM OffsetThe system provides a BIOS Setup option to boost the system fan speed by a programmable positive offset or a ldquoMaxrdquo setting Setting the programmable offset causes the BMC to add the offset to the fan speeds that it would otherwise be driving the fans to The Max setting causes the BMC to replace the domain minimum speed with alternate domain minimums that also are programmable through SDRsThis capability is offered to provide system administrators the option to manually configure fans speeds in instances where the fan speed optimized for a given platform may not be sufficient when a high end add-in is configured into the system This enables easier usage of the fan speed control to support Intelreg as well as third party chassis and better support of ambient temperatures higher than 35C
HA201-TP Users Manual
111
Chapter 7 Intelreg Server Board S2600TP Platform Management
supply thermal sensors) used as the input to a clamp algorithm for the power supply fan control This domain has both piecewise clipping curves and clamped sensors mapped into the power supply fan domain All the power supplies can be defined as a single fan domain
731472 Use of Power Supply Thermal Sensors as Input to System (Chassis) Fan Control Some products require that the power supply internal thermal sensors are used as control inputs to the system (chassis) fans in the same manner as other system thermal sensors are used for this purpose The power supply thermal sensors are included as clamped sensors into one or more system fan domains which may include the power supply fan domain
73148 Fan Boosting due to Fan FailuresIntelreg Server Systems supporting the Intelreg Xeonreg processor E5-2600 v3 product family introduce additional capabilities for handling fan failure or removal as described in this sectionEach fan failure shall be able to define a unique response from all other fan domains An OEM SDR table defines the response of each fan domain based on a failure of any fan including both system and power supply fans (for PMBus-compliant power supplies only) This meansthat if a system has six fans then there will be six different fan fail reactions
73149 Programmable Fan PWM OffsetThe system provides a BIOS Setup option to boost the system fan speed by a programmable positive offset or a ldquoMaxrdquo setting Setting the programmable offset causes the BMC to add the offset to the fan speeds that it would otherwise be driving the fans to The Max setting causes the BMC to replace the domain minimum speed with alternate domain minimums that also are programmable through SDRsThis capability is offered to provide system administrators the option to manually configure fans speeds in instances where the fan speed optimized for a given platform may not be sufficient when a high end add-in is configured into the system This enables easier usage of the fan speed control to support Intelreg as well as third party chassis and better support of ambient temperatures higher than 35C
HA201-TP Users Manual
112
Chapter 7 Intelreg Server Board S2600TP Platform Management
7315 Power Management Bus (PMBus)The Power Management Bus (ldquoPMBusrdquo) is an open standard protocol that is built upon the SMBus 20 transport It defines a means of communicating with power conversion and other devices using SMBus-based commands A system must have PMBus-compliant power supplies installed in order for the BMC or ME to monitor them for status andor power metering purposesFor more information on PMBus see the System Management Interface Forum Web sitehttpwwwpowersigorg
7316 Power Supply Dynamic Redundancy SensorThe BMC supports redundant power subsystems and implements a Power Unit Redundancy sensor per platform A Power Unit Redundancy sensor is of sensor type Power Unit (09h) and reading type Availability Status (0Bh) This sensor generates events when a power subsystem transitions between redundant and non-redundant states as determined by the number and health of the power subsystemrsquos component power supplies The BMC implements Dynamic Power Supply Redundancy status based upon current system load requirements as well as total Power Supply capacity This status is independent of the Cold Redundancy status This prevents the BMC from reporting Fully Redundant Power supplies when the load required by the system exceeds half the power capability of all power supplies installed and operationalDynamic Redundancy detects this condition and generates the appropriate SEL event to notify the user of the condition Power supplies of different power ratings may be swapped in and out to adjust the power capacity and the BMC will adjust the Redundancy status accordinglyThe definition of redundancy is power subsystem dependent and sometimes even configuration dependent See the appropriate Platform Specific Information for power unit redundancy supportThis sensor is configured as manual-rearm sensor in order to avoid the possibility of extraneous SEL events that could occur under certain system configuration and workload conditions The sensor shall rearm for the following conditionsbull PSU hot-addbull System reset
HA201-TP Users Manual
113
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull AC power cyclebull DC power cycleSystem AC power is applied but on standby ndash Power unit redundancy is based on OEM SDR power unit record and number of PSU presentSystem is (DC) powered on - The BMC calculates Dynamic Power Supply Redundancy status based upon current system load requirements as well as total Power Supply capacityThe BMC allows redundancy to be configured on a per power-unit-redundancy sensor basis by means of the OEM SDR records
7317 Component Fault LED ControlSeveral sets of component fault LEDs are supported on the server board Some LEDs are owned by the BMC and some by the BIOSThe BMC owns control of the following FRUfault LEDsbull Fan fault LEDs ndash A fan fault LED is associated with each fan The BMC
lights a fan fault LED if the associated fan-tach sensor has a lower critical threshold event status asserted Fan-tach sensors are manual re-arm sensors Once the lower critical threshold is crossed the LED remains lit until the sensor is rearmed These sensors are rearmed at system DC power-on and system reset
bull DIMM fault LEDs ndash The BMC owns the hardware control for these LEDs The LEDs reflect the state of BIOS-owned event-only sensors When the BIOS detects a DIMM fault condition it sends an IPMI OEM command (Set Fault Indication) to the BMC to instruct the BMC to turn on the associated DIMM Fault LED These LEDs are only active when the system is in the lsquoonrsquo state The BMC will not activate or change the state of the LEDs unless instructed by the BIOS
bull Hard Disk Drive Status LEDs ndash The HSBP PSoC owns the hardware control for these LEDs and detection of the faultstatus conditions that the LEDs reflect
bull CPU Fault LEDs The BMC owns control for these LEDs An LED is lit if there is an MSID mismatch (that is CPU power rating is incompatible with the board)
HA201-TP Users Manual
114
Chapter 7 Intelreg Server Board S2600TP Platform Management
7318 CMOS Battery MonitoringThe BMC monitors the voltage level from the CMOS battery which provides battery backup to the chipset RTC This is monitored as an auto-rearm threshold sensorUnlike monitoring of other voltage sources for which the Emulex Pilot III component continuously cycles through each input the voltage channel used for the battery monitoring provides a software enable bit to allow the BMC FW to poll the battery voltage at a relativelyslow rate in order to conserve battery power
HA201-TP Users Manual
115
Chapter 7 Intelreg Server Board S2600TP Platform Management
74 Intelreg Intelligent Power Node Manager (NM)Power management deals with requirements to manage processor power consumption and manage power at the platform level to meet critical business needs Node Manager (NM) is a platform resident technology that enforces power capping and thermal-triggered powercapping policies for the platform These policies are applied by exploiting subsystem settings (such as processor P and T states) that can be used to control power consumption NM enables data center power management by exposing an external interface to management software through which platform policies can be specified It also implements specific data center power management usage models such as power limiting and thermal monitoringThe NM feature is implemented by a complementary architecture utilizing the ME BMC BIOS and an ACPI-compliant OS The ME provides the NM policy engine and power controllimiting functions (referred to as Node Manager or NM) while the BMC provides the external LAN link by which external management software can interact with the feature The BIOS provides system power information utilized by the NM algorithms and also exports ACPI Source Language (ASL) code used by OS-Directed Power Management (OSPM) for negotiating processor P and T state changes for power limiting PMBus-compliant power supplies provide the capability to monitor input power consumption which is necessary to support NMThe NM architecture applicable to this generation of servers is defined by the NPTM Architecture Specification v20 NPTM is an evolving technology that is expected to continue to add new capabilities that will be defined in subsequent versions of the specification The ME NM implements the NPTM policy engine and controlmonitoring algorithms defined in the Node Power and Thermal Manager (NPTM) specification
741 Hardware RequirementsNM is supported only on platforms that have the NM FW functionality loaded and enabled on the Management Engine (ME) in the SSB and that have a BMC present to support the external LAN interface to the ME NM power limiting features requires a means for the ME to monitorinput power consumption for the platform This capability is generally provided by means of PMBus-compliant power supplies although an alternative model using a simpler SMBus power monitoring device is possible (there is potential loss in accuracy and responsiveness using
HA201-TP Users Manual
116
Chapter 7 Intelreg Server Board S2600TP Platform Management
non-PMBus devices) The NM SmaRTCLST feature does specifically require PMBus-compliant power supplies as well as additional hardware on the server board
742 FeaturesNM provides feature support for policy management monitoring and querying alerts and notifications and an external interface protocol The policy management features implement specific IT goals that can be specified as policy directives for NM Monitoring and querying features enable tracking of power consumption Alerts and notifications provide the foundation for automation of power management in the data center management stack The external interface specifies the protocols that must be supported in this version of NM
743 ME System Management Bus (SMBus) Interfacebull The ME uses the SMLink0 on the SSB in multi-master mode as a
dedicated bus for communication with the BMC using the IPMB protocol The BMC FW considers this a secondary IPMB bus and runs at 400 kHz
bull The ME uses the SMLink1 on the SSB in multi-master mode bus for communication with PMBus devices in the power supplies for support of various NM-related features This bus is shared with the BMC which polls these PMBus power supplies for sensor monitoring purposes (for example power supply status input power and so on) This bus runs at 100 KHz
bull The Management Engine has access to the ldquoHost SMBusrdquo
744 PECI 30bull The BMC owns the PECI bus for all Intel server implementations and
acts as a proxy for the ME when necessary
745 NM ldquoDiscoveryrdquo OEM SDRAn NM ldquodiscoveryrdquo OEM SDR must be loaded into the BMCrsquos SDR repository if and only if the NM feature is supported on that product This OEM SDR is used by management software to detect if NM is supported and to understand how to communicate with itSince PMBus compliant power supplies are required in order to support NM the system should be probed when the SDRs are loaded into the
HA201-TP Users Manual
117
Chapter 7 Intelreg Server Board S2600TP Platform Management
BMCrsquos SDR repository in order to determine whether or not the installed power supplies do in fact support PMBus If the installed power supplies are not PMBus compliant then the NM ldquodiscoveryrdquo OEM SDR should not be loadedPlease refer to the Intelreg Intelligent Power Node Manager 20 External Architecture Specification using IPMI for details of this interface
746 SmaRTCLSTThe power supply optimization provided by SmaRTCLST relies on a platform HW capability as well as ME FW support When a PMBus-compliant power supply detects insufficient input voltage an overcurrent condition or an over-temperature condition it will assert theSMBAlert signal on the power supply SMBus (such as the PMBus) Through the use of external gates this results in a momentary assertion of the PROCHOT and MEMHOT signals to the processors thereby throttling the processors and memory The ME FW also sees the SMBAlert assertion queries the power supplies to determine the condition causing the assertion and applies an algorithm to either release or prolong the throttling based on the situationSystem power control modes include1 SmaRT Low AC in put voltage event results in a one-time momentary
throttle for each event to the maximum throttle state2 Electrical Protection CLST High output energy event results in a
throttling hiccup mode with fixed maximum throttle time and a fix throttle release ramp time
3 Thermal Protection CLST High power supply thermal event results in a throttling hiccup mode with fixed maximum throttle time and a fix throttle release ramp time
When the SMBAlert signal is asserted the fans will be gated by HW for a short period (~100ms) to reduce overall power consumption It is expected that the interruption to the fans will be of short enough duration to avoid false lower threshold crossings for the fan tachsensors however this may need to be comprehended by the fan monitoring FW if it does have this side-effectME FW will log an event into the SEL to indicate when the system has been throttled by the SmaRTCLST power management feature This is dependent on ME FW support for this sensor Please refer ME FW EPS for SEL log details
HA201-TP Users Manual
118
Chapter 7 Intelreg Server Board S2600TP Platform Management
74611 Dependencies on PMBus-compliant Power Supply SupportThe SmaRTCLST system feature depends on functionality present in the ME NM SKU This feature requires power supplies that are compliant with the PMBus
note For additional information on Intelreg Intelligent Power Node
Manager usage and support please visit the following Intel Website
httpwwwintelcomcontentwwwusendata-centerdata-center-managementnode-manager-generalhtmlwapkw=node+manager
HA201-TP Users Manual
119
Chapter 7 Intelreg Server Board S2600TP Platform Management
75 Basic and Advanced Server Management FeaturesThe integrated BMC has support for basic and advanced server management features Basic management features are available by default Advanced management features are enabled with the addition of an optionally installed Remote Management Module 4 Lite (RMM4 Lite) key
When the BMC FW initializes it attempts to access the Intelreg RMM4 lite If the attempt to access Intelreg RMM4 lite is successful then the BMC activates the Advanced featuresThe following table identifies both Basic and Advanced server management features
HA201-TP Users Manual
120
Chapter 7 Intelreg Server Board S2600TP Platform Management
751 Dedicated Management PortThe server board includes a dedicated 1GbE RJ45 Management Port The management port is active with or without the RMM4 Lite key installed
752 Embedded Web ServerBMC Base manageability provides an embedded web server and an OEM-customizable web GUI which exposes the manageability features of the BMC base feature set It is supported over all on-board NICs that have management connectivity to the BMC as well as an optionaldedicated add-in management NIC At least two concurrent web sessions from up to two different users is supported The embedded web user interface shall support the following client web browsersbull Microsoft Internet Explorer 90bull Microsoft Internet Explorer 100bull Mozilla Firefox 24bull Mozilla Firefox 25The embedded web user interface supports strong security (authentication encryption and firewall support) since it enables remote server configuration and control The user interface presented by the embedded web user interface shall authenticate the user before allowing a web session to be initiated Encryption using 128-bit SSL is supported User authentication is based on user id and passwordThe GUI presented by the embedded web server authenticates the user before allowing a web session to be initiated It presents all functions to all users but grays-out those functions that the user does not have privilege to execute For example if a user does not have privilege to power control then the item shall be displayed in grey-out font in that userrsquos UI display The web GUI also provides a launch point for some of the advanced features such as KVM and media redirection These features are grayed out in the GUI unless the system has been updated to support these advanced features The embedded web server only displays US English or Chinese language outputAdditional features supported by the web GUI includesbull Presents all the Basic features to the usersbull Power onoffreset the server and view current power statebull Displays BIOS BMC ME and SDR version informationbull Display overall system health
HA201-TP Users Manual
121
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull Configuration of various IPMI over LAN parameters for both IPV4 and IPV6
bull Configuration of alerting (SNMP and SMTP)bull Display system asset information for the product board and
chassisbull Display of BMC-owned sensors (name status current reading
enabled thresholds) including color-code status of sensorsbull Provides ability to filter sensors based on sensor type (Voltage
Temperature Fan and Power supply related)bull Automatic refresh of sensor data with a configurable refresh ratebull On-line helpbull Displayclear SEL (display is in easily understandable human
readable format)bull Supports major industry-standard browsers (Microsoft Internet
Explorer and Mozilla Firefox)bull The GUI session automatically times-out after a user-configurable
inactivity period By default this inactivity period is 30 minutesbull Embedded Platform Debug feature - Allow the user to initiate
a ldquodebug dumprdquo to a file that can be sent to Intelreg for debug purposes
bull Virtual Front Panel The Virtual Front Panel provides the same functionality as the local front panel The displayed LEDs match the current state of the local panel LEDs The displayed buttons (for example power button) can be used in the same manner as the local buttons
bull Display of ME sensor data Only sensors that have associated SDRs loaded will be displayed
bull Ability to save the SEL to a filebull Ability to force HTTPS connectivity for greater security This is
provided through a configuration option in the UIbull Display of processor and memory information as is available over
IPMI over LANbull Ability to get and set Node Manager (NM) power policiesbull Display of power consumed by the serverbull Ability to view and configure VLAN settingsbull Warn user the reconfiguration of IP address will cause disconnectbull Capability to block logins for a period of time after several
HA201-TP Users Manual
122
Chapter 7 Intelreg Server Board S2600TP Platform Management
consecutive failed login attempts The lock-out period and the number of failed logins that initiates the lockout period are configurable by the user
bull Server Power Control - Ability to force into Setup on a resetbull System POST results ndash The web server provides the systemrsquos Power-
On Self Test (POST) sequence for the previous two boot cycles including timestamps The timestamps may be viewed in relative to the start of POST or the previous POST code
bull Customizable ports - The web server provides the ability to customize the port numbers used for SMASH http https KVM secure KVM remote media and secure remote media
For additional information reference the Intelreg Remote Management Module 4 and Integrated BMC Web Console Users Guide
753 Advanced Management Feature Support (RMM4 Lite)The integrated baseboard management controller has support for advanced management features which are enabled when an optional Intelreg Remote Management Module 4 Lite (RMM4 Lite) is installed The Intel RMM4 add-on offers convenient remote KVM access and control through LAN and internet It captures digitizes and compresses video and transmits it with keyboard and mouse signals to and from a remote computer Remote access and controlsoftware runs in the integrated baseboard management controller utilizing expanded capabilities enabled by the Intel RMM4 hardwareKey Features of the RMM4 add-on arebull KVM redirection from either the dedicated management NIC or
the server board NICs used for management traffic upto to two KVM sessions
bull Media Redirection ndash The media redirection feature is intended to allow system administrators or users to mount a remote IDE or USB CDROM floppy drive or a USB flash disk as a remote device to the server Once mounted the remote device appears just like a local device to the server allowing system administrators or users to install software (including operating systems) copy files update BIOS or boot the server from this device
bull KVM ndash Automatically senses video resolution for best possible screen capture high performance mouse tracking and
HA201-TP Users Manual
123
Chapter 7 Intelreg Server Board S2600TP Platform Management
synchronization It allows remote viewing and configuration in pre-boot POST and BIOS setup
7531 Keyboard Video Mouse (KVM) RedirectionThe BMC firmware supports keyboard video and mouse redirection (KVM) over LAN This feature is available remotely from the embedded web server as a Java applet This feature is only enabled when the Intelreg RMM4 lite is present The client system must have a Java Runtime Environment (JRE) version 60 or later to run the KVM or media redirection appletsThe BMC supports an embedded KVM application (Remote Console) that can be launched from the embedded web server from a remote console USB11 or USB 20 based mouse and keyboard redirection are supported It is also possible to use the KVM-redirection (KVM-r) session concurrently with media-redirection (media-r) This feature allows a user to interactively use the keyboard video and mouse (KVM) functions of the remote server as if the user were physically at the managed server KVM redirection console supports the following keyboard layouts English Dutch French German Italian Russian and SpanishKVM redirection includes a ldquosoft keyboardrdquo function The ldquosoft keyboardrdquo is used to simulate an entire keyboard that is connected to the remote system The ldquosoft keyboardrdquo functionality supports the following layouts English Dutch French German Italian Russian and SpanishThe KVM-redirection feature automatically senses video resolution for best possible screen capture and provides high-performance mouse tracking and synchronization It allows remote viewing and configuration in pre-boot POST and BIOS setup once BIOS has initialized videoOther attributes of this feature includebull Encryption of the redirected screen keyboard and mousebull Compression of the redirected screenbull Ability to select a mouse configuration based on the OS typebull Supports user definable keyboard macrosKVM redirection feature supports the following resolutions and refresh ratesbull 640x480 at 60Hz 72Hz 75Hz 85Hz 100Hzbull 800x600 at 60Hz 72Hz 75Hz 85Hzbull 1024x768 at 60Hx 72Hz 75Hz 85Hzbull 1280x960 at 60Hz
HA201-TP Users Manual
124
Chapter 7 Intelreg Server Board S2600TP Platform Management
bull 1280x1024 at 60Hzbull 1600x1200 at 60Hzbull 1920x1080 (1080p)bull 1920x1200 (WUXGA)bull 1650x1080 (WSXGA+)
7532 Remote ConsoleThe Remote Console is the redirected screen keyboard and mouse of the remote host system To use the Remote Console window of your managed host system the browser must include a Java Runtime Environment plug-in If the browser has no Java support such as with a small handheld device the user can maintain the remote host system using the administration forms displayed by the browserThe Remote Console window is a Java Applet that establishes TCP connections to the BMCThe protocol that is run over these connections is a unique KVM protocol and not HTTP or HTTPS This protocol uses ports 7578 for KVM 5120 for CDROM media redirection and 5123 for FloppyUSB media redirection When encryption is enabled the protocol uses ports 7582 for KVM 5124 for CDROM media redirection and 5127 for FloppyUSB media redirection The local network environment must permit these connections to be made that is the firewall and in case of a private internal network the NAT (Network Address Translation) settings have to be configured accordingly
7533 PerformanceThe remote display accurately represents the local display The feature adapts to changes to the video resolution of the local display and continues to work smoothly when the system transitions from graphics to text or vice-versa The responsiveness may be slightly delayed depending on the bandwidth and latency of the networkEnabling KVM andor media encryption will degrade performance Enabling video compression provides the fastest response while disabling compression provides better video qualityFor the best possible KVM performance a 2Mbsec link or higher is recommendedThe redirection of KVM over IP is performed in parallel with the local KVM without affecting the local KVM operation
HA201-TP Users Manual
125
Chapter 7 Intelreg Server Board S2600TP Platform Management
7534 SecurityThe KVM redirection feature supports multiple encryption algorithms including RC4 and AES The actual algorithm that is used is negotiated with the client based on the clientrsquos capabilities
7535 AvailabilityThe remote KVM session is available even when the server is powered-off (in stand-by mode) No re-start of the remote KVM session shall be required during a server reset or power onoff A BMC reset (for example due to an BMC Watchdog initiated reset or BMC reset after BMC FWupdate) will require the session to be re-established KVM sessions persist across system reset but not across an AC power loss
7536 UsageAs the server is powered up the remote KVM session displays the complete BIOS boot process The user is able interact with BIOS setup change and save settings as well as enter and interact with option ROM configuration screensAt least two concurrent remote KVM sessions are supported It is possible for at least two different users to connect to same server and start remote KVM sessions
7537 Force-enter BIOS SetupKVM redirection can present an option to force-enter BIOS Setup This enables the system to enter F2 setup while booting which is often missed by the time the remote console redirects the video
7538 Media RedirectionThe embedded web server provides a Java applet to enable remote media redirection This may be used in conjunction with the remote KVM feature or as a standalone applet The media redirection feature is intended to allow system administrators or users to mount a remote IDE or USB CD-ROM floppy drive or a USB flash disk as a remote device to the server Once mounted the remote device appears just like a local device to the server allowing system administrators or users to install software (including operating systems) copy files update BIOS and so on or boot the server from this deviceThe following capabilities are supported
HA201-TP Users Manual
126
Chapter 7 Intelreg Server Board S2600TP Platform Management
The operation of remotely mounted devices is independent of the local devices on the server Both remote and local devices are useable in parallelbull Either IDE (CD-ROM floppy) or USB devices can be mounted as a
remote device to the serverbull It is possible to boot all supported operating systems from the remotely
mounted device and to boot from disk IMAGE (IMG) and CD-ROM or DVD-ROM ISO files See the Testedsupported Operating System List for more information
bull Media redirection supports redirection for both a virtual CD device and a virtual FloppyUSB device concurrently The CD device may be either a local CD drive or else an ISO image file the FloppyUSB device may be either a local Floppy drive a local USB device or else a disk image file
bull The media redirection feature supports multiple encryption algorithms including RC4 and AES The actual algorithm that is used is negotiated with the client based on the clientrsquos capabilities
bull A remote media session is maintained even when the server is powered-off (in standby mode) No restart of the remote media session is required during a server reset or power onoff An BMC reset (for example due to an BMC reset after BMC FW update) will require the session to be re-established
bull The mounted device is visible to (and useable by) managed systemrsquos OS and BIOS in both pre-boot and post-boot states
bull The mounted device shows up in the BIOS boot order and it is possible to change the BIOS boot order to boot from this remote device
bull It is possible to install an operating system on a bare metal server (no OS present) using the remotely mounted device This may also require the use of KVM-r to configure the OS during install
USB storage devices will appear as floppy disks over media redirection This allows for the installation of device drivers during OS installationIf either a virtual IDE or virtual floppy device is remotely attached during system boot both the virtual IDE and virtual floppy are presented as bootable devices It is not possible to present only a single-mounted device type to the system BIOS
HA201-TP Users Manual
127
Chapter 7 Intelreg Server Board S2600TP Platform Management
75381 AvailabilityThe default inactivity timeout is 30 minutes and is not user-configurable Media redirection sessions persist across system reset but not across an AC power loss or BMC reset
75382 Network Port UsageThe KVM and media redirection features use the following portsbull 5120 ndash CD Redirectionbull 5123 ndash FD Redirectionbull 5124 ndash CD Redirection (Secure)bull 5127 ndash FD Redirection (Secure)bull 7578 ndash Video Redirectionbull 7582 ndash Video Redirection (Secure)For additional information reference the Intelreg Remote Management Module 4 and Integrated BMC Web Console Users Guide
Chapter 1 Product IntroductionChapter 1 Product IntroductionChapter 8 Technical Support
wwwaicipccom
bull TAIWANTel +886 3 433 9188Fax +886 3 287 1818Email salesaicipccomtw
bull CHINA Tel +862154961421 +862154961422Fax Extension 608Email Technical Support supportaicipccom
bull AMERICA - West coastTel +19098958989Fax +19098958999Email salesaicipccom
bull AMERICA - East coastTel +19738848886Fax +19738844794Email njsalesaicipccom
bull EUROPETel +31306386789Fax +31306360638Emailsalesaicipcnl
Email Technical Support supportaicipccom
- PREFACE
-
- SAFETY INSTRUCTIONS
-
- Chapter 1 Product Introduction
-
- 11 Box Content
- 12 Specifications
- 13 General Information
-
- Chapter 2 Hardware Installation
-
- 21 Removing and Installing top cover
- 22 Central Processing Unit (CPU)
- 23 Diagram of the Correct Installation
- 24 System Memory
- 25 RemovingInstalling a Drive Tray Hard Drive Drive Slot Map
- 26 Removing and Installing a Fan Module
- 27 Power Supply
- 28 Tool-less Blade Slide Installation introduction
-
- Chapter 3 Motherborad Overview
-
- 31 Intelreg Server Board Feature Set
- 32 Motherboard Layout
- 33 Motherboard block diagram
- 34 Motherboard Configuration Jumpers
- 35 Motherboard Configuration Jumpers
-
- Chapter 4 12GB Expander Borad Overview
-
- 41 12GB Expander Board
-
- Chapter 5 Backplane Overview
-
- 51 Backplane
-
- Chapter 6 Debug amp Firmware Update Introduction
-
- 61 Expender firmware update through smart console port
- 62 Update the expander firmware through in-band
- 63 12G expander EDFB setting
- 64 Slot HDD power setting
- 65 HDD BP thermal sensor temperature setting
-
- Chapter 7 Intelreg Server Board S2600TP Platform Management
-
- 71 Server Management Function Architecture
- 72 Features and Functions
- 73 Sensor Monitoring
- 74 Intelreg Intelligent Power Node Manager (NM)
- 75 Basic and Advanced Server Management Features
-
- Chapter 8 Technical Support
-
- 按鈕11
![Page 16: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/16.jpg)
![Page 17: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/17.jpg)
![Page 18: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/18.jpg)
![Page 19: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/19.jpg)
![Page 20: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/20.jpg)
![Page 21: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/21.jpg)
![Page 22: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/22.jpg)
![Page 23: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/23.jpg)
![Page 24: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/24.jpg)
![Page 25: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/25.jpg)
![Page 26: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/26.jpg)
![Page 27: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/27.jpg)
![Page 28: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/28.jpg)
![Page 29: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/29.jpg)
![Page 30: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/30.jpg)
![Page 31: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/31.jpg)
![Page 32: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/32.jpg)
![Page 33: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/33.jpg)
![Page 34: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/34.jpg)
![Page 35: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/35.jpg)
![Page 36: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/36.jpg)
![Page 37: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/37.jpg)
![Page 38: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/38.jpg)
![Page 39: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/39.jpg)
![Page 40: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/40.jpg)
![Page 41: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/41.jpg)
![Page 42: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/42.jpg)
![Page 43: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/43.jpg)
![Page 44: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/44.jpg)
![Page 45: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/45.jpg)
![Page 46: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/46.jpg)
![Page 47: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/47.jpg)
![Page 48: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/48.jpg)
![Page 49: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/49.jpg)
![Page 50: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/50.jpg)
![Page 51: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/51.jpg)
![Page 52: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/52.jpg)
![Page 53: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/53.jpg)
![Page 54: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/54.jpg)
![Page 55: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/55.jpg)
![Page 56: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/56.jpg)
![Page 57: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/57.jpg)
![Page 58: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/58.jpg)
![Page 59: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/59.jpg)
![Page 60: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/60.jpg)
![Page 61: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/61.jpg)
![Page 62: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/62.jpg)
![Page 63: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/63.jpg)
![Page 64: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/64.jpg)
![Page 65: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/65.jpg)
![Page 66: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/66.jpg)
![Page 67: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/67.jpg)
![Page 68: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/68.jpg)
![Page 69: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/69.jpg)
![Page 70: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/70.jpg)
![Page 71: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/71.jpg)
![Page 72: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/72.jpg)
![Page 73: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/73.jpg)
![Page 74: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/74.jpg)
![Page 75: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/75.jpg)
![Page 76: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/76.jpg)
![Page 77: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/77.jpg)
![Page 78: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/78.jpg)
![Page 79: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/79.jpg)
![Page 80: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/80.jpg)
![Page 81: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/81.jpg)
![Page 82: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/82.jpg)
![Page 83: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/83.jpg)
![Page 84: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/84.jpg)
![Page 85: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/85.jpg)
![Page 86: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/86.jpg)
![Page 87: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/87.jpg)
![Page 88: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/88.jpg)
![Page 89: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/89.jpg)
![Page 90: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/90.jpg)
![Page 91: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/91.jpg)
![Page 92: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/92.jpg)
![Page 93: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/93.jpg)
![Page 94: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/94.jpg)
![Page 95: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/95.jpg)
![Page 96: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/96.jpg)
![Page 97: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/97.jpg)
![Page 98: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/98.jpg)
![Page 99: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/99.jpg)
![Page 100: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/100.jpg)
![Page 101: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/101.jpg)
![Page 102: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/102.jpg)
![Page 103: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/103.jpg)
![Page 104: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/104.jpg)
![Page 105: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/105.jpg)
![Page 106: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/106.jpg)
![Page 107: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/107.jpg)
![Page 108: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/108.jpg)
![Page 109: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/109.jpg)
![Page 110: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/110.jpg)
![Page 111: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/111.jpg)
![Page 112: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/112.jpg)
![Page 113: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/113.jpg)
![Page 114: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/114.jpg)
![Page 115: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/115.jpg)
![Page 116: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/116.jpg)
![Page 117: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/117.jpg)
![Page 118: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/118.jpg)
![Page 119: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/119.jpg)
![Page 120: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/120.jpg)
![Page 121: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/121.jpg)
![Page 122: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/122.jpg)
![Page 123: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/123.jpg)
![Page 124: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/124.jpg)
![Page 125: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/125.jpg)
![Page 126: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/126.jpg)
![Page 127: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/127.jpg)
![Page 128: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/128.jpg)
![Page 129: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/129.jpg)
![Page 130: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/130.jpg)
![Page 131: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/131.jpg)
![Page 132: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/132.jpg)
![Page 133: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/133.jpg)
![Page 134: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/134.jpg)
![Page 135: AIC HA201-TP (2U 24-Bay Storage Server Solution) User ManualHA201-TP User's Manual 7 Chapter 2 Hardware Installation 2 1 Removing and Installing top cover Unscrew the screws.(2pcs](https://reader036.vdocuments.net/reader036/viewer/2022071405/60fb120a8cdb8f0fc425db2e/html5/thumbnails/135.jpg)