information gathering 2

Download Information Gathering 2

If you can't read please download the document

Upload: aero-plane

Post on 16-Apr-2017

5.656 views

Category:

Documents


0 download

TRANSCRIPT

Information Gathering

Michael Johnson

SYS-TSC-VSP

Information Gathering

BIOS / DMI / POST events - overview

IPMI - Generic overview

SEL, SP, IPMI, DMI and register data

How to view gathered data and analysis of events

Types of events available and were to look

Normal / expected output

Explorer, siga, sysreport, mpsreport, msinfo32

Working with non-service processor platforms

A Briefer on BIOS Execution Stages

BIOS = Basic Input Output System

BIOS has certain execution phases, namely checkpoints, defined and reported throughout it's execution life

These checkpoints are both written out onto on a specific port (80,81), and reported at the bottom of POST screen if possible, via LCD display or SP (sp get port80 vX0z) progress is also available through sundiag - current-port80 on the Galaxy SP

Bios is a collection of APIs Its also consists of configratiopn information User selectable varablesAPCI memory information SEL DMI logssets resiters to determine behaviuor powernow and varioius enable/diable options.

BIOS Output (HT Sync Flood Example)

BIOS Stages

Initial jump to Reset Vector

At This stage,first CPU (also called Boot Strap Processor, or

BSP) has a hard-coded jump instruction to a globally_defined

address of F000:FFF0

Boot block execution from Flash ROM

Consists of Initial and extremely efficient code for very early

bring-up of a minimalist system. During this phase:

HyperTransport Devices which are interconnected to CPU0 are

detected and linked to with minimum speed/bandwidth

Available and working DIMM memory which is attached to each North bridge, is arranged in local memory map registers

BIOS Stages cont

Potential Sync Flood is detected and flagged in CMOS

Compressed BIOS binary checksum is verified

Motherboard initialization:

All general chips are initialized

Hyper Transport, North Bridges

South Bridge, Tunnels, etc

Video initialization: Graphic is initialized

USB resource configuration:

South Bridge USB controller is initialized

Attached USB devices are recognized

General Execution Stages of BIOS cont

Option ROM initialization:

All ROM binaries of available PCI devices are sequentially executed

(like SAS, GB LAN, etc.)

System boot initiation:

Execution is handed over to boot loader of an OS on certain Boot device

(Solaris on Disk0)

User-initiated system setup:

Alternate entry into BIOS Setup Page has happened

By pressing F2 during BIOS POST

IPMI

Ipmitool is a utility for interfacing with devices that support the Intelligent Platform Management Interface specification. IPMI is an open standard for machine health, inventory, and remote power control.

Ipmitool can communicate with IPMI-enabled devices through either a kernel driver such as OpenIPMI or over the RMCP (Remote Management Control Protocol) LAN protocol defined in the IPMI specification.

IPMIv2 adds support for encrypted LAN communications and remote Serial-over-LAN functionality.

Ipmitool is found in /usr/sfw/bin on Solaris operating systems.

Ipmitool has to be installed as an option on Linux and Windows.

IPMI v2.0 Architecture

BaseboardSystem Bus

BridgeController

ICMB

Aux. IPMB

RemoteMgmt. Card

SMBus/PCI Mgmt. Bus

BaseboardMgmt.Controller(BMC)I2C/SMBus

SDR, SEL, FRUNV Store

MgmtNetwkCtrlr

LAN

PCIRS-232

MODEM/ Serial

IPMB (I2C)Chassis

FRU SEEPROM

side-bandSystem Interface

SENSORs& controlcircuitry

I2C / SMBus

IPMI Messages

sensors& controlcircuitrySatelliteMgmt.Controller

In BandOut of Band

IPMI Block Diagram V20z:

IPMI

SDR - Sensor Data Repository displays sensor values via i2c

SEL - System Event Log displays the SM BIOS log events

FRU - Field Replaceable Units displays the contents of a FRU PROM chip built onto the component itself. If the device does not have a FRU PROM chip, its values cannot be displayed

URL: http://ipmitool.sourceforge.net/

URL: http://ipmiutil.sourceforge.net/

IPMI Specification Second Generation v2.0

ftp://download.intel.com/design/servers/ipmi/IPMIv2_0rev1_0.pdf

Or of course, man ipmitool

Ipmitool Command Options:

Commands:

rawSend a RAW request and print responsei2cSend I2C master write-read command and print response

lanConfigure LAN channelschassisGet chassis status and set power state

powerShortcut to chassis power commandseventSend pre-defined events to MC

mcManagement controller status and global enablessdrPrint sensor data repository entries and readings

sensorPrint detailed sensor informationfruPrint built-in FRU and scan SDR for FRU locators

selPrint system event log (SEL)pefConfigure platform event filtering (PEF)

solConfigure and connect IPMIv2.0 Serial-over-LANtsolConfigure and connect with Tyan IPMIv1.5 Serial-over-LAN

isolConfigure IPMIv1.5 Serial-over-LANuserConfigure management controller users

channelConfigure management controller channelssessionPrint session information

sunoemOEM commands for sun serverskontronoemOEM Commands for Kontron devices

picmgRun a PICMG/ATCA extended cmdfwumUpdate IPMC using Kontron OEM Firmware Update Mgr

execRun list of commands from filesetSet runtime variable for shell and exec

IPMI Command Options

ipmitool [-chvV] -I lan -H hostname [-p ] [-U ]

[-f ] [-S ]

ipmitool [-chvV] -I lanplus -H hostname [-p ] [-U

] [-f ] [-S ]

ipmitool [-chvV] [-S ] -I bmc

-c Present output in CSV (comma separated variable) format.

-f

-v Increase verbose output level. Each instance increases verbosity

-V Display version information.

IPMI Shell Command

An alternative to this is to use the provided shell interface to issue repeated commands that will all use the same automatically generated cache. The shell interface is not available with all platforms and sometimes it is more advantageous to use the cache method (SDR), but if you are going to be analyzing the SEL chances are you will want to issue multiple commands and the shell interface makes this much easier with command history and editing.

ipmitool -I lanplus -H IPADDR -U root -P changeme shell

ipmitool> sel elist

100 | Pre-Init Time-stamp | Entity Presence ps1.prsnt | Device Absent

200 | Pre-Init Time-stamp | Entity Presence io.f0.prsnt | Device Absent

300 | Pre-Init Time-stamp | Power Supply ps0.vinok | State Asserted

NOTE: In order to improve readability and avoid repeating useless command line arguments all further examples will assume that the shell interface is being used and that an appropriate session is already established either over LAN interface or using KCS interface and an OS kernel driver.

IPMI Sel Elist for X4450

Typically and uncorrectable or correctable memory error is reported as the following:

0 | Pre-Init Time-stamp | Memory | Correctable Error | Asserted | CPU 0 DIMM(PAIR) 0

0 | Pre-Init Time-stamp | Memory | Uncorrectable Error | Asserted | CPU 0 DIMM(PAIR) 0

Note the updates in firmware to include the term (PAIR) to aid misconception in FRU part.

The BIOS is responsible for DIMM ECC handling. When a CE/UE error occurs, the chipset will generate a SMI, the BIOS will detect it and send a SEL event to the BMC. The IPMI spec. defines the format of SEL for DIMM ECC events. In addition, since the UE may cause the system hang or reset, the BIOS checks the UE status bit during early post stage and fires a SEL event to BMC as appropriate.

Make sure the system is running the latest firmware. Earlier firmware reported as follows:

19 | 02/14/2008 | 19:53:51 | Memory #0x7a | Correctable ECC | Asserted

Entities

An important foundation for sensors and events and FRUs is the concept of entities. In IPMI every sensor is assigned an entity ID and instance, which at its most basic level is a classification system that helps define what device type and number a sensor monitors. These are closely related to the concept of a Field Replaceable Unit in that a particular entity ID and instance can be used to describe a FRU.

An entity ID is mapped to a physical device through a table defined in the IPMI specification. For our purposes the following entities are used on the Sun Fire X4000 platform:

Their primary usefulness is for grouping and querying sensors, and in fact they come invery handy with ipmitool to do specific sensor queries.

Entities cont

For example to see all power supply related sensors:

ipmitool> sdr entity 10

ps0.prsnt | 1Ch | ok | 10.0 | Device Present

ps0.pwrok | 1Dh | ok | 10.0 | State Deasserted

ps0.vinok | 1Eh | ok | 10.0 | State Asserted

ps1.prsnt | 1Fh | ok | 10.1 | Device Present

ps1.pwrok | 20h | ok | 10.1 | State Deasserted

ps1.vinok | 21h | ok | 10.1 | State Deasserted

As you can see above the Entity ID and Instance is provided in the sdr elist command output in the form:

..

Entities cont

Entities can also be used for entity lookups whereby you can associate a sensor with other sensors by finding all other sensors with the same entity ID and instance as the one you are looking up. This allows you to associate a sensor (and event).

Sensor ID : ft0.fm2.f1.speed (0x49)

Entity ID : 29.2

Sensor Type (Analog) : Fan

Sensor Reading : 7000 (+/- 0) RPM

Status : ok

Lower Non-Recoverable : 3000.000

Lower Critical : na

Lower Non-Critical : na

Upper Non-Critical : na

Upper Critical : na

Upper Non-Recoverable : 22000.000

Assertions Enabled : lnr- unr+

Deassertions Enabled : lnr- unr+

SEL Record ID : 1002

Record Type : 02

Timestamp : 03/21/2007 06:08:09

Generator ID : 0020

EvM Revision : 04

Sensor Type : Fan

Sensor Number : 49

Event Type : Threshold

Event Direction : Assertion Event

Event Data (RAW) : 541a1e

Trigger Reading : 2600.000 RPM

Trigger Threshold : 3000.000 RPM

Description : Lower Non-recoverable going low

Threshold

Threshold (or analog) sensors are used for inputs like temperature, voltages, and fan speeds. These are connected to a sensor chip and read by the service processor using I2C. They can have both upper and lower thresholds set and multiple different events can be configured around each of them.

There can be zero or more of the following thresholds configured as upper or lower bounds for each threshold sensor:

* Non-Critical* Critical* Non-Recoverable

These are commonly represented in short form with ipmitool, here is how the short names map to the longer names

Each threshold can also have both a direction and a status flag attached to it. The use of a "+" or "-" is used to indicate whether that particular threshold applies to a reading that is going high or going low, respectively. The status flag indicates whether the event is generated is an assertion or a deassertion event. Each of these can be configured independently by setting various bits in the sensor data record.

Threshold cont

Example Thresholds:

Use eample of evnt on 129.156.212.177

f02 | 03/21/2007 | 06:08:09 | System Firmware Progress | Motherboard initialization | Asserted1002 | 03/21/2007 | 06:08:09 | Fan ft0.fm2.f1.speed | Lower Non-recoverable going low | Reading 2600 < Threshold 3000 RPM1102 | 03/21/2007 | 06:08:10 | System Firmware Progress | Video initialization | Asserted1202 | 03/21/2007 | 06:08:11 | Fan ft0.fm1.f0.speed | Lower Non-recoverable going low | Reading 2600 < Threshold 3000 RPM1302 | 03/21/2007 | 06:08:14 | Fan ft0.fm2.fail | Predictive Failure Asserted1402 | 03/21/2007 | 06:08:15 | Fan ft0.fm2.f1.speed | Lower Non-recoverable going high | Reading 7200 > Threshold 3000 RPM1502 | 03/21/2007 | 06:08:15 | Fan ft0.fm1.fail | Predictive Failure Asserted1602 | 03/21/2007 | 06:08:16 | Fan ft0.fm1.f0.speed | Lower Non-recoverable going high | Reading 7100 > Threshold 3000 RPM

Discrete Sensors

While threshold sensors represent analog readings, discrete sensors can be considered to represent digital readings. There are different types of discrete sensors as well:

Generic Discrete: these sensors can have various flags from a generic pre-defined table. None of these sensors are used on this platform.

Digital Discrete: similar to the above but are for sensors that only represent one of two possible states. Think of a GPIO pin where the state is either 1 or 0. There are many sensors of this type used on this platform, everything from presence detection sensors to LED state sensors.

Sensor-Specific Discrete: similar to the first type these sensor types have any number of flags defined depending on the sensor type. This makes it possible to represent a large number of sensors, each with different states. There is only one of these types of sensor on this platform and it belongs to the Chassis Intrusion sensor.

BIOS Error Handling and Reporting : Correctable Errors DMI Log

Sync Flooding is a HyperTransport method used to stop data propagation in the case of a serious error

Device that detects the error initiates sync flood packets

All other HT devices cease operation

They transmit sync flood packets out of all HT links

Packets finally reach the platform South Bridge (AMD8111/NF2200/3400)

BIOS has Pre-programmed SB to trigger system RESET signal, when sync flood is detected. System reboots

During Boot Block and POST, BIOS analyzes related error bits in all HT Nodes, reports of Sync Flood reasons

BIOS detects some of the error sources, but not all of them

BIOS Error Handling & Reporting Overview

Error Handling and Reporting Uncorrectable ECC Errors

Rev.F BIOS Error Handling Framework

General Conditions:

1- Two lines of SEL Log will be reported for each Node with valid fault and MCA address

2- One line of SEL Log will be reported for each node with evidence of a generic sync flood and no valid address

3- If a node has detected CRC errors on incoming HT links...

Link# will be appended after "FEDBADF00D" (indicator of a CRC fed to HT link)...

"000201" comes in four least significant nibbles (00[future link4, zero for now][link2][link1][link0])

each nibble content represents:

0 = no CRC Error was received on this link

1 = incoming Byte0 Link received the CRC error

2 = incoming Byte1 Link received the CRC Error

3 = incoming Bytes0+1 Links both saw CRC Errors on incoming links

Data Gathering Tools

Windows MPSreports

Solaris - Explorer

Redhat -sysreport

Suse Siga, supportconfig

Windows MPSreports

Microsoft Premier Field Engineering (PFE) Reporting Utility

Supported Operating Systems:

Windows NT 4.0

Windows 2000

Windows XP

Windows Server 2003 x86

Windows Server 2003 x64 (AMD64)

Windows Server 2003 for Itanium

Windows MPSreports cont

The PFE version of MPS_REPORTS gathers a wide range of diagnostic information from Windows and limited information for server applications installed such as SQL or Exchange.

The user running the utility must have administrative privileges on the system.

During runtime there are uncompressed text files that could possibly consume 100 MB or more or more of disk space, depending on the size of the event logs. Most of the space is freed when the reporting tool finishes, leaving behind only the tools folder and the .cab file, which can be deleted if no longer needed.

Windows MPSreports

Microsoft Product Support Reporting Tool

Supported Operating Systems:

Windows 2000

Windows NT

Windows Server 2003

Windows XP

There are 8 specialty versions, one for each of the following support scenario categories:

Alliance, Directory Services (not for Windows NT 4.0), Networking, Clustering, SQL, Software Update Services, MDAC and Base/Setup/Storage/Print/Performance.

Each version gathers some of the same basic information but there are specific reports unique to each of the support scenario categories.

Please read the readme.txt files for more details about each version.

Windows MPSreports cont

The Microsoft Product Support Reporting Tool facilitates the gathering of critical system and logging information used in troubleshooting support issues. The reporting tool DOES NOT make any registry changes or modifications to the operating system.

Files created are zipped CAB files cabextract can

be used on Solaris to extract.

Microsoft Product Support Reporting Tool

Cabextract website for Solaris and other packages.

/usr/local/gnu/bin/cabextract - - help

Windows Utilities msconfig

The System.ini file

The Win.ini file

The Boot.ini file

Programs that are set to load during the startup process

Environment settings

International settings

Description of Windows XP System Information (Msconfig.exe) Tool

Windows Utilities msinfo32

Hardware Resources

Components

Software Environment

Applications

Internet Explorer

Description of Windows XP System Information (Msinfo32.exe) Tool

Windows Event Viewer

The Event Logs for Windows include:

System Event Log

Security Event Log

Application Event Log

Directory Service

File Replication Service

Other Windows Utilities

System Monitor - perfmon.exe

Ipconfig Dosutil network information

Device Manager / Control Panel

Network Sniffers Ethereal Wireshark

Dumps - three memory dump file types:

Complete memory dump

Kernel memory dump

Small memory dump (64 KB)

Overview of memory dump file options for Windows Server 2003 and XP

Windows DRAM Limitations

http://msdn.microsoft.com/en-us/library/aa366778(VS.85).aspx

Distribution32-bit64-bit

Windows Server 2003 SP2, Datacenter128GB4TB

Windows Server 2003 SP2, Enterprise Edition 8GB2TB

Windows Storage Server 2003, Enterprise Edition8GB-

Windows Storage Server 20034GB-

Windows Server 2003 R2 Datacenter Edition128GB1TB

Windows Server 2003 SP1, Datacenter Edition128GB1TB

Windows Server 2003 R2 Enterprise Edition64GB1TB

Windows Server 2003 SP1, Enterprise Edition64GB1TB

Windows Server 2003, Standard Edition4GB16GB

Solaris Explorer

A new version has been released 5.8 it is available

from the sun download site

Sun Explorer Data Collector version 5.8

Supported Platforms

Solaris 7/8/9/10

SPARC/Solaris 9/10

X86 Platform

Solaris Explorer

The following describes the changes made in Sun Explorer 5.8:

New Features

Explorer 5.8 now collects /usr/sbin/smbios output. (6412712)

Modified Features

Now supports transport target with HTTP protocol in addition to HTTPS protocol.

Because some companies have problems using HTTPS internally, such as in the DMZ net, Explorer supports the use of HTTP from the DMZ to the company's own HTTP endpoint. (6471251)

Explorer 5.8 restores the ability to gather system controller data via telnet with the scextended and 1280extended modules.

A defect in Explorer 5.7 caused these telnet connections to be blocked unless an ssh connection was made to accept the host key, before running Explorer. (6521236, 6528445)

Explorer Modules ilomextended

Explorer -if -w

Collects remote Integrated Lights Out Manager

(ILOM) Intelligent Platform Management

Interface (IPMI) data from Galaxy systems.

Commands Collected

/usr/sfw/bin/ipmitool -H {host} -p {port} -U {user} -f {pwfile} mc info/usr/sfw/bin/ipmitool -H {host} -p {port} -U {user} -f {pwfile} mc getenables

/usr/sfw/bin/ipmitool -H {host} -p {port} -U {user} -f {pwfile} chassis poh/usr/sfw/bin/ipmitool -H {host} -p {port} -U {user} -f {pwfile} chassis power status

/usr/sfw/bin/ipmitool -H {host} -p {port} -U {user} -f {pwfile} fru print/usr/sfw/bin/ipmitool -H {host} -p {port} -U {user} -f {pwfile} pef status

/usr/sfw/bin/ipmitool -H {host} -p {port} -U {user} -f {pwfile} pef list/usr/sfw/bin/ipmitool -H {host} -p {port} -U {user} -f {pwfile} sdr list full

/usr/sfw/bin/ipmitool -H {host} -p {port} -U {user} -f {pwfile} sel info/usr/sfw/bin/ipmitool -H {host} -p {port} -U {user} -f {pwfile} sel elist

/usr/sfw/bin/ipmitool -H {host} -p {port} -U {user} -f {pwfile} sensor list/usr/sfw/bin/ipmitool -H {host} -p {port} -U {user} -f {pwfile} user summary

/usr/sfw/bin/ipmitool -H {host} -p {port} -U {user} -f {pwfile} user list/usr/sfw/bin/ipmitool -H {host} -p {port} -U {user} -f {pwfile} sunoem led get

/usr/sfw/bin/ipmitool -H {host} -p {port} -U {user} -f {pwfile} chassis restart_cause

Explorer Modules ipmi

Collects local Intelligent Platform Management Interface (IPMI) data on x86 platform.

The following commands are collected:

/usr/sfw/bin/ipmitool chassis status/usr/sfw/bin/ipmitool chassis poh

/user/sfw/bin/ipmitool chassis power status/user/sfw/bin/ipmitool chassis restart_cause

/usr/sfw/bin/ipmitool fru/user/sfw/bin/ipmitool fru print

/user/sfw/bin/ipmitool mc getenables/user/sfw/bin/ipmitool mc info

/usr/sfw/bin/ipmitool pef status/usr/sfw/bin/ipmitool pef list

usr/sfw/bin/ipmitool sel info /usr/sfw/bin/ipmitool sel elist

/user/sfw/bin/ipmitool sdr enlist full /usr/sfw/bin/ipmitool sdr list all info

/user/sfw/bin/ipmitool sensor list /user/sfw/bin/ipmitool sunoem led get

Griffon Knowledge Engine

Sun Alerts and FINS checked against

Recommended Patches for Solaris 7, 8, 9 and 10 (sparc) (x86) checked against

Firmware Patches checked against

ASP Group Patches checked against

BADPATCH/WITHDRAWN Patches checked against

Griffin Knowledge Engine Home page Griffon System Analysis Job Submission Form

Features of the Griffon System Analysis Job Submission Form

Explorer files may be referenced via a network path.

Provides a high-level matrix to highlight critical problem areas.

Individual System Analysis Reports

Container files of hostids to group results

Hostid references utilize the current Explorer file Proactive servers

Patch references are linked to external SunSolve.

Sun Gathering Debug Data (Sun GDD)

Tools provide the right approach to problem resolution by leveraging proactive actions and best practices to help you gather the required debug data needed for further analysis.

For each product covered, GDD tools provide documentation and scripts which detail the relevant data the Sun Technical Support Center requires for analyzing your problem

Sun Gathering Debug Data (Sun GDD)

The tools gather 90% of the debug data frequently requested by the Sun Technical Support Center - including data for more common critical situations including memory, start/stop, installation, hang, and crash issues. By collecting this data before you initiate a Service Request, you can substantially reduce the time needed to analyze and resolve the problem.

Sun Java System Calendar Server

Sun Java System Directory Editor

Sun Java System Directory Proxy Server

Sun Java System Directory Server

Sun Java System Messaging Server

Sun Java System Portal Server

Sun Java System Web Proxy Server 3.6 Service Pack 9

Sun Java System Web Server

External GDD sun website

Sun Gathering Debug Data (GDD) (WZT-0253) - Web based

SUSE supportconfig

Detailed system information and logs are collected and organized in a manner that helps reduce service request resolution times. Private system information can be disclosed when using this tool. If this is a concern, please prune private data from the log files. Several startup options are available to exclude more sensitive information. Refer to the man page to see these options.

Output format is better the SIGA scripts, however needs to be installed from the SUSE web site.

Supportconfig Installation

The script has been renamed from config.sh to supportconfig. It has been repackaged as an RPM for easy installation and updating. There is now a supportconfig man page.

1. Download supportconfig-2.13-0.4.noarch.rpm

2. Install the RPM rpm -Uvh supportconfig-2.13-0.4.noarch.rpm

3. Run the supportconfig command as root

4. To view the information files, assuming the tarball filename is:

/var/log/nts_host1_070406_1300.tar.bz2

cd /var/log

tar jxf nts_host1_070406_1300.tar.bz2

cd nts_host1_070406_1300

ls -l

VMware

ESX Server includes a script called vm-support, which collects information that VMware support might request and packages that information into one file.

To make sure you have version 1.14 or later of the script Version 1.14 (and later) of the vm-support script produces more diagnostic information than previous versions of the script.

ESX Server 3.x includes a version of vm-support newer than 1.14. You don't need to check the version of this script or update it.

ESX Server 2.x might include a version earlier than 1.14. To see which version is installed on your system, run the command with no options. For example:

[user@esx2host]$ cd /tmp

[user@esx2host]$ /usr/bin/vm-support

VMware ESX Server Support Script 0.94

Preparing Files: [Ctrl+C to cancel]

If you have a version earlier than 1.14, follow the instructions later in this presentation.

VMware cont

To collect diagnostic information using the script:

1. Log on to the service console as root

2. Confirm what version of VMware ESX Server you are running:

[root@esxhost]# vmware -v

3. Change to the directory where you want the output to appear. For example:

[root@esxhost]# cd /tmp

If you run the script in /usr/bin, the output will appear in that directory and remain there until you delete it

4. Run the script:

[root@esxhost]# /usr/bin/vm-support

You don't need to power off your virtual machines before running this script

For ESX Server performance issues, VMware Technical Support might ask you to collect performance snapshots using the the -s and -S switches. Please refer to:

http://kb.vmware.com/kb/1967 for more information

5. When the script finishes, it informs you of the output filename and location

VMware Upgrading vm-support

Download vm-support script and move it to the /tmp directory on the service console of the ESX Server system

Make a backup copy of your existing script:

[root@esxhost]# cp /usr/bin/vm-support /usr/bin/vm-support.old

Extract the archive to /usr/bin/, replacing the original vm-support script:

[root@esxhost]# cd /usr/bin

[root@esxhost]# tar xvzf /tmp/653_fvm-support_114.tgz

The archive places the vm-support script in the current directory.

Note: When running on an older version of ESX Server, the updated script might report errors about missing commands. This is normal.

Supports Versions:

VMware ESX Server 2.0.x VMware ESX Server 2.1.x VMware ESX Server 2.5.x VMware ESX Server 3.0.x

Various Other Tools Available

Dmidecode Available Solaris Linux later versions

Biosdecode Available Linux part of dmidecode 2.9

Smbios Solaris

nvidia-bug-report.sh Graphics 'X' data gather

Kstat Solaris Various information gathered

Scanpci Linux Solaris

Pcitweak Linux (SuSe) Solaris

Setpci Linux Redhat

Working With Non-SP Machines

You have no IPMI data

You are likely to have no DMI logs either

Some systems will pause on boot to indicate syncflood and other serious events

Platform messages

PCI.exe could gather northbridge registers

POST codes POST LEDs

What does occur LEDS, fans Graphics

Lab

Connecting To Remote Platforms

Ultra 24vi64-u24a-kvm-gmp03.uk

X2100/X2200va64-x2100g-sp-gmp03.uk

X4100/X4200Not available

X4150Should be available by 12pm

X4600va64-x4600a-sp-gmp03.uk

Collect ipmi SEL, SDR and FRU data in/out of band

Thank you

Michael Johnson

[email protected]

Sun Microsystems, Inc.

Page

Click to edit the title text format

Click to edit the outline text format

Second Outline Level

Sun Confidential: Internal Only

Click to edit the notes format

Page

Click to edit the title text format

Presenters Name

Presenters Title

Presenters Company

Click to edit the notes format

Page

Entity ID Description Entity ID Description 3Processor4Disk or Disk Bay6ServiceProcessor7System Board10Power Supply12Front Panel 13Back Panel 15Disk Drive Backplane19Power Distribution Board23System Chassis29Fan Device31Cable/Interconnect32Memory Device 34BIOS

???Page ??? (???)11/09/2007, 10:14:39Page / Short ThersholdShort ThresholdUnrUpper Non-RecoverableLncLower Non-CriticalUcrUpper CriticalLcrLower CriticalUncUpper Non-CriticalLnrLower Non-Recoverable

???Page ??? (???)10/09/2007, 22:54:41Page /