Home
Dell OpenManage Server Administrator Version 5.4 Messages Reference Guide
Contents
1. 50 Intrusion Events 51 BIOS Generated System Events 52 R2 Generated System Events 55 Cable Interconnect Events 55 Battery Events 55 Entity Presence Events 56 4 Storage Management Message Reference 57 Alert Monitoring and Logging 57 Alert Message Format with Substitution Variables 57 Alert Message Change History 60 Alert Descriptions and Corrective Actions 64 Index 135 Introduction 5 Introduction Dell OpenManage Server Administrator produces event messages stored primarily in the operating system or Server Administrator event logs and sometimes in SNMP traps This document describes the event messages created by Server Administrator version 5 3 or later and displayed in the Server Administrator Alert log Server Administrator creates events in response to sensor status changes a
2. 5 Understanding Event Messages 6 Sample Event Message Text 7 Viewing Alerts and Event Messages 7 Viewing Events in Windows 2000 Advanced Server and Windows Server 2003 8 Viewing Events in Red Hat Enterprise Linux and SUSE Linux Enterprise Server 8 Viewing the Event Information 9 Understanding the Event Description 10 2 Event Message Reference 13 Miscellaneous Messages 13 Temperature Sensor Messages 15 Cooling Device Messages 18 Voltage Sensor Messages 19 Current Sensor Messages 22 Chassis Intrusion Messages 25 Redundancy Unit Messages 26 Power Supply Messages 29 Memory
3. Clear Alert Number None Related Alert Number 2266 LRA Number 2060 753 2328 The NVRAM has corrupt data Warning Non critical Cause The NVRAM has corrupt data The controller is unable to correct the situation Action Replace the controller Clear Alert Number None Related Alert Number None LRA Number 2060 753 2329 SAS port report 1 Warning Non critical Cause The text for this alert is generated by the controller and can vary depending on the situation The 1 indicates a substitution variable The text for this substitution variable is generated by the controller and is displayed with the alert in the Alert Log This text can vary depending on the situation Action Make sure the cables are attached securely If the problem persists replace the cable with a valid cable according to SAS specifications If the problem still persists you may need to replace some devices such as the controller or EMM See the hardware documentation for more information Clear Alert Number None Related Alert Number None LRA Number 2060 753 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers Storage Management Message Reference 125 2330 SAS port report 1 Ok Normal Cause This alert is for informational purposes The 1 indicates a substitution variable The text for this subst
4. Related Alert Number None LRA Number None 1151 2213 Recharge count maximum exceeded Warning Non critical Cause The battery has been recharged more times than the battery recharge limit allows Action Replace the battery pack Clear Alert Number None Related Alert Number None LRA Number 2100 1153 2214 Battery charge in progress OK Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 1151 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 104 Storage Management Message Reference 2215 Battery charge process interrupted OK Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 1151 2232 The controller alarm is silenced Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 751 2233 The background initialization BGI rate has changed Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 751 2234 The Patrol Read rate has changed Ok Normal Cause This alert is
5. Cause You are attempting to rebuild data that resides on a defective disk Action Replace the source disk and restore from backup Clear Alert Number None Related Alert Number 2195 2346 LRA Number 2071 904 2348 The rebuild failed due to errors on the target physical disk Critical Failure Error Cause You are attempting to rebuild data on a disk that is defective Action Replace the target disk If a rebuild does not automatically start after replacing the disk initiate the Rebuild task You may need to assign the new disk as a hot spare to initiate the rebuild Clear Alert Number None Related Alert Number 2195 2346 LRA Number 2071 904 2349 A bad disk block could not be reassigned during a write operation Critical Failure Error Cause A write operation could not complete because the disk contains bad disk blocks that could not be reassigned Data loss may have occurred and data redundancy may also be lost Action Replace the disk Clear Alert Number None Related Alert Number 2346 LRA Number 2071 904 2350 There was an unrecoverable disk media error during the rebuild Critical Failure Error Cause The rebuild encountered an unrecoverable disk media error Action Replace the disk Clear Alert Number None Related Alert Number 2095 2273 LRA Number 2071 904 2351 A physical disk is marked as missing Ok No
6. Virtual disk format completed 73 Virtual disk format started 68 Virtual disk has inconsistent data 112 Virtual disk initialization 87 Virtual disk initialization cancelled 70 Virtual disk initialization completed 73 Virtual disk initialization failed 71 Virtual disk initialization started 69 Virtual disk rebuild completed 73 Virtual disk rebuild failed 72 Virtual disk rebuild started 69 Virtual disk reconfiguration completed 73 Virtual disk reconfiguration failed 72 Virtual disk reconfiguration started 69 Virtual Disk Redundancy has been degraded 133 Virtual disk renamed 92 voltage sensor 6 Voltage sensor detected a failure value 21 45 Voltage sensor detected a non recoverable value 21 Voltage sensor detected a warning value 20 Voltage Sensor Events 44 Voltage sensor has failed 19 45 voltage sensor messages 19 44 Voltage sensor returned to a normal value 20 Voltage sensor value unknown 20 45 148 Index 148 Index
7. Error A fan enclosure sensor in the specified system detected an error from which it cannot recover The sensor location and chassis location are provided Table 2 11 AC Power Cord Messages Event ID Description Severity Cause 1500 AC power cord sensor has failed Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Information An AC power cord sensor in the specified system failed The AC power cord status cannot be monitored The sensor location and chassis location information are provided 1501 AC power cord is not being monitored Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Information The AC power cord status is not being monitored This occurs when a system s expected AC power configuration is set to nonredundant The sensor location and chassis location information are provided Table 2 10 Fan Enclosure Messages continued Event ID Description Severity Cause Event Message Reference 35 Hardware Log Sensor Messages Hardware logs provide hardware status messages to systems management software On certain systems the hardware log is implemented as a circular queue When the log becomes full the oldest status messages are overwritten when new status messages are logged On some systems the log is not circular On these systems when the log becomes full subsequent hardware status messages are lost Hardware log sen
8. Failure Error Cause A physical disk included in the virtual disk has failed or a user has cancelled the initialization Action If a physical disk has failed then replace the physical disk Clear Alert Number None Related Alert Number None LRA Number 2081 1204 2080 Physical disk initialize failed Critical Failure Error Cause The physical disk has failed or is corrupt Action Replace the failed or corrupt disk You can identify a disk that has failed by locating the disk that has a red X for its status Restart the initialization Clear Alert Number None Related Alert Number None LRA Number 2071 904 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 72 Storage Management Message Reference 2081 Virtual disk reconfiguratio n failed Critical Failure Error Cause A physical disk included in the virtual disk has failed or is corrupt A user may also have cancelled the reconfiguration Action Replace the failed or corrupt disk You can identify a disk that has failed by locating the disk that has a red X for its status If the physical disk is part of a redundant array then rebuild the physical disk When finished restart the reconfiguration Clear Alert Number None Related Alert Number None LRA Number 2081 1204 2082 Virtual disk re
9. Warning Non critical Cause The battery may be recharging the room temperature may be too hot or the fan in the system may be degraded or failed Action If this alert was generated due to a battery recharge the situation will correct when the recharge is complete You should also check if the room temperature is normal and that the system components are functioning properly Clear Alert Number 2172 Related Alert Number None LRA Number 2100 1153 2172 The controller battery temperature is normal Ok Normal Cause This alert is for informational purposes Action None Clear Alert Status Alert 2172 is a clear alert for alert 2171 Related Alert Number None LRA Number None 1151 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 96 Storage Management Message Reference 2173 Unsupported configuration detected The SCSI rate of the enclosure management modules EMMs is not the same EMM0 1 EMM1 2 Warning Non critical Cause The EMMs in the enclosure have a different SCSI rate This is an unsupported configuration All EMMs in the enclosure should have the same SCSI rate The percent sign indicates a substitution variable The text for this substitution variable is displayed with the alert in the Alert Log and can vary depending on the situation Action The EMMs in
10. 2090 2100 753 803 853 903 953 1003 1053 1103 1153 1203 2266 Controller log file entry 1 Ok Normal Cause This alert is for informational purposes The 1 indicates a substitution variable The text for this substitution variable is generated by the controller and is displayed with the alert in the Alert Log This text can vary depending on the situation Action None Clear Alert Number None Related Alert Number None LRA Number None 751 801 851 901 951 1001 1051 1101 1151 1201 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers Storage Management Message Reference 109 2267 The controller reconstruct rate has changed Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 751 2268 1 Storage Management has lost communicatio n with the controller An immediate reboot is strongly recommended to avoid further problems If the reboot does not restore communicatio n then contact technical support for more information Critical Failure Error Cause Storage Management has lost communication with a controller This may occur if the controller driver or firmware is experiencing a problem The 1 indicates a substitution variable The text for this substi
11. 7 Hardware Log Sensor Events 49 hardware log sensor messages 49 Hot spare SMART polling failed 113 I Intrusion Events 51 intrusion messages 51 L Log backup created 13 Log monitoring has been disabled 36 51 Log size is near or at capacity 36 Log size returned to a normal level 36 Log status is unknown 36 51 Log was cleared 13 M Maximum temperature probe warning threshold value changed 91 Memory device ECC Correctable error count crossed a warning threshold 32 Memory device ECC Correctable error count sensor crossed a failure threshold 32 memory device messages 32 Memory device monitoring has been disabled 32 Memory ECC Events 48 memory ecc messages 48 Memory Events 49 memory modules messages 49 memory prefailure sensor 6 messages AC power cord 34 50 battery 55 battery sensor 40 bios generated system 52 BMC watchdog 48 cable interconnect 55 chassis intrusion 25 cooling device 18 current sensor 22 drives 50 entity presence 56 fan enclosure 33 fan sensor 45 hardware log sensor 49 intrusion 51 memory device 32 memory ecc 48 memory modules 49 miscellaneous 13 pluggable device 39 52 power supply 29 47 processor sensor 37 processor status 46 r2 generated system 55 redundancy unit 26 storage management 64 temperature sensor 15 43 voltage sensor 19 44 Minimum temperature probe warning threshold value cha
12. Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 80 Storage Management Message Reference 2110 SMART warning degraded Warning Non critical Cause A disk is degraded and has received a SMART alert predictive failure The disk is likely to fail in the near future Action Replace the disk that has received the SMART alert If the physical disk is a member of a non redundant virtual disk then back up the data before replacing the disk NOTICE Removing a physical disk that is included in a non redundant virtual disk will cause the virtual disk to fail and may cause data loss Clear Alert Number None Related Alert Number None LRA Number 2070 903 2111 Failure prediction threshold exceeded due to test No action needed Warning Non critical Cause A disk has received a SMART alert predictive failure due to test conditions Action None Clear Alert Number None Related Alert Number None LRA Number 2070 903 2112 Enclosure was shut down Critical Failure Error Cause The physical disk enclosure is either hotter or cooler than the maximum or minimum allowable temperature range Action Check for factors that may cause overheating or excessive cooling For example verify that the enclosure fan is working You should also check the thermostat settings and examine whether the enclosure is located near a heat source Make sure the
13. Non critical Cause The controller is not able to communicate with a disk that is assigned as a dedicated hot spare The disk may have been removed There may also be a bad or loose cable Action Check if the disk is healthy and that it has not been removed Check the cables If necessary replace the disk and reassign the hot spare Clear Alert Number None Related Alert Number 2048 LRA Number 2070 903 2202 A global hot spare has been removed Ok Normal Cause The controller is unable to communicate with a disk that is assigned as a global hot spare The disk may have been removed There may also be a bad or loose cable Action Check if the disk is healthy and that it has not been removed Check the cables If necessary replace the disk and reassign the hot spare Clear Alert Number None Related Alert Number None LRA Number None 901 2203 A dedicated hot spare failed Warning Non critical Cause The controller is unable to communicate with a disk that is assigned as a dedicated hot spare The disk may have failed or been removed There may also be a bad or loose cable Action Check if the disk is healthy and that it has not been removed Check the cables If necessary replace the disk and reassign the hot spare Clear Alert Number None Related Alert Number 2048 LRA Number 2070 903 Table 4 4 Storage Management Messages continued Event ID Description Seve
14. None Clear Alert Number None Related Alert Number None LRA Number None 951 2297 An EMM has been removed Critical Failure Error Cause An EMM has been removed Action Replace the EMM See the hardware documentation for information on replacing the EMM Clear Alert Number None Related Alert Number None LRA Number 2091 954 2298 There is a bad sensor on an enclosure Warning Non critical Cause The enclosure has a bad sensor The enclosure sensors monitor the fan speeds temperature probes etc Action See the hardware documentation for more information Clear Alert Number None Related Alert Number None LRA Number 2090 853 2299 Bad PHY 1 Critical Failure Error Cause There is a problem with a physical connection or PHY The 1 indicates a substitution variable The text for this substitution variable is displayed with the alert in the Alert Log and can vary depending on the situation Action Contact Dell technical support Clear Alert Number None Related Alert Number None LRA Number 2091 854 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers Storage Management Message Reference 117 2300 The enclosure is unstable Critical Failure Error Cause The controller is not receiving a consistent response from the enclosure Th
15. Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers Storage Management Message Reference 127 2337 The controller is unable to recover cached data from the battery backup unit BBU Critical Failure Error Cause The controller was unable to recover data from the cache Action Check if the battery is charged and in good health When the battery charge is unacceptably low it cannot maintain cached data Check if the battery has reached its recharge limit The battery may need to be recharged or replaced Clear Alert Number None Related Alert Number None LRA Number 2101 1154 2338 The controller has recovered cached data from the BBU Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 1151 2339 The factory default settings have been restored Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 751 2340 The BGI completed with uncorrectable errors Critical Failure Error Cause The BGI task encountered errors that cannot be corrected The virtual disk contains physical disks that have unusable disk space or disk errors that cannot be corrected Action Replace the physical di
16. Table 2 12 Hardware Log Sensor Messages Event ID Description Severity Cause 1550 Log monitoring has been disabled Log type lt Log type gt Information A hardware log sensor in the specified system is disabled The log type information is provided 1551 Log status is unknown Log type lt Log type gt Information A hardware log sensor in the specified system could not obtain a reading The log type information is provided 1552 Log size is no longer near or at capacity Log type lt Log type gt Information The hardware log on the specified system is no longer near or at its capacity usually as the result of clearing the log The log type information is provided 1553 Log size is near or at capacity Log type lt Log type gt Warning The size of a hardware log on the specified system is near or at the capacity of the hardware log The log type information is provided 1554 Log size is full Log type lt Log type gt Error The size of a hardware log on the specified system is full The log type information is provided 1555 Log sensor has failed Log type lt Log type gt Error A hardware log sensor in the specified system failed The hardware log status cannot be monitored The log type information is provided Event Message Reference 37 Processor Sensor Messages Processor sensors monitor how well a processor is functioning Processor messages listed in Table 2 13 provide
17. enclosure has enough ventilation and that the room temperature is not too hot or too cold See the enclosure documentation for more diagnostic information Clear Alert Number None Related Alert Number None LRA Number 2091 854 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers Storage Management Message Reference 81 2114 A consistency check on a virtual disk has been paused suspended Ok Normal Cause The check consistency operation on a virtual disk was paused by a user Action To resume the check consistency operation right click the virtual disk in the tree view and select Resume Check Consistency Clear Alert Number 2115 Related Alert Number None LRA Number None 1201 2115 A consistency check on a virtual disk has been resumed Ok Normal Cause This alert is for informational purposes The check consistency operation on a virtual disk has resumed processing after being paused by a user Action None Clear Alert Status Alert 2115 is a clear alert for alert 2114 Related Alert Number None LRA Number None 1201 2116 A virtual disk and its mirror have been split Ok Normal Cause This alert is for informational purposes A user has caused a mirrored virtual disk to be split When a virtual disk is mirrored its data is copied to another virtual disk in order
18. full redundancy lt Number gt Specifies the number of power supply or cooling devices required to achieve full redundancy for example Number of devices required for full redundancy 4 Possible memory module event cause lt list of causes gt Specifies a list of possible causes for the memory module event for example Possible memory module event cause Single bit warning error rate exceeded Single bit error logging disabled Power Supply type lt type of power supply gt Specifies the type of power supply for example Power Supply type VRM Previous redundancy state was lt State gt Specifies the status of the previous redundancy message for example Previous redundancy state was Lost Previous state was lt State gt Specifies the previous state of the sensor for example Previous state was OK Normal Processor sensor status lt status gt Specifies the status of the processor sensor for example Processor sensor status Configuration error Table 1 2 Event Description Reference continued Description Line Item Explanation 12 Introduction Redundancy unit lt Redundancy location in chassis gt Specifies the location of the redundant power supply or cooling unit in the chassis for example Redundancy unit Fan Enclosure Sensor location lt Location in chassis gt Specifies the location of the sensor in the specified chassis for example Sensor location CPU1 Temperature sen
19. identified with a red X on the enclosure s Health subtab Alternatively you can select the Storage object and click the Health subtab The controller status displayed on the Health subtab indicates whether a controller has a failed or degraded component See the enclosure documentation for information on replacing enclosure components and for other diagnostic information Clear Alert Status 2124 Related Alert Number 2048 LRA Number 2090 1305 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 84 Storage Management Message Reference 2123 Redundancy lost Warning Non critical Cause A virtual disk or an enclosure has lost data redundancy In the case of a virtual disk one or more physical disks included in the virtual disk have failed Due to the failed physical disk or disks the virtual disk is no longer maintaining redundant mirrored or parity data The failure of an additional physical disk will result in lost data In the case of an enclosure more than one enclosure component has failed For example the enclosure may have suffered the loss of all fans or all power supplies Action Identify and replace the failed components To identify the failed component select the Storage object and click the Health subtab The controller status displayed on the Health subtab indicates whether a controller h
20. 1009 Systems Management Data Manager Stopped Information Systems Management Data Manager services were stopped 1011 RCI table is corrupt Warning This message is generated when the BIOS Remote Configuration Interface RCI table is corrupted or cannot be read by the systems management software 1012 IPMI Status Interface lt the IPMI interface being used gt lt additional information if available and applicable gt Information This message is generated to indicate the Intelligent Platform Management Interface IPMI status of the system Additional information when available includes Baseboard Management Controller BMC not present BMC not responding System Event Log SEL not present and SEL Data Record SDR not present Table 2 1 Miscellaneous Messages continued Event ID Description Severity Cause Event Message Reference 15 Temperature Sensor Messages Temperature sensors listed in Table 2 2 help protect critical components by alerting the systems management console when temperatures become too high inside a chassis The temperature sensor messages use additional variables sensor location chassis location previous state and temperature sensor value or state Table 2 2 Temperature Sensor Messages Event ID Description Severity Cause 1050 Temperature sensor has failed Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was l
21. 21 1200 22 1201 22 1202 23 1203 23 1204 24 1205 24 1250 25 1251 25 1252 25 1253 26 1254 26 1255 26 1300 27 1301 27 1302 27 1303 27 1304 28 1305 28 1306 28 1350 29 1351 29 1352 30 1353 30 1354 31 1355 31 1403 32 1404 32 1450 33 1451 33 1452 33 1453 33 1454 34 1455 34 1500 34 1501 34 1502 35 136 Index 136 Index 1503 35 1504 35 1505 35 1550 36 1551 36 1552 36 1553 36 1554 36 1555 36 1600 37 1601 37 1602 37 1603 38 1604 38 1605 38 1650 39 1651 39 1652 39 1653 39 1700 40 1701 40 1702 40 1703 40 1704 41 1705 41 2048 64 2049 65 2050 65 2051 66 2052 66 2053 66 2054 66 2055 66 2056 67 2057 68 2058 68 2059 68 2061 69 2062 69 2063 69 2064 69 2065 69 2067 70 2070 70 2074 70 2076 71 2077 71 2079 71 2080 71 2081 72 2082 72 2083 72 2085 72 2086 73 2088 73 2089 73 2090 73 2091 73 2092 73 2094 74 2095 74 2098 74 2099 75 2100 75 2101 75 2102 76 2103 76 2104 76 2105 76 2106 77 2107 77 2108 78 2109 79 2110 80 2111 80 2112 80 2114 81 2115 81 2116 81 2117 81 2118 82 2120 82 2121 82 2122 83 2123 84 2124 85 2
22. Controller and enclosure names type of communication problem return code and SCSI status Action Check for problems with the cables See the online help for more information on checking the cables You should also check to see if the enclosure has degraded or failed components To do so select the enclosure object in the tree view and click the Health subtab The Health subtab displays the status of the enclosure components Verify that the controller has supported driver and firmware versions installed and that the EMMs are each running the same version of supported firmware Clear Alert Number 2162 Related Alert Number None LRA Number 2090 853 2138 Enclosure alarm enabled Ok Normal Cause This alert is for informational purposes A user has enabled the enclosure alarm Action None Clear Alert Number None Related Alert Number None LRA Number None 851 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers Storage Management Message Reference 89 2139 Enclosure alarm disabled Ok Normal Cause A user has disabled the enclosure alarm Action None Clear Alert Number None Related Alert Number None LRA Number None 851 2140 Dead disk segments restored Ok Normal Cause This alert is for informational purposes Disk space that was formerly dead or inaccessible t
23. Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 118 Storage Management Message Reference 2303 The enclosure cannot support both SAS and SATA physical disks Physical disks may be disabled Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 851 2304 An attempt to hot plug an EMM has been detected This type of hot plug is not supported Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number 2211 LRA Number None 751 2305 The physical disk is too small to be used for a rebuild Warning Non critical Cause The physical disk is too small to rebuild the data Action Remove the physical disk and insert a new physical disk that is the same size or larger than the disk that is being rebuilt The new physical disk must also use the same technology for example SAS or SATA as the disk being rebuilt If the rebuild does not start automatically after you have inserted a suitable physical disk then run the Rebuild task See the Replacing a Failed Disk section in the Dell OpenManage Server Administrator Storage Management User s Guide for more information Clear Alert Number None Related Alert Number 2326 LRA Number 2070 903 Table 4 4 Storage Management Messages continue
24. Device Messages 32 Fan Enclosure Messages 33 AC Power Cord Messages 34 Hardware Log Sensor Messages 35 Processor Sensor Messages 37 4 Contents Pluggable Device Messages 39 Battery Sensor Messages 40 3 System Event Log Messages for IPMI Systems 43 Temperature Sensor Events 43 Voltage Sensor Events 44 Fan Sensor Events 45 Processor Status Events 46 Power Supply Events 47 Memory ECC Events 48 BMC Watchdog Events 48 Memory Events 49 Hardware Log Sensor Events 49 Drive Events
25. If sensor type is not discrete Current sensor value in Amps lt Reading gt OR Current sensor value in Watts lt Reading gt If sensor type is discrete Discrete current state lt State gt Information A current sensor in the specified system returned to a valid range after crossing a failure threshold The sensor location chassis location previous state and current sensor value are provided 1203 Current sensor detected a warning value Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt If sensor type is not discrete Current sensor value in Amps lt Reading gt OR Current sensor value in Watts lt Reading gt If sensor type is discrete Discrete current state lt State gt Warning A current sensor in the specified system exceeded its warning threshold The sensor location chassis location previous state and current sensor value are provided Table 2 5 Current Sensor Messages continued Event ID Description Severity Cause 24 Event Message Reference 1204 Current sensor detected a failure value Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt If sensor type is not discrete Current sensor value in Amps lt Reading gt OR Current sensor value in Watts lt Reading gt If sensor type is discrete Disc
26. Introduction The location of the event log file depends on the operating system you are using In the Microsoft Windows 2000 Advanced Server and Windows Server 2003 operating systems messages are logged to the system event log and optionally to a unicode text file dcsys32 log viewable using Notepad that is located in the install_path omsa log directory The default install_path is C Program Files Dell SysMgt In the Red Hat Enterprise Linux and SUSE Linux Enterprise Server operating system messages are logged to the system log file The default name of the system log file is var log messages You can view the messages file using a text editor such as vi or emacs NOTE Logging messages to a unicode text file is optional By default the feature is disabled To enable this feature modify the Event Manager section of the dcemdy32 ini file as follows In Windows locate the file at lt install_path gt dataeng ini and set UnitextLog enabled True The default install_path is C Program Files Dell SysMgt Restart the DSM SA Event Manager service In Red Hat Enterprise Linux and SUSE Linux Enterprise Server locate the file at lt install_path gt dataeng ini and set UnitextLog enabled True The default install_path is opt dell srvadmin Issue the etc init d dataeng restart command to restart the Server Administrator event manager service This will also restart the Server Administrator data manager and SNMP s
27. Number None LRA Number 2070 903 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers Storage Management Message Reference 103 2207 The only hot spare available is a SAS disk SAS disks cannot replace SATA disks Warning Non critical Cause The only physical disk available to be assigned as a hot spare is using SAS technology The physical disks in the virtual disk are using SATA technology Because of this difference in technology the hot spare cannot rebuild data if one of the physical disks in the virtual disk fails Action Add a SATA disk that is large enough to be used as the hot spare and assign the new disk as a hot spare Clear Alert Number None Related Alert Number None LRA Number 2070 903 2211 The physical disk is not supported Warning Non critical Cause The physical disk may not have a supported version of the firmware or the disk may not be supported by Dell Action If the disk is supported by Dell update the firmware to a supported version If the disk is not supported by Dell replace the disk with one that is supported Clear Alert Number None Related Alert Number None LRA Number 2070 903 2212 The controller battery temperature is above normal OK Normal Cause This alert is for informational purposes Action None Clear Alert Number None
28. Number None Related Alert Number 2341 2343 LRA Number 2080 1203 2343 The Check Consistency logging of inconsistent parity data is disabled Warning Non critical Cause The Check Consistency can no longer report errors in the parity data Action See the hardware documentation for more information Clear Alert Number None Related Alert Number None LRA Number 2080 1203 2346 Error occurred 1 Warning Non critical Cause A physical device may have an error The 1 indicates a substitution variable The text for this substitution variable is generated by the firmware and is displayed with the alert in the Alert Log This text can vary depending on the situation Action Verify the health of attached devices Review the Alert Log for significant events Run the PHY integrity diagnostic tests You may need to replace faulty hardware Make sure the cables are attached securely See the hardware documentation for more information Clear Alert Number None Related Alert Number 2048 2050 2056 2057 2076 2079 2081 2083 2095 2129 2201 2203 2270 2282 2369 LRA Number 2070 903 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers Storage Management Message Reference 129 2347 The rebuild failed due to errors on the source physical disk Critical Failure Error
29. Redundancy unit lt Redundancy location in chassis gt Chassis location lt Name of chassis gt Previous redundancy state was lt State gt Warning A redundancy sensor in the specified system detected that one of the components of the redundancy unit has failed but the unit is still redundant The redundancy unit location chassis location previous redundancy state and the number of devices required for full redundancy are provided 1306 Redundancy lost Redundancy unit lt Redundancy location in chassis gt Chassis location lt Name of chassis gt Previous redundancy state was lt State gt Error A redundancy sensor in the specified system detected that one of the components in the redundant unit has been disconnected has failed or is not present The redundancy unit location chassis location previous redundancy state and the number of devices required for full redundancy are provided Table 2 7 Redundancy Unit Messages continued Event ID Description Severity Cause Event Message Reference 29 Power Supply Messages Power supply sensors monitor how well a power supply is functioning Power supply messages listed in Table 2 8 provide status and warning information for power supplies present in a particular chassis Table 2 8 Power Supply Messages Event ID Description Severity Cause 1350 Power supply sensor has failed Sensor location lt Location in chassis gt Chassis location
30. The rebuild failed due to errors on the source physical disk 129 The rebuild failed due to errors on the target physical disk 129 The SCSI Enclosure Processor SEP has been rebooted as part of the firmware download operation and will be unavailable until the operation completes 133 The virtual disk cache policy has changed 101 The virtual disk Check Consistency has made corrections and completed 100 The virtual disk Read policy has changed 100 The virtual disk reconfiguration has resumed 100 There is a bad sensor on an enclosure 116 There was an unrecoverable disk media error during the rebuild 129 Thermal shutdown protection has been initiated 13 Index 147 U understanding event description 10 Unsupported configuration detected The SCSI rate of the enclosure management modules EMMs is not the same EMM0 1 EMM1 2 96 User initiated host system reset 14 V viewing event information 9 event messages 7 events in Red Hat Linux 8 events in SUSE Linux Enterprise Server 8 events in Windows 2000 8 Virtual disk check consistency cancelled 70 Virtual disk check consistency completed 72 Virtual disk check consistency failed 71 Virtual disk check consistency started 68 Virtual disk configuration changed 66 Virtual disk created 66 Virtual disk degraded 68 Virtual disk deleted 66 Virtual disk failed 67 Virtual disk format changed 71
31. for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 751 2235 The Check Consistency rate has changed Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 751 2237 A controller rescan has been initiated Ok Normal Cause This alert is for informational purposes Action None None 751 2238 The controller debug log file has been exported Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 751 2239 A foreign configuration has been cleared Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 751 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers Storage Management Message Reference 105 2240 A foreign configuration has been imported Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 751 2241 The Patrol Read mode has changed Ok Normal Cause This alert is for informational purposes Action N
32. fully initialize the disk and then restore from back up Clear Alert Number None Related Alert Number None LRA Number 2071 904 2274 The physical disk rebuild has resumed Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 901 2276 The dedicated hot spare is too small Warning Non critical Cause The dedicated hot spare is not large enough to protect all virtual disks that reside on the disk group Action Assign a larger disk as the dedicated hot spare Clear Alert Number None Related Alert Number None LRA Number 2070 903 2277 The global hot spare is too small Warning Non critical Cause The global hot spare is not large enough to protect all virtual disks that reside on the controller Action Assign a larger disk as the global hot spare Clear Alert Number None Related Alert Number None LRA Number 2070 903 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 112 Storage Management Message Reference 2278 The controller battery charge level is below a normal threshold Ok Normal Cause The battery is discharging A battery discharge is a normal activity during the battery Learn cycle Before completing the battery Learn cycle recharges the batte
33. initiated 104 A dedicated hot spare failed 101 A dedicated hot spare has been automatically unassigned 102 A dedicated hot spare has been removed 102 A device has been inserted 115 A device has been removed 115 A device is in an unknown state 108 A device is missing 108 A disk media error has been corrected 112 A disk media error was corrected during recovery 113 A foreign configuration has been cleared 104 A foreign configuration has been detected 123 A foreign configuration has been imported 105 A global hot spare failed 101 A global hot spare has been removed 101 A global rescan has initiated 107 A Learn cycle start is pending while the battery charges 114 A mirrored virtual disk has been unmirrored 81 A physical disk is incompatible 119 A physical disk is marked as missing 129 A physical disk that was marked as missing has been replaced 129 A power supply in the enclosure has a DC failure 120 A power supply in the enclosure has an AC failure 120 A previously scheduled system BIOS update has been canceled 13 A redundant path has been restored 113 A redundant path is broken 113 A system BIOS update has been scheduled for the next reboot 13 A user has discarded data from the controller cache 131 A virtual disk and its mirror have been split 81 A virtual disk blink has been initiated 105 A virtua
34. message occurs when a physical disk included in a redundant virtual disk fails Because the virtual disk is redundant uses mirrored or parity information and only one physical disk has failed the virtual disk can be rebuilt Action 1 Configure a hot spare for the virtual disk if one is not already configured Rebuild the virtual disk When using an Expandable RAID Controller PERC PERC 3 SC 3 DCL 3 DC 3 QC 4 SC 4 DC 4e DC 4 Di CERC ATA100 4ch PERC 5 E PERC 5 i or a Serial Attache SCSI SAS 5 iR controller rebuild the virtual disk by first configuring a hot spare for the disk and then initiating a write operation to the disk The write operation will initiate a rebuild of the disk Cause 2 A physical disk in the disk group has been removed Action 2 If a physical disk was removed from the disk group either replace the disk or restore the original disk You can identify which disk has been removed by locating the disk that has a red X for its status Perform a rescan after replacing the disk Clear Alert Number None Related Alert Number 2048 2049 2050 2076 2079 2081 2123 2129 2346 LRA Number 2080 1203 2058 Virtual disk check consistency started Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number 2085 Related Alert Number None LRA Number None 1201 2059 Virtual disk format started Ok Normal Cause This alert is
35. or an invalid cabling configuration See the hardware documentation for information on correct cabling configurations Check if the firmware is a supported version Clear Alert Number None Related Alert Number None LRA Number 2061 754 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers Storage Management Message Reference 131 2357 SAS expander error 1 Critical Failure Error Cause The 1 indicates a substitution variable The text for this substitution variable is generated by the firmware and is displayed with the alert in the Alert Log This text can vary depending on the situation Action There may be a problem with the enclosure Check the health of the enclosure and its components by selecting the enclosure object in the tree view The Health subtab displays a red X or yellow exclamation point for enclosure components that are failed or degraded See the enclosure documentation for more information Clear Alert Number None Related Alert Number None LRA Number 2061 754 2358 The battery charge cycle is complete Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 1151 2359 The physical disk is not certified Warning Non critical Cause The physical disk does n
36. previous versions of Storage Management an unexpected system shutdown may have caused the controller to repost a large number of alerts to the Alert Log when restarting the system Modified Alerts 2095 Severity changed to Informational SNMP trap changed to 901 2153 Severity changed to Informational SNMP trap changed to 851 2188 Severity changed to Informational SNMP trap changed to 1151 2192 Changed documentation for cause and corrective action 2202 Severity changed to Informational SNMP trap changed to 901 2204 Severity changed to Informational SNMP trap changed to 901 2205 Severity changed to Informational SNMP trap changed to 901 Table 4 3 Alert Message Change History Alert Message Change History 62 Storage Management Message Reference 2266 SNMP traps changed to 751 801 851 901 951 1001 1051 1101 1151 1201 2272 Severity changed to Critical SNMP trap changed to 904 Changed corrective action information in the documentation 2273 Changed alert message text and documentation for cause and corrective action 2279 Changed alert message text 2299 Changed corrective action information in the documentation 2305 Changed severity to Warning Changed SNMP trap number to 903 2331 Changed severity to Informational Changed SNMP trap number to 901 2367 Changed severity to Warning Changed SNMP trap number to 903 Obsolete Alerts 2333 2354 2354 replaced by 236
37. state The sensor location chassis location previous state and battery sensor status are provided 1703 Battery sensor detected a warning value Sensor Location lt Location in chassis gt Chassis Location lt Name of chassis gt Previous state was lt State gt Battery sensor status lt status gt Warning A battery sensor in the specified system detected that a battery is in a predictive failure state The sensor location chassis location previous state and battery sensor status are provided Event Message Reference 41 1704 Battery sensor detected a failure value Sensor Location lt Location in chassis gt Chassis Location lt Name of chassis gt Previous state was lt State gt Battery sensor status lt status gt Error A battery sensor in the specified system detected that a battery has failed The sensor location chassis location previous state and battery sensor status are provided 1705 Battery sensor detected a non recoverable value Sensor Location lt Location in chassis gt Chassis Location lt Name of chassis gt Previous state was lt State gt Battery sensor status lt status gt Error A battery sensor in the specified system detected that a battery has failed The sensor location chassis location previous state and battery sensor status are provided Table 2 15 Battery Sensor Messages continued Event ID Description Severity Cause 42 Ev
38. status and warning information for current sensors in a particular chassis Table 2 5 Current Sensor Messages Event ID Description Severity Cause 1200 Current sensor has failed Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt If sensor type is not discrete Current sensor value in Amps lt Reading gt OR Current sensor value in Watts lt Reading gt If sensor type is discrete Discrete current state lt State gt Information A current sensor in the specified system failed The sensor location chassis location previous state and current sensor value are provided 1201 Current sensor value unknown Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt If sensor type is not discrete Current sensor value in Amps lt Reading gt OR Current sensor value in Watts lt Reading gt If sensor type is discrete Discrete current state lt State gt Information A current sensor in the specified system could not obtain a reading The sensor location chassis location previous state and a nominal current sensor value are provided Event Message Reference 23 1202 Current sensor returned to a normal value Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt
39. status and warning information for processors in a particular chassis Table 2 13 Processor Sensor Messages Event ID Description Severity Cause 1600 Processor sensor has failed Sensor Location lt Location in chassis gt Chassis Location lt Name of chassis gt Previous state was lt State gt Processor sensor status lt status gt Information A processor sensor in the specified system is not functioning The sensor location chassis location previous state and processor sensor status are provided 1601 Processor sensor value unknown Sensor Location lt Location in chassis gt Chassis Location lt Name of chassis gt Previous state was lt State gt Processor sensor status lt status gt Information A processor sensor in the specified system could not obtain a reading The sensor location chassis location previous state and processor sensor status are provided 1602 Processor sensor returned to a normal value Sensor Location lt Location in chassis gt Chassis Location lt Name of chassis gt Previous state was lt State gt Processor sensor status lt status gt Information A processor sensor in the specified system transitioned back to a normal state The sensor location chassis location previous state and processor sensor status are provided 38 Event Message Reference 1603 Processor sensor detected a warning value Sensor Location lt Location in chassis
40. to maintain redundancy After being split both virtual disks retain a copy of the data although because the mirror is no longer intact updates to the data are no longer copied to the mirror Action None Clear Alert Number None Related Alert Number None LRA Number None 1201 2117 A mirrored virtual disk has been unmirrored Ok Normal Cause This alert is for informational purposes A user has caused a mirrored virtual disk to be unmirrored When a virtual disk is mirrored its data is copied to another virtual disk in order to maintain redundancy After being unmirrored the disk formerly used as the mirror returns to being a physical disk and becomes available for inclusion in another virtual disk Action None Clear Alert Number None Related Alert Number None LRA Number None 1201 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 82 Storage Management Message Reference 2118 Change write policy Ok Normal Cause This alert is for informational purposes A user has changed the write policy for a virtual disk Action None Clear Alert Number None Related Alert Number None LRA Number None 1201 2120 Enclosure firmware mismatch Warning Non critical Cause The firmware on the EMM is not the same version It is required that both modules have the same version
41. value in amps for example Current sensor value in Amps 7 853 Date and time of action lt Date and time gt Specifies the date and time the action was performed for example Date and time of action Sat Jun 12 16 20 33 2004 Device location lt Location in chassis gt Specifies the location of the device in the specified chassis for example Device location Memory Card A Discrete current state lt State gt Specifies the state of the current sensor for example Discrete current state Good Discrete temperature state lt State gt Specifies the state of the temperature sensor for example Discrete temperature state Good Introduction 11 Discrete voltage state lt State gt Specifies the state of the voltage sensor for example Discrete voltage state Good Fan sensor value lt Reading gt Specifies the fan speed in revolutions per minute RPM or On Off for example Fan sensor value in RPM 2600 Fan sensor value Off Log type lt Log type gt Specifies the type of hardware log for example Log type ESM Memory device bank location lt Bank name in chassis gt Specifies the name of the memory bank in the system that generated the message for example Memory device bank location Bank_1 Memory device location lt Device name in chassis gt Specifies the location of the memory module in the chassis for example Memory device location DIMM_A Number of devices required for
42. 107 2254 The Clear operation has cancelled Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 901 2255 The physical disk has been started Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number 2048 2050 2065 2099 2121 2196 2201 2203 LRA Number None 901 2259 An enclosure blink operation has initiated Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number 2260 Related Alert Number None LRA Number None 851 2260 An enclosure blink has ceased OK Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 851 2261 A global rescan has initiated Ok Normal Cause This alert is for informational purposes Action None None 101 2262 SMART thermal shutdown is enabled Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 101 2263 SMART thermal shutdown is disabled Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 101 Table 4 4 Storage M
43. 1102 1152 and 1202 Added SNMP trap 851 2295 Removed SNMP traps 754 804 904 954 1004 1054 1104 1154 and 1204 Remaining SNMP trap is 854 Obsolete Alerts 2317 2363 Documentation Changes Documentation updated to indicate related alerts and Local Response Agent LRA alerts 2095 Changed documentation for cause Table 4 2 Message Format with Variables for Each Storage Object continued Storage Object Message Variables A B C and X Y Z in the following examples are variables representing the storage object name or number Storage Management Message Reference 61 2305 Changed documentation for cause and corrective action Changed SNMP trap number to 903 This change only made in the Dell OpenManage Server Administrator Messages Reference Guide to reflect existing Storage Management online help 2312 Changed documentation for corrective action in the Storage Management online help The Dell OpenManage Server Administrator Messages Reference Guide already has updated corrective action 2367 Changed documentation for cause and corrective action Storage Management 2 2 Comments Product Versions to which Changes Apply Storage Management 2 2 Server Administrator 3 2 Dell OpenManage 5 2 Reduction of unnecessary alert generation Enhancements to Storage Management avoid numerous redundant or inappropriate alerts posted to the Alert Log after an unexpected system shutdown In
44. 126 85 2127 85 2128 86 2129 86 Index 137 2130 86 2131 86 2132 87 2135 87 2136 87 2137 88 2138 88 2139 89 2140 89 2141 89 2142 89 2143 89 2144 89 2145 90 2146 90 2147 90 2148 90 2149 90 2150 90 2151 91 2152 91 2153 91 2154 91 2155 91 2156 91 2157 92 2158 92 2159 92 2162 92 2163 93 2164 93 2165 93 2166 94 2167 94 2168 94 2169 95 2170 95 2171 95 2173 96 2174 96 2175 96 2176 97 2177 97 2178 97 2179 97 2180 98 2181 98 2182 98 2186 98 2187 98 2188 99 2189 99 2191 99 2192 100 2193 100 2194 100 2195 100 2196 100 2199 101 2201 101 2202 101 2203 101 2204 102 2205 102 2206 102 2207 103 2211 103 2212 103 2213 103 2214 103 2215 104 2232 104 2233 104 2234 104 2235 104 2237 104 2238 104 2239 104 2240 105 2241 105 2242 105 2243 105 2244 105 2245 105 2246 106 2247 106 2248 106 2249 106 2251 106 2252 106 138 Index 138 Index 2254 107 2255 107 2259 107 2260 107 2261 107 2262 107 2263 107 2264 108 2265 108 2266 108 2267 109 2268 109 2269 109 2270 110 2271 110 2272 110 2273 111 2274 111 2276 111 2277 111 2278 112 2279 112 2280 11
45. 2 2281 112 2282 113 2283 113 2284 113 2285 113 2286 114 2287 114 2288 114 2289 114 2290 115 2291 115 2292 115 2293 115 2294 115 2295 115 2296 116 2297 116 2298 116 2299 116 2300 117 2301 117 2302 117 2303 117 118 2304 118 2305 118 2306 119 2307 119 2309 119 2310 120 2311 120 2312 120 2313 120 2314 121 2315 121 2316 121 2318 121 2319 122 2320 122 2321 122 2322 122 2323 123 2324 123 2325 123 2326 123 2327 124 2328 124 2329 124 2330 125 2331 125 2332 125 2334 125 2335 126 2336 126 2337 127 2338 127 2339 127 2340 127 2341 128 2342 128 2343 128 2346 128 2347 129 2348 129 2349 129 2350 129 2351 129 2352 129 Index 139 2353 130 2356 130 2357 131 2358 131 2359 131 2360 131 2361 132 2362 132 2364 132 2366 132 2367 133 2368 133 2369 133 2371 133 A A bad disk block could not be reassigned during a write operation 129 A bad disk block has been reassigned 125 A block on the physical disk has been punctured by the controller 111 A consistency check on a virtual disk has been paused suspended 81 A consistency check on a virtual disk has been resumed 81 A controller hot plug has been detected 125 A controller rescan has been
46. 2091 1004 2325 The power supply cable has been inserted Ok Normal Cause This alert is for informational purposes Action None Clear Alert Status Alert 2325 is a clear alert for alerts 2324 and 2312 Related Alert Number None LRA Number None 1001 2326 A foreign configuration has been detected Ok Normal Cause This alert is for informational purposes The controller has physical disks that were moved from another controller These physical disks contain virtual disks that were created on the other controller See the Import Foreign Configuration and Clear Foreign Configuration section in the Dell OpenManage Server Administrator Storage Management User s Guide for more information Action None Clear Alert Number None Related Alert Number None LRA Number None 751 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 124 Storage Management Message Reference 2327 The NVRAM has corrupted data The controller is reinitializing the NVRAM Warning Non critical Cause The NVRAM has corrupted data This may occur after a power surge a battery failure or for other reasons The controller is reinitializing the NVRAM Action None The controller is taking the required corrective action If this alert is generated often such as during each reboot replace the controller
47. 8 2355 2365 2370 Documentation Changes Severity for alert 2163 changed from Ok Normal to Critical Failure Error Documentation change only made in the Dell OpenManage Server Administrator Messages Reference Guide to reflect the severity displayed in the Server Administrator Alert Log and documented in the Storage Management online help Severity for alert 2318 changed from Critical Failure Error to Warning Non critical Documentation change only made in the Dell OpenManage Server Administrator Messages Reference Guide to reflect the severity displayed in the Server Administrator Alert Log and documented in the Storage Management online help Removed alert 2344 Replaced by alert 2070 Documentation change only made in the Dell OpenManage Server Administrator Messages Reference Guide to reflect existing Storage Management online help Table 4 3 Alert Message Change History Alert Message Change History Storage Management Message Reference 63 Removed alert 2345 Replaced by alert 2079 Documentation change only made in the Dell OpenManage Server Administrator Messages Reference Guide to reflect existing Storage Management online help Storage Management 2 1 Comments Product Versions to which Changes Apply Storage Management 2 1 Server Administrator 2 4 Dell OpenManage 5 1 New Alerts 2062 see note 2173 2195 2196 2212 2213 2214 2215 2260 see note 2370 2371 T
48. Alert Information SNMP Trap Numbers Storage Management Message Reference 79 2109 SMART warning temperature Warning Non critical Cause A disk has reached an unacceptable temperature and received a SMART alert predictive failure The disk is likely to fail in the near future Action 1 Determine why the physical disk has reached an unacceptable temperature A variety of factors can cause the excessive temperature For example a fan may have failed the thermostat may be set too high or the room temperature may be too hot or cold Verify that the fans in the server or enclosure are working If the physical disk is in an enclosure you should check the thermostat settings and examine whether the enclosure is located near a heat source Make sure the enclosure has enough ventilation and that the room temperature is not too hot See the physical disk enclosure documentation for more diagnostic information Action 2 If you cannot identify why the disk has reached an unacceptable temperature then replace the disk If the physical disk is a member of a non redundant virtual disk then back up the data before replacing the disk NOTICE Removing a physical disk that is included in a non redundant virtual disk will cause the virtual disk to fail and may cause data loss Clear Alert Number None Related Alert Number None LRA Number 2070 903 Table 4 4 Storage Management Messages continued Event ID
49. Cause and Action Related Alert Information SNMP Trap Numbers 74 Storage Management Message Reference 2094 Predictive Failure reported Warning Non critical Cause The physical disk is predicted to fail Many physical disks contain Self Monitoring Analysis and Reporting Technology SMART When enabled SMART monitors the health of the disk based on indications such as the number of write operations that have been performed on the disk Action Replace the physical disk Even though the disk may not have failed yet it is strongly recommended that you replace the disk If this disk is part of a redundant virtual disk perform the Offline task on the disk replace the disk and then assign a hot spare and the rebuild will start automatically If this disk is a hot spare then unassign the hot spare perform the Prepare to Remove task on the disk replace the disk and assign the new disk as a hot spare NOTICE If this disk is part of a nonredundant disk back up your data immediately If the disk fails you will not be able to recover the data Clear Alert Number None Related Alert Number None LRA Number 2070 903 2095 SCSI sense data Ok Normal Cause A SCSI device experienced an error but may have recovered Action None Clear Alert Number None Related Alert Number 2273 LRA Number None 751 851 901 2098 Global hot spare assigned Ok Normal Cause A user has assigne
50. None LRA Number 2060 753 2135 Array Manager is installed on the system Warning Non critical Cause Storage Management has been installed on a system that has an Array Manager installation Action Installing Storage Management and Array Manager on the same system is not a supported configuration Uninstall either Storage Management or Array Manager Clear Alert Number None Related Alert Number None LRA Number 2050 103 2136 Virtual disk initialization Ok Normal Cause This alert is for informational purposes Virtual disk initialization is in progress Action None Clear Alert Number 2088 Related Alert Number None LRA Number None 1201 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 88 Storage Management Message Reference 2137 Communicatio n timeout Warning Non critical Cause The controller is unable to communicate with an enclosure There are several reasons why communication may be lost For example there may be a bad or loose cable An unusual amount of I O may also interrupt communication with the enclosure In addition communication loss may be caused by software hardware or firmware problems bad or failed power supplies and enclosure shutdown When viewed in the Alert Log the description for this event displays several variables These variables are
51. Reading gt Warning Voltage of the monitored entity lt Sensor Name Location gt exceeded the warning threshold lt Sensor Name Location gt voltage sensor returned to normal lt Reading gt Information The voltage of a previously reported lt Sensor Name Location gt is returned to normal state System Event Log Messages for IPMI Systems 45 Fan Sensor Events The cooling device sensors monitor how well a fan is functioning These messages provide status warning and failure messages for fans for a particular chassis Table 3 3 Fan Sensor Events Event Message Severity Cause lt Sensor Name Location gt Fan sensor detected a failure lt Reading gt where lt Sensor Name Location gt is the entity that this sensor is monitoring For example BMC Back Fan or BMC Front Fan Reading is specified in RPM For example 100 RPM Critical The speed of the specified lt Sensor Name Location gt fan is not sufficient to provide enough cooling to the system lt Sensor Name Location gt Fan sensor returned to normal state lt Reading gt Information The fan specified by lt Sensor Name Location gt has returned to its normal operating speed lt Sensor Name Location gt Fan sensor detected a warning lt Reading gt Warning The speed of the specified lt Sensor Name Location gt fan may not be sufficient to provide enough cooling to the system lt Sensor Name Location gt Fan Redundancy sensor redundanc
52. The controller cache has been discarded 98 The controller debug log file has been exported 104 The controller has recovered cached data from the BBU 127 The controller is unable to recover cached data from the battery backup unit BBU 127 The controller reconstruct rate has changed 109 The controller write policy has been changed to Write Back 99 The controller write policy has been changed to Write Through 99 The current kernel version and the non RAID SCSI driver version are older than the minimum required levels See the Readme file for a list of validated kernel and driver versions 94 The DC power supply is switched off 122 The dedicated hot spare is too small 111 The EMM has failed 115 The enclosure cannot support both SAS and SATA physical disks Physical disks may be disabled 118 The enclosure has a hardware error 117 The enclosure is not responding 117 The enclosure is unstable 117 The enclosure temperature has returned to normal 130 The factory default settings have been restored 127 The firmware on the EMMs is not the same version EMM0 1 EMM1 2 120 The global hot spare is too small 111 The initialization sequence of SAS components failed during system startup 146 Index 146 Index SAS management and monitoring is not possible 121 The non RAID SCSI driver version is older than the minimum required level See the Readm
53. This event is generated when the intrusion sensor detects an intrusion lt Intrusion sensor Name gt sensor returned to normal state Information This event is generated when the earlier intrusion has been corrected lt Intrusion sensor Name gt sensor intrusion was asserted while system was ON Critical This event is generated when the intrusion sensor detects an intrusion while the system is on lt Intrusion sensor Name gt sensor intrusion was asserted while system was OFF Critical This event is generated when the intrusion sensor detects an intrusion while the system is off Table 3 10 Drive Events continued Event Message Severity Cause 52 System Event Log Messages for IPMI Systems BIOS Generated System Events The BIOS generated messages monitor the health and functionality of the chipsets I O channels and other BIOS related functions These system events are generated by the BIOS Table 3 12 BIOS Generated System Events Event Message Severity Cause System Event I O channel chk Critical This event is generated when a critical interrupt is generated in the I O Channel System Event PCI Parity Err Critical This event is generated when a parity error is detected on the PCI bus System Event Chipset Err Critical This event is generated when a chip error is detected System Event PCI System Err Information This event indicates historical data and is generated when the system has crashed
54. absent from the chassis This sensor monitors the chassis and any attached systems AC Power Cord Sensor Monitors the presence of AC power for an AC power cord Hardware Log Sensor Monitors the size of a hardware log Processor Sensor Monitors the processor status in the system Pluggable Device Sensor Monitors the addition removal or configuration errors for some pluggable devices such as memory cards Battery Sensor Monitors the status of one or more batteries in the system Sample Event Message Text The following example shows the format of the event messages logged by Server Administrator EventID 1000 Source Server Administrator Category Instrumentation Service Type Information Date and Time Mon Oct 21 10 38 00 2002 Computer lt computer name gt Description Server Administrator starting Data Bytes in Hex Viewing Alerts and Event Messages An event log is used to record information about important events Server Administrator generates alerts that are added to the operating system event log and to the Server Administrator Alert log To view these alerts in Server Administrator 1 Select the System object in the tree view 2 Select the Logs tab 3 Select the Alert subtab You can also view the event log using your operating system s event viewer Each operating system s event viewer accesses the applicable operating system event log 8
55. an an existing module The 1 and 2 indicate a substitution variable The text for these substitution variables is displayed with the alert in the Alert Log and can vary depending on the situation Action Upgrade to the same version of the firmware on both EMM modules Clear Alert Number None Related Alert Number None LRA Number 2090 853 2312 A power supply in the enclosure has an AC failure Warning Non critical Cause The power supply has an AC failure Action Replace the power supply Clear Alert Number 2325 Related Alert Number 2122 2324 LRA Number 2090 1003 2313 A power supply in the enclosure has a DC failure Warning Non critical Cause The power supply has a DC failure Action Replace the power supply Clear Alert Number 2323 Related Alert Number 2122 2322 LRA Number 2090 1003 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers Storage Management Message Reference 121 2314 The initialization sequence of SAS components failed during system startup SAS management and monitoring is not possible Critical Failure Error Cause Storage Management is unable to monitor or manage SAS devices Action Reboot the system If problem persists make sure you have supported versions of the drivers and firmware Also you may need to rei
56. anagement Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 108 Storage Management Message Reference 2264 A device is missing Warning Non critical Cause The controller cannot communicate with a device The device may be removed There may also be a bad or loose cable Action Check if the device is in and not removed If it is in check the cables You should also check the connection to the controller battery and the battery health A battery with a weak or depleted charge may cause this alert Clear Alert Number None Related Alert Number None LRA Number 2050 2060 2070 2080 2090 2100 753 803 853 903 953 1003 1053 1103 1153 1203 2265 A device is in an unknown state Warning Non critical Cause The controller cannot communicate with a device The state of the device cannot be determined There may be a bad or loose cable The system may also be experiencing problems with the application programming interface API There could also be a problem with the driver or firmware Action Check the cables Check if the controller has a supported version of the driver and firmware You can download the most current version of the driver and firmware from support dell com Rebooting the system may also resolve this problem Clear Alert Number None Related Alert Number 2048 2050 LRA Number 2050 2060 2070 2080
57. and recovered System Event PCI Fatal Err Critical This error is generated when a fatal error is detected on the PCI bus System Event PCIE Fatal Err Critical This error is generated when a fatal error is detected on the PCIE bus POST Err POST fatal error lt number gt Critical This event is generated when an error accrues during system boot See the system documentation for more information on the error code Memory Spared redundancy lost Critical This event is generated when memory spare is no longer redundant Memory Mirrored redundancy lost Critical This event is generated when memory mirroring is no longer redundant Memory RAID redundancy lost Critical This event is generated when memory RAID is no longer redundant Err Reg Pointer OEM Diagnostic data event was asserted Information This event is generated when an OEM event accrues System Board PFault Fail Safe state asserted Critical This event is generated when the system board voltages are not at normal levels System Board PFault Fail Safe state deasserted Information This event is generated when earlier PFault Fail Safe system voltages returns to a normal level Memory Add BANK DIMM presence was asserted Information This event is generated when memory is added to the system System Event Log Messages for IPMI Systems 53 Memory Removed BANK DIMM presence was asserted Information This event is gen
58. ange Critical Failure Error Cause A disk has received a SMART alert predictive failure after a configuration change The disk is likely to fail in the near future Action Replace the disk that has received the SMART alert If the physical disk is a member of a non redundant virtual disk then back up the data before replacing the disk NOTICE Removing a physical disk that is included in a non redundant virtual disk will cause the virtual disk to fail and may cause data loss Clear Alert Number None Related Alert Number None LRA Number 2071 904 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 78 Storage Management Message Reference 2108 Smart warning Warning Non critical Cause A disk has received a SMART alert predictive failure The disk is likely to fail in the near future Action Replace the disk that has received the SMART alert If the physical disk is a member of a non redundant virtual disk then back up the data before replacing the disk NOTICE Removing a physical disk that is included in a non redundant virtual disk will cause the virtual disk to fail and may cause data loss Clear Alert Number None Related Alert Number None LRA Number 2070 903 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related
59. as a failed or degraded component Click the controller that displays a Warning or Failed status This action displays the controller Health subtab which displays the status of the individual controller components Continue clicking the components with a Warning or Health status until you identify the failed component See the online help for more information See the enclosure documentation for information on replacing enclosure components and for other diagnostic information Clear Alert Number 2124 Related Alert Number 2048 2049 2057 LRA Number 2080 2090 1306 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers Storage Management Message Reference 85 2124 Redundancy normal Ok Normal Cause This alert is for informational purposes Data redundancy has been restored to a virtual disk or an enclosure that previously suffered a loss of redundancy Action None Clear Alert Number Alert 2124 is a clear alert for alerts 2122 and 2123 Related Alert Number None LRA Number None 1304 2126 SCSI sense sector reassign Warning Non critical Cause A sector of the physical disk is corrupted and data cannot be maintained on this portion of the disk This alert is for informational purposes NOTICE Any data residing on the corrupt portion of the disk may be lost and you may need to restore your d
60. ata from backup Action If the physical disk is part of a nonredundant virtual disk then back up the data and replace the physical disk NOTICE Removing a physical disk that is included in a nonredundant virtual disk will cause the virtual disk to fail and may cause data loss If the disk is part of a redundant virtual disk then any data residing on the corrupt portion of the disk will be reallocated elsewhere in the virtual disk Clear Alert Number None Related Alert Number None LRA Number None 903 2127 Background initialization BGI started Ok Normal Cause BGI of a virtual disk has started This alert is for informational purposes Action None Clear Alert Status 2130 Related Alert Number None LRA Number None 1201 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 86 Storage Management Message Reference 2128 BGI cancelled Ok Normal Cause BGI of a virtual disk has been cancelled A user or the firmware may have stopped BGI Action None Clear Alert Number None Related Alert Number None LRA Number None 1201 2129 BGI failed Critical Failure Error Cause BGI of a virtual disk has failed Action None Clear Alert Number None Related Alert Number 2340 LRA Number 2081 1204 2130 BGI completed Ok Normal Cause BGI of a virtual di
61. attery health is poor 121 processor sensor 7 Processor sensor detected a failure value 38 52 Processor sensor detected a non recoverable value 38 Processor sensor detected a warning value 38 52 Processor sensor has failed 37 52 Processor sensor returned to a normal state 37 52 Processor sensor value unknown 37 52 Processor Status Events 46 processor status messages 46 R r2 generated system messages 55 Rebuild completed with errors 93 Rebuild not possible as SAS SATA is not supported in the same virtual disk 133 Recharge count maximum exceeded 103 Redundancy degraded 28 83 Redundancy is offline 27 Redundancy lost 28 84 Redundancy normal 85 Redundancy not applicable 27 48 Redundancy regained 28 Redundancy sensor has failed 27 Redundancy sensor value unknown 27 48 redundancy unit messages 26 redundancy unit sensor 6 S SAS expander error 1 131 SAS port report 1 124 125 SAS SMP communications error 1 130 144 Index 144 Index SCSI sense data 74 SCSI sense sector reassign 85 See the Readme file for a list of validated controller driver versions 93 sensor AC power cord 7 chassis intrusion 6 current 6 fan 6 fan enclosure 7 hardware log 7 memory prefailure 6 power supply 6 processor 7 37 redundancy unit 6 temperature 6 voltage 6 Server Administrator starting 13 Server Administrator start
62. atures let you monitor the health of storage resources such as controllers enclosures physical disks and virtual disks Alert Monitoring and Logging The Storage Management Service performs alert monitoring and logging By default the Storage Management Service starts when the managed system starts up If you stop the Storage Management Service the alert monitoring and logging stops Alert monitoring does the following Updates the status of the storage object that generated the alert Propagates the storage object s status to all the related higher objects in the storage hierarchy For example the status of a lower level object will be propagated up to the status displayed on the Health tab for the top level storage object Logs an alert in the Alert log and the operating system OS application log Sends an SNMP trap if the operating system s SNMP service is installed and enabled NOTE Dell OpenManage Server Administrator Storage Management does not log alerts regarding the data I O path These alerts are logged by the respective RAID drivers in the system alert log See the Storage Management Online Help and the Dell OpenManage Server Administrator Storage Management User s Guide for updated information Alert Message Format with Substitution Variables When you view an alert in the Server Administrator alert log the alert identifies the specific components such as the controller name or the virtual dis
63. ause You have attempted to replace a disk with another disk that is using an incompatible technology For example you may have replaced one side of a mirror with a SAS disk when the other side of the mirror is using SATA technology Action See the hardware documentation for information on replacing disks Clear Alert Number None Related Alert Number None LRA Number 2070 903 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 120 Storage Management Message Reference 2310 A virtual disk is permanently degraded Critical Failure Error Cause A redundant virtual disk has lost redundancy This may occur when the virtual disk suffers the failure of multiple physical disks In this case both the source physical disk and the target disk with redundant data have failed A rebuild is not possible because there is no redundancy Action Replace the failed disks and restore from backup Clear Alert Number None Related Alert Number None LRA Number 2081 1204 2311 The firmware on the EMMs is not the same version EMM0 1 EMM1 2 Warning Non critical Cause The firmware on the EMM modules is not the same version It is required that both modules have the same version of the firmware This alert may be caused if you attempt to insert an EMM module that has a different firmware version th
64. ay be possible for the error correction algorithm to correct the error and maintain parity data An error involving multiple bits however usually indicates data loss In some cases if the multi bit error occurs during a read operation the data on the disk may be correct valid If the multi bit error occurs during a write operation data loss has occurred Action Replace the dual in line memory module DIMM The DIMM is a part of the controller battery pack See your hardware documentation for information on replacing the DIMM You may need to restore data from backup Clear Alert Number None Related Alert Number None LRA Number 2061 754 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers Storage Management Message Reference 115 2290 Single bit ECC error Warning Non critical Cause An error involving a single bit has been encountered during a read or write operation The error correction algorithm has corrected this error Action None Clear Alert Number None Related Alert Number None LRA Number 2060 753 2291 An EMM has been discovered Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 851 2292 Communicatio n with the enclosure has been lost Critical Failure E
65. battery messages 55 BIOS Generated System Events 52 bios generated system messages 52 BMC Watchdog Events 48 BMC watchdog messages 48 C cable interconnect messages 55 Change write policy 82 Chassis intrusion detected 26 47 Chassis intrusion in progress 26 47 chassis intrusion messages 25 Chassis intrusion returned to normal 25 chassis intrusion sensor 6 Chassis intrusion sensor detected a non recoverable value 26 47 Chassis intrusion sensor has failed 25 Chassis intrusion sensor value unknown 25 47 Communication regained 92 Communication timeout 88 Communication with the enclosure has been lost 115 Controller alarm disabled 89 Controller alarm enabled 89 Controller alarm has been tested 91 Index 141 Controller battery is reconditioning 76 Controller battery low 90 Controller battery recondition is completed 76 Controller configuration has been reset 92 Controller event log 1 125 126 Controller log file entry 1 108 Controller rebuild rate has changed 89 cooling device messages 18 current sensor 6 Current sensor detected a failure value 24 Current sensor detected a non recoverable value 24 Current sensor detected a warning value 23 Current sensor has failed 22 46 current sensor messages 22 Current sensor returned to a normal value 23 46 Current sensor value unknown 22 D Dead disk segment
66. build failed Critical Failure Error Cause A physical disk included in the virtual disk has failed or is corrupt A user may also have cancelled the rebuild Action Replace the failed or corrupt disk You can identify a disk that has failed by locating the disk that has a red X for its status Restart the virtual disk rebuild Clear Alert Number None Related Alert Number 2048 LRA Number 2081 1204 2083 Physical disk rebuild failed Critical Failure Error Cause A physical disk included in the virtual disk has failed or is corrupt A user may also have cancelled the rebuild Action Replace the failed or corrupt disk You can identify a disk that has failed by locating the disk that has a red X for its status Rebuild the virtual disk rebuild Clear Alert Number None Related Alert Number None LRA Number 2071 904 2085 Virtual disk check consistency completed Ok Normal Cause This alert is for informational purposes Action None Clear Alert Status Alert 2085 is a clear alert for alert 2058 Related Alert Number None LRA Number None 1201 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers Storage Management Message Reference 73 2086 Virtual disk format completed Ok Normal Cause This alert is for informational purposes Action None C
67. cation in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt Fan sensor value lt Reading gt Information A fan sensor in the specified system could not obtain a reading The sensor location chassis location previous state and a nominal fan sensor value are provided 1102 Fan sensor returned to a normal value Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt Fan sensor value lt Reading gt Information A fan sensor reading on the specified system returned to a valid range after crossing a warning threshold The sensor location chassis location previous state and fan sensor value are provided 1103 Fan sensor detected a warning value Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt Fan sensor value lt Reading gt Warning A fan sensor reading in the specified system exceeded a warning threshold The sensor location chassis location previous state and fan sensor value are provided Event Message Reference 19 Voltage Sensor Messages Voltage sensors listed in Table 2 4 monitor the number of volts across critical components Voltage sensor messages provide status and warning information for voltage sensors in a particular chassis 1104 Fan sensor detected a failure value Sensor location lt Location i
68. coverable value Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt Power Supply type lt type of power supply gt lt Additional power supply status information gt If in configuration error state Configuration error type lt type of configuration error gt Error A power supply sensor in the specified system detected an error from which it cannot recover The sensor location chassis location previous state and additional power supply status information are provided Table 2 8 Power Supply Messages continued Event ID Description Severity Cause 32 Event Message Reference Memory Device Messages Memory device messages listed in Table 2 9 provide status and warning information for memory modules present in a particular system Memory devices determine health status by monitoring the ECC memory correction rate and the type of memory events that have occurred NOTE A critical status does not always indicate a system failure or loss of data In some instances the system has exceeded the ECC correction rate Although the system continues to function you should perform system maintenance as described in Table 2 9 NOTE In Table 2 9 lt status gt can be either critical or non critical Table 2 9 Memory Device Messages Event ID Description Severity Cause 1403 Memory device status is lt status gt Memory device location lt
69. d Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers Storage Management Message Reference 119 2306 Bad block table is 80 full Warning Non critical Cause The bad block table is used for remapping bad disk blocks This table fills as bad disk blocks are remapped When the table is full bad disk blocks can no longer be remapped and disk errors can no longer be corrected At this point data loss can occur The bad block table is now 80 full Action Back up your data Replace the disk generating this alert and restore from back up Clear Alert Number None Related Alert Number 2307 LRA Number 2070 903 2307 Bad block table is full Unable to log block 1 Critical Failure Error Cause The bad block table is used for remapping bad disk blocks This table fills as bad disk blocks are remapped When the table is full bad disk blocks can no longer be remapped and disk errors can no longer be corrected At this point data loss can occur The 1 indicates a substitution variable The text for this substitution variable is displayed with the alert in the Alert Log and can vary depending on the situation Action Replace the disk generating this alert If necessary restore your data from backup Clear Alert Number None Related Alert Number 2048 LRA Number 2071 904 2309 A physical disk is incompatible Warning Non critical C
70. d a physical disk as a global hot spare This alert is for informational purposes Action None Clear Alert Number None Related Alert Number 2277 LRA Number None 901 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers Storage Management Message Reference 75 2099 Global hot spare unassigned Ok Normal Cause A user has unassigned a physical disk as a global hot spare This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 901 2100 Temperature exceeded the maximum warning threshold Warning Non critical Cause The physical disk enclosure is too hot A variety of factors can cause the excessive temperature For example a fan may have failed the thermostat may be set too high or the room temperature may be too hot Action Check for factors that may cause overheating For example verify that the enclosure fan is working You should also check the thermostat settings and examine whether the enclosure is located near a heat source Make sure the enclosure has enough ventilation and that the room temperature is not too hot See the physical disk enclosure documentation for more diagnostic information Clear Alert Number 2353 Related Alert Number 2112 LRA Number 2090 1053 2101 Temperature dropped bel
71. des to cancel the flash BIOS update or an error occurs during the flash 1004 Thermal shutdown protection has been initiated Error This message is generated when a system is configured for thermal shutdown due to an error event If a temperature sensor reading exceeds the error threshold for which the system is configured the operating system shuts down and the system powers off This event may also be initiated on certain systems when a fan enclosure is removed from the system for an extended period of time 14 Event Message Reference 1005 SMBIOS data is absent Warning The system does not contain the required systems management BIOS version 2 2 or higher or the BIOS is corrupted 1006 Automatic System Recovery ASR action was performed Action performed was lt Action gt Date and time of action lt Date and time gt Error This message is generated when an automatic system recovery action is performed due to a hung operating system The action performed and the time of action are provided 1007 User initiated host system control action Action requested was lt Action gt Information User requested a host system control action to reboot power off or power cycle the system Alternatively the user had indicated protective measures to be initiated in the event of a thermal shutdown 1008 Systems Management Data Manager Started Information Systems Management Data Manager services were started
72. e chassis components at a safe temperature when the primary fan has failed Redundancy is normal when the intended number of critical components are operating Redundancy is degraded when a component fails but others are still operating Redundancy is lost when the number of components functioning falls below the redundancy threshold Table 2 7 lists the redundancy unit messages 1253 Chassis intrusion in progress Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt Chassis intrusion state lt Intrusion state gt Warning A chassis intrusion sensor in the specified system detected that a system cover is currently being opened and the system is operating The sensor location chassis location previous state and chassis intrusion state are provided 1254 Chassis intrusion detected Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt Chassis intrusion state lt Intrusion state gt Error A chassis intrusion sensor in the specified system detected that the system cover was opened while the system was operating The sensor location chassis location previous state and chassis intrusion state are provided 1255 Chassis intrusion sensor detected a non recoverable value Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state wa
73. e file for the validated driver version 94 The NVRAM has corrupt data 124 The NVRAM has corrupted data The controller is reinitializing the NVRAM 124 The only hot spare available is a SAS disk SAS disks cannot replace SATA disks 103 The only hot spare available is a SATA disk SATA disks cannot replace SAS disks 102 The Patrol Read corrected a media error 110 The patrol read has resumed 114 The Patrol Read has started 105 The Patrol Read has stopped 105 The Patrol Read is paused 114 The Patrol Read mode has changed 105 The Patrol Read rate has changed 104 The physical disk blink has ceased 106 The physical disk blink has initiated 106 The physical disk Clear operation failed 110 The physical disk Clear operation has completed 109 The physical disk Clear operation has started 106 The physical disk has been started 107 The physical disk is not certified 131 The physical disk is not supported 103 The physical disk is too small to be used for a rebuild 118 The physical disk rebuild has resumed 111 The power supply cable has been inserted 123 The power supply is switched on 123 The RAID controller firmware and driver validation was not performed The configuration file cannot be opened 93 The RAID controller firmware and driver validation was not performed The configuration file is out of date or corrupted 94
74. e physical disk offline Action Perform a rescan You can also select the offline disk and perform a Make Online operation Clear Alert Number 2158 Related Alert Number 2099 2196 LRA Number 2070 903 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 66 Storage Management Message Reference 2051 Physical disk degraded Warning Non critical Cause A physical disk has reported an error condition and may be degraded The physical disk may have reported the error condition in response to a consistency check or other operation Action Replace the degraded physical disk You can identify which disk is degraded by locating the disk that has a red X for its status Perform a rescan after replacing the disk Clear Alert Number None Related Alert Number 2070 LRA Number None 903 2052 Physical disk inserted Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number 2065 2305 2367 LRA Number None 901 2053 Virtual disk created Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 1201 2054 Virtual disk deleted Warning Non critical Cause A virtual disk has been deleted Performing a Reset Configuration ma
75. ed Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 126 Storage Management Message Reference 2335 Controller event log 1 Warning Non critical Cause The 1 indicates a substitution variable The text for this substitution variable is generated by the controller and is displayed with the alert in the Alert Log This text is from events in the controller event log that were generated while Storage Management was not running This text can vary depending on the situation Action If there is a problem review the controller event log and the Server Administrator Alert Log for significant events or alerts that may assist in diagnosing the problem Check the health of the storage components See the hardware documentation for more information Clear Alert Number None Related Alert Number None LRA Number 2060 753 2336 Controller event log 1 Critical Failure Error Cause The 1 indicates a substitution variable The text for this substitution variable is generated by the controller and is displayed with the alert in the Alert Log This text is from events in the controller event log that were generated while Storage Management was not running This text can vary depending on the situation Action See the hardware documentation for more information Clear Alert Number None Related Alert Number None LRA Number 2061 754 Table 4 4
76. ed Table 2 4 Voltage Sensor Messages continued Event ID Description Severity Cause Event Message Reference 21 1154 Voltage sensor detected a failure value Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt If sensor type is not discrete Voltage sensor value in Volts lt Reading gt If sensor type is discrete Discrete voltage state lt State gt Error A voltage sensor in the specified system exceeded its failure threshold The sensor location chassis location previous state and voltage sensor value are provided 1155 Voltage sensor detected a non recoverable value Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt If sensor type is not discrete Voltage sensor value in Volts lt Reading gt If sensor type is discrete Discrete voltage state lt State gt Error A voltage sensor in the specified system detected an error from which it cannot recover The sensor location chassis location previous state and voltage sensor value are provided Table 2 4 Voltage Sensor Messages continued Event ID Description Severity Cause 22 Event Message Reference Current Sensor Messages Current sensors listed in Table 2 5 measure the amount of current in amperes that is traversing critical components Current sensor messages provide
77. elated Alert Number None LRA Number None 1151 2177 The controller battery Learn cycle has completed Ok Normal Cause This alert is for informational purposes Action None Clear Alert Status Alert 2177 is a clear alert for alert 2176 Related Alert Number None LRA Number None 1151 2178 The controller battery Learn cycle has timed out Warning Non critical Cause The controller battery must be fully charged before the Learn cycle can begin The battery may be unable to maintain a full charge causing the Learn cycle to timeout Additionally the battery must be able to maintain cached data for a specified period of time in the event of a power loss For example some batteries maintain cached data for 24 hours If the battery is unable to maintain cached data for the required period of time then the Learn cycle will timeout Action Replace the battery pack as the battery is unable to maintain a full charge Clear Alert Number None Related Alert Number None LRA Number 2100 1153 2179 The controller battery Learn cycle has been postponed Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 1151 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 98 Storage Manage
78. elled Ok Normal Cause A user has cancelled the rebuild operation Action Restart the rebuild operation Clear Alert Number None Related Alert Number None LRA Number None 901 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers Storage Management Message Reference 71 2076 Virtual disk check consistency failed Critical Failure Error Cause A physical disk included in the virtual disk failed or there is an error in the parity information A failed physical disk can cause errors in parity information Action Replace the failed physical disk You can identify which disk has failed by locating the disk that has a red X for its status Rebuild the physical disk When finished restart the check consistency operation Clear Alert Number None Related Alert Number None LRA Number 2081 1204 2077 Virtual disk format failed Critical Failure Error Cause A physical disk included in the virtual disk failed Action Replace the failed physical disk You can identify which physical disk has failed by locating the disk that has a red X for its status Rebuild the physical disk When finished restart the virtual disk format operation Clear Alert Number None Related Alert Number None LRA Number 2081 1204 2079 Virtual disk initialization failed Critical
79. enerated when the power supply is unplugged lt Power Supply Sensor Name gt predictive failure was deasserted Information This event is generated when the power supply has recovered from an earlier predictive failure event lt Power Supply Sensor Name gt input lost was deasserted Information This event is generated when the power supply is plugged in 48 System Event Log Messages for IPMI Systems Memory ECC Events The memory ECC event messages monitor the memory modules in a system These messages monitor the ECC memory correction rate and the type of memory events that occurred BMC Watchdog Events The BMC watchdog operations are performed when the system hangs or crashes These messages monitor the status and occurrence of these events in a system Table 3 6 Memory ECC Events Event Message Severity Cause ECC error correction detected on Bank DIMM A B Information This event is generated when there is a memory error correction on a particular Dual Inline Memory Module DIMM ECC uncorrectable error detected on Bank DIMM Critical This event is generated when the chipset is unable to correct the memory errors Usually a bank number is provided and DIMM may or may not be identifiable depending on the error Correctable memory error logging disabled Critical This event is generated when the chipset in the ECC error correction rate exceeds a predefined limit Table 3 7 BMC Watchdog Events E
80. ent Message Reference System Event Log Messages for IPMI Systems 43 System Event Log Messages for IPMI Systems The following tables list the system event log SEL messages their severity and cause NOTE For corrective actions see the appropriate documentation Temperature Sensor Events The temperature sensor event messages help protect critical components by alerting the systems management console when the temperature rises inside the chassis These event messages use additional variables such as sensor location chassis location previous state and temperature sensor value or state Table 3 1 Temperature Sensor Events Event Message Severity Cause lt Sensor Name Location gt temperature sensor detected a failure lt Reading gt where lt Sensor Name Location gt is the entity that this sensor is monitoring For example PROC Temp or Planar Temp Reading is specified in degree Celsius For example 100 C Critical Temperature of the backplane board system board or the carrier in the specified system lt Sensor Name Location gt exceeded the critical threshold lt Sensor Name Location gt temperature sensor detected a warning lt Reading gt Warning Temperature of the backplane board system board or the carrier in the specified system lt Sensor Name Location gt exceeded the non critical threshold lt Sensor Name Location gt temperature sensor returned to warning state lt Reading gt Wa
81. erated when memory is removed from the system Memory Cfg Err configuration error BANK DIMM was asserted Critical This event is generated when memory configuration is incorrect for the system Mem Redun Gain redundancy regained Information This event is generated when memory redundancy is regained Mem ECC Warning transition to non critical from OK Warning This event is generated when correctable ECC errors have increased from a normal rate Mem ECC Warning transition to critical from less severe Critical This event is generated when correctable ECC errors reach a critical rate Mem CRC Err transition to non recoverable Critical This event is generated when CRC errors enter a non recoverable state Mem Fatal SB CRC uncorrectable ECC was asserted Critical This event is generated when CRC errors occur while storing to memory Mem Fatal NB CRC uncorrectable ECC was asserted Critical This event is generated when CRC errors occur while removing from memory Mem Overtemp critical over temperature was asserted Critical This event is generated when system memory reaches critical temperature USB Over current transition to non recoverable Critical This event is generated when the USB exceeds a predefined current level Hdwr version err hardware incompatibility BMC Firmware and CPU mismatch was asserted Critical This event is generated when there is a mismatch be
82. ere could be a firmware problem or an invalid cabling configuration If the cables are too long they will degrade the signal Action Power down all enclosures attached to the system and reboot the system If the problem persists upgrade the firmware to the latest supported version You can download the most current version of the driver and firmware from support dell com Make sure the cable configuration is valid See the hardware documentation for valid cabling configurations Clear Alert Number None Related Alert Number None LRA Number 2091 854 2301 The enclosure has a hardware error Critical Failure Error Cause The enclosure or an enclosure component is in a Failed or Degraded state Action Check the health of the enclosure and its components Replace any hardware that is in a Failed state See the hardware documentation for more information Clear Alert Number None Related Alert Number None LRA Number 2091 854 2302 The enclosure is not responding Critical Failure Error Cause The enclosure or an enclosure component is in a Failed or Degraded state Action Check the health of the enclosure and its components Replace any hardware that is in a Failed state See the hardware documentation for more information Clear Alert Number None Related Alert Number None LRA Number 2091 854 Table 4 4 Storage Management Messages continued Event ID
83. erver Administrator starting Feb 6 14 20 51 server01 Server Administrator Instrumentation Service EventID 1001 Server Administrator startup complete Feb 6 14 21 21 server01 Server Administrator Instrumentation Service EventID 1254 Chassis intrusion detected Sensor location Main chassis intrusion Chassis location Main System Chassis Previous state was OK Normal Chassis intrusion state Open Feb 6 14 21 51 server01 Server Administrator Instrumentation Service EventID 1252 Chassis intrusion returned to normal Sensor location Main chassis intrusion Chassis location Main System Chassis Previous state was Critical Failed Chassis intrusion state Closed Viewing the Event Information The event log for each operating system contains some or all of the following information Date The date the event occurred Time The local time the event occurred Type A classification of the event severity Information Warning or Error User The name of the user on whose behalf the event occurred Computer The name of the system where the event occurred Source The software that logged the event Category The classification of the event by the event source Event ID The number identifying the particular event type Description A description of the event The format and contents of the event description vary depending on the event type 10 Introducti
84. ervices The following subsections explain how to open the Windows 2000 Advanced Server Windows Server 2003 and the Red Hat Enterprise Linux and SUSE Linux Enterprise Server event viewers Viewing Events in Windows 2000 Advanced Server and Windows Server 2003 1 Click the Start button point to Settings and click Control Panel 2 Double click Administrative Tools and then double click Event Viewer 3 In the Event Viewer window click the Tree tab and then click System Log The System Log window displays a list of recently logged events 4 To view the details of an event double click one of the event items NOTE You can also look up the dcsys32 log file in the install_path omsa log directory to view the separate event log file The default install_path is C Program Files Dell SysMgt Viewing Events in Red Hat Enterprise Linux and SUSE Linux Enterprise Server 1 Log in as root 2 Use a text editor such as vi or emacs to view the file named var log messages The following example shows the Red Hat Enterprise Linux and SUSE Linux Enterprise Server message log var log messages The text in boldface type indicates the message text NOTE These messages are typically displayed as one long line In the following example the message is displayed using line breaks to help you see the message text more clearly Introduction 9 Feb 6 14 20 51 server01 Server Administrator Instrumentation Service EventID 1000 S
85. es An offline physical disk has been made online Action None Clear Alert Status Alert 2158 is a clear alert for alert 2050 Related Alert Number 2048 2050 2065 2099 2121 2196 2201 2203 LRA Number None 901 2159 Virtual disk renamed Ok Normal Cause This alert is for informational purposes A user has renamed a virtual disk When renaming a virtual disk on a PERC 3 SC 3 DCL 3 DC 3 QC 4 SC 4 DC 4e DC 4 Di CERC ATA100 4ch PERC 5 E PERC 5 i or SAS 5 iR controller this alert displays the new virtual disk name On the PERC 3 SC 3 DCL 3 DC 3 QC 4 SC 4 DC 4e DC 4 Di 4 IM 4e Si 4e Di and CERC ATA 100 4ch controllers this alert displays the original virtual disk name Action None Clear Alert Number None Related Alert Number None LRA Number None 1201 2162 Communicatio n regained Ok Normal Cause This alert is for informational purposes Communication with an enclosure has been restored Action None Clear Alert Status Alert 2162 is a clear alert for alerts 2137 and 2292 Related Alert Number None LRA Number None 851 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers Storage Management Message Reference 93 2163 Rebuild completed with errors Critical Failure Error Cause This alert is documented in the Storage Management online hel
86. essages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers Storage Management Message Reference 113 2282 Hot spare SMART polling failed Critical Failure Error Cause The controller firmware attempted a SMART polling on the hot spare but was unable to complete it The controller has lost communication with the hot spare Action Check the health of the disk assigned as a hot spare You may need to replace the disk and reassign the hot spare Make sure the cables are attached securely See the Cables Attached Correctly section in the Dell OpenManage Server Administrator Storage Management User s Guide for more information on checking the cables Clear Alert Number None Related Alert Number None LRA Number 2071 904 2283 A redundant path is broken Warning Non critical Cause The controller has two connectors that are connected to the same enclosure The communication path on one connector has lost connection with the enclosure The communication path on the other connector is reporting this loss Action Make sure the cables are attached securely Make sure both EMMs are healthy Clear Alert Number 2284 Related Alert Number None LRA Number 2070 903 2284 A redundant path has been restored Ok Normal Cause This alert is for informational purposes Action None Clear Alert Status Alert 2284 is a clear ale
87. for information on replacing the DIMM Clear Alert Number None Related Alert Number None LRA Number 2061 754 2322 The DC power supply is switched off Critical Failure Error Cause The power supply unit is switched off Either a user switched off the power supply unit or it is defective Action Check if the power switch is turned off If it is turned off turn it on If the problem persists check if the power cord is attached and functional If the problem is still not corrected or if the power switch is already turned on replace the power supply unit Clear Alert Number 2323 Related Alert Number None LRA Number 2091 1004 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers Storage Management Message Reference 123 2323 The power supply is switched on Ok Normal Cause This alert is for informational purposes Action None Clear Alert Status Alert 2323 is a clear alert for alerts 2313 and 2322 Related Alert Number None LRA Number None 1001 2324 The AC power supply cable has been removed Critical Failure Error Cause The power cable may be pulled out or removed The power cable may also have overheated and become warped and nonfunctional Action Replace the power cable Clear Alert Number 2325 Related Alert Number None LRA Number
88. for informational purposes Action None Clear Alert Number 2086 Related Alert Number None LRA Number None 1201 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers Storage Management Message Reference 69 2061 Virtual disk initialization started Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number 2088 Related Alert Number None LRA Number None 1201 2062 Physical disk initialization started Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number 2089 Related Alert Number None LRA Number None 901 2063 Virtual disk reconfiguratio n started Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number 2090 Related Alert Number None LRA Number None 1201 2064 Virtual disk rebuild started Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number 2091 Related Alert Number None LRA Number None 1201 2065 Physical disk rebuild started Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number 2092 Related Alert Number 2099 2121 2196 LRA Number None 901 Table 4 4 Storage Management Messages continued Event ID Description Sever
89. gt Chassis Location lt Name of chassis gt Previous state was lt State gt Processor sensor status lt status gt Warning A processor sensor in the specified system is in a throttled state The sensor location chassis location previous state and processor sensor status are provided 1604 Processor sensor detected a failure value Sensor Location lt Location in chassis gt Chassis Location lt Name of chassis gt Previous state was lt State gt Processor sensor status lt status gt Error A processor sensor in the specified system is disabled has a configuration error or experienced a thermal trip The sensor location chassis location previous state and processor sensor status are provided 1605 Processor sensor detected a non recoverable value Sensor Location lt Location in chassis gt Chassis Location lt Name of chassis gt Previous state was lt State gt Processor sensor status lt status gt Error A processor sensor in the specified system has failed The sensor location chassis location previous state and processor sensor status are provided Table 2 13 Processor Sensor Messages continued Event ID Description Severity Cause Event Message Reference 39 Pluggable Device Messages The pluggable device messages listed in Table 2 14 provide status and error information when some devices such as memory cards are added or removed Table 2 14 Pluggable Dev
90. hanged 104 The battery charge cycle is complete 131 The BGI completed with uncorrectable errors 127 The Check Consistency found inconsistent parity data Data redundancy may be lost 128 The Check Consistency logging of inconsistent parity data is disabled 128 Index 145 The Check Consistency made corrections and completed 128 The Check Consistency rate has changed 104 The Clear operation has cancelled 107 The controller alarm is silenced 104 The controller battery charge level is below a normal threshold 112 The controller battery charge level is normal 95 The controller battery charge level is operating within normal limits 112 The controller battery has been removed 96 The controller battery has been replaced 96 The controller battery is charging 106 The controller battery is degraded 106 The controller battery is executing a Learn cycle 106 The controller battery Learn cycle has been postponed 97 The controller battery Learn cycle has completed 97 The controller battery Learn cycle has started 97 The controller battery Learn cycle has timed out 97 The controller battery Learn cycle will start in days 98 The controller battery needs to be replaced 95 The controller battery temperature is above normal 95 The controller battery temperature is above normal 103 The controller battery temperature is normal 95
91. has been reconnected or replaced The sensor location chassis location previous state and additional power supply status information are provided 1353 Power supply detected a warning Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt Power Supply type lt type of power supply gt lt Additional power supply status information gt If in configuration error state Configuration error type lt type of configuration error gt Warning A power supply sensor reading in the specified system exceeded a user definable warning threshold The sensor location chassis location previous state and additional power supply status information are provided Table 2 8 Power Supply Messages continued Event ID Description Severity Cause Event Message Reference 31 1354 Power supply detected a failure Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt Power Supply type lt type of power supply gt lt Additional power supply status information gt If in configuration error state Configuration error type lt type of configuration error gt Error A power supply has been disconnected or has failed The sensor location chassis location previous state and additional power supply status information are provided 1355 Power supply sensor detected a non re
92. he Patrol Read corrected a media error Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 901 2272 Patrol Read found an uncorrectable media error Critical Failure Error Cause The Patrol Read task has encountered an error that cannot be corrected There may be a bad disk block that cannot be remapped Action Back up your data If you are able to back up the data successfully then fully initialize the disk and then restore from back up Clear Alert Number None Related Alert Number None LRA Number 2071 904 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers Storage Management Message Reference 111 2273 A block on the physical disk has been punctured by the controller Critical Failure Error Cause The controller encountered an unrecoverable medium error when attempting to read a block on the physical disk and marked that block as invalid If the controller encountered the unrecoverable medium error on a source physical disk during a rebuild or reconfigure operation it will also puncture the corresponding block on the target physical disk The invalid block will be cleared on a write operation Action Back up your data If you are able to back up the data successfully then
93. he alert numbers for the new alerts 2062 2260 were previously unassigned Alert numbers 2370 and 2371 are new NOTE Alerts 2062 and 2260 were previously undocumented in the Storage Management online help Dell OpenManage Server Administrator Storage Management User s Guide and the Dell OpenManage Server Administrator Messages Reference Guide Modified Alerts 2049 2050 2051 2052 2065 2074 2080 2083 2089 2092 2141 2158 2249 2251 2252 2255 2269 2270 2274 2303 2305 2309 2361 2362 2363 The term array disk has been changed to physical disk throughout Storage Management This change affects the message text of the modified alerts Obsolete Alerts 2160 2161 2160 replaced by 2195 2161 replaced by 2196 Documentation Changes Documentation updated to indicate clear alert status Reference to SNMP trap variables removed Corresponding Array Manager event numbers removed see comments Starting with Dell OpenManage 5 0 Array Manager is no longer an installable option If you have an Array Manager installation and wish to see how the Array Manager events correspond to the Storage Management alerts refer to the product documentation prior to Storage Management 2 1 or Dell OpenManage 5 1 Table 4 3 Alert Message Change History Alert Message Change History 64 Storage Management Message Reference Alert Descriptions and Corrective Actions The following sections describe ale
94. ice Messages Event ID Description Severity Cause 1650 lt Device plug event type unknown gt Device location lt Location in chassis if available gt Chassis location lt Name of chassis if available gt Additional details lt Additional details for the events if available gt Information A pluggable device event message of unknown type was received The device location chassis location and additional event details if available are provided 1651 Device added to system Device location lt Location in chassis gt Chassis location lt Name of chassis gt Additional details lt Additional details for the events gt Information A device was added in the specified system The device location chassis location and additional event details if available are provided 1652 Device removed from system Device location lt Location in chassis gt Chassis location lt Name of chassis gt Additional details lt Additional details for the events gt Information A device was removed from the specified system The device location chassis location and additional event details if available are provided 1653 Device configuration error detected Device location lt Location in chassis gt Chassis location lt Name of chassis gt Additional details lt Additional details for the events gt Error A configuration error was detected for a pluggable device in the specified s
95. itution variable is generated by the controller and is displayed with the alert in the Alert Log This text can vary depending on the situation Action None Clear Alert Number None Related Alert Number None LRA Number None 751 2331 A bad disk block has been reassigned Ok Normal Cause The disk has a bad block Data has been readdressed to another disk block and no data loss has occurred Action Monitor the disk for other alerts or indications of poor health For example you may receive alert 2306 Replace the disk if you suspect there is a problem Clear Alert Number None Related Alert Number None LRA Number None 901 2332 A controller hot plug has been detected Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 751 2334 Controller event log 1 Ok Normal Cause This alert is for informational purposes The 1 indicates a substitution variable The text for this substitution variable is generated by the controller and is displayed with the alert in the Alert Log This text is from events in the controller event log that were generated while Storage Management was not running This text can vary depending on the situation Action None Clear Alert Number None Related Alert Number None LRA Number None 751 Table 4 4 Storage Management Messages continu
96. ity Cause and Action Related Alert Information SNMP Trap Numbers 70 Storage Management Message Reference 2067 Virtual disk check consistency cancelled Ok Normal Cause The check consistency operation cancelled because a physical disk in the array has failed or because a user cancelled the check consistency operation Action If the physical disk failed then replace the physical disk You can identify which disk failed by locating the disk that has a red X for its status Perform a rescan after replacing the disk When performing a consistency check be aware that the consistency check can take a long time The time it takes depends on the size of the physical disk or the virtual disk Clear Alert Number None Related Alert Number None LRA Number None 1201 2070 Virtual disk initialization cancelled Ok Normal Cause The virtual disk initialization cancelled because a physical disk included in the virtual disk has failed or because a user cancelled the virtual disk initialization Action If a physical disk failed then replace the physical disk You can identify which disk has failed by locating the disk that has a red X for its status Perform a rescan after replacing the disk Restart the format physical disk operation Restart the virtual disk initialization Clear Alert Number None Related Alert Number None LRA Number None 1201 2074 Physical disk rebuild canc
97. k name to which the alert applies In an actual operating environment a storage system can have many combinations of controllers and disks as well as user defined names for virtual disks and other components Because each environment is unique in its storage configuration and user defined names an accurate alert message requires that the Storage Management Service be able to insert the environment specific names of storage components into an alert message This environment specific information is inserted after the alert message text as shown for alert 2127 in Table 4 1 58 Storage Management Message Reference For other alerts the alert message text is constructed from information passed directly from the controller or another storage component to the Alert Log In these cases the variable information is represented with a percent sign in the Storage Management documentation An example of such an alert is shown for alert 2334 in Table 4 1 The variables required to complete the message vary depending on the type of storage object and whether the storage object is in a SCSI or SAS configuration The following table identifies the possible variables used to identify each storage object NOTE Some alert messages relating to an enclosure or an enclosure component such as a fan or EMM are generated by the controller when the enclosure or enclosure component ID cannot be determined Table 4 1 Alert Message Format Alert ID Message Te
98. ke sure that it recharges successfully If the battery does not recharge replace the battery pack Clear Alert Number None Related Alert Number None LRA Number 2100 1153 2247 The controller battery is charging Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number 2358 Related Alert Number None LRA Number None 1151 2248 The controller battery is executing a Learn cycle Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 1151 2249 The physical disk Clear operation has started Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 901 2251 The physical disk blink has initiated Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 901 2252 The physical disk blink has ceased Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 901 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers Storage Management Message Reference
99. l disk blink has ceased 105 A virtual disk is permanently degraded 120 140 Index 140 Index AC power cord is not being monitored 34 AC power cord messages 34 AC power cord sensor 7 AC power cord sensor has failed 34 50 AC power has been lost 35 AC power has been restored 35 All virtual disks are missing from the controller This situation was discovered during system start up 132 An attempt to hot plug an EMM has been detected This type of hot plug is not supported 118 An EMM has been discovered 115 An EMM has been inserted 116 An EMM has been removed 116 An enclosure blink has ceased 107 An enclosure blink operation has initiated 107 An invalid SAS configuration has been detected 98 Array Manager is installed on the system 87 Asset name changed 91 Asset tag changed 91 Automatic System Recovery ASR action was performed 14 B Background initialization cancelled 86 Background initialization completed 86 Background initialization failed 86 Background initialization started 85 Bad block extended medium error 90 Bad block extended sense error 90 Bad block medium error 90 Bad block replacement error 90 Bad block sense error 90 Bad block table is 80 full 119 Bad block table is full Unable to log block 1 119 Bad PHY 1 116 Battery charge in progress 103 Battery charge process interrupted 104
100. lear Alert Status Alert 2086 is a clear alert for alert 2059 Related Alert Number None LRA Number None 1201 2088 Virtual disk initialization completed Ok Normal Cause This alert is for informational purposes Action None Clear Alert Status Alert 2088 is a clear alert for alerts 2061 and 2136 Related Alert Number None LRA Number None 1201 2089 Physical disk initialize completed Ok Normal Cause This alert is for informational purposes Action None Clear Alert Status Alert 2089 is a clear alert for alert 2062 Related Alert Number None LRA Number None 901 2090 Virtual disk reconfiguratio n completed Ok Normal Cause This alert is for informational purposes Action None Clear Alert Status Alert 2090 is a clear alert for alert 2063 Related Alert Number None LRA Number None 1201 2091 Virtual disk rebuild completed Ok Normal Cause This alert is for informational purposes Action None Clear Alert Status Alert 2091 is a clear alert for alert 2064 Related Alert Number None LRA Number None 1201 2092 Physical disk rebuild completed Ok Normal Cause This alert is for informational purposes Action None Clear Alert Status Alert 2092 is a clear alert for alert 2065 Related Alert Number None LRA Number None 901 Table 4 4 Storage Management Messages continued Event ID Description Severity
101. location in chassis gt Possible memory module event cause lt list of causes gt Warning A memory device correction rate exceeded an acceptable value The memory device status and location are provided 1404 Memory device status is lt status gt Memory device location lt location in chassis gt Possible memory module event cause lt list of causes gt Error A memory device correction rate exceeded an acceptable value a memory spare bank was activated or a multibit ECC error occurred The system continues to function normally except for a multibit error Replace the memory module identified in the message during the system s next scheduled maintenance Clear the memory error on multibit ECC error The memory device status and location are provided Event Message Reference 33 Fan Enclosure Messages Some systems are equipped with a protective enclosure for fans Fan enclosure messages listed in Table 2 10 monitor whether foreign objects are present in an enclosure and how long a fan enclosure is missing from a chassis Table 2 10 Fan Enclosure Messages Event ID Description Severity Cause 1450 Fan enclosure sensor has failed Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Information The fan enclosure sensor in the specified system failed The sensor location and chassis location are provided 1451 Fan enclosure sensor value unknown Sensor locatio
102. lt Name of chassis gt Previous state was lt State gt Power Supply type lt type of power supply gt lt Additional power supply status information gt If in configuration error state Configuration error type lt type of configuration error gt Information A power supply sensor in the specified system failed The sensor location chassis location previous state and additional power supply status information are provided 1351 Power supply sensor value unknown Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt Power Supply type lt type of power supply gt lt Additional power supply status information gt If in configuration error state Configuration error type lt type of configuration error gt Information A power supply sensor in the specified system could not obtain a reading The sensor location chassis location previous state and additional power supply status information are provided 30 Event Message Reference 1352 Power supply returned to normal Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt Power Supply type lt type of power supply gt lt Additional power supply status information gt If in configuration error state Configuration error type lt type of configuration error gt Information A power supply
103. ly connected lt Cable sensor Name Location gt Connection was asserted Information This event is generated when the earlier cable connection error was corrected Table 3 15 Battery Events Description Severity Cause lt Battery sensor Name Location gt Failed was asserted Critical This event is generated when the sensor detects a failed or missing battery lt Battery sensor Name Location gt Failed was deasserted Information This event is generated when the earlier failed battery was corrected lt Battery sensor Name Location gt is low was asserted Warning This event is generated when the sensor detects a low battery condition lt Battery sensor Name Location gt is low was deasserted Information This event is generated when the earlier low battery condition was corrected 56 System Event Log Messages for IPMI Systems Entity Presence Events The entity presence messages are used for detecting different hardware devices Table 3 16 Entity Presence Events Description Severity Cause lt Device Name gt presence was asserted Information This event is generated when the device was detected lt Device Name gt absent was asserted Critical This event is generated when the device was not detected Storage Management Message Reference 57 Storage Management Message Reference The Dell OpenManage Server Administrator Storage Management s alert or event management fe
104. mber 2049 2052 2162 2292 LRA Number None 851 2369 Virtual Disk Redundancy has been degraded Ok Normal Cause A physical disk in a RAID 6 virtual disk has either failed or been removed Action Replace the missing or failed physical disk Clear Alert Number 2121 Related Alert Number 2048 2049 2050 2076 2346 LRA Number None 1201 2371 Attempted import of Unsupported Virtual Disk type RAID 1 Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 751 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 134 Storage Management Message Reference Index 135 Index Symbols 1 Storage Management has lost communication with this RAID controller and attached storage An immediate reboot is strongly recommended to avoid further problems If the reboot does not restore communication there may be a hardware failure 109 Numerics 0001 13 1000 13 1001 13 1002 13 1003 13 1004 13 1005 14 1006 14 1007 14 1008 14 1009 14 1011 14 1012 14 1050 15 1051 15 1052 16 1053 16 1054 17 1055 17 1100 18 1101 18 1102 18 1103 18 1104 19 1105 19 1150 19 1151 20 1152 20 1153 20 1154 21 1155
105. mber None Related Alert Number None LRA Number None 1203 2193 The virtual disk reconfiguratio n has resumed Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 1201 2194 The virtual disk Read policy has changed Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 1201 2195 Dedicated hot spare assigned Physical disk 1 Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number 2196 Related Alert Number None LRA Number None 1201 2196 Dedicated hot spare unassigned Physical disk 1 Ok Normal Cause This alert is for informational purposes Action None Clear Alert Status Alert 2196 is a clear alert for alert 2195 Related Alert Number None LRA Number None 1201 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers Storage Management Message Reference 101 2199 The virtual disk cache policy has changed Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 1201 2201 A global hot spare failed Warning
106. ment Message Reference 2145 Controller battery low Warning Non critical Cause The controller battery charge is low Action Recondition the battery See the online help for more information Clear Alert Number None Related Alert Number None LRA Number 2100 1153 2146 Bad block replacement error Warning Non critical Cause A portion of a physical disk is damaged Action See the Dell OpenManage Server Administrator Storage Management online help or the Dell OpenManage Server Administrator Storage Management User s Guide for more information Clear Alert Number None Related Alert Number None LRA Number 2060 753 2147 Bad block sense error Warning Non critical Cause A portion of a physical disk is damaged Action See the Dell OpenManage Server Administrator Storage Management online help for more information Clear Alert Number None Related Alert Number None LRA Number 2060 753 2148 Bad block medium error Warning Non critical Cause A portion of a physical disk is damaged Action See the Dell OpenManage Server Administrator Storage Management online help for more information Clear Alert Number None Related Alert Number None LRA Number 2060 753 2149 Bad block extended sense error Warning Non critical Cause A portion of a physical disk is damaged Action See the Dell OpenManage Server Administrator Storage Management o
107. ment Message Reference 2180 The controller battery Learn cycle will start in 1 days Ok Normal Cause This alert is for informational purposes The 1 indicates a substitution variable The text for this substitution variable is displayed with the alert in the Alert Log and can vary depending on the situation Action None Clear Alert Number None Related Alert Number None LRA Number None 1151 2181 The controller battery Learn cycle will start in 1 hours Ok Normal Cause This alert is for informational purposes The 1 indicates a substitution variable The text for this substitution variable is displayed with the alert in the Alert Log and can vary depending on the situation Action None Clear Alert Number None Related Alert Number None LRA Number None 1151 2182 An invalid SAS configuration has been detected Critical Failure Error Cause The controller and attached enclosures are not cabled correctly Action See the hardware documentation for information on correct cabling configurations Clear Alert Number None Related Alert Number None LRA Number 2061 754 2186 The controller cache has been discarded Warning Non critical Cause The controller has flushed the cache and any data in the cache has been lost This may happen if the system has memory or battery problems that cause the controller to distrust the cache Although user data ma
108. minimum required level See readme txt for the validated driver version Warning Non critical Cause The version of the driver does not meet the minimum requirements Storage Management may not be able to display the storage or perform storage management functions until you have updated the system to meet the minimum requirements Action See the Readme file for the validated driver version Update the system to meet the minimum requirements and then reinstall Storage Management Clear Alert Number None Related Alert Number None LRA Number 2050 103 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers Storage Management Message Reference 95 2169 The controller battery needs to be replaced Critical Failure Error Cause The controller battery cannot recharge The battery may be old or it may have been already recharged the maximum number of times In addition the battery charger may not be working Action Replace the battery pack Clear Alert Number None Related Alert Number 2118 LRA Number 2101 1154 2170 The controller battery charge level is normal Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 1151 2171 The controller battery temperature is above normal
109. ms 47 Power Supply Events The power supply sensors monitor the functionality of the power supplies These messages provide status and warning information for power supplies for a particular system Table 3 5 Power Supply Events Event Message Severity Cause lt Power Supply Sensor Name gt power supply sensor removed Critical This event is generated when the power supply sensor is removed lt Power Supply Sensor Name gt power supply sensor AC recovered Information This event is generated when the power supply has been replaced lt Power Supply Sensor Name gt power supply sensor returned to normal state Information This event is generated when the power supply that failed or removed was replaced and the state has returned to normal lt Entity Name gt PS Redundancy sensor redundancy degraded Information Power supply redundancy is degraded if one of the power supply sources is removed or failed lt Entity Name gt PS Redundancy sensor redundancy lost Critical Power supply redundancy is lost if only one power supply is functional lt Entity Name gt PS Redundancy sensor redundancy regained Information This event is generated if the power supply has been reconnected or replaced lt Power Supply Sensor Name gt predictive failure was asserted Warning This event is generated when the power supply is about to fail lt Power Supply Sensor Name gt input lost was asserted Critical This event is g
110. mum temperature probe warning threshold value changed Ok Normal Cause This alert is for informational purposes A user has changed the value for the maximum temperature probe warning threshold Action None Clear Alert Number None Related Alert Number None LRA Number None 1051 2155 Minimum temperature probe warning threshold value changed Ok Normal Cause This alert is for informational purposes A user has changed the value for the minimum temperature probe warning threshold Action None Clear Alert Number None Related Alert Number None LRA Number None 1051 2156 Controller alarm has been tested Ok Normal Cause This alert is for informational purposes The controller alarm test has run successfully Action None Clear Alert Number None Related Alert Number None LRA Number None 751 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 92 Storage Management Message Reference 2157 Controller configuration has been reset Ok Normal Cause This alert is for informational purposes A user has reset the controller configuration See the online help for more information Action None Clear Alert Number None Related Alert Number None LRA Number None 751 2158 Physical disk online Ok Normal Cause This alert is for informational purpos
111. n lt Location in chassis gt Chassis location lt Name of chassis gt Information The fan enclosure sensor in the specified system could not obtain a reading The sensor location and chassis location are provided 1452 Fan enclosure inserted into system Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Information A fan enclosure has been inserted into the specified system The sensor location and chassis location are provided 1453 Fan enclosure removed from system Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Warning A fan enclosure has been removed from the specified system The sensor location and chassis location are provided 34 Event Message Reference AC Power Cord Messages AC power cord messages listed in Table 2 11 provide status and warning information for power cords that are part of an AC power switch if your system supports AC switching 1454 Fan enclosure removed from system for an extended amount of time Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Error A fan enclosure has been removed from the specified system for a user definable length of time The sensor location and chassis location are provided 1455 Fan enclosure sensor detected a non recoverable value Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt
112. n lt Name of chassis gt Previous state was lt State gt If sensor type is not discrete Voltage sensor value in Volts lt Reading gt If sensor type is discrete Discrete voltage state lt State gt Information A voltage sensor in the specified system could not obtain a reading The sensor location chassis location previous state and a nominal voltage sensor value are provided 1152 Voltage sensor returned to a normal value Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt If sensor type is not discrete Voltage sensor value in Volts lt Reading gt If sensor type is discrete Discrete voltage state lt State gt Information A voltage sensor in the specified system returned to a valid range after crossing a failure threshold The sensor location chassis location previous state and voltage sensor value are provided 1153 Voltage sensor detected a warning value Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt If sensor type is not discrete Voltage sensor value in Volts lt Reading gt If sensor type is discrete Discrete voltage state lt State gt Warning A voltage sensor in the specified system exceeded its warning threshold The sensor location chassis location previous state and voltage sensor value are provid
113. n chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt Fan sensor value lt Reading gt Error A fan sensor in the specified system detected the failure of one or more fans The sensor location chassis location previous state and fan sensor value are provided 1105 Fan sensor detected a non recoverable value Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt Fan sensor value lt Reading gt Error A fan sensor detected an error from which it cannot recover The sensor location chassis location previous state and fan sensor value are provided Table 2 4 Voltage Sensor Messages Event ID Description Severity Cause 1150 Voltage sensor has failed Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt If sensor type is not discrete Voltage sensor value in Volts lt Reading gt If sensor type is discrete Discrete voltage state lt State gt Information A voltage sensor in the specified system failed The sensor location chassis location previous state and voltage sensor value are provided Table 2 3 Cooling Device Messages continued Event ID Description Severity Cause 20 Event Message Reference 1151 Voltage sensor value unknown Sensor location lt Location in chassis gt Chassis locatio
114. nd other monitored parameters The Server Administrator event monitor uses these status change events to add descriptive messages to the operating system event log or the Server Administrator Alert log Each event message that Server Administrator adds to the Alert log consists of a unique identifier called the event ID for a specific event source category and a descriptive message The event message includes the severity cause of the event and other relevant information such as the event location and the monitored item s previous state Tables provided in this guide list all Server Administrator event IDs in numeric order Each entry includes the event ID s corresponding description severity level and cause Message text in angle brackets for example lt State gt describes the event specific information provided by the Server Administrator What s New in this Release Modifications have been made to the Storage Management Service events For more information see Alert Message Change History Messages Not Described in This Guide This guide describes only event messages created by Server Administrator and displayed in the Server Administrator Alert log For information on other messages produced by your system consult one of the following sources Your system s Installation and Troubleshooting Guide Other system documentation Operating system documentation Application program documentation 6 I
115. nged 91 Multi bit ECC error 114 Multiple enclosures are attached to the controller This is an unsupported configuration 99 P Patrol Read found an uncorrectable media error 110 Physical disk dead segments recovered 89 Physical disk degraded 66 Physical disk initialization started 69 Physical disk initialize completed 73 Index 143 Physical disk initialize failed 71 Physical disk inserted 66 Physical disk offline 65 Physical disk online 92 Physical disk rebuild cancelled 70 Physical disk rebuild completed 73 Physical disk rebuild failed 72 Physical disk rebuild started 69 Physical disk removed 65 Physical disk s have been removed from a virtual disk The virtual disk will be in Failed state during the next system reboot 132 Physical disk s that are part of a virtual disk have been removed while the system was shut down This removal was discovered during system start up 132 pluggable device sensor 7 Power supply detected a failure 31 Power supply detected a warning 30 48 Power Supply Events 47 power supply messages 29 47 Power supply returned to normal 30 48 power supply sensor 6 Power supply sensor detected a non recoverable value 31 Power supply sensor has failed 29 Power supply sensor value unknown 29 Predictive Failure reported 74 Problems with the battery or the battery charger have been detected The b
116. nline help for more information Clear Alert Number None Related Alert Number None LRA Number 2060 753 2150 Bad block extended medium error Warning Non critical Cause A portion of a physical disk is damaged Action See the Dell OpenManage Server Administrator Storage Management online help for more information Clear Alert Number None Related Alert Number None LRA Number 2060 753 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers Storage Management Message Reference 91 2151 Asset tag changed Ok Normal Cause This alert is for informational purposes A user has changed the enclosure asset tag Action None Clear Alert Number None Related Alert Number None LRA Number None 851 2152 Asset name changed Ok Normal Cause This alert is for informational purposes A user has changed the enclosure asset name Action None Clear Alert Number None Related Alert Number None LRA Number None 851 2153 Service tag changed Ok Normal Cause An enclosure service tag was changed In most circumstances this service tag should only be changed by Dell support or your service provider Action Ensure that the tag was changed under authorized circumstances Clear Alert Number None Related Alert Number None LRA Number None 851 2154 Maxi
117. nstall Storage Management or Server Administrator because of some missing installation components Clear Alert Number None Related Alert Number None LRA Number 2051 104 2315 Diagnostic message 1 Ok Normal Cause This alert is for informational purposes The 1 indicates a substitution variable The text for this substitution variable is generated by the utility that ran the diagnostics and is displayed with the alert in the Alert Log This text can vary depending on the situation Action None Clear Alert Number None Related Alert Number None LRA Number None 751 2316 Diagnostic message 1 Critical Failure Error Cause A diagnostics test failed The 1 indicates a substitution variable The text for this substitution variable is generated by the utility that ran the diagnostics and is displayed with the alert in the Alert Log This text can vary depending on the situation Action See the documentation for the utility that ran the diagnostics for more information Clear Alert Number None Related Alert Number None LRA Number 2061 754 2318 Problems with the battery or the battery charger have been detected The battery health is poor Warning Non critical Cause The battery or the battery charger is not functioning properly Action Replace the battery pack Clear Alert Number None Related Alert Number 2188 LRA Number 2100 1154 Table 4 4 St
118. ntroduction Understanding Event Messages This section describes the various types of event messages generated by the Server Administrator When an event occurs on your system the Server Administrator sends information about one of the following event types to the systems management console Server Administrator generates events based on status changes in the following sensors Temperature Sensor Helps protect critical components by alerting the systems management console when temperatures become too high inside a chassis also monitors a variety of locations in the chassis and in any attached systems Fan Sensor Monitors fans in various locations in the chassis and in any attached systems Voltage Sensor Monitors voltages across critical components in various chassis locations and in any attached systems Current Sensor Monitors the current or amperage output from the power supply or supplies in the chassis and in any attached systems Chassis Intrusion Sensor Monitors intrusion into the chassis and any attached systems Redundancy Unit Sensor Monitors redundant units critical units such as fans AC power cords or power supplies within the chassis also monitors the chassis and any attached systems For example redundancy allows a second or nth fan to keep the chassis components at a safe temperature when another fan has failed Redundancy is normal when the intended number of c
119. ntroller A Example 2057 Virtual disk degraded Virtual Disk 11 Virtual Disk 11 Controller 1 PERC 5 E Adapter NOTE The virtual disk and controller names are not always displayed Enclosure Message Format Enclosure X Y Controller A Connector B Example 2112 Enclosure shutdown Enclosure 0 2 Controller 1 Connector 0 SCSI Power Supply Message Format Power Supply X Controller A Connector B Target ID C where C is the SCSI ID number of the enclosure management module EMM managing the power supply Example 2122 Redundancy degraded Power Supply 1 Controller 1 Connector 0 Target ID 6 SAS Power Supply Message Format Power Supply X Controller A Connector B Enclosure C Example 2312 A power supply in the enclosure has an AC failure Power Supply 1 Controller 1 Connector 0 Enclosure 2 SCSI Temperature Probe Message Format Temperature Probe X Controller A Connector B Target ID C where C is the SCSI ID number of the EMM managing the temperature probe Example 2101 Temperature dropped below the minimum warning threshold Temperature Probe 1 Controller 1 Connector 0 Target ID 6 SAS Temperature Probe Message Format Temperature Probe X Controller A Connector B Enclosure C Example 2101 Temperature dropped below the minimum warning threshold Temperature Probe 1 Controller 1 Connector 0 Enclosure 2 SCSI Fan Message Format Fan X Controller A Connector B Target ID C where C is the SCSI ID
120. number of devices required for full redundancy are provided 1302 Redundancy not applicable Redundancy unit lt Redundancy location in chassis gt Chassis location lt Name of chassis gt Previous redundancy state was lt State gt Information A redundancy sensor in the specified system detected that a unit was not redundant The redundancy location chassis location previous redundancy state and the number of devices required for full redundancy are provided 1303 Redundancy is offline Redundancy unit lt Redundancy location in chassis gt Chassis location lt Name of chassis gt Previous redundancy state was lt State gt Information A redundancy sensor in the specified system detected that a redundant unit is offline The redundancy unit location chassis location previous redundancy state and the number of devices required for full redundancy are provided 28 Event Message Reference 1304 Redundancy regained Redundancy unit lt Redundancy location in chassis gt Chassis location lt Name of chassis gt Previous redundancy state was lt State gt Information A redundancy sensor in the specified system detected that a lost redundancy device has been reconnected or replaced full redundancy is in effect The redundancy unit location chassis location previous redundancy state and the number of devices required for full redundancy are provided 1305 Redundancy degraded
121. number of the EMM managing the fan Example 2121 Device returned to normal Fan 1 Controller 1 Connector 0 Target ID 6 SAS Fan Message Format Fan X Controller A Connector B Enclosure C Example 2121 Device returned to normal Fan 1 Controller 1 Connector 0 Enclosure 2 SCSI EMM Message Format EMM X Controller A Connector B Target ID C where C is the SCSI ID number of the EMM Example 2121 Device returned to normal EMM 1 Controller 1 Connector 0 Target ID 6 Table 4 2 Message Format with Variables for Each Storage Object continued Storage Object Message Variables A B C and X Y Z in the following examples are variables representing the storage object name or number 60 Storage Management Message Reference Alert Message Change History The following table describes changes made to the Storage Management alerts from the previous release of Storage Management to the current release SAS EMM Message Format EMM X Controller A Connector B Enclosure C Example 2121 Device returned to normal EMM 1 Controller 1 Connector 0 Enclosure 2 Table 4 3 Alert Message Change History Alert Message Change History Storage Management 2 3 Comments Product Versions to which Changes Apply Storage Management 2 3 Server Administrator 3 2 Dell OpenManage 5 3 New Alerts 2369 Modified Alerts 2095 Added SNMP traps 751 and 851 2294 Removed SNMP traps 752 802 852 902 952 1002 1052
122. o a redundant virtual disk has been restored Action None Clear Alert Number None Related Alert Number None LRA Number None 1201 2141 Physical disk dead segments recovered Ok Normal Cause This alert is for informational purposes Portions of the physical disk were formerly inaccessible The disk space from these dead segments has been recovered and is now usable Any data residing on these dead segments has been lost Action None Clear Alert Number None Related Alert Number None LRA Number None 901 2142 Controller rebuild rate has changed Ok Normal Cause This alert is for informational purposes A user has changed the controller rebuild rate Action None Clear Alert Number None Related Alert Number None LRA Number None 751 2143 Controller alarm enabled Ok Normal Cause This alert is for informational purposes A user has enabled the controller alarm Action None Clear Alert Number None Related Alert Number None LRA Number None 751 2144 Controller alarm disabled Ok Normal Cause This alert is for informational purposes A user has disabled the controller alarm Action None Clear Alert Number None Related Alert Number None LRA Number None 751 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 90 Storage Manage
123. of the firmware This alert may be caused when a user attempts to insert an EMM module that has a different firmware version than an existing module Action Download the same version of the firmware to both EMM modules Clear Alert Number None Related Alert Number None LRA Number 2090 853 2121 Device returned to normal Ok Normal Cause This alert is for informational purposes A device that was previously in an error state has returned to a normal state For example if an enclosure became too hot and subsequently cooled down then you may receive this alert Action None Clear Alert Status Alert 2121 is a clear alert for alert 2048 Related Alert Number 2050 2065 2158 LRA Number None 752 802 852 902 952 1002 1052 1102 1152 1202 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers Storage Management Message Reference 83 2122 Redundancy degraded Warning Non critical Cause One or more of the enclosure components has failed For example a fan or power supply may have failed Although the enclosure is currently operational the failure of additional components could cause the enclosure to fail Action Identify and replace the failed component To identify the failed component select the enclosure in the tree view and click the Health subtab Any failed component will be
124. on Understanding the Event Description Table 1 2 lists in alphabetical order each line item that may appear in the event description Table 1 2 Event Description Reference Description Line Item Explanation Action performed was lt Action gt Specifies the action that was performed for example Action performed was Power cycle Action requested was lt Action gt Specifies the action that was requested for example Action requested was Reboot shutdown OS first Additional Details lt Additional details for the event gt Specifies additional details available for the hot plug event for example Memory device DIMM1_A Serial number FFFF30B1 lt Additional power supply status information gt Specifies information pertaining to the event for example Power supply input AC is off Power supply POK power OK signal is not normal Power supply is turned off Chassis intrusion state lt Intrusion state gt Specifies the chassis intrusion state open or closed for example Chassis intrusion state Open Chassis location lt Name of chassis gt Specifies name of the chassis that generated the message for example Chassis location Main System Chassis Configuration error type lt type of configuration error gt Specifies the type of configuration error that occurred for example Configuration error type Revision mismatch Current sensor value in Amps lt Reading gt Specifies the current sensor
125. one Clear Alert Number None Related Alert Number None LRA Number None 751 2242 The Patrol Read has started Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number 2243 Related Alert Number None LRA Number None 751 2243 The Patrol Read has stopped Ok Normal Cause This alert is for informational purposes Action None Clear Alert Status Alert 2243 is a clear alert for alert 2242 Related Alert Number None LRA Number None 751 2244 A virtual disk blink has been initiated Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 1201 2245 A virtual disk blink has ceased Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 1201 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 106 Storage Management Message Reference 2246 The controller battery is degraded Warning Non critical Cause The controller battery charge is weak Action As the charge weakens the charger should automatically recharge the battery If the battery has reached its recharge limit replace the battery pack Monitor the battery to ma
126. or value are provided 1055 Temperature sensor detected a non recoverable value Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt If sensor type is not discrete Temperature sensor value in degrees Celsius lt Reading gt If sensor type is discrete Discrete temperature state lt State gt Error A temperature sensor on the backplane board system board or drive carrier in the specified system detected an error from which it cannot recover The sensor location chassis location previous state and temperature sensor value are provided Table 2 2 Temperature Sensor Messages continued Event ID Description Severity Cause 18 Event Message Reference Cooling Device Messages Cooling device sensors listed in Table 2 3 monitor how well a fan is functioning Cooling device messages provide status and warning information for fans in a particular chassis Table 2 3 Cooling Device Messages Event ID Description Severity Cause 1100 Fan sensor has failed Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt Fan sensor value lt Reading gt Information A fan sensor in the specified system is not functioning The sensor location chassis location previous state and fan sensor value are provided 1101 Fan sensor value unknown Sensor location lt Lo
127. orage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 122 Storage Management Message Reference 2319 Single bit ECC error The DIMM is degrading Warning Non critical Cause The DIMM is beginning to malfunction Action Replace the DIMM to avoid data loss or data corruption The DIMM is a part of the controller battery pack See your hardware documentation for information on replacing the DIMM Clear Alert Number None Related Alert Number 2320 LRA Number 2060 753 2320 Single bit ECC error The DIMM is critically degraded Critical Failure Error Cause The DIMM is malfunctioning Data loss or data corruption may be imminent Action Replace the DIMM immediately to avoid data loss or data corruption The DIMM is a part of the controller battery pack See your hardware documentation for information on replacing the DIMM Clear Alert Number None Related Alert Number 2321 LRA Number 2061 754 2321 Single bit ECC error The DIMM is critically degraded There will be no further reporting Critical Failure Error Cause The DIMM is malfunctioning Data loss or data corruption is imminent The DIMM must be replaced immediately No further alerts will be generated Action Replace the DIMM immediately The DIMM is a part of the controller battery pack See your hardware documentation
128. ot comply with the standards set by Dell and is not supported Action Replace the physical disk with a physical disk that is supported Clear Alert Number None Related Alert Number None LRA Number 2070 903 2360 A user has discarded data from the controller cache Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 751 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 132 Storage Management Message Reference 2361 Physical disk s that are part of a virtual disk have been removed while the system was shut down This removal was discovered during system start up Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 751 2362 Physical disk s have been removed from a virtual disk The virtual disk will be in Failed state during the next system reboot Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 751 2364 All virtual disks are missing from the controller This situation was discovered during system start up Ok Normal Cause This alert is for informational p
129. ow the minimum warning threshold Warning Non critical Cause The physical disk enclosure is too cool Action Check if the thermostat setting is too low and if the room temperature is too cool Clear Alert Number 2353 Related Alert Number None LRA Number 2090 1053 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 76 Storage Management Message Reference 2102 Temperature exceeded the maximum failure threshold Critical Failure Error Cause The physical disk enclosure is too hot A variety of factors can cause the excessive temperature For example a fan may have failed the thermostat may be set too high or the room temperature may be too hot Action Check for factors that may cause overheating For example verify that the enclosure fan is working You should also check the thermostat settings and examine whether the enclosure is located near a heat source Make sure the enclosure has enough ventilation and that the room temperature is not too hot See the physical disk enclosure documentation for more diagnostic information Clear Alert Number None Related Alert Number None LRA Number 2091 1054 2103 Temperature dropped below the minimum failure threshold Critical Failure Error Cause The physical disk enclosure is too cool Action Check if the thermosta
130. p Action See the online help for more information Clear Alert Number None Related Alert Number None LRA Number 2071 904 2164 See the Readme file for a list of validated controller driver versions Ok Normal Cause This alert is for informational purposes Storage Management is unable to determine whether the system has the minimum required versions of the RAID controller drivers Action See the Readme file for driver and firmware requirements In particular if Storage Management experiences performance problems you should verify that you have the minimum supported versions of the drivers and firmware installed Clear Alert Number None Related Alert Number None LRA Number None 101 2165 The RAID controller firmware and driver validation was not performed The configuration file cannot be opened Warning Non critical Cause Storage Management is unable to determine whether the system has the minimum required versions of the RAID controller firmware and drivers This situation may occur for a variety of reasons For example the installation directory path to the configuration file may not be correct The configuration file may also have been removed or renamed Action Reinstall Storage Management Clear Alert Number None Related Alert Number None LRA Number 2060 753 Table 4 4 Storage Management Messages continued Event ID Description Severity Cau
131. rable Critical This event is generated when the processor machine check enters a non recoverable state Logging Disabled all event logging disabled was asserted Critical This event is generated when all event logging is disabled Unknown system event sensor unknown system hardware failure was asserted Critical This event is generated when an unknown hardware failure is detected Table 3 12 BIOS Generated System Events continued Event Message Severity Cause System Event Log Messages for IPMI Systems 55 R2 Generated System Events Cable Interconnect Events The cable interconnect messages are used for detecting errors in the hardware cabling Battery Events Table 3 13 R2 Generated Events Description Severity Cause System Event OS stop event OS graceful shutdown detected Information The OS was shutdown restarted normally OEM Event data record after OS graceful shutdown restart event Information Comment string accompanying an OS shutdown restart System Event OS stop event runtime critical stop Critical The OS encountered a critical error and was stopped abnormally OEM Event data record after OS bugcheck event Information OS bugcheck code and paremeters Table 3 14 Cable Interconnect Events Description Severity Cause lt Cable sensor Name Location gt Configuration error was asserted Critical This event is generated when the cable is not connected or is incorrect
132. rete current state lt State gt Error A current sensor in the specified system exceeded its failure threshold The sensor location chassis location previous state and current sensor value are provided 1205 Current sensor detected a non recoverable value Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt If sensor type is not discrete Current sensor value in Amps lt Reading gt OR Current sensor value in Watts lt Reading gt If sensor type is discrete Discrete current state lt State gt Error A current sensor in the specified system detected an error from which it cannot recover The sensor location chassis location previous state and current sensor value are provided Table 2 5 Current Sensor Messages continued Event ID Description Severity Cause Event Message Reference 25 Chassis Intrusion Messages Chassis intrusion messages listed in Table 2 6 are a security measure Chassis intrusion means that someone is opening the cover to a system s chassis Alerts are sent to prevent unauthorized removal of parts from a chassis Table 2 6 Chassis Intrusion Messages Event ID Description Severity Cause 1250 Chassis intrusion sensor has failed Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt Chassis intrusion state lt In
133. ritical components are operating Redundancy is degraded when a component fails but others are still operating Redundancy is lost when there is one less critical redundancy device than required Power Supply Sensor Monitors power supplies in the chassis and in any attached systems Memory Prefailure Sensor Monitors memory modules by counting the number of Error Correction Code ECC memory corrections Table 1 1 Understanding Event Messages Icon Alert Severity Component Status OK Normal An event that describes the successful operation of a unit The alert is provided for informational purposes and does not indicate an error condition For example the alert may indicate the normal start or stop of an operation such as power supply or a sensor reading returning to normal Warning Non critical An event that is not necessarily significant but may indicate a possible future problem For example a Warning Non critical alert may indicate that a component such as a temperature probe in an enclosure has crossed a warning threshold Critical Failure Error A significant event that indicates actual or imminent loss of data or loss of function For example crossing a failure threshold or a hardware failure such as an array disk Introduction 7 Fan Enclosure Sensor Monitors protective fan enclosures by detecting their removal from and insertion into the system and by measuring how long a fan enclosure is
134. rity Cause and Action Related Alert Information SNMP Trap Numbers 102 Storage Management Message Reference 2204 A dedicated hot spare has been removed Ok Normal Cause The controller is unable to communicate with a disk that is assigned as a dedicated hot spare The disk may have been removed There may also be a bad or loose cable Action Check if the disk is healthy and that it has not been removed Check the cables If necessary replace the disk and reassign the hot spare Clear Alert Number None Related Alert Number None LRA Number None 901 2205 A dedicated hot spare has been automatically unassigned Ok Normal Cause The hot spare is no longer required because the virtual disk it was assigned to has been deleted Action None Clear Alert Number None Related Alert Number 2098 2161 2196 LRA Number None 901 2206 The only hot spare available is a SATA disk SATA disks cannot replace SAS disks Warning Non critical Cause The only physical disk available to be assigned as a hot spare is using SATA technology The physical disks in the virtual disk are using SAS technology Because of this difference in technology the hot spare cannot rebuild data if one of the physical disks in the virtual disk fails Action Add a SAS disk that is large enough to be used as the hot spare and assign the new disk as a hot spare Clear Alert Number None Related Alert
135. rmal Cause This alert is for informational purposes Action None Clear Alert Number 2352 Related Alert Number None LRA Number None 901 2352 A physical disk that was marked as missing has been replaced Ok Normal Cause This alert is for informational purposes Action None Clear Alert Status Alert 2352 is a clear alert for alert 2351 Related Alert Number None LRA Number None 901 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 130 Storage Management Message Reference 2353 The enclosure temperature has returned to normal Ok Normal Cause This alert is for informational purposes Action None Clear Alert Status Alert 2353 is a clear alert for alerts 2100 and 2101 Related Alert Number None LRA Number None 851 2356 SAS SMP communicatio ns error 1 Critical Failure Error Cause The 1 indicates a substitution variable The text for this substitution variable is generated by the firmware and is displayed with the alert in the Alert Log This text can vary depending on the situation The reference to SMP in this text refers to SAS Management Protocol Action There may be a SAS topology error See the hardware documentation for information on correct SAS topology configurations There may be problems with the cables such as a loose connection
136. rmation This event is generated when a processor recovers from the internal error lt Processor Entity gt status processor sensor disabled Warning This event is generated for all processors that are disabled lt Processor Entity gt status processor sensor terminator not present Information This event is generated if the terminator is missing on an empty processor slot lt Processor Entity gt presence was deasserted Critical This event is generated when the system could not detect the processor lt Processor Entity gt presence was asserted Information This event is generated when the earlier processor detection error was corrected lt Processor Entity gt thermal tripped was deasserted Information This event is generated when the processor has recovered from an earlier thermal condition lt Processor Entity gt configuration error was asserted Critical This event is generated when the processor configuration is incorrect lt Processor Entity gt configuration error was deasserted Information This event is generated when the earlier processor configuration error was corrected lt Processor Entity gt throttled was asserted Warning This event is generated when the processor slows down to prevent over heating lt Processor Entity gt throttled was deasserted Information This event is generated when the earlier processor throttled event was corrected System Event Log Messages for IPMI Syste
137. rning Temperature of the backplane board system board or the carrier in the specified system lt Sensor Name Location gt returned from critical state to non critical state lt Sensor Name Location gt temperature sensor returned to normal state lt Reading gt Information Temperature of the backplane board system board or the carrier in the specified system lt Sensor Name Location gt returned to normal operating range 44 System Event Log Messages for IPMI Systems Voltage Sensor Events The voltage sensor event messages monitor the number of volts across critical components These messages provide status and warning information for voltage sensors for a particular chassis Table 3 2 Voltage Sensor Events Event Message Severity Cause lt Sensor Name Location gt voltage sensor detected a failure lt Reading gt where lt Sensor Name Location gt is the entity that this sensor is monitoring Reading is specified in volts For example 3 860 V Critical The voltage of the monitored device has exceeded the critical threshold lt Sensor Name Location gt voltage sensor state asserted Critical The voltage specified by lt Sensor Name Location gt is in critical state lt Sensor Name Location gt voltage sensor state de asserted Information The voltage of a previously reported lt Sensor Name Location gt is returned to normal state lt Sensor Name Location gt voltage sensor detected a warning lt
138. rror Cause The controller has lost communication with an EMM The cables may be loose or defective Action Make sure the cables are attached securely Reboot the system Clear Alert Number 2162 Related Alert Number None LRA Number 2091 854 2293 The EMM has failed Critical Failure Error Cause The failure may be caused by a loss of power to the EMM The EMM self test may also have identified a failure There could also be a firmware problem or a multi bit error Action Replace the EMM See the hardware documentation for information on replacing the EMM Clear Alert Number None Related Alert Number None LRA Number 2091 854 2294 A device has been inserted Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 851 2295 A device has been removed Critical Failure Error Cause A device has been removed and the system is no longer functioning in optimal condition Action Replace the device Clear Alert Number None Related Alert Number None LRA Number 2091 854 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 116 Storage Management Message Reference 2296 An EMM has been inserted Ok Normal Cause This alert is for informational purposes Action
139. rt for alert 2283 Related Alert Number None LRA Number None 901 2285 A disk media error was corrected during recovery Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 901 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 114 Storage Management Message Reference 2286 A Learn cycle start is pending while the battery charges Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 1151 2287 The Patrol Read is paused Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number 2288 Related Alert Number None LRA Number None 751 2288 The patrol read has resumed Ok Normal Cause This alert is for informational purposes Action None Clear Alert Status Alert 2288 is a clear alert for alert 2287 Related Alert Number None LRA Number None 751 2289 Multi bit ECC error Critical Failure Error Cause An error involving multiple bits has been encountered during a read or write operation The error correction algorithm recalculates parity data during read and write operations If an error involves only a single bit it m
140. rts generated by the RAID or SCSI controllers supported by Storage Management The alerts are displayed in the Server Administrator Alert subtab or through Windows Event Viewer These alerts can also be forwarded as SNMP traps to other applications SNMP traps are generated for the alerts listed in the following sections These traps are included in the Dell OpenManage Server Administrator Storage Management management information base MIB The SNMP traps for these alerts use all of the SNMP trap variables For more information on SNMP support and the MIB see the SNMP Reference Guide To locate an alert scroll through the following table to find the alert number displayed on the Server Administrator Alert tab or search this file for the alert message text or number See Understanding Event Messages for more information on severity levels For more information regarding alert descriptions and the appropriate corrective actions see the online help Table 4 4 Storage Management Messages Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 2048 Device failed Critical Failure Error Cause A storage component such as a physical disk or an enclosure has failed The failed component may have been identified by the controller while performing a task such as a rescan or a check consistency Action Replace the failed component You can identify which disk has failed by locating the di
141. ry You should receive alert 2179 when the recharge occurs Action Check if the battery Learn cycle is in progress Alert 2176 indicates that the battery Learn cycle has initiated The battery also displays the Learn state while the Learn cycle is in progress If a Learn cycle is not in progress replace the battery pack Clear Alert Number None Related Alert Number 2199 LRA Number None 1154 2279 The controller battery charge level is operating within normal limits Ok Normal Cause This alert is provided for informational purposes This alert indicates that the battery is recharging during the battery Learn cycle Action None Clear Alert Number None Related Alert Number None LRA Number None 1151 2280 A disk media error has been corrected Ok Normal Cause A disk media error was detected while the controller was completing a background task A bad disk block was identified The disk block has been remapped Action Consider replacing the disk If you receive this alert frequently be sure to replace the disk You should also routinely back up your data Clear Alert Number None Related Alert Number None LRA Number None 1201 2281 Virtual disk has inconsistent data Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number 2127 LRA Number None 1201 Table 4 4 Storage Management M
142. ry RAID redundancy degraded Information This event is generated when there is a memory failure in a RAID configured memory configuration Memory RAID redundancy lost Critical This event is generated when redundancy is lost in a RAID configured memory configuration Memory RAID redundancy regained Information This event is generated when the redundancy lost or degraded earlier is regained in a RAID configured memory configuration Memory Mirrored redundancy degraded Information This event is generated when there is a memory failure in a mirrored memory configuration Memory Mirrored redundancy lost Critical This event is generated when redundancy is lost in a mirrored memory configuration Memory Mirrored redundancy regained Information This event is generated when the redundancy lost or degraded earlier is regained in a mirrored memory configuration Memory Spared redundancy degraded Information This event is generated when there is a memory failure in a spared memory configuration Memory Spared redundancy lost Critical This event is generated when redundancy is lost in a spared memory configuration Memory Spared redundancy regained Information This event is generated when the redundancy lost or degraded earlier is regained in a spared memory configuration Table 3 9 Hardware Log Sensor Events Event Message Severity Cause Log full detected Critical This event is generated when the SEL device detec
143. s lt State gt Chassis intrusion state lt Intrusion state gt Error A chassis intrusion sensor in the specified system detected an error from which it cannot recover The sensor location chassis location previous state and chassis intrusion state are provided Table 2 6 Chassis Intrusion Messages continued Event ID Description Severity Cause Event Message Reference 27 The number of devices required for full redundancy is provided as part of the message when applicable for the redundancy unit and the platform For details on redundancy computation see the respective platform documentation Table 2 7 Redundancy Unit Messages Event ID Description Severity Cause 1300 Redundancy sensor has failed Redundancy unit lt Redundancy location in chassis gt Chassis location lt Name of chassis gt Previous redundancy state was lt State gt Information A redundancy sensor in the specified system failed The redundancy unit location chassis location previous redundancy state and the number of devices required for full redundancy are provided 1301 Redundancy sensor value unknown Redundancy unit lt Redundancy location in chassis gt Chassis location lt Name of chassis gt Previous redundancy state was lt State gt Information A redundancy sensor in the specified system could not obtain a reading The redundancy unit location chassis location previous redundancy state and the
144. s restored 89 Dedicated hot spare assigned Physical disk 1 100 Dedicated hot spare unassigned Physical disk 1 100 Dedicated spare imported as global due to missing arrays 132 Device failed 64 Device returned to normal 82 Diagnostic message 1 121 Drive Events 50 Driver version mismatch 87 drives messages 50 E Enclosure alarm disabled 89 Enclosure alarm enabled 88 Enclosure firmware mismatch 82 Enclosure was shut down 80 entity presence messages 56 Error occurred 1 128 event description reference 10 F Failure prediction threshold exceeded due to test 80 Fan enclosure inserted into system 33 fan enclosure messages 33 Fan enclosure removed from system 33 Fan enclosure removed from system for an extended amount of time 34 fan enclosure sensor 7 Fan enclosure sensor detected a non recoverable value 34 Fan enclosure sensor has failed 33 Fan enclosure sensor value unknown 33 fan sensor 6 Fan sensor detected a failure value 19 Fan sensor detected a non recoverable value 19 Fan sensor detected a warning value 18 Fan Sensor Events 45 Fan sensor has failed 18 44 fan sensor messages 45 Fan sensor returned to a normal value 18 Fan sensor value unknown 18 44 Firmware version mismatch 86 G Global hot spare assigned 74 142 Index 142 Index Global hot spare unassigned 75 H hardware log sensor
145. s for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 1151 2191 Multiple enclosures are attached to the controller This is an unsupported configuration Critical Failure Error Cause Many enclosures are attached to the controller port When the enclosure limit is exceeded the controller loses contact with all enclosures attached to the port Action Remove the last enclosure You must remove the enclosure that has been added last and is causing the enclosure limit to exceed Clear Alert Number None Related Alert Number 2211 LRA Number 2091 854 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 100 Storage Management Message Reference 2192 The virtual disk Check Consistency has made corrections and completed Ok Normal Cause This alert is for informational purposes The virtual disk Check Consistency has identified errors and made corrections For example the Check Consistency may have encountered a bad disk block and remapped the disk block to restore data consistency Action This alert is for informational purposes only and no additional action is required As a precaution monitor the Alert Log for other errors related to this virtual disk If problems persist contact Dell Technical Support Clear Alert Nu
146. se and Action Related Alert Information SNMP Trap Numbers 94 Storage Management Message Reference 2166 The RAID controller firmware and driver validation was not performed The configuration file is out of date or corrupted Warning Non critical Cause Storage Management is unable to determine whether the system has the minimum required versions of the RAID controller firmware and drivers This situation has occurred because a configuration file is unreadable or missing data The configuration file may be corrupted Action Reinstall Storage Management Clear Alert Number None Related Alert Number None LRA Number 2060 753 2167 The current kernel version and the non RAID SCSI driver version are older than the minimum required levels See readme txt for a list of validated kernel and driver versions Warning Non critical Cause The version of the kernel and the driver do not meet the minimum requirements Storage Management may not be able to display the storage or perform storage management functions until you have updated the system to meet the minimum requirements Action See the Readme file for a list of validated kernel and driver versions Update the system to meet the minimum requirements and then reinstall Storage Management Clear Alert Number None Related Alert Number None LRA Number 2050 103 2168 The non RAID SCSI driver version is older than the
147. sk that has a red X for its status Perform a rescan after replacing the disk Clear Alert Number 2121 Related Alert Number 2095 2201 2203 LRA Number 2051 2061 2071 2081 2091 2101 754 804 854 904 954 1004 1054 1104 1154 1204 Storage Management Message Reference 65 2049 Physical disk removed Warning Non critical Cause A physical disk has been removed from the disk group This alert can also be caused by loose or defective cables or by problems with the enclosure Action If a physical disk was removed from the disk group either replace the disk or restore the original disk On some controllers a removed disk has a red X for its status On other controllers a removed disk may have an Offline status or is not displayed on the user interface Perform a rescan after replacing or restoring the disk If a disk has not been removed from the disk group then check for problems with the cables See the online help for more information on checking the cables Make sure that the enclosure is powered on If the problem persists check the enclosure documentation for further diagnostic information Clear Alert Number 2052 Related Alert Number 2054 2057 2056 2076 2079 2081 2083 2129 2202 2204 2270 2292 2299 2369 LRA Number 2070 903 2050 Physical disk offline Warning Non critical Cause A physical disk in the disk group is offline A user may have manually put th
148. sk has completed This alert is for informational purposes Action None Clear Alert Number Alert 2130 is a clear alert for alert 2127 Related Alert Number None LRA Number None 1201 2131 Firmware version mismatch Warning Non critical Cause The firmware on the controller is not a supported version Action Install a supported version of the firmware If you do not have a supported version of the firmware available it can be downloaded from the Dell support site at support dell com If you do not have a supported version of the firmware available check with your support provider for information on how to obtain the most current firmware Clear Alert Number None Related Alert Number None LRA Number 2060 753 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers Storage Management Message Reference 87 2132 Driver version mismatch Warning Non critical Cause The controller driver is not a supported version Action Install a supported version of the driver If you do not have a supported driver version available it can be downloaded from the Dell support site at support dell com If you do not have a supported version of the driver available check with your support provider for information on how to obtain the most current driver Clear Alert Number None Related Alert Number
149. sk that contains the disk errors Review other alert messages to identify the physical disk that has errors If the virtual disk is redundant you can replace the physical disk and continue using the virtual disk If the virtual disk is non redundant you may need to recreate the virtual disk after replacing the physical disk After replacing the physical disk run Check Consistency to check the data Clear Alert Number None Related Alert Number None LRA Number 2081 1204 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 128 Storage Management Message Reference 2341 The Check Consistency made corrections and completed Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 1201 2342 The Check Consistency found inconsistent parity data Data redundancy may be lost Warning Non critical Cause The data on a source disk and the redundant data on a target disk is inconsistent Action Restart the Check Consistency task If you receive this alert again check the health of the physical disks included in the virtual disk Review the alert messages for significant alerts related to the physical disks If you suspect that a physical disk has a problem replace it and restore from backup Clear Alert
150. sor messages listed in Table 2 12 provide status and warning information about the noncircular logs that may fill up resulting in lost status messages 1502 AC power has been restored Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Information An AC power cord that did not have AC power has had the power restored The sensor location and chassis location information are provided 1503 AC power has been lost Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Warning An AC power cord has lost its power but there is sufficient redundancy to classify this as a warning The sensor location and chassis location information are provided 1504 AC power has been lost Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Error An AC power cord has lost its power and lack of redundancy requires this to be classified as an error The sensor location and chassis location information are provided 1505 AC power has been lost Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Error An AC power cord sensor in the specified system failed The AC power cord status cannot be monitored The sensor location and chassis location information are provided Table 2 11 AC Power Cord Messages continued Event ID Description Severity Cause 36 Event Message Reference
151. sor value lt Reading gt Specifies the temperature in degrees Celsius for example Temperature sensor value in degrees Celsius 30 Voltage sensor value in Volts lt Reading gt Specifies the voltage sensor value in volts for example Voltage sensor value in Volts 1 693 Table 1 2 Event Description Reference continued Description Line Item Explanation Event Message Reference 13 Event Message Reference The following tables lists in numerical order each event ID and its corresponding description along with its severity and cause NOTE For corrective actions see the appropriate documentation Miscellaneous Messages Miscellaneous messages in Table 2 1 indicate that certain alert systems are up and working Table 2 1 Miscellaneous Messages Event ID Description Severity Cause 0000 Log was cleared Information User cleared the log from Server Administrator 0001 Log backup created Information The log was full copied to backup and cleared 1000 Server Administrator starting Information Server Administrator is beginning to initialize 1001 Server Administrator startup complete Information Server Administrator completed its initialization 1002 A system BIOS update has been scheduled for the next reboot Information The user has chosen to update the flash basic input output system BIOS 1003 A previously scheduled system BIOS update has been canceled Information The user deci
152. stem returned to a valid range after crossing a failure threshold The sensor location chassis location previous state and temperature sensor value are provided 1053 Temperature sensor detected a warning value Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt If sensor type is not discrete Temperature sensor value in degrees Celsius lt Reading gt If sensor type is discrete Discrete temperature state lt State gt Warning A temperature sensor on the backplane board system board CPU or drive carrier in the specified system exceeded its warning threshold The sensor location chassis location previous state and temperature sensor value are provided Table 2 2 Temperature Sensor Messages continued Event ID Description Severity Cause Event Message Reference 17 1054 Temperature sensor detected a failure value Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt If sensor type is not discrete Temperature sensor value in degrees Celsius lt Reading gt If sensor type is discrete Discrete temperature state lt State gt Error A temperature sensor on the backplane board system board or drive carrier in the specified system exceeded its failure threshold The sensor location chassis location previous state and temperature sens
153. t setting is too low and if the room temperature is too cool Clear Alert Number None Related Alert Number 2112 LRA Number 2091 1054 2104 Controller battery is reconditioning Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number 2105 Related Alert Number None LRA Number None 1151 2105 Controller battery recondition is completed Ok Normal Cause This alert is for informational purposes Action None Clear Alert Status Alert 2105 is a clear alert for alert 2104 Related Alert Number None LRA Number None 1151 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers Storage Management Message Reference 77 2106 Smart FPT exceeded Warning Non critical Cause A disk on the specified controller has received a SMART alert predictive failure indicating that the disk is likely to fail in the near future Action Replace the disk that has received the SMART alert If the physical disk is a member of a non redundant virtual disk then back up the data before replacing the disk NOTICE Removing a physical disk that is included in a non redundant virtual disk will cause the virtual disk to fail and may cause data loss Clear Alert Number None Related Alert Number None LRA Number 2070 903 2107 Smart configuration ch
154. t State gt If sensor type is not discrete Temperature sensor value in degrees Celsius lt Reading gt If sensor type is discrete Discrete temperature state lt State gt Information A temperature sensor on the backplane board system board or the carrier in the specified system failed The sensor location chassis location previous state and temperature sensor value are provided 1051 Temperature sensor value unknown Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt If sensor type is not discrete Temperature sensor value in degrees Celsius lt Reading gt If sensor type is discrete Discrete temperature state lt State gt Information A temperature sensor on the backplane board system board or drive carrier in the specified system could not obtain a reading The sensor location chassis location previous state and a nominal temperature sensor value are provided 16 Event Message Reference 1052 Temperature sensor returned to a normal value Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt If sensor type is not discrete Temperature sensor value in degrees Celsius lt Reading gt If sensor type is discrete Discrete temperature state lt State gt Information A temperature sensor on the backplane board system board or drive carrier in the specified sy
155. ted when the drive is placed in consistency check Drive lt Drive gt consistency check in progress was deasserted Informational This event is generated when the consistency check of the drive is completed Drive lt Drive gt in critical array was asserted Critical This event is generated when the drive is placed in critical array Drive lt Drive gt in critical array was deasserted Informational This event is generated when the drive is removed from critical array Drive lt Drive gt in failed array was asserted Critical This event is generated when the drive is placed in the fail array System Event Log Messages for IPMI Systems 51 Intrusion Events The chassis intrusion messages are a security measure Chassis intrusion alerts are generated when the system s chassis is opened Alerts are sent to prevent unauthorized removal of parts from the chassis Drive lt Drive gt in failed array was deasserted Informational This event is generated when the drive is removed from the fail array Drive lt Drive gt rebuild in progress was asserted Informational This event is generated when the drive is rebuilding Drive lt Drive gt rebuild aborted was asserted Warning This event is generated when the drive rebuilding process is aborted Table 3 11 Intrusion Events Event Message Severity Cause lt Intrusion sensor Name gt sensor detected an intrusion Critical
156. the enclosure have a different SCSI rate This is an unsupported configuration All EMMs in the enclosure should have the same SCSI rate Clear Alert Number None Related Alert Number None LRA Number 2090 853 2174 The controller battery has been removed Warning Non critical Cause The controller cannot communicate with the battery the battery may be removed or the contact point between the controller and the battery may be burnt or corroded Action Replace the battery if it has been removed If the contact point between the battery and the controller is burnt or corroded you will need to replace either the battery or the controller or both See the hardware documentation for information on how to safely access remove and replace the battery Clear Alert Number None Related Alert Number 2188 2318 LRA Number 2100 1153 2175 The controller battery has been replaced Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 1151 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers Storage Management Message Reference 97 2176 The controller battery Learn cycle has started Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number 2177 R
157. trusion state gt Information A chassis intrusion sensor in the specified system failed The sensor location chassis location previous state and chassis intrusion state are provided 1251 Chassis intrusion sensor value unknown Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt Chassis intrusion state lt Intrusion state gt Information A chassis intrusion sensor in the specified system could not obtain a reading The sensor location chassis location previous state and chassis intrusion state are provided 1252 Chassis intrusion returned to normal Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt Chassis intrusion state lt Intrusion state gt Information A chassis intrusion sensor in the specified system detected that a cover was opened while the system was operating but has since been replaced The sensor location chassis location previous state and chassis intrusion state are provided 26 Event Message Reference Redundancy Unit Messages Redundancy means that a system chassis has more than one of certain critical components Fans and power supplies for example are so important for preventing damage or disruption of a computer system that a chassis may have extra fans or power supplies installed Redundancy allows a second or nth fan to keep th
158. ts that only one entry can be added to the SEL before it is full Log cleared Information This event is generated when the SEL is cleared 50 System Event Log Messages for IPMI Systems Drive Events The drive event messages monitor the health of the drives in a system These events are generated when there is a fault in the drives indicated Table 3 10 Drive Events Event Message Severity Cause Drive lt Drive gt asserted fault state Critical This event is generated when the specified drive in the array is faulty Drive lt Drive gt de asserted fault state Information This event is generated when the specified drive recovers from a faulty condition Drive lt Drive gt drive presence was asserted Informational This event is generated when the drive is installed Drive lt Drive gt predictive failure was asserted Warning This event is generated when the drive is about to fail Drive lt Drive gt predictive failure was deasserted Informational This event is generated when the drive from earlier predictive failure is corrected Drive lt Drive gt hot spare was asserted Warning This event is generated when the drive is placed in a hot spare Drive lt Drive gt hot spare was deasserted Informational This event is generated when the drive is taken out of hot spare Drive lt Drive gt consistency check in progress was asserted Warning This event is genera
159. tution variable is displayed with the alert in the Alert Log and can vary depending on the situation Action Reboot the system If the problem is not resolved contact technical support See your system documentation for information about contacting technical support by using telephone fax and Internet services Clear Alert Number None Related Alert Number None LRA Number 2051 104 2269 The physical disk Clear operation has completed Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 901 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 110 Storage Management Message Reference 2270 The physical disk Clear operation failed Critical Failure Error Cause A Clear task was being performed on a physical disk but the task was interrupted and did not complete successfully The controller may have lost communication with the disk The disk may have been removed or the cables may be loose or defective Action Verify that the disk is present and not in a Failed state Make sure the cables are attached securely See the online help for more information on checking the cables Restart the Clear task Clear Alert Number None Related Alert Number None LRA Number 2071 904 2271 T
160. tween the BMC firmware and the processor in use or vice versa Table 3 12 BIOS Generated System Events continued Event Message Severity Cause 54 System Event Log Messages for IPMI Systems Hdwr version err hardware incompatibility BMC Firmware and CPU mismatch was deasserted Information This event is generated when the earlier mismatch between the BMC firmware and the processor is corrected Hdwr version err hardware incompatibility BMC Firmware and other mismatch was asserted Critical This event is generated when there is a mismatch between the BMC firmware and the processor in use or vice versa Hdwr version err hardware incompatibility BMC Firmware and CPU mismatch was deasserted Information This event is generated when an earlier hardware mismatch is corrected SBE Log Disabled correctable memory error logging disabled was asserted Critical This event is generated when the ECC single bit error rate is exceeded CPU Protocol Err transition to non recoverable Critical This event is generated when the processor protocol enters a non recoverable state CPU Bus PERR transition to non recoverable Critical This event is generated when the processor bus PERR enters a non recoverable state CPU Init Err transition to non recoverable Critical This event is generated when the processor initialization enters a non recoverable state CPU Machine Chk transition to non recove
161. up complete 13 Service tag changed 91 Single bit ECC error limit exceeded 98 Single bit ECC error 115 Single bit ECC error The DIMM is critically degraded 122 Single bit ECC error The DIMM is critically degraded There will be no further reporting 122 Single bit ECC error The DIMM is degrading 122 Smart configuration change 77 Smart FPT exceeded 77 SMART thermal shutdown is disabled 107 SMART thermal shutdown is enabled 107 Smart warning 78 Smart warning degraded 80 Smart warning temperature 79 SMBIOS data is absent 14 System Event Log Messages 43 system management data manager started 14 system management data manager stopped 14 T Temperature dropped below the minimum failure threshold 76 Temperature dropped below the minimum warning threshold 75 Temperature exceeded the maximum failure threshold 76 Temperature exceeded the maximum warning threshold 75 temperature sensor 6 Temperature sensor detected a failure value 17 Temperature sensor detected a non recoverable value 17 Temperature sensor detected a warning value 16 Temperature Sensor Events 43 Temperature sensor has failed 15 43 temperature sensor messages 15 43 Temperature sensor returned to a normal value 16 43 Temperature sensor value unknown 15 43 The AC power supply cable has been removed 123 The background initialization BGI rate has c
162. urposes Action None Clear Alert Number None Related Alert Number None LRA Number None 751 2366 Dedicated spare imported as global due to missing arrays Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 901 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers Storage Management Message Reference 133 2367 Rebuild not possible as SAS SATA is not supported in the same virtual disk Warning Non critical Cause The physical disk is using an incompatible technology Action All physical disks in the virtual disk must use the same technology You cannot use both SAS and SATA physical disks in the same virtual disk Remove the physical disk and insert a new physical disk that uses the correct technology If the rebuild does not start automatically after you have inserted a suitable physical disk then run the Rebuild task Clear Alert Number None Related Alert Number 2326 LRA Number 2070 903 2368 The SCSI Enclosure Processor SEP has been rebooted as part of the firmware download operation and will be unavailable until the operation completes Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Nu
163. vent Message Severity Cause BMC OS Watchdog timer expired Information This event is generated when the BMC watchdog timer expires and no action is set BMC OS Watchdog performed system reboot Critical This event is generated when the BMC watchdog detects that the system has crashed timer expired because no response was received from Host and the action is set to reboot BMC OS Watchdog performed system power off Critical This event is generated when the BMC watchdog detects that the system has crashed timer expired because no response was received from Host and the action is set to power off BMC OS Watchdog performed system power cycle Critical This event is generated when the BMC watchdog detects that the system has crashed timer expired because no response was received from Host and the action is set to power cycle System Event Log Messages for IPMI Systems 49 Memory Events The memory modules can be configured in different ways in particular systems These messages monitor the status warning and configuration information about the memory modules in the system Hardware Log Sensor Events The hardware logs provide hardware status messages to the system management software On particular systems the subsequent hardware messages are not displayed when the log is full These messages provide status and warning messages when the logs are full Table 3 8 Memory Events Event Message Severity Cause Memo
164. w w w d e l l c o m s u p p o r t d e l l c o m Dell OpenManage Server Administrator Messages Reference Guide Notes and Notices NOTE A NOTE indicates important information that helps you make better use of your computer NOTICE A NOTICE indicates either potential damage to hardware or loss of data and tells you how to avoid the problem ____________________ Information in this document is subject to change without notice 2003 2007 Dell Inc All rights reserved Reproduction in any manner whatsoever without the written permission of Dell Inc is strictly forbidden Trademarks used in this text Dell the DELL logo and Dell OpenManage are trademarks of Dell Inc Microsoft and Windows are registered trademarks and Windows Server is a trademark of Microsoft Corporation Red Hat is a registered trademark of Red Hat Inc SUSE is a registered trademark of Novell Inc in the United States and other countries Other trademarks and trade names may be used in this document to refer to either the entities claiming the marks and names or their products Dell Inc disclaims any proprietary interest in trademarks and trade names other than its own October 2007 Contents 3 Contents 1 Introduction 5 What s New in this Release 5 Messages Not Described in This Guide
165. xt Displayed in the Storage Management Service Documentation Message Text Displayed in the Alert Log with Variable Information Supplied 2127 Background Initialization started Background Initialization started Virtual Disk 3 Virtual Disk 3 Controller 1 PERC 5 E Adapter 2334 Controller event log Controller event log Current capacity of the battery is above threshold Controller 1 PERC 5 E Adapter Table 4 2 Message Format with Variables for Each Storage Object Storage Object Message Variables A B C and X Y Z in the following examples are variables representing the storage object name or number Controller Message Format Controller A Name Message Format Controller A Example 2326 A foreign configuration has been detected Controller 1 PERC 5 E Adapter NOTE The controller name is not always displayed Battery Message Format Battery X Controller A Example 2174 The controller battery has been removed Battery 0 Controller 1 SCSI Physical Disk Message Format Physical Disk X Y Controller A Connector B Example 2049 Physical disk removed Physical Disk 0 14 Controller 1 Connector 0 SAS Physical Disk Message Format Physical Disk X Y Z Controller A Connector B Example 2049 Physical disk removed Physical Disk 0 0 14 Controller 1 Connector 0 Storage Management Message Reference 59 Virtual Disk Message Format Virtual Disk X Name Controller A Name Message Format Virtual Disk X Co
166. y degraded Information The fan specified by lt Sensor Name Location gt may have failed and hence the redundancy has been degraded lt Sensor Name Location gt Fan Redundancy sensor redundancy lost Critical The fan specified by lt Sensor Name Location gt may have failed and hence the redundancy that was degraded previously has been lost lt Sensor Name Location gt Fan Redundancy sensor redundancy regained Information The fan specified by lt Sensor Name Location gt may have started functioning again and hence the redundancy has been regained 46 System Event Log Messages for IPMI Systems Processor Status Events The processor status messages monitor the functionality of the processors in a system These messages provide processor health and warning information of a system Table 3 4 Processor Status Events Event Message Severity Cause lt Processor Entity gt status processor sensor IERR where lt Processor Entity gt is the processor that generated the event For example PROC for a single processor system and PROC for multiprocessor system Critical IERR internal error generated by the lt Processor Entity gt lt Processor Entity gt status processor sensor Thermal Trip Critical The processor generates this event before it shuts down because of excessive heat caused by lack of cooling or heat synchronization lt Processor Entity gt status processor sensor recovered from IERR Info
167. y detect that a virtual disk has been deleted and generate this alert Action None Clear Alert Number None Related Alert Number None LRA Number 2080 1203 2055 Virtual disk configuration changed Ok Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 1201 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers Storage Management Message Reference 67 2056 Virtual disk failed Critical Failure Error Cause One or more physical disks included in the virtual disk have failed If the virtual disk is non redundant does not use mirrored or parity data then the failure of a single physical disk can cause the virtual disk to fail If the virtual disk is redundant then more physical disks have failed than can be rebuilt using mirrored or parity information Action Create a new virtual disk and restore from a backup Clear Alert Number None Related Alert Number 2048 2049 2050 2076 2079 2081 2129 2346 LRA Number 2081 1204 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 68 Storage Management Message Reference 2057 Virtual disk degraded Warning Non critical Cause 1 This alert
168. y have been lost this alert does not always indicate that relevant or user data has been lost Action Verify that the battery and memory are functioning properly Clear Alert Number None Related Alert Number None LRA Number 2060 753 2187 Single bit ECC error limit exceeded Warning Non critical Cause The system memory is malfunctioning Action Replace the battery pack Clear Alert Number None Related Alert Number None LRA Number 2060 753 Table 4 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related Alert Information SNMP Trap Numbers Storage Management Message Reference 99 2188 The controller write policy has been changed to Write Through Ok Normal Cause The controller battery is unable to maintain cached data for the required period of time For example if the required period of time is 24 hours the battery is unable to maintain cached data for 24 hours It is normal to receive this alert during the battery Learn cycle as the Learn cycle discharges the battery before recharging it When discharged the battery cannot maintain cached data Action Check the health of the battery If the battery is weak replace the battery pack Clear Alert Number None Related Alert Number None LRA Number None 1151 2189 The controller write policy has been changed to Write Back Ok Normal Cause This alert i
169. ystem The device may have been added to the system incorrectly 40 Event Message Reference Battery Sensor Messages Battery sensors monitor how well a battery is functioning Battery messages listed in Table 2 15 provide status and warning information for batteries in a particular chassis Table 2 15 Battery Sensor Messages Event ID Description Severity Cause 1700 Battery sensor has failed Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt Battery sensor status lt status gt Information A battery sensor in the specified system is not functioning The sensor location chassis location previous state and battery sensor status are provided 1701 Battery sensor value unknown Sensor Location lt Location in chassis gt Chassis Location lt Name of chassis gt Previous state was lt State gt Battery sensor status lt status gt Information A battery sensor in the specified system could not retrieve a reading The sensor location chassis location previous state and battery sensor status are provided 1702 Battery sensor returned to a normal value Sensor Location lt Location in chassis gt Chassis Location lt Name of chassis gt Previous state was lt State gt Battery sensor status lt status gt Information A battery sensor in the specified system detected that a battery transitioned back to a normal
Download Pdf Manuals
Related Search
Related Contents
Imation 2GB Atom PDFカタログはこちら Instructions HWAT-Eco-04 Samsung NA64H3000AK/PC Manuel de l'utilisateur RK-750 Technaxx TX-22 FC-POSの詳細パンフレットはこちらから - フジミック新潟 Aurora AS610C User's Manual 通信7月号&夏休みの開館状況 Copyright © All rights reserved.
Failed to retrieve file