Home
Sun Fire V1280/Netra 1280 System Administration Guide
Contents
1. Local I2C PCF8591 Test VOLT_AD Device Test channel 00000001 Voltage 0x00000099 1 49 channel 00000002 Voltage 0x0000009D 3 37 channel 00000003 Voltage 0x0000009A 5 1 channel 00000004 Voltage 0x00000000 0 0 Local I2C LM75 Test EMPO IIep Device Test Temparature 24 50 Degree C Local I2C LM75 Test EMP 1 Rio Device Test Temparature 23 50 Degree C Chapter 6 Running POST 65 CODE EXAMPLE 6 3 SCPOST Output with Diagnostic Level Set to min Continued Local I2C LM75 EMP 2 CBH Temparature Local I2C PCF8574 Sc CSR Console Bus Hub CBH Register Access POST Complete Test Device Test 32 0 Degree C Test Device Test Test Test 66 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 CHAPTER 7 Troubleshooting This chapter provides troubleshooting information for a system administrator The chapter describes the following topics System Faults on page 72 Displaying Diagnostic Information on page 82 Assisting Sun Service Personnel in Determining Causes of Failure on page 83 Recovering a Hung System on page 77 Device Mapping The physical address represents a physical characteristic that is unique to the device Examples of physical addresses include the bus address and the slot number The slot number indicates where the device is installed You reference a physical device by the node identifier
2. TABLE 9 2 TABLE 9 3 TABLE 9 4 TABLE 9 5 TABLE 9 6 TABLE 9 7 TABLE 9 8 TABLE 9 9 Tables Selected Management Tasks 3 System Indicator LED Functions 5 POST Configuration Parameters 60 CPU and Memory Agent ID Assignment 68 I O Assembly Type and Number of Slots 69 Number and Name of I O Assemblies per System 69 I O Controller Agent ID Assignments 69 IB_SSC Assembly PCI Device Mapping 70 System Fault Indicator States 73 Blacklisting Component Names 75 Checking Temperature Conditions Using the showenvironment Command 79 Types of DR Operation 94 Board Receptacle States 95 Board Occupant States 96 Board Conditions 96 Component Occupant States 97 Component Conditions 97 Component Types 97 DR Board States from the System Controller SC 99 cfgadm c Command Options 102 xi TABLE 9 10 cfgadm x Command Options 102 TABLE 9 11 Diagnostic Levels 103 xii Sun Fire V1280 Netra 1280 System Administration Guide December 2002 Code Samples CODE EXAMPLE 2 1 CODE EXAMPLE 2 2 CODE EXAMPLE 2 3 CODE EXAMPLE 2 4 CODE EXAMPLE 2 5 CODE EXAMPLE 5 1 CODE EXAMPLE 5 2 CODE EXAMPLE 5 3 CODE EXAMPLE 5 4 CODE EXAMPLE 5 5 CODE EXAMPLE 5 6 CODE EXAMPLE 6 1 CODE EXAMPLE 6 2 CODE EXAMPLE 6 3 CODE EXAMPLE 8 1 CODE EXAMPLE 8 2 CODE EXAMPLE 9 1 CODE EXAMPLE 9 2 Hardware Reset Output from the System Controller 18 Output From the setupnetwork Command 21 Installing the LOM Drive
3. 87 97 1 1 1 1 1 shutdown 63 degC 107 degC 107 degC 107 degC 107 degC 107 degC 107 degC 107 degC 107 degC 107 degC 107 degC 107 degC 107 degC degC degC degC degC degC degC degC degC 07 degC 07 degC 07 degC 07 degC 07 degC To check the internal temperature of the system and also the system s warning and shutdown threshold temperatures type Chapter 5 Using Lights Out Management and the System Controller from Solaris 53 CODE EXAMPLE 5 6 Sample Output from the 1om t Command Continued 39 SB2 0 SB2 SB2 SB2 SB2 SB2 SB2 SB2 SB2 SB2 SB2 IB6 IB6 IB6 IB6 IB6 IB6 IB6 IB6 IB6 D KB DDB HBR D BoD O 0 JU amp WN EF O1 O1 O1 O1 O1 O1 O1 O1 O1 JU amp NN EH PO PO P1 P1 P2 P2 P3 P3 t_dx3 53 degC warning 102 degC shutdown 107 degC t_sbbc0 52 degC warning 102 degC shutdown 107 degC t_sbbcl 42 degC warning 102 degC shutdown 107 degC Ambient 27 degC warning 82 degC shutdown 87 degC Die 54 degC warning 92 degC shutdown 97 degC Ambient 26 degC warning 82 degC shutdown 87 degC Die 53 degC warning 92 degC shutdown 97 degC Ambient 27 degC warning 82 degC shutdown 87 degC Die 51 degC warning 92 degC shutdown 97 degC Ambient 27 degC warning 82 degC shutdown 87 degC Die 51 degC warning 92 degC shutdown 97 degC t_ambient0 29 degC warning 82 degC shutdown 87 degC t_ambientl 29 degC warning 82 degC
4. Agent ID AID The AID ranges from 0 to 31 in decimal notation 0 to 1f in hexadecimal In the device path beginning with ssm 0 0 the first number 0 is the node ID CPU Memory Mapping CPU Memory board and memory agent IDs AIDs range from 0 to 23 in decimal notation 0 to 17 in hexadecimal The system can have up to three CPU Memory boards 67 Each CPU Memory board has four CPUs depending on your configuration Each CPU Memory board has up to four banks of memory Each bank of memory is controlled by one memory management unit MMU which is the CPU The following code example shows a device tree entry for a CPU and its associated memory ssm 0 0 SUNW UltraSPARC III b 0 ssm 0 0 SUNW memory controller b 400000 where in b 0 a bis the CPU agent identifier AID a Ois the CPU register inb 400000 a bis the memory agent identifier AID m 400000 is the memory controller register There are up to four CPUs on each CPU Memory board TABLE 7 1 CPUs with agent IDs 0 3 reside on board name SBO CPUs with agent IDs 8 11 on board name SB2 and so on TABLE 7 1 CPU and Memory Agent ID Assignment CPU Memory Board Name Agent IDs On Each CPU Memory Board CPU 0 CPU 1 CPU 2 CPU 3 SBO 0 0 1 1 2 2 3 3 SB2 8 8 9 9 10 a 11 b SB4 16 10 17 11 18 12 19 13 The first number in the columns of agent IDs is a decimal number The number or letter in parentheses is in hexadecimal notation 68 Sun
5. Chapter 7 Troubleshooting 79 TABLE 7 8 Checking Temperature Conditions Using the showenvironment Command Continued SSC1 Board 0 Temp 0 22 Degrees C 1 sec OK SSC1 Board 0 Temp 1 22 Degrees C 1 sec OK SSC1 Board 0 Temp 2 28 Degrees C 1 sec OK SSC1 Board 0 1 5 VDC 0 1 49 Volts DC 1 sec OK SSC1 Board 0 3 3 VDC 0 3 35 Volts DC 1 sec OK SSC1 Board 0 5 VDC 0 4 98 Volts DC 1 sec OK N0 PSO Input 0 Volt 0 1 sec OK N0 PSO 48 VDC 0 Volt 0 48 00 Volts DC 1 sec OK NO PS1 Input 0 Volt 0 5 sec OK NO PS1 48 VDC 0 Volt 0 48 00 Volts DC 5 sec OK NO FTO Fan 0 Cooling 0 Auto 5 sec OK NO FTO Fan 1 Cooling 0 Auto 5 sec OK NO FTO Fan 2 Cooling 0 Auto 5 sec OK NO FTO Fan 3 Cooling 0 Auto 5 sec OK NO FTO Fan 4 Cooling 0 Auto 5 sec OK NO FTO Fan 5 Cooling 0 Auto 5 sec OK NO FTO Fan 6 Cooling 0 Auto 5 sec OK NO FTO Fan 7 Cooling 0 Auto 5 sec OK NO RPO Board 0 1 5 VDC 0 1 49 Volts DC 5 sec OK NO RPO Board 0 3 3 VDC 0 3 37 Volts DC 5 sec OK NO RPO Board 0 Temp 0 20 Degrees C 5 sec OK NO RPO Board 0 Temp 1 19 Degrees C 5 sec OK NO RPO SDC 0 Temp 0 55 Degrees C 5 sec OK NO RPO AR 0 Temp 0 45 Degrees C 5 sec OK NO RPO DX 0 Temp 0 57 Degrees C 5 sec OK NO RPO DX 1 Temp 0 59 Degrees C 5 sec OK NO RP2 Board 0 5 VDC 0 1 48 Volts DC 5 sec OK NO RP2 Board 0 3 3 VDC 0 3 37 Volts DC 5 sec OK N0 RP2 Board 0 Temp 0 22 Degrees C 5 sec OK N0 RP2 Board 0 Temp 1 22 Degrees C 5 sec OK NO RP2 SDC 0 Temp
6. Programming NO SBO PROM 0 Comparing image and flash Image and flash are different proceeding BEASITNG cstepiaaronio Gaye Done Programming airone rate Done Verteying caps Piana Done May 22 14 46 40 commando lw8 NO SBO PROM 002 Programming NO SBO PROM 1 Comparing image and flash Image and flash are different proceeding Erasing Spie nada giate Done Programnthig sosia Done VerLEeybno Lure Done ay 22 14 47 08 commando 1w8 NO SBO PROM 002 Programming N0 SB2 PROM 0 Comparing image and flash Image and flash are different proceeding Erasing i lesione Done Programming pei Done VOT EY IMG Histo Done ay 22 14 47 36 commando lw8 N0 SB2 PROM 002 Programming N0 SB2 PROM 1 Comparing image and flash Image and flash are different proceeding BraAstEngo Lo aes lento Done PHOGEAMMING a susie heave sue le Done VECLEYIDNG si di pra bae Done May 22 14 48 10 commando lw8 N0 SB2 PROM 002 Firmware update complet with update 0 updated with version 5 13 5 05 17 2 with update 1 updated with version 5 13 5 05 17 2 with update 0 updated with version 5 13 5 05 17 2 with update 1 updated with version 5 13 5 05 17 2 You must reboot Solaris to load the new firmware 90 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 CHAPTER 9 CPU Memory Board Replacement and Dynamic Reconfiguration DR This chapter describes how to dynamically
7. RAS 6 Reliability 6 Disabling Components or Boards and Power On Self Test POST Manual Disabling of Components 7 Environmental Monitoring 7 Availability 7 Dynamic Reconfiguration 8 Power Failure 8 System Controller Reboot 8 Host Watchdog 8 Serviceability 8 LEDs 9 7 Nomenclature 9 System Controller Error Logging 9 System Controller XIR eXternally Initiated Reset Support 9 2 Starting and Setting Up the Sun Fire V1280 Netra 1280 11 Installing and Cabling Hardware 12 Using the Power On Standby Switch 13 Powering On and Off 14 Powering On 14 Y Initial Power On 14 v Powering On from Standby Mode 14 Bringing the System to Standby Mode 15 After Powering On 18 Setting Up the System 19 v ToSetthe Date and Time 19 v To Set Up the Password 20 v To Configure Network Parameters 20 Installing and Booting the Solaris Operating Environment 22 v To Install and Boot the Solaris Operating Environment 22 v To Install the Lights Out Management Packages 23 v To Install the LOM Drivers 23 v To Install the LOM Utility 25 v To Install the LOM Manual Pages 27 Resetting the System 27 v To Forcibly Reset the System 27 v To Reset the System Controller 28 3 Console Navigation Procedures 29 Establishing a LOM Console Connection 30 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 Accessing the LOM Console Using the Serial Port 30 v To Connect to an ASCII Terminal 30 v To Connect to a Network Terminal Server 32 v To Conne
8. byte 19 700000 scsi 3 disk block ssm 0 0 pci ssm 0 0 pci ssm 0 0 pci ssm 0 0 pci ssm 0 0 pci ssm 0 0 pci ssm 0 0 pci aD Q A 19 700000 SUNW qlc 2 scsi fcp 19 700000 SUNW glc 2 fp 0 0 fp 19 700000 SUNW qlc 2 fp 0 0 disk block ssm 0 0 pci ssm 0 0 pci ssm 0 0 pci oO 19 700000 SUNW qlc 1 scsi fcp 19 700000 SUNW glc 1 fp 0 0 fp 19 700000 SUNW qlc 1 fp 0 0 disk block ssm 0 0 pci ssm 0 0 pci ssm 0 0 pci oO A System Faults A system fault is any condition that is considered to be unacceptable for normal system operation When the system has a fault the Fault LED um will turn on The system indicators are shown in FIGURE 7 2 72 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 On Stand by switch Sun microsystems Locator System Power UNIX Running Source A and Source B System Fault Top Access Required Alarmi and Alarm2 FIGURE 7 2 System Indicators The indicator states are shown in TABLE 7 6 You must take immediate action to eliminate a system fault TABLE 7 6 System Fault Indicator States System Fault Fault indicator lit indicator lit Top Access when fault on FRU lit on FRU FRU name detected fault fault Comments System Board Yes Yes Yes Includes processors Ecache and DIMMs Level 2 repeater Yes Yes Yes IB_SSC Yes Yes Yes System Controller No Yes Yes IB_SSC fault LED lit Fan Yes
9. on page 16 Press the On Standby switch to the left and hold it for at least four seconds The system will power down to standby mode The only indicator LEDs to be illuminated on the system indicator board are the Source A and Source B indicators The IB_SSC assembly Active LED will also be illuminated but not visible from the front of the system Chapter 2 Starting and Setting Up the Sun Fire V1280 Netra 1280 17 After Powering On You will see the following output on the System Controller serial port connection CODE EXAMPLE 2 1 Hardware Reset Output from the System Controller Hardware Reset SYSTEM CONTROLLER SC POST 23 2002 03 22 18 03 PSR 0x044010e5 PCR 0x04004000 Basic sanity checks done Skipping POST ERI Device Present Getting MAC address for SSCl Using SCC MAC address AC address is 0 3 xx xx xx xx Hostname some_name Address xxx XXX XXX XXX etmask 255 255 255 0 Attached TCP IP interface to eri unit 0 Attaching interface 100 done Gateway XXX XXX XXX XXX interrupt 100 Mbps half duplex link up Copyright 2001 2002 Sun Microsystems Use is subject to license terms Lights Out Management Firmware RTOS version 23 ScApp version 5 13 0007 LW8_build0 7 SC POST diag level off Inc All rights reserved The date is Friday July 19 2002 3 48 50 PM BST Fri Jul 19 15 48 51 some name lom Boot ScApp 5 13 0007 RTOS 23 Fri Jul 19 15 48 54 some_name lom SBBC Reset
10. BO B1 PCIO PCM PCI2 PCI3 PCI4 PCI5 lt slot gt RPO RP2 Blacklist a component or device if you believe it might be failing intermittently or is failing Troubleshoot a device you believe is having problems There are three system controller commands for blacklisting m disablecomponent m enablecomponent m showcomponent Chapter 7 Troubleshooting 75 The disablecomponent and enablecomponent commands only update the blacklist They do not directly affect the state of the currently configured system boards The updated lists will take effect when you do one the following m Reboot the system m Use dynamic reconfiguration to configure the board containing the blacklisted component out of and then back into the system In order to use disablecomponent and enablecomponent on the Repeater Boards RP0 RP2 the system first has to be shut down to Standby using the poweroff command When the disablecomponent or enablecomponent command is issued for a Repeater Board RP0 RP2 the System Controller will be automatically reset to make use of the new settings If a replacement Repeater Board is inserted it is necessary to manually reset the System Controller using the reset sc command Refer to the Sun Fire V1280 Netra 1280 System Controller Command Reference Manual for a description of this command Special Considerations for CPU Memory Boards In the unlikely event that a CPU Memory board fails the interconnect t
11. Board Configuration Failure Cannot Configure Either CPUO or CPU1 While the Other Is Configured Before you try to configure either CPUO or CPU1 make sure that the other CPU is unconfigured Once both CPU0 and CPU1 are unconfigured it is then possible to configure both of them CPUs on a Board Must Be Configured Before Memory Before configuring memory all CPUs on the system board must be configured If you try to configure memory while one or more CPUs are unconfigured the system displays an error message such as 110 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 cfgadm Hardware specific failure configure NO SB2 memory Can t config memory if not all cpus are online ssm 0 0 memorycontroller Error Logging Error messages from Solaris are logged using syslog and SunMC Error messages from the System Controller are also logged to SunMC Chapter 9 CPU Memory Board Replacement and Dynamic Reconfiguration DR 111 112 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 Glossary ap_id Attachment point cfgadm command Condition Configuration system Configuration board Connection Detachability Disconnection DR Attachment point identifier an ap_id specifies the type and location of the attachment point in the system and is unambiguous There are two types of identifier physical and logical A physical identifier contains a fully specified p
12. H NH Sun Fire V1280 OpenFirmware version 5 13 0007 07 18 02 12 45 Copyright 2001 Sun Microsystems Inc All rights reserved SmartFirmware Copyright C 1996 2001 All rights reserved 16384 MB memory installed Serial 9537054 Ethernet address 8 0 xx xx xx xx Host ID 80xxxxxx OTICE obp_ main Extended diagnostics are now switched on 0 ok Controlling POST With the bootmode Command The System Controller bootmode command allows you to specify the boot configuration for the next system reboot only This removes the necessity for taking the system down to the OpenBoot PROM to make these changes for instance to the diag level variable For example use the following command to force the highest level of POST tests to be run prior to the next reboot lom gt shutdown lom gt bootmode diag lom gt poweron Chapter 6 Running POST 63 To force the lowest level of POST tests to be run prior to the next reboot use lom gt shutdown lom gt bootmode skipdiag lom gt poweron If the system is not rebooted within 10 minutes of the bootmode command being issued the bootmode setting is returned to normal and the previously set values of diag level and verbosity level are applied For a fuller description of these commands refer to the Sun Fire V1280 Netra 1280 System Controller Command Reference Manual Controlling the System Controller POST The System Controller Power On Self Test is
13. Local POST sgcpu flash contains CPU Memory Board Local POST and OBP sgsc flash contains LOM System Controller firmware sgrtos flash contains LOM System Controller Real Time Operating System You must place these in a suitable directory for instance var tmp and issue the lom G command with the filename of the file to be downloaded The firmware knows from header information contained in the file which image type is being upgraded These images will be provided in a patch downloadable from www sunsolve sun com or from your SunService representative The patch README file should contain full instructions for installing these new firmware images It is very important that the instructions are followed exactly otherwise you may render your system unbootable Caution Do not interrupt the lom G operation If the lom G command is terminated abnormally the System Controller will go into single use mode and will only be accessible from the serial port Caution Before performing a lom G check the firmware revisions of all boards using the showboards p version command Caution You are strongly recommended to run the lom G command from a Solaris console running on the serial connection so that the results can be fully monitored Caution Before updating CPU Memory boards or the I O Assembly ensure that all boards to be updated are powered on by using the poweron command 88 Sun Fire V1280
14. POST has run If this parameter value is set to false you will obtain the OpenBoot PROM ok prompt after POST runs from which you must type a boot command to boot the Solaris operating environment Controls the behavior of the system after an externally initiated reset XIR as well as a red mode trap Chapter 6 Running POST 61 TABLE 6 1 POST Configuration Parameters Continued Parameter Value Description sync The OpenBoot PROM invokes sync A core file is generated If default value the invocation returns the OpenBoot PROM performs a reboot none The OpenBoot PROM prints a message describing the reset trap that triggered the error reset and passes control to the OpenBoot PROM ok prompt The message describing the reset trap type is platform specific boot The OpenBoot PROM firmware reboots the system A core file is not generated Rebooting a system occurs using the OpenBoot PROM settings for diag device or boot device depending on the value of the OpenBoot PROM configuration variable diag switch If diag switch is set to true the device names in diag device will be the default for boot If diag switch is set to false the device names in boot device will be the default for boot The default output from POST will be similar to CODE EXAMPLE 6 1 CODE EXAMPLE 6 1 POST Output Using max Setting Testing CPU Boards Loading the test table from board SBO PROM 0 N0 SB0 P0 Running CPU POR an
15. Pe OO a w Q D 0 O K Ww FIGURE 9 1 shows details of the display in CODE EXAMPLE 9 2 Occupant State Attachment n Board Component Point ID Receptacle State vu Information NO IB6 connected configured ok powered on assigned Apr 3 18 04 PCI_I O_Boa n devices ssm 0 0 N0 IB6 Busy State Physical ID and location Board Component Type When Connected FIGURE 9 1 Details of the Display for cfgadm av Command Options The options to the cfgadm c command are listed in TABLE 9 9 Chapter 9 CPU Memory Board Replacement and Dynamic Reconfiguration DR 101 102 TABLE 9 9 cfgadm c Command Options cfgadm c Option Function connect The slot provides power to the board and begins monitoring the board The slot is assigned if it was not previously assigned disconnect The system stops monitoring the board and power to the slot is turned off configure The operating system assigns functional roles to a board and loads device drivers for the board and for the devices attached to the board unconfigure The system detaches a board logically from the operating system and takes the associated device drivers offline Environmental monitoring continues but any devices on the board are not available for system use The options provided by the cfgadm x command are listed in TABLE 9 10 TABLE 9 10 cfgadm x Command Options cfgadm x Option Function poweron Powers on a CPU Memory b
16. amp Sun microsystems Sun Fire V1280 Netra 1280 System Administration Guide Sun Microsystems Inc 4150 Network Circle Santa Clara CA 95054 U S A 650 960 1300 Part No 817 0509 10 December 2002 Revision A Send comments about this document to docfeedback sun com Copyright 2002 Sun Microsystems Inc 4150 Network Circle Santa Clara California 95054 U S A All rights reserved Sun Microsystems Inc has intellectual property rights relating to technology embodied in the product that is described in this document In porous and without limitation these intellectual property rights may include one or more of the U S patents listed at ttp www sun com patents and one or more additional patents or pending patent applications in the U S and in other countries This document and the product to which it pertains are distributed under licenses restricting their use copying distribution and decompilation No part of the product or of this document may be reproduced in any form by any means without prior written authorization of Sun and its licensors if any Third party software including font technology is copyrighted and licensed from Sun suppliers Parts of the product may be derived from Berkeley BSD systems licensed from the University of California UNIX is a registered trademark in the U S and in other countries exclusively licensed through X Open Company Ltd Sun Sun Microsystems the Sun logo AnswerBoo
17. and Permanent Memory 97 Limitations 98 Memory Interleaving 98 Reconfiguring Permanent Memory 98 Command Line Interface 99 The cfgadm Command 99 Displaying Basic Board Status 99 Displaying Detailed Board Status 100 Command Options 101 Testing Boards and Assemblies 102 v To Test a CPU Memory Board 102 Installing or Replacing CPU Memory Boards 104 v To Install a New Board 104 v To Hot Swap a CPU Memory Board 104 v To Remove a CPU Memory Board From the System 105 v To Disconnect a CPU Memory Board Temporarily 106 Troubleshooting 107 Unconfigure Operation Failure 107 CPU Memory Board Unconfiguration Failures 107 Configure Operation Failure 110 CPU Memory Board Configuration Failure 110 Error Logging 111 Glossary 113 Index 117 viii Sun Fire V1280 Netra 1280 System Administration Guide December 2002 Figures FIGURE 1 1 FIGURE 1 2 FIGURE 2 1 FIGURE 3 1 FIGURE 4 1 FIGURE 7 1 FIGURE 7 2 FIGURE 9 1 VO Ports 2 System Indicator Board 5 Sun Fire V1280 Netra 1280 System Power On Standby Switch 13 Navigation Procedures 38 System Controller Logging 44 Sun Fire V1280 Netra 1280 IB_SSC PCI Physical Slot Designations for IB6 71 System Indicators 73 Details of the Display for cfgadm av 101 x Sun Fire V1280 Netra 1280 System Administration Guide December 2002 TABLE 1 1 TABLE 1 2 TABLE 6 1 TABLE 7 1 TABLE 7 2 TABLE 7 3 TABLE 7 4 TABLE 7 5 TABLE 7 6 TABLE 7 7 TABLE 7 8 TABLE 9 1
18. balancing failures when multiple network interface cards are attached to a system If a failure occurs in a network adapter and if an alternate adapter is connected to the same IP link the system switches all the network accesses from the failed adapter to the alternate adapter When multiple network adapters are connected to the same IP link any increases in network traffic are spread across multiple network adapters which improves network throughput A DR operation in which hardware is not physically added or removed An example is the deactivation of a failed board that is then left in the slot to avoid changing the flow of cooling air until a replacement is available Hardware resource such as a system board or a disk drive that occupies a DR receptacle or slot A specific Sun Fire system model such as the Sun Fire V1280 Netra 1280 system A DR operation that involves the physical addition or removal of a board See also Logical DR A brief pause in the operating environment to allow an unconfigure and disconnect operation on a system board with non pageable OpenBoot PROM OBP or kernel memory All operating environment and device activity on the backplane must cease for a few seconds during a critical phase of the operation A receiver such as a board slot or SCSI chain A board connector Sun Fire V1280 Netra 1280 System Administration Guide December 2002 SNMP State Suspendability Suspend safe Suspen
19. configured using the LOM setupsc command This enables the System Controller POST level to be set to of f min or max For a fuller description of this command refer to the Sun Fire V1280 Netra 1280 System Controller Command Reference Manual System Controller POST output only appears on the System Controller serial connection To set the SCPOST diagnostic level default to min CODE EXAMPLE 6 2 Setting SCPOST Diagnostic Level to min lom gt setupsc System Controller Configuration SC POST diag Level off min Host Watchdog enabled Rocker Switch enabled Secure Mode off lom gt 64 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 When SCPOST diag level is set to min you will see the following output on the serial port whenever the System Controller is reset CODE EXAMPLE 6 3 SCPOST Output with Diagnostic Level Set to min SYSTEM CONTROLLER SC POST 21 2001 12 11 17 11 PSR 0x044010e5 PCR 0x04004000 SelfTest running at DiagLevel 0x20 SC Boot PROM Test BootPROM CheckSum Test IU Test IU instruction set Test Little endian access Test FPU Test FPU instruction set Test SparcReferenceMMU Test SRMMU TLB RAM Test SRMMU TLB Read miss Test SRMMU page probe Test SRMMU segment probe Test SRMMU region probe Test SRMMU context probe Test lt more SCPOST ouput gt Local I2C AT24C64 Test EEPROM Device Test performing eeprom sequential read
20. degC degC degC degC degC degC degC degC degC degC degC degC degC degC degC degC degC degC degC degC degC degC degC warning warning warning warning warning warning warning warning warning warning warning warning warning warning warning warning warning warning warning warning warning warning warning warning warning warning warning warning warning warning warning warning warning warning warning warning warning warning 102 102 82 82 82 82 53 102 102 102 102 82 53 102 102 102 102 102 102 102 102 102 102 102 102 82 92 82 92 82 92 82 92 102 102 102 102 102 degC degC degC degC degC degC degC degC degC degC degC degC degC degC degC degC degC degC degC degC degC degC degC degC degC degC degC degC degC degC degC degC degC degC degC degC degC degC Checking the Internal Temperature Lom t shutdown 107 degC shutdown 107 degC shutdown shutdown shutdown shutdown shutdown shutdown shutdown shutdown shutdown 87 87 87 87 63 degC degC degC degC degC 07 degC 07 degC 07 degC 07 degC shutdown 87 degC hutdown hutdown hutdown hutdown hutdown hutdown hutdown hutdown hutdown hutdown hutdown shutdown shutdown shutdown shutdown shutdown shutdown shutdown shutdown shutdown shutdown shutdown shutdown shutdown shutdown AND NDNA U U U U U YN 87 97 87 97 87 97
21. drv sparcv9 lomp drivers INFO Installing var tmp SUNWlomr reloc platform sun4u kernel drv sparcv9 lomv P P EE P F i drivers INFO Identified drivers lom lomp lomv i drivers INFO Cleaning up old driver lom Cleaning up old devlink entry type ddi_ pseudo name SUNW lom lom i drivers INFO Cleaning up old driver lomp Cleaning up old devlink entry type ddi_pseudo name lomp lomp i drivers INFO Cleaning up old driver lomv Cleaning up old devlink entry type ddi_pseudo name SUNW 1omv MO type ddi_pseudo name lomv M0 i drivers INFO Cleaning up old driver lomh Cleaning up old devlink entry type ddi_pseudo name SUNW lomh lom i drivers INFO Adding driver lomp driver lomp aliases link lomp spec lomp Adding devlink entry type ddi_pseudo name lomp lomp adding driver with aliases perm 0644 root sys devfsadm driver failed to attach lomp Warning Driver lomp successfully added to system but failed to attach i drivers INFO Adding driver lomv driver lomv aliases SUNW lomv link SUNW lomv lomv spec MO Adding devlink entry type ddi_pseudo name SUNW lomv MO Adding devlink entry type ddi_pseudo name lomv MO adding driver with aliases SUNW lomv perm 0644 root sys devfsadm driver failed to attach lomv Wa
22. firmware Monitoring the System From Solaris There are two ways of interrogating the LOM device System Controller or of sending it commands to perform m By executing LOM commands from the lom gt shell prompt For information about how to do this see Chapter 3 m By executing LOM specific Solaris commands from the UNIX prompt These commands are described in this chapter The Solaris commands described in this section which are all available from the UNIX prompt run the usr sbin lom utility Where appropriate the command lines given in this section are accompanied by typical output from the commands Viewing Online LOM Documentation To view the manual pages for the LOM utility type 46 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 man lom Chapter 5 Using Lights Out Management and the System Controller from Solaris 47 Viewing the LOM Configuration Lom c To view the current LOM configuration type CODE EXAMPLE 5 1 Sample Output from the lom c Command lom c LOM configuration settings serial escape sequence serial event reporting default Event reporting level fatal warning amp information Serial security disabled Automatic return to console disabled firmware version 13 7 firmware checksum 0000 product revision 0 0 product ID Netra T12 Checking the Status of the Fault LED and Alarms lom 1 To check whether the Sy
23. hot swappable Note Only suitably trained personnel or SunService are permitted to enter the Restricted Access Location to hotswap PSUs or hard disk drives m CPU Memory Boards SB0 SB2 SB4 can be blacklisted if considered faulty m Repeater Boards RP0 RP2 can be blacklisted if considered faulty If a fault is indicated on any other FRU or a physical replacement of blacklisted FRUs above is required then SunService should be called 74 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 Manual Blacklisting while waiting for repair The System Controller supports the blacklisting feature which allows you to disable components on a board TABLE 7 7 Blacklisting provides a list of system board components that will not be tested and will not be configured into the Solaris operating environment The blacklist is stored in nonvolatile memory TABLE 7 7 Blacklisting Component Names System Component Subsystem Component Name Component CPU system slot port physical_bank logical_bank CPU Memory boards slot Ports on the CPU Memory board Physical memory banks on CPU Memory boards Logical banks on CPU Memory boards 1 0 assembly system Repeater system 1 0 assembly Ports on the 1 0 assembly Buses on the I O assembly I O cards in the I O assemblies Repeater board SBO SB2 SB4 PO P1 P2 P3 BO B1 LO L1 L2 L3 slot port bus or slot card IB6 PO P1
24. node 0 zero SB is a system board x is a slot number A slot number can be 0 2 or 4 for a system board m A logical attachment point is an abbreviated name created by the system to refer to the physical attachment point Logical attachment points take the following form NO SBX m Note that cfgadm will also show the I O assembly NO 18B6 but as this is non redundant no DR actions willl be allowed on this attachment point Chapter 9 CPU Memory Board Replacement and Dynamic Reconfiguration DR 93 DR Operations There are four main types of DR operation TABLE 9 1 Types of DR Operation Connect The slot provides power to the board and monitors its temperature Configure The operating environment assigns functional roles to a board and loads device drivers for the board and brings the devices on that board into use by the Solaris operating environment Unconfigure The system detaches a board logically from the operating environment Environmental monitoring continues but devices on the board are not available for system use Disconnect The system stops monitoring the board and power to the slot is turned off If a system board is in use stop its use and disconnect it from the system before you power it off After a new or upgraded system board is inserted and powered on connect its attachment point and configure it for use by the operating environment The cfgadm 1M command can connect and configure or u
25. outside the system have LEDs that indicate their state The System Controller manages all the LEDs in the system with the exception of the power supply LEDs which are managed by the power supplies For a discussion of LED functions refer to the appropriate board or device chapter of the Sun Fire V1280 Netra 1280 Service Manual Nomenclature The System Controller the Solaris operating environment the power on self test POST and the OpenBoot PROM error messages use FRU name identifiers that match the physical labels in the system The only exception is the OpenBoot PROM nomenclature used for I O devices which use the device path names as described in Chapter 7 to indicate I O devices during device probing System Controller Error Logging System Controller error messages are automatically notified to the Solaris operating environment The System Controller also has an internal buffer where error messages are stored You can display the System Controller logged events stored in the System Controller message buffer by using the showlogs command System Controller XIR eXternally Initiated Reset Support The System Controller reset command enables you to recover from a hung system and extract a Solaris operating environment core file Chapter1 Overview 9 10 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 CHAPTER 2 Starting and Setting Up the Sun Fire V1280 Netra 1280 This chapter describes how to
26. reconfigure the CPU Memory boards on the Sun Fire V1280 Netra 1280 system Dynamic Reconfiguration Overview DR software is part of the Solaris operating environment With the DR software you can dynamically reconfigure system boards and safely remove them or install them into a system while the Solaris operating environment is running and with minimum disruption to user processes running on the system You can use DR to do the following m Minimize the interruption of system applications while installing or removing a board m Disable a failing device by removing it before the failure can crash the operating system m Display the operational status of boards m Initiate system tests of a board while the system continues to run Command Line Interface The Solaris cfgadm 1M command provides the command line interface for the administration of DR functionality 91 92 DR Concepts Quiescence During the unconfigure operation on a system board with permanent memory OpenBoot PROM or kernel memory the operating environment is briefly paused which is known as operating environment quiescence All operating environment and device activity on the baseplane must cease during a critical phase of the operation Note Quiescence may take several minutes depending on workload and system configuration Before it can achieve quiescence the operating environment must temporarily suspend all processes CPUs and de
27. s D1 s D3 s sta DO s D1 s D2 s D3 s sta DO s 5vdc0 3vdc0 heetah0 heetahl heetah2 heetah3 5vdc0 3vdc0 heetah0 heetahl heetah2 heetah3 CUS OKay CUS OKay CUS OKay CUS OKay CUS OKay CUS OKay CUS OKay CUS OKay CUS OKay CUS OKay CUS OKay CUS OKay CUS OKay CUS OK status 6 status o status o status o status o status o status o status o status o status o status o status o status o status o status o status 6 status o status 6 status o status o tus online tatus okay tatus okay tatus okay tatus okay tus online tatus okay tatus okay tatus okay tatus okay tus online tatus okay AA AAA AAAAAAA AAAAAA AA Chapter 5 Using Lights Out Management and the System Controller from Solaris 51 CODE EXAMPLE 5 5 Sample Output from the lom v Command Continued 27 SB0 P2 B0 D1 status okay 28 SB0 P2 B0 D2 status okay 29 SB0 P2 B0 D3 status okay 30 SB0 P3 status online 31 SB0 P3 B0 DO status okay 32 SB0 P3 B0 D1 status okay 33 SB0 P3 B0 D2 status okay 34 SB0 P3 B0 D3 status okay 35 SB2 status ok 36 SB2 P0 status online 37 SB2 P0 B0 DO status okay 38 SB2 P0 B0 D1 status okay 39 SB2 P0 B0 D2 status okay 0 SB2 P0 B0 D3 status okay SB2 P1 status online SB2 P1 B0 D0 status okay SB2 P1 B0 D1 status okay SB2 P1 B0 D2 status okay SB2 P1 B0 D3 status okay SB2 P2 status online SB2 P2 B0 D0 status okay SB2 P2 B0 D1 status okay SB2 P2 B0 D2 status okay SB2 P2 B0
28. the board back to the Solaris operating environment with the Solaris dynamic reconfiguration cfgadm command cfgadm c configure ap_id where ap_id is one of the following NO SBO NO SB2 or NO SB4 This command powers the board on tests it attaches the board and brings all of its resources back to the Solaris operating environment Verify that the green Power LED is lit To Remove a CPU Memory Board From the System Note Before you begin this procedure make sure you have ready a system board filler panel to replace the system board you are going to remove A system board filler panel is a metal board with slots that allow cooling air to circulate Detach and power off the board from the system by using the cfgadm c disconnect command cfgadm c disconnect ap_id where ap_id is one of the following NO SBO NO SB2 or NO SB4 Chapter 9 CPU Memory Board Replacement and Dynamic Reconfiguration DR 105 refer to the Sun Fire V1280 Netra 1280 Service Manual Failure to follow the stated Caution For complete information about physically removing and replacing boards procedures can result in damage to system boards and other components 2 Remove the board from the system Refer to the Sun Fire V1280 Netra 1280 Service Manual for complete step by step board removal procedures 3 Insert a system board filler panel into the slot within one minute of removing the board to p
29. the system while the operating environment is in quiescence A driver is suspend safe if it supports operating environment quiescence suspend resume A suspend safe driver also guarantees that when a suspend request is successfully completed the device that the driver manages will not attempt to access memory even if the device is open when the suspend request is made A suspend unsafe device allows a memory access or a system interruption to occur while the operating environment is in quiescence Attachment Points An attachment point is a collective term for a board and its slot DR can display the status of the slot the board and the attachment point The DR definition of a board also includes the devices connected to it so the term occupant refers to the combination of board and attached devices m A slot also called a receptacle has the ability to electrically isolate the occupant from the host machine That is the software can put a single slot into low power mode m Receptacles can be named according to slot numbers or can be anonymous for example a SCSI chain To obtain a list of all available logical attachment points use the 1 option with the cfgadm 1M command There are two formats used when referring to attachment points m A physical attachment point describes the software driver and location of the slot An example of a physical attachment point name is devices ssm 0 0 N0 SBx where NO is
30. you press the Return key after each question the current value will not be changed See the Sun Fire V1280 Netra 1280 System Controller Command Reference Manual for full details of the setupnetwork command CODE EXAMPLE 2 2 shows an example of the setupnetwork command 20 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 CODE EXAMPLE 2 2 Output From the setupnetwork Command lom gt setupnetwork etwork Configuration Is the system controller on a network yes Use DHCP or static network settings static Hostname hostname sc IP Address 123 XXX XXX XXX etmask 255 255 255 0 Gateway 123 XXX XXX XXX DNS Domain xxx somewhere com Primary DNS Server 123 XXX XXX XXX Secondary DNS Server 123 XXX XXX XXX lom gt Use the information in CODE EXAMPLE 2 2 as a guide for the information you need to enter for each parameter value entry Chapter 2 Starting and Setting Up the Sun Fire V1280 Netra 1280 21 Installing and Booting the Solaris Operating Environment To use LOM commands you must install the Lights Out Management 2 0 packages SUNWlomu SUNWlomr and SUNWlomm from the Solaris Supplementary CD v To Install and Boot the Solaris Operating Environment 1 Access the LOM prompt See Chapter 3 2 Power on the system Type poweron Depending on the setting of the OpenBoot PROM auto boot parameter the system will attempt to boot Solaris or will remain at the
31. 0 53 Degrees C 5 sec OK NO RP2 AR 0 Temp 0 43 Degrees C 5 sec OK NO RP2 DX 0 Temp 0 49 Degrees C 5 sec OK NO RP2 DX 1 Temp 0 52 Degrees C 5 sec OK NO SBO Board 0 5 VDC 0 1 51 Volts DC 5 sec OK NO SBO Board 0 3 3 VDC 0 3 29 Volts DC 5 sec OK NO SBO SDC 0 Temp 0 46 Degrees C 5 sec OK NO SBO AR 0 Temp 0 39 Degrees C 5 sec OK NO SBO DX 0 Temp 0 45 Degrees C 5 sec OK NO SBO DX 1 Temp 0 49 Degrees C 5 sec OK NO SBO DX 2 Temp 0 53 Degrees C 5 sec OK NO SBO DX 3 Temp 0 48 Degrees C 5 sec OK N0 SBO SBBC 0 Temp 0 49 Degrees C 5 sec OK NO SBO Board 1 Temp 0 24 Degrees C 5 sec OK NO SBO Board 1 Temp 1 24 Degrees C 6 sec OK N0 SBO CPU 0 Temp 0 47 Degrees C 6 sec OK NO SBO CPU 0 1 8 VDC 0 1 72 Volts DC 6 sec OK 80 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 TABLE 7 8 Checking Temperature Conditions Using the showenvironment Command Continued NO SBO CPU 1 Temp 0 47 Degrees C 6 sec OK NO SBO CPU 1 1 8 VDC 1 1 72 Volts DC 6 sec OK N0 SBO SBBC 1 Temp 0 37 Degrees C 6 sec OK NO SBO Board 1 Temp 2 24 Degrees C 6 sec OK NO SBO Board 1 Temp 3 24 Degrees C 6 sec OK NO SBO CPU 2 Temp 0 49 Degrees C 6 sec OK NO SBO CPU 2 1 8 VDC 0 1 71 Volts DC 6 sec OK NO SBO CPU 3 Temp 0 46 Degrees C 6 sec OK NO SBO CPU 3 1 8 VDC 1 1 72 Volts DC 7 sec OK N0 SB2 Board 0 1 5 VDC 0 1 51 Volts DC 6 sec OK N0 SB2 Board 0 3 3 VDC 0 3 29 Volts DC 6 sec OK N0 SB2 SDC 0 Temp 0
32. 1 TES j oO 00 O O O O O e O O O O O O O PCIO PCIS eo IG O kaan a IRE ai S S 2 A RICO Ri DLE SEM S SERRE ZERO RRRERR OE E ERRE COCO CRAN C CR CREAR e SORT CCC CCR pin EERROR esata ta CECER ROR LE rates te Su eee ise aan to eC ERRO co OAN f il U W p ole Neat N COLL A i amp ececa a Ca Re Alarms port h i SERENE RE RERE AEAEE Cer o SRO Bo o c o EOR 0 0 0 RORE ON 10 100 Ethernet o g LOM System b Q Controller EEE LE la dada CECE CET CCE CEQ Es Le EEEE O a taj CECECECECECE Ca g i e la dada datata dt Le x Serial ports BRAS RARE LE Bo ane EE ETTI RERO ero CEEEERRERCR ELEC ORFEO ER ERRO FEAR Neto Net CECECECECECECECECRCE CRC so SERRE se ia et0 Net Hao e LG S li Leg la datata tg H asasasasesesetey CR CR CECE CERES O la dada dada dada da Gee a PSU3 input gt gt o RON la D EEE O 2 te R Q Q N H So H oe DOE TN RRRA i Dacca CE GRR SRE la data Sata PSU2 in Ob cue SE FREE RARE LELE ORO E CEE HOES HEBERT CER ceo O amp P PSU1 input o ESISTITI ia EEE g CECECECECERE CE O CRAN 3 COREREEEERERCE CS QE CCR c e GC CREER RERO A FORORO ROROA ROR So go EE E reato LTS PSUO in p ut O erre ete E
33. 2 14 14 Copyright 2000 Sun Microsystems Inc All rights reserved Using lt gt as the package base directory Processing package information Processing system information 4 package pathnames are already properly installed Verifying package dependencies Verifying disk space requirements Checking for conflicts with packages already installed Checking for setuid setgid programs Se SR SH HE Installing LOMlite Utilities usr as lt SUNWlomu gt Installing part 1 of 1 1432 blocks Chapter 2 Starting and Setting Up the Sun Fire V1280 Netra 1280 25 CODE EXAMPLE 2 4 Installing the LOM Utility Installation of lt SUNWlomu gt was successful HE 26 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 v To Install the LOM Manual Pages As root type CODE EXAMPLE 2 5 Installing the LOM Manual Pages pkgadd d SUNWlomm Processing package instance lt SUNWlomm gt from lt cdrom suppcd_s28u7_multi_s28u7_supp 08all Lights_Out_Management_2 0 Product gt LOMlite manual pages sparc 2 0 REV 2000 08 22 14 14 Copyright 2000 Sun Microsystems Inc All rights reserved Using lt gt as the package base directory Processing package information Processing system information 5 package pathnames are already properly installed Verifying disk space requirements Checking for conflicts with packages already installed Checking for setuid setg
34. 55 Degrees C 6 sec OK N0 SB2 AR 0 Temp 0 37 Degrees C 6 sec OK N0 SB2 DX 0 Temp 0 47 Degrees C 6 sec OK N0 SB2 DX 1 Temp 0 50 Degrees C 6 sec OK N0 SB2 DX 2 Temp 0 53 Degrees C 6 sec OK N0 SB2 DX 3 Temp 0 47 Degrees C 6 sec OK N0 SB2 SBBC 0 Temp 0 48 Degrees C 6 sec OK N0 SB2 Board 1 Temp 0 23 Degrees C 7 sec OK N0 SB2 Board 1 Temp 1 24 Degrees C 7 sec OK N0 SB2 CPU 0 Temp 0 45 Degrees C 7 sec OK N0 SB2 CPU 0 1 8 VDC 0 1 72 Volts DC 7 sec OK N0 SB2 CPU 1 Temp 0 46 Degrees C 7 sec OK N0 SB2 CPU 1 1 8 VDC 1 1 73 Volts DC 7 sec OK N0 SB2 SBBC 1 Temp 0 37 Degrees C 7 sec OK N0 SB2 Board 1 Temp 2 24 Degrees C 7 sec OK N0 SB2 Board 1 Temp 3 25 Degrees C 7 sec OK N0 SB2 CPU 2 Temp 0 47 Degrees C 7 sec OK N0 SB2 CPU 2 1 8 VDC 0 1 71 Volts DC 7 sec OK N0 SB2 CPU 3 Temp 0 45 Degrees C 7 sec OK N0 SB2 CPU 3 1 8 VDC 1 1 71 Volts DC 7 sec OK N0 IB6 Board 0 1 5 VDC 0 1 50 Volts DC 7 sec OK N0 IB6 Board 0 3 3 VDC 0 3 35 Volts DC 7 sec OK N0 IB6 Board 0 5 VDC 0 4 95 Volts DC 7 sec OK NO IB6 Board 0 12 VDC 0 11 95 Volts DC 7 sec OK N0 IB6 Board 0 Temp 0 29 Degrees C 7 sec OK N0 IB6 Board 0 Temp 1 28 Degrees C 7 sec OK N0 IB6 Board 0 3 3 VDC 1 3 30 Volts DC 7 sec OK N0 IB6 Board 0 3 3 VDC 2 3 28 Volts DC 7 sec OK N0 IB6 Board 0 1 8 VDC 0 1 81 Volts DC 7 sec OK NO IB6 Board 0 2 5 VDC 0 2 51 Volts DC 7 sec OK N0 IB6 Fan 0 Cooling 0 High 7 sec OK NO IB6 Fan 1 Cooling 0 High 7 sec OK NO IB6 SDC 0 Temp 0 63 Degrees C
35. 7 sec OK N0 IB6 AR 0 Temp 0 77 Degrees C 7 sec OK N0 IB6 DX 0 Temp 0 69 Degrees C 7 sec OK NO IB6 DX 1 Temp 0 73 Degrees C 8 sec OK Chapter 7 Troubleshooting 81 Checking Temperature Conditions Using the showenvironment Command Continued N0 IB6 SBBC 0 Temp 0 ET Degrees C 8 sec OK NO IB6 IOASIC 0 Temp 0 46 Degrees C 8 sec OK NO IB6 IOASIC 1 Temp 1 52 Degrees C 8 sec OK Power Supplies Each power supply unit PSU has its own LEDs as follows m Power Active lit if PSU is supplying main power blinks if PSU is in Standby mode m Faulty lit if PSU has detected a fault condition and has turned off its main output m Predictive Fail lit if PSU has detected a pending internal fault but is still providing main output power degraded PSU fan speed is the only trigger for this condition In addition there are two System LEDs labelled SourceA and SourceB These show the state of the power feeds to the system There are four physical power feeds and they are split into A and B Feed A supplies PSO and PS1 feed B supplies PS2 and PS3 If either PSO or PS1 receives input power then the SourceA indicator is lit If either PS2 or PS3 receives input power then the SourceB indicator is lit If neither of the supplies receives input power the indicator is turned off These indicators are set on the basis of periodic monitoring at least once every 10 seconds Displaying Diagnostic In
36. 78 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 Moving System Identity You may decide that the simplest way to restore service in some circumstances is to use a complete replacement system In order to facilitate the rapid transfer of system identity and critical settings from one system to its replacement the System Configuration Card SCC can be physically removed from the SCC Reader SCCR of the faulty system and inserted into the SCCR of the replacement system The following information is stored on the System Configuration Card SCC m MAC addresses m System Controller 10 100 Ethernet Port m Onboard Gigabit Ethernet port NETO m Onboard Gigabit Ethernet port NET1 Hostid m Critical LOM configurations a LOM password m escape sequence SC network settings IP address DHCP gateway etc eventreporting level host watchdog enabled disabled On Standby enabled disabled secure mode enabled disabled ritical OBP configurations auto boot boot device diag device use nvramrc local mac address n E e EEE QE EEEE Temperature One indication of problems may be overtemperature of one or more components Use the showenvironment command to list current status TABLE 7 8 Checking Temperature Conditions Using the showenvironment Command lom gt showenviroment Slot Device Sensor Value Units Age Status SSC1 SBBC 0 Temp 0 34 Degrees C 1 sec OK SSCI CBH 0 Temp 0 41 Degrees C 1 sec OK
37. 9600 8N1 m 9600 baud a 8 data bits No parity a 1 stop bit More details can be found in the Sun Fire V1280 Site Preparation and Installation Guide 12 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 Using the Power On Standby Switch Caution The power switch is not an On Off switch it is an On Standby switch It does not isolate the equipment The power On Standby switch of the Sun Fire V1280 Netra 1280 system is a rocker type momentary action switch It controls only low voltage signals and no high voltage circuits pass through it On Standby switch SYSTEM POWER SOURCE Su n microsystems FIGURE 2 1 Sun Fire V1280 Netra 1280 System Power On Standby Switch The symbols on the switch are On m Press and release to power on the server This is the equivalent of the LOM poweron command Standby m Press for less than four seconds to initiate an orderly shutdown of the system into Standby mode This is equivalent to issuing the shut down command at the l1om gt prompt This is the method to use under normal operation m Press and hold down for more than four seconds to perform a system power down to standby mode This is equivalent to issuing the poweroff command at the lom gt prompt This process is not interruptible You should ensure that Solaris is cleanly shut down before powering a system to standby mode otherwise data is at risk of being lost The recommende
38. A board in this state can be unplugged See Dynamic Reconfiguration 113 114 Dynamic Reconfiguration Hot plug Hot swap IP Multipathing IPMP Logical DR Occupant Platform Physical DR Quiescence Receptacle Port Dynamic Reconfiguration DR is software that allows the administrator to 1 view a system configuration 2 suspend or restart operations involving a port storage device or board and 3 reconfigure the system detach or attach hotswappable devices such as disk drives or interface boards without the need to power down the system When DR is used with IPMP or Solstice DiskSuite software and redundant hardware the server can continue to communicate with disk drives and networks without interruption while a service provider replaces an existing device or installs a new device DR supports replacement of a CPU Memory provided the memory on the board is not interleaved with memory on other boards in the system Hot plug boards and modules have special connectors that supply electrical power to the board or module before the data pins make contact Boards and devices that do not have hot plug connectors cannot be inserted or removed while the system is running A hot swap device has special DC power connectors and logic circuitry that allow the device to be inserted without the necessity of turning off the system Internet Protocol multipathing Enables continuous application availability by load
39. D3 status okay SB2 P3 status online SB2 P3 B0 D0 status okay SB2 P3 B0 D1 status okay SB2 P3 B0 D2 status okay SB2 P3 B0 D3 status okay IB6 status ok IB6 FANO status okay IB6 FANI status okay a oases DB DB DB DB PB DB PE PB OO I oO O amp WN EF O1 O1 O1 O1 O1 O1 O1 O1 I oO O1 BWN HE The information output from this command is also contained in the output from the Solaris prtdiag v command 52 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 CODE EXAMPLE 5 6 Sample Output from the lom t Command lom t System Temperature Sensors 1 SSC t_sbbc0 2 SSC1 t_cbh0 3 SSC1 t_ambient0 4 SSC1 t_ambientl 5 SSC1 t_ambient2 6 RPO t_ambient0 7 RPO t_ambientl 8 RPO t_sdc0 9 RPO t_ar0 10 RPO t_dx0 11 RPO t_dx1 12 RP2 t_ambient0 13 RP2 t_ambientl 14 RP2 t_sdc0 15 RP2 t_ar0 16 RP2 t_dx0 17 RP2 t_dx1 18 SBO E sdc0 19 SRO t_ar0 20 SBO t_dx0 21 SBO t_dx1 22 SBO t_dx2 23 SBO t_dx3 24 SBO t_sbbc0 25 SBO t_sbbcl 26 SBO PO Ambient 27 SBO PO Die 28 SBO P1 Ambient 29 SBO P1 Die 30 SBO P2 Ambient 31 SBO P2 Die 32 SBO P3 Ambient 33 SBO P3 Die 34 SB2 t_sdc0 35 SB2 t_ar0 36 SB2 t_dx0 37 SB2 t_dx1 38 SB2 t_dx2 36 45 23 21 28 22 22 62 47 62 65 23 22 57 42 5 3 56 48 39 49 54 57 53 53 40 29 57 27 51 2 53 29 50 51 40 52 54 61 degC degC degC degC degC degC degC degC degC degC degC degC degC degC degC
40. EEE D CECREECECECR b SRE DNA DETTO TETRA RARES h Saracena a adele are tata oO D la CREEECEREREEEE ES CSECECECECECECE le Po Ls ee O O O O_O O ic oc FIGURE 1 1 I O Ports 2 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 The console serial port and 10 100 Ethernet port can be used to access the System Controller Use the console serial port to connect directly to an ASCII terminal or a NTS network terminal server Connecting the System Controller board with a serial cable enables you to access the System Controller command line interface with an ASCII terminal or an NTS Use the 10 100 Ethernet port to connect the System Controller to the network LOM Prompt The LOM prompt provides the command line interface for the System Controller It is also the place where console messages are displayed The prompt is lom gt Some of the system management tasks are shown in TABLE 1 1 TABLE 1 1 Selected Management Tasks System Controller Management Tasks System Controller Commands To Use Configuring the System Controller Configuring the system Powering boards on and off and powering the sy
41. Fire V1280 Netra 1280 System Administration Guide December 2002 IB_SSC Assembly Mapping TABLE 7 2 lists the types of I O assembly the number of slots each I O assembly has and the systems the I O assembly types are supported on TABLE 7 2 I O Assembly Type and Number of Slots VO Assembly Type Number of Slots Per I O Assembly PCI 6 TABLE 7 3 lists the number of I O assemblies per system and the I O assembly name TABLE 7 3 Number and Name of I O Assemblies per System Number of I O Assemblies YO Assembly Name 1 IB6 Each I O assembly hosts two I O controllers m I O controller 0 a I O controller 1 When mapping the I O device tree entry to a physical component in the system you must consider up to five nodes in the device tree Node identifier ID ID controller agent ID AID Bus offset PCI slot Device instance TABLE 7 4 lists the AIDs for the two I O controllers in each I O assembly TABLE 7 4 I O Controller Agent ID Assignments Slot Number I O Assembly Name Even I O controller AID Odd I O Controller AID 6 IB6 24 18 25 19 The first number in the column is a decimal number The number or a number and letter combination in parentheses is in hexadecimal notation The I O controller has two bus sides A and B m Bus A which is 66 MHz is referenced by offset 600000 m Bus B which is 33 MHz is referenced by offset 700000 The board slots located in the I O assembly are refer
42. Netra 1280 System Administration Guide December 2002 Examples Downloading the sgpci flash image CODE EXAMPLE 8 1 Downloading the sgpci flash Image lom G sgpci flash WARNING This program will replace LOMlite2 firmware version 5 13 with version 0 1 Are you sure you want to continue Enter C and return to Continue or anything else to Terminat C Transferring 308 kB image to the system controller This may take several minutes Validating image 308 kB IO image transferred Programming N0 IB6 PROM 0 Comparing image and flash Image and flash are different proceeding with update Erasing Done Programming Done Verifying ec Done ay 22 14 28 37 commando lw8 N0 IB6 PROM 0 updated with version 5 13 5 05 17 2 002 Firmware update complet You must reboot Solaris to load the new firmware Downloading the sgcpu flash image CODE EXAMPLE 8 2 Downloading the sgcpu flash Image lom G sgcpu flash WARNING This program will replace LOMlite2 firmware version 5 13 with version 0 1 Are you sure you want to continue Enter C and return to Continue or anything else to Terminat C Transferring 792 kB image to the system controller This may take several minutes Chapter 8 Firmware Upgrade Procedures 89 CODE EXAMPLE 8 2 Downloading the sgcpu flash Image Continued Validating image 792 kB CPU image transferred
43. OpenBoot PROM ok prompt The default setting is true which will attempt to initiate a Solaris boot If the setting of auto boot is false or there is no bootable Solaris image installed then you will get the OpenBoot PROM ok prompt lom gt poweron lt POST messages displayed here gt ok 3 If necessary install the Solaris operating environment Refer to your installation documentation which is available with your Solaris operating environment release 4 At the ok prompt boot the Solaris operating environment by typing the OpenBoot PROM boot command ok boot device For the optional device parameter see the OpenBoot PROM devalias command which displays the predefined aliases 22 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 After the Solaris operating environment is booted the login prompt is displayed login v To Install the Lights Out Management Packages The three LOM packages needed on a Sun Fire V1280 Netra 1280 are SUNWlomu LOMlite Utilities usr SUNWLomm LOMlite manual pages and SUNWlomr LOM drivers These are available on the Solaris Supplementary CD Note The latest patches to these packages is available from SunSolve in patch 110208 It is strongly advised that the latest version of patch 110208 is obtained from SunSolve and is installed on the Sun Fire V1280 Netra 1280 to make use of the latest LOM utility updates v To Inst
44. ROM ok prompt then the system has stopped responding In some circumstances the host watchdog will detect that the Solaris operating environment has stopped responding and will automatically reset the system Assuming that the host watchdog has not been disabled using the setupsc command then the Host Watchdog will cause an automatic reset of the system Also you can issue the reset command default option is x which causes an XIR to be sent to the processors from the lom gt prompt The reset command causes the Solaris operating environment to be terminated Caution When the Solaris operating environment is terminated data in memory might not be flushed to disk This could cause a loss or corruption of the application file system data Before the Solaris operating environment is terminated this action requires confirmation from you To Recover a Hung System Manually Complete the steps in Assisting Sun Service Personnel in Determining Causes of Failure on page 83 Access the LOM shell See Chapter 3 Chapter 7 Troubleshooting 77 3 Type the reset command to force control of the system back to the OpenBoot PROM The reset command sends an externally initiated reset XIR to the system and collects data for debugging the hardware lom gt reset Note An error is displayed if the set secure command has been used to set the system into secure mode You cannot use the reset or break commands
45. Reason s Power On Reset Fri Jul 19 15 48 54 some_name lom Initializing the SC SRAM Fri Jul 19 15 48 59 some name lom Caching ID information Fri Jul 19 15 49 00 some_name lom Clock Source 75MHz Fri Jul 19 15 49 02 some name lom NO PSO Status is OK Fri Jul 19 15 49 03 some name lom N0 PS1 Status is OK Fri Jul 19 15 49 03 some_name lom Chassis is in single partition mode Fri Jul 19 15 49 05 some_name lom Cold boot detected recovering active domains 18 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 CODE EXAMPLE 2 1 Hardware Reset Output from the System Controller Continued Hardware Reset Fri Jul 19 15 49 06 some_name lom NOTICE NO FTO is powered off Connected lom gt Setting Up the System After powering on you must set up your system using the System Controller setdate and setupnetwork commands described in this chapter This section contains the following topics m To Set the Date and Time on page 19 m To Configure Network Parameters on page 20 m To Install and Boot the Solaris Operating Environment on page 22 v To Set the Date and Time Note If your time zone area is using daylight or summer time this is set automatically Set the date time and time zone for the system using the setdate command at the LOM prompt The following example shows setting the time zone to Pacific Standard Time PST us
46. The System Controller polls these sensors in a timely manner and makes the environmental data available If necessary the System Controller shuts down various components to prevent damage For instance in the case of an overtemperature the System Controller notifies the Solaris operating environment of the overtemperature and the operating environment takes action In the case of extreme overtemperature the System Controller software can shut down the system without first notifying the operating environment System Indicator Board The system indicator board contains the On Standby switch and indicator LEDs as shown in FIGURE 1 2 4 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 On Stand by switch p SYSTEM ALARM POWER SOURCE O i un microsystems Locator System Active UNIX Running Source A and Source B System Fault Top Access Required Alarm1 and Alarm2 FIGURE 1 2 System Indicator Board The indicator LEDs function as shown in TABLE 1 2 TABLE 1 2 System Indicator LED Functions Name Colour Function Locator White Normally off can be lit by user command System Fault Amber Lights when the LOM detects a fault System Active Green Lights when power is applied to the system Top Access Amber Lights when a fault occurs in a FRU which can only be replaced from the top of the system UNIX Running Green Lights when Solaris is running Alarm1 and Alarm2 Green Light when trig
47. U Memory boards also known as system slots Board Receptacle States A board can have one of three receptacle states empty disconnected or connected Whenever you insert a board the receptacle state changes from empty to disconnected Whenever you remove a board the receptacle state changes from disconnected to empty Caution Physically removing a board that is in the connected state or that is powered on and in the disconnected state crashes the operating system and can result in permanent damage to that system board TABLE 9 2 Board Receptacle States Name Description empty A board is not present disconnected The board is disconnected from the system bus A board can be in the disconnected state without being powered off However a board must be powered off and in the disconnected state before you remove it from the slot connected The board is powered on and connected to the system bus You can view the components on a board only after it is in the connected state Board Occupant States A board can have one of two occupant states configured or unconfigured The occupant state of a disconnected board is always unconfigured Chapter 9 CPU Memory Board Replacement and Dynamic Reconfiguration DR 95 TABLE 9 3 Board Occupant States Name Description configured At least one component on the board is configured unconfigured All of the components on the board are unconfigured Board Conditions A
48. Yes Yes IB Fan fault LED lit Power Supply Yes by Yes No All power supply indicators are lit by the hardware power supply hardware There is also a predicted fault indicator Power supply EEPROM errors do not cause degraded state as there is no indicator control Power distribution board No Yes Yes Can only be degraded Baseplane No Yes Yes Can only be degraded System indicator board No Yes Yes Can only be degraded System configuration No Yes No card Chapter 7 Troubleshooting 73 TABLE 7 6 System Fault Indicator States Continued System Fault Fault indicator lit indicator lit Top Access when fault on FRU lit on FRU FRU name detected fault fault Comments Fan tray Yes Yes No Main fan Yes Yes No Media bay No Yes Yes Disk Yes Yes No This includes faults where the FRU is only degraded Customer Replaceable Units Sun Fire V1280 The following FRUs are considered to be ones on which you can deal with faults m Hard disks hot swappable m PSUs PS0 PS1 PS2 PS3 hot swappable m CPU Memory Boards SB0 SB2 SB4 can be blacklisted if considered faulty m Repeater Boards RP0 RP2 can be blacklisted if considered faulty If a fault is indicated on any other FRU or a physical replacement of blacklisted FRUs above is required then SunService should be called Netra 1280 The following FRUs are considered to be ones on which you can deal with faults m Hard disks hot swappable m PSUs PS0 PS1 PS2 PS3
49. all the LOM Drivers As root type CODE EXAMPLE 2 3 Installing the LOM Drivers pkgadd d SUNWlomr Processing package instance lt SUNWlomr gt from lt var tmp gt LOMlite driver root sparc 2 0 REV 2000 08 22 14 14 Copyright 2000 Sun Microsystems Inc All rights reserved Executing checkinstall script Using lt gt as the package base directory Processing package information Processing system information 9 package pathnames are already properly installed Verifying package dependencies Verifying disk space requirements Checking for conflicts with packages already installed Checking for setuid setgid programs This package contains scripts which will be executed with super user permission during the process of installing this package Do you want to continue with the installation of lt SUNWlomr gt y n y Tnstalling LOMlite driver root as lt SUNWlomr gt Chapter 2 Starting and Setting Up the Sun Fire V1280 Netra 1280 23 CODE EXAMPLE 2 3 Installing the LOM Drivers Continued Installing part 1 of 1 20 blocks i drivers INFO Starting drivers INFO Installing var tmp SUNWlomr reloc platform sun4u kernel drv lom drivers INFO Installing var tmp SUNWlomr reloc platform sun4u kernel drv lomp drivers INFO Installing var tmp SUNWlomr reloc platform sun4u kernel drv sparcv9 lom drivers INFO Installing var tmp SUNWlomr reloc platform sun4u kernel
50. ame 57 6 Running POST 59 OpenBoot PROM Variables for POST Configuration 59 Controlling POST With the bootmode Command 63 Controlling the System Controller POST 64 7 Troubleshooting 67 Device Mapping 67 CPU Memory Mapping 67 IB_SSC Assembly Mapping 69 System Faults 72 Customer Replaceable Units 74 Sun Fire V1280 74 Netra 1280 74 Manual Blacklisting while waiting for repair 75 Special Considerations for CPU Memory Boards 76 Recovering a Hung System 77 v To Recover a Hung System Manually 77 Moving System Identity 79 Temperature 79 Power Supplies 82 vi Sun Fire V1280 Netra 1280 System Administration Guide December 2002 Displaying Diagnostic Information 82 Assisting Sun Service Personnel in Determining Causes of Failure 83 Firmware Upgrade Procedures 85 Using the flashupdate Command 85 flashupdate Command Examples 87 Using the lom GCommand 88 Examples 89 CPU Memory Board Replacement and Dynamic Reconfiguration DR 91 Dynamic Reconfiguration 91 Overview 91 Command Line Interface 91 DR Concepts 92 Quiescence 92 Suspend Safe and Suspend Unsafe Devices 92 Attachment Points 93 DR Operations 94 Hot Plug Hardware 94 Conditions and States 94 Board States and Conditions 95 Board Receptacle States 95 Board Occupant States 95 Board Conditions 96 Component States and Conditions 96 Component Receptacle States 96 Component Occupant States 96 Component Conditions 97 Component Types 97 Contents vii Nonpermanent
51. as been made If the system is in standby mode the 1om prompt is automatically displayed Connected lom gt Otherwise type a single carriage return and the Solaris console prompt will be displayed Connected If a connection to the LOM console is already established over the network port then you will be offered the opportunity to forcefully connect by logging out the other connection Enter Password The console is already in use Host somehost acme com Connected May 24 10 27 Idle time 00 23 17 Force logout of other user y n y Connected lom gt Sun Fire V1280 Netra 1280 System Administration Guide December 2002 Y To Access the LOM Console Using the Telnet Command In order to be able to access the LOM System Controller via telnet to the 10 100 Ethernet port you must first set up the interface Refer to To Configure Network Parameters on page 20 1 Type the telnet command at the Solaris prompt to connect to the System Controller telnet lt system_controller_hostname gt Trying 128 123 129 95 2 Connected to interpol sc Escape character is 2 If the LOM password has been set up you will be prompted for a password telnet lt system_controller_hostname gt TEV ING 120 125 12 34 90 Connected to interpol sc Escape character is Enter password 3 Enter the correct password as previously set up usi
52. athname while a logical identifier contains a shorthand notation A collective term for a board and its card cage slot A physical attachment point describes the software driver and location of the card cage slot A logical attachment point is an abbreviated name created by the system to refer to the physical attachment point cfgadm is the primary command for dynamic reconfiguration on the Sun Fire V1280 Netra 1280 system For information about the command and its options refer to the cfgadm 1M cfgadm_sbd 1M and cfgadm_pci 1M man pages For any late breaking news about this and related commands refer to the Solaris 8 section at the DR web site See Chapter 9 The operational status of an attachment point The collection of attached devices known to the system The system cannot use a physical device until the configuration is updated The operating system assigns functional roles to a board and loads device drivers for the board and for devices attached to the board The operating system assigns functional roles to a board and loads device drivers for the board and for devices attached to the board A board is present in a slot and is electrically connected The temperature of the slot is monitored by the system The device driver supports DDI_DETACH and the device such as an I O board or a SCSI chain is physically arranged so that it can be detached The system stops monitoring the board and power to the slot is turned off
53. board can be in one of four conditions unknown ok failed or unusable TABLE 9 4 Board Conditions Name Description unknown The board has not been tested ok The board is operational failed The board failed testing unusable The board slot is unusable Component States and Conditions This section contains descriptions of the states and conditions for components Component Receptacle States A component cannot be individually connected or disconnected Thus components can have only one state connected Component Occupant States A component can have one of two occupant states configured or unconfigured 96 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 TABLE 9 5 Component Occupant States Name Description configured Component is available for use by the Solaris operating environment unconfigured Component is not available for use by the Solaris operating environment Component Conditions A component can have one of three conditions unknown ok failed TABLE 9 6 Component Conditions Name Description unknown Component has not been tested ok Component is operational failed Component failed testing Component Types You can use DR to configure or to unconfigure several types of component TABLE 9 7 Component Types Name Description cpu Individual CPU memory All the memory on the board Nonpermanent and Permanent Memory Before you can delete a b
54. board in the system The memory on the board is identified by a non zero permanent memory size in the status display produced by the cfgadm av command DR supports reconfiguration of permanent memory from one system board to another only if one of the following conditions is met m The target system board has the same amount of memory as the source system board OR m The target system board has more memory than the source system board In this case the additional memory is added to the pool of available memory 98 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 Command Line Interface The following procedures are discussed in this section m To Test a CPU Memory Board on page 102 To Install a New Board on page 104 To Hot Swap a CPU Memory Board on page 104 To Remove a CPU Memory Board From the System on page 105 To Disconnect a CPU Memory Board Temporarily on page 106 Note There is no need to enable dynamic reconfiguration explicitly DR is enabled by default The cfgadm Command The cfgadm 1M command provides configuration administration operations on dynamically reconfigurable hardware resources TABLE 9 8 lists the DR board states TABLE 9 8 DR Board States from the System Controller SC Board States Description Available The slot is not assigned Assigned The board is assigned but the hardware has not been configured to use it The board may be
55. boots the System Controller systemboards causes all CPU Memory boards and the I O Assembly to be updated scapp causes the System Controller application to be updated This action reboots the System Controller rtos causes the System Controller real time operating system to be updated This action reboots the System Controller lt board gt names a specific board to be updated sb0 sb2 sb4 or ib6 c specifies a board as the source of flash images Use this option to update replacement CPU Memory boards lt source_board gt is a pre existing CPU Memory board to be used as the source of the flash image sb0 sb2 or sb4 lt destination_board gt is the CPU Memory board to be updated sb0 sb2 or sb4 u automatically updates all CPU Memory boards with the image from the board which currently has the highest firmware revision Use this option to update replacement CPU Memory boards h displays help for this command A power cycle is required in order to activate the updated OpenBoot PROM Note flashupdate cannot retrieve flash images from a secure userid password protected HTTP URL A message of the form flashupdate failed URL does not contain required file lt file gt will be returned although the file may exist 86 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 a PP Caution Do not interrupt the flashupdate operation If the flashupdate command is terminated abnormal
56. by switch to the left This initiates an orderly shutdown of the system into Standby mode This is equivalent to issuing the shutdown command at the lom gt prompt 16 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 Sending the LOM poweroff Command Use the poweroff command when you wish to power down all modules and system chassis to standby mode At the lom gt prompt type lom gt poweroff This will abruptly terminate Solaris Do you want to continue no Only answer yes if you wish to forcibly power down the system regardless of the state of Solaris Under normal operation you should use the shutdown command Type y to continue or press Return to cancel the command The system will power off to standby mode The only indicator LEDs to be illuminated on the system indicator board are the Source A and Source B indicators The IB_SSC assembly Active LED will also be illuminated but not visible from the front of the system Refer to the Sun Fire V1280 Netra 1280 System Controller Command Reference Manual for a full description of the poweroff command Sending the poweroff Command Using the On Standby Switch Only use this method if you wish to forcibly power down the system regardless of the state of Solaris Under normal operation you should send the shutdown command either from the lom gt prompt or from the On Standby Switch see Sending the shutdown Command Using the On Standby Switch
57. ct to Serial Port B of a Workstation 33 v To Access the LOM Console Using the Telnet Command 35 v To Disconnect from the LOM Console 36 Switching Between the Different Consoles 37 v To Break to the LOM Prompt 39 Selecting an Escape Sequence 39 To Connect to the Solaris Console from the LOM Prompt 39 To Break to the LOM Prompt from the OpenBoot PROM 40 To Break to the OpenBoot Prompt when Solaris is Running 41 lt lt lt lt To Terminate a Session If You Are Connected To the System Controller Through the Serial Port 41 v To Terminate a Session If You Are Connected to the System Controller with telnet 42 System Controller Message Logging 43 Using Lights Out Management and the System Controller from Solaris 45 LOM Command Syntax 45 Monitoring the System From Solaris 46 Viewing Online LOM Documentation 46 Viewing the LOM Configuration Lom c 48 Checking the Status of the Fault LED and Alarms lom 1 48 Viewing the Event Log 1om e 49 Checking the Fans Lom f 50 Checking the Internal Voltage Sensors lom v 50 Checking the Internal Temperature Lom t 53 Contents Vv Viewing All Component Status Data and the LOM Configuration Data lom a 54 Other LOM Tasks Performed From Solaris 55 Turning Alarms On and Off Lom A 55 Changing the lom gt Prompt Escape Sequence Lom X 56 Stopping LOM from Sending Reports to the Console When at the LOM Prompt lom E off 56 Upgrading the Firmware Lom G filen
58. d Set Clocks N0 SB0 P1 Running CPU POR and Set Clocks N0 SB0 P2 Running CPU POR and Set Clocks N0 SB0 P3 Running CPU POR and Set Clocks N0 SB0 P0 lpost 5 13 0007 2002 07 18 12 45 N0 SB0 P2 lpost 5 13 0007 2002 07 18 12 45 NO SBO P1 lpost 5 13 0007 2002 07 18 12 45 N0 SB0 P0 Copyright 2001 Sun Microsystems Inc All rights reserved N0 SB0 P0 Subtest Setting Fireplane Config Registers N0 SB0 P0 Subtest Display CPU Version frequency N0 SB0 P0 Version register 003e0015 21000507 N0 SB0 P0 Cpu System ratio 6 cpu actual frequency 900 N0 SB0 P1 Copyright 2001 Sun Microsystems Inc All rights reserved lt more POST ouput gt pci bootbus controller pci Probing ssm 0 0 pci 18 700000 Device 1 Nothing there Probing ssm 0 0 pci 18 700000 Device 2 Nothing there Probing ssm 0 0 pci 18 700000 Device 3 ide disk cdrom 62 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 CODE EXAMPLE 6 1 POST Output Using max Setting Continued 18 600000 Device 1 Nothing there 18 600000 Device 2 scsi disk tape scsi disk tape Probing ssm 0 0 pci Probing ssm 0 0 pci pci pei Probing ssm 0 0 pci Probing ssm 0 0 pci Probing ssm 0 0 pci Probing ssm 0 0 pci Probing ssm 0 0 pci 19 700000 Device 19 700000 Device 19 700000 Device 19 600000 Device 19 600000 Device Nothing there Nothing there Nothing there network network D
59. d method of powering down to standby is by using the shut down command at the LOM prompt Use the LOM setupsc command to prevent accidental operation of the On Standby switch Chapter 2 Starting and Setting Up the Sun Fire V1280 Netra 1280 13 Powering On and Off Powering On v Initial Power On 1 Ensure all power cables are connected and external circuit breakers are switched on 2 The system will enter Standby mode The only indicator LEDs to be illuminated on the system indicator board are the Source A and Source B indicators The IB_SSC assembly Active LED will also be illuminated but not visible from the front of the system Y Powering On from Standby Mode Powering the system on from Standby mode can be achieved in either of two ways m Operating the On Standby switch m Sending the poweron command via the LOM port If the auto boot variable has been set in the OBP the system will automatically boot into the Solaris operating environment Using the On Standby Switch 1 Check that power is applied to the system and that it is correctly in Standby mode The only indicator LEDs to be illuminated on the system indicator board are the Source A and Source B indicators The IB_SSC assembly Active LED will also be illuminated but not visible from the front of the system 2 Momentarily press the On Standby switch to the right The system will power on completely The System Active indicator will be illuminated in ad
60. d unsafe System Controller software Unconfiguration Simple Network Management Protocol SNMP is any system listening to SNMP events This is usually the system with the Sun Management Center software installed The operational status of either a receptacle slot or an occupant board To be suitable for DR a device driver must have the ability to stop user threads execute the DDI_SUSPEND call stop the clock and stop the CPUs A suspend safe device is one that does not access memory or interrupt the system while the operating system is in quiescence A driver is considered suspend safe if it supports operating system quiescence suspend resume It also guarantees that when a suspend request is successfully completed the device that the driver manages will not attempt to access memory even if the device is open when the suspend request is made A suspend unsafe device is one that allows a memory access or a system interruption while the operating system is in quiescence The main application that performs all of the System Controller hardware management functions The system detaches a board logically from the operating system and takes the associated device drivers off line Environmental monitoring continues but any devices on the board are not available for system use Glossary 115 116 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 Index A alarms checking status 48 alarms setti
61. dition to the Source A and Source B indicators The system will execute the power on self tests POST 14 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 Using the LOM poweron Command At the lom gt prompt type lom gt poweron The System Controller will first power on all the power supplies followed by the fan tray Finally the System Controller will power on the system boards If the value of the OpenBoot PROM variable auto boot is true then the system will also boot the Solaris operating environment Individual modules can also be powered on using the poweron command For further details see the Sun Fire V1280 System Controller Command Reference Manual The System Active indicator will be illuminated in addition to the Source A and Source B indicators The system will execute the power on self tests POST Note The poweron all command only powers on individual components it does not boot Solaris Refer to the Sun Fire V1280 Netra 1280 System Controller Command Reference Manual for a full description of the poweron command Bringing the System to Standby Mode This can be achieved in one of five ways m Using the UNIX shut down command Sending the shutdown command via the LOM port Sending the shutdown command using the On Standby switch Sending the poweroff command via the LOM port Sending the poweroff command using the On Standby switch Note You should ensure that Solaris is cl
62. eanly shut down before powering a system to standby mode otherwise data is at risk of being lost Chapter 2 Starting and Setting Up the Sun Fire V1280 Netra 1280 15 Using the Solaris shutdown Command At the system prompt type shutdown i5 The system will power off to standby mode The only indicator LEDs to be illuminated on the system indicator board are the Source A and Source B indicators The IB_SSC assembly Active LED will also be illuminated but not visible from the front of the system Sending the LOM shutdown Command Use the LOM shutdown command to perform a clean shutdown of Solaris followed by a power down of all modules and system chassis to standby mode Note If Solaris is running this command will attempt to halt the system cleanly before powering down the system to standby mode and is the equivalent of the Solaris init 5 command At the lom gt prompt type lom gt shutdown After Solaris has been stopped the system will power off to standby mode The only indicator LEDs to be illuminated on the system indicator board are the Source A and Source B indicators The IB_SSC assembly Active LED will also be illuminated but not visible from the front of the system Refer to the Sun Fire V1280 Netra 1280 System Controller Command Reference Manual for a full description of the LOM shutdown command Sending the shutdown Command Using the On Standby Switch Momentarily press the system On Stand
63. em Administration Guide December 2002 CHAPTER 4 System Controller Message Logging The Sun Fire V1280 Netra 1280 System Controller generates timestamped messages for system events processes such as when powering on booting powering off changes to hot pluggable units environmental warnings etc The messages are initially stored in the System Controller onboard memory in a circular 128 message buffer note that a single message can span multiple lines In addition the System Controller sends the messages to the Solaris host when it is running Solaris and these are processed by the system log daemon syslogd When Solaris is running messages are sent at the time they are generated by the System Controller Retrieval of messages not already copied from the System Controller takes place at Solaris boot time or when the System Controller is reset The logged messages can be viewed at the System Controller 1om gt prompt by using the showlogs command The messages can also be displayed at the Solaris prompt by using the lom 1m utility see Chapter 5 Typically the messages are stored on the Solaris host in the var adm messages file the only limiting factor being the available disk space Messages that are held in the System Controller core memory are volatile and are not retained if the power is removed from the System Controller by loss of both power sources less than two power supplies are operational removal of the IB_SSC
64. ember 2002 v To Break to the OpenBoot Prompt when Solaris is Running When the Solaris operating environment is running the usual effect of sending a break signal to the console is to force entry to the OpenBoot PROM or kernel debugger Do this by using the break command from the LOM prompt lom gt break This will suspend Solaris Do you want to continue no y Type go to resume debugger entered 1 ok v To Terminate a Session If You Are Connected To the System Controller Through the Serial Port If you are at the Solaris prompt or the OpenBoot PROM go to LOM prompt by typing the escape sequence then terminate the LOM prompt session by typing logout followed by a single carriage return lom gt logout If you are connected through a terminal server invoke the terminal server s command to disconnect the connection If the connection was established using a tip command then type the tip exit sequence Chapter 3 Console Navigation Procedures 41 v To Terminate a Session If You Are Connected to the System Controller with telnet If you are at the Solaris prompt or the OpenBoot PROM go to the LOM prompt by typing the escape sequence and terminate the LOM prompt session by using the logout command The telnet session will terminate automatically lom gt logout Connection closed by foreign host o 42 Sun Fire V1280 Netra 1280 Syst
65. enced by the device number Chapter 7 Troubleshooting 69 This section describes the PCI I O assembly slot assignments and provides an example of the device path The following code example gives a breakdown of a device tree entry for a SCSI disk ssm 0 0 pci 19 700000 pci 3 SUNW isptwo 4 sd 5 0 Note The numbers in the device path are hexadecimal where in 19 700000 a 19 is the I O controller agent identifier AID m 700000 is the bus offset in pci 3 a 3 is the device number isptwo is the SCSI host adapter in sd 5 0 a 5 is the SCSI target number for the disk a 0 is the logic unit number LUN of the target disk This section describes the PCI I O assembly slot assignments and provides an example of the device path TABLE 7 5 lists in hexadecimal notation the slot number I O assembly name device path of each I O assembly the I O controller number and the bus TABLE 7 5 IB_SSC Assembly PCI Device Mapping I O Assembly Name Device Path Physical Slot Number 1 O Controller Number Bus IB6 ssm 0 0 pci 18 700000 1 0 0 B ssm 0 0 pci 18 700000 2 1 0 B ssm 0 0 pci 18 700000 3 x 0 B ssm 0 0 pci 18 600000 1 5 0 A ssm 0 0 pci 18 600000 2 w 0 A ssm 0 0 pci 19 700000 1 2 1 B ssm 0 0 pci 19 700000 2 3 1 B 70 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 TABLE 7 5 IB_SSC Assembly PCI Device Mapping Continued VO Assembly Name Device Path Phy
66. est during POST a message similar to the following will appear in POST output Jul 15 15 58 12 noname lom SB0 ar0 Bit in error P3_ADDR 2 Jul 15 15 58 12 noname lom SB0 ar0 Bit in error P3_ADDR 1 Jul 15 15 58 12 noname lom SB0 ar0 Bit in error P3_ADDR 0 Jul 15 15 58 12 noname lom AR Interconnect test System board SB0 ar0 address repeater connections to system board RP2 ar0 failed Jul 15 15 58 13 noname lom SB0 ar0 Bit in error P3_INCOMING 0 Jul 15 15 58 17 noname lom SB0 ar0 Bit in error P3_PREREQ 0 Jul 15 15 58 17 noname lom SB0 ar0 Bit in error P3_ADDR 18 Jul 15 15 58 17 noname lom SB0 ar0 Bit in error P3_ADDR 17 A CPU Memory board failing the interconnect test may prevent the poweron command from completely powering up the system The system will then drop back to the lom gt prompt As a provisional measure before Service intervention is obtained the faulty CPU Memory board can be isolated from the system using the following sequence of commands at the System Controller lom gt prompt 76 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 lom gt disablecomponent SBx lom gt poweroff lom gt resetsc y A subsequent poweron command should now be successful Recovering a Hung System If you cannot log into the Solaris operating environment and typing the break command from the LOM shell did not force control of the system back to the OpenBoot P
67. f the features of the Sun Fire V1280 Netra 1280 system The procedural overview of how to set up your system and detailed procedures is covered in Chapter 2 System Controller The System Controller is an embedded system resident on the IB_SSC Assembly which connects to the system baseplane The system controller is responsible for providing the Lights Out Management LOM functions which include power on sequencing sequencing module power on self tests POST environmental monitoring fault indication and alarms The System Controller provides an RS232 serial interface and one 10 100 Ethernet interface Access to the LOM command line interface and the Solaris OpenBoot PROM console is shared and obtained through these interfaces System Controller functions include Monitoring the system Providing the Solaris and OpenBoot PROM consoles Providing the virtual TOD time of day Performing environmental monitoring Performing system initialization Coordinating POST The software application running on the System Controller provides a command line interface for you to modify system settings I O Ports The following ports are on the rear of the system Console serial RS 232 port RJ45 Reserved serial RS 232 port RJ45 Two Gigabit Ethernet ports RJ 45 Alarms port DB15 System Controller 10 100 Ethernet port RJ45 UltraSCSI port Up to six PCI ports five 33 MHz one 66 MHz Their locations are shown in FIGURE 1
68. formation For information on displaying diagnostic information refer to the Sun Hardware Platform Guide which is available with your Solaris operating environment release Sun Fire V1280 Netra 1280 System Administration Guide December 2002 Assisting Sun Service Personnel in Determining Causes of Failure Provide the following information to Sun service personnel so that they can help you determine the causes of your failure m A verbatim transcript of all output written to the system console leading up to the failure Also include any output printed subsequent to user actions If the transcript does not show certain user actions in a separate file include comments on what actions prompted particular messages m Acopy of the system log file from var adm messages from the time leading up to the failure m The following system controller command output from the LOM shell showsc v command showboards v command showlogs command history date showresetstate showenvironment Chapter 7 Troubleshooting 83 84 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 CHAPTER 8 Firmware Upgrade Procedures This chapter explains how to upgrade the system firmware The firmware on the Sun Fire V1280 Netra 1280 system can be updated by two mechanisms m The flashupdate command from the System Controller LOM prompt m The lom G command in the Solaris operating environment The first method requires tha
69. g into the Solaris operating environment and typing the break command did not force control of the system back into the OpenBoot PROM ok prompt after you type the reset command for the first time you must next type reset a in order to reset everything The reset a command is equivalent to the OpenBoot PROM reset all word v To Reset the System Controller Use the reset sc command to reset the System Controller This can be used in the event of a hardware or software problem causing the System Controller Application to malfunction lom gt resetsc Are you sure you want to reboot the system controller now no y This will cause the System Controller to reset execute the System Controller POST level specified using the setupsc command and restart the LOM software 28 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 CHAPTER 3 Console Navigation Procedures This chapter explains step by step procedures and provides illustrations for connecting to the system and navigating between the LOM shell and the console It also explains how to terminate a System Controller session Topics covered in this chapter include a Establishing a LOM Console Connection on page 30 To Connect to an ASCII Terminal on page 30 To Connect to a Network Terminal Server on page 32 To Connect to Serial Port B of a Workstation on page 33 To Access the LOM Console Using the Telnet Command
70. g system 78 reliability 6 S SCPOST controlling 64 serviceability 8 setdate command 19 setenv command 60 setupnetwork command 20 setupsc command 64 showcomponent command 75 showenvironment command 79 shutdown command 16 Solaris console accessing 39 Solaris installing and booting 22 standby power off to 15 power on from 14 state component 95 suspend safe devices 92 suspend unsafe devices 92 system hard hung recovering from 78 System Controller POST See SCPOST system faults 72 system identity moving 79 system hung recovering 77 T temperature 79 terminal connecting 30 troubleshooting 67 U use nvramrc OpenBoot variable 61 V verbosity level OpenBoot variable 60 voltage sensors 50 Index 119 120 Sun Fire V1280 Netra 1280 System Administration Guide December 2002
71. gement and the System Controller from Solaris 55 Changing the lom gt Prompt Escape Sequence lom X The character sequence hash dot enables you to escape from Solaris to the lom gt prompt To change the default escape sequence type lom X xy where xy are the alpha numeric characters you want to use Note Quotes may be required for special characters to be interpreted by the shell Note If you are typing at the console and type the first character of the escape sequence there is a one second delay before the character appears on the screen This is because the system waits to see if the next character in the escape sequence is about to be typed If all the characters in the escape sequence are typed then the lom gt prompt appears If the next character to be typed is not the next character in the escape sequence then the characters belonging to the escape sequence that were typed are output to the screen Stopping LOM from Sending Reports to the Console When at the LOM Prompt lom E off LOM event reports can interfere with information you are attempting to send or receive on the console To stop the LOM from sending reports to the console type lom E off To prevent LOM messages displaying when you are at the LOM prompt turn off serial event reporting This is equivalent to the seteventreporting command described in the Sun Fire V1280 Netra 1280 System Controlle
72. gered by events as specified in the LOM Source A and Source B Green Light when the relevant power feeds are present This indicator is repeated on the rear of the system Chapter1 Overview 5 Reliability Availability and Serviceability RAS Reliability availability and serviceability RAS are features of this system The descriptions of these features are Reliability is the probability that a system stays operational for a specified time period when operating under normal environmental conditions Reliability differs from availability in that reliability involves only system failure whereas availability depends on both failure and recovery m Availability also known as average availability is the percentage of time that a system is available to perform its functions correctly Availability can be measured at the system level or in the context of the availability of a service to an end client The system availability is likely to impose an upper limit on the availability of any products built on top of that system m Serviceability measures the ease and effectiveness of maintenance and system repair for the product There is no single well defined metric because serviceability can include both Mean Time to Repair MTTR and diagnosability The following sections provide details on RAS For more hardware related information on RAS refer to the Sun Fire V1280 Service Manual For RAS features that involve the Solaris o
73. i J Fri J Fri J e ul 19 15 Ul 19 15 ul 19 15 ul 19 15 ul 19 15 ion mode ul 19 15 ul 19715 ut AS LO ul 19 15 ul sL9 Lo lom e 11 Event Log Fri Jul 19 15 16 00 commando sc lom Boot ScApp 5 13 0007 RTOS SEG 516 itos 1 6 2165 245 27 PEN ES 528 228 06 08 10 TI T 29 46 52 03 08 commando sc commando sc commando sc commando sc commando sc commando sc commando sc commando sc commando sc commando sc Lom Lom Lom tom tom Lom Lom Lom Lom Lom Caching ID information Clock Source 75MHz NO PSO Status is OK NO PS1 Status is OK Chassis is in single Locator OFF Alarm 1 ON Alarm 2 ON Alarm 1 OFF Alarm 2 OFF Chapter 5 Using Lights Out Management and the System Controller from Solaris 49 Checking the Fans lom f To check status of the fans type CODE EXAMPLE 5 4 Sample Output from the lom f Command lom f Fans 1 OK speed self regulating 2 OK speed self regulating 3 OK speed self regulating 4 OK speed self regulating 5 OK speed self regulating 6 OK speed self regulating 7 OK speed self regulating 8 OK speed self regulating 9 OK speed 100 10 OK speed 100 If you need to replace a fan contact your local Sun sales representative and quote the part number of the component you need For information see the Sun Fire V1280 Netra 1280 Service Manual The information output f
74. id programs Installing LOMlite manual pages as lt SUNWlomm gt Installing part 1 of 1 71 blocks Installation of lt SUNWlomm gt was successful Resetting the System v To Forcibly Reset the System The reset command is used to reset the system in the event of a system hang or hardware problem If Solaris is running then you will be prompted to confirm this action Chapter 2 Starting and Setting Up the Sun Fire V1280 Netra 1280 27 lom gt reset This will abruptly terminate Solaris Do you want to continue no y NOTICE XIR on CPU 3 By default reset uses XIR externally initiated reset to reset the CPU processors in the system The externally initiated reset forces control of the processors into the OpenBoot PROM and begins the OpenBoot PROM s error reset recovery actions The error reset recovery actions preserve most of the Solaris states to allow the collection of data need for debugging the hardware and software including a Solaris operating environment core file After saving the debug information if the value of the OpenBoot PROM variable auto boot is true the system will boot Solaris The OpenBoot PROM s error reset recovery actions are controlled by setting the OpenBoot PROM error reset recovery configuration variable reset is prevented in standby and the message reset not allowed domain A keyswitch is set to off is displayed Note If the system is still hung you cannot lo
75. iguration Failures m Memoryona board is interleaved across boards before an attempt to unconfigure the board A process is bound to a CPU before an attempt to unconfigure the CPU m Memory remains configured on a system board before you attempt a CPU unconfigure operation on that board m The memory on the board is configured in use See Unable to Unconfigure Memory on a Board With Permanent Memory on page 108 m CPUson the board cannot be taken off line See Unable to Unconfigure a CPU on page 110 Chapter 9 CPU Memory Board Replacement and Dynamic Reconfiguration DR 107 Cannot Unconfigure a Board Whose Memory Is Interleaved Across Boards If you try to unconfigure a system board whose memory is interleaved across system boards the system displays an error message such as cfgadm Hardware specific failure unconfigure N0 SB2 memory Memory is interleaved across boards ssm 0 0 memory controller b 400000 Cannot Unconfigure a CPU to Which a Process is Bound If you try to unconfigure a CPU to which a process is bound the system displays an error message such as the following cfgadm Hardware specific failure unconfigure N0 SB2 cpu3 Failed to off line ssm 0 0 SUNW UltraSPARC III Unbind the process from the CPU and retry the unconfigure operation Cannot Unconfigure a CPU Before All Memory is Unconfigured All memory on a system board must be unconfigured before you try to u
76. ing the offset from Greenwich mean time GMT date and time to Thursday April 20 2000 at 18 hours 15 minutes and 10 seconds lom gt setdate t GMT 8 042018152000 10 If Solaris is running use the Solaris date command instead For more information on the setdate command refer to the Sun Fire V1280 Netra 1280 System Controller Command Reference Manual Chapter 2 Starting and Setting Up the Sun Fire V1280 Netra 1280 19 v To Set Up the Password 1 At the LOM prompt type the System Controller password command 2 Atthe Enter new password prompt type in your password 3 Atthe Enter new password again prompt re type in your password lom gt password Enter new password Enter new password again lom gt In the event that your password has been lost or forgotten contact SunService for advice v To Configure Network Parameters The Sun Fire V1280 Netra 1280 system can be administered from the System Controller LOM prompt and from Solaris There are two ways to access the LOM Console connection m Through the System Controller Serial port connection m Through a telnet network connection using the 10 100 Ethernet port Note The system can be administered solely through the serial port but if you want to use the 10 100 Ethernet port it is recommended that a separate secure subnet be used for this connection At the LOM prompt type setupnetwork lom gt setupnetwork Note If
77. internal temperature checking 53 internal voltage sensors 50 L logical attachment point 93 LOM escape sequence changing 56 monitoring the system 46 to online documentation 46 sample Event Log 49 setting the alarms 55 lom A command 55 lom E command 56 lom f command 50 lom G command 88 lom 1 command 48 LOM prompt accessing 39 LOM serial port 56 stopping event reporting 56 lom t command 53 lom v command 50 lom X command 56 maintenance 85 manual blacklisting 75 mapping 67 CPU Memory 67 I O assembly 69 node 67 memory interleaved 98 nonpermanent 97 permanent 97 reconfiguring 98 monitoring environmental conditions 4 N navigation procedures 29 network parameters setting 20 node mapping 67 nonpermanent memory 97 O On Standby switch 13 OpenBoot PROM variables 59 OpenBoot prompt accessing 41 overtemperature 79 P password command 20 password setting 20 permanent memory 97 physical attachment point 93 POST 59 controlling 59 63 118 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 OpenBoot PROM variables 59 power supply 82 powering on hardware 18 power off 15 to standby 15 poweroff command 17 power on 14 from standby 14 initial 14 poweron command 15 power on self test See POST printenv command 60 Q quiescence 92 R RAS 6 reboot on error OpenBoot variable 61 recovering from a hard hun
78. k2 docs sun com and Solaris are trademarks or registered trademarks of Sun Microsystems Inc in the U S and in other countries All SPARC trademarks are used under license and are trademarks or registered trademarks of SPARC International Inc in the U S and in other countries Products bearing SPARC trademarks are based upon an architecture developed by Sun Microsystems Inc The OPEN LOOK and Sun Graphical User Interface was developed by Sun Microsystems Inc for its users and licensees Sun acknowledges the pioneering efforts of Xerox in researching and developing the concept of visual or graphical user interfaces for the computer industry Sun holds a non exclusive license from Xerox to the Xerox Graphical User Interface which license also covers Sun s licensees who implement OPEN LOOK GUIs and otherwise comply with Sun s written license agreements U S Government Rights Commercial use Government users are subject to the Sun Microsystems Inc standard license agreement and applicable provisions of the FAR and its supplements DOCUMENTATION IS PROVIDED AS IS AND ALL EXPRESS OR IMPLIED CONDITIONS REPRESENTATIONS AND WARRANTIES INCLUDING ANY IMPLIED WARRANTY OF MERCHANTABILITY FITNESS FOR A PARTICULAR PURPOSE OR NON INFRINGEMENT ARE DISCLAIMED EXCEPT TO THE EXTENT THAT SUCH DISCLAIMERS ARE HELD TO BE LEGALLY INVALID Copyright 2002 Sun Microsystems Inc 4150 Network Circle Santa Clara California 95054 Etats Unis To
79. l status information to the Solaris operating environment and Sun Management Center software for Sun Fire systems If hardware needs to be powered off the System Controller notifies the Solaris operating environment to perform a system shutdown Availability The software availability features include Dynamic Reconfiguration Power Failure System Controller Reboot Host Watchdog Chapter1 Overview 7 Dynamic Reconfiguration The following components can be dynamically reconfigured Hard disk drives CPU Memory boards Power supplies Fans Power Failure On recovery from a power outage the System Controller attempts to restore the system to its previous state System Controller Reboot The System Controller can be rebooted and will start up and resume management of the system The reboot does not disturb the currently running Solaris operating environment Host Watchdog The System Controller monitors the state of the Solaris operating environment and will initiate a reset if Solaris stops responding Serviceability The software serviceability features promote the efficiency and timeliness of providing routine as well as emergency service to the system a LEDs Nomenclature System Controller error logging System Controller XIR eXternally Initiated Reset support 8 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 LEDs All field replaceable units FRUs that are accessible from
80. ly the System Controller will go into single use mode and will only be accesible from the serial port Caution Before performing a flashupdate check the firmware revisions of all boards using the showboards p version command Caution If the System Controller application scapp or real time operating system rtos are to be updated you are strongly recommended to run the flashupdate command from a LOM shell running on the serial connection so that the results can be fully monitored Caution Before updating CPU Memory boards or the I O Assembly ensure that all boards to be updated are powered on by using the poweron command flashupdate Command Examples To update the flash PROMs on the System Controller 1 O Assembly and all CPU Memory boards type lom gt flashupdate f ftp lt host gt lt path gt all To update the System Controller application and real time operating system type lom gt flashupdate f ftp lt host gt lt path gt scapp rtos To bring CPU Memory boards sb2 and sb4 up to the same firmware level as board sb0 type lom gt flashupdate c sb0 sb2 sb4 A power cycle is required in order to activate the updated OpenBoot PROM Chapter 8 Firmware Upgrade Procedures 87 gt gt e gt Using the lom G Command There are four image types which may need to be transferred using this method with names of the following form sgpci flash contains I O board
81. nconfigure a CPU If you try to unconfigure a CPU before all memory on the board is unconfigured the system displays an error message such as cfgadm Hardware specific failure unconfigure NO SB2 cpu0 Can t unconfig cpu if mem online ssm 0 0 memory controller Unconfigure all memory on the board and then unconfigure the CPU Unable to Unconfigure Memory on a Board With Permanent Memory To unconfigure the memory on a board that has permanent memory move the permanent memory pages to another board that has enough available memory to hold them Such an additional board must be available before the unconfigure operation begins 108 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 Memory Cannot Be Reconfigured If the unconfigure operation fails with a message such as the following the memory on the board could not be unconfigured cfgadm Hardware specific failure unconfigure N0 SB0 No available memory target ssm 0 0 memory controller 3 400000 Add to another board enough memory to hold the permanent memory pages and then retry the unconfigure operation To confirm that a memory page cannot be moved use the verbose option with the cfgadm command and look for the word permanent in the listing cfgadm av s select type memory Not Enough Available Memory If the unconfigure fails with one of the messages below there will not be enough available memory in
82. nconfigure and disconnect in a single command but if necessary each operation connection configuration unconfiguration or disconnection can be performed separately Hot Plug Hardware Hot plug devices have special connectors that supply electrical power to the board or module before the data pins make contact Boards and devices that have hot plug connectors can be inserted or removed while the system is running The devices have control circuits to ensure they have a common reference and power control during the insertion process The interfaces are not powered on until the board is home and the System Controller instructs them to The CPU Memory boards used in the Sun Fire V1280 Netra 1280 system are hot plug devices Conditions and States A state is the operational status of either a receptacle slot or an occupant board A condition is the operational status of an attachment point 94 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 Before you attempt to perform any DR operation on a board or component from a system you must determine state and condition Use the cfgadm 1M command with the 1a options to display the type state and condition of each component and the state and condition of each board slot in the system See the section Component Types on page 97 for a list of the component types Board States and Conditions This section contains descriptions of the states and conditions of CP
83. ng 55 attachment points 93 auto boot OpenBoot variable 61 availability 7 B blacklisting components 75 manual 75 board condition 96 displaying status 99 occupant state 95 receptacle state 95 board status detailed 100 bootmode command 60 63 C cfgadm command 91 99 componens blacklisting 75 disabling 75 component condition 97 occupant state 96 receptacle state 96 state 96 type 97 condition component 95 CPU Memory board replacement 91 CPU Memory mapping 67 D date and time setting 19 device name mapping 67 device path names to physical system devices 67 diag level OpenBoot variable 60 diagnostic information displaying 82 disablecomponent command 75 disabling a component 75 domain console 4 Dynamic Reconfiguration 91 E enablecomponent command 75 environmental monitoring 4 error level OpenBoot variable 60 rror reset recovery OpenBoot variable 61 event reporting 56 117 F failures determining causes 83 fans checking status 50 fault LED checking status remotely 48 fault system 72 firmware upgrading 85 flashupdate command 85 H hangs determining causes 83 hard hung system recovering from 78 hardware powering on 18 hot plug devices 94 hung system recovering 77 hung system recovering from 78 I I O assemblies mapping 69 initial power on 14 interleave mode OpenBoot variable 61 interleave scope OpenBoot variable 61
84. ng the password command 4 If the password is accepted the System Controller indicates that a connection has been made If the system is in standby mode the 1om prompt is automatically displayed Connected lom gt Otherwise type a single carriage return and the Solaris console prompt will be displayed Connected Chapter 3 Console Navigation Procedures 35 5 If a connection to the LOM Console is already established over the serial port then you will be offered the opportunity to forcefully connect by logging out the other connection telnet lt system_controller_hostname gt TEV ING 123 123 123 595 5208 Connected to interpol sc Escape character is The console is already in use Host somehost acme com Connected May 24 10 27 Idle time 00 23 17 Force logout of other user y n y Connected lom gt In this case you should first use the LOM logout command on the serial connection to make the connection available Refer to the Sun Fire V1280 Netra 1280 System Controller Command Reference Manual for further details Y To Disconnect from the LOM Console When you have finished using the LOM Console you can disconnect your connection by using the logout command On the serial port the response is lom gt logout Connection closed When connected over the network the response is lom gt logout Connection closed Connection to lt system controller ho
85. nute to prevent the system overheating Refer to the Sun Fire V1280 Netra 1280 Service Manual for complete step by step board insertion procedures Power on test and configure the board using the cfgadm c configure command cfgadm c configure ap_id where ap_id is one of the following NO SBO NO SB2 or NO SB4 To Hot Swap a CPU Memory Board Caution For complete information about physically removing and replacing boards refer to the Sun Fire V1280 Netra 1280 Service Manual Failure to follow the stated procedures can result in damage to system boards and other components 1 Make sure you are properly grounded using a wrist strap 104 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 Power off the board with cfgadm cfgadm c disconnect ap_id where ap_id is one of the following NO SBO NO SB2 or NO SB4 This command removes the resources from the Solaris operating environment and the OpenBoot PROM and powers off the board Verify the state of the Power and Hotplug OK LEDs The green Power LED will flash briefly as the CPU Memory board is cooling down In order to safely remove the board from the systems the green Power LED must be off and the amber Hotplug OK LED must be on Complete the hardware removal and installation of the board For more information refer to the Sun Fire V1280 Netra 1280 Service Manual After removing and installing board bring
86. oard poweroff Powers off a CPU Memory board The cfgadm_sbd man page provides additional information on the cfgadm c and cfgadm x options The sbd library provides the functionality for hot plugging system boards of the class sbd through the cfgadm framework Testing Boards and Assemblies To Test a CPU Memory Board Before you can test a CPU Memory board it must first be powered on and disconnected If these conditions are not met the board test fails You can use the Solaris cfgadm command to test CPU memory boards As superuser type cfgadm t ap id Sun Fire V1280 Netra 1280 System Administration Guide December 2002 To change the level of diagnostics that cfgadm runs supply a diagnostic level for the cfgadm command as follows cfgadm o platform diag lt level gt t ap id where level is a diagnostic level and ap id is one of the following NO SBO NO SB2 or NO SB4 If you do not supply level the default diagnostic level is set to the default The diagnostic levels are TABLE 9 11 Diagnostic Levels Diagnostic Level Description init Only system board initialization code is run No testing is done This is a very fast pass through POST quick All system board components are tested with few tests and test patterns default All system board components are tested with all tests and test patterns except for memory and Ecache modules Note that max and default are the same defini
87. oard the environment must vacate the memory on that board Vacating a board means flushing its nonpermanent memory to swap space and copying its permanent that is kernel and OpenBoot PROM memory to another memory board To relocate permanent memory the operating environment on a system must be temporarily suspended or quiesced The length of the suspension Chapter 9 CPU Memory Board Replacement and Dynamic Reconfiguration DR 97 depends on the system configuration and the running workloads Detaching a board with permanent memory is the only time when the operating environment is suspended therefore you should know where permanent memory resides so that you can avoid significantly impacting the operation of the system You can display the permanent memory by using the cfgadm 1M command with the v option When permanent memory is on the board the operating environment must find another memory component of adequate size to receive the permanent memory If that is not possible the DR operation will fail Limitations Memory Interleaving System boards cannot be dynamically reconfigured if system memory is interleaved across multiple CPU Memory boards Reconfiguring Permanent Memory When a CPU Memory board containing non relocatable permanent memory is dynamically reconfigured out of the system a short pause in all domain activity is required which may delay application response Typically this condition applies to one CPU Memory
88. on page 35 m Switching Between the Different Consoles on page 37 To Break to the LOM Prompt on page 39 To Connect to the Solaris Console from the LOM Prompt on page 39 To Break to the LOM Prompt from the OpenBoot PROM on page 40 To Break to the OpenBoot Prompt when Solaris is Running on page 41 To Terminate a Session If You Are Connected To the System Controller Through the Serial Port on page 41 To Terminate a Session If You Are Connected to the System Controller with telnet on page 42 29 Establishing a LOM Console Connection There are two ways to access the LOM Console connection m Through the System Controller Serial port connection m Through a telnet network connection using the 10 100 Ethernet port Under normal operation when Solaris is running or the system is in the OpenBoot PROM connecting to the LOM Console will automatically select a connection to the Solaris console otherwise a connection to the LOM prompt is made The LOM prompt is lom gt Accessing the LOM Console Using the Serial Port With the serial port you can connect to one of three types of device m ASCII terminal m Network terminal server m Workstation Please see the Sun Fire V1280 Site Preparation and Installation Guide for details of how to make the physical connections The procedure is different for each type of device v To Connect to an ASCII Terminal 1 If the LOM passw
89. or terms Read Chapter 6 in the User s Guide words to be emphasized Command line variable replace with a real name or value These are called class options You must be superuser to do this To delete a file type rm filename Sun Fire V1280 Netra 1280 System Administration Guide December 2002 Shell Prompts Shell Prompt C shell machine_name C shell superuser machine_name Bourne shell and Korn shell Bourne shell and Korn shell superuser LOM shell lom gt Related Documentation Type of Book Title Part Number Service Sun Fire V1280 Netra 1280 Service Manual 817 0510 System Sun Fire V1280 Netra 1280 System Controller 817 0511 Controller Command Reference Manual Accessing Sun Documentation You can view and print a broad selection of Sun documentation including localized versions at http www sun com documentation Preface xvii Sun Welcomes Your Comments Sun is interested in improving its documentation and welcomes your comments and suggestions You can email your comments to Sun at docfeedback sun com Please include the part number of your document which is on the title page in the subject line of your email xviii Sun Fire V1280 Netra 1280 System Administration Guide December 2002 CHAPTER 1 Overview This chapter presents a software overview of the Sun Fire V1280 Netra 1280 system The objective of this chapter is to provide you with a basic understanding o
90. or the System Controller is reset Messages stored on the system disk are available when Solaris is rebooted The display of the messages on the shared Solaris System Controller console port when at the lom gt prompt is controlled by the setevent reporting command see the Sun Fire V1280 Netra 1280 System Controller Command Reference Manual This determines whether a message is printed at the lom gt prompt at the time the message is logged and also whether it is posted to the Solaris logging system so that it is written to var adm messages 43 Main system hardware Main CPU Solaris messages x Disk var adm messages accessible when system is up and running Y Discard FIGURE 4 1 System Controller Logging System Controller LOM history log in ring buffer 128 messages Last messages entered available FIFO LOM writes message LOM port A RE Discard LOM commands gain access to history log whenever system is On or in standby mode i e System Controller not broken or un powered 44 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 CHAPTER 5 Using Lights Out Management and the System Controller from Solaris This chapter explains how to use the LOM specific commands available in Solaris for monitoring and managing a Sun Fire V1280 Netra 1280 system To use these commands you should install the Lights Out Managemen
91. or the first time Chapter 3 describes how to navigate within the system controller Chapter 4 explains System Controller message logging Chapter 5 describes how to use the LOM from the Solaris console Chapter 6 describes how to run the power on self test POST Chapter 7 describes troubleshooting information including LEDs system faults displaying diagnostic information displaying system configuration information disabling components blacklisting and mapping device path names to physical system devices XV Chapter 8 provides information on firmware updates including how to update the flash PROMs and the procedure for updating the system controller firmware Chapter 9 describes Dynamic Reconfiguration and the procedures you can use Using UNIX Commands This book assumes you are experienced with the UNIX operating environment If you are not experienced with the UNIX operating environment see one or more of the following for this information m AnswerBook2 online documentation for the Solaris operating environment m Other software documentation that you received with your system Typographic Conventions Typeface Meaning Examples AaBbCc123 The names of commands files Edit your login file and directories on screen Use 1s a to list all files computer output You have mail AaBbCc123 What you type when su contrasted with on screen Password computer output AaBbCc123 Book titles new words
92. ord has been set and the previous connection was logged out you will be prompted for a password Enter Password Enter the correct password as previously set up using the password command 30 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 2 If the password is accepted the System Controller indicates that a connection has been made If the system is in standby mode the 1om prompt is automatically displayed Connected lom gt Otherwise type a single carriage return and the Solaris console prompt will be displayed Connected If a connection to the LOM console is already established over the network port then you will be offered the opportunity to forcefully connect by logging out the other connection Enter Password The console is already in use Host somehost acme com Connected May 24 10 27 Idle time 00 23 17 Force logout of other user y n y Connected lom gt Otherwise type a single carriage return and the Solaris console prompt will be displayed Connected Chapter 3 Console Navigation Procedures 31 v To Connect to a Network Terminal Server 1 You will be provided with a menu of various servers to which you can connect Select the required server 2 If the LOM password has been set and the previous connection was logged out you will be prompted for a password Enter Password Enter the correct password as p
93. perating environment refer to the Sun Hardware Platform Guide Reliability The software reliability features include m Disabling Components or Boards and Power On Self Test POST m Manual Disabling of Components Environmental Monitoring The reliability features also improve system availability 6 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 Disabling Components or Boards and Power On Self Test POST The power on self test POST is part of powering on the system If the board or component failed testing POST disables components or boards The showboards command displays the board as either being failed or degraded The system running the Solaris operating environment is booted only with components that have passed POST testing Manual Disabling of Components The system controller provides component level status and user controlled disabling of components which is also referred to as blacklisting You can add a faulty component to a blacklist with the disablecomponent command Components in the blacklist will not be configured You can remove a component from the blacklist with the enablecomponent command The showcomponent command displays status information about the component including whether or not it has been disabled Environmental Monitoring The System Controller monitors the system s temperature cooling and voltage sensors The System Controller provides the latest environmenta
94. power on your system using the System Controller command line interface LOM prompt how to set up the System Controller using the setupnetwork command and how to boot the Solaris operating environment This chapter contains the following topics Installing and Cabling Hardware on page 12 Using the Power On Standby Switch on page 13 Powering On and Off on page 14 Setting Up the System on page 19 Installing and Booting the Solaris Operating Environment on page 22 Resetting the System on page 27 The list below summarizes the major steps you must perform to power on and set up the system which are explained using step by step procedures 1 2 3 4 5 6 7 co Install and cable the hardware Apply external power to the hardware Set the date and time for the system Set the password for the System Controller Set up system specific parameters with the setupnetwork command Power on all hardware with the poweron command If the Solaris operating environment is not pre installed install it Boot the Solaris operating environment Install the Lights Out Management packages from the Solaris Supplementary CD 11 Installing and Cabling Hardware 1 Connect a terminal to the System Controller board serial port Refer to FIGURE 1 1 2 Set up the terminal to use the same baud rate as the serial port The serial port settings of the System Controller board are m
95. r Command Reference Manual 56 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 To turn serial event reporting on again type lom E on Upgrading the Firmware lom G filename For a full description refer to Chapter 8 Chapter 5 Using Lights Out Management and the System Controller from Solaris 57 58 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 CHAPTER 6 Running POST Each of the system boards CPU Memory boards and IB_SSC Assembly contain a flash PROM that provides storage for power on self test POST diagnostics POST tests the following CPU chips External cache Memory Bus interconnect I O ASICs I O buses POST provides several diagnostic levels which can be selected using the OpenBoot PROM variable diag level In addition the bootmode command enables the POST settings to be declared for the next system reboot There is a separate POST which runs on the System Controller and which can be controlled using the setupsc command OpenBoot PROM Variables for POST Configuration The OpenBoot PROM enables you to set variables that configure how POST runs These are described in the OpenBoot 4 x Command Reference Manual You can use the OpenBoot printenv command to display the current settings 3 ok printenv diag level diag level init init 59 You can use the OpenBoot PROM setenv command to change the current setting of a
96. reassigned by the chassis port or released Active The board is being actively used You cannot reassign an active board Displaying Basic Board Status The cfgadm program displays information about boards and slots Refer to the cfgadm 1 man page for options to this command Many operations require that you specify the system board names To obtain these system names type cfgadm Chapter 9 CPU Memory Board Replacement and Dynamic Reconfiguration DR 99 100 When used without options cfgadm displays information about all known attachment points including board slots and SCSI buses The following display shows a typical output CODE EXAMPLE 9 1 Output of the Basic cfgadm Command cfgadm Ap_Id Type Receptacle Occupant Condition NO IB6 PCI_I O_Boa connected configured ok NO SBO CPU_Board connected configured unknown NO SB4 unknown emptyunconfigured unknown co scsi bus connected configured unknown cil scsi bus connected unconfigured unknown C2 scsi bus connected unconfigured unknown c3 scsi bus connected configured unknown Displaying Detailed Board Status For a more detailed status report use the command cfgadm av The a option lists attachment points and the v option turns on expanded verbose descriptions CODE EXAMPLE 9 2 is a partial display produced by the cfgadm av command The output appears complicated because the lines wrap around in this display This status report is for the same s
97. revent system overheating v To Disconnect a CPU Memory Board Temporarily You can use DR to power down the board and leave it in place For example you might want to do this if the board fails and a replacement board or a system board filler panel is not available Detach and power off the board using the cfgadm c disconnect command cfgadm c disconnect ap_id where ap_id is one of the following NO0 SBO NO SB2 or NO SB4 106 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 Troubleshooting This section discusses common types of failure m Unconfigure Operation Failure on page 35 m Configure Operation Failure on page 41 The following are examples of cfgadm diagnostic messages Syntax error messages are not included here cfgadm cfgadm cfgadm cfgadm cfgadm cfgadm WARNING hardware component is busy try again operation Data error error_text operation Hardware specific failure error_text operation Insufficient privileges operation Operation requires a service interruption System is busy try again Processor number number failed to offline See the following man pages for additional error message detail cfgadm 1M cfgadm_sbd 1M and config_admin 3X Unconfigure Operation Failure An unconfigure operation for a CPU Memory board can fail if the system is not in a correct state before you begin the operation CPU Memory Board Unconf
98. reviously set up using the password command 3 If the password is accepted the System Controller indicates that a connection has been made If the system is in standby mode the 1om prompt is automatically displayed Connected lom gt Otherwise type a single carriage return and the Solaris console prompt will be displayed Connected 32 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 4 If a connection to the LOM console is already established over the network port then you will be offered the opportunity to forcefully connect by logging out the other connection Enter Password The console is already in use Host somehost acme com Connected May 24 10 27 Idle time 00 23 17 Force logout of other user y n y Connected lom gt Otherwise type a single carriage return and the Solaris console prompt will be displayed Connected v To Connect to Serial Port B of a Workstation 1 At the Solaris shell prompt type tip hardwire See the tip man page for a complete description of the tip command 2 If the LOM password has been set and the previous connection was logged out you will be prompted for a password Enter Password Enter the correct password as previously set up using the password command Chapter 3 Console Navigation Procedures 33 34 3 If the password is accepted the System Controller indicates that a connection h
99. rning Driver lomv successfully added to system but failed to attach i drivers INFO Adding driver lom driver lom aliases SUNW lomh SUNW lom link SUNW lomh SUNW lom spec lom Adding devlink entry type ddi_pseudo name SUNW lomh lom Adding devlink entry type ddi_pseudo name SUNW lom lom 24 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 CODE EXAMPLE 2 3 Installing the LOM Drivers Continued adding driver with aliases SUNW lomh SUNW lom perm 0644 root sys devfsadm driver failed to attach lom Warning Driver lom successfully added to system but failed to attach i drivers SUCCESS Finished verifying class lt drivers gt Installation of lt SUNWlomr gt was successful Note The WARNING messages concerning lomp lomv and lom driver attachment seen during the installation of the SUWNlomr package can be safely ignored since the SUNWlomr package is not used on the Sun Fire V1280 Netra 1280 system However the presence of the package is required to allow successful upgrade via future patches v To Install the LOM Utility As root type CODE EXAMPLE 2 4 Installing the LOM Utility pkgadd d SUNWlomu Processing package instance lt SUNWlomu gt from lt cdrrom suppcd_s28u7_multi_s28u7_supp 08all Lights_Out_Management_2 0 Product gt LOMlite Utilities usr sparc 2 0 REV 2000 08 2
100. rom this command is also contained in the output from the Solaris prtdiag v command Checking the Internal Voltage Sensors lom v The v option displays the status of the Sun Fire V1280 Netra 1280 system internal voltage sensors To check the status of the supply rails and internal voltage sensors type CODE EXAMPLE 5 5 Sample Output from the lom v Command lom v Supply voltages i SSCL v1 gt vde0 status ok 2 SSCL v_3 3vdc0 status ok 3 SSC1 v_5vdc0 status ok 4 RPO v_1 5vdc0 status ok 5 RPO v_3 3vdc0 status ok 6 RP2 v_1 5vdc0 status ok 7 RP2 v_3 3vdc0 status ok 50 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 CODE EXAMPLE 5 5 Sample Output from the lom v Command Continued I oO O1 amp NN OH Lo 19 20 21 22 23 24 25 26 27 SBO SBO SB0 SB0 SB0 SBO SB2 SB2 SB2 SB2 SB2 SB2 IB6 IB6 IB6 IB6 IB6 IB6 IB6 IB6 PO P1 P2 P3 PO PI P2 P3 System status 1 JU DAWN DO J UH BW N D ND ND ON ND ON N QU amp N H Lo PSO PS1 FTO FTO FANO FTO FANI FTO FAN2 FTO FAN3 FTO FAN4 FTO FAN5 FTO FAN6 FTO FAN7 RPO RP2 SBO SBO SB0 SB0 SB0 SB0 SB0 SB0 SB0 SB0 SB0 SB0 SB0 PO P0 B0 P0 B0 P0 B0 P0 B0 P1 P1 B0 P1 B0 P1 B0 P1 B0 P2 P2 B0 v_1 v_3 VE SOS SO OS SOS OS OO TO Wn O G U OU OU U vu vu vu vv VU v ct w DO
101. rs 23 Installing the LOM Utility 25 Installing the LOM Manual Pages 27 Sample Output from the lom c Command 48 Sample Output from the 1om 1 Command 48 Sample LOM Event Log Oldest Event Reported First 49 Sample Output from the 1om f Command 50 Sample Output from the lom v Command 50 Sample Output from the lom t Command 53 POST Output Using max Setting 62 Setting SCPOST Diagnostic Level to min 64 SCPOST Output with Diagnostic Level Set to min 65 Downloading the sgpci flash Image 89 Downloading the sgcpu flash Image 89 Output of the Basic cfgadm Command 100 Output of the cfgadm av Command 100 xiii xiv Sun Fire V1280 Netra 1280 System Administration Guide December 2002 Preface This book presents a step by step description on how to power on and customize the platform setup This book describes information on system controller security software steps to power off the system how to perform firmware updates how to remove and replace system boards CPU Memory boards and I O assemblies PCI cards plus the software steps needed to remove the System Controller and Repeater boards troubleshooting and a glossary of technical terms How This Book Is Organized Chapter 1 describes the system controller and explains board states describes redundant system components minimum system configurations and reliability serviceability and availability Chapter 2 describes how to power on and set up the system f
102. shutdown 87 degC t_sdc0 68 degC warning 102 degC shutdown 107 degC t_ar0 77 degC warning 102 degC shutdown 107 degC t_dx0 76 degC warning 102 degC shutdown 107 degC t_dxl 78 degC warning 102 degC shutdown 107 degC t_sbbc0 51 degC warning 102 degC shutdown 107 degC t_schizo0 48 degC warning 102 degC shutdown 107 degC t_schizol 53 degC warning 102 degC shutdown 107 degC The information output from this command is also contained in the output from the Solaris prtdiag v command Viewing All Component Status Data and the LOM Configuration Data lom a To view all LOM status and configuration data type lom a 54 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 Other LOM Tasks Performed From Solaris This section explains how to Turn the alarm indicators on and off Change the LOM escape sequence Stop the LOM from sending reports to the console Upgrade the firmware Turning Alarms On and Off Lom A There are two alarms associated with the LOM They are associated with no specific conditions but are software flags available to be set by your own processes or from the command line To turn an alarm on from the command line type lom A on n where n is the number of the alarm you want to set 1 or 2 To turn the alarm off again type lom A off n where n is the number of the alarm you want to turn off 1 or 2 Chapter 5 Using Lights Out Mana
103. sical Slot Number 1 O Controller Number Bus ssm 0 0 pci 19 700000 3 4 1 B ssm 0 0 pci 19 600000 1 y 1 A ssm 0 0 pci 19 600000 2 Z 1 A where w onboard LSI1010R SCSI controller x onboard CMD646U2 EIDE controller y onboard Gigaswift Ethernet controller 0 z onboard Gigaswift Ethernet controller 1 and is dependent upon the type of PCI card installed in the slot Note the following m 600000 is the bus offset and indicates bus A which operates at 66 MHz m 700000 is the bus offset and indicates bus B which operates at 33 MHz m 3 is the device number In this example 3 means it is the third device on the bus 000002 81 9d 0 0 uwss Z 000002 81 1900 0 0 uss 000002 61 9d 0 0 wss Z 000002 61 190 0 0 uss 000002 61 90 0 0 uss 000009 81 O10d 0 0 wss Slots 0 1 N wo A ol FIGURE 7 1 Sun Fire V1280 Netra 1280 IB_SSC PCI Physical Slot Designations for IB6 Chapter 7 Troubleshooting 71 where is dependent upon the type of PCI card installed in the slot For instance m Dual Differential Ultra SCSI card 375 0006 in Slot 4 m FC AL card 375 3019 in Slot 3 m FC AL card 375 3019 in Slot 2 would generate device paths as follows 19 700000 scsi 3 1 19 700000 scsi 3 1 scsi 2 19 700000 scsi 3 1 tape byte 19 700000 scsi 3 1 disk block 19 700000 scsi 3 scsi 2 19 700000 scsi 3 tape
104. splayed No error messages are displayed The failing test name is displayed 60 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 TABLE 6 1 POST Configuration Parameters Continued Parameter Value max Description All relevant error statuses are displayed interleave scope interleave mode reboot on error use nvramrc auto boot error reset recovery default value within board default value across boards optimal default value fixed off false default value true true false default value true default value false The memory banks on a system board will be interleaved with each other The memory will be interleaved on all memory banks across all of the boards in the system The memory is mixed size interleaving in order to gain optimal performance The memory is fixed size interleaving There is no memory interleaving The system will be paused when there is an error The system will be rebooted This parameter is the same as the OpenBoot PROM nvramrc parameter This parameter uses aliases that are stored in nvramrc The OpenBoot PROM executes the script stored in nvramrc if this parameter is set to true The OpenBoot PROM does not evaluate the script stored in nvramrc if this parameter is set to false Controls booting of the Solaris operating environment If this value is true the system boots automatically after
105. st gt closed by foreign host 36 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 Switching Between the Different Consoles The System Controller console connection provides access to the System Controller LOM command line interface or the Solaris OpenBoot PROM console This section describes the procedures how to navigate between the LOM prompt m Solaris system console m OpenBoot PROM These procedures are summarized in FIGURE 3 1 Chapter 3 Console Navigation Procedures 37 Solaris Operating Environment Type the escape sequence OpenBoot PROM Type the escape sequence LOM shell Type lom gt console Type lom gt break FIGURE 3 1 Navigation Procedures 38 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 v To Break to the LOM Prompt When connected to the Solaris console typing the escape sequence will take the console into the LOM prompt By default the escape sequence is set to That is a sign followed by a period For instance if the escape sequence is the default of you will see lom gt Selecting an Escape Sequence If you are typing at the console and type the first character of the escape sequence there is a one second delay before the character appears on the screen This is because the system waits to see if the next character in the escape sequence is about to be typed The second character must be t
106. stem Fault LED and alarms are on or off type CODE EXAMPLE 5 2 Sample Output from the lom 1 Command lom 1 LOM alarm states Alarml off Alarm2 off Alarm3 on Fault LED off Alarms 1 and 2 are software flags They are associated with no specific conditions but are available to be set by your own processes or from the command line see Turning Alarms On and Off Lom A on page 55 Alarm 3 is UNIX Running and is not user configurable 48 Sun Fire V1280 Netra 1280 System Administration Guide December 2002 Viewing the Event Log lom e To see the event log type lom e n x where n is the number of reports up to 128 that you want to see and x specifies the level of reports you are interested in There are four levels of event 1 Fatal events 2 Warning events 3 Information events 4 User events not used on Sun Fire V1280 Netra 1280 systems If you specify a level you will see reports for that level and above For example if you specify level 2 you will see reports of level 2 and level 1 events If you specify level 3 you will see reports of level 3 level 2 and level 1 events If you do not specify a level you will see reports of level 3 level 2 and level 1 events CODE EXAMPLE 5 3 shows a sample event log display CODE EXAMPLE 5 3 Sample LOM Event Log Oldest Event Reported First LOMlit 23 Fri J Fri J Fri J Fri J Fri J partit Fri J Fri J Fr
107. stem on or off Testing the CPU Memory board Resetting the System Controller Marking components as faulty or OK Upgrading firmware Displaying the current System Controller settings password setescape seteventreporting setupnetwork setupsc setalarm setlocator poweron poweroff reset shutdown testboard resetsc disablecomponent enablecomponent flashupdate showescape showeventreporting shownetwork showsc Chapter 1 Overview 3 TABLE 1 1 Selected Management Tasks Continued System Controller Management Tasks Displaying the current system state Setting the date time and time zone Displaying the date and time System Controller Commands To Use showalarm showboards showcomponent showenvironment showfault showhostname showlocator showlogs showmodel showresetstate setdate showdate Solaris Console If the Solaris operating environment the OpenBoot PROM or POST is running you can access the Solaris console When you connect to the Solaris console you will be in one of the following modes of operation m Solaris operating environment console or prompts m OpenBoot PROM ok prompt m System will be running POST and you can view the POST output To switch between these prompts and the LOM prompt refer to Switching Between the Different Consoles on page 37 Environmental Monitoring There are sensors that monitor temperature voltage and cooling
108. t 2 0 packages SUNWlomr SUNWlomu and SUNWlomm from the Solaris Supplemental CD Refer to To Install the Lights Out Management Packages on page 23 for a description of how to install the LOM packages Note The latest patches to these packages is available from SunSolve in patch 110208 It is strongly advised that the latest version of patch 110208 is obtained from SunSolve and is installed on the Sun Fire V1280 Netra 1280 to make use of the latest LOM utility updates The chapter contains the following sections m Monitoring the System From Solaris on page 46 m Other LOM Tasks Performed From Solaris on page 55 LOM Command Syntax lom c 1 f v t a G X lom e lt n gt x lom A on off lt n gt lom E on off where c displays LOM configuration 45 1 displays the status of the Fault and Alarms LEDs e displays the event log f displays fan status This information is also displayed in the output from the Solaris prtdiag v command v displays the status of the voltage sensors This information is also displayed in the output from the Solaris prtdiag v command t displays temperature information This information is also displayed in the output from the Solaris prtdiag v command a displays all component status data A turns alarms on and off X changes the escape sequence E switches event logging to the console on and off G upgrades the
109. t the 10 100 System Controller Ethernet port is connected to a suitable network and is configured so that it can see an external ftp or http server which contains the new firmware images to be downloaded Using the flashupdate Command The flashupdate command requires that the 10 100 Ethernet port can access an external ftp or http server The flashupdate command updates the flash PROMs in the System Controller and the system boards CPU Memory boards and I O assembly The source flash image is normally held on an NFS server In the case of CPU Memory boards you can update one board with the flash image from another The syntax for the flashupdate command is flashupdate yl n f lt url gt all systemboards scapp rtos lt board gt flashupdate y l n c lt source_board gt lt destination_board gt flashupdate y n u where y does not prompt for confirmation n does not execute this command if confirmation is required 85 f specifies a URL as the source of the flash images This option requires a network connection with the flash image held on an NFS server Use this option to install new firmware lt url gt is the URL of the directory containing the flash images and must be of the form ftp lt userid gt lt password gt lt hostname gt lt path gt or http lt hostname gt lt path gt all causes all boards CPU Memory I O Assembly and System Controller to be updated This action re
110. the system if the board is removed cfgadm Hardware specific failure unconfigure N0 SB0 Insufficient memory Reduce the memory load on the system and try again If practical install more memory in another board slot Memory Demand Increased If the unconfigure fails with the following message the memory demand has increased while the unconfigure operation was proceeding cfgadm Hardware specific failure unconfigure N0 SB0 Memory operation failed cfgadm Hardware specific failure unconfigure NO SBO Memory operation refused Reduce the memory load on the system and try again Chapter 9 CPU Memory Board Replacement and Dynamic Reconfiguration DR 109 Unable to Unconfigure a CPU CPU unconfiguration is part of the unconfiguration operation for a CPU Memory board If the operation fails to take the CPU offline the following message is logged to the console WARNING Processor number failed to offline This failure occurs if m The CPU has processes bound to it m The CPU is the last one in a CPU set m The CPU is the last online CPU in the system Unable to Disconnect a Board It is possible to unconfigure a board and then discover that it cannot be disconnected The cfgadm status display lists the board as not detachable This problem occurs when the board is supplying an essential hardware service that cannot be relocated to an alternate board Configure Operation Failure CPU Memory
111. tion max All system board components are tested with all tests and test patterns except for memory and Ecache modules Note that max and default are the same definition meml Runs all tests at the default level plus more exhaustive DRAM and SRAM test algorithms For Memory and Ecache modules all locations are tested with multiple patterns More extensive time consuming algorithms are not run at this level mem2 The same as mem1 with the addition of a DRAM test that does explicit compare operations of the DRAM data Chapter 9 CPU Memory Board Replacement and Dynamic Reconfiguration DR 103 gt Bb A Installing or Replacing CPU Memory Boards Caution Physical board replacement should only be carried out by qualified service personnel To Install a New Board Caution For complete information about physically removing and replacing CPU Memory boards refer to the Sun Fire V1280 Netra 1280 Service Manual Failure to follow the stated procedures can result in damage to system boards and other components Note When replacing boards you sometimes need filler panels If you are unfamiliar with how to insert a board into the system read the Sun Fire V1280 Netra 1280 Service Manual before you begin this procedure Make sure you are properly grounded with a wrist strap After locating an empty slot remove the system board filler panel from the slot Insert the board into the slot within one mi
112. us droits r serv s Sun Microsystems Inc a les droits de propri t intellectuels relatants la technologie incorpor e dans le produit qui est d crit dans ce document En particulier et sans la limitation ces droits de propri t intellectuels peuvent inclure un ou plus des brevets am ricains num r s http www sun com patents et un ou les brevets plus suppl mentaires ou les applications de brevet en attente dans les Etats Unis et dans les autres pays Ce produit ou document est prot g par un copyright et distribu avec des licences qui en restreignent l utilisation la copie la distribution et la d compilation Aucune partie de ce produit ou document ne peut tre reproduite sous aucune forme parquelque moyen que ce soit sans l autorisation pr alable et crite de Sun et de ses bailleurs de licence s il y ena Le logiciel d tenu par des tiers et qui comprend la technologie relative aux polices de caract res est prot g par un copyright et licenci par des fournisseurs de Sun Des parties de ce produit pourront tre d riv es des syst mes Berkeley BSD licenci s par l Universit de Californie UNIX est une marque d pos e aux Etats Unis et dans d autres pays et licenci e exclusivement par X Open Company Ltd Sun Sun Microsystems le logo Sun AnswerBook2 docs sun com et Solaris sont des marques de fabrique ou des marques d pos es de Sun Microsystems Inc aux Etats Unis et dans d autres pa
113. variable diag level quick 1 ok setenv diag level quick For example you can configure POST to run fastest by using diag level init 1 ok setenv diag level init 1 ok setenv verbosity level off verbosity level off This has the same effect as using the System Controller command bootmode skipdiag at the LOM prompt The difference is that by using the OpenBoot command the settings remain permanent until you change them again TABLE 6 1 POST Configuration Parameters Parameter Value Description diag level init default value quick max meml mem2 verbosity level off min default value max error level off min Only system board initialization code is run No testing is done This is a very fast pass through POST All system board components are tested using few tests with few test patterns All system board components are tested with all tests and test patterns except for memory and Ecache modules For memory and Ecache modules all locations are tested with multiple patterns More extensive time consuming algorithms are not run at this level Runs all tests at the default level plus more exhaustive DRAM and SRAM test algorithms This is the same as mem1 with the addition of a DRAM test that does explicit compare operations of the DRAM data No status messages are displayed Test names status messages and error messages are displayed Subtest trace messages are di
114. vice activities It may take a few minutes to achieve quiescence depending on system usage and activities currently in progress If the operating environment cannot achieve quiescence it displays the reasons which may include the following m An execution thread did not suspend m Real time processes are running m A device exists that cannot be paused by the operating environment The conditions that cause processes to fail to suspend are generally temporary Examine the reasons for the failure If the operating environment encountered a transient condition a failure to suspend a process you can try the operation again RPC or TCP Time out or Loss of Connection Time outs occur by default after two minutes Administrators may need to increase this time out value to avoid time outs during a DR induced operating system quiescence which may take longer than two minutes Quiescing a system makes the system and related network services unavailable for a period of time that can exceed two minutes These changes affect both the client and server machines Suspend Safe and Suspend Unsafe Devices When DR suspends the operating environment all of the device drivers that are attached to the operating environment must also be suspended If a driver cannot be suspended or subsequently resumed the DR operation fails Sun Fire V1280 Netra 1280 System Administration Guide December 2002 A suspend safe device does not access memory or interrupt
115. while the system is in secure mode Refer to the Sun Fire V1280 Netra 1280 System Controller Command Reference Manual for more details 4 This step depends on the setting of the Open Boot PROM rror reset recovery configuration variable m If the error reset recovery configuration variable is set to none the system returns immediately to the OpenBoot PROM When the OpenBoot PROM takes control it takes actions based on the setting of the OpenBoot PROM rror reset recovery configuration variable You can type any OpenBoot PROM command from the ok prompt including rebooting the Solaris operating environment with the boot command Also you can force a core file with the sync command The actions that can be configured by this variable might mean that the system will not return to the ok prompt m If the error reset recovery configuration variable is not set to none the OpenBoot PROM will automatically take recovery actions m If the error reset recovery configuration variable is set to sync default the system generates a Solaris operating environment core file and reboots the system m Ifthe OpenBoot PROM error reset recovery configuration variable is set to boot the system is rebooted 5 If the previous actions fail to reboot the system use the poweroff and poweron commands to power cycle the system To power off the system type lom gt poweroff To power on the system type lom gt poweron
116. yped within this one second window If all the characters in the escape sequence are typed then the lom gt prompt appears If the next character to be typed is not the next character in the escape sequence then the characters belonging to the escape sequence that were typed are output to the screen It is recommended that you choose an escape sequence that does not start with a sequence of characters that is frequently typed at the console otherwise the delay between your striking the keys and the character appearing on the screen may be confusing v To Connect to the Solaris Console from the LOM Prompt To connect to the Solaris console use the console command from the LOM prompt then type a carriage return If Solaris is running the system will respond with the Solaris prompt lom gt console Chapter 3 Console Navigation Procedures 39 If the system was in the OpenBoot PROM then the system will respond with the OpenBoot PROM prompt lom gt console 2 ok If the system is in standby mode the following message will be generated lom gt console Solaris is not active v To Break to the LOM Prompt from the OpenBoot PROM The process of moving from the OpenBoot PROM to the LOM prompt is the same as moving from Solaris to the LOM prompt Type the sequence of escape characters default 2 ok lom gt 40 Sun Fire V1280 Netra 1280 System Administration Guide Dec
117. ys Toutes les marques SPARC sont utilis es sous licence et sont des marques de fabrique ou des marques d pos es de SPARC International Inc aux Etats Unis et dans d autres pays Les produits protant les marques SPARC sont bas s sur une architecture d velopp e par Sun Microsystems Inc L interface d utilisation graphique OPEN LOOK et Sun a t d velopp e par Sun Microsystems Inc pour ses utilisateurs et licenci s Sun reconna t les efforts de pionniers de Xerox pour la recherche et le d veloppment du concept des interfaces d utilisation visuelle ou graphique pour l industrie de l informatique Sun d tient une license non exclusive do Xerox sur l interface d utilisation graphique Xerox cette licence couvrant galement les licenci es de Sun qui mettent en place l interface d utilisation graphique OPEN LOOK et qui en outre se conforment aux licences crites de Sun LA DOCUMENTATION EST FOURNIE EN L TAT ET TOUTES AUTRES CONDITIONS DECLARATIONS ET GARANTIES EXPRESSES OU TACITES SONT FORMELLEMENT EXCLUES DANS LA MESURE AUTORISEE PAR LA LOI APPLICABLE Y COMPRIS NOTAMMENT TOUTE GARANTIE IMPLICITE RELATIVE A LA QUALITE MARCHANDE A L APTITUDE A UNE UTILISATION PARTICULIERE OU A L ABSENCE DE CONTREFA ON LD aon CA Adobe PostScript Contents Overview 1 System Controller 1 I O Ports 2 LOM Prompt 3 Solaris Console 4 Environmental Monitoring 4 System Indicator Board 4 Reliability Availability and Serviceability
118. ystem used in CODE EXAMPLE 9 1 FIGURE 9 1 provides details of each display item CODE EXAMPLE 9 2 Output of the cfgadm av Command cfgadm av Ap_Id Receptacle Occupant Condition Information When Type Busy Phys_Id 0 IB6 connected configured ok powered on assigned Apr 3 18 04 PCI_I O_Boa n devices ssm 0 0 N0 IB6 0 IB6 pci0 connected configured ok device ssm 0 0 pci 19 70000 Apr 3 18 04 io n devices ssm 0 0 N0 IB6 pci0 0 IB6 pcil connected configured ok device ssm 0 0 pci 19 600000 Apr 3 18 04 io n devices ssm 0 0 N0 IB6 pcil 0 IB6 pci2 connected configured ok device ssm 0 0 pci 18 700000 Apr 3 18 04 io n devices ssm 0 0 N0 IB6 pci2 0 IB6 pci3 connected configured ok device ssm 0 0 pci 18 600000 Apr 3 18 04 io n devices ssm 0 0 N0 IB6 pci3 0 SBO connected configured unknown powered on assigned Sun Fire V1280 Netra 1280 System Administration Guide December 2002 CODE EXAMPLE 9 2 Output of the cfgadm av Command Continued Apr 3 18 04 CPU Board n devices ssm 0 0 N0 SBO 0 SB0 cpu0 connected configured ok cpuid 0 speed 750 MHz he 8 MBytes 18 04 cpu n devices ssm 0 0 N0 SB0 cpu0 0 SB0 cpul connected configured ok cpuid 1 speed 750 MHz 8 MBytes 18 04 cpu n devices ssm 0 0 N0 SB0 cpul 0 SB0 cpu2 connected configured ok cpuid 2 speed 750 MHz cache 8 MBytes 18 04 cpu n devices ssm 0 0 N0 SB0 cpu2 e Baw Q Ww O N Ww
Download Pdf Manuals
Related Search
Related Contents
ImageQuant™ LAS 500 - GE Healthcare Life Sciences Aga Ranges Legacy 600 CFM User's Manual Système de Télévigilance 7 Professionnel Marantz SR7500 Stereo System User Manual La R&D industrielle, une clé pour l`avenir Manuale Tecnico - VB56 Vernier Motion Detector User Manual Manual de Usuario - Domotica y Seguridad Copyright © All rights reserved.
Failed to retrieve file