Instruction/ maintenance manual of the product Sun Fire T1000 Sun Microsystems
Go to page of 90
Sun Microsystems, Inc. www .sun.com Submit comments about this document at: http://www.sun.com/hwdocs/feedback Sun Fire ™ T1000 Ser v er Ser vice Man ual P ar t No .
Please Recycle Copyright 2006 Sun Microsystems, Inc., 4150 Network Cir cle, Santa Clara, California 95054, U.S.A. All rights reserved. Sun Microsystems, Inc.
iii Contents Preface vii 1. Sun Fire T1000 Server Overview 1 Sun Fire T1000 Server Features 1 Chip-Multitheaded (CMT) Multicore Processor and Memory Technology 2 Performance Enhancements 2 Remote Mana.
iv Sun Fire T2000 Server Service Manual • January 2006 Running ALOM Service-Related Commands 19 Connecting to ALOM 19 Switching Between the System Console and ALOM 20 Service-Related ALOM Commands 2.
Contents v For further information, refer to the documents that accompany the SunVTS software 49 3. Removing and Replacing FRUs 51 Safety Information 51 Safety Symbols 52 Electrostatic Dischar ge Safe.
vi Sun Fire T2000 Server Service Manual • January 2006 ▼ T o Replace the Clock Battery on the Motherboard 71 Common Procedures for Finishing Up 72 ▼ T o Replace the T op Cover 72 ▼ T o Reinstall the Server Chassis in the Rack 73 ▼ T o Apply Power to the Server 73 A.
vii Pr eface The Sun Fire T1000 Service Manual provides information to aid in tr oubleshooting problems with and r eplacing components within the Sun Fire™ T1000 server . This manual is written for technicians, service personnel, and system administrators who service and repair computer systems.
viii Sun Fire T1000 Server Service Manual • January 2006 Using UNIX Commands Use this section to alert readers that not all UNIX commands ar e provided. For example: This document might not contain information on basic UNIX ® commands and procedur es such as shutting down the system, booting the system, and configuring devices.
Pref ace ix T ypographic Conventions Shell Pr ompts Sun Fir e T1000 Server Documentation Y ou can view and print the following documents from the Sun documentation web T ypeface 1 1 The settings on your browser might differ from these settings.
x Sun Fire T1000 Server Service Manual • January 2006 site at http://www.sun.com/documentation Accessing Sun Documentation Y ou can view , print, or purchase a br oad selection of Sun™ documentation, including localized versions, at: http://www.sun.
Pref ace xi or resour ces. Sun will not be responsible or liable for any actual or alleged damage or loss caused by or in connection with the use of or reliance on any such content, goods, or services that are available on or through such sites or r esources.
xii Sun Fire T1000 Server Service Manual • January 2006.
1 CHAPTER 1 Sun Fir e T1000 Server Overview This chapter provides an overview of the features of the Sun Fir e T1000 server . The following topics are cover ed: ■ “Sun Fire T1000 Server Features.
2 Sun Fire T1000 Server Service Manual • January 2006 Chip-Multitheaded (CMT) Multicor e Pr ocessor and Memory T echnology The UltraSP ARC ® T1 multicore pr ocessor is the basis of the Sun Fire T1000 server .
Chapter 1 Sun Fire T1000 Ser ver Ov er view 3 T ABLE 1- 1 lists feature specifications for the Sun Fire T1000 server . For additional information on the Sun Fire T1000 server features r efer to the Sun Fire T1000 Server Product Overview .
4 Sun Fire T1000 Server Service Manual • January 2006 The ALOM-CMT software is pr einstalled as firmwar e, and therefore, ALOM initializes as soon as you apply power to the system.
Chapter 1 Sun Fire T1000 Ser ver Ov er view 5 ■ PSH automated run time diagnosis capability that takes faulty components off line. For more information about using RAS features, r efer to the Sun Fire T1000 Server System Administration Guide .
6 Sun Fire T1000 Server Service Manual • January 2006 Pr edictive Self-Healing The Sun Fire T1000 server featur es the latest fault management technologies. W ith the Solaris 10 Operating System (OS), Sun is introducing a new architectur e for building and deploying systems and services capable of predictive self-healing .
Chapter 1 Sun Fire T1000 Ser ver Ov er view 7 FIGURE 1-4 Sun Fire T1000 Server Rear Panel Additional Service Related Information In addition to this document, the following resources ar e available to.
8 Sun Fire T1000 Server Service Manual • January 2006.
9 CHAPTER 2 Sun Fir e T1000 Server Diagnostics This chapter describes the diagnostics that are available for monitoring and troubleshooting the Sun Fir e T1000 server . This chapter does not provide detailed troubleshooting pr ocedures, but instead describes the Sun Fir e T1000 server diagnostics facilities and how to use them.
10 Sun Fire T1000 Server Service Manual • January 2006 ■ ALOM-CMT firmwar e – is the system f irmware that runs on the system controller . In addition to providing the interface between the hardwar e and OS, ALOM also tracks and reports the health of key server components.
Chapter 2 Sun Fire T1000 Ser ver Diagnostics 11 FIGURE 2-1 Diagnostic Flow Chart Suspect f aulty hardware 3. Are any f aults repor ted by the showfaults command? 9. Do the Solaris logs indicate a f aulty FR U? 11. Does POST repor t any f aulty de vices? 12.
12 Sun Fire T1000 Server Service Manual • January 2006 T ABLE 2-1 Diagnostic Flow Chart Actions Action No. Diagnostic Action Resulting Action For more information, see these sections 1. Check the power supply fault LED. The amber Fault LED indicates the power cord in unplugged or the power supply is faulty .
Chapter 2 Sun Fire T1000 Ser ver Diagnostics 13 8. Identify the cause of the over temperature condition The over temperature condition may be caused excessive ambient room temperatur e, an overheating power supply or a faulty fan tray assembly . • If ambient r oom temperatur e is too high, r educe room temperatur e.
14 Sun Fire T1000 Server Service Manual • January 2006 Using LEDs to Identify the State of Devices The Sun Fire T1000 server pr ovides the following groups of LEDs: ■ Front and r ear panel LEDS ( .
Chapter 2 Sun Fire T1000 Ser ver Diagnostics 15 FIGURE 2-2 Sun Fire T1000 Server Fr ont Panel FIGURE 2-3 Sun Fire T1000 Server Rear Panel LEDs Locator LED Ser vice required LED P ower OK LED/pow er on.
16 Sun Fire T1000 Server Service Manual • January 2006 Fr ont and Rear Panel LEDs T wo LEDs and one LED/button are located in the upper left corner of the fr ont panel ( T ABLE 2-2 ).
Chapter 2 Sun Fire T1000 Ser ver Diagnostics 17 Power Supply LEDs The power supply LEDs ( T ABLE 2-3 ) are located on the back of the power supply . Using ALOM For Diagnosis and Repair V erif ication .
18 Sun Fire T1000 Server Service Manual • January 2006 ALOM enables you to run diagnostics remotely such as power -on self test (POST), that would otherwise requir e physical proximity to the server s serial port.
Chapter 2 Sun Fire T1000 Ser ver Diagnostics 19 manually clear certain types of faults without a FRU replacement or if ALOM was unable to automatically detect the FRU replacement. ALOM does not automatically detect hard drive r eplacement. Persistent environmental faults can automatically recover .
20 Sun Fire T1000 Server Service Manual • January 2006 Switching Between the System Console and ALOM ■ T o switch from the console output to the ALOM sc> pr ompt, type #. (Pound Period). ■ T o switch from the sc> pr ompt to the console, type console .
Chapter 2 Sun Fire T1000 Ser ver Diagnostics 21 Note – For the ALOM ASR commands, see T ABLE 2-7 . ▼ T o Run the showfaults Command The showfaults command displays faults handled by ALOM. Use the showfaults command for the following reasons: ■ T o see if any faults have been passed to, or detected by ALOM.
22 Sun Fire T1000 Server Service Manual • January 2006 ● At the sc > prompt, type the showfaults command. In this example, showfaults is reporting a memory err or at DIMM location MB/CMP0/CH0/R1/D0. (J0701). ▼ T o Run the showenvironment Command The showenvironment command displays a snapshot of the server ’s environmental status.
Chapter 2 Sun Fire T1000 Ser ver Diagnostics 23 -------------------------------------------------------- ---------------------------------------------------------- Fans (Speeds Revolution Per Minute):.
24 Sun Fire T1000 Server Service Manual • January 2006 Note – Some information might not be available when the server is in standby mode. ▼ T o Run the showfru Command Note – By default, the output of the showfru command for all FRUs is very long.
Chapter 2 Sun Fire T1000 Ser ver Diagnostics 25 ● At the sc> prompt, enter the showfru command. sc> showfru -s FRU_PROM at MB/SEEPROM SEGMENT: SD /ManR /ManR/UNIX_Timestamp32: TUE OCT 18 21:17.
26 Sun Fire T1000 Server Service Manual • January 2006 /SPD/Vendor Serial No: d03f623 FRU_PROM at MB/CMP0/CH0/R1/D0/SEEPROM /SPD/Timestamp: MON OCT 03 12:00:00 2005 /SPD/Description: DDR2 SDRAM, 2048 MB /SPD/Manufacture Location: /SPD/Vendor: Infineon (formerly Siemens) /SPD/Vendor Part No: 72T256220HR3.
Chapter 2 Sun Fire T1000 Ser ver Diagnostics 27 If you do not provide a command-line argument, all FRUs ar e listed. Running POST Power on self test (POST) is a group of PROM-based tests that run when the server is powered on or r eset.
28 Sun Fire T1000 Server Service Manual • January 2006 T ABLE 2-5 lists the ALOM variables used to configur e POST and FIGURE 2-5 shows how the variables work together .
Chapter 2 Sun Fire T1000 Ser ver Diagnostics 29 FIGURE 2-5 Flowchart of ALOM V ariable for POST Configuration.
30 Sun Fire T1000 Server Service Manual • January 2006 T ABLE 2-6 shows typical combinations of ALOM variables and associated POST mode. ▼ T o Change POST Parameters 1.
Chapter 2 Sun Fire T1000 Ser ver Diagnostics 31 Reasons to Run POST Y ou can use POST for basic sanity checking of the server hardwar e and for troubleshooting as described in the following sections. Routine Sanity Check of the Hardwar e POST tests critical hardwar e components to verify functionality before the system boots and accesses software.
32 Sun Fire T1000 Server Service Manual • January 2006 3. Reset the system so that POST runs. The following example uses the powercycle command. For other methods, refer to the Sun Fire T1000 Server Administration Guide .
Chapter 2 Sun Fire T1000 Ser ver Diagnostics 33 5. Perform further investigation if needed. When POST is finished r unning, the system will continue to boot even if post detects a faulty FRU, provided it does not leave the system without memory or a CPU core.
34 Sun Fire T1000 Server Service Manual • January 2006 under test above c : s > MSG = test-error-message c : s > END_ERROR where c = the cor e number , s = the strand number . W arning and informational messages use the following syntax: INFO or WARNING : message The following is an example of a POST error message.
Chapter 2 Sun Fire T1000 Ser ver Diagnostics 35 Example: In this example, MB/CMP0/CH2/R0/D0 (DIMM 0 at J0701) is disabled. Until the faulty component is replaced, the system can boot using memory that was not disabled. Note – Y ou can use ASR commands to display and control disabled components.
36 Sun Fire T1000 Server Service Manual • January 2006 The PSH console message provides the following information: ■ T ype ■ Severity ■ Description ■ Automated Response ■ Impact ■ Sugges.
Chapter 2 Sun Fire T1000 Ser ver Diagnostics 37 ▼ T o Use the fmdump Command to Identify Faults The fmdump command displays the list of faults detected by the Solaris PSH facility . Use this command for the following reasons: ■ T o see if any faults have been detected by the Solaris PSH facility .
38 Sun Fire T1000 Server Service Manual • January 2006 Memory module errors exceeded acceptable levels Type Fault Severity Major Description The Solaris(TM) Fault Manager has determined that the number of correctable (single bit) memory errors reported against a memory DIMM module indicates a fault requiring repair action is present .
Chapter 2 Sun Fire T1000 Ser ver Diagnostics 39 c. Follow the suggested actions to repair the fault. Collecting Information Fr om Solaris OS Files and Commands W ith the Solaris OS running on the Sun Fire T1000 server , you have the full compliment of Solaris OS f iles and commands available for collecting information and for troubleshooting.
40 Sun Fire T1000 Server Service Manual • January 2006 The /var/adm directory contains several message f iles. The most recent messages are in the / var/adm/messages f ile. After a period of time (usually every ten days), a new messages file is automatically cr eated.
Chapter 2 Sun Fire T1000 Ser ver Diagnostics 41 The ASR commands ( T ABLE 2-7 ) enable you to view , and manually add or remove components from the ASR blacklist. These commands are r un from the ALOM sc> prompt. Note – The components ( asrkeys) vary from system to system, depending on how many cores and memory ar e present.
42 Sun Fire T1000 Server Service Manual • January 2006 Example with no disabled components: Example showing a disabled component:. T o Run the disablecomponent Command The disablecomponent command disables a component by adding it to the ASR blacklist.
Chapter 2 Sun Fire T1000 Ser ver Diagnostics 43 ▼ T o Run the enablecomponent Command The enablecomponent command enables a disabled component by removing it from the ASR blacklist.
44 Sun Fire T1000 Server Service Manual • January 2006 ▼ T o Check Whether SunVTS Softwar e Is Installed 1. Check for the presence of SunVTS packages. T ype: ■ If SunVTS software is loaded, information about the packages is displayed. ■ If SunVTS software is not loaded, you see an error message for each missing package.
Chapter 2 Sun Fire T1000 Ser ver Diagnostics 45 SunVTS software r equir es that you use one of two security schemes. The security scheme you choose must be properly conf igured in or der for you to perform this procedur e. For details, refer to the SunVTS User ’ s Guide .
46 Sun Fire T1000 Server Service Manual • January 2006 4. Start SunVTS software. T ype: where display-system is the name of the machine thr ough which you are remotely logged in to the Sun Fire T1000 server . If you have installed SunVTS software in a location other than the default /opt directory , alter the path in this command accordingly .
Chapter 2 Sun Fire T1000 Ser ver Diagnostics 47 FIGURE 2-6 The SunVTS GUI Screen.
48 Sun Fire T1000 Server Service Manual • January 2006 5. Expand the test lists to see the individual tests. The test selection area lists tests in categories, such as Network , as shown in FIGURE 2-7 .
Chapter 2 Sun Fire T1000 Ser ver Diagnostics 49 7. ( Optional ) Customize individual tests. Y ou can customize individual tests by right-clicking on the name of the test. For example, in the illustration under FIGURE 2-7 , right-clicking on the text string bg0(nettest) brings up a menu that enables you to configure this Ethernet test.
50 Sun Fire T1000 Server Service Manual • January 2006.
51 CHAPTER 3 Removing and Replacing FRUs This chapter describes how to remove and r eplace field-r eplaceable units (FRUs) in the Sun Fire T1000 server .
52 Sun Fire T1000 Server Service Manual • January 2006 The document, Important Safety Information for Sun Hardware Systems , 816-7190, contains a listing of safety precautions for Sun systems. This document is located in the packing carton of your server .
Chapter 3 Removing and Replacing FR Us 53 Use an Antistatic W rist Strap W ear an antistatic wrist strap and use an antistatic mat when handling components such as drive assemblies, boards, or car ds. When servicing or removing server components, attach an antistatic strap to your wrist and then to a metal area on the chassis.
54 Sun Fire T1000 Server Service Manual • January 2006 1. Log in as superuser or equivalent. Depending on the nature of the pr oblem, you might want to view the system status or the log files, or r un diagnostics before you shut down the system. Refer to the Sun Fire T1000 Server Administration Guide for log f ile information.
Chapter 3 Removing and Replacing FR Us 55 ▼ T o Remove the Server Fr om a Rack If the server is installed in a rack with the extendable slide rails that were supplied with the server , use this procedure to r emove the server chassis from the rack. 1.
56 Sun Fire T1000 Server Service Manual • January 2006 6. Press the release buttons on both mounting brackets ( FIGURE 3-2 ) to release the right and left mounting brackets, then pull the server chassis out of the rails. The mounting brackets slide approximately 4 in (10 cm) further before disengaging.
Chapter 3 Removing and Replacing FR Us 57 ▼ T o Remove the T op Cover Access to all customer replaceable units (CRUs) requir es the r emoval of the top cover: Note – Never run the system with the top cover removed. The top cover must be in place for proper air f low .
58 Sun Fire T1000 Server Service Manual • January 2006 ■ “T o Remove the Fan T ray Assembly” on page 60 and “T o Replace the Fan T ray Assembly” on page 61 ■ “T o Remove the Power Supp.
Chapter 3 Removing and Replacing FR Us 59 3. On the rear of the chassis, release the retention latch () FIGURE 3-5 that secures the PCI Express card to the chassis. FIGURE 3-4 Releasing the PCI Express Car d Retention Latch 4. Gently work the PCI Express card out of the socket on the PCI Express riser board FIGURE 3-5 ) and the retention bracket.
60 Sun Fire T1000 Server Service Manual • January 2006 5. Place the PCI Express card on an antistatic mat. ▼ T o Add or Replace the Optional PCI Expr ess Car d Use this procedur e to replace the PCI Expr ess card. 1. Unpackage the replacement PCI Express card and place it on an antistatic mat.
Chapter 3 Removing and Replacing FR Us 61 FIGURE 3-6 Removing the Fan T ray Assembly 4. Remove the fan assembly from the sheet metal mounting brackets. ▼ T o Replace the Fan T ray Assembly 1. Unpackage the replacement fan tray assembly and place it on an antistatic mat.
62 Sun Fire T1000 Server Service Manual • January 2006 FIGURE 3-7 Removing the Power Supply ▼ T o Replace the Power Supply 1. Unpackage the replacement power supply . 2. Slide the power supply into the chassis and engage the two alignment pins in the rear of the chassis that mate with the power supply .
Chapter 3 Removing and Replacing FR Us 63 FIGURE 3-8 Replacing the Power Supply ▼ T o Remove the Har d Drive 1. Perform the procedures described in “Common Procedures for Parts Replacement” on page 53 . 2. Disconnect the cable from the hard drive.
64 Sun Fire T1000 Server Service Manual • January 2006 ▼ T o Replace the Har d Drive 1. Unpackage the replacement hard drive and tray assembly . 2. Slide the hard drive and tray assembly into the chassis until it mates with the front of the chassis ( FIGURE 3-10 ).
Chapter 3 Removing and Replacing FR Us 65 ▼ T o Remove DIMMs Caution – This procedur e requir es that you handle components that ar e sensitive to static discharges that can cause the component to fail.
66 Sun Fire T1000 Server Service Manual • January 2006 * DIMM names in messages are displayed with the full name such as MB/CMP0/CH1/R1/D1, but this table lists the DIMM namei in an abbreviated way the preceding MB/CMP0 is omitted) for clarity . 5. Grasp the top corners of the DIMM and remove it from the motherboard.
Chapter 3 Removing and Replacing FR Us 67 6. Perform the following steps to clear the memory fault. a. Gain access to the ALOM sc> prompt. Refer to the Sun Fire T2000 Server Advanced Lights Out Management (ALOM) Guide for instructions.
68 Sun Fire T1000 Server Service Manual • January 2006 c. Switch to the system console to view POST output. W atch the POST output for possible fault messages.
Chapter 3 Removing and Replacing FR Us 69 1. Remove the PCI Express card. See “T o Remove the Optional PCI Express Car d” on page 58 . 2. Remove the fan tray assembly and cable. See “T o Remove the Fan T ray Assembly” on page 60 . 3. Remove the power supply and cable.
70 Sun Fire T1000 Server Service Manual • January 2006 6. Replace the memory DIMMs. “T o Add or Replace DIMMs” on page 66 . 7. Replace the socketed system conf iguration SEEPROM. The location of this SEEPROM is shown in Appendix A , “Field-Replaceable Units (FRUs)” on page 75 .
Chapter 3 Removing and Replacing FR Us 71 ▼ T o Replace the Clock Battery on the Motherboar d 1. Unpackage the replacement battery . 2. Press the new battery into the motherboard ( FIGURE 3-13 ) with the + facing upward. FIGURE 3-13 Replacing the Clock Battery on the Motherboard 3.
72 Sun Fire T1000 Server Service Manual • January 2006 Common Pr ocedures for Finishing Up ▼ T o Replace the T op Cover 1. Place the top cover on the chassis. Set the cover down so that the cover hangs over the rear of the server by about an inch (2.
Chapter 3 Removing and Replacing FR Us 73 ▼ T o Reinstall the Server Chassis in the Rack Refer to the Sun Fire T1000 System Installation Manual for installation instructions. After you have reinstalled the server chassis in the rack, reconnect all cables that you disconnected when you remover the chassis fr om the rack.
74 Sun Fire T1000 Server Service Manual • January 2006.
75 APPENDIX A Field-Replaceable Units (FRUs) FIGURE A-1 shows the locations of the field-replaceable units (FRUs) in the Sun Fir e T1000 server . T ABLE A-1 lists the FRUs.
76 Sun Fire T1000 Server Service Manual • January 2006 . FIGURE A-1 Field-Replaceable Units Motherboard (1) (2) (3) Disk (4) (5) (5) 1 2 3 4 5 6 8 7.
Appendix A Field-Replaceable Units (FR Us) 77 1 T ABLE A-1 Sun Fire T1000 Server FRU List Item No. CRU Replacement Instructions Description Location 1 Motherboard and chassis assembly “T o Remove the Motherboard and Chassis” on page 68 The motherboard and chassis ar e replaced as a single assembly .
78 Sun Fire T1000 Server Service Manual • January 2006 T ABLE A-2 Location of DIMMs Connector Number Location J0501 J0601 J0701 J0810 J1001 J1 101 J1201 J1301 MB/CMP0/CH0/R0/D0 MB/CMP0/CH0/R0/D1 MB/.
An important point after buying a device Sun Microsystems Sun Fire T1000 (or even before the purchase) is to read its user manual. We should do this for several simple reasons:
If you have not bought Sun Microsystems Sun Fire T1000 yet, this is a good time to familiarize yourself with the basic data on the product. First of all view first pages of the manual, you can find above. You should find there the most important technical data Sun Microsystems Sun Fire T1000 - thus you can check whether the hardware meets your expectations. When delving into next pages of the user manual, Sun Microsystems Sun Fire T1000 you will learn all the available features of the product, as well as information on its operation. The information that you get Sun Microsystems Sun Fire T1000 will certainly help you make a decision on the purchase.
If you already are a holder of Sun Microsystems Sun Fire T1000, but have not read the manual yet, you should do it for the reasons described above. You will learn then if you properly used the available features, and whether you have not made any mistakes, which can shorten the lifetime Sun Microsystems Sun Fire T1000.
However, one of the most important roles played by the user manual is to help in solving problems with Sun Microsystems Sun Fire T1000. Almost always you will find there Troubleshooting, which are the most frequently occurring failures and malfunctions of the device Sun Microsystems Sun Fire T1000 along with tips on how to solve them. Even if you fail to solve the problem, the manual will show you a further procedure – contact to the customer service center or the nearest service center