|
v1280 서버 / cpu 4개 장착 / 솔라리스 8
messages:Dec 9 02:08:02 libapr pcisch: [ID 285080 kern.info] NOTICE: correctable error detected by pci0 (safari id 18) during
messages:Dec 9 02:08:02 libapr lw8: [ID 792739 kern.error] /N0/SB0 reported ECC error
messages:Dec 9 02:08:02 libapr lw8: [ID 634739 kern.error] ^M
messages:Dec 9 02:08:02 libapr lw8: [ID 604904 kern.error] /N0/IB6 reported ECC error
messages:Dec 9 02:08:02 libapr lw8: [ID 416664 kern.error] ^M
messages:Dec 9 02:08:02 libapr lw8: [ID 224458 kern.error] /N0/SB0 reported first ECC error
messages:Dec 9 02:08:03 libapr lw8: [ID 606155 kern.error] Bad data read from a DIMM or cache controlled by /N0/SB0/P3
messages:Dec 9 04:01:48 libapr SUNW,UltraSPARC-III+: [ID 869849 kern.info] [AFT0] errID 0x000d3877.57151850 Data Bit 59 was in error and corrected
messages:Dec 9 04:01:48 libapr lw8: [ID 792739 kern.error] /N0/SB0 reported ECC error
messages:Dec 9 04:01:49 libapr lw8: [ID 934999 kern.error] ^M
messages:Dec 9 04:01:49 libapr lw8: [ID 224458 kern.error] /N0/SB0 reported first ECC error
messages:Dec 9 04:01:49 libapr lw8: [ID 606155 kern.error] Bad data read from a DIMM or cache controlled by /N0/SB0/P3
messages:Dec 9 10:12:00 libapr SUNW,UltraSPARC-III+: [ID 632124 kern.info] [AFT0] errID 0x000d4caa.162edb90 Data Bit 59 was in error and corrected
messages:Dec 9 10:12:00 libapr lw8: [ID 792739 kern.error] /N0/SB0 reported ECC error
messages:Dec 9 10:12:00 libapr lw8: [ID 461518 kern.error] ^M
messages:Dec 9 10:12:00 libapr lw8: [ID 224458 kern.error] /N0/SB0 reported first ECC error
messages:Dec 9 10:12:00 libapr lw8: [ID 606155 kern.error] Bad data read from a DIMM or cache controlled by /N0/SB0/P3
messages.1:Nov 25 22:12:00 libapr SUNW,UltraSPARC-III+: [ID 269824 kern.info] [AFT0] errID 0x000927f9.20990a30 Data Bit 59 was in error and corrected
messages.1:Nov 25 22:12:00 libapr lw8: [ID 792739 kern.error] /N0/SB0 reported ECC error
messages.1:Nov 25 22:12:00 libapr lw8: [ID 461518 kern.error] ^M
messages.1:Nov 25 22:12:00 libapr lw8: [ID 224458 kern.error] /N0/SB0 reported first ECC error
messages.1:Nov 25 22:12:00 libapr lw8: [ID 606155 kern.error] Bad data read from a DIMM or cache controlled by /N0/SB0/P3
messages.1:Nov 26 04:03:36 libapr SUNW,UltraSPARC-III+: [ID 485082 kern.info] [AFT0] errID 0x00093b28.62b0fe90 Data Bit 59 was in error and corrected
libapr% tail -f /var/adm/messages
Dec 9 10:12:00 libapr Read transaction for CPU A: DTrans: 0x001 (AID=0, SEQ#=1)^M
Dec 9 10:12:00 libapr ECC Syndrome: 0x034^M
Dec 9 10:12:00 libapr MTag Syndrome: 0x0^M
Dec 9 10:12:00 libapr ^M
Dec 9 10:12:00 libapr Read transaction from CPU D: DTrans: 0x001 (AID=0, SEQ#=1)^M
Dec 9 10:12:00 libapr ECC Syndrome: 0x034^M
Dec 9 10:12:00 libapr MTag Syndrome: 0x0^M
Dec 9 10:12:00 libapr lw8: [ID 224458 kern.error] /N0/SB0 reported first ECC error
Dec 9 10:12:00 libapr lw8: [ID 606155 kern.error] Bad data read from a DIMM or cache controlled by /N0/SB0/P3
System Configuration: Sun Microsystems sun4u Sun Fire V1280
System clock frequency: 150 MHZ
Memory size: 9GB
==================================== CPUs ====================================
E$ CPU CPU Temperature Fan
CPU Freq Size Impl. Mask Die Ambient Speed Unit
--- -------- ---------- ------ ---- -------- -------- ----- ----
SB0/P0 900 MHz 8MB 15 2.3 60 C 38 C
SB0/P1 900 MHz 8MB 15 2.3 58 C 39 C
SB0/P2 900 MHz 8MB 15 2.3 63 C 38 C
SB0/P3 900 MHz 8MB 15 2.3 60 C 39 C
================================= IO Devices =================================
Bus Freq
Brd Type MHz Slot Name Model
--- ---- ---- ---------- -------------------------------- ----------------------
0 pci 66 1 SUNW,qlc-pci1077,2200.1077.4082.+
0 pci 66 1 network-pci108e,abba.11 (network+ SUNW,pci-ce
0 pci 66 2 scsi-pci1000,21.1000.1000.1 (scs+
0 pci 66 2 scsi-pci1000,21.1000.1000.1 (scs+
0 pci 66 2 network-pci108e,abba.11 (network+ SUNW,pci-ce
0 pci 33 3 ide-pci1095,646.1095.646.7 (ide)
0 pci 33 3 scsi-pci1000,f.1000.1000.14 (scs+
0 pci 33 3 scsi-pci1000,f.1000.1000.14 (scs+
0 pci 33 4 bootbus-controller-sgsbbc SUNW,sgsbbc
============================ Memory Configuration ============================
Segment Table:
-----------------------------------------------------------------------
Base Address Size Interleave Factor Contains
-----------------------------------------------------------------------
0x0 8GB 16 BankIDs 0,1,2,3,4,5,8,9,10,11,12,13,14,15
0x200000000 1GB 2 BankIDs 6,7
Bank Table:
-----------------------------------------------------------
Physical Location
ID ControllerID GroupID Size Interleave Way
-----------------------------------------------------------
0 0 0 512MB 9
1 0 1 512MB 11
2 0 0 512MB 13
3 0 1 512MB 15
4 1 0 512MB 4
5 1 1 512MB 6
8 2 0 1GB 0,8
9 2 1 512MB 12
10 2 0 1GB 2,10
11 2 1 512MB 14
12 3 0 512MB 1
13 3 1 512MB 3
14 3 0 512MB 5
15 3 1 512MB 7
6 1 0 512MB 0
7 1 1 512MB 1
Memory Module Groups:
--------------------------------------------------
ControllerID GroupID Labels
--------------------------------------------------
0 0 SB0/P0/B0/D0,SB0/P0/B0/D1,SB0/P0/B0/D2,SB0/P0/B0/D3
0 1 SB0/P0/B1/D0,SB0/P0/B1/D1,SB0/P0/B1/D2,SB0/P0/B1/D3
Memory Module Groups:
--------------------------------------------------
ControllerID GroupID Labels
--------------------------------------------------
1 0 SB0/P1/B0/D0,SB0/P1/B0/D1,SB0/P1/B0/D2,SB0/P1/B0/D3
1 1 SB0/P1/B1/D0,SB0/P1/B1/D1,SB0/P1/B1/D2,SB0/P1/B1/D3
Memory Module Groups:
--------------------------------------------------
ControllerID GroupID Labels
--------------------------------------------------
2 0 SB0/P2/B0/D0,SB0/P2/B0/D1,SB0/P2/B0/D2,SB0/P2/B0/D3
2 1 SB0/P2/B1/D0,SB0/P2/B1/D1,SB0/P2/B1/D2,SB0/P2/B1/D3
Memory Module Groups:
--------------------------------------------------
ControllerID GroupID Labels
--------------------------------------------------
3 0 SB0/P3/B0/D0,SB0/P3/B0/D1,SB0/P3/B0/D2,SB0/P3/B0/D3
3 1 SB0/P3/B1/D0,SB0/P3/B1/D1,SB0/P3/B1/D2,SB0/P3/B1/D3
============================ Environmental Status ============================
Fan Speeds:
---------------------------------------
Location Sensor Speed
---------------------------------------
FT0/FAN3 ft_fan3 self-regulating
FT0/FAN0 ft_fan0 self-regulating
FT0/FAN1 ft_fan1 self-regulating
FT0/FAN2 ft_fan2 self-regulating
FT0/FAN4 ft_fan4 self-regulating
FT0/FAN5 ft_fan5 self-regulating
FT0/FAN6 ft_fan6 self-regulating
FT0/FAN7 ft_fan7 self-regulating
IB6/FAN0 ft_fan0 100%
IB6/FAN1 ft_fan1 100%
--------------------------------------------------
Led State:
--------------------------------------------------
Location Led State Color
--------------------------------------------------
chassis fault OFF amber
chassis power ON green
chassis locator OFF white
chassis top_access OFF amber
chassis alarm1 OFF amber
chassis alarm2 OFF amber
chassis system ON green
chassis supplyA ON green
chassis supplyB ON green
PS0 fault OFF amber
PS0 power ON green
PS0 predicted_fault OFF amber
PS1 fault OFF amber
PS1 power ON green
PS1 predicted_fault OFF amber
PS2 fault OFF amber
PS2 power ON green
PS2 predicted_fault OFF amber
PS3 fault OFF amber
PS3 power ON green
PS3 predicted_fault OFF amber
FT0 fault OFF amber
FT0 power ON green
FT0 ok_to_remove OFF amber
RP0 fault OFF amber
RP0 power ON green
RP0 ok_to_remove OFF amber
RP2 fault OFF amber
RP2 power ON green
RP2 ok_to_remove OFF amber
SB0 fault OFF amber
SB0 power ON green
SB0 ok_to_remove OFF amber
IB6 fault OFF amber
IB6 power ON green
IB6 ok_to_remove OFF amber
DISK0 fault OFF amber
DISK0 power ON green
DISK0 ok_to_remove OFF blue
DISK1 fault OFF amber
DISK1 power ON green
DISK1 ok_to_remove OFF blue
FT0/FAN3 fault OFF amber
FT0/FAN0 fault OFF amber
FT0/FAN1 fault OFF amber
FT0/FAN2 fault OFF amber
FT0/FAN4 fault OFF amber
FT0/FAN5 fault OFF amber
FT0/FAN6 fault OFF amber
FT0/FAN7 fault OFF amber
IB6/FAN0 fault OFF amber
IB6/FAN1 fault OFF amber
---------------------------------------------------------------
Temperature sensors:
---------------------------------------------------------------
Location Sensor Temperature Lo LoWarn HiWarn Hi Status
---------------------------------------------------------------
SSC1 t_sbbc0 41C -12C -2C 102C 107C okay
SSC1 t_cbh0 48C -12C -2C 102C 107C okay
SSC1 t_ambient0 29C -12C -2C 82C 87C okay
SSC1 t_ambient1 27C -12C -2C 82C 87C okay
SSC1 t_ambient2 36C -12C -2C 82C 87C okay
RP0 t_ambient0 28C -12C -2C 82C 87C okay
RP0 t_ambient1 28C -12C -2C 53C 63C okay
RP0 t_sdc0 71C -12C -2C 102C 107C okay
RP0 t_ar0 54C -12C -2C 102C 107C okay
RP0 t_dx0 72C -12C -2C 102C 107C okay
RP0 t_dx1 73C -12C -2C 102C 107C okay
RP2 t_ambient0 28C -12C -2C 82C 87C okay
RP2 t_ambient1 28C -12C -2C 53C 63C okay
RP2 t_sdc0 69C -12C -2C 102C 107C okay
RP2 t_ar0 51C -12C -2C 102C 107C okay
RP2 t_dx0 67C -12C -2C 102C 107C okay
RP2 t_dx1 70C -12C -2C 102C 107C okay
SB0 t_sdc0 58C -12C -2C 102C 107C okay
SB0 t_ar0 44C -12C -2C 102C 107C okay
SB0 t_dx0 61C -12C -2C 102C 107C okay
SB0 t_dx1 67C -12C -2C 102C 107C okay
SB0 t_dx2 68C -12C -2C 102C 107C okay
SB0 t_dx3 64C -12C -2C 102C 107C okay
SB0 t_sbbc0 65C -12C -2C 102C 107C okay
SB0 t_sbbc1 44C -12C -2C 102C 107C okay
SB0/P0 Ambient 38C -12C -2C 82C 87C okay
SB0/P0 Die 60C -12C -2C 92C 97C okay
SB0/P1 Ambient 39C -12C -2C 82C 87C okay
SB0/P1 Die 58C -12C -2C 92C 97C okay
SB0/P2 Ambient 38C -12C -2C 82C 87C okay
SB0/P2 Die 63C -12C -2C 92C 97C okay
SB0/P3 Ambient 39C -12C -2C 82C 87C okay
SB0/P3 Die 60C -12C -2C 92C 97C okay
IB6 t_ambient0 34C -12C -2C 82C 87C okay
IB6 t_ambient1 32C -12C -2C 82C 87C okay
IB6 t_sdc0 74C -12C -2C 102C 107C okay
IB6 t_ar0 64C -12C -2C 102C 107C okay
IB6 t_dx0 68C -12C -2C 102C 107C okay
IB6 t_dx1 62C -12C -2C 102C 107C okay
IB6 t_sbbc0 55C -12C -2C 102C 107C okay
IB6 t_schizo0 55C -12C -2C 102C 107C okay
IB6 t_schizo1 53C -12C -2C 102C 107C okay
----------------------------------------------------------------------
Voltage sensors:
----------------------------------------------------------------------
Location Sensor Voltage Lo LoWarn HiWarn Hi Status
----------------------------------------------------------------------
SSC1 v_1.5vdc0 1.49V 1.35V 1.42V 1.57V 1.65V okay
SSC1 v_3.3vdc0 3.35V 2.97V 3.13V 3.46V 3.63V okay
SSC1 v_5vdc0 5.01V 4.50V 4.75V 5.25V 5.50V okay
PS0 v_input0 - - - - - okay
PS0 v_output0 - - - - - okay
PS1 v_input0 - - - - - okay
PS1 v_output0 - - - - - okay
PS2 v_input0 - - - - - okay
PS2 v_output0 - - - - - okay
PS3 v_input0 - - - - - okay
PS3 v_output0 - - - - - okay
RP0 v_1.5vdc0 1.49V 1.35V 1.42V 1.57V 1.65V okay
RP0 v_3.3vdc0 3.27V 2.97V 3.13V 3.46V 3.63V okay
RP2 v_1.5vdc0 1.48V 1.35V 1.42V 1.57V 1.65V okay
RP2 v_3.3vdc0 3.27V 2.97V 3.13V 3.46V 3.63V okay
SB0 v_1.5vdc0 1.52V 1.35V 1.42V 1.57V 1.65V okay
SB0 v_3.3vdc0 3.31V 2.97V 3.13V 3.46V 3.63V okay
SB0/P0 v_cheetah0 1.64V 1.46V 1.53V 1.70V 1.78V okay
SB0/P1 v_cheetah1 1.63V 1.46V 1.53V 1.70V 1.78V okay
SB0/P2 v_cheetah2 1.64V 1.46V 1.53V 1.70V 1.78V okay
SB0/P3 v_cheetah3 1.63V 1.46V 1.53V 1.70V 1.78V okay
IB6 v_1.5vdc0 1.50V 1.35V 1.42V 1.57V 1.65V okay
IB6 v_3.3vdc0 3.33V 2.97V 3.13V 3.46V 3.63V okay
IB6 v_5vdc0 4.95V 4.50V 4.75V 5.25V 5.50V okay
IB6 v_12vdc0 12.11V 10.80V 11.40V 12.60V 13.20V okay
IB6 v_3.3vdc1 3.34V 2.97V 3.13V 3.47V 3.63V okay
IB6 v_3.3vdc2 3.30V 2.97V 3.13V 3.47V 3.63V okay
IB6 v_1.8vdc0 1.84V 1.62V 1.71V 1.89V 1.98V okay
IB6 v_2.4vdc0 2.55V 2.25V 2.37V 2.62V 2.75V okay
-------------------------
Board Status:
-------------------------
Location Status
-------------------------
PS0 okay
PS1 okay
PS2 okay
PS3 okay
FT0 okay
FT0/FAN3 okay
FT0/FAN0 okay
FT0/FAN1 okay
FT0/FAN2 okay
FT0/FAN4 okay
FT0/FAN5 okay
FT0/FAN6 okay
FT0/FAN7 okay
RP0 okay
RP2 okay
SB0 ok
SB0/P0 online
SB0/P0/B0/D0 okay
SB0/P0/B0/D1 okay
SB0/P0/B0/D2 okay
SB0/P0/B0/D3 okay
SB0/P0/B1/D0 okay
SB0/P0/B1/D1 okay
SB0/P0/B1/D2 okay
SB0/P0/B1/D3 okay
SB0/P1 online
SB0/P1/B0/D0 okay
SB0/P1/B0/D1 okay
SB0/P1/B0/D2 okay
SB0/P1/B0/D3 okay
SB0/P1/B1/D0 okay
SB0/P1/B1/D1 okay
SB0/P1/B1/D2 okay
SB0/P1/B1/D3 okay
SB0/P2 online
SB0/P2/B0/D0 okay
SB0/P2/B0/D1 okay
SB0/P2/B0/D2 okay
SB0/P2/B0/D3 okay
SB0/P2/B1/D0 okay
SB0/P2/B1/D1 okay
SB0/P2/B1/D2 okay
SB0/P2/B1/D3 okay
SB0/P3 online
SB0/P3/B0/D0 okay
SB0/P3/B0/D1 okay
SB0/P3/B0/D2 okay
SB0/P3/B0/D3 okay
SB0/P3/B1/D0 okay
SB0/P3/B1/D1 okay
SB0/P3/B1/D2 okay
SB0/P3/B1/D3 okay
IB6 ok
IB6/FAN0 okay
IB6/FAN1 okay
================================ HW Revisions ================================
ASIC Revisions:
---------------
pci: Rev 4
pci: Rev 4
pci: Rev 4
pci: Rev 4
cpu 문제인지 아님 P3에 장착된 메모리를 전부 교체해야하는지 문의드립니다.
|
첫댓글 이것은 prtdiag 명령어 출력 결과하고 같이 붙여 주시면 안될까요7?
그리고 위의 메세지가 여러번 반복해서 나왔는지도 좀 알려주시고요 ^^
우선 위의 경우라면 fmdump 명령어로 힌트 정보를 얻을 수 있을 것도 같습니다. 이 결과도 같이 주시면 좋을 듯
cputrack, cpustat 명령어로 확인 좀 해주세요. ^^
Sc 에서 showenvironment 한번 해보세요
fmdump, cputrack, cpustat 명령어 지원이 안되네요.. 그리고 서버가 원격지에 있어서 SC접속이 안되어 SC쪽 점검도 못하고 있습니다.
솔라리스 8 이라서 안먹히는겁니다