通过AIX 的errpt -a 命令可以查看系统的一些报错信息。 通常,使用errpt -a -j <ErrorID> 即可查看大致的错误信息,从而形成判断依据采取进一步的措施。
以下是一个例子:
LABEL: TAPE_ERR1
IDENTIFIER: 4865FA9B
Date/Time: Wed Nov 8 06:21:28 EDT 2006
Sequence Number: 123875
Machine Id: xxxxxxxxxxxx
Node Id: xxxxxxxx
Class: H
Type: PERM
Resource Name: rmt713
Resource Class: tape
Resource Type: 3580
Location: U7311.D11.xxxxxxx-P1-C1-T1-Wxxxxxxxxxxxxxxxx-L0
VPD:
Manufacturer................IBM
Machine Type and Model......ULT3580-TD2
Serial Number...............xxxxxxxxxx
Device Specific.(FW)........5AT0
Description
TAPE OPERATION ERROR
Probable Causes
TAPE
User Causes
MEDIA DEFECTIVE
DIRTY READ/WRITE HEAD
Recommended Actions
FOR REMOVABLE MEDIA, CHANGE MEDIA AND RETRY
PERFORM PROBLEM DETERMINATION PROCEDURES
Detail Data
SENSE DATA
0600 0000 1101 0000 0E00 0000 0000 0000 0102 0000 F000 0300 0000 011C 0000 0000
1400 3600 6353 7282 0001 4243 3034 3037 4C36 0000 C2B3 AD23 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
一般情况下,根据Detail Data之前的信息,可以大致判断系统出现的故障的设备及故障情况。在某些情况下,由于问题的指向不是特别清晰,我们可以结合Sense data进行进一步分析。
sense data是十六进制数,我们主要查看 SCSI Reponse Code, SCSI Sense Key,SCSI Additional Sense Code (ASC) and SCSI Additional Sense Code Qualifier (ASCQ) 四个值,结合一期来判断具体故障类别。
这4个值分别在sense data的位置为: Byte 20, 22, 32,33. 在上例中,即:SCSI Response Code:F0,SCSI Sense Key:03, SCSI Additional Sense Code (ASC) :00, SCSI Additional Sense Code Qualifier (ASCQ) :00.
SCSI Response Code 包括如下类:
* 70 - Current Error (Direct Access Logical Block NOT From Sense Data).
* F0 - Current Error (Direct Access Logical Block IS From Sense Data)
* 71 - Deferred Error (Direct Access Logical Block NOT From Sense Data).
* 7F - Vendor Spec. Error (Direct Access Logical Block NOT From Sense Data).
* EE - Encryption Error
* F1 - Deferred Error (Direct Access Logical Block IS From Sense Data).
* FF - Vendor Spec. Error (Direct Access Logical Block IS From Sense Data).
SCSI Sense Key的类如下:
X0 - No Sense
X1 - Recovered Error
X2 - Not Ready
X3 - Medium Error
X4 - Hardware Error
X5 - Illegal Request
X6 - Unit Attention
X7 - Data Protect
X8 - Blank Check
X9 - Vendor Specific
XA - Copy Aborted
XB - Aborted Command
XC - Equal
XD - Volume Overflow
XE - Miscompare
XF - RESERVED
ASC & ASCQ 代码的列表比较长,可以访问如下链接查看:
https://www.t10.org/lists/asc-num.htm
因此可以结合errpt的文字描述,以及Response Code, Sense Key ,ASC,ASCQ等信息来综合判断故障的情况以制定诊断计划。
如果觉得我的文章对您有用,请点赞。您的支持将鼓励我继续创作!
赞3
添加新评论0 条评论