用户自己在巡检过程中,执行了smitty clverify同步配置操作,同步完成后lsvg -p datavg发现两个节点同一块PV missing了。完全搞不懂存储盘怎么会掉。。存储上检查也没发现什么报错。
#lsvg -p datavg
V_NAME PV STATE TOTAL PPs FREE PPs FREE DISTRIBUTION
hdisk6 active 1599 0 00..00..00..00..00
hdisk7 missing 1599 0 00..00..00..00..00
hdisk8 active 1599 0 00..00..00..00..00
hdisk9 active 1599 0 00..00..00..00..00
hdisk17 active 1599 0 00..00..00..00..00
hdisk18 active 1599 0 00..00..00..00..00
hdisk19 active 1599 0 00..00..00..00..00
hdisk20 active 1599 0 00..00..00..00..00
hdisk12 active 1599 0 00..00..00..00..00
hdisk22 active 1599 0 00..00..00..00..00
hdisk23 active 1599 467 00..00..00..147..320
hdisk25 active 1599 0 00..00..00..00..00
hdisk26 active 1599 0 00..00..00..00..00
hdisk27 active 1599 317 00..00..00..00..317
datavg中共14块PV,其中7块在A存储,另7块在B存储,做了VG镜像。
#lsvg -l datevg
backupvg:
LV NAME TYPE LPs PPs PVs LV STATE MOUNT POINT
journal jfs2 800 1600 2 open/syncd /journal
loglv02 jfs2log 1 2 2 open/stale N/A
wij jfs2 240 480 2 open/syncd /wij
dthealth jfs2 160 320 2 open/syncd /dthealth
date jfs2 9600 19200 14 open/stale /date
状态变成stale了。
errpt 报错:
#errpt
IDENTIFIER TIMESTAMP T C RESOURCE_NAME DESCRIPTION
EAA3D429 0311124816 U S LVDD PHYSICAL PARTITION MARKED STALE
EAA3D429 0311124816 U S LVDD PHYSICAL PARTITION MARKED STALE
EAA3D429 0311124716 U S LVDD PHYSICAL PARTITION MARKED STALE
EAA3D429 0311122216 U S LVDD PHYSICAL PARTITION MARKED STALE
EAA3D429 0311111716 U S LVDD PHYSICAL PARTITION MARKED STALE
F7DDA124 0311105216 U H LVDD PHYSICAL VOLUME DECLARED MISSING
52715FA5 0311105216 U H LVDD FAILED TO WRITE VOLUME GROUP STATUS AREA
E86653C3 0311105216 P H LVDD I/O ERROR DETECTED BY LVM
B6267342 0311105216 P H hdisk7 DISK OPERATION ERROR
EAA3D429 0311105216 U S LVDD PHYSICAL PARTITION MARKED STALE
E86653C3 0311105216 P H LVDD I/O ERROR DETECTED BY LVM
B6267342 0311105216 P H hdisk7 DISK OPERATION ERROR
3D32B80D 0310164516 P S topsvcs NIM thread blocked
3D32B80D 0310164516 P S topsvcs NIM thread blocked
0C10BB8C 0310164516 I H hdisk3 ARRAY CONFIGURATION CHANGED
0C10BB8C 0310164516 I H hdisk3 ARRAY CONFIGURATION CHANGED
#errpt -aj B6267342 |more
---------------------------------------------------------------------------
LABEL: SC_DISK_ERR2
IDENTIFIER: B6267342
Date/Time: Fri Mar 11 10:52:00 BEIST 2016
Sequence Number: 43005
Machine Id: 00F73C484C00
Node Id: cacheserver1
Class: H
Type: PERM
Resource Name: hdisk7
Resource Class: disk
Resource Type: mpioapdisk
Location: U78A0.001.DNWKK8P-P1-C2-T1-W20240080E52C0296-L5000000000000
VPD:
Manufacturer................IBM
Machine Type and Model......1814 FAStT
ROS Level and ID............31303630
Serial Number...............
Device Specific.(Z0)........0000053245005032
Device Specific.(Z1)........
Description
DISK OPERATION ERROR
Probable Causes
DASD DEVICE
Failure Causes
DISK DRIVE
DISK DRIVE ELECTRONICS
Recommended Actions
PERFORM PROBLEM DETERMINATION PROCEDURES
Detail Data
PATH ID
1
SENSE DATA
0A00 2E00 0000 0080 0000 1204 0000 0000 0000 0000 0000 0000 0118 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0D7D 0D21 000D B7C0 0000 0000 0000 0000 0000 0000 0000 0093 0000
0000 003D 0017
---------------------------------------------------------------------------
LABEL: SC_DISK_ERR2
IDENTIFIER: B6267342
Date/Time: Fri Mar 11 10:52:00 BEIST 2016
Sequence Number: 43002
Machine Id: 00F73C484C00
Node Id: cacheserver1
Class: H
Type: PERM
Resource Name: hdisk7
Resource Class: disk
Resource Type: mpioapdisk
Location: U78A0.001.DNWKK8P-P1-C2-T1-W20240080E52C0296-L5000000000000
VPD:
Manufacturer................IBM
Machine Type and Model......1814 FAStT
ROS Level and ID............31303630
Serial Number...............
Device Specific.(Z0)........0000053245005032
Device Specific.(Z1)........
Description
DISK OPERATION ERROR
Probable Causes
DASD DEVICE
Failure Causes
DISK DRIVE
DISK DRIVE ELECTRONICS
Recommended Actions
PERFORM PROBLEM DETERMINATION PROCEDURES
Detail Data
PATH ID
1
SENSE DATA
0A00 2A00 0502 B848 0000 0804 0000 0000 0000 0000 0000 0000 0118 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0D7D 0D21 000D B7C0 0000 0000 0000 0000 0000 0000 0000 0093 0000
0000 003D 0017
#errpt -aj E86653C3 |more
---------------------------------------------------------------------------
LABEL: LVM_IO_FAIL
IDENTIFIER: E86653C3
Date/Time: Fri Mar 11 10:52:00 BEIST 2016
Sequence Number: 43006
Machine Id: 00F73C484C00
Node Id: cacheserver1
Class: H
Type: PERM
Resource Name: LVDD
Resource Class: NONE
Resource Type: NONE
Location:
Description
I/O ERROR DETECTED BY LVM
Probable Causes
POWER, DRIVE, ADAPTER, OR CABLE FAILURE
Recommended Actions
RUN DIAGNOSTICS AGAINST THE FAILING DEVICE
Detail Data
PHYSICAL VOLUME DEVICE MAJOR/MINOR
8000 000F 0000 0008
ERROR CODE AS DEFINED IN sys/errno.h
16
BLOCK NUMBER
128
LOGICAL VOLUME DEVICE MAJOR/MINOR
8000 002F 0000 0000
PHYSICAL BUFFER TRANSACTION TIME
0
RESIDUAL COUNT
9216
NUMBER OF BLOCKS
9216
I/O TYPE
LVM META DATA
SENSE DATA
0000 0000 0000 0000 00F7 3C48 0000 4C00 0000 0136 E2F4 B667 00F7 3C48 3477 4766
0000 0000 0000 0000
---------------------------------------------------------------------------
LABEL: LVM_IO_FAIL
IDENTIFIER: E86653C3
Date/Time: Fri Mar 11 10:52:00 BEIST 2016
Sequence Number: 43003
Machine Id: 00F73C484C00
Node Id: cacheserver1
Class: H
Type: PERM
Resource Name: LVDD
Resource Class: NONE
Resource Type: NONE
Location:
Description
I/O ERROR DETECTED BY LVM
Probable Causes
POWER, DRIVE, ADAPTER, OR CABLE FAILURE
Recommended Actions
RUN DIAGNOSTICS AGAINST THE FAILING DEVICE
Detail Data
PHYSICAL VOLUME DEVICE MAJOR/MINOR
8000 000F 0000 0008
ERROR CODE AS DEFINED IN sys/errno.h
16
BLOCK NUMBER
84064328
LOGICAL VOLUME DEVICE MAJOR/MINOR
8000 002F 0000 0002
PHYSICAL BUFFER TRANSACTION TIME
0
RESIDUAL COUNT
4096
NUMBER OF BLOCKS
4096
I/O TYPE
USER DATA
SENSE DATA
0000 0000 0002 815C 00F7 3C48 0000 4C00 0000 0136 E2F4 B667 00F7 3C48 3477 4766
0000 0000 0000 0000
#errpt -aj F7DDA124 |more
---------------------------------------------------------------------------
LABEL: LVM_SA_PVMISS
IDENTIFIER: F7DDA124
Date/Time: Fri Mar 11 10:52:00 BEIST 2016
Sequence Number: 43008
Machine Id: 00F73C484C00
Node Id: cacheserver1
Class: H
Type: UNKN
Resource Name: LVDD
Resource Class: NONE
Resource Type: NONE
Location:
Description
PHYSICAL VOLUME DECLARED MISSING
Probable Causes
POWER, DRIVE, ADAPTER, OR CABLE FAILURE
Detail Data
MAJOR/MINOR DEVICE NUMBER
8000 000F 0000 0008
SENSE DATA
00F7 3C48 0000 4C00 0000 0136 E2F4 B667 00F7 3C48 3477 4766 0000 0000 0000 0000
试了chpv -va hdisk7命令,提示hdisk7上的逻辑卷是打开的。
#chpv -va hdisk7
0516-1010 chpv: Warning, the physical volume hdisk7 has open logical
volumes. Continuing with change.
我现思路是把HACMP停掉后varyoffvg就可以通过chpv -va hdisk7命令改回active状态?
如果chpv -va hdisk7命令执行不成功,是否需要解除镜像把hdisk7提出VG,删除hdisk7再加入VG,然后重新做镜像才行呢?希望大神们能支支招啊。