问题描述:在HMC中,VIOS和LPAR都建立了相应的SCSI虚拟适配器,并已做了关联:hscroot@hmcsvr:~> lshwres -m 9117-MMB-xxxx -r virtualio --rsubtype scsi
lpar_name=D09_DFP_VIOS1,lpar_id=2,slot_num=319,state=0,is_required=0,adapter_type=server,remote_lpar_id=21,remote...
显示全部问题描述:
在HMC中,VIOS和LPAR都建立了相应的SCSI虚拟适配器,并已做了关联:
hscroot@hmcsvr:~> lshwres -m 9117-MMB-xxxx -r virtualio --rsubtype scsi
lpar_name=D09_DFP_VIOS1,lpar_id=2,slot_num=319,state=0,is_required=0,adapter_type=server,remote_lpar_id=21,remote_lpar_name=LPAR19_10.2.0.7_itsmapp01,remote_slot_num=319
lpar_name=D09_DFP_VIOS1,lpar_id=2,slot_num=318,state=1,is_required=0,adapter_type=server,remote_lpar_id=20,remote_lpar_name=LPAR18_10.2.0.41_NCDB,remote_slot_num=318
但是在VIOS通过cfgdev
却无法识别到这个设备,于是尝试在HMC里删除这个设备,重新添加一下,于是出现如下报错:
hscroot@hmcsvr:~> chhwres -r virtualio --rsubtype scsi -m 9117-MMB-xxxx -o r -s 319 --id 2 --force
HSCL294D Dynamic remove of virtual I/O resources failed:
The operation to remove slot(s) U9117.MMB.xxxxxx-V2-C319 has failed on partition D09_DFP_VIOS1. The requested amount of slot(s) to be removed is 1 and the completed amount is 0.
The OS standard output is:
..build_tree
ROOT DIR=/usr/lib/dr/scripts
Syslog ch=DRMGR
cal_dr_scriptinfo_file_checksum : Checksum : 0xd17d68289d3792bf
File read: string table
s_script:
file_name: 0x20126f08(/usr/lib/dr/scripts/all/IBM.CSMAgentRM_dr.sh)
script_version: 0x20126f58(2)
script_vendor_info: 0x20126f63(IBM)
script_creation_date: 0x20126f5a(05252010)
script_info: 0x20126f35(DR script to manage IBM.CSMAgentRM)
s_resource(Before - offsets):
resource_name: 0x5f
resource_use_description: 0x69
s_resource:(After - ptrs)
resource_name: 0x20126f67(pmig)
resource_use_description: 0x20126f71(Partition migration for IBM.CSMAgentRM)
s_resource(Before - offsets):
resource_name: 0x64
resource_use_description: 0x90
s_resource:(After - ptrs)
resource_name: 0x20126f6c(phib)
resource_use_description: 0x20126f98(Partition hibernation for IBM.CSMAgentRM)
s_script:
file_name: 0x20126fc1(/usr/lib/dr/scripts/all/aud_acct_dr)
script_version: 0x20127010(1)
script_vendor_info: 0x2012701b(IBM)
script_creation_date: 0x20127012(03232007)
script_info: 0x20126fe5(WPAR DR Script for Auditing and Accounting)
s_resource(Before - offsets):
resource_name: 0x117
resource_use_description: 0x134
s_resource:(After - ptrs)
resource_name: 0x2012701f(wmig-checkpoint)
resource_use_description: 0x2012703c(Checkpoint of Auditing and Accounting within a WPAR)
s_resource(Before - offsets):
resource_name: 0x127
resource_use_description: 0x168
s_resource:(After - ptrs)
resource_name: 0x2012702f(wmig-restart)
resource_use_description: 0x20127070(Restart of Auditing and Accounting within a WPAR)
s_script:
file_name: 0x201270a1(/usr/lib/dr/scripts/all/ctrmc_MDdr)
script_version: 0x201270f9(2)
script_vendor_info: 0x20127104(IBM)
script_creation_date: 0x201270fb(05252010)
script_info: 0x201270c4(DR script to refresh Management Domain configuration)
s_resource(Before - offsets):
resource_name: 0x200
resource_use_description: 0x20a
s_resource:(After - ptrs)
resource_name: 0x20127108(pmig)
resource_use_description: 0x20127112(Partition migration for RSCT Management Domain)
s_resource(Before - offsets):
resource_name: 0x205
resource_use_description: 0x239
s_resource:(After - ptrs)
resource_name: 0x2012710d(phib)
resource_use_description: 0x20127141(Partition hibernation for RSCT Management Domain)
s_script:
file_name: 0x20127172(/usr/lib/dr/scripts/all/viosdr)
script_version: 0x201271a1(1)
script_vendor_info: 0x201271ac(IBM Corp.)
script_creation_date: 0x201271a3(11192012)
script_info: 0x20127191(VIOS DR Handler)
s_resource(Before - offsets):
resource_name: 0x2ae
resource_use_description: 0x2b3
s_resource:(After - ptrs)
resource_name: 0x201271b6(slot)
resource_use_description: 0x201271bb(Virtual I/O Slot Handler)
..remove_a_slot
..query_a_slot
chosen_slot=0x2010b578 name=U9117.MMB.1095DFP-V2-C319
No OF node for U9117.MMB.1095DFP-V2-C319
The OS standard error is:
0931-009 You specified a drc_name for a resource which is
not assigned to this partition.
个人判断:怀疑是HMC和VIOS两边的RMC数据库不一致导致,遂在VIOS上执行了/usr/sbin/rsct/install/bin/recfgct
,但还是不行,因为目前拿不到HMC
的root
权限,最后的办法只有重置下HMC的RMC数据库试试。请教各位,还有更好的办法吗?
收起