目前遇到一个问题,请大家帮忙
2台p750+1台vnx5500做ha。
P750安装aix7.1+ha5.4.1 (符合兼容列表,已查)
/etc/hosts文件
192.168.1.1 node1_boot1
192.168.2.1 node1_boot2
192.168.1.2 node2_boot1
192.168.2.2 node2_boot2
10.234.10.1 node1_per
10.234.10.2 node2_per
10.234.10.10 clu_svc
配置HA过程
建一个集群:cls
分别通过两个boot1建两个node:node1,node2
建一个网络netXX01,并把4个boot地址全部加入。
配两个per地址,属于netXX01
建一个服务地址,clu_svc
建一个app,并配置好相应脚本
在node1建datavg,包括hdiskpower1和hdiskpower2,并varyoffvg,再在node2上importvg datavg
建资源组rg_HIS,配置成主备模式,并never fallback 将服务IP、app、datavg加入资源组。
再校验同步,此时未出错。
(说明,这时未配置非IP的心跳网络)
然后smitty clstart启双机,这时双机启动失败
hacmp.out中相关的日志:
+rg_HIS:cl_activate_vgs[213] [[ -n == -n ]]
+rg_HIS:cl_activate_vgs[215] SYNCFLAG=-n
+rg_HIS:cl_activate_vgs[216] shift
/usr/es/sbin/cluster/events/utils/cl_activate_vgs: line 216: syntax error at line 221: ` $# -ne 0 ' unexpected
+rg_HIS:process_resources[process_volume_groups+15] RC=3
+rg_HIS:process_resources[process_volume_groups+15] [[ 3 != 0 ]]
+rg_HIS:process_resources[process_volume_groups+15] [[ 3 != 11 ]]
+rg_HIS:process_resources[process_volume_groups+19] export GROUPNAME
+rg_HIS:process_resources[process_volume_groups+22] ALLVGS=All_volume_groups
+rg_HIS:process_resources[process_volume_groups+23] cl_RMupdate resource_error All_volume_groups process_resources
Reference string: Fri.Jun.21.11:30:17.CST.2013.process_resources.All_volume_groups.rg_HIS.ref
+rg_HIS:process_resources[process_volume_groups+23] [[ 3 != 0 ]]
+rg_HIS:process_resources[process_volume_groups+29] STAT=3
+rg_HIS:process_resources[process_volume_groups+52] return 3
+rg_HIS:process_resources[process_volume_groups_main+187] STAT=3
+rg_HIS:process_resources[process_volume_groups_main+190] return 3
+rg_HIS:process_resources[+2471] RC=3
+rg_HIS:process_resources[+2471] [[ ACQUIRE = RELEASE ]]
+rg_HIS:process_resources[+2318] true
+rg_HIS:process_resources[+2320] set -a
+rg_HIS:process_resources[+2323] clRGPA
+rg_HIS:clRGPA[+49] [[ high = high ]]
+rg_HIS:clRGPA[+49] version=1.16
+rg_HIS:clRGPA[+51] usingVer=clrgpa
+rg_HIS:clRGPA[+56] clrgpa
+rg_HIS:clRGPA[+57] exit 0
+rg_HIS:process_resources[+2323] eval JOB_TYPE=ERROR RESOURCE_GROUPS="rg_HIS"
+rg_HIS:process_resources[+2323] JOB_TYPE=ERROR RESOURCE_GROUPS=rg_HIS
+rg_HIS:process_resources[+2325] RC=0
+rg_HIS:process_resources[+2326] set +a
+rg_HIS:process_resources[+2328] [ 0 -ne 0 ]
+rg_HIS:process_resources[+2334] RESOURCE_GROUPS=rg_HIS
+rg_HIS:process_resources[+2590] set_resource_group_state ERROR
+rg_HIS:process_resources[set_resource_group_state+4] STAT=0
+rg_HIS:process_resources[set_resource_group_state+7] export GROUPNAME
+rg_HIS:process_resources[set_resource_group_state+8] [ ERROR != DOWN ]
+rg_HIS:process_resources[set_resource_group_state+10] [ REAL = EMUL ]
+rg_HIS:process_resources[set_resource_group_state+15] clchdaemons -d clstrmgr_scripts -t resource_locator -n DBHIS2 -o rg_HIS -v ERROR
+rg_HIS:process_resources[set_resource_group_state+16] [ 0 -ne 0 ]
+rg_HIS:process_resources[set_resource_group_state+27] [ ERROR = ACQUIRING ]
+rg_HIS:process_resources[set_resource_group_state+32] [ ERROR = RELEASING ]
+rg_HIS:process_resources[set_resource_group_state+37] [ ERROR = UP ]
+rg_HIS:process_resources[set_resource_group_state+42] [ ERROR = DOWN ]
+rg_HIS:process_resources[set_resource_group_state+47] [ ERROR = ERROR ]
+rg_HIS:process_resources[set_resource_group_state+49] cl_RMupdate rg_error rg_HIS process_resources
Reference string: Fri.Jun.21.11:30:18.CST.2013.process_resources.rg_HIS.ref
+rg_HIS:process_resources[set_resource_group_state+50] continue
+rg_HIS:process_resources[set_resource_group_state+81] return 0
(其中DBHIS2就是node2)
如果我把vg从资源组中去掉,校验同步,再启HA,资源组和应用的状态就是online的
请高手查看下是什么原因?
谢谢了。
我的hdiskpower1和2没有把reserve_policy设置为no_reserve
是不是HA主备模式时,EMC存储也要设置为no_reserve?
还有,是不是如果不加disk method和cfgscsi_id也会出现cluster启动报错的情况?
收起