硬件生产HA

HA问题,大家帮忙下,,急

今天遇到了一个HA停止的问题:当停止HA时候,一个节点的HA进程停止了,另一个节点的HA进程还在.先说下环境,2台P740做了HA,应用是SAP.操作系统AIX6100-06-08-1216   HA版本:6.1sp09配置完成HA后,空脚本执行启动和停止HA都没有问题,然后再用命令启动应用(DB2和SAP,...显示全部
今天遇到了一个HA停止的问题:当停止HA时候,一个节点的HA进程停止了,另一个节点的HA进程还在.
先说下环境,2台P740做了HA,应用是SAP.
操作系统AIX6100-06-08-1216   HA版本:6.1sp09
配置完成HA后,空脚本执行启动和停止HA都没有问题,
然后再用命令启动应用(DB2和SAP,这里用脚本执行,但资源组里的脚本还是空脚本),启动和停止HA仍然没有问题

现在情况,在资源组里的脚本加上上面测试的脚本,启动没有问题,但是停止的时候,一个节点A的HA进程都停了,
但是B节点的HA还在,如下图,不知道什么问题麻烦大家帮忙解决下.
停止HA后,HA进程应该如下图:inoperative状态,但是另一个节点的状态还是active

clstart.png


另一个B节点的HA关闭不了,

clstop.png




以下是hacmp.out的一些log
:cl_sel[148] ls -rt1 /tmp/ibmsupt/hacmp/eventlogs.2012.10.10.16.06.Z /tmp/ibmsupt/hacmp/event
logs.2012.10.10.16.11.Z /tmp/ibmsupt/hacmp/eventlogs.2012.10.10.17.20.Z /tmp/ibmsupt/hacmp/ev
entlogs.2012.10.10.17.26.Z /tmp/ibmsupt/hacmp/eventlogs.2012.10.10.21.55.Z /tmp/ibmsupt/hacmp
/eventlogs.2012.10.10.22.01.Z
:cl_sel[148] FFDC_LIST=/tmp/ibmsupt/hacmp/eventlogs.2012.10.10.16.06.Z
:cl_sel[151] rm -f /tmp/ibmsupt/hacmp/eventlogs.2012.10.10.16.06.Z
:cl_sel[155] dspmsg scripts.cat 10059 'FFDC event log collection saved to /tmp/ibmsupt/hacmp/
eventlogs.2012.10.10.22.01n' /tmp/ibmsupt/hacmp/eventlogs.2012.10.10.22.01
FFDC event log collection saved to /tmp/ibmsupt/hacmp/eventlogs.2012.10.10.22.01
:cl_sel[157] exit 0
WARNING: Cluster sapcluster has been running recovery program 'TE_FAIL_NODE' for 360 seconds.
Please check cluster status.
WARNING: Cluster sapcluster has been running recovery program 'TE_FAIL_NODE' for 390 seconds.
Please check cluster status.
WARNING: Cluster sapcluster has been running recovery program 'TE_FAIL_NODE' for 420 seconds.
Please check cluster status.
WARNING: Cluster sapcluster has been running recovery program 'TE_FAIL_NODE' for 450 seconds.
Please check cluster status.
WARNING: Cluster sapcluster has been running recovery program 'TE_FAIL_NODE' for 480 seconds.
Please check cluster status.
WARNING: Cluster sapcluster has been running recovery program 'TE_FAIL_NODE' for 540 seconds.
Please check cluster status.



加入现在要重新启动HA的话,报了ha的进程还在
Command: failed        stdout: yes           stderr: no

Before command completion, additional instructions may appear below.

Verifying Cluster Configuration Prior to Starting Cluster Services.
WARNING: Node(s):  prd1 requested to start cluster services.
These nodes are already running cluster services and will not be started.


麻烦大家帮忙下.....................收起
参与15

查看其它 13 个回答jim567的回答

jim567jim567系统架构师上海天玑科技股份有限公司
停止应用-停止数据库-查看脚本的权限(HA当中脚本权限755)两边节点的脚本内容以及路径是否正确-同步HA查看同步信息。
互联网服务 · 2012-10-11
浏览4364

回答者

jim567
系统架构师上海天玑科技股份有限公司
擅长领域: 灾备存储数据安全

jim567 最近回答过的问题

回答状态

  • 发布时间:2012-10-11
  • 关注会员:1 人
  • 回答浏览:4364
  • X社区推广