HA5.1+aix5.3奇怪现象

每台机器各2块网卡,用磁盘做的心跳,资源组是rotating方式,把A机的2根网线全拔了,B机接管,再插回A机的网线,这时又都回到了A机,同时B机立刻down机,应该从哪方面开始找原因。显示全部
每台机器各2块网卡,用磁盘做的心跳,资源组是rotating方式,把A机的2根网线全拔了,B机接管,再插回A机的网线,这时又都回到了A机,同时B机立刻down机,应该从哪方面开始找原因。收起
参与11

查看其它 10 个回答redliquid的回答

redliquidredliquid软件架构设计师sms
[root@p5509bf:/#]lssrc -g cluster
Subsystem         Group            PID          Status
clsmuxpdES       cluster          372748       active
clstrmgrES       cluster          348320       active
clinfoES         cluster          385162       active
[root@p5509bf:/#]lssrc -ls topsvcs
Subsystem         Group            PID     Status
topsvcs          topsvcs          266316  active
Network Name   Indx Defd  Mbrs  St   Adapter ID      Group ID
net_ether_01_0 [ 0] 2     2     S    192.168.0.254   192.168.0.240  
net_ether_01_0 [ 0]                 (192.168.0.215  )
net_ether_01_0 [ 0] en0              0x4150d94c      0x4150d96c
HB Interval = 1.000 secs. Sensitivity = 10 missed beats
Missed HBs: Total: 0 Current group: 0
Packets sent    : 142 ICMP 0 Errors: 0 No mbuf: 0
Packets received: 194 ICMP 0 Dropped: 0
NIM's PID: 409672
net_ether_01_1 [ 1] 2     2     S    10.10.10.10     10.10.10.11   
net_ether_01_1 [ 1] en1              0x4150d912      0x4150d96b
HB Interval = 1.000 secs. Sensitivity = 10 missed beats
Missed HBs: Total: 0 Current group: 0
Packets sent    : 141 ICMP 0 Errors: 0 No mbuf: 0
Packets received: 194 ICMP 0 Dropped: 0
NIM's PID: 356452
diskhb_0       [ 2] 2     2     S    255.255.10.0    255.255.10.1   
diskhb_0       [ 2] rhdisk5          0x8150d913      0x8150d989
HB Interval = 2.000 secs. Sensitivity = 4 missed beats
Missed HBs: Total: 0 Current group: 0
Packets sent    : 52 ICMP 0 Errors: 0 No mbuf: 0
Packets received: 56 ICMP 0 Dropped: 0
NIM's PID: 303324
  2 locally connected Clients with PIDs:
haemd(389178) hagsd(294950)
  Dead Man Switch Enabled:
     reset interval = 1 seconds
     trip  interval = 20 seconds
  Client Heartbeating Disabled.
  Configuration Instance = 50
  Daemon employs no security
  Segments pinned: Text Data.
  Text segment size: 810 KB. Static data segment size: 1543 KB.
  Dynamic data segment size: 4033. Number of outstanding malloc: 183
  User time 0 sec. System time 0 sec.
  Number of page faults: 230. Process swapped out 0 times.
  Number of nodes up: 2. Number of nodes down: 0.
[root@p5509bf:/#]cd /usr/es/sbin/cluster/u*
[root@p5509bf:/usr/es/sbin/cluster/utilities#]./cltopinfo
Cluster Description of Cluster: test_cluster
Cluster Security Level: Standard
There are 2 node(s) and 2 network(s) defined
NODE p550_node:
        Network net_diskhb_01
                p550_node_hdisk5_01     /dev/hdisk5
        Network net_ether_01
                ati_svr 192.168.0.254
                p5509bf_boot    192.168.0.215
                p550bf_stb      10.10.10.10
NODE p615_node:
        Network net_diskhb_01
                p615_node_hdisk6_01     /dev/hdisk6
        Network net_ether_01
                ati_svr 192.168.0.254
                p615_boot       192.168.0.240
                p615_stb        10.10.10.11

Resource Group test_app
        Behavior                 rotating
        Participating Nodes      p615_node p550_node
        Service IP Label                 ati_svr
[root@p5509bf:/usr/es/sbin/cluster/utilities#]./clfindres
./cl-----------------------------------------------------------------------------
Group Name     Type       State      Location   
-----------------------------------------------------------------------------
test_app       rotating   OFFLINE    p615_node   
                          ONLINE     p550_node   

[root@p5509bf:/usr/es/sbin/cluster/utilities#]./cllstopsvcs -c        
#runFixedPri:fixedPriLevel:tsLogLength:gsLogLength:instanceNum
1:38:5000:5000:50
[root@p5509bf:/usr/es/sbin/cluster/utilities#]./cllstopsvcs   
runFixedPri:    1
fixedPriLevel:  38
tsLogLength:    5000
gsLogLength:    5000
instanceNum     50

[root@p5509bf:/usr/es/sbin/cluster/utilities#]ls *rsct*
clrsctinfo
[root@p5509bf:/usr/es/sbin/cluster/utilities#]./clrsctinfo -cp cllsif
p550_node_hdisk5_01:service:net_diskhb_01:diskhb:serial:p550_node:/dev/rhdisk5::hdisk5::
p5509bf_boot:boot:net_ether_01:ether:public:p550_node:192.168.0.215::en0::255.255.255.0
ati_svr:service:net_ether_01:ether:public::192.168.0.254::::255.255.255.0
p550bf_stb:standby:net_ether_01:ether:public:p550_node:10.10.10.10::en1::255.255.255.0
p615_node_hdisk6_01:service:net_diskhb_01:diskhb:serial:p615_node:/dev/hdisk6::hdisk6::
p615_boot:boot:net_ether_01:ether:public:p615_node:192.168.0.240::en0::255.255.255.0
ati_svr:service:net_ether_01:ether:public::192.168.0.254::::255.255.255.0
p615_stb:standby:net_ether_01:ether:public:p615_node:10.10.10.11::en1::255.255.255.0
[root@p5509bf:/usr/es/sbin/cluster/utilities#]./clrsctinfo -cp cllsnw
net_diskhb_01:serial:disable::p550_node:p550_node_hdisk5_01:::p615_node:p615_node_hdisk6_01::
net_ether_01:public:disable::p550_node:ati_svr:p550bf_stb:::p615_node:ati_svr:p615_stb::
IT咨询服务 · 2008-12-24
浏览1306

回答者

redliquid
软件架构设计师sms
擅长领域: 存储灾备数据安全

redliquid 最近回答过的问题

回答状态

  • 发布时间:2008-12-24
  • 关注会员:0 人
  • 回答浏览:1306
  • X社区推广