能源采矿hacmpaix 6aix 6.1

关于 NIM thread blocked 报错的问题

问题描述:点检发现小机报错3D32B80D   0107190114 P S topsvcs        NIM thread blocked3D32B80D   0107190114 P S topsvcs        NIM thread blocked3D32B80D   0107190...显示全部
问题描述:点检发现小机报错3D32B80D   0107190114 P S topsvcs        NIM thread blocked
3D32B80D   0107190114 P S topsvcs        NIM thread blocked
3D32B80D   0107190114 P S topsvcs        NIM thread blocked
3D32B80D   0107190114 P S topsvcs        NIM thread blocked
3D32B80D   0107190114 P S topsvcs        NIM thread blocked
3D32B80D   0107190114 P S topsvcs        NIM thread blocked
3D32B80D   0107190114 P S topsvcs        NIM thread blocked
3D32B80D   0107190114 P S topsvcs        NIM thread blocked
3D32B80D   0107190114 P S topsvcs        NIM thread blocked
3D32B80D   0107190114 P S topsvcs        NIM thread blocked
3D32B80D   0107190114 P S topsvcs        NIM thread blocked
3D32B80D   0107190114 P S topsvcs        NIM thread blocked
3D32B80D   0107190114 P S topsvcs        NIM thread blocked
3C81E43F   0107190114 P U topsvcs        Late in sending heartbeat
3D32B80D   0107190114 P S topsvcs        NIM thread blocked
3D32B80D   0107190114 P S topsvcs        NIM thread blocked
3D32B80D   0107190114 P S topsvcs        NIM thread blocked

查看相关日志具体内容:
Description
NIM thread blocked

Probable Causes
A thread in a Topology Services Network Interface Module (NIM) process
was blocked
Topology Services NIM process cannot get timely access to CPU
The system clock was set forward

User Causes
Excessive memory consumption is causing high memory contention
Excessive disk I/O is causing high memory contention
The system clock was manually set forward

        Recommended Actions
        Examine I/O and memory activity on the system
        Reduce load on the system
        Tune virtual memory parameters
        Call IBM Service if problem persists

Failure Causes
Excessive virtual memory activity prevents NIM from making progress
Excessive disk I/O traffic is interfering with paging I/O

        Recommended Actions
        Examine I/O and memory activity on the system
        Reduce load on the system
        Tune virtual memory parameters
        Call IBM Service if problem persists

Detail Data
DETECTING MODULE
rsct,nim_control.C,1.39.1.41,7916            
ERROR ID
6BUfAx.LuxmG/klb09g.2g0...................
REFERENCE CODE

Thread which was blocked
send thread
Interval in seconds during which process was blocked
          60
Interface name
en4

en0 也是这样子
lssrc -ls topasvcs:
Network Name   Indx Defd  Mbrs  St   Adapter ID      Group ID
net_ether_01_0 [ 0] 3     3     S    172.16.12.11    172.16.12.13   
net_ether_01_0 [ 0] en4              0x40100e25      0x40100e7b
HB Interval = 1.000 secs. Sensitivity = 10 missed beats
Missed HBs: Total: 2 Current group: 2
Packets sent    : 48017343 ICMP 0 Errors: 0 No mbuf: 0
Packets received: 59525397 ICMP 0 Dropped: 0
NIM's PID: 11272248
net_ether_01_1 [ 1] 3     3     S    172.16.11.11    172.16.11.13   
net_ether_01_1 [ 1] en0              0x410f4e51      0x410f4e53
HB Interval = 1.000 secs. Sensitivity = 10 missed beats
Missed HBs: Total: 977 Current group: 1
Packets sent    : 45498064 ICMP 719 Errors: 0 No mbuf: 0
Packets received: 59184866 ICMP 1819 Dropped: 0
NIM's PID: 12189704
rs232_0        [ 2] 2     2     S    255.255.0.1     255.255.0.3   
rs232_0        [ 2] tty1             0x80100e27      0x80100e81
HB Interval = 2.000 secs. Sensitivity = 5 missed beats
Missed HBs: Total: 1 Current group: 1
Packets sent    : 31484275 ICMP 0 Errors: 0 No mbuf: 0
Packets received: 32652406 ICMP 0 Dropped: 0
NIM's PID: 14287078
rs232_1        [ 3] 2     2     S    255.255.0.0     255.255.0.2   
rs232_1        [ 3] tty0             0x80100e28      0x80100e44
HB Interval = 2.000 secs. Sensitivity = 5 missed beats
Missed HBs: Total: 0 Current group: 0
Packets sent    : 31484270 ICMP 0 Errors: 0 No mbuf: 0
Packets received: 32646900 ICMP 0 Dropped: 0
NIM's PID: 12386476
  2 locally connected Clients with PIDs:
haemd(11993156) hagsd(8323148)
  Fast Failure Detection available but off.
  Dead Man Switch Enabled:
     reset interval = 1 seconds
     trip  interval = 20 seconds
  Client Heartbeating Disabled.
  Configuration Instance = 6
  Daemon employs no security
  Segments pinned: Text Data.
  Text segment size: 862 KB. Static data segment size: 1497 KB.
  Dynamic data segment size: 8705. Number of outstanding malloc: 210
  User time 2246 sec. System time 1836 sec.
  Number of page faults: 11. Process swapped out 0 times.
  Number of nodes up: 3. Number of nodes down: 0.

看着网上说的是更改hacmp里面NIM:我查看了下:
Current values for Network Module :

Description                     [Ethernet Protocol]
Parameters                      []
Grace Period                    60
Supports gratuitous arp         [True]
Type                            Address (IP)
Entry type                      [adapter_type]
Next generic type               [transport]
Next generic name               [Generic_UDP]
Supports source routing         True
Failure Cycle                   10
Heartbeat Rate (seconds)        1.00
Failure Detection Rate  (Normal)

  10 * 1.00 * 2 = 20.00 seconds


配置正确啊,请问如何设置这个参数收起
参与9

查看其它 6 个回答colins的回答

colinscolins系统工程师金融行业
修改下cluster的配置看看。里面有个值修改成slow看看。
银行 · 2014-01-08
浏览6035

回答者

colins
colins0412
系统工程师金融行业
擅长领域: 服务器存储灾备

colins 最近回答过的问题

回答状态

  • 发布时间:2014-01-08
  • 关注会员:1 人
  • 回答浏览:6035
  • X社区推广