能源采矿hacmpaix 6aix 6.1

关于 NIM thread blocked 报错的问题

问题描述:点检发现小机报错3D32B80D   0107190114 P S topsvcs        NIM thread blocked
3D32B80D   0107190114 P S topsvcs        NIM thread blocked
3D32B80D   0107190114 P S topsvcs        NIM thread blocked
3D32B80D   0107190114 P S topsvcs        NIM thread blocked
3D32B80D   0107190114 P S topsvcs        NIM thread blocked
3D32B80D   0107190114 P S topsvcs        NIM thread blocked
3D32B80D   0107190114 P S topsvcs        NIM thread blocked
3D32B80D   0107190114 P S topsvcs        NIM thread blocked
3D32B80D   0107190114 P S topsvcs        NIM thread blocked
3D32B80D   0107190114 P S topsvcs        NIM thread blocked
3D32B80D   0107190114 P S topsvcs        NIM thread blocked
3D32B80D   0107190114 P S topsvcs        NIM thread blocked
3D32B80D   0107190114 P S topsvcs        NIM thread blocked
3C81E43F   0107190114 P U topsvcs        Late in sending heartbeat
3D32B80D   0107190114 P S topsvcs        NIM thread blocked
3D32B80D   0107190114 P S topsvcs        NIM thread blocked
3D32B80D   0107190114 P S topsvcs        NIM thread blocked

查看相关日志具体内容:
Description
NIM thread blocked

Probable Causes
A thread in a Topology Services Network Interface Module (NIM) process
was blocked
Topology Services NIM process cannot get timely access to CPU
The system clock was set forward

User Causes
Excessive memory consumption is causing high memory contention
Excessive disk I/O is causing high memory contention
The system clock was manually set forward

        Recommended Actions
        Examine I/O and memory activity on the system
        Reduce load on the system
        Tune virtual memory parameters
        Call IBM Service if problem persists

Failure Causes
Excessive virtual memory activity prevents NIM from making progress
Excessive disk I/O traffic is interfering with paging I/O

        Recommended Actions
        Examine I/O and memory activity on the system
        Reduce load on the system
        Tune virtual memory parameters
        Call IBM Service if problem persists

Detail Data
DETECTING MODULE
rsct,nim_control.C,1.39.1.41,7916            
ERROR ID
6BUfAx.LuxmG/klb09g.2g0...................
REFERENCE CODE

Thread which was blocked
send thread
Interval in seconds during which process was blocked
          60
Interface name
en4

en0 也是这样子
lssrc -ls topasvcs:
Network Name   Indx Defd  Mbrs  St   Adapter ID      Group ID
net_ether_01_0 [ 0] 3     3     S    172.16.12.11    172.16.12.13   
net_ether_01_0 [ 0] en4              0x40100e25      0x40100e7b
HB Interval = 1.000 secs. Sensitivity = 10 missed beats
Missed HBs: Total: 2 Current group: 2
Packets sent    : 48017343 ICMP 0 Errors: 0 No mbuf: 0
Packets received: 59525397 ICMP 0 Dropped: 0
NIM's PID: 11272248
net_ether_01_1 [ 1] 3     3     S    172.16.11.11    172.16.11.13   
net_ether_01_1 [ 1] en0              0x410f4e51      0x410f4e53
HB Interval = 1.000 secs. Sensitivity = 10 missed beats
Missed HBs: Total: 977 Current group: 1
Packets sent    : 45498064 ICMP 719 Errors: 0 No mbuf: 0
Packets received: 59184866 ICMP 1819 Dropped: 0
NIM's PID: 12189704
rs232_0        [ 2] 2     2     S    255.255.0.1     255.255.0.3   
rs232_0        [ 2] tty1             0x80100e27      0x80100e81
HB Interval = 2.000 secs. Sensitivity = 5 missed beats
Missed HBs: Total: 1 Current group: 1
Packets sent    : 31484275 ICMP 0 Errors: 0 No mbuf: 0
Packets received: 32652406 ICMP 0 Dropped: 0
NIM's PID: 14287078
rs232_1        [ 3] 2     2     S    255.255.0.0     255.255.0.2   
rs232_1        [ 3] tty0             0x80100e28      0x80100e44
HB Interval = 2.000 secs. Sensitivity = 5 missed beats
Missed HBs: Total: 0 Current group: 0
Packets sent    : 31484270 ICMP 0 Errors: 0 No mbuf: 0
Packets received: 32646900 ICMP 0 Dropped: 0
NIM's PID: 12386476
  2 locally connected Clients with PIDs:
haemd(11993156) hagsd(8323148)
  Fast Failure Detection available but off.
  Dead Man Switch Enabled:
     reset interval = 1 seconds
     trip  interval = 20 seconds
  Client Heartbeating Disabled.
  Configuration Instance = 6
  Daemon employs no security
  Segments pinned: Text Data.
  Text segment size: 862 KB. Static data segment size: 1497 KB.
  Dynamic data segment size: 8705. Number of outstanding malloc: 210
  User time 2246 sec. System time 1836 sec.
  Number of page faults: 11. Process swapped out 0 times.
  Number of nodes up: 3. Number of nodes down: 0.

看着网上说的是更改hacmp里面NIM:我查看了下:
Current values for Network Module :

Description                     [Ethernet Protocol]
Parameters                      []
Grace Period                    60
Supports gratuitous arp         [True]
Type                            Address (IP)
Entry type                      [adapter_type]
Next generic type               [transport]
Next generic name               [Generic_UDP]
Supports source routing         True
Failure Cycle                   10
Heartbeat Rate (seconds)        1.00
Failure Detection Rate  (Normal)

  10 * 1.00 * 2 = 20.00 seconds


配置正确啊,请问如何设置这个参数
参与9

7同行回答

colinscolins系统工程师金融行业
修改下cluster的配置看看。里面有个值修改成slow看看。显示全部
修改下cluster的配置看看。里面有个值修改成slow看看。收起
银行 · 2014-01-08
浏览6053
ITRookieITRookie网络工程师hua-cloud
也是遇到同样的问题,坐等高手啊显示全部
也是遇到同样的问题,坐等高手啊收起
互联网服务 · 2015-07-14
浏览6192
马蓝山小许马蓝山小许存储工程师柏科数据
谢谢!11!!!显示全部
谢谢
!11!!!收起
硬件生产 · 2014-05-05
浏览6095
bensysbensys系统工程师18
坐等高手如何解决,学习了。。。显示全部
坐等高手如何解决,学习了。。。收起
系统集成 · 2014-03-27
浏览6105
hufeng719hufeng719联盟成员系统工程师某钢铁企业
到底怎么处理呢?显示全部
到底怎么处理呢?收起
能源采矿 · 2014-01-08
浏览6063
zwz99999zwz99999系统工程师dcits
应该是网络延迟导致的!显示全部
应该是网络延迟导致的!收起
系统集成 · 2014-01-08
浏览6740
hufeng719hufeng719联盟成员系统工程师某钢铁企业
能说说具体修改哪个文件的配置吗,不太清楚呢显示全部
能说说具体修改哪个文件的配置吗,不太清楚呢收起
能源采矿 · 2014-01-08
浏览6186

提问者

hufeng719
系统工程师某钢铁企业
擅长领域: 数据库存储服务器

相关问题

相关资料

问题状态

  • 发布时间:2014-01-08
  • 关注会员:1 人
  • 问题浏览:16398
  • 最近回答:2015-07-14
  • X社区推广