问题描述:点检发现小机报错3D32B80D 0107190114 P S topsvcs NIM thread blocked 3D32B80D 0107190114 P S topsvcs NIM thread blocked 3D32B80D 0107190114 P S topsvcs NIM thread blocked 3D32B80D 0107190114 P S topsvcs NIM thread blocked 3D32B80D 0107190114 P S topsvcs NIM thread blocked 3D32B80D 0107190114 P S topsvcs NIM thread blocked 3D32B80D 0107190114 P S topsvcs NIM thread blocked 3D32B80D 0107190114 P S topsvcs NIM thread blocked 3D32B80D 0107190114 P S topsvcs NIM thread blocked 3D32B80D 0107190114 P S topsvcs NIM thread blocked 3D32B80D 0107190114 P S topsvcs NIM thread blocked 3D32B80D 0107190114 P S topsvcs NIM thread blocked 3D32B80D 0107190114 P S topsvcs NIM thread blocked 3C81E43F 0107190114 P U topsvcs Late in sending heartbeat 3D32B80D 0107190114 P S topsvcs NIM thread blocked 3D32B80D 0107190114 P S topsvcs NIM thread blocked 3D32B80D 0107190114 P S topsvcs NIM thread blocked
查看相关日志具体内容: Description NIM thread blocked
Probable Causes A thread in a Topology Services Network Interface Module (NIM) process was blocked Topology Services NIM process cannot get timely access to CPU The system clock was set forward
User Causes Excessive memory consumption is causing high memory contention Excessive disk I/O is causing high memory contention The system clock was manually set forward
Recommended Actions Examine I/O and memory activity on the system Reduce load on the system Tune virtual memory parameters Call IBM Service if problem persists
Failure Causes Excessive virtual memory activity prevents NIM from making progress Excessive disk I/O traffic is interfering with paging I/O
Recommended Actions Examine I/O and memory activity on the system Reduce load on the system Tune virtual memory parameters Call IBM Service if problem persists
Detail Data DETECTING MODULE rsct,nim_control.C,1.39.1.41,7916 ERROR ID 6BUfAx.LuxmG/klb09g.2g0................... REFERENCE CODE
Thread which was blocked send thread Interval in seconds during which process was blocked 60 Interface name en4
en0 也是这样子 lssrc -ls topasvcs: Network Name Indx Defd Mbrs St Adapter ID Group ID net_ether_01_0 [ 0] 3 3 S 172.16.12.11 172.16.12.13 net_ether_01_0 [ 0] en4 0x40100e25 0x40100e7b HB Interval = 1.000 secs. Sensitivity = 10 missed beats Missed HBs: Total: 2 Current group: 2 Packets sent : 48017343 ICMP 0 Errors: 0 No mbuf: 0 Packets received: 59525397 ICMP 0 Dropped: 0 NIM's PID: 11272248 net_ether_01_1 [ 1] 3 3 S 172.16.11.11 172.16.11.13 net_ether_01_1 [ 1] en0 0x410f4e51 0x410f4e53 HB Interval = 1.000 secs. Sensitivity = 10 missed beats Missed HBs: Total: 977 Current group: 1 Packets sent : 45498064 ICMP 719 Errors: 0 No mbuf: 0 Packets received: 59184866 ICMP 1819 Dropped: 0 NIM's PID: 12189704 rs232_0 [ 2] 2 2 S 255.255.0.1 255.255.0.3 rs232_0 [ 2] tty1 0x80100e27 0x80100e81 HB Interval = 2.000 secs. Sensitivity = 5 missed beats Missed HBs: Total: 1 Current group: 1 Packets sent : 31484275 ICMP 0 Errors: 0 No mbuf: 0 Packets received: 32652406 ICMP 0 Dropped: 0 NIM's PID: 14287078 rs232_1 [ 3] 2 2 S 255.255.0.0 255.255.0.2 rs232_1 [ 3] tty0 0x80100e28 0x80100e44 HB Interval = 2.000 secs. Sensitivity = 5 missed beats Missed HBs: Total: 0 Current group: 0 Packets sent : 31484270 ICMP 0 Errors: 0 No mbuf: 0 Packets received: 32646900 ICMP 0 Dropped: 0 NIM's PID: 12386476 2 locally connected Clients with PIDs: haemd(11993156) hagsd(8323148) Fast Failure Detection available but off. Dead Man Switch Enabled: reset interval = 1 seconds trip interval = 20 seconds Client Heartbeating Disabled. Configuration Instance = 6 Daemon employs no security Segments pinned: Text Data. Text segment size: 862 KB. Static data segment size: 1497 KB. Dynamic data segment size: 8705. Number of outstanding malloc: 210 User time 2246 sec. System time 1836 sec. Number of page faults: 11. Process swapped out 0 times. Number of nodes up: 3. Number of nodes down: 0.
看着网上说的是更改hacmp里面NIM:我查看了下: Current values for Network Module :
Description [Ethernet Protocol] Parameters [] Grace Period 60 Supports gratuitous arp [True] Type Address (IP) Entry type [adapter_type] Next generic type [transport] Next generic name [Generic_UDP] Supports source routing True Failure Cycle 10 Heartbeat Rate (seconds) 1.00 Failure Detection Rate (Normal)