工业制造其它Db2集群hadr切换

DB2的HADR切换异常,导致集群状态都是offline?

    DB2主数据库因其网络断开导致TSA进行主备切换,切换过程中备库也出现网络问题,无法将备库切换为主库,且用lssam命令查看,资源均为offline。请各位知悉的大佬帮忙看下如何解。1、网络连接图原主库syslog:Apr 19 10:07:39 hammdbp01 daemon:notice cthats[12255374]: (Re...显示全部

    DB2主数据库因其网络断开导致TSA进行主备切换,切换过程中备库也出现网络问题,无法将备库切换为主库,且用lssam命令查看,资源均为offline。请各位知悉的大佬帮忙看下如何解。
1、网络连接图

原主库syslog:Apr 19 10:07:39 hammdbp01 daemon:notice cthats[12255374]: (Recorded using libct_ffdc.a cv 2):::Error ID: 6zV5DL.fZVLW/ugM/12.V8....................:::Reference ID: :::
Template ID: 173c787f:::Details File:  :::Location: rsct,nim_control.C,1.39.1.43,6717             :::**TS_LOC_DOWN_ST Possible malfunction on local adapter Adapter inter
face name en0 Adapter offset 0 Adapter IP address 10.8.1.3**
原备库syslog:Apr 19 10:07:39 hammdbp11 daemon:notice cthats[14483462]: (Recorded using libct_ffdc.a cv 2):::Error ID: 6zV5DL.fZVLW/BkF/82.V8....................:::Reference ID: :::Template ID: 173c787f:::Details File:  :::Location: rsct,nim_control.C,1.39.1.43,6717
         :::TS_LOC_DOWN_ST Possible malfunction on local adapter Adapter interface name en0 Adapter offset 0 Adapter IP address 10.8.1.10
2、lssam命令查看状态如下:


3、DB2数据库diaglog
原主:
2022-04-19-10.07.46.004311+480 E38169532A649        LEVEL: Error
PID     : 35848234             TID : 22190          PROC : db2sysc 0
INSTANCE: istsm                NODE : 000           DB   : AMSMDB  
HOSTNAME: hasmdbp01
EDUID   : 22190                EDUNAME: db2hadrp.0.1 (AMSMDB) 0
FUNCTION: DB2 UDB, High Availability Disaster Recovery, hdrEduAcceptEvent, probe:20200
MESSAGE : Did not receive anything through HADR connection for the duration of
          HADR_TIMEOUT. Closing connection.
DATA #1 : String, 30 bytes
hdrCurrentTime/hdrLastRecvTime
DATA #2 : unsigned integer, 4 bytes
1650334066
DATA #3 : unsigned integer, 4 bytes
1650334055

2022-04-19-10.07.46.004820+480 I38170182A373        LEVEL: Error
PID     : 35848234             TID : 22190          PROC : db2sysc 0
INSTANCE: istsm                NODE : 000           DB   : AMSMDB  
HOSTNAME: hasmdbp01
EDUID   : 22190                EDUNAME: db2hadrp.0.1 (AMSMDB) 0
FUNCTION: DB2 UDB, High Availability Disaster Recovery, hdrEduAcceptEvent, probe:20200

2022-04-19-10.15.21.276554+480 I38191820A543        LEVEL: Error
PID     : 19923062             TID : 6684           PROC : db2sysc 0
INSTANCE: istsm                NODE : 000           DB   : AMSMDB
APPHDL  : 0-7                  APPID: 10.132.1.5.53485.220419021519
AUTHID  : SIVIEW               HOSTNAME: hasmdbp01
EDUID   : 6684                 EDUNAME: db2agent (AMSMDB) 0
FUNCTION: DB2 UDB, High Availability Disaster Recovery, hdrResolveHostNameAndPort, probe:1540
MESSAGE : ZRC=0x810F0032=-2129723342=SQLO_HOST_UNKNOWN "Host unknown"
原备:
PID     : 23920726             TID : 18763          PROC : db2sysc 0
INSTANCE: istsm                NODE : 000           DB   : AMSMDB  
HOSTNAME: hasmdbp11
EDUID   : 18763                EDUNAME: db2hadrs.0.0 (AMSMDB) 0
FUNCTION: DB2 UDB, High Availability Disaster Recovery, hdrEduAcceptEvent, probe:20200
MESSAGE : Did not receive anything through HADR connection for the duration of
          HADR_TIMEOUT. Closing connection.
DATA #1 : String, 30 bytes
hdrCurrentTime/hdrLastRecvTime
DATA #2 : unsigned integer, 4 bytes
1650334064
DATA #3 : unsigned integer, 4 bytes
1650334053

2022-04-19-10.07.44.035335+480 I73471990A373        LEVEL: Error
PID     : 23920726             TID : 18763          PROC : db2sysc 0
INSTANCE: istsm                NODE : 000           DB   : AMSMDB  
HOSTNAME: hasmdbp11
EDUID   : 18763                EDUNAME: db2hadrs.0.0 (AMSMDB) 0
FUNCTION: DB2 UDB, High Availability Disaster Recovery, hdrEduAcceptEvent, probe:20200

2022-04-19-10.07.55.037739+480 E73474647A649        LEVEL: Error
PID     : 23920726             TID : 18763          PROC : db2sysc 0
INSTANCE: istsm                NODE : 000           DB   : AMSMDB  
HOSTNAME: hasmdbp11
EDUID   : 18763                EDUNAME: db2hadrs.0.0 (AMSMDB) 0
FUNCTION: DB2 UDB, High Availability Disaster Recovery, hdrEduAcceptEvent, probe:20200
MESSAGE : Did not receive anything through HADR connection for the duration of
          HADR_TIMEOUT. Closing connection.
DATA #1 : String, 30 bytes
hdrCurrentTime/hdrLastRecvTime
DATA #2 : unsigned integer, 4 bytes
1650334075
DATA #3 : unsigned integer, 4 bytes
1650334064

收起
参与3

返回zhmwang的回答

zhmwangzhmwangPDOceanBase

在进行类似操作时, 最好将TSA disable,网络的情况涉及到TSA  心跳设备的探活。
对于目前的问题,只能重建TSA+HADR啦。

互联网服务 · 2022-05-15
浏览977

回答者

zhmwang
PDOceanBase
擅长领域: 数据库服务器国产数据库

zhmwang 最近回答过的问题

回答状态

  • 发布时间:2022-05-15
  • 关注会员:2 人
  • 回答浏览:977
  • X社区推广