非常感谢,下午我把相关的日志贴上来。
tail -f /var/adm/ras/mmfs.log.latest
Thu Jan 3 13:39:09.812 2019: [D] Leave protocol detail info: LA: 65 LFLG: 17184422 LFLG delta: 65
Thu Jan 3 13:39:09.815 2019: [I] Recovering nodes: 10.20.30.11
Thu Jan 3 13:39:09.817 2019: [I] Recovery: db2instance, delay 4 sec. for safe recovery.
Thu Jan 3 13:39:13.833 2019: [I] Recovered 1 nodes for file system db2data.
db2inst1@db2sr03:~> db2instance -list
ID TYPE STATE HOME_HOST CURRENT_HOST ALERT PARTITION_NUMBER LOGICAL_PORT NETNAME
0 MEMBER STARTED db2sr02 db2sr02 NO 0 0 db2sr02
1 MEMBER STARTED db2sr04 db2sr04 NO 0 0 db2sr04
128 CF ERROR db2sr01 db2sr01 YES - 0 db2sr01
129 CF PEER db2sr03 db2sr03 NO - 0 db2sr03
HOSTNAME STATE INSTANCE_STOPPED ALERT
db2sr03 ACTIVE NO NO
db2sr02 ACTIVE NO NO
db2sr04 ACTIVE NO NO
db2sr01 INACTIVE NO YES
There is currently an alert for a member, CF, or host in the data-sharing instance. For more information on the alert, its impact, and how to clear it, run the following command: 'db2cluster -cm -list -alert'.
db2inst1@db2sr03:~>
db2sr03:/var/ct/db2domain_20181224120019/log/mc/IBM.GblResRM # lssam
Online IBM.ResourceGroup:ca_db2inst1_0-rg Control=MemberInProblemState Nominal=Online
'- Online IBM.Application:ca_db2inst1_0-rs Control=MemberInProblemState
|- Failed offline IBM.Application:ca_db2inst1_0-rs:db2sr01 Node=Offline
'- Online IBM.Application:ca_db2inst1_0-rs:db2sr03
Online IBM.ResourceGroup:db2_db2inst1_0-rg Nominal=Online
'- Online IBM.Application:db2_db2inst1_0-rs
|- Online IBM.Application:db2_db2inst1_0-rs:db2sr02
'- Offline IBM.Application:db2_db2inst1_0-rs:db2sr04
Online IBM.ResourceGroup:db2_db2inst1_1-rg Nominal=Online
'- Online IBM.Application:db2_db2inst1_1-rs
|- Offline IBM.Application:db2_db2inst1_1-rs:db2sr02
'- Online IBM.Application:db2_db2inst1_1-rs:db2sr04
Online IBM.ResourceGroup:db2mnt-db2data-rg Nominal=Online
'- Online IBM.Application:db2mnt-db2data-rs
|- Online IBM.Application:db2mnt-db2data-rs:db2sr02
'- Online IBM.Application:db2mnt-db2data-rs:db2sr04
Online IBM.ResourceGroup:db2mnt-db2instance-rg Control=MemberInProblemState Nominal=Online
'- Online IBM.Application:db2mnt-db2instance-rs Control=MemberInProblemState
|- Failed offline IBM.Application:db2mnt-db2instance-rs:db2sr01 Node=Offline
|- Online IBM.Application:db2mnt-db2instance-rs:db2sr02
|- Online IBM.Application:db2mnt-db2instance-rs:db2sr03
'- Online IBM.Application:db2mnt-db2instance-rs:db2sr04
Online IBM.ResourceGroup:idle_db2inst1_997_db2sr02-rg Nominal=Online
'- Online IBM.Application:idle_db2inst1_997_db2sr02-rs
'- Online IBM.Application:idle_db2inst1_997_db2sr02-rs:db2sr02
Online IBM.ResourceGroup:idle_db2inst1_997_db2sr04-rg Nominal=Online
'- Online IBM.Application:idle_db2inst1_997_db2sr04-rs
'- Online IBM.Application:idle_db2inst1_997_db2sr04-rs:db2sr04
Online IBM.ResourceGroup:idle_db2inst1_998_db2sr02-rg Nominal=Online
'- Online IBM.Application:idle_db2inst1_998_db2sr02-rs
'- Online IBM.Application:idle_db2inst1_998_db2sr02-rs:db2sr02
Online IBM.ResourceGroup:idle_db2inst1_998_db2sr04-rg Nominal=Online
'- Online IBM.Application:idle_db2inst1_998_db2sr04-rs
'- Online IBM.Application:idle_db2inst1_998_db2sr04-rs:db2sr04
Online IBM.ResourceGroup:idle_db2inst1_999_db2sr02-rg Nominal=Online
'- Online IBM.Application:idle_db2inst1_999_db2sr02-rs
'- Online IBM.Application:idle_db2inst1_999_db2sr02-rs:db2sr02
Online IBM.ResourceGroup:idle_db2inst1_999_db2sr04-rg Nominal=Online
'- Online IBM.Application:idle_db2inst1_999_db2sr04-rs
'- Online IBM.Application:idle_db2inst1_999_db2sr04-rs:db2sr04
Pending online IBM.ResourceGroup:primary_db2inst1_900-rg Control=MemberInProblemState Nominal=Online
'- Offline IBM.Application:primary_db2inst1_900-rs Control=MemberInProblemState
** |- Failed offline IBM.Application:primary_db2inst1_900-rs:db2sr01 Node=Offline**
'- Offline IBM.Application:primary_db2inst1_900-rs:db2sr03
Online IBM.Equivalency:ca_db2inst1_0-rg_group-equ
01/03/19 13:21:07.426482 T(4123798384) _GBD Monitor detect OpState change for resource Name=primary_db2inst1_900-rs OldOpState=5 NewOpState=1 Handle=0x6028 0xffff 0x6b8042d8 0xa01d9c44 0x15732ff7 0x08efa5f8
01/03/19 13:29:45.748125 T(4104928112) _GBD Monitor detect OpState change for resource Name=cacontrol_db2inst1_129_db2sr03 OldOpState=1 NewOpState=2 Handle=0x6028 0xffff 0x6b8042d8 0xa01d9c44 0x15732ff5 0xdb520678
01/03/19 13:29:45.766898 T(4103945072) _GBD Taking application resource offline: Name=primary_db2inst1_900-rs Handle=0x6028 0xffff 0x6b8042d8 0xa01d9c44 0x15732ff7 0x08efa5f8
01/03/19 13:29:45.766986 T(4103748464) _GBD Monitor detect OpState change for resource Name=primary_db2inst1_900-rs OldOpState=1 NewOpState=6 Handle=0x6028 0xffff 0x6b8042d8 0xa01d9c44 0x15732ff7 0x08efa5f8
01/03/19 13:29:47.062209 T(4123798384) _GBD STOP command for application resource "primary_db2inst1_900-rs" (handle 0x6028 0xffff 0x6b8042d8 0xa01d9c44 0x15732ff7 0x08efa5f8) succeeded with exit code 0
01/03/19 13:29:48.032666 T(4104928112) _GBD Monitor detect OpState change for resource Name=primary_db2inst1_900-rs OldOpState=6 NewOpState=2 Handle=0x6028 0xffff 0x6b8042d8 0xa01d9c44 0x15732ff7 0x08efa5f8
01/03/19 13:29:48.050650 T(4103945072) _GBD Taking application resource offline: Name=ca_db2inst1_0-rs Handle=0x6028 0xffff 0x6b8042d8 0xa01d9c44 0x15732ff6 0x09c38568
01/03/19 13:29:48.050725 T(4103748464) _GBD Monitor detect OpState change for resource Name=ca_db2inst1_0-rs OldOpState=1 NewOpState=6 Handle=0x6028 0xffff 0x6b8042d8 0xa01d9c44 0x15732ff6 0x09c38568
01/03/19 13:29:49.473035 T(4123798384) _GBD STOP command for application resource "ca_db2inst1_0-rs" (handle 0x6028 0xffff 0x6b8042d8 0xa01d9c44 0x15732ff6 0x09c38568) succeeded with exit code 0
01/03/19 13:29:50.142250 T(4104928112) _GBD Monitor detect OpState change for resource Name=ca_db2inst1_0-rs OldOpState=6 NewOpState=2 Handle=0x6028 0xffff 0x6b8042d8 0xa01d9c44 0x15732ff6 0x09c38568
01/03/19 13:31:39.917030 T(4104928112) _GBD Monitor detect OpState change for resource Name=cacontrol_db2inst1_129_db2sr03 OldOpState=2 NewOpState=1 Handle=0x6028 0xffff 0x6b8042d8 0xa01d9c44 0x15732ff5 0xdb520678
01/03/19 13:31:40.007051 T(4103945072) _GBD Bringing application resource online: Name=ca_db2inst1_0-rs Handle=0x6028 0xffff 0x6b8042d8 0xa01d9c44 0x15732ff6 0x09c38568
01/03/19 13:31:40.007552 T(4103748464) _GBD Monitor detect OpState change for resource Name=ca_db2inst1_0-rs OldOpState=2 NewOpState=5 Handle=0x6028 0xffff 0x6b8042d8 0xa01d9c44 0x15732ff6 0x09c38568
01/03/19 13:31:45.062421 T(4123798384) _GBD START command for application resource "ca_db2inst1_0-rs" (handle 0x6028 0xffff 0x6b8042d8 0xa01d9c44 0x15732ff6 0x09c38568) succeeded with exit code 0
01/03/19 13:31:45.062549 T(4123798384) _GBD Monitor detect OpState change for resource Name=ca_db2inst1_0-rs OldOpState=5 NewOpState=1 Handle=0x6028 0xffff 0x6b8042d8 0xa01d9c44 0x15732ff6 0x09c38568
01/03/19 13:38:12.155458 T(4103748464) _GBD Running cleanup command "/db2home/db2inst1/sqllib/adm/db2rocme 1 PRIMARY db2inst1 900 CLEANUP" for resource 0x6028 0xffff 0x6b8042d8 0xa01d9c44 0x15732ff7 0x08efa5f8. Supporter: 0x0000 0x0000 0x00000000 0x00000000 0x00000000 0x00000000.
01/03/19 13:38:12.157403 T(4103748464) _GBD Running cleanup command "/db2home/db2inst1/sqllib/adm/db2rocme 1 CF db2inst1 128 CLEANUP" for resource 0x6028 0xffff 0x6b8042d8 0xa01d9c44 0x15732ff6 0x09c38568. Supporter: 0x6028 0xffff 0xf6b0c7f8 0x181dbce7 0x15732ff3 0xd40ef300.
#####附件是 tsa 详细日志 trace.8.sp.txt
附件:
trace.8.sp.txt (2.06 MB)
trace.3.sp.txt (1.46 MB)