小弟刚在两台P520的机器上配好了HA7.1,也测试完成。功能什么的都正常,但是不知道为什么从开始配置HA到现在。
系统中总会报下面的错误。
# errpt
IDENTIFIER TIMESTAMP T C RESOURCE_NAME DESCRIPTION
E509DBCA 0515081414 P S ConfigRM An internal error was encountered in the
DE84C4DB 0515081414 I O ConfigRM IBM.ConfigRM daemon has started.
CB4A951F 0515081414 I S SRC SOFTWARE PROGRAM ERROR
68FD23E8 0515081414 P S ConfigRM The peer domain configuration manager da
E509DBCA 0515081314 P S ConfigRM An internal error was encountered in the
DE84C4DB 0515081314 I O ConfigRM IBM.ConfigRM daemon has started.
CB4A951F 0515081314 I S SRC SOFTWARE PROGRAM ERROR
68FD23E8 0515081314 P S ConfigRM The peer domain configuration manager da
E509DBCA 0515081314 P S ConfigRM An internal error was encountered in the
DE84C4DB 0515081314 I O ConfigRM IBM.ConfigRM daemon has started.
CB4A951F 0515081314 I S SRC SOFTWARE PROGRAM ERROR
68FD23E8 0515081314 P S ConfigRM The peer domain configuration manager da
E509DBCA 0515081314 P S ConfigRM An internal error was encountered in the
DE84C4DB 0515081314 I O ConfigRM IBM.ConfigRM daemon has started.
CB4A951F 0515081314 I S SRC SOFTWARE PROGRAM ERROR
68FD23E8 0515081314 P S ConfigRM The peer domain configuration manager da
E509DBCA 0515081214 P S ConfigRM An internal error was encountered in the
DE84C4DB 0515081214 I O ConfigRM IBM.ConfigRM daemon has started.
CB4A951F 0515081214 I S SRC SOFTWARE PROGRAM ERROR
68FD23E8 0515081214 P S ConfigRM The peer domain configuration manager da
E509DBCA 0515081214 P S ConfigRM An internal error was encountered in the
DE84C4DB 0515081214 I O ConfigRM IBM.ConfigRM daemon has started.
CB4A951F 0515081214 I S SRC SOFTWARE PROGRAM ERROR
68FD23E8 0515081214 P S ConfigRM The peer domain configuration manager da
E509DBCA 0515081214 P S ConfigRM An internal error was encountered in the
DE84C4DB 0515081214 I O ConfigRM IBM.ConfigRM daemon has started.
CB4A951F 0515081214 I S SRC SOFTWARE PROGRAM ERROR
68FD23E8 0515081214 P S ConfigRM The peer domain configuration manager da
E509DBCA 0515081214 P S ConfigRM An internal error was encountered in the
DE84C4DB 0515081214 I O ConfigRM IBM.ConfigRM daemon has started.
CB4A951F 0515081214 I S SRC SOFTWARE PROGRAM ERROR
LABEL: CONFIGRM_STARTED_ST
IDENTIFIER: DE84C4DB
Date/Time: Thu May 15 08:13:13 BEIDT 2014
Sequence Number: 122697
Machine Id: 00C933BF4C00
Node Id: p520b
Class: O
Type: INFO
WPAR: Global
Resource Name: ConfigRM
Description
IBM.ConfigRM daemon has started.
Probable Causes
The RSCT Configuration Manager daemon (IBM.ConfigRMd) has been started.
User Causes
The RSCT Configuration Manager daemon (IBM.ConfigRMd) has been started.
Recommended Actions
None
Detail Data
DETECTING MODULE
RSCT,IBM.ConfigRMd.C,1.57,347
ERROR ID
REFERENCE CODE
---------------------------------------------------------------------------
LABEL: SRC_RSTRT
IDENTIFIER: CB4A951F
Date/Time: Thu May 15 08:13:13 BEIDT 2014
Sequence Number: 122696
Machine Id: 00C933BF4C00
Node Id: p520b
Class: S
Type: INFO
WPAR: Global
Resource Name: SRC
Description
SOFTWARE PROGRAM ERROR
Probable Causes
APPLICATION PROGRAM
Failure Causes
SOFTWARE PROGRAM
Recommended Actions
VERIFY SUBSYSTEM RESTARTED AUTOMATICALLY
Detail Data
SYMPTOM CODE
0
SOFTWARE ERROR CODE
-9035
ERROR CODE
0
DETECTING MODULE
[url=mailto:]'srchevn.c'@line:'234'[/url]
FAILING MODULE
IBM.ConfigRM
---------------------------------------------------------------------------
LABEL: CONFIGRM_EXIT_ONLIN
IDENTIFIER: 68FD23E8
Date/Time: Thu May 15 08:13:13 BEIDT 2014
Sequence Number: 122695
Machine Id: 00C933BF4C00
Node Id: p520b
Class: S
Type: PERM
WPAR: Global
Resource Name: ConfigRM
Description
The peer domain configuration manager daemon (IBM.ConfigRMd) is exiting due
to encountering an error in the process of making a domain online.
The configuration manager daemon will restart automatically, synchronize
the node configuration with the domain and rejoin the domain if possible.
Probable Causes
A problem exists with the Group Services or Topology Services subsystem.
A problem exists with the System Resource Controller.
Failure Causes
A problem exists with the Group Services or Topology Services subsystem.
A problem exists with the System Resource Controller.
Recommended Actions
No action is necessary since recovery should be automatic.
Detail Data
DETECTING MODULE
RSCT,PeerDomain.C,1.99.22.59,21705
ERROR ID
REFERENCE CODE
---------------------------------------------------------------------------
LABEL: CONFIGRM_ONLINEFAIL
IDENTIFIER: E509DBCA
Date/Time: Thu May 15 08:12:58 BEIDT 2014
Sequence Number: 122694
Machine Id: 00C933BF4C00
Node Id: p520b
Class: S
Type: PERM
WPAR: Global
Resource Name: ConfigRM
Description
An internal error was encountered in the configuration manager daemon (IBM.ConfigRMd).
Probable Causes
Failure in the various reasons. See the detailed error fields
for the specific error.
Failure Causes
Failure in the various reasons. See the detailed error fields
for the specific error.
Recommended Actions
Resolve the problem indicated in the detailed data fields.
Try bringing the node online via the 'startrpnode' or 'startrpdomain' command.
Detail Data
DETECTING MODULE
RSCT,PeerDomain.C,1.99.22.59,13920
ERROR ID
REFERENCE CODE
Error Code
0001 802D
Message Catalog Name
dummy
Message Set
1
Message Identifier
1
Message Inserts
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
0000 0000
---------------------------------------------------------------------------
LABEL: CONFIGRM_STARTED_ST
IDENTIFIER: DE84C4DB
Date/Time: Thu May 15 08:12:57 BEIDT 2014
Sequence Number: 122693
Machine Id: 00C933BF4C00
Node Id: p520b
Class: O
Type: INFO
WPAR: Global
Resource Name: ConfigRM
Description
IBM.ConfigRM daemon has started.
Probable Causes
The RSCT Configuration Manager daemon (IBM.ConfigRMd) has been started.
User Causes
The RSCT Configuration Manager daemon (IBM.ConfigRMd) has been started.
Recommended Actions
None
Detail Data
DETECTING MODULE
RSCT,IBM.ConfigRMd.C,1.57,347
ERROR ID
REFERENCE CODE
---------------------------------------------------------------------------
LABEL: SRC_RSTRT
IDENTIFIER: CB4A951F
Date/Time: Thu May 15 08:12:57 BEIDT 2014
Sequence Number: 122692
Machine Id: 00C933BF4C00
Node Id: p520b
Class: S
Type: INFO
WPAR: Global
Resource Name: SRC
Description
SOFTWARE PROGRAM ERROR
Probable Causes
APPLICATION PROGRAM
Failure Causes
SOFTWARE PROGRAM
Recommended Actions
VERIFY SUBSYSTEM RESTARTED AUTOMATICALLY
Detail Data
SYMPTOM CODE
0
SOFTWARE ERROR CODE
-9035
ERROR CODE
0
DETECTING MODULE
[url=mailto:]'srchevn.c'@line:'234'[/url]
FAILING MODULE
IBM.ConfigRM
查了下,好像是说有个叫IBM.ConfigRM的子系统总是启不来,然后在循环重启,所以不停的循环报错。我重新删除HA重新配置了,发现这个问题出现在在一个节点初始化集群然后向另外一个节点同步配置后,这个时候资源组资源什么的都没有添加,请问各位大侠,有遇到过的么,有什么解决的方法么。
收起