今天客户跟我说他用errpt检查出了错误,错误信息如下,之前我也没接触过这方面的东西,errpt这个命令我倒是知道,但还没去研究。现在的环境是两台p570装的oracle 10g rac,因为现在系统软件和硬件都运行正常,我知道应当问题不大,但用户问到底是怎么回事,我也一下答不上来,也不知道如何入手。 去年有系统工程师去项目上做过巡检,所以去年的errpt信息就不管了,有些可能是项目上线前或初期报的错,今年好像就是报SOFTWARE PROGRAM ABNORMALLY TERMINATED,从详细信息来看好像和oracle有什么关系,因为有CORE FILE NAME /oracle/products/10.2.0/db_1/afcdb02_afc2/sysman/emd/core PROGRAM NAME 字样,但现在数据库都跑得好好的,也没看哪里受什么影响。明天叫用户把alert日志传过来看能不能找到点问题。 麻烦大家看下下面的信息都是些什么原因造成的,谢谢,最主要是SOFTWARE PROGRAM ABNORMALLY TERMINATED这条。
IDENTIFIER TIMESTAMP T C RESOURCE_NAME DESCRIPTION A924A5FC 0215085911 P S SYSPROC SOFTWARE PROGRAM ABNORMALLY TERMINATED A924A5FC 0203064311 P S SYSPROC SOFTWARE PROGRAM ABNORMALLY TERMINATED F3931284 1208172710 I H ent2 ETHERNET NETWORK RECOVERY MODE EC0BCCD4 1208172610 T H ent2 ETHERNET DOWN CAD234BE 1208164810 U H LVDD QUORUM LOST, VOLUME GROUP CLOSING AB59ABFF 1206152210 U U LIBLVM Remote node Concurrent Volume Group fail AB59ABFF 1206152210 U U LIBLVM Remote node Concurrent Volume Group fail AB59ABFF 1125144710 U U LIBLVM Remote node Concurrent Volume Group fail AB59ABFF 1125144710 U U LIBLVM Remote node Concurrent Volume Group fail F3931284 1120144010 I H ent1 ETHERNET NETWORK RECOVERY MODE EC0BCCD4 1120143910 T H ent1 ETHERNET DOWN F3931284 1120143610 I H ent1 ETHERNET NETWORK RECOVERY MODE EC0BCCD4 1120143510 T H ent1 ETHERNET DOWN F3931284 1120143210 I H ent1 ETHERNET NETWORK RECOVERY MODE EC0BCCD4 1120141210 T H ent1 ETHERNET DOWN F3931284 1120111710 I H ent2 ETHERNET NETWORK RECOVERY MODE F3931284 1120111710 I H ent0 ETHERNET NETWORK RECOVERY MODE EC0BCCD4 1120111010 T H ent2 ETHERNET DOWN EC0BCCD4 1120111010 T H ent0 ETHERNET DOWN F3931284 1120110810 I H ent2 ETHERNET NETWORK RECOVERY MODE F3931284 1120110810 I H ent0 ETHERNET NETWORK RECOVERY MODE EC0BCCD4 1120105710 T H ent2 ETHERNET DOWN EC0BCCD4 1120105710 T H ent0 ETHERNET DOWN EC0BCCD4 1120104710 T H ent0 ETHERNET DOWN F3931284 1120103110 I H ent1 ETHERNET NETWORK RECOVERY MODE EC0BCCD4 1120102810 T H ent1 ETHERNET DOWN AB59ABFF 0813140810 U U LIBLVM Remote node Concurrent Volume Group fail AB59ABFF 0813140810 U U LIBLVM Remote node Concurrent Volume Group fail A29426DA 0810185110 P U topsvcs Local adapter misconfiguration detected 37F3CC40 0810121010 P U RMCdaemon RSCT has detected that system time has m
Adapter interface name en1 Adapter offset 0 Adapter expected IP address 172.168.10.2 --------------------------------------------------------------------------- LABEL: RMCD_2610_120_ER IDENTIFIER: 37F3CC40 Date/Time: Tue Aug 10 12:10:26 GMT+08:00 2010 Sequence Number: 544 Machine Id: 00CBDEF54C00 Node Id: afcdb02 Class: U Type: PEND WPAR: Global Resource Name: RMCdaemon Resource Class: NONE Resource Type: NONE Location: Description RSCT has detected that system time has moved backward. Probable Causes The system time has been set back. User Causes Via system operator or NTP action, the system time has been set backward. Recommended Actions The RSCT components rely on system time to always increase. If the system time has moved backward, RSCT components may hang or present undefined behavior. If PowerHA is installed, refer to the PowerHA documentation. If SAMP is installed, refer to the SAMP documentation. Otherwise, if a Peer Domain is online it should be forced offline via the forcerpoffline command. Once the Peer Domain is offline, or if there is no Peer Domain, run the command rmcctrl -z followed by the command rmcctrl -s. Refer to the RSCT documentation for more information. Detail Data DETECTING MODULE RSCT,rmcd_utils.c,1.69.1.53,6656 ERROR ID 6.lwwr.m2BMA/rNM.0cU08.................... REFERENCE CODE
Current system time obtained by RSCT Tue Aug 10 12:10:26 2010 Last system time saved by RSCT Tue Aug 10 20:07:54 2010