硬件生产HAAIXcrash

IBM 8204-E8A 6100-06-05-1115 HA 6.1 异常重启

小弟手上一台IBM 8204-E8A 6100-06-05-1115  HA版本为6.1,最近这个月宕了两次机了,找不到原因,请各位大神帮帮忙[root@Jxmail3:/]#errpt
IDENTIFIER TIMESTAMP  T C RESOURCE_NAME  DESCRIPTION
AFA89905   0210180015 I O grpsvcs        Group Services daemon started
97419D60   0210180015 I O topsvcs        Topology Services daemon started
A6DF45AA   0210175915 I O RMCdaemon      The daemon is started.
D221BD55   0210175915 I O perftune       RESTRICTED TUNABLES MODIFIED AT REBOOT
67145A39   0210175915 U S SYSDUMP        SYSTEM DUMP
F48137AC   0210175815 U O minidump       COMPRESSED MINIMAL DUMP
9D035E4D   0210175815 P S SYSVMM         DATA STORAGE INTERRUPT, PROCESSOR
9DBCFDEE   0210175915 T O errdemon       ERROR LOGGING TURNED ON


kdb信息
vsopdb1:/arch1/mail3_dump150210/dump#kdb dump unix
dump mapped from @ 700000000000000 to @ 70000023539a5ca
Preserving 1909104 bytes of symbol table [unix]
Component Names:
1)  minidump [2 entries]
2)  dmp_minimal [10 entries]
3)  proc [634 entries]
4)  thrd [7651 entries]
5)  mtrc [49 entries]
6)  lfs [4 entries]
7)  bos [6 entries]
8)  vmm [14 entries]
9)  alloc_kheap [2055 entries]
10)  alloc_other [46 entries]
11)  mst [1 entries]
12)  ras.dump.livedump [3 entries]
13)  ras.errlg [2 entries]
14)  efcdd [1 entries]
15)  efcdd.fcs1 [8 entries]
16)  efcdd.fcs0 [8 entries]
17)  scsidiskdd [3 entries]
18)  scsidiskdd.hdisk0 [3 entries]
19)  scsidiskdd.hdisk0.mpio0.pcm0 [2 entries]
20)  scsidiskdd.hdisk1 [3 entries]
21)  scsidiskdd.hdisk1.mpio1.pcm1 [2 entries]
22)  scsidiskdd.hdisk15 [3 entries]
23)  scsidiskdd.hdisk15.mpio15.pcm15 [5 entries]
24)  scsidiskdd.hdisk17 [3 entries]
25)  scsidiskdd.hdisk17.mpio17.pcm17 [5 entries]
26)  scsidiskdd.hdisk6 [3 entries]
27)  scsidiskdd.hdisk6.mpio6.pcm6 [5 entries]
28)  scsidiskdd.hdisk14 [3 entries]
29)  scsidiskdd.hdisk14.mpio14.pcm14 [5 entries]
30)  scsidiskdd.hdisk7 [3 entries]
31)  scsidiskdd.hdisk7.mpio7.pcm7 [5 entries]
32)  scsidiskdd.hdisk10 [3 entries]
33)  scsidiskdd.hdisk10.mpio10.pcm10 [5 entries]
34)  scsidiskdd.hdisk12 [3 entries]
35)  scsidiskdd.hdisk12.mpio12.pcm12 [5 entries]
36)  scsidiskdd.hdisk4 [3 entries]
37)  scsidiskdd.hdisk4.mpio4.pcm4 [5 entries]
38)  scsidiskdd.hdisk9 [3 entries]
39)  scsidiskdd.hdisk9.mpio9.pcm9 [5 entries]
40)  scsidiskdd.hdisk11 [3 entries]
41)  scsidiskdd.hdisk11.mpio11.pcm11 [5 entries]
42)  scsidiskdd.hdisk16 [3 entries]
43)  scsidiskdd.hdisk16.mpio16.pcm16 [5 entries]
44)  scsidiskdd.hdisk8 [3 entries]
45)  scsidiskdd.hdisk8.mpio8.pcm8 [5 entries]
46)  scsidiskdd.hdisk18 [3 entries]
47)  scsidiskdd.hdisk18.mpio18.pcm18 [5 entries]
48)  scsidiskdd.hdisk13 [3 entries]
49)  scsidiskdd.hdisk13.mpio13.pcm13 [5 entries]
50)  scsidiskdd.hdisk19 [3 entries]
51)  scsidiskdd.hdisk19.mpio19.pcm19 [5 entries]
52)  scsidiskdd.hdisk20 [3 entries]
53)  scsidiskdd.hdisk20.mpio20.pcm20 [5 entries]
54)  scsidiskdd.hdisk22 [3 entries]
55)  scsidiskdd.hdisk22.mpio22.pcm22 [5 entries]
56)  scsidiskdd.hdisk21 [3 entries]
57)  scsidiskdd.hdisk21.mpio21.pcm21 [5 entries]
58)  scsidiskdd.hdisk23 [3 entries]
59)  scsidiskdd.hdisk23.mpio23.pcm23 [5 entries]
60)  scsidiskdd.hdisk24 [3 entries]
61)  scsidiskdd.hdisk24.mpio24.pcm24 [5 entries]
62)  scsidiskdd.hdisk25 [3 entries]
63)  scsidiskdd.hdisk25.mpio25.pcm25 [5 entries]
64)  scsidiskdd.hdisk26 [3 entries]
65)  scsidiskdd.hdisk26.mpio26.pcm26 [5 entries]
66)  scsidiskdd.hdisk29 [3 entries]
67)  scsidiskdd.hdisk29.mpio29.pcm29 [5 entries]
68)  scsidiskdd.hdisk27 [3 entries]
69)  scsidiskdd.hdisk27.mpio27.pcm27 [5 entries]
70)  scsidiskdd.hdisk30 [3 entries]
71)  scsidiskdd.hdisk30.mpio30.pcm30 [5 entries]
72)  scsidiskdd.hdisk31 [3 entries]
73)  scsidiskdd.hdisk31.mpio31.pcm31 [5 entries]
74)  scsidiskdd.hdisk32 [3 entries]
75)  scsidiskdd.hdisk32.mpio32.pcm32 [5 entries]
76)  scsidiskdd.hdisk33 [3 entries]
77)  scsidiskdd.hdisk33.mpio33.pcm33 [5 entries]
78)  scsidiskdd.hdisk28 [3 entries]
79)  scsidiskdd.hdisk28.mpio28.pcm28 [5 entries]
80)  scsidiskdd.hdisk34 [3 entries]
81)  scsidiskdd.hdisk34.mpio34.pcm34 [5 entries]
82)  scsidiskdd.hdisk35 [3 entries]
83)  scsidiskdd.hdisk35.mpio35.pcm35 [5 entries]
84)  scsidiskdd.hdisk37 [3 entries]
85)  scsidiskdd.hdisk37.mpio37.pcm37 [5 entries]
86)  scsidiskdd.hdisk39 [3 entries]
87)  scsidiskdd.hdisk39.mpio39.pcm39 [5 entries]
88)  scsidiskdd.hdisk36 [3 entries]
89)  scsidiskdd.hdisk36.mpio36.pcm36 [5 entries]
90)  scsidiskdd.hdisk38 [3 entries]
91)  scsidiskdd.hdisk38.mpio38.pcm38 [5 entries]
92)  scsidiskdd.hdisk40 [3 entries]
93)  scsidiskdd.hdisk40.mpio40.pcm40 [5 entries]
94)  scsidiskdd.hdisk41 [3 entries]
95)  scsidiskdd.hdisk41.mpio41.pcm41 [5 entries]
96)  scsidiskdd.hdisk5 [3 entries]
97)  scsidiskdd.hdisk5.mpio5.pcm5 [5 entries]
98)  lvm [37 entries]
99)  lvm.rootvg [4 entries]
100)  lvm.rootvg.userdata [0 entries]
101)  lvm.rootvg.metadata.lvs.rootvg_dalv [2 entries]
102)  lvm.rootvg.metadata.pvs.hdisk0 [2 entries]
103)  lvm.rootvg.metadata.pvs.hdisk1 [2 entries]
104)  lvm.rootvg.metadata.lvs.hd5 [3 entries]
105)  lvm.rootvg.userdata.lvs.hd6 [1 entries]
106)  lvm.rootvg.metadata.lvs.hd6 [3 entries]
107)  lvm.rootvg.userdata.lvs.hd8 [1 entries]
108)  lvm.rootvg.metadata.lvs.hd8 [3 entries]
109)  lvm.rootvg.userdata.lvs.hd4 [1 entries]
110)  lvm.rootvg.metadata.lvs.hd4 [3 entries]
111)  lvm.rootvg.userdata.lvs.hd2 [1 entries]
112)  lvm.rootvg.metadata.lvs.hd2 [3 entries]
113)  lvm.rootvg.userdata.lvs.hd9var [1 entries]
114)  lvm.rootvg.metadata.lvs.hd9var [3 entries]
115)  lvm.rootvg.userdata.lvs.hd3 [1 entries]
116)  lvm.rootvg.metadata.lvs.hd3 [3 entries]
117)  lvm.rootvg.userdata.lvs.hd1 [1 entries]
118)  lvm.rootvg.metadata.lvs.hd1 [3 entries]
119)  lvm.rootvg.userdata.lvs.hd10opt [1 entries]
120)  lvm.rootvg.metadata.lvs.hd10opt [3 entries]
121)  lvm.rootvg.userdata.lvs.hd11admin [1 entries]
122)  lvm.rootvg.metadata.lvs.hd11admin [3 entries]
123)  lvm.rootvg.userdata.lvs.lg_dumplv [1 entries]
124)  lvm.rootvg.metadata.lvs.lg_dumplv [2 entries]
125)  lvm.rootvg.userdata.lvs.livedump [1 entries]
126)  lvm.rootvg.metadata.lvs.livedump [3 entries]
127)  lvm.rootvg.userdata.lvs.bnmslv [1 entries]
128)  lvm.rootvg.metadata.lvs.bnmslv [3 entries]
129)  lvm.rootvg.userdata.lvs.paging00 [1 entries]
130)  lvm.rootvg.metadata.lvs.paging00 [3 entries]
131)  efscsidd [1 entries]
132)  efscsidd.fscsi0 [5 entries]
133)  efscsidd.fscsi1 [5 entries]
134)  lvm.vg_maile [4 entries]
135)  lvm.vg_maile.userdata [0 entries]
136)  lvm.vg_maile.metadata.lvs.vg_maile_dalv [2 entries]
137)  lvm.vg_maile.metadata.pvs.hdisk4 [2 entries]
138)  lvm.vg_maile.metadata.pvs.hdisk5 [2 entries]
139)  lvm.vg_maile.metadata.pvs.hdisk6 [2 entries]
140)  lvm.vg_maile.metadata.pvs.hdisk7 [2 entries]
141)  lvm.vg_maile.metadata.pvs.hdisk8 [2 entries]
142)  lvm.vg_maile.userdata.lvs.loglv00 [1 entries]
143)  lvm.vg_maile.metadata.lvs.loglv00 [2 entries]
144)  lvm.vg_maile.userdata.lvs.fslv00 [1 entries]
145)  lvm.vg_maile.metadata.lvs.fslv00 [2 entries]
146)  lvm.vg_mailf [4 entries]
147)  lvm.vg_mailf.userdata [0 entries]
148)  lvm.vg_mailf.metadata.lvs.vg_mailf_dalv [2 entries]
149)  lvm.vg_mailf.metadata.pvs.hdisk9 [2 entries]
150)  lvm.vg_mailf.metadata.pvs.hdisk10 [2 entries]
151)  lvm.vg_mailf.metadata.pvs.hdisk11 [2 entries]
152)  lvm.vg_mailf.metadata.pvs.hdisk12 [2 entries]
153)  lvm.vg_mailf.metadata.pvs.hdisk13 [2 entries]
154)  lvm.vg_mailf.userdata.lvs.loglv01 [1 entries]
155)  lvm.vg_mailf.metadata.lvs.loglv01 [2 entries]
156)  lvm.vg_mailf.userdata.lvs.fslv01 [1 entries]
157)  lvm.vg_mailf.metadata.lvs.fslv01 [2 entries]
158)  ldr [3 entries]
159)  iplcb [1 entries]
160)  ipc [7 entries]
161)  rtastrc [1 entries]
162)  sissas [2 entries]
163)  lvm [2 entries]
164)  jfs2 [1 entries]
165)  tty [4 entries]
166)  netstat [10 entries]
167)  goent_dd [10 entries]
168)  dump_failures [1 entries]
169)  dump_statistics [1 entries]
WARNING: Version mismatch between unix file and command kdb
           START              END
0000000000001000 0000000004070000 start+000FD8
F00000002FF47600 F00000002FFDF948 __ublock+000000
000000002FF22FF4 000000002FF22FF8 environ+000000
000000002FF22FF8 000000002FF22FFC errno+000000
F1000F0A00000000 F1000F0A10000000 pvproc+000000
F1000F0A10000000 F1000F0A18000000 pvthread+000000
Dump analysis on CHRP_SMP_PCI POWER_PC POWER_6 machine with 16 available CPU(s)  (64-bit registers)
Processing symbol table...
.......................done
read vscsi_scsi_ptrs OK, ptr = 0x0
(4)> stat
SYSTEM_CONFIGURATION:
CHRP_SMP_PCI POWER_PC POWER_6 machine with 16 available CPU(s)  (64-bit registers)

SYSTEM STATUS:
sysname... AIX
nodename.. Jxmail3
release... 1
version... 6
build date Apr  6 2011
build time 12:40:25
label..... 1114A_61N
machine... 00CB04E64C00
nid....... CB04E64C
time of crash: Tue Feb 10 17:49:41 2015
age of system: 17 day, 19 hr., 28 min., 40 sec.
xmalloc debug: enabled
FRRs active... 0
FRRs started.. 0

CRASH INFORMATION:
CPU 4 CSA F1000815B012FD00 at time of crash, error code for LEDs: 30000000
pvthread+080200 STACK:
[003074D8]net_kmem_rmlist+0003F8 (F1000E0008319C00, 00000000082ABA00,
   0000000000000009, 0000000000000003 [??])
[003059A8]net_malloc_cpu+000F08 (??, ??, ??, ??)
[00450B08]net_malloc+000028 (??, ??, ??)
[046D34BC]tcp_output+0027DC (??)
[046C683C]tcp_input0+00839C (??, ??, ??, ??)
[046C82C0]tcp_input+000060 (F1000E00087AD600, 0000001400000014)
[0461C034]ipintr_noqueue_post_fw+000954 (F1000A1C0AA30000, F1000E00087AD600,
   F1000A001A55FF40)
[0461D2A0]ipintr_noqueue+000120 (??, ??, ??)
[0461E8BC]in_newstack+000020 ()
[04617390]in_flip_and_run+000070 (??, ??, ??)
[04616728]dogisr+0003C8 (F1000A0019844068, F1000E00087AD600,
   F1000E0008679000)           
[041D2BD8]eth_std_receive+000378 (??, ??, ??)
[041D0BA8]eth_receive+0001A8 (??, ??)
[041AB230]rx_handler+000790 (??, ??)
[041AD8D4]goent_slih+000794 (??)
[0026752C]i_poll_soft+00012C (??)
[00266E40]i_softmod+000620 ()
[00141C8C]flih_util+000258 ()
____ Exception (F00000002FF47600) ____
iar   : 000000000008B984  msr   : 8000000000009032  cr    : 24028824
lr    : 000000000006E084  ctr   : 0000000000000000  xer   : 20000000
mq    : 00000000  asr   : 00000003AF1AD001  amr   : F3FC000000000000  
r0  : 0000000000000000  r1  : 0FFFFFFFF3FFFD70  r2  : 0000000002B655E0
r3  : 0000000000000000  r4  : 0000000000000080  r5  : 0000000000000000
r6  : 0000000000000080  r7  : 0000000000000000  r8  : 0000000000000000
r9  : 0000000000000000  r10 : 0000000000000000  r11 : 0000000000000001
r12 : 0000000000000000  r13 : F1000A0C00220C00  r14 : 00000000DEADBEEF
r15 : 0000000000000000  r16 : F1000A0C003A2580  r17 : 000000002CDF5BDC
r18 : 000000003B9ACA00  r19 : 0000000000000001  r20 : 0000000000934BB8
r21 : 00000000000034C8  r22 : 0000000000000000  r23 : 0000000000000000
r24 : 0000000002BD8C24  r25 : F1000F0A10080278  r26 : 0000000000000000
r27 : 0000000002416100  r28 : 0000000002BD8CB0  r29 : F1000A3E00241000
r30 : 0000000002BD8000  r31 : 0000000000000000  

prev      0000000000000000 stackfix  0000000000000000 int_ticks 0000
cfar      00000000000A53A0
kjmpbuf   0000000000000000 excbranch 0000000000000000 no_pfault 00
intpri    0B               backt     00               flags     00
hw_fru_id 00000001         hw_cpu_id 00000002
fpscr     0000000000000000 fpscrx    00000000         fpowner   00
fpeu      00               fpinfo    00               alloc     F000
o_iar     0000000000000000 o_toc     0000000000000000
o_arg1    0000000000000000 o_vaddr   0000000000000000
krlockp   0000000000000000 rmgrwa    0000000000000000
amrstackhigh  F00000002FFCCFF0 amrstacklow   F00000002FFCC000
amrstackcur   F00000002FFCCFF0 amrstackfix   0000000000000000
kstackhigh    0000000000000000 kstacksize    00000000
frrstart  700DFEED00000000 frrend    700DFEED00000000
frrcur    700DFEED00000000 frrstatic 0000 kjmpfrroff 0000
frrovcnt  0000 frrbarrcnt 0000 frrmask 00 callrmgr 00
Except :
excp_type 00000000  
ex[0] 0000000000000000 ex[1] 0000000000000000
ex[2] 0000000000000000 ex[3] 0000000000000000 ex[4] 0000000000000000
[0008B984].h_cede+000014 ()
[0006E080]waitproc+000780 ()
[0035ABB0]procentry+000010 (??, ??, ??, ??)
[kdb_read_mem] no real storage @ FFFFFFFFFFF65B0
(4)>
(4)>
(4)> cpu 4
current cpu
(4)>
(4)>
(4)> status
CPU     TID  TSLOT     PID  PSLOT  PROC_NAME
  0    20005      2   20004      2  wait
  1   1B0043     27   E0028     14  wait
  2   1C0045     28   F002A     15  wait
  3   1D0047     29  10002C     16  wait
  4    2014B   2050   2012E   1026  wait
  5    3014D   2051   30130   1027  wait
  6    4014F   2052   40132   1028  wait
  7    50151   2053   50134   1029  wait
  8    20253   4098   20236   2050  wait
  9    30255   4099   30238   2051  wait
10    40257   4100   4023A   2052  wait
11    50259   4101   5023C   2053  wait
12    2035B   6146   2033E   3074  wait
13    3035D   6147   30340   3075  wait
14    4035F   6148   40342   3076  wait
15    50361   6149   50344   3077  wait
16-127   Disabled
(4)> cpu 0
(0)> > ; status
invalid command
CPU     TID  TSLOT     PID  PSLOT  PROC_NAME
  0    20005      2   20004      2  wait
  1   1B0043     27   E0028     14  wait
  2   1C0045     28   F002A     15  wait
  3   1D0047     29  10002C     16  wait
  4    2014B   2050   2012E   1026  wait
  5    3014D   2051   30130   1027  wait
  6    4014F   2052   40132   1028  wait
  7    50151   2053   50134   1029  wait
  8    20253   4098   20236   2050  wait
  9    30255   4099   30238   2051  wait
10    40257   4100   4023A   2052  wait
11    50259   4101   5023C   2053  wait
12    2035B   6146   2033E   3074  wait
13    3035D   6147   30340   3075  wait
14    4035F   6148   40342   3076  wait
15    50361   6149   50344   3077  wait
16-127   Disabled
(0)> dr air
air is not a valid register name
Usage: dr ?
(0)> drair
invalid command
(0)>  dr iar
iar   : 000000000008B984
.h_cede+000014        ori    r0,r0,0             <0000000000000000> r0=0
(0)> proc
              SLOT NAME     STATE      PID    PPID          ADSPACE  CL #THS

pvproc+000800    2*wait     ACTIVE 0020004 0000000 0000000830003190   0 0001

NAME....... wait
STATE...... stat  :07  .... xstat :0000
FLAGS...... flag  :00000201 LOAD KPROC
........... flag2 :00000003 64BIT WAITPROC
........... flag3 :00000102 NOSWAP FIXPRI
........... atomic :00000000
........... secflag:0001 ROOT
LINKS...... child      :0000000000000000
........... siblings   :F1000F0A00000400
........... uidinfo    :00000000022AAB60
........... ganchor    :0000000000000000
THREAD..... threadlist :F1000F0A10000200
DISPATCH... synch      :FFFFFFFFFFFFFFFF
AACCT...... projid      :00000000  ........... sprojid     :00000000
........... subproj     :0000000000000000
........... file id     :0000000000000000 0000000000000000 00000000
........... kcid       :00000000
........... flags       :0000
WLM........ class/wlm  :00/0000
........... time of SIGTERM:00000000
........... wlm_nvpages      :0000000000000000  0
........... totalcputime     :000154BD3D0EEA08
........... totalscputime    :000154BD3D0EEA08
........... totaldiskio      :0000000000000000
IDENTIFIER. uid        :00000000  ........... suid       :00000000
........... pid        :00020004  ........... ppid       :00000000
........... sid        :00000000  ........... pgrp       :00000000
MISC...... lock       @ F1000F0A000008F0 0000000000000000
.......... lock_d     @ F1000F0A000009A8 0000000000000000
..... parent_lock     @ F1000F0A000009A0 0000000000000000
..... session_lock    @ F1000F0A00000998 0000000000000000
........... pgrpl      :0000000000000000
........... pgrpb      :0000000000000000  ... ttyl       :0000000000000000
........... ipc        :0000000000000000  ... sigs_queued:0
........... dblist     :0000000000000000  ... dbnext     :0000000000000000
........... eyec       :7076707250524F43  (pvprPROC)
STATISTICS. nframes    :0000000000000020  ... npsblks    :0000000000000000
........... nvpages    :0000000000000020  ... auditmask  :00000000
........... ncpages    :0000000000000000
SCHEDULER.. sched_next :0000000000000000  ... sched_back :0000000000000000
......... usched_lock @ F1000F0A00000910 0000000000000000
........... uschedp    :0000000000000000
........... asyncio    :0000000000000000
CHECKPOINT. crid       :00000000  ........... crid_token :FFFFFFFF
........... cridnext   :0000000000000000  ... chksynch   :FFFFFFFF
........... vpid       :00000000  ........... vppid      :00000000
........... vsid       :00000000  ........... vpgrp      :00000000
PROCFS..... procfsvn   :0000000000000000
NUMA....... rset       :0000000000000000
EWLM....... ewlmproc   :0000000000000000
PROC....... procp      :F1000A3C00250C00  ... size       :00000328
    ....... pri        :FF  ................. policy     :01
BOP........ bop_flags  :0001  .............. monitor_count :0000

FLAGS...... flag  :00000200 KPROC
........... flag2 :00000001 64BIT
........... int   :00000000
........... atomic:00000000
THREAD..... threadcount:00000001  ........... active     :00000001
........... suspended  :00000000  ........... terminating:00000000
........... local      :00000000  ........... wlm        :00000001
........... wlmoc      :00000000
SCHEDULE... nice       :     255  ........... sched_pri  :     255
DISPATCH... pevent     :0000000000000000
IDENTIFIER. pid        :00020004
MISC....... adspace    :0000000830003190
........... adtable    :0000000F8F52D001  ... adspace_ldr:00007FFFFFFFF080
........... eyec       :70726F6350524F43  (procPROC)
........... uprobe     :0000000000000000   ... forktime   :00A9563622985A6C
SIGNAL..... infoq      :0000000000000000
........... pending    :[3] 0000000000000000
........................[2] 0000000000000000
........................[1] 0000000000000000
........................[0] 0000000000000000
........... sigignore  :[3] 0000000000000000
........................[2] 0000000000000000
........................[1] 0000000000000000
........................[0] 0000000000000000
........... sigcatch   :[3] 0000000000000000
........................[2] 0000000000000000
........................[1] 0000000000000000
........................[0] 0000000000000000
........... siginfo    :[3] 0000000000000000
........................[2] 0000000000000000
........................[1] 0000000000000000
........................[0] 0000000000000000
STATISTICS. page size  :0000000000000000  ... minflt     :0000000000000000
........... majflt     :0000000000000000  ... pctcpu     :00000000
....... inputdiskio    :000000003C9CA000
....... inputio ops    :0000000000002309
....... outputdiskio   :0000000282EE1800
....... outputio ops   :00000000000F330D
....... logdiskio      :0000000000000000
....... logio ops      :0000000000000000
SCHEDULER.. repage     :0000000000000000  ... sched_count:00000000
........... cpticks    :0000....  ........... msgcnt     :0000
........... majfltsec  :00000000
........... rs_attinfo :0000000000000000  ........... sradassign :0000
........... rs_rss     :0000000000000000  ........... boundcount :0000
. no. of threads w/rset:               0  ...........     w/srad :   0
CHECKPOINT. chkblock   :00000000  ........... chkfile    :0000000000000000
POSIX RT TIMERS        :Data not present in dump.

CPU-time... clock ticks:08C7833A
........... active     :0000000000000000
PROCFS..... prtrcset   :0000000000000000
PVPROC..... pvprocp    :F1000F0A00000800  ... size       :00000400
(0)>                           
(0)> th
                SLOT NAME     STATE    TID PRI   RQ CPUID  CL  WCHAN

pvthread+000200    2*wait     RUN   020005 0FF    0 00000   0  

PVTHREAD EYECATCHER INVALID
NAME................ wait
FLAGS............... KTHREAD
.................tid :0000000000020005  ......tsleep :FFFFFFFFFFFFFFFF
...............flags :00001000  ..............flags2 :00000000
...........pmcontext :00000000
DATA.........pvprocp :F1000F0A00000800
LINKS.....prevthread :F1000F0A10000200
..........nextthread :F1000F0A10000200
DISPATCH.......synch :FFFFFFFFFFFFFFFF
SCHEDULER...affinity :00000000  .................pri :000000FF
.............boosted :00000000  ...............wchan :0000000000000000
...............state :00000002  ...............wtype :00000000
MISC       ..tv_eyec :0000000000000000 ()
CHECKPOINT......vtid :00000000  .............chkfile :0000000000000000
LOCK........ lock_d @ F1000F0A10000230 0000000000000000
PROCFS......procfsvn :0000000000000000
NUMA............rset :0000000000000000
PROFILING.....prbase :0000000000000000  ....prpinned :0000000000000000
.....prflags :00000000  ..........prbufcount :00000000
WLM........class/wlm :00/0000
.............wlm_tag :
THREAD.......threadp :F1000A0A00220C00  ........size :00000100

FLAGS............... KTHREAD FUNNELLED
.................tid :0000000000020005  ......stackp :0000000000000000
.................scp :0000000000000000  .......ulock :0000000000000000
...............uchan :0000000000000000  ....userdata :0000000000000000
..................cv :0000000000000000  .......flags :0000000000003000
..............atomic :0000000000000000  ......flags2 :0000000000000000
DATA...........procp :F1000A3C00250C00
...........pvthreadp :F1000F0A10000200
...............userp :F00000002FF48000 <__ublock+000A00>
............uthreadp :F00000002FF47600 <__ublock+000000>
SLEEP/LOCK......usid :0000000000000000  ......wchan1 :0000000000000000
..............wchan2 :0000000000000000  ......swchan :0000000000000000
...........eventlist :0000000000000000  ......result :00000000
.............polevel :00000000  ..............pevent :0000000000000000
..............wevent :0000000000000000  .......slist :0000000000000000
...........wchan1sid :0000000000000000  wchan1offset :00000000
...........lockcount :00000000  ..........adsp_flags :0000
DISPATCH.......ticks :00000000  ...............prior :0000000000000000
................next :0000000000000000  ......dispct :0000000013A13A10
...............fpuct :0000000000000000  ...homecount :00000000
............pri_band :00        .........near_dispct :0000000000000000
..........far_dispct :0000000000000000
........allowed_cpus :0-511
.......prefunnel_cpu :00000000  .......dispatch_hist :00
......threadcontrolp :0000000000000000
MISC........graphics :0000000000000000  .ulock_listp :0000000000000000
...........lockowner :0000000000000000  ..kthreadseg :00007FFFFFFFF080
..........time_start :000172C9A987A118  .......credp :0000000000000000
....spurr_time_start :000172C9560B9B80
..........wlm_charge :0  ..........wlm_evtcnt :00000000
............ipc_data :0000000000000000
..............t_eyec :7468726450524F43
............t_waitTm :0000000000000000 (thrdPROC)
...............iopri :00000000
......t_smt_priority :4 NORMAL     

VMM...........t_delw :0000000000000000
SIGNAL........sigproc:00000000  ..............cursig :00000000
......(pending) sig  :[3] 0000000000000000
......................[2] 0000000000000000
......................[1] 0000000000000000
......................[0] 0000000000000000
............sigmask  :[3] 0000000000000000
......................[2] 0000000000000000
......................[1] 0000000000000000
......................[0] 0000000000000000
SCHEDULER......cpuid :00000000  ..............scpuid :00000000
.........affinity_ts :1A11B7B9  ..............policy :00000001
.................cpu :00000000  .............lockpri :00000000
.............wakepri :000000FF  ...........rehome_tb :0000000000000000
.............ceiling :000000FF  ................time :00000000
.............sav_pri :000000FF  ..............t_nice :000000FF
...........run_queue :F1000A3C00231000  ......cpu_tb :00000000
.............home_rq :F1000A3C00225380  .home_sradid :0000
............ldispcpu :0001
......... rs_attinfo :0000000000000000
.............suspend :00000001  .............fsflags :00000000
..........norun_secs :00000000  .......reaffin_count :0000
CHECKPOINT..chkerror :0000      ............chkblock :00000000
TIMERS...clock ticks :08C7833A
PROCFS.......whystop :00000000  ............whatstop :00000000
PVTHREAD...pvthreadp :F1000F0A10000200  ........size :00000100
参与12

11同行回答

北京荣歆咨询北京荣歆咨询系统架构师北京荣歆咨询有限公司
回复 11# 杨红1989 有没有日常监测VMM方面的信息?比如用vmstat,topas,nmon,lsps等查看的记录。运行的是哪些应用,将aix6.1缺省参数minperm% = 3,lru_file_repage = 0,maxperm% = 90参数进行修改的考虑是什么?是否监测分析过修改后系统VMM的运行情况。...显示全部
回复 11# 杨红1989
有没有日常监测VMM方面的信息?比如用vmstat,topas,nmon,lsps等查看的记录。
运行的是哪些应用,将aix6.1缺省参数minperm% = 3,lru_file_repage = 0,maxperm% = 90参数进行修改的考虑是什么?是否监测分析过修改后系统VMM的运行情况。收起
IT咨询服务 · 2015-02-12
浏览2677
杨红1989杨红1989系统工程师上海天玑
[root@Jxmail3:/etc/tunables]#more lastboot.log               Restoring schedo values=======================Restoring vmo values====================Setting minperm% to 10Setting maxperm% to 20Warning: ...显示全部
[root@Jxmail3:/etc/tunables]#more lastboot.log               
Restoring schedo values
=======================

Restoring vmo values
====================
Setting minperm% to 10
Setting maxperm% to 20
Warning: a restricted tunable has been modified
Setting maxclient% to 20
Warning: a restricted tunable has been modified
Setting lru_file_repage to 1
Warning: a restricted tunable has been modified

Restoring ioo values
====================
Setting j2_syncPageLimit to 16

Restoring raso values
=====================

Restoring no values
===================
Setting ipignoreredirects to 0
Setting ipsendredirects to 1
Setting ipsrcrouteforward to 1
Setting ipsrcroutesend to 1
Setting poolbuckets to 1
Setting tcp_nodelayack to 1

Restoring nfso values
=====================


[root@Jxmail3:/etc/tunables]#no -a |grep ipignoreredirects
        ipignoreredirects = 0
[root@Jxmail3:/etc/tunables]#no -a |grep ipsendredirects
          ipsendredirects = 1
[root@Jxmail3:/etc/tunables]#no -a |grep ipsrcrouteforward
        ipsrcrouteforward = 1
[root@Jxmail3:/etc/tunables]#no -a |grep ipsrcroutesend
           ipsrcroutesend = 1
[root@Jxmail3:/etc/tunables]#no -a |grep poolbuckets
              poolbuckets = 2
[root@Jxmail3:/etc/tunables]#no -a |grep tcp_nodelayack
           tcp_nodelayack = 1


就一个参数值不一致,,poolbuckets = 2收起
硬件生产 · 2015-02-12
浏览4203
杨红1989杨红1989系统工程师上海天玑
回复 8# 北京荣歆咨询     [root@Jxmail3:/]#errpt -aj 9D035E4D---------------------------------------------------------------------------LABEL:          DSI_PROCIDENTIFIER:     9D035E4DDate/Ti...显示全部
回复 8# 北京荣歆咨询


    [root@Jxmail3:/]#errpt -aj 9D035E4D
---------------------------------------------------------------------------
LABEL:          DSI_PROC
IDENTIFIER:     9D035E4D

Date/Time:       Tue Feb 10 17:58:17 GMT+08:00 2015
Sequence Number: 999
Machine Id:      00CB04E64C00
Node Id:         Jxmail3
Class:           S
Type:            PERM
WPAR:            Global
Resource Name:   SYSVMM         

Description
DATA STORAGE INTERRUPT, PROCESSOR

Probable Causes
SOFTWARE PROGRAM

Failure Causes
SOFTWARE PROGRAM

        Recommended Actions
        IF PROBLEM PERSISTS THEN DO THE FOLLOWING
        CONTACT APPROPRIATE SERVICE REPRESENTATIVE

Detail Data
DATA STORAGE INTERRUPT STATUS REGISTER
0000 0000 4200 0000
SEGMENT REGISTER, SEGREG
0000 7FFF FFFF D080
DATA STORAGE INTERRUPT ADDRESS REGISTER
0000 0000 082A BA08
EXVAL
0000 0000 0000 0115




[root@Jxmail3:/]#errpt -aj D221BD55
---------------------------------------------------------------------------
LABEL:          TUNE_RESTRICTED
IDENTIFIER:     D221BD55

Date/Time:       Tue Feb 10 17:59:20 GMT+08:00 2015
Sequence Number: 1003
Machine Id:      00CB04E64C00
Node Id:         Jxmail3
Class:           O
Type:            INFO
WPAR:            Global
Resource Name:   perftune        

Description
RESTRICTED TUNABLES MODIFIED AT REBOOT

Probable Causes
SYSTEM TUNING

User Causes
TUNABLE PARAMETER OF TYPE RESTRICTED HAS BEEN MODIFIED

        Recommended Actions
        REVIEW TUNABLE LISTS IN DETAILED DATA

Detail Data
LIST OF TUNABLE COMMANDS CONTROLLING MODIFIED RESTRICTED TUNABLES AT REBOOT, SEE FILE /etc/tunables/lastboot.log
vmo收起
硬件生产 · 2015-02-12
浏览3943
杨红1989杨红1989系统工程师上海天玑
回复 8# 北京荣歆咨询     [root@Jxmail3:/]#errpt -aj 9D035E4D---------------------------------------------------------------------------LABEL:          DSI_PROCIDENTIFIER:     9D035E4DDate/Ti...显示全部
回复 8# 北京荣歆咨询


    [root@Jxmail3:/]#errpt -aj 9D035E4D
---------------------------------------------------------------------------
LABEL:          DSI_PROC
IDENTIFIER:     9D035E4D

Date/Time:       Tue Feb 10 17:58:17 GMT+08:00 2015
Sequence Number: 999
Machine Id:      00CB04E64C00
Node Id:         Jxmail3
Class:           S
Type:            PERM
WPAR:            Global
Resource Name:   SYSVMM         

Description
DATA STORAGE INTERRUPT, PROCESSOR

Probable Causes
SOFTWARE PROGRAM

Failure Causes
SOFTWARE PROGRAM

        Recommended Actions
        IF PROBLEM PERSISTS THEN DO THE FOLLOWING
        CONTACT APPROPRIATE SERVICE REPRESENTATIVE

Detail Data
DATA STORAGE INTERRUPT STATUS REGISTER
0000 0000 4200 0000
SEGMENT REGISTER, SEGREG
0000 7FFF FFFF D080
DATA STORAGE INTERRUPT ADDRESS REGISTER
0000 0000 082A BA08
EXVAL
0000 0000 0000 0115收起
硬件生产 · 2015-02-11
浏览4124
北京荣歆咨询北京荣歆咨询系统架构师北京荣歆咨询有限公司
每个人排查的思路不同。如果是我,会先看errpt -aj D221BD55errpt -aj 9D035E4D的输出信息。显示全部
每个人排查的思路不同。如果是我,会先看
errpt -aj D221BD55
errpt -aj 9D035E4D
的输出信息。收起
IT咨询服务 · 2015-02-11
浏览4006
杨红1989杨红1989系统工程师上海天玑
回复 5# 杨红1989     [root@Jxmail3:/var/hacmp/adm/history]#more cluster.02102015Feb 10 18:01:13 EVENT START: node_up mail3Feb 10 18:01:17 EVENT COMPLETED: node_up mail3 0Feb 10 18:01:20 EVENT START: rg_move_release mail3 1Feb 10 18:01:20 EV...显示全部
回复 5# 杨红1989


    [root@Jxmail3:/var/hacmp/adm/history]#more cluster.02102015
Feb 10 18:01:13 EVENT START: node_up mail3
Feb 10 18:01:17 EVENT COMPLETED: node_up mail3 0
Feb 10 18:01:20 EVENT START: rg_move_release mail3 1
Feb 10 18:01:20 EVENT START: rg_move mail3 1 RELEASE
Feb 10 18:01:20 EVENT COMPLETED: rg_move mail3 1 RELEASE 0
Feb 10 18:01:20 EVENT COMPLETED: rg_move_release mail3 1 0
Feb 10 18:02:45 EVENT START: rg_move_fence mail3 1
Feb 10 18:02:46 EVENT COMPLETED: rg_move_fence mail3 1 0
Feb 10 18:02:48 EVENT START: rg_move_fence mail3 2
Feb 10 18:02:48 EVENT COMPLETED: rg_move_fence mail3 2 0
Feb 10 18:02:48 EVENT START: rg_move_acquire mail3 2
Feb 10 18:02:48 EVENT START: rg_move mail3 2 ACQUIRE
Feb 10 18:02:49 EVENT START: acquire_service_addr
Feb 10 18:02:51 EVENT START: acquire_aconn_service en2 net_ether_01
Feb 10 18:02:51 EVENT COMPLETED: acquire_aconn_service en2 net_ether_01 0
Feb 10 18:02:53 EVENT START: acquire_aconn_service en0 net_ether_01
Feb 10 18:02:53 EVENT COMPLETED: acquire_aconn_service en0 net_ether_01 0
Feb 10 18:02:53 EVENT COMPLETED: acquire_service_addr 0
Feb 10 18:03:00 EVENT COMPLETED: rg_move mail3 2 ACQUIRE 0
Feb 10 18:03:01 EVENT COMPLETED: rg_move_acquire mail3 2 0
Feb 10 18:03:01 EVENT START: rg_move_complete mail3 2
Feb 10 18:03:02 EVENT START: start_server app_maila
Feb 10 18:03:02 EVENT START: start_server app_mailb
Feb 10 18:03:02 EVENT COMPLETED: start_server app_maila 0
Feb 10 18:03:02 EVENT COMPLETED: start_server app_mailb 0
Feb 10 18:03:03 EVENT COMPLETED: rg_move_complete mail3 2 0
Feb 10 18:03:05 EVENT START: node_up_complete mail3
Feb 10 18:03:05 EVENT COMPLETED: node_up_complete mail3 0收起
硬件生产 · 2015-02-11
浏览4151
杨红1989杨红1989系统工程师上海天玑
回复 4# 北京宝汇德 ASM近期无报错信息显示全部
回复 4# 北京宝汇德 ASM近期无报错信息收起
硬件生产 · 2015-02-11
浏览3934
杨红1989杨红1989系统工程师上海天玑
hacmp.log日志文件字符太多,详见附件!显示全部
hacmp.log日志文件字符太多,详见附件!

附件:

附件图标hacmp.log (395.41 KB)

收起
硬件生产 · 2015-02-11
浏览3935
北京宝汇德北京宝汇德副总经理/副总裁北京宝汇德技术服务有限公司
把HMC日志,以及ASMI的日志发一下看看显示全部
把HMC日志,以及ASMI的日志发一下看看收起
系统集成 · 2015-02-11
浏览4028
zwz99999zwz99999系统工程师dcits
把cluster.log和hacmp.out的贴出来看看显示全部
把cluster.log和hacmp.out的贴出来看看收起
系统集成 · 2015-02-11
浏览4049

提问者

杨红1989
系统工程师上海天玑
擅长领域: 服务器云计算Docker

相关问题

相关资料

相关文章

问题状态

  • 发布时间:2015-02-10
  • 关注会员:1 人
  • 问题浏览:16101
  • 最近回答:2015-02-12
  • X社区推广