Author |
Message
|
mayur2378 |
Posted: Tue May 17, 2005 7:19 am Post subject: Problem with the RAC |
|
|
Apprentice
Joined: 26 May 2004 Posts: 47
|
Hey Guys,
We have been having this peculiar problem for a while. We are on AIX5.2, MB 5.0 FP 4.
Whenever we start the RAC on the AIX box that we have our execution group keeps restarting periodically and we keep getting core dumps. So in short havent been able to use the debugger on the box. We have been working closely with the IBM support but so far none of their solutions seemed to have worked.
So my question here is, has anybody on the forum ever faced this problem and if so did they get a viable solution. Looking fwd to the responses
Mayur |
|
Back to top |
|
 |
JT |
Posted: Tue May 17, 2005 5:35 pm Post subject: |
|
|
Padawan
Joined: 27 Mar 2003 Posts: 1564 Location: Hartford, CT.
|
What version of the Agent Controller do you have installed? |
|
Back to top |
|
 |
mayur2378 |
Posted: Wed May 18, 2005 6:30 am Post subject: |
|
|
Apprentice
Joined: 26 May 2004 Posts: 47
|
Hey JT,
Its version of Agent controller is 5.0.2
Mayur |
|
Back to top |
|
 |
JT |
Posted: Wed May 18, 2005 6:51 am Post subject: |
|
|
Padawan
Joined: 27 Mar 2003 Posts: 1564 Location: Hartford, CT.
|
Take a look at this post from earlier this year: http://www.mqseries.net/phpBB2/viewtopic.php?t=20973
Apparently there's a specific version of the v5.0.2 identified as a fix for the AIX Agent Controller. According to the post it can't be downloaded, rather it must be requested from IBM. |
|
Back to top |
|
 |
mayur2378 |
Posted: Wed May 18, 2005 8:00 am Post subject: |
|
|
Apprentice
Joined: 26 May 2004 Posts: 47
|
Thankz JT, will look into it
Mayur |
|
Back to top |
|
 |
chenulu |
Posted: Wed May 18, 2005 9:43 am Post subject: |
|
|
Voyager
Joined: 27 Mar 2002 Posts: 87 Location: Research Triangle Park, NC
|
Hi Mayur,
Can you please post the abend stack?
Regards, Chenulu |
|
Back to top |
|
 |
mayur2378 |
Posted: Wed May 18, 2005 10:02 am Post subject: |
|
|
Apprentice
Joined: 26 May 2004 Posts: 47
|
I am not sure i follow ,
Do you wanna have a look at the core dump that we got?
Mayur |
|
Back to top |
|
 |
chenulu |
Posted: Wed May 18, 2005 10:37 am Post subject: |
|
|
Voyager
Joined: 27 Mar 2002 Posts: 87 Location: Research Triangle Park, NC
|
Not the core dump... but the contents of abend file in /var/mqsi/errors that is generated when you start the EG |
|
Back to top |
|
 |
mayur2378 |
Posted: Wed May 18, 2005 12:27 pm Post subject: |
|
|
Apprentice
Joined: 26 May 2004 Posts: 47
|
Heres the Abend,
I dont have the latest with me right now but this is the one we had sent for the PMR. There were multiple abends. This is just one of those
Mayur
abend record for pid 1744932 tid 2828 time in seconds since 01/01/1970: 1113340134
File: /build/S500_P/src/CommonServices/Unix/ImbAbend.cpp
Line: 458
Function: signal received
---- Inserts ----
11
@(#) 1.33.2.2 CommonServices/Unix/ImbAbend.cpp, CommonServices, S500, S500-CSD03 03/11/25 15:51:20 [4/27/04 20:23:00]
972764592
-----------------
----------------------------- Stack dump for current thread ( 2828)
(0xd7be02b0+0x00000030) readRASTRINGFromBuffer [/usr/lib/libraclco.so]
(0xd7be1524+0x000009cc) ra_readMessageFromBuffer [/usr/lib/libraclco.so]
(0xd7bd9c38+0x0000021c) messageLoop [/usr/lib/librabnd.so]
(0xd7bd956c+0x00000200) PipeServer [/usr/lib/librabnd.so]
(0xd004d33c+0x000000ec) _pthread_body [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0x00000000) <invalid code address>
----------------------------------------------------------------------
------------------------------------------ Stack dumps for all threads
=== Thread 1
(0x00000000) <invalid code address>
=== Thread 258
(0x00000000) <invalid code address>
(0xd00580a4+0x00000058) _event_wait [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0xd0063f38+0x0000043c) _cond_wait_local [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0xd0064794+0x00000074) _cond_wait [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0xd0065328+0x00000204) pthread_cond_wait [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0xd5da8af8+0x000000c8) timerThreadFunction__20ImbGlobalMutexHelperFPv [/usr/opt/mqsi/lib/libCommonServices.a(libCommonServices.a.so)]
(0xd5da8a1c+0x00000048) startTimerThread [/usr/opt/mqsi/lib/libCommonServices.a(libCommonServices.a.so)]
(0xd004d33c+0x000000ec) _pthread_body [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0x00000000) <invalid code address>
=== Thread 515
(0x00000000) <invalid code address>
(0xd01e93e0+0x000000b4) nsleep [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0xd01ec330+0x0000004c) usleep [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0xd5c6ef78+0x00000014) imbSleep__Fi [/usr/opt/mqsi/lib/libCommonServices.a(libCommonServices.a.so)]
(0xd7803938+0x00000040) runState__30ImbStatsCollectorState_WaitingFP17ImbStatsCollector [/usr/opt/mqsi/lib/libDataFlowDLL.a(libDataFlowDLL.a.so)]
(0xd77e274c+0x000001cc) runCollectorCycle__17ImbStatsCollectorFP11ImbOsThread [/usr/opt/mqsi/lib/libDataFlowDLL.a(libDataFlowDLL.a.so)]
(0xd77e2650+0x00000048) run__Q2_17ImbStatsCollector10ParametersFP11ImbOsThread [/usr/opt/mqsi/lib/libDataFlowDLL.a(libDataFlowDLL.a.so)]
(0xd5c6b410+0x00000070) run__27ImbThreadPoolThreadFunctionFP11ImbOsThread [/usr/opt/mqsi/lib/libCommonServices.a(libCommonServices.a.so)]
(0xd5c5e9b0+0x00000054) threadRun__11ImbOsThreadFv [/usr/opt/mqsi/lib/libCommonServices.a(libCommonServices.a.so)]
(0xd5c5e554+0x00000064) threadBootStrap__11ImbOsThreadFPv [/usr/opt/mqsi/lib/libCommonServices.a(libCommonServices.a.so)]
(0xd004d33c+0x000000ec) _pthread_body [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0x00000000) <invalid code address>
=== Thread 772
(0x00000000) <invalid code address>
(0xd01e93e0+0x000000b4) nsleep [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0xd028612c+0x00000034) sleep [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0xd7955190+0x0000006c) sysSignalWait [/usr/java131/jre/bin/libhpi.a]
(0xd6fc607c+0x0000006c) signalDispatcherThread [/usr/java131/jre/bin/classic/libjvm.a]
(0xd6ecf678+0x00000148) xmExecuteThread [/usr/java131/jre/bin/classic/libjvm.a]
(0xd6eb5788+0x00000034) threadStart [/usr/java131/jre/bin/classic/libjvm.a]
(0xd7947d78+0x00000058) _start [/usr/java131/jre/bin/libhpi.a]
(0xd004d33c+0x000000ec) _pthread_body [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0x00000000) <invalid code address>
=== Thread 1029
(0x00000000) <invalid code address>
(0xd00580a4+0x00000058) _event_wait [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0xd0063f38+0x0000043c) _cond_wait_local [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0xd0064794+0x00000074) _cond_wait [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0xd0065328+0x00000204) pthread_cond_wait [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0xd794d8bc+0x00000050) condvarWait [/usr/java131/jre/bin/libhpi.a]
(0xd794c528+0x000000c4) sysMonitorWait [/usr/java131/jre/bin/libhpi.a]
(0xd6f11bcc+0x000001f4) lkMonitorWait [/usr/java131/jre/bin/classic/libjvm.a]
(0xd6e84358+0x00000140) JVM_MonitorWait [/usr/java131/jre/bin/classic/libjvm.a]
(0xd6eb16c8+0x0000019c) sysInvokeNative [/usr/java131/jre/bin/classic/libjvm.a]
(0xd6e8b938+0x000002a0) mmisInvokeJniMethodHelper [/usr/java131/jre/bin/classic/libjvm.a]
(0xd6eb1d90+0xfffffec4) changeCodes (name probably invalid) [/usr/java131/jre/bin/classic/libjvm.a]
=== Thread 1286
(0x00000000) <invalid code address>
(0xd00580a4+0x00000058) _event_wait [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0xd0063f38+0x0000043c) _cond_wait_local [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0xd0064794+0x00000074) _cond_wait [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0xd0065328+0x00000204) pthread_cond_wait [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0xd794d8bc+0x00000050) condvarWait [/usr/java131/jre/bin/libhpi.a]
(0xd794c528+0x000000c4) sysMonitorWait [/usr/java131/jre/bin/libhpi.a]
(0xd6f11bcc+0x000001f4) lkMonitorWait [/usr/java131/jre/bin/classic/libjvm.a]
(0xd6e84358+0x00000140) JVM_MonitorWait [/usr/java131/jre/bin/classic/libjvm.a]
(0xd6eb16c8+0x0000019c) sysInvokeNative [/usr/java131/jre/bin/classic/libjvm.a]
(0xd6e8b938+0x000002a0) mmisInvokeJniMethodHelper [/usr/java131/jre/bin/classic/libjvm.a]
(0xd6eb1d90+0xfffffec4) changeCodes (name probably invalid) [/usr/java131/jre/bin/classic/libjvm.a]
=== Thread 1543
(0x00000000) <invalid code address>
(0xd00580a4+0x00000058) _event_wait [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0xd0063f38+0x0000043c) _cond_wait_local [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0xd0064794+0x00000074) _cond_wait [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0xd0065328+0x00000204) pthread_cond_wait [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0xd794d8bc+0x00000050) condvarWait [/usr/java131/jre/bin/libhpi.a]
(0xd794c528+0x000000c4) sysMonitorWait [/usr/java131/jre/bin/libhpi.a]
(0xd6f92688+0x0000018c) gcHelper [/usr/java131/jre/bin/classic/libjvm.a]
(0xd6ecf678+0x00000148) xmExecuteThread [/usr/java131/jre/bin/classic/libjvm.a]
(0xd6eb5788+0x00000034) threadStart [/usr/java131/jre/bin/classic/libjvm.a]
(0xd7947d78+0x00000058) _start [/usr/java131/jre/bin/libhpi.a]
(0xd004d33c+0x000000ec) _pthread_body [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0x00000000) <invalid code address>
=== Thread 1800
(0x00000000) <invalid code address>
(0xd00580a4+0x00000058) _event_wait [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0xd0063f38+0x0000043c) _cond_wait_local [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0xd0064794+0x00000074) _cond_wait [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0xd0065328+0x00000204) pthread_cond_wait [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0xd794d8bc+0x00000050) condvarWait [/usr/java131/jre/bin/libhpi.a]
(0xd794c528+0x000000c4) sysMonitorWait [/usr/java131/jre/bin/libhpi.a]
(0xd6f92688+0x0000018c) gcHelper [/usr/java131/jre/bin/classic/libjvm.a]
(0xd6ecf678+0x00000148) xmExecuteThread [/usr/java131/jre/bin/classic/libjvm.a]
(0xd6eb5788+0x00000034) threadStart [/usr/java131/jre/bin/classic/libjvm.a]
(0xd7947d78+0x00000058) _start [/usr/java131/jre/bin/libhpi.a]
(0xd004d33c+0x000000ec) _pthread_body [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0x00000000) <invalid code address>
=== Thread 2057
(0x00000000) <invalid code address>
(0xd00580a4+0x00000058) _event_wait [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0xd0063f38+0x0000043c) _cond_wait_local [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0xd0064794+0x00000074) _cond_wait [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0xd0065328+0x00000204) pthread_cond_wait [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0xd794d8bc+0x00000050) condvarWait [/usr/java131/jre/bin/libhpi.a]
(0xd794c528+0x000000c4) sysMonitorWait [/usr/java131/jre/bin/libhpi.a]
(0xd6f92688+0x0000018c) gcHelper [/usr/java131/jre/bin/classic/libjvm.a]
(0xd6ecf678+0x00000148) xmExecuteThread [/usr/java131/jre/bin/classic/libjvm.a]
(0xd6eb5788+0x00000034) threadStart [/usr/java131/jre/bin/classic/libjvm.a]
(0xd7947d78+0x00000058) _start [/usr/java131/jre/bin/libhpi.a]
(0xd004d33c+0x000000ec) _pthread_body [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0x00000000) <invalid code address>
=== Thread 2314
(0x00000000) <invalid code address>
(0xd00580a4+0x00000058) _event_wait [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0xd0063f38+0x0000043c) _cond_wait_local [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0xd0064794+0x00000074) _cond_wait [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0xd0065328+0x00000204) pthread_cond_wait [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0xd794d8bc+0x00000050) condvarWait [/usr/java131/jre/bin/libhpi.a]
(0xd794c528+0x000000c4) sysMonitorWait [/usr/java131/jre/bin/libhpi.a]
(0xd6f92688+0x0000018c) gcHelper [/usr/java131/jre/bin/classic/libjvm.a]
(0xd6ecf678+0x00000148) xmExecuteThread [/usr/java131/jre/bin/classic/libjvm.a]
(0xd6eb5788+0x00000034) threadStart [/usr/java131/jre/bin/classic/libjvm.a]
(0xd7947d78+0x00000058) _start [/usr/java131/jre/bin/libhpi.a]
(0xd004d33c+0x000000ec) _pthread_body [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0x00000000) <invalid code address>
=== Thread 2571
(0x00000000) <invalid code address>
(0xd00580a4+0x00000058) _event_wait [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0xd0063f38+0x0000043c) _cond_wait_local [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0xd0064794+0x00000074) _cond_wait [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0xd0065328+0x00000204) pthread_cond_wait [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0xd794d8bc+0x00000050) condvarWait [/usr/java131/jre/bin/libhpi.a]
(0xd794c528+0x000000c4) sysMonitorWait [/usr/java131/jre/bin/libhpi.a]
(0xd6f92688+0x0000018c) gcHelper [/usr/java131/jre/bin/classic/libjvm.a]
(0xd6ecf678+0x00000148) xmExecuteThread [/usr/java131/jre/bin/classic/libjvm.a]
(0xd6eb5788+0x00000034) threadStart [/usr/java131/jre/bin/classic/libjvm.a]
(0xd7947d78+0x00000058) _start [/usr/java131/jre/bin/libhpi.a]
(0xd004d33c+0x000000ec) _pthread_body [/usr/lib/libpthreads.a(shr_xpg5.o)]
(0x00000000) <invalid code address>
=== Thread 2828
No stack information available |
|
Back to top |
|
 |
chenulu |
Posted: Wed May 18, 2005 1:29 pm Post subject: |
|
|
Voyager
Joined: 27 Mar 2002 Posts: 87 Location: Research Triangle Park, NC
|
Hi Mayur,
What is the maintenance level on AIX? You can get that by issuing oslevel -r command.
Regards, Chenulu |
|
Back to top |
|
 |
mayur2378 |
Posted: Thu May 19, 2005 6:30 am Post subject: |
|
|
Apprentice
Joined: 26 May 2004 Posts: 47
|
hey chenulu,
we are on level 7
Mayur |
|
Back to top |
|
 |
chenulu |
Posted: Thu May 19, 2005 8:48 am Post subject: |
|
|
Voyager
Joined: 27 Mar 2002 Posts: 87 Location: Research Triangle Park, NC
|
Hi Mayur,
Comment out the line MALLOCTYPE=buckets in your mqsistart script (/usr/opt/mqsi/bin directory) and restart your broker.. The problem is that a bug has been introduced with ML7 of AIX and APAR IY67718 addresses this problem.
Commenting out the MALLOCTYPE is a workaround and you can use this while you request a fix from AIX team
Regards, Chenulu |
|
Back to top |
|
 |
mayur2378 |
Posted: Thu May 19, 2005 10:32 am Post subject: |
|
|
Apprentice
Joined: 26 May 2004 Posts: 47
|
Hey Chenulu,
We did that yesterday but still have the same problem. The Support team had recommended that change
Mayur |
|
Back to top |
|
 |
vennela |
Posted: Thu May 19, 2005 10:47 am Post subject: |
|
|
 Jedi Knight
Joined: 11 Aug 2002 Posts: 4055 Location: Hyderabad, India
|
Quote: |
The Support team had recommended that change |
Where do you think chenulu is from?
I would guess he is from the same nest. |
|
Back to top |
|
 |
mayur2378 |
Posted: Thu May 19, 2005 12:44 pm Post subject: |
|
|
Apprentice
Joined: 26 May 2004 Posts: 47
|
well i kinda figured he is from the same nest. Chenulu recommeded some changes but we had already tried those, so wanted to let him know that, those changes didnt help
Mayur |
|
Back to top |
|
 |
|