ASG
IBM
Zystems
Cressida
Icon
Netflexity
 
  MQSeries.net
Search  Search       Tech Exchange      Education      Certifications      Library      Info Center      SupportPacs      LinkedIn  Search  Search                                                                   FAQ  FAQ   Usergroups  Usergroups
 
Register  ::  Log in Log in to check your private messages
 
RSS Feed - WebSphere MQ Support RSS Feed - Message Broker Support

MQSeries.net Forum Index » General IBM MQ Support » the one where the semaphore owner died...

Post new topic  Reply to topic
 the one where the semaphore owner died... « View previous topic :: View next topic » 
Author Message
sebulba_dz
PostPosted: Mon May 08, 2006 1:28 am    Post subject: the one where the semaphore owner died... Reply with quote

Novice

Joined: 02 Jun 2003
Posts: 20
Location: UK

All cylinders were firing as usual when suddenly some channels started to terminate unexpectedly like these:
Code:

----- amqxfdcx.c : 722 --------------------------------------------------------
05/03/06  16:51:59
AMQ9604: Channel 'CLNT_LONDLG01_IM' terminated unexpectedly

EXPLANATION:
The process or thread executing channel 'CLNT_LONDLG01_IM' is no longer
running. The check process system call returned 545284357 for process 25563.
ACTION:
No immediate action is required because the channel entry has been removed from
the list of running channels. Inform the system administrator who should
examine the operating system procedures to determine why the channel process
has terminated.
----- amqxfdcx.c : 688 --------------------------------------------------------
05/03/06  16:51:59
AMQ6184: An internal WebSphere MQ error has occurred on queue manager LONDLG01.

EXPLANATION:
An error has been detected, and the WebSphere MQ error recording routine has
been called. The failing process is process 27051.
ACTION:
Use the standard facilities supplied with your system to record the problem
identifier, and to save the generated output files. Contact your IBM support
center.  Do not discard these files until the problem has been resolved.
----- amqxfdcx.c : 722 --------------------------------------------------------

This wouldn't be a problem if the clients reconnect, but there's always a few that doesn't. There were loads of FDCs generated, most of them had a probe ID RM487001 and error code rrcE_CHANNEL_TERMINATED. But there was one FDC which is alien to me:
Code:

+-----------------------------------------------------------------------------+
|                                                                             |
| WebSphere MQ First Failure Symptom Report                                   |
| =========================================                                   |
|                                                                             |
| Date/Time         :- Wednesday May 03 16:50:29 BST 2006                     |
| Host Name         :- mq1 (SunOS 5.9)                                        |
| PIDS              :- 5724B4103                                              |
| LVLS              :- 530.10  CSD10                                          |
| Product Long Name :- WebSphere MQ for Sun Solaris                           |
| Vendor            :- IBM                                                    |
| Probe Id          :- XC308040                                               |
| Application Name  :- MQM                                                    |
| Component         :- xlsReleaseMutex                                        |
| Build Date        :- May 13 2005                                            |
| CMVC level        :- p530-10-L050504                                        |
| Build Type        :- IKAP - (Production)                                    |
| UserID            :- 00000102 (mqm)                                         |
| Program Name      :- runmqlsr_nd                                            |
| Process           :- 00006553                                               |
| Thread            :- 00000001                                               |
| QueueManager      :- LONDLG01                                               |
| Major Errorcode   :- xecL_W_SEM_OWNER_DIED                                  |
| Minor Errorcode   :- OK                                                     |
| Probe Type        :- INCORROUT                                              |
| Probe Severity    :- 3                                                      |
| Probe Description :- AMQ6125: An internal WebSphere MQ error has occurred.  |
| FDCSequenceNumber :- 0                                                      |
|                                                                             |
+-----------------------------------------------------------------------------+

MQM Function Stack
ccxListener
rppStartResponder
rppChoosePool
xcsReleaseMutexSem
xlsReleaseMutex
xcsFFST


It appears that the above errored first, and those channel_terminates followed after. I didn't have to restart anything on MQ and just one client that had problems reconnecting.

The two hits from google on the probe id is on AS400, so it doesn't really apply to me. Any suggestions?

I'm running 5.3, csd10 on solaris sparc.
Back to top
View user's profile Send private message
Nigelg
PostPosted: Mon May 08, 2006 2:35 am    Post subject: Reply with quote

Grand Master

Joined: 02 Aug 2004
Posts: 1046

The FDC looks like it is a report that another process had diesd unexpectedly, and is informational only.
You should be concerned about the RM487001 FDCs, because that means that a channel pool process (amqrmppa) hads died and taken all the channels it is running with it.
The most obvious cause of this is a user exit crashing the process.
_________________
MQSeries.net helps those who help themselves..
Back to top
View user's profile Send private message
Tibor
PostPosted: Mon May 08, 2006 7:31 am    Post subject: Reply with quote

Grand Master

Joined: 20 May 2001
Posts: 1033
Location: Hungary

sebulba_dz,

You have to search on the ibm websphere mq support site, rather. I found it very useful in generally, but not in this case.

Nigel,

I have already seen similar error (semaphore owner died), even if there is no user exit.
Back to top
View user's profile Send private message
sebulba_dz
PostPosted: Wed May 10, 2006 12:00 am    Post subject: Reply with quote

Novice

Joined: 02 Jun 2003
Posts: 20
Location: UK

Thanks Nigel, now I have something to put the blame on!

Seriouly speaking, we use the BlockIP2 security exit and we had some suspicion that it was the suspect. The version we have on is very very old and has been there for ages. looks like i should get the updated version into prod in a hurry.

Tibor: I did search on the websphere mq support site, but like you said, it wasn't useful in this case. i don't usually use google unless i can't find anything here or in the ibm site
Back to top
View user's profile Send private message
Display posts from previous:   
Post new topic  Reply to topic Page 1 of 1

MQSeries.net Forum Index » General IBM MQ Support » the one where the semaphore owner died...
Jump to:  



You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum
Protected by Anti-Spam ACP
 
 


Theme by Dustin Baccetti
Powered by phpBB © 2001, 2002 phpBB Group

Copyright © MQSeries.net. All rights reserved.