Author |
Message
|
bay hoe san |
Posted: Wed May 09, 2007 12:43 am Post subject: Error in SYSTEM.CLUSTER.COMMAND.QUEUE |
|
|
Centurion
Joined: 27 Nov 2006 Posts: 117
|
Hello,
( 1) Full repository is at the 2 servers. 2 partial qmgrs at z/OS.
( 2) I encountered the following errors in 1 of my z/OS cluster qmgrs:
CSQX037E +MQT9 CSQXREPO Unable to get message from
SYSTEM.CLUSTER.COMMAND.QUEUE, MQCC=2 MQRC=2016
CSQX448E +MQT9 CSQXREPO Repository manager stopping because of errors. Restart in 600 seconds
CSQX449I +MQT9 CSQXREPO Repository manager restarted
(the above error msgs repeated for many times)
CSQX448E +MQT9 CSQXREPO Repository manager stopping because of 977 errors. Restart in 600 seconds
CSQX449I +MQT9 CSQXREPO Repository manager restarted
CSQX036E +MQT9 CSQXREPO Unable to open SYSTEM.CLUSTER.COMMAND.QUEUE, MQCC=2 MQRC=2042
CSQX411I +MQT9 CSQXREPO Repository manager stopped
( 2) Is giving MQRC=2016 and today it gives 2042, what could be the possible cause of error?
( 3) How to rectify? What I did was to recycle the CHIN after 2 hours of Problem determination.
( 4) Pls advice.
( 5) Thank you and have a nice day ahead.
.Hoe San. |
|
Back to top |
|
 |
Vitor |
Posted: Wed May 09, 2007 1:42 am Post subject: |
|
|
 Grand High Poobah
Joined: 11 Nov 2005 Posts: 26093 Location: Texas, USA
|
Looking up 2016 reveals it to be a GET_INHIBITED error. I'd investigate why the command queue was in such a state, what caused it, fix that and then see what is using it (2042 is an OBJECT_IN_USE) if it's still using it when the original problem is resolved.
What led you to believe recycling the CHIN would be an appropriate response to this problem, i.e what came from the 2 hours of problem determination as it's not an obvious leap from a 2016...? _________________ Honesty is the best policy.
Insanity is the best defence. |
|
Back to top |
|
 |
bay hoe san |
Posted: Wed May 09, 2007 1:59 am Post subject: |
|
|
Centurion
Joined: 27 Nov 2006 Posts: 117
|
Hello,
( 1) I am an inexperienced MQ admin.
( 2) I have investigated but couldn't determine the cause, not even a hint and since users are pressing me to resolve it ASAP and the servers guy does not wana recycle the qmgrs too, so the last resort is I recycle the CHIN. After that, everything is back to normal, here comes, since I recycle the problem now lies with host and I have to provide an explanation.
( 3) I have check the definition of SYSTEM.CLUSTER.COMMAND.QUEUE and put & get are enabled. I done a comparsion between the queues in the 2 qmgrs @host z/OS since 1 of them is ok when the user submit a batch job which does a put to 1 of the cluster queues.
( 4) May you or anyone advise me on what could be the possible causes or what should I do next?
( 5) Thanks.
.Hoe San. |
|
Back to top |
|
 |
Vitor |
Posted: Wed May 09, 2007 2:08 am Post subject: |
|
|
 Grand High Poobah
Joined: 11 Nov 2005 Posts: 26093 Location: Texas, USA
|
bay hoe san wrote: |
( 4) May you or anyone advise me on what could be the possible causes or what should I do next?
|
It could be a number of things.....
I would theorise that the queue was get inhibited by the system to prevent some problem but aside from that you've not really posted a lot to go on. What came out of these 2 hours? What did you find and where? _________________ Honesty is the best policy.
Insanity is the best defence. |
|
Back to top |
|
 |
bay hoe san |
Posted: Wed May 09, 2007 8:15 am Post subject: |
|
|
Centurion
Joined: 27 Nov 2006 Posts: 117
|
Hello,
( 1) My 2 hours of PD doesn't seem to yield any result.
( 2) The user is doing a MQPUT via a batch job to 1 of the cluster queue and unsuccessful. I guessed he has tried a couple of times but failed, so he check the started task and discovered those error messages that I have posted and confronted me.
( 3) I searched the website and found some postings on system.cluster.command.queue - PTF/APAR which I checked have applied the superseded, check whether any storage shortage in syslog but found none. Check any MQ related messages but found similar to those I mentioned.
( 4) User insisted is a host issue since the error msgs are at host. Servers' support personnel claimed no change at their end. Cluster-senders/receivers are in running state. In fact, the error msgs occured sometime back but is just that no one using qmgrs at host. The 2 host qmgrs are up since Jan 2007
( 5) Seeing that it already past 2 hours, I decided to recycle the channel and after recycle this particular qmgr, the batch job complete successfully.
( 6) Pls advise. I really have no clue as to what could be the cause.
( 7) Thank you.
.Hoe San. |
|
Back to top |
|
 |
rameshtdp |
Posted: Fri May 11, 2007 5:05 am Post subject: hi |
|
|
 Novice
Joined: 11 May 2007 Posts: 18 Location: India
|
Is your mq problem solved?
If not I can give some solution.
Rgds,
Ramesh |
|
Back to top |
|
 |
bay hoe san |
Posted: Sat May 12, 2007 8:05 am Post subject: |
|
|
Centurion
Joined: 27 Nov 2006 Posts: 117
|
Hello Ramesh,
( 1) Well, I recycled my channel initiator and that solved my problem. However, the user is pressing for root cause analysis and preventive measure.
( 2) I found it rather strange, my the other cluster qmgr @host has the same error messages as the one having trouble but it can still function.
( 3) Yes, pls give me some solutions/hints. Appreciate greatly.
Thank you.
.Hoe San. |
|
Back to top |
|
 |
kevinf2349 |
Posted: Thu May 31, 2007 7:38 am Post subject: |
|
|
 Grand Master
Joined: 28 Feb 2003 Posts: 1311 Location: USA
|
Did you issue a cluster refresh command during that 2 hours of diagnostics? |
|
Back to top |
|
 |
bay hoe san |
Posted: Thu May 31, 2007 4:04 pm Post subject: |
|
|
Centurion
Joined: 27 Nov 2006 Posts: 117
|
Hello,
( 1) Yes, cluster refresh was done at host end. I checked with the SA of the servers, he claimed he has done the refresh too. Cannot be sure what type of refresh he did as I told him to do a refresh repos(yes) as the full repos is at the servers end.
( 2) Pls advise.
Thanks.
.Hoe San. |
|
Back to top |
|
 |
|