|
RSS Feed - WebSphere MQ Support
|
RSS Feed - Message Broker Support
|
 |
|
Installation problem |
« View previous topic :: View next topic » |
Author |
Message
|
Boomn4x4 |
Posted: Wed Jun 13, 2012 10:32 am Post subject: Installation problem |
|
|
Disciple
Joined: 28 Nov 2011 Posts: 172
|
I have written a script to install and configure MQ.
I have one script that creates the mqm user and sets some environment variables.
I have a second script that as root installs MQ via the RPMs. I have a subscript that contains all the commands to create a queue manager, start it, and configure the channels, listeners, queues, etc. The first script executes this script as mqm... (ie. su - mqm -c configure.sh). Control then returns to the original script. This script stops the newly created/started/configured script as user mqm (su - mqm -c "endmqm -w $QMGR"). From there the qmgr is backed up and copied to another server. Finally the script starts the qmgr backup.
All of this worked fine in testing. It goes into production, and fails. The failure occured after the qmgr was created/started/configured on the endmqm. Unfortunalty, due to sloppy scripting on my part, I didn't get the actual return code of the failure of the mqm. I fixed the script to get a better return code, but extra unfortunatly, I ran the script again on another test system and it installed fine from start to finish.
I've tried uninstalling MQ by uninstalling the RPMS and deleting the /var/mqm directories and then reinstalling the script, but I cannot seem to duplicate the problem.
I'm at wits end trying to figure out what could have gone wrong with the endmqm.
Any help would be appreciated. |
|
Back to top |
|
 |
bruce2359 |
Posted: Wed Jun 13, 2012 1:19 pm Post subject: |
|
|
 Poobah
Joined: 05 Jan 2008 Posts: 9469 Location: US: west coast, almost. Otherwise, enroute.
|
You provided no useful information - other than "It didn't work." Like all other supplied control programs, endmqm documentation will tell you what return codes the program issues. It must be one of those non-zero return codes.
Or something else. _________________ I like deadlines. I like to wave as they pass by.
ב''ה
Lex Orandi, Lex Credendi, Lex Vivendi. As we Worship, So we Believe, So we Live. |
|
Back to top |
|
 |
Boomn4x4 |
Posted: Thu Jun 14, 2012 4:25 am Post subject: |
|
|
Disciple
Joined: 28 Nov 2011 Posts: 172
|
bruce2359 wrote: |
You provided no useful information - other than "It didn't work." Like all other supplied control programs, endmqm documentation will tell you what return codes the program issues. It must be one of those non-zero return codes.
Or something else. |
That's because I didn't get any useful information... the return code I got was actually a "0". Which, due to sloppy scripting, was actually the return code of the su... not the endmqm. The endmqm, when it failed, was hardcoded to return 1... not the actual rc of the endmqm call. Now, after I've fixed to poor coding, I have been unable to duplicate the problem to get an actual rc.
Do the mq logs have the return code anywhere? I've tried looking through /var/mqm/qmgrs/errors/AM* but didn't see anything. |
|
Back to top |
|
 |
Boomn4x4 |
Posted: Thu Jun 14, 2012 4:51 am Post subject: |
|
|
Disciple
Joined: 28 Nov 2011 Posts: 172
|
Is it possible that it is a timing issue? That the QMGR failed to end because it wasn't fully started?
I'm looking my script log and the failure happened at 23:46:23
Looking at the MQ error logs:
At 23:46:22 - the listener started
At 23:46:23 - the cluster sender channel started
At 23:46:34 - the cluster receiver channel started.
....
At 23:46:45 - the last of the channels started
If the QMGR was in the process of connecting to a cluster, is it plausible that this would cause and endmqm to fail? |
|
Back to top |
|
 |
exerk |
Posted: Thu Jun 14, 2012 4:55 am Post subject: |
|
|
 Jedi Council
Joined: 02 Nov 2006 Posts: 6339
|
Boomn4x4 wrote: |
Is it possible that it is a timing issue? That the QMGR failed to end because it wasn't fully started?
I'm looking my script log and the failure happened at 23:46:23
Looking at the MQ error logs:
At 23:46:22 - the listener started
At 23:46:23 - the cluster sender channel started
At 23:46:34 - the cluster receiver channel started.
....
At 23:46:45 - the last of the channels started
If the QMGR was in the process of connecting to a cluster, is it plausible that this would cause and endmqm to fail? |
Unlikely, and it defies logic to expect the endmqm command to wait for channels to be in a certain condition before it runs successfully. _________________ It's puzzling, I don't think I've ever seen anything quite like this before...and it's hard to soar like an eagle when you're surrounded by turkeys. |
|
Back to top |
|
 |
bruce2359 |
Posted: Thu Jun 14, 2012 6:02 am Post subject: |
|
|
 Poobah
Joined: 05 Jan 2008 Posts: 9469 Location: US: west coast, almost. Otherwise, enroute.
|
exerk wrote: |
Unlikely, and it defies logic to expect the endmqm command to wait for channels to be in a certain condition before it runs successfully. |
This would be easy to test outside your script.
Run the strmqm command; wait a few seconds, then issue the endmqm command. _________________ I like deadlines. I like to wave as they pass by.
ב''ה
Lex Orandi, Lex Credendi, Lex Vivendi. As we Worship, So we Believe, So we Live. |
|
Back to top |
|
 |
|
|
 |
|
Page 1 of 1 |
|
You cannot post new topics in this forum You cannot reply to topics in this forum You cannot edit your posts in this forum You cannot delete your posts in this forum You cannot vote in polls in this forum
|
|
|
|