|
RSS Feed - WebSphere MQ Support
|
RSS Feed - Message Broker Support
|
 |
|
z/OS and Unix's Channel Communication |
« View previous topic :: View next topic » |
Author |
Message
|
suffolk |
Posted: Tue Apr 01, 2008 8:19 pm Post subject: z/OS and Unix's Channel Communication |
|
|
Novice
Joined: 03 Mar 2008 Posts: 24
|
Hi All,
Recently, we met with an issue, it starts like this, we were running HA on Unix's side and a few days ago, the disk was full and it caused a log corruption. We tried restarting the QM but to no avail and went to replace amqalchk.fil by creating a temp QM and copied the new amqalchk.fil into the existing QM and it started up.
Everything works fine except that the sender channel between z/OS and receiving channel on Unix works on a intermittent basis, sometimes it's up and sometimes it's down and other times just take a long time on retrying. We reset the affected channels on both ends but to no avail too.
We saw this in the z/OS's log:
15.26.08 STC04534 +CSQX517E +CSQP CSQXSUPR Error in SYSTEM.CHANNEL.SYNCQ - channel CH.CSQP.CPM.C01 repeated
20.22.19 STC04534 +CSQX206E +CSQP CSQXRCTL Error sending data,
channel CH.PCPMMQ1.CPM.C02,
connection 10.20.1.245,
TRPTYPE=TCP RC=0000008C
The affected channel in question is CH.PCPMMQ1.CPM.C02, rest of the logs just repeatedly writes:
07.51.30 STC04534 +CSQX500I +CSQP CSQXRCTL Channel CH.PCPMMQ1.CPM.C02 started
07.51.30 STC04534 +CSQX524E +CSQP CSQXRCTL Remote queue manager unavailable for CH.PCPMMQ1.CPM.C02
07.51.30 STC04534 +CSQX599E +CSQP CSQXRCTL Channel CH.PCPMMQ1.CPM.C02 ended abnormally |
|
Back to top |
|
 |
vennela |
Posted: Tue Apr 01, 2008 8:37 pm Post subject: |
|
|
 Jedi Knight
Joined: 11 Aug 2002 Posts: 4055 Location: Hyderabad, India
|
What is happenning on the UNIX side
What does the error logs say on UNIX side
How often is the Queue Manager failing over to the other box? |
|
Back to top |
|
 |
suffolk |
Posted: Tue Apr 01, 2008 8:41 pm Post subject: |
|
|
Novice
Joined: 03 Mar 2008 Posts: 24
|
Unix's log is reporting this:
AMQ6119: An internal WebSphere MQ error has occurred (No such file or
directory(2): stat: /MQHA/QM.PCPMMQ1.CPM/data/qmgrs/QM!PCPMMQ1!CPM/@ipcc/isem:)
EXPLANATION:
MQ detected an unexpected error when calling the operating system. The MQ error
recording routine has been called.
ACTION:
Use the standard facilities supplied with your system to record the problem
identifier, and to save the generated output files. Contact your IBM support
center. Do not discard these files until the problem has been resolved.
03/31/08 00:38:57
AMQ9228: The TCP/IP responder program could not be started.
EXPLANATION:
An attempt was made to start an instance of the responder program, but the
program was rejected.
ACTION:
The failure could be because either the subsystem has not been started (in this
case you should start the subsystem), or there are too many programs waiting
(in this case you should try to start the responder program later). The reason
code was 0.
Other queues and channels on both z/OS and Unix are working fine and that really puzzled us. Failing over to the other node is very remote and once we intentionally tried to failover and surprising the affected channel(sdr and rcv) started but only lasted a while and went to retrying mode again. |
|
Back to top |
|
 |
mvic |
Posted: Wed Apr 02, 2008 3:08 am Post subject: Re: z/OS and Unix's Channel Communication |
|
|
 Jedi
Joined: 09 Mar 2004 Posts: 2080
|
suffolk wrote: |
replace amqalchk.fil by creating a temp QM and copied the new amqalchk.fil into the existing QM and it started up. |
This is an odd procedure. Did IBM recommend it?
Quote: |
Everything works fine except that the sender channel between z/OS and receiving channel on Unix works on a intermittent basis |
Sounds as if the steps taken to retrieve the situation (ie. the above described steps) may not have given you a complete solution.
If IBM Support were going to work on this they would need a -t all -t detail trace, and might even say that the procedure using amqalchk.fil was not a supported operation (not 100pc sure though).
IMHO I'd say raise a PMR if you don't get it fixed. |
|
Back to top |
|
 |
suffolk |
Posted: Wed Apr 02, 2008 8:35 am Post subject: |
|
|
Novice
Joined: 03 Mar 2008 Posts: 24
|
It was a tech note given by IBM and we tried to remove a node from the HA today and it worked, seems like configurations for the HA is not done correctly. Will be monitoring it for the next few days. |
|
Back to top |
|
 |
|
|
 |
|
Page 1 of 1 |
|
You cannot post new topics in this forum You cannot reply to topics in this forum You cannot edit your posts in this forum You cannot delete your posts in this forum You cannot vote in polls in this forum
|
|
|
|