MQSeries.net :: View topic - MQ xmit channel failure from hard NW break...

hguapluas · Posted: Fri Aug 26, 2005 1:55 pm Post subject:

Hi all,

Just wondering if anybody else has run into a situation similar to below. Have run into this a few times now and pattern for restoration seems to be same in each case.

In a large multi-network environment running through two or more firewalls when there is a firewall shutdown or other hard failure in network connectivity, when connections are restored the xmit channels on one or both ends will not reconnect and resume traffic flow. What is usually required is manual intervention on one or both sides, usually on the side where messages are getting queued up. MQ guy has to bounce the xmit channel several times and usually send a fresh (new) message and then channel status goes active and traffic flow resumes. Just doing a simple channel start on one or both ends does not resume traffic flow. It always seems to take a series of channel starts on the sender channel to resume traffic flow.

In all cases, there is an associated complete network outage tied in with this between the sender/receiver. Other channels on either end that are on network paths that were not affected by the outage continue as normal. It is only the channels that were impacted by the hard network break.

MQ vers on Windows side is 5.3 CSD's range from 06-09. On mainframe side, they are using zOS and not sure what version of MQ but it is an older version. In almost all cases of this, it was an issue of Mainframe to Windows connectivity. I do not think this has happened on any Windows-to-Windows MQ connections which would have all been at v5.3.

Some of the outages were caused by hardware failure. Others were caused by decision to shutdown firewall connections suddenly to prevent spread of recent worm attack from propagating between networks. The worm itself did not impact MQ traffic.

Curious minds want to know your experiences and what you've done to remedy the situation. Is there a more elegant way to restore service short of manually bouncing the sender channel multiple times?!?

Thanks.