Author |
Message
|
pettitma |
Posted: Tue Mar 03, 2009 4:40 am Post subject: Connectivity Issue.. |
|
|
Novice
Joined: 26 Apr 2005 Posts: 18
|
Hi,
Hoping you can help with a rather frustrating connectivity issue we're having.
A new qmgr has been developed but is not connecting to the remote qmgr.
When the Sdr channel is started it does not connect, just goes into a BINDING state then Retrying. That behaviour makes sense as I can not perform an MQPING. However, I can perform a telnet to the remote port successfully.
Details of the channel/AMQ logs below:
SDR Channel:
CHANNEL(MARS1) CHLTYPE(SDR)
ALTDATE(2009-03-03) ALTTIME(11.57.20)
BATCHHB(0) BATCHINT(0)
BATCHSZ(50) COMPHDR(NONE)
COMPMSG(NONE) CONNAME(nnn.nnn.nnn.nnn(1417)) <-ip removed
CONVERT(YES) DESCR(Sender Channel)
DISCINT(3600) HBINT(240)
KAINT(AUTO) LOCLADDR( )
LONGRTY(999999999) LONGTMR(1200)
MAXMSGL(4194304) MCANAME( )
MCATYPE(PROCESS) MCAUSER(mqm)
MODENAME( ) MONCHL(QMGR)
MSGDATA( ) MSGEXIT( )
NPMSPEED(FAST) PASSWORD( )
RCVDATA( ) RCVEXIT( )
SCYDATA( ) SCYEXIT( )
SENDDATA( ) SENDEXIT( )
SEQWRAP(999999999) SHORTRTY(10)
SHORTTMR(60) SSLCIPH( )
SSLPEER( ) STATCHL(QMGR)
TPNAME( ) TRPTYPE(TCP)
USERID( ) XMITQ(BGQD.SMS1)
RCV Channel on the remote side:
CHANNEL(MARS1) CHLTYPE(RCVR)
ALTDATE(2009-02-12) ALTTIME(14.20.14)
BATCHSZ(50) COMPHDR(NONE)
COMPMSG(NONE) DESCR( )
HBINT(240) KAINT(AUTO)
MAXMSGL(1048576) MCAUSER(mqm)
MONCHL(QMGR) MRDATA( )
MREXIT( ) MRRTY(10)
MRTMR(1000) MSGDATA( )
MSGEXIT( ) NPMSPEED(FAST)
PUTAUT(DEF) RCVDATA( )
RCVEXIT( ) SCYDATA( )
SCYEXIT( ) SENDDATA( )
SENDEXIT( ) SEQWRAP(999999999)
SSLCAUTH(REQUIRED) SSLCIPH( )
SSLPEER( ) STATCHL(QMGR)
TRPTYPE(TCP)
Output of MQPING:
ping channel (MARS1)
7 : ping channel (MARS1)
AMQ9208: Error on receive from host 172 (nnn.nnn.nnn.nnn) <-ip removed
UNIX telnet (with ip masked) - telnet works on the remote side also back to the sender:
mqm@bcap105> telnet nnn.nnn.nnn.nnn 1417
Trying nnn.nnn.nnn.nnn...
Connected to nnn.nnn.nnn.nnn.
Escape character is '^]'.
AMQ error log on Sender side:
03/03/2009 12:25:04 - Process(18612.1) User(mqm) Program(runmqchl_nd)
AMQ9002: Channel 'MARS1' is starting.
EXPLANATION:
Channel 'MARS1' is starting.
ACTION:
None.
-------------------------------------------------------------------------------
03/03/2009 12:25:16 - Process(18612.1) User(mqm) Program(runmqchl_nd)
AMQ9208: Error on receive from host 172 (nnn.nnn.nnn.nnn).
EXPLANATION:
An error occurred receiving data from 172 (nnn.nnn.nnn.nnn) over TCP/IP. This may
be due to a communications failure.
ACTION:
The return code from the TCP/IP (read) call was 131 (X'83'). Record these
values and tell the systems administrator.
----- amqccita.c : 3070 -------------------------------------------------------
03/03/2009 12:25:16 - Process(18612.1) User(mqm) Program(runmqchl_nd)
AMQ9999: Channel program ended abnormally.
EXPLANATION:
Channel program 'MARS1' ended abnormally.
ACTION:
Look at previous error messages for channel program 'MARS1' in the error
files to determine the cause of the failure.
----- amqrccca.c : 777 --------------------------------------------------------
03/03/2009 12:34:52 - Process(23511.1) User(mqm) Program(amqrcmla)
AMQ9208: Error on receive from host 172 (nnn.nnn.nnn.nnn).
EXPLANATION:
An error occurred receiving data from 172 (nnn.nnn.nnn.nnn) over TCP/IP. This may
be due to a communications failure.
ACTION:
The return code from the TCP/IP (read) call was 131 (X'83'). Record these
values and tell the systems administrator.
----- amqccita.c : 3070 -------------------------------------------------------
Any idea why I can telnet to the remote side but can't MQ ping/connect? |
|
Back to top |
|
 |
Vitor |
Posted: Tue Mar 03, 2009 4:46 am Post subject: Re: Connectivity Issue.. |
|
|
 Grand High Poobah
Joined: 11 Nov 2005 Posts: 26093 Location: Texas, USA
|
pettitma wrote: |
Any idea why I can telnet to the remote side but can't MQ ping/connect? |
Can you telnet back the other way? _________________ Honesty is the best policy.
Insanity is the best defence. |
|
Back to top |
|
 |
mqjeff |
Posted: Tue Mar 03, 2009 5:03 am Post subject: |
|
|
Grand Master
Joined: 25 Jun 2008 Posts: 17447
|
pettitma wrote: |
- telnet works on the remote side also back to the sender: |
Are there channel errors on the remote side? If the channel is stuck in BINDING, then you won't be able to do anything, and one side or the other should be telling you why. |
|
Back to top |
|
 |
pettitma |
Posted: Tue Mar 03, 2009 6:04 am Post subject: |
|
|
Novice
Joined: 26 Apr 2005 Posts: 18
|
Sorry, I should've put more info from the remote side perspective. The behaviour on the remote side appears identical to the sender side.
I can telnet from the remote side back yes. This is the confusing aspect, if I couldn't telnet then simple, network related - but as I can (and I'm definitely telnet'ing to the ip in the sdr channel with the correct port) - then I'm a bit confused as to what the problem could be.
Within the AMQ error log on the remote side it is simiar to the sender side, the below occurs when I try and start the remote side's sender channel. The log in the remote side contains no information when I try and start the sender channel on the sender side.
03/03/09 14:33:25 - Process(269.1) User(mqm) Program(runmqchl_nd)
AMQ9002: Channel 'JUPITER1' is starting.
EXPLANATION:
Channel 'JUPITER1' is starting.
ACTION:
None.
-------------------------------------------------------------------------------
03/03/09 14:37:10 - Process(269.1) User(mqm) Program(runmqchl_nd)
AMQ9209: Connection to host '172 (nnn.nnn.nnn.nnn)' closed.
EXPLANATION:
An error occurred receiving data from '172 (nnn.nnn.nnn.nnn)' over TCP/IP. The
connection to the remote host has unexpectedly terminated.
ACTION:
Tell the systems administrator.
----- amqccita.c : 3094 -------------------------------------------------------
03/03/09 14:37:10 - Process(269.1) User(mqm) Program(runmqchl_nd)
AMQ9999: Channel program ended abnormally.
EXPLANATION:
Channel program 'JUPITER1' ended abnormally.
ACTION:
Look at previous error messages for channel program 'JUPITER1' in
the error files to determine the cause of the failure.
----- amqrccca.c : 777 --------------------------------------------------------
So, both sides are reporting TCP errors in their respective logs, yet I can telnet to the remote side (from either end).
What I have just noticed is this though:
An error occurred receiving data from '172 (172.29.21.146)' - not sure why the connection id would be '172'.. |
|
Back to top |
|
 |
gunter |
Posted: Tue Mar 03, 2009 7:02 am Post subject: |
|
|
Partisan
Joined: 21 Jan 2004 Posts: 307 Location: Germany, Frankfurt
|
The output is ok.
The error is "Connection reset by peer". Mostly it is a network problem, maybe a firewall or to less resources. _________________ Gunter Jeschawitz
IBM Certified System Administrator - Websphere MQ, 5.3 |
|
Back to top |
|
 |
pettitma |
Posted: Tue Mar 03, 2009 7:09 am Post subject: |
|
|
Novice
Joined: 26 Apr 2005 Posts: 18
|
Thanks for replying Gunter.
If there was a network issue wouldn't that surely prevent the telnet test from failing? |
|
Back to top |
|
 |
kevinf2349 |
Posted: Tue Mar 03, 2009 9:04 am Post subject: |
|
|
 Grand Master
Joined: 28 Feb 2003 Posts: 1311 Location: USA
|
Did you look up the TCPIP return code? |
|
Back to top |
|
 |
fjb_saper |
Posted: Tue Mar 03, 2009 8:51 pm Post subject: |
|
|
 Grand High Poobah
Joined: 18 Nov 2003 Posts: 20756 Location: LI,NY
|
Reread the posts guys.
We may just have a channel mismatch here.
on one qmgr the channel name is MARS1 on the other the channel name is JUPITER1. Those will NEVER connect to each other.
Connecting channel pairs SENDER/RECEIVER MUST have the same name! (including case). intercommunication guide and read as well the conventions for channel names. Makes everything easier to understand.
Have fun  _________________ MQ & Broker admin |
|
Back to top |
|
 |
exerk |
Posted: Tue Mar 03, 2009 11:48 pm Post subject: |
|
|
 Jedi Council
Joined: 02 Nov 2006 Posts: 6339
|
Post re-read, and saw this (original post):
Quote: |
SDR Channel:
CHANNEL(MARS1) CHLTYPE(SDR)...
RCVR Channel on the remote side:
CHANNEL(MARS1) CHLTYPE(RCVR)... |
Then in a later post it states:
Quote: |
...the below occurs when I try and start the remote side's sender channel. The log in the remote side contains no information when I try and start the sender channel on the sender side.
03/03/09 14:33:25 - Process(269.1) User(mqm) Program(runmqchl_nd)
AMQ9002: Channel 'JUPITER1' is starting.
EXPLANATION:
Channel 'JUPITER1' is starting.
ACTION:
None.
-------------------------------------------------------------------------------
etc. |
I would expect the logs to state that no instance of the channel exists at the remote end, rather than show what has been posted  _________________ It's puzzling, I don't think I've ever seen anything quite like this before...and it's hard to soar like an eagle when you're surrounded by turkeys. |
|
Back to top |
|
 |
queuemanager |
Posted: Thu Mar 05, 2009 11:57 pm Post subject: |
|
|
Apprentice
Joined: 28 Nov 2006 Posts: 43 Location: Bangalore
|
Which platform are your queuemanagers on??? |
|
Back to top |
|
 |
|