MQSeries.net :: View topic

jason_e · Posted: Sat Sep 13, 2003 3:03 pm Post subject:

Hi,
My WebSphere MQ 5.3 box has been running fine for a few days until for
some reason I was unable to receive any messages.

I need to determine why this happened and find out how to prevent it
from happening again. The event logs and FFSR reports is not of much
use since I don't really know what they are trying to tell me.

Below is a copy of my event logs and extracts from my FFST reports,
I have quite a few FFST reports but it seem to be the same type of
prodlem over and over again (xllLongLockRequest & cciTcpReceive).

What can I do to troubleshoot there errors?

The "Transactions rolled back to release log space." is very
concerning since AFAIK my applications shouldn't be causing
that to happen. How can I determine what transactions are
causing the problems?

Regards,
Jason

===========
NT EVENT LOGS
===========
Program cannot update queue manager object

The attempt to update object '%CHLBATCH.4'
on queue manager 'ZEPELTRA' failed with reason code 2003.

-----------------

Channel program ended abnormally.

Channel program 'SEPEL.ZEPELTRA' ended abnormally.

Look at previous error messages for channel
program 'SEPEL.ZEPELTRA' in the error files
to determine the cause of the failure.

------------------

Transactions rolled back to release log space.

The log space for the queue manager is becoming full. One or
more long-running transactions have been rolled back to release
log space so that the queue manager can continue to process requests.

Try to ensure that the duration of your transactions is not
excessive. Consider increasing the size of the log to
allow transactions to last longer before the log starts
to become full.

------------------
Error on receive from host xxx.xxx.xxx.xxx

An error occurred receiving data from xxx.xxx.xxx.xxx over TCP/IP. This
may be due to a communications failure.

The return code from the TCP/IP (recv)
call was 10054 (X'2746'). Record these values
and tell the systems administrator.

===========
FFSR REPORTS
===========

| WebSphere MQ First Failure Symptom Report |
| ========================================= | Date/Time :- |
| Host Name :- Windows 2000 Build 2195: Service Pack 4 |
| PIDS :- - |
| LVLS :- - |
| Product Long Name :- WebSphere MQ for Windows |
| Vendor :- IBM |
| Probe Id :- - |
| Application Name :- MQM |
| Component :- xllLongLockRequest |
| Build Date :- Oct 12 2002 |
| CMVC level :- p000-L021011 |
| Build Type :- IKAP - (Production) |
| UserID :- MUSR_MQADMIN |
| Process Name :- C:\Program Files\IBM\WebSphere MQ\bin\amqzlaa0.exe |
| Process :- 00001584 |
| Thread :- 00000002 |
| QueueManager :- ZEPELTRA |
| Major Errorcode :- STOP |
| Minor Errorcode :- OK |
| Probe Type :- HALT6109 |
| Probe Severity :- 1 |
| Probe Description :- AMQ6109: An internal WebSphere MQ error has occurred. |
| FDCSequenceNumber :- 0 |
| |
+-----------------------------------------------------------------------------+

+-----------------------------------------------------------------------------+
| |
| WebSphere MQ First Failure Symptom Report |
| ========================================= |
| |
| Date/Time :- |
| Host Name :- (Windows 2000 Build 2195: Service Pack 4) |
| PIDS :- |
| LVLS :- |
| Product Long Name :- WebSphere MQ for Windows |
| Vendor :- IBM |
| Probe Id :- |
| Application Name :- MQM |
| Component :- cciTcpReceive |
| Build Date :- Oct 12 2002 |
| CMVC level :- p000-L021011 |
| Build Type :- IKAP - (Production) |
| UserID :- MUSR_MQADMIN |
| Process Name :- C:\Program Files\IBM\WebSphere MQ\bin\AMQRMPPA.EXE |
| Process :- 00001628 |
| Thread :- 00000015 |
| Major Errorcode :- rrcE_BAD_DATA_RECEIVED |
| Minor Errorcode :- OK |
| Probe Type :- MSGAMQ9207 |
| Probe Severity :- 2 |
| Probe Description :- AMQ9207: The data received from host 'XXX.XXX.XXX.XXX' |
| is not valid. |
| FDCSequenceNumber :- 0 |
| Comment1 :- XXX.XXX.XXX.XXX |
| |
| Comment2 :- TCP/IP |
| |
| Comment3 :- |
| |
| |
+-----------------------------------------------------------------------------+