MQSeries.net :: View topic - excessive sd + preferred way to restart DataFlowEngine

jcv · Posted: Thu Apr 24, 2008 5:36 am Post subject:

Hello!

$ uname
AIX
$ ulimit -a|grep desc
nofiles(descriptors) 2000
$ mqsiservice
BIPv600 hr HR
ucnv Console CCSID 912 dft ucnv CCSID 912
ICUW ibm-912_P100-1995 ICUA ibm-912_P100-1995

BIP8071I: Successful command completion.

lsof command reports excessive socket descriptor list on DataFlowEngine process running message flows which issue http requests through http request node. In fact, the limit of 2000 is reached:

$ lsof|grep DataFlow|grep 335944|wc -l
2001
$

Entries look like this:

DataFlowE 335944 mqm 1999u IPv4 0xf1000d0002d1f288 0t0 TCP host1:*->host2:XXXX

netstat does not show network connections between those two hosts in any state. Is this some known issue? We are going to apply some patches there... Is there possible error in message flow?
It's hard for me to trace now under what circumstances this leak happens, but obviously it's not every time http request node processes request, because in that case limit would be reached after several minutes.

Besides that, when I restarted message flows by using toolkit commands (corresponding to mqsistopmsgflow and mqsistartmsgflow), I have somehow resolved part of the problem, being able to process further messages in that execution group, but the process DataFlowEnging itself is not restarted by that command, hence no descriptors were freed. I know all processes would be restarted if I stop the whole broker, and I don't want to do that. What's left? To kill the process on os level to free descriptors? Am I missing some mqsi command?