MQSeries.net :: View topic

jonasb · Posted: Thu Jun 11, 2009 2:45 am Post subject:

Hi

When dealing with edifact messages one file (interchange) can contain
several messages. Our idea is to have general MessageSets that can parse any edifact interchange, we can then split the messages and process them one by one.

A typical message can look like this (simplified)
(This interchange cointains two messages)

UNA:+.? '
UNB+UNOA:1+FROM+TO+090610:0920+SOMEREF'
UNH+MSGREF+1:2+68+1:2'
BGM+MSGREF+1:2+68+1:2'
UNT+3+MSGREF'
UNH+MSGREF2+1:2+68+1:2'
BGM+MSGREF2+1:2+68+1:2'
UNT+3+MSGREF2'
UNZ+1+SOMEREF'

A line is called a segment, and each segment starts with a three letter identifier (e.g. UNA) and ends with a "'".

We would like to be able to parse these type of interchanges into a structure like this one.

INTERCHANGE
UNA(0,1)
UNB(1,1)
MESSAGE(0,-1)
UNH(1,1)
"ANY"(0,-1) <ANY SEGMENT EXCEPT UNT/UNZ>(0,-1)
UNT(1,1)
UNZ(1,1)

We have not been able to find anything that would match all types of
segments except UNT/UNZ. In our attempts, the "ANY" segment also consume UNT, UNZ and UNH, resulting in something like this:

INTERCHANGE
UNA = UNA:+.? '
UNB = UNB+UNOA:1+FROM+TO+090610:0920+SOMEREF'
MESSAGE
UNH = UNH+MSGREF+1:2+68+1:2'
"ANY" = BGM+MSGREF+1:2+68+1:2'
"ANY" = UNT+3+MSGREF'
"ANY" = UNH+MSGREF2+1:2+68+1:2'
"ANY" = BGM+MSGREF2+1:2+68+1:2'
"ANY" = UNT+3+MSGREF2'
"ANY" = UNZ+1+SOMEREF'

A way of doing it is of course to name all 24^3 possible tags (excluding UNT and UNZ), but that does not seem like a good solution....

If anyone has any pointers they would be much appreciated.

Kind Regards,
Jonas
_________________
jonasb