EDI/Edifact Plugin: Unterschied zwischen den Versionen
| Cg (Diskussion | Beiträge) | Cg (Diskussion | Beiträge)  | ||
| (74 dazwischenliegende Versionen von 2 Benutzern werden nicht angezeigt) | |||
| Zeile 1: | Zeile 1: | ||
| == Edi/Edifact Plugin == | |||
| The Edifact Plugin provides extensive support for EDIFACT message processing and adds a hierarchical structure editor to the GUIBrowser user interface. There, EDIFACT messages can be interactively edited and action sequences constructed to operate on it. | |||
| == Edi/Edifact Message Plugin == | |||
| The Edifact Message plugin provides extensive support for Edifact message processing. | |||
| === Background Information === | === Background Information === | ||
| EDIFACT is an international standard for the exchange of business transaction messages, such as invoices, orders, quotes etc. | |||
| EDIFACT transactions are heavily used in B2B communications, and supported by SAP, Oracle, IBM and other business processing systems. | |||
| EDIFACT messages consist of a number of records (called "segments"), which themselves contain a number of data elements. | |||
| Both non-composite and composite data elements are possible. | Both non-composite and composite data elements are possible. | ||
| EDIFACT messages are surrounded by a header- and trailer segment. | |||
| An  | An EDIFACT transaction may contain multiple individual messages, and is itself surrounded by a transaction header segment and a transaction trailer segment. | ||
| For comprehensive documentation and a full specification, see [http://www.unece.org United Nations Economic  | For comprehensive documentation and a full specification, see [http://www.unece.org United Nations Economic Commission for Europe] in general, and [http://www.unece.org/cefact/edifact/welcome.html Introducing UN/EDIFACT] in particular. | ||
| and [http://www.unece.org/cefact/edifact/welcome.html Introducing UN/Edifact] in particular. | |||
| ===  | === EDIFACT Messages === | ||
| UN/ | UN/EDIFACT defines the overall format and layout of EDIFACT messages. Standard specifications are typically published twice a year. | ||
| New revisions typically add additional data elements, additional segment types and messages. | New revisions typically add additional data elements, additional segment types and messages. | ||
| New revisions are typically backward compatible. That  | New revisions are typically backward compatible. That means that previous definitions are seldom (if ever) obsoleted. | ||
| All messages and segments  | All messages and segments contain a tag (or message type) field, which allows for the type of message/segment to be identified. | ||
| Thus, in theory, it is possible to extract field values from arbitrary messages, even if the concrete message layout is unknown. | Thus, in theory, it is possible to extract field values from arbitrary messages, even if the concrete message layout is unknown. | ||
| Zeile 28: | Zeile 26: | ||
| For example, a DTM-segment's datetime value can be extracted easily, but its interpretation (delivery date, invoice date, quotation date etc.) is | For example, a DTM-segment's datetime value can be extracted easily, but its interpretation (delivery date, invoice date, quotation date etc.) is | ||
| only possible if the overall message structure is known. | only possible if the overall message structure is known. | ||
| === Message Subsets === | === Message Subsets === | ||
| The definitions as published by UN/Unice define the overall structure of messages and segments. | The definitions as published by UN/Unice define the overall structure of messages and segments. | ||
| Zeile 34: | Zeile 32: | ||
| Many segments are specified as optional and not supported/expected by many concrete users. | Many segments are specified as optional and not supported/expected by many concrete users. | ||
| For this typical B2B transaction systems expect and handle only subsets of the  | For this, typical B2B transaction systems expect and handle only subsets of the overall message set and subsets of the possible segment set within those messages. | ||
| === Meta Descriptions === | === Meta Descriptions === | ||
| In order for a correct semantic interpretation, modification and verification of data elements, a meta description is required, | In order for a correct semantic interpretation, modification and verification of data elements, a meta description is required, | ||
| which specifies the format of a message (segment order and grouping) and of data  | which specifies the format of a message (segment order and grouping) and of data elements (alphanumeric, numeric, minimum/maximum size and value etc.). | ||
| Such specifications are published by UN/Unice for the overall framework, and by companies for the B2B partners. | Such specifications are published by UN/Unice for the overall framework, and by companies for the B2B partners. | ||
| Specifications exist in both formal (machine readable) formats and in non-formal (human readable) formats. | Specifications exist in both formal (machine readable) formats and in non-formal (human readable) formats. | ||
| Formal specification formats are SEF (Structured  | Formal specification formats are SEF (Structured EDIFACT Format Specification) and XSD (XML Schema definitions). | ||
| Non-formal specification are PDF, HTML and Text formats. | Non-formal specification are PDF, HTML and Text formats. | ||
| The biggest challenges in dealing with  | The biggest challenges in dealing with EDIFACT is the trouble to get formal (machine readable) specifications. | ||
| SEF format files provide all required information and would be perfect for this task - however, SEF seems to be obsolete and SEF specifications for | SEF format files provide all required information and would be perfect for this task - however, SEF seems to be obsolete and SEF specifications for | ||
| newer versions are hard to find (which is a pity). SEF files are not available from UN/Unice. | newer versions are hard to find (which is a pity). SEF files are not available from UN/Unice. | ||
| XSD seems to be the way to go, but official XSD  | XSD seems to be the way to go, but official XSD specifications are rare and not published by UN. | ||
| DFDL is a new XML-based definition standard, which is very promising in theory. However, it is relatively new. | DFDL is a new XML-based definition standard, which is very promising in theory. However, it is relatively new. | ||
| Only a small subset of  | Only a small subset of EDIFACT messages is formally specified in this format. | ||
| Official documents from UN/Unice include PDF, HTML and Text files. | Official documents from UN/Unice include PDF, HTML and Text files. | ||
| Although being informal, existing  | Although being informal, existing HTML and text files seem to follow a common layout and structure. | ||
| == EDIFACT MessageBrowser Plugin == | |||
| === Edifact Plugin Components === | |||
| This plugin adds a structured EDIFACT message inspector and editor to expecco. | |||
| === Edifact Smalltalk Class Library === | |||
| It provides an interface very similar to the GUIBrowser's interface, showing the hierarchical structure of EDIFACT messages. It also provides a UI to interactively compose action sequences to analyze, modify and test an EDIFACT message. | |||
| This library contains all real functionality of the edifact plugin.  | |||
| == EDIFACT Plugin Components == | |||
| === EDIFACT Smalltalk Class Library === | |||
| This library contains all real functionality of the EDIFACT plugin. | |||
| The expecco action library (edifact.ets) contains interfacing blocks to the functions found there. | The expecco action library (edifact.ets) contains interfacing blocks to the functions found there. | ||
| ====  | ==== EDIFACT Message Decoder ==== | ||
| Includes a general message/segment/data decoder, which is able to read arbitrary (unknown) messages. | Includes a general message/segment/data decoder, which is able to read arbitrary (unknown) messages. | ||
| Such textual messages (wire data) is decoded and presented as a hierarchy of objects, which model the message, segments and elements. | Such textual messages (wire data) is decoded and presented as a hierarchy of objects, which model the message, segments and elements. | ||
| The decoder can decode arbitrary messages - i.e. a meta description is not required for basic decoding. | The decoder can decode arbitrary messages - i.e. a meta description is not required for basic decoding. | ||
| ====  | ==== EDIFACT Message Encoder ==== | ||
| This encodes a message object back into the external  | This encodes a message object back into the external EDIFACT wire representation. | ||
| It cares for the character set, separator definitions etc. | It cares for the character set, separator definitions etc. | ||
| In general, the original input is regenerated (with minor differences) when the a decoded  | In general, the original input is regenerated (with minor differences) when the a decoded EDIFACT message is encoded back to wire data. | ||
| ====  | ==== EDIFACT Object Representation Framework ==== | ||
| These classes model  | These classes model EDIFACT objects as messages, segments, composite- and non-composite data elements. | ||
| Without a meta  | Without a meta description of the message set, data elements can by accessed by numeric indices (segment#, element#). | ||
| With a meta description, elements are  | With a meta description, elements are accessible by more user friendly names, such as the official UN standard field names (C009, C123, 4056 etc), or user assigned application field names (deliveryDate, deliveryLocation, street, city etc.) | ||
| or user assigned application field names (delivaryDate, deliveryLocation, street, city etc.) | |||
| ====  | ==== EDIFACT Schema Definition Framework ==== | ||
| This set of classes represent a meta description of  | This set of classes represent a meta description of EDIFACT objects. They can be hand-written, dynamically constructed or read in from external | ||
| specification files in various formats. | specification files in various formats. | ||
| ====  | ==== EDIFACT Schema Definition Readers ==== | ||
| These classes read external meta specifications in various formats and create a schema object. | These classes read external meta specifications in various formats and create a schema object. | ||
| Schema objects are needed to correctly interpret element data, to  | Schema objects are needed to correctly interpret element data, to access elements by user friendly names, and to create new messages which | ||
| have the correct segment and segment group structure. | have the correct segment and segment group structure. | ||
| Readers exist for the following formats: | Readers exist for the following formats: | ||
| Zeile 99: | Zeile 101: | ||
| ==== Smalltalk Library API ==== | ==== Smalltalk Library API ==== | ||
| The EDIFACT class library contains functions to be called from elementary code (and also code which is called by the blocks contained in the EDIFACT block library, which is described below). | |||
| Included are parsers for various formats (regular EDIFACT and XML-EDIFACT), generators for those, parsers for meta-definitions and data objects to represent EDIFACT messages (segments, groups, fields etc.) | |||
| ===== Decoding a Message ===== | ===== Decoding a Message ===== | ||
| given the following  | given the following EDIFACT message in wire data form: | ||
|  UNA:+.? ' | |||
| <CODE><PRE> | |||
|  UNB+UNOC:3+Senderkennung+Empfaengerkennung+060620:0931+1++1234567' | |||
| UNA:+.? ' | |||
|  UNH+1+ORDERS:D:96A:UN' | |||
| UNB+UNOC:3+Senderkennung+Empfaengerkennung+060620:0931+1++1234567' | |||
|  BGM+220+B10001' | |||
| UNH+1+ORDERS:D:96A:UN' | |||
|  DTM+4:20060620:102' | |||
| BGM+220+B10001' | |||
|  NAD+BY+++Bestellername+Strasse+Stadt++23436+xx' | |||
| DTM+4:20060620:102' | |||
|  LIN+1++Produkt Schrauben:SA' | |||
| NAD+BY+++Bestellername+Strasse+Stadt++23436+xx' | |||
|  QTY+1:1000' | |||
| LIN+1++Produkt Schrauben:SA' | |||
|  UNS+S' | |||
| QTY+1:1000' | |||
|  CNT+2:1' | |||
|  UNT+9+1' | |||
|  UNZ+1+1234567' | |||
| UNT+9+1' | |||
| UNZ+1+1234567' | |||
| </PRE></CODE> | |||
| assuming, the above message (as String) is held in the "msg" variable, | assuming, the above message (as String) is held in the "msg" variable, | ||
| it can be decoded with: | it can be decoded with: | ||
| <CODE><PRE> | |||
|     document := EdifactDecoder decode:msg. |     document := EdifactDecoder decode:msg. | ||
| </PRE></CODE> | |||
| This returns a structured (hierarchical) message object in "document", which consists of segments and data fields. | |||
| ===== Encoding a Message ===== | |||
| for the opposite direction, a number of EdifactCoder classes exist, which take a decoded or constructed EDIFACT message and generate an external representation from it: | |||
|     EdifactCoder encode: document | |||
| generates a standard EDIFACT representation (the above). | |||
| For XML output, various different formats are supported, which use different tags for the elements. | |||
| <br>For example, | |||
|     EdifactXMLCoder encode: document | |||
| generates the following XML representation: | |||
|   <?xml version="1.0" encoding="UTF-8"?> | |||
|   <edifact> | |||
|     <UNB> | |||
|         <element> | |||
|             <value>UNOC</value> | |||
|             <value>3</value> | |||
|         </element> | |||
|         <element> | |||
|             <value>Senderkennung</value> | |||
|         </element> | |||
|         <element> | |||
|             <value>Empfaengerkennung</value> | |||
|         </element> | |||
|         <element> | |||
|             <value>060620</value> | |||
|             <value>0931</value> | |||
|         </element> | |||
|         ... | |||
|     </UNB> | |||
|     ... | |||
|     <CNT> | |||
|         <element> | |||
|             <value>2</value> | |||
|             <value>1</value> | |||
|         </element> | |||
|     </CNT> | |||
|     <UNT> | |||
|         <element> | |||
|             <value>9</value> | |||
|         </element> | |||
|         <element> | |||
|             <value>1</value> | |||
|         </element> | |||
|     </UNT> | |||
|     <UNZ> | |||
|         <element> | |||
|             <value>1</value> | |||
|         </element> | |||
|         <element> | |||
|             <value>1234567</value> | |||
|         </element> | |||
|     </UNZ> | |||
|   </edifact> | |||
| whereas: | |||
|     EdifactXMLEdifactCoder encode: document | |||
| would generate: | |||
|   <?xml version="1.0" encoding="UTF-8"?> | |||
|   <EDIFACT> | |||
|     <S_UNB> | |||
|         <C_S001> | |||
|             <D_0001>UNOC</D_0001> | |||
|             <D_0002>3</D_0002> | |||
|         </C_S001> | |||
|         <C_S002> | |||
|             <D_0004>Senderkennung</D_0004> | |||
|         </C_S002> | |||
|         <C_S003> | |||
|             <D_0010>Empfaengerkennung</D_0010> | |||
|         </C_S003> | |||
|         <C_S004> | |||
|             <D_0017>060620</D_0017> | |||
|             <D_0019>0931</D_0019> | |||
|         </C_S004> | |||
|         <D_0020>1</D_0020> | |||
|         <C_S005></C_S005> | |||
|         <D_0026>1234567</D_0026> | |||
|     </S_UNB> | |||
|     <ORDERS> | |||
|         <S_UNH> | |||
|             <D_0062>1</D_0062> | |||
|             <C_S009> | |||
|                 <D_0065>ORDERS</D_0065> | |||
|                 <D_0052>D</D_0052> | |||
|                 <D_0054>96A</D_0054> | |||
|                 <D_0051>UN</D_0051> | |||
|             </C_S009> | |||
|         </S_UNH> | |||
|         <S_BGM> | |||
|             <C_C002> | |||
|                 <D_1001>220</D_1001> | |||
|             </C_C002> | |||
|             <D_1004>B10001</D_1004> | |||
|         </S_BGM> | |||
|         <S_DTM> | |||
|             <C_C507> | |||
|                 <D_2005>4</D_2005> | |||
|                 <D_2380>20060620</D_2380> | |||
|                 <D_2379>102</D_2379> | |||
|             </C_C507> | |||
|         </S_DTM> | |||
|         <S_NAD> | |||
|             <D_3035>BY</D_3035> | |||
|             <C_C082></C_C082> | |||
|             <C_C058></C_C058> | |||
|             <C_C080> | |||
|                 <D_3036>Bestellername</D_3036> | |||
|             </C_C080> | |||
|             <C_C059> | |||
|                 <D_3042>Strasse</D_3042> | |||
|             </C_C059> | |||
|             <D_3164>Stadt</D_3164> | |||
|             <D_3229></D_3229> | |||
|             <D_3251>23436</D_3251> | |||
|             <D_3207>xx</D_3207> | |||
|         </S_NAD> | |||
|         <S_LIN> | |||
|             <D_1082>1</D_1082> | |||
|             <D_1229></D_1229> | |||
|             <C_C212> | |||
|                 <D_7140>Produkt Schrauben</D_7140> | |||
|                 <D_7143>SA</D_7143> | |||
|             </C_C212> | |||
|         </S_LIN> | |||
|         <S_QTY> | |||
|             <C_C186> | |||
|                 <D_6063>1</D_6063> | |||
|                 <D_6060>1000</D_6060> | |||
|             </C_C186> | |||
|         </S_QTY> | |||
|         <S_UNS> | |||
|             <D_0081>S</D_0081> | |||
|         </S_UNS> | |||
|         <S_CNT> | |||
|             <C_C270> | |||
|                 <D_6069>2</D_6069> | |||
|                 <D_6066>1</D_6066> | |||
|             </C_C270> | |||
|         </S_CNT> | |||
|         <S_UNT> | |||
|             <D_0074>9</D_0074> | |||
|             <D_0062>1</D_0062> | |||
|         </S_UNT> | |||
|     </ORDERS> | |||
|     <S_UNZ> | |||
|         <D_0036>1</D_0036> | |||
|         <D_0020>1234567</D_0020> | |||
|     </S_UNZ> | |||
|   </EDIFACT> | |||
| many more (customizable) formats are supported; please contact eXept for details. | |||
| ===== Accessing Document and Message Components ===== | ===== Accessing Document and Message Components ===== | ||
| Zeile 127: | Zeile 290: | ||
| ====== EdifactDocument API ====== | ====== EdifactDocument API ====== | ||
| "document" will be an instance of EdifactDocument  | "document" will be an instance of EdifactDocument. It represents a single transmission or document (possibly containing multiple messages). | ||
| It supports the API: | |||
| *  | * ''aDocument'' '''numberOfMessages'''<br>- returns the number of individual messages in the transaction | ||
| *  | * ''aDocument'' '''messages'''<br>- a collection of messages contained in the transaction | ||
| *  | * ''aDocument'' '''adviceSegment'''<br>- returns the transmission's advice segment (the 'UNA' segment) | ||
| *  | * ''aDocument'' '''headerSegment'''<br>- returns the transaction's header segment (the 'UNB' segment) | ||
| *  | * ''aDocument'' '''trailerSegment'''<br>- returns the transaction's trailer segment (the 'UNZ' segment) | ||
| Individual messages are fetched as instances of EdifactMessage via "document messages at: | Individual messages are fetched as instances of EdifactMessage via "''document'' '''messages''' '''at:''' ''index''" or enumerated with "''document'' '''messages''' '''do:'''[:each | ...]". | ||
| <br> | <br> | ||
| Examples: | Examples: | ||
| Zeile 143: | Zeile 307: | ||
| ====== EdifactMessage API ====== | ====== EdifactMessage API ====== | ||
| EdifactMessage supports the API: | EdifactMessage is a single message inside an edifactDocument (or transmission). It consists of segments, and supports the API: | ||
| *  | * ''aMessage'' '''segments'''<br>- a collection of individual segments inside the message (excludes header and trailer). This is a "raw" (e.g. flat) list of segments, as present in the original message. No segment group information is reflected in this list. | ||
| * ''aMessage'' '''groupedSegments'''<br>- a hierarchical collection of segments and segment groups (excludes header and trailer). This is a hierarchical list where group-segments are founder under segment group objects. | |||
| * headerSegment - returns the message header segment (the 'UNH' segment) | |||
| * trailerSegment - returns the message trailer segment (the 'UNT' segment) | |||
| *  | * ''aMessage'' '''headerSegment'''<br>- returns the message header segment (the 'UNH' segment) | ||
| * ''aMessage'' '''trailerSegment'''<br>- returns the message trailer segment (the 'UNT' segment) | |||
| The list of version d14b message types is found at [http://www.unece.org/fileadmin/DAM/trade/untdid/d14b/trmd/trmdi2.htm] and [http://www.unece.org/fileadmin/DAM/trade/untdid/d14b/timd/timdi2.htm]. | |||
| * controllingAgency - extracts the standards controlling agency which defined the message structure. In the above, this would be 'UN'. | |||
| * version - extracts the message structure standard version number. In the above message, this is '96A'. | |||
| * release - extracts the message structure standard release number. In the above message, this is 'D'. | |||
| * ''aMessage'' '''messageType'''<br>- extracts the message type from the message header as a string. In the above example, this would be 'ORDERS'.<br>The list of version d14b message types is found at [http://www.unece.org/fileadmin/DAM/trade/untdid/d14b/trmd/trmdi2.htm] and [http://www.unece.org/fileadmin/DAM/trade/untdid/d14b/timd/timdi2.htm]. | |||
| * messageSchema - return the message's schema definition (see below) | |||
| *  | * ''aMessage'' '''controllingAgency'''<br>- extracts the standards controlling agency which defined the message structure. In the above, this would be 'UN'. | ||
| * ''aMessage'' '''version'''<br>- extracts the message structure standard version number. In the above message, this is '96A'. | |||
| * ''aMessage'' '''release'''<br>- extracts the message structure standard release number. In the above message, this is 'D'. | |||
| * ''aMessage'' '''messageSchema'''<br>- return the message's schema definition (see below) | |||
| * ''aMessage'' '''edifactVersion'''<br>- return the messageSet which contains the message and segment schemas (see below) | |||
| The "controllingAgency", "version" and "release" fields are required to determine which meta schema definition is to be used (see below for details) | The "controllingAgency", "version" and "release" fields are required to determine which meta schema definition is to be used (see below for details) | ||
| Zeile 162: | Zeile 327: | ||
| ====== EdifactSegment API ====== | ====== EdifactSegment API ====== | ||
| Segments as returned by any of the above understand the following API: | Segments as returned by any of the above. They consist of a tag (segment type) and a number of data elements. They understand the following API: | ||
| *  | * ''aSegment'' '''tag'''<br>- the segments's id (e.g. 'UNA', 'DTM', 'BGM' etc.) The list of version d14b tags is found at [http://www.unece.org/fileadmin/DAM/trade/untdid/d14b/trsd/trsdi1.htm]. | ||
| * dataElementAt: | * ''aSegment'' '''dataElementAt:''' ''index''<br>- return a possibly composite data element. This is a wrapper object, which holds on both the schema definition (if known) and the data value. | ||
| * dataElementStringAt: | * ''aSegment'' '''dataElementStringAt:''' ''index''<br>- return the datum as a string. If the element is not present, an empty string is returned. | ||
| * dataElementValueAt: | * ''aSegment'' '''dataElementValueAt:''' ''index''<br>- return the datum as a string or number or nil, if the element is not present. | ||
| * dataElementAt:index1 at: | * ''aSegment'' '''dataElementAt:''' ''index1'' '''at:''' ''index2''<br>- similar to the above, but return a composite element's subcomponent | ||
| * dataElementStringAt:index1 at: | * ''aSegment'' '''dataElementStringAt:''' ''index1'' '''at:''' ''index2''<br>- similar to the above, but return a composite element's subcomponent's string | ||
| * dataElementValueAt:index1 at: | * ''aSegment'' '''dataElementValueAt:''' ''index1'' '''at:''' ''index2''<br>- similar to the above, but return a composite element's subcomponent' value | ||
| * dataElementAt:index put:newElement | * ''aSegment'' '''dataElementAt:''' ''index'' '''put:''' ''newElement''<br>- | ||
| * dataElementStringAt:index put:newString | * ''aSegment'' '''dataElementStringAt:''' ''index'' '''put:''' ''newString''<br>- | ||
| * dataElementValueAt:index put:newObject | * ''aSegment'' '''dataElementValueAt:''' ''index'' '''put:''' ''newObject''<br>- | ||
| * dataElementAt:index1 at:index2 put:newElement | * ''aSegment'' '''dataElementAt:''' ''index1'' '''at:''' ''index2'' '''put:''' ''newElement''<br>- | ||
| * dataElementStringAt:index1 at:index2 put:newString | * ''aSegment'' '''dataElementStringAt:''' ''index1'' '''at:''' ''index2'' '''put:''' ''newString''<br>- | ||
| * dataElementValueAt:index1 at:index2 put:newObject | * ''aSegment'' '''dataElementValueAt:''' ''index1'' '''at:''' ''index2'' '''put:''' ''newObject''<br>- | ||
| Examples: | Examples: | ||
| Zeile 188: | Zeile 353: | ||
|     (dtmSegment dataElementStringAt:1 at:2) -> '20060620' |     (dtmSegment dataElementStringAt:1 at:2) -> '20060620' | ||
| ======  | ====== Well-known Segment Types ====== | ||
| Although any data element can be accessed either by an index or its UN-standard name (e.g. C107 or 4063), | |||
| For some wellknown, heavily used segment types, user friendly accessors are available: | |||
| for some well-known, often used segment types, additional user friendly accessors are provided. | |||
| The idea is to make program code more readable, by using human readable field names, instead of the more | |||
| technical UN-standard names. | |||
| ====== UNB Segment API ====== | ====== UNB (Interchange Header) Segment API ====== | ||
| UNB (transmission header): | |||
| * interchangeRecipient | * ''unbSegment'' '''interchangeRecipient''' | ||
| * interchangeSender | * ''unbSegment'' '''interchangeSender''' | ||
| * processingPriorityCode | * ''unbSegment'' '''processingPriorityCode''' | ||
| * testIndicator | * ''unbSegment'' '''testIndicator''' | ||
| * -- more to be documented -- | * -- more to be documented -- | ||
| ====== UNH Segment API ====== | ====== UNH (Message Header) Segment API ====== | ||
| UNH (message header): | |||
| * messageIdentifier - the contents of the S009 element as a composite string (with colons) | * ''unhSegment'' '''messageIdentifier'''<br> - the contents of the S009 element as a composite string (with colons) | ||
| * messageType - the contents of the S009:0065 element | * ''unhSegment'' '''messageType'''<br> - the contents of the S009:0065 element | ||
| * messageVersionNumber - the contents of the S009:0052 element | * ''unhSegment'' '''messageVersionNumber'''<br> - the contents of the S009:0052 element | ||
| * messageReleaseNumber - the contents of the S009:0054 element | * ''unhSegment'' '''messageReleaseNumber'''<br> - the contents of the S009:0054 element | ||
| * controllingAgency - the contents of the S009:0051 element | * ''unhSegment'' '''controllingAgency'''<br> - the contents of the S009:0051 element | ||
| * associationAssignedCode - the contents of the S009:0057 element | * ''unhSegment'' '''associationAssignedCode'''<br> - the contents of the S009:0057 element | ||
| * -- more to be documented -- | * -- more to be documented -- | ||
| ====== UNT Segment API ====== | ====== UNT (Message Trailer) Segment API ====== | ||
| UNT (message trailer): | |||
| * messageReferenceNumber | * ''untSegment'' '''messageReferenceNumber'''<br> - | ||
| * -- more to be documented -- | * -- more to be documented -- | ||
| ====== UNZ Segment API ====== | ====== UNZ (Interchange Trailer) Segment API ====== | ||
| UNZ (interchange trailer): | |||
| * numberOfMessages - the number of messages in the transmission (document) | * ''unzSegment'' '''numberOfMessages'''<br> - the number of messages in the transmission (document) | ||
| * interchangeControlReference | * ''unzSegment'' '''interchangeControlReference'''<br> - | ||
| * -- more to be documented -- | * -- more to be documented -- | ||
| ====== DTM Segment API ====== | ====== DTM (Date Time) Segment API ====== | ||
| DTM (date time segment): | |||
| * dateTimeString - retrieves the 2nd component, which is the date time as a string | |||
| * dateTimeFormat - retrieves the 3rd component, which specifies the format of the dateTimeString | |||
| * dateTime - converts the dateTimeString as specified by dateTimeFormat and returns a Date, Time, Timestamp, TimeDuration or similar object. | |||
| * ''dtmSegment'' '''dateTimeString'''<br> - retrieves the 2nd component, which is the date time as a string | |||
| ====== NAD Segment API ====== | |||
| * ''dtmSegment'' '''dateTimeFormat'''<br> - retrieves the 3rd component, which specifies the format of the dateTimeString | |||
| NAD (name and address segment): | |||
| * ''dtmSegment'' '''dateTime'''<br> - converts the dateTimeString as specified by dateTimeFormat and returns a Date, Time, Timestamp, TimeDuration or similar object. | |||
| * cityName - retrieves the city component | |||
| * countryNameCode | |||
| ====== NAD (Name and Address) Segment API ====== | |||
| * nameAndAddress | |||
| * partyName | |||
| * ''nadSegment'' '''cityName'''<br> - retrieves the city component | |||
| * postalIDCode | |||
| * ''nadSegment'' '''countryNameCode'''<br> - | |||
| * street - retrieves the street component | |||
| * ''nadSegment'' '''nameAndAddress'''<br> - | |||
| * ''nadSegment'' '''partyName'''<br> - | |||
| * ''nadSegment'' '''postalIDCode'''<br> - | |||
| * ''nadSegment'' '''street'''<br> - retrieves the street component | |||
| * - to be documented - | * - to be documented - | ||
| ====== QTY Segment API ====== | ====== QTY (Quantity) Segment API ====== | ||
| QTY (quantity): | |||
| * quantity - the contents of the 6060 element as a string | * ''qtySegment'' '''quantity'''<br> - the contents of the 6060 element as a string | ||
| * quantityDetails - the contents of the C186 element as a composite string (with colons) | * ''qtySegment'' '''quantityDetails'''<br> - the contents of the C186 element as a composite string (with colons) | ||
| * quantityTypeCodeQualifier - the contents of the 6063 element | * ''qtySegment'' '''quantityTypeCodeQualifier'''<br> - the contents of the 6063 element | ||
| * -- more to be documented - | * -- more to be documented - | ||
| ====== MOA Segment API ====== | ====== MOA (Monetary Amount) Segment API ====== | ||
| MOA (monetary amount) | |||
| * amount | * ''moaSegment'' '''amount'''<br> - | ||
| * currencyIdCode | * ''moaSegment'' '''currencyIdCode'''<br> - | ||
| * currencyTypeCodeQualifier | * ''moaSegment'' '''currencyTypeCodeQualifier'''<br> - | ||
| * statusDescriptionCode | * ''moaSegment'' '''statusDescriptionCode'''<br> - | ||
| * typeCode | * ''moaSegment'' '''typeCode'''<br> - | ||
| * -- more to be documented - | * -- more to be documented - | ||
| ===== Getting a Schema Definition ===== | ===== Getting a Schema Definition ===== | ||
| Schema definitions are normally automatically fetched as required, whenever a segment- or messageSchema is needed. For example, when asking a message for its groupedSegments, or when asking a segment for a named field, the message is consulted for its messageIdentifier, and a corresponding schema definition is tried to be loaded from a folder containing the schema definition files. | |||
| Schema definitions are imported via the schema parsers. These parsers read specifications in various formats and | |||
| return an instance of EdifactMessageSet. This holds the definitions of segments and messages. | |||
| However, you can also import schemas manually via the schema parsers. These parsers read specifications in various formats and | |||
| return an instance of <COD>EdifactMessageSet</CODE>. This holds the definitions of segments and messages. | |||
| Because schema parsing may be a relatively expensive operation (depending on the format), these schema definitions are | Because schema parsing may be a relatively expensive operation (depending on the format), these schema definitions are | ||
| cached both in memory and as an option also in external files ( | cached both in memory and as an option also in external files (schema cache folder). | ||
| Assuming, a schema definition is present in SEF format in the file "~expecco/definitions/un/d10a/sef/INVOIC.SEF", | Assuming, a schema definition is present in SEF format in the file "<CODE>~expecco/definitions/un/d10a/sef/INVOIC.SEF</CODE>", | ||
| read it with: | read it with: | ||
| Zeile 273: | Zeile 443: | ||
| For standard definitions, a more convenient automatic parsing is possible, | For standard definitions, a more convenient automatic parsing is possible, | ||
| by asking the utility class "EdifactVersion" for a particular schema. | by asking the utility class "<CODE>EdifactVersion</CODE>" for a particular schema. | ||
| Give it the controlling agency, version and release, as in: | Give it the controlling agency, version and release, as in: | ||
|     messageSet := EdifactVersion controllingAgency:'UN' version:'03A' release:'D'. |     messageSet := EdifactVersion controllingAgency:'UN' version:'03A' release:'D'. | ||
| Notice that this is also the proper way to get an incoming message's schema definition and done by the "EdifactMessage >> messageSchema method). | Notice that this is also the proper way to get an incoming message's schema definition and done by the "<CODE>EdifactMessage >> messageSchema</CODE>" method. This will automatically extract the correct schema definition (from the message identifier) and load it via an appropriate parser as required. | ||
| This is also the mechanism used when a schema definition is needed by the XPath accessors or when asking for the segment group structure. | |||
| ===== MessageSet API ===== | ===== MessageSet API ===== | ||
| A message set object contains the definitions of one or more messages (as described in the schema file) and of all required ( | A message set object contains the definitions of one or more messages (as described in the schema file) and of all required (e.g. referred to) segments. | ||
| You can fetch individual messageSchemas with: | You can fetch individual messageSchemas with: | ||
| * aMessageSchema messageSchemaAt:messageType | * ''aMessageSchema'' '''messageSchemaAt:'''''messageType''<br>where messageType is one of 'INVOIC', 'ORDERS', etc. | ||
| and segment schemas with: | and segment schemas with: | ||
| * aMessageSchema segmentSchemaAt:segmentID, where segmentID is one of 'BGM', 'DTM' etc. | |||
| * ''aMessageSchema'' '''segmentSchemaAt:'''''segmentID''<br>where segmentID is one of 'BGM', 'DTM' etc. | |||
| MessageSets are organized in a hierarchy, and missing schemas are also searched in a messageSet's baseSchema.  | |||
| and element schemas with: | |||
| * ''aMessageSchema'' '''elementSchemaAt:'''''elementName''<br>where elementName is one of 'C107', '4063' etc. | |||
| MessageSets are organized in a hierarchy, and missing schemas are also searched in a messageSet's baseSchema. | |||
| Typically, messageSets for custom applications | Typically, messageSets for custom applications | ||
| have the more general standard UN messageSet as their baseSchema. | have the more general standard UN messageSet as their baseSchema. | ||
| Zeile 299: | Zeile 474: | ||
| ===== Accessing fields with Schema Information Present ===== | ===== Accessing fields with Schema Information Present ===== | ||
| if a  | if a schema is known, fields and segments and groups of segments can be accessed by name. | ||
| Assuming the standard schema for the ORDERS message, the above message has the following group schema: | Assuming the standard schema for the ORDERS message, the above message has the following group schema: | ||
| <CODE><PRE> | <CODE><PRE> | ||
|        UNA:+.? ' |        UNA:+.? ' | ||
|        UNB+UNOC:3+Senderkennung+Empfaengerkennung+060620:0931+1++1234567' |        UNB+UNOC:3+Senderkennung+Empfaengerkennung+060620:0931+1++1234567' | ||
|        UNH+1+ORDERS:D:96A:UN' |        UNH+1+ORDERS:D:96A:UN' | ||
|        BGM+220+B10001' |        BGM+220+B10001' | ||
|        DTM+4:20060620:102' |        DTM+4:20060620:102' | ||
| SG2    NAD+BY+++Bestellername+Strasse+Stadt++23436+xx' | SG2      NAD+BY+++Bestellername+Strasse+Stadt++23436+xx' | ||
| SG25   LIN+1++Produkt Schrauben:SA' | SG25       LIN+1++Produkt Schrauben:SA' | ||
| SG25   QTY+1:1000' | SG25       QTY+1:1000' | ||
|        UNS+S' |        UNS+S' | ||
|        CNT+2:1' |        CNT+2:1' | ||
| Zeile 316: | Zeile 491: | ||
|        UNZ+1+1234567' |        UNZ+1+1234567' | ||
| </PRE></CODE> | </PRE></CODE> | ||
| e.g. the NAD belongs to SG2 and the LIN+QTY belong to SG25. | |||
| Without proper knowledge of which segment group a segment belongs to, it is usually hard to impossible to correctly interpret | Without proper knowledge of which segment group a segment belongs to, it is usually hard to impossible to correctly interpret | ||
| the values inside the message (just consider the fact, that there are normally multiple NAD, LIN, QTY etc. segments present). | the values inside the message (just consider the fact, that there are normally multiple NAD, LIN, QTY etc. segments present). | ||
| ===== Accessing fields with XPath like  | ===== Accessing fields with XPath like Accessors ===== | ||
| When a message knows its schema, you can access elements by  | When a message knows its schema, you can access elements by XPath: | ||
| * message dataElementStringAt:'/SG25/QTY/ | * ''message'' '''dataElementStringAt:''' '/SG25/QTY/ | ||
| or, as a convenient shortcut: | or, as a convenient shortcut: | ||
| * message xPathGet:'/SG25/QTY' | * ''message'' '''xPathGet:''' '/SG25/QTY' | ||
| Many of the XPath search and filter functions are possible, for example, to find the second quantity in a message which contains multiple | Many of the XPath search and filter functions are possible, for example, to find the second quantity in a message which contains multiple | ||
| instances of SG25, use: | instances of SG25, use: | ||
| * message xPathGet:'/SG25[2]/QTY' | * ''message'' '''xPathGet:''' '/SG25[2]/QTY' | ||
| or, to find all fields which contain a particular value, use: | or, to find all fields which contain a particular value, use: | ||
| * message xPathGet:'//QTY[contains(text(),"100.0")]' | * ''message'' '''xPathGet:''' '//QTY[contains(text(),"100.0")]' | ||
| The same scheme can be used to change fields, when EDIFACT objects are constructed: | |||
| * ''message'' '''xPathSet:''' '/SG25[2]/QTY/6060' '''value:''' 100 | |||
| === Edifact Plugin Library for Expecco === | === Edifact Plugin Library for Expecco === | ||
| The plugin includes a library of action blocks, to call the above described functions. | |||
| For example, the following activity decodes an EDIFACT message and extracts a particular field by XPath from it: | |||
| [[Datei:EdifactExtract.png]] | |||
| where the attachment contains an INVOIC as EDIFACT: | |||
| <PRE> | |||
| UNA:+.? ' | |||
| UNB+UNOC:3+9900111111112:500+9900111111113:500+150407:0813+000011606231' | |||
| UNH+000610018555+INVOIC:D:06A:UN:2.6a' | |||
| BGM+457+PRN780101466824+9' | |||
| DTM+137:20150405:102' | |||
| DTM+9:20150405:102' | |||
| DTM+155:20120731:102' | |||
| DTM+156:20121127:102' | |||
| IMD++JVR' | |||
| RFF+Z13:31004' | |||
| RFF+OI:PRN771200331164' | |||
| DTM+171:20121204:102' | |||
| NAD+MS+9900111111112::293++VNB:::::Z02+Musterstraße::42+Musterhausen++12345+DE' | |||
| RFF+VA:DE814588308' | |||
| NAD+MR+9900111111113::293++Lieferant:Kreditoren- und Energiedatenmanagem::::Z02+Musterstraße::42+Musterhausen++12345+DE' | |||
| NAD+DP++++Musterstr.::41+Musterhausen++12345+DE' | |||
| LOC+172+DE0001231234500000000000004700823' | |||
| CUX+2:EUR:4' | |||
| PYT+3' | |||
| DTM+265:20121220:102' | |||
| UNS+S' | |||
| MOA+77:-53.08' | |||
| MOA+113:-26' | |||
| MOA+9:-27.08' | |||
| TAX+7+VAT+++:::0+O' | |||
| MOA+125:-53.08' | |||
| MOA+161:0' | |||
| MOA+113:-26' | |||
| MOA+115:0' | |||
| UNT+28+000610018555' | |||
| UNZ+1+000011606231' | |||
| </PRE> | |||
| The INVOIC's segment structure is: | |||
| [[Datei:EDISegmentStructure1.png]] | |||
| <hr> | <hr> | ||
| Zeile 349: | Zeile 571: | ||
| ==== Edifact Plugin Library for Expecco ==== | ==== Edifact Plugin Library for Expecco ==== | ||
| The library contains blocks for: | |||
| * EDIFACT document decoding and encoding | |||
| <hr> | |||
| * message, segment, field extraction, replacement and adding | |||
| * analysis and verification against a schema definition, grouping of segment groups | |||
| Most of these action blocks are simple wrappers which call into the above described API. | |||
| ===== Message Decoding ===== | |||
| * Decode Edifact Document<br>takes a textual edifact document, parses it and generates a structured edifact document object consisting of edifact message objects. | |||
| * Decode and Enumerate Edifact Messages<br>takes a textual edifact message, parses it and generates multiple edifact message objects at its output (streaming). | |||
| * Decode Single Edifact Message<br>takes a single textual edifact message, parses it and generates a structured edifact message object consisting of edifact segment objects. | |||
| * Decode Document from XML/Edifact<br>takes a single textual XML/Edifact message, parses it and generates a structured edifact document object consisting of edifact message objects. | |||
| ===== Message Encoding ===== | |||
| * Encode to Edifact<br>takes a structured edifact document object and generates a textual edifact representation. | |||
| * Encode to simple XML<br>takes a structured edifact document object and generates a textual XML representation (simple XML format). | |||
| * Encode to UN XML<br>takes a structured edifact document object and generates a textual XML representation (UN XML format). | |||
| * Encode to XML/Edifact<br>takes a structured edifact document object and generates a textual XML representation (XML/Edifact format). | |||
| * Encode to XML/Edifact V2<br>takes a structured edifact document object and generates a textual XML representation (XML/Edifact version 2 format). | |||
| ===== Document Extraction ===== | |||
| * Get AdviceSegment<br>extracts the advice segment from a document object. | |||
| * Get HeaderSegment<br>extracts the header segment from a document object. | |||
| * Get TrailerSegment<br>extracts the trailer segment from a document object. | |||
| * Extract Messages<br>extracts all messages from a document object. | |||
| * Extract Single Message<br>extracts a single message from a document object. | |||
| * Get Header Fields<br>extracts specific fields from a document object. | |||
| ===== Document Construction ===== | |||
| * Create<br>create a new empty document object. | |||
| * Set Header and Trailer Segments<br>adds segments to a document object. | |||
| * Create from Messages<br>create a new document object. | |||
| * Create from Single Messages<br>create a new document object. | |||
| ===== Message Extraction ===== | |||
| * Get HeaderSegment<br>extracts the header segment from a message object. | |||
| * Get TrailerSegment<br>extracts the trailer segment from a message object. | |||
| * Get Segment by Index<br>extracts any segment from a message object. | |||
| * Get MessageType<br>extracts the message type of a message object. | |||
| * Assert Has MessageType<br>verifies the message type of a message object. | |||
| ===== Segment and Segment Group Extraction ===== | |||
| * Get Segment Tag<br>extracts the segment tag from a segment object. | |||
| * Assert Has Tag<br>verifies the segment tag of a segment object. | |||
| * Get by XPath<br>extracts a segment given an XPath from an edifact object. | |||
| ===== Field Extraction ===== | |||
| * Get Data Element by Index<br>extracts a data element from a segment object. | |||
| * Get Composite Data Element by Index<br>extracts a composite data element from a segment object. | |||
| * Get DTM Fields<br>extracts Date time fields from a segment object. | |||
| * Get by XPath<br>extracts a data element given an XPath locator from an edifact object. | |||
| * Get Multiple by XPath<br>extracts multiple data elements given an XPath locator from an edifact object. | |||
| * Get All Matching by XPath<br>extracts multiple data elements given an XPath locator from an edifact object. | |||
| ===== Field Modification ===== | |||
| * Set by XPath<br>change a data element given an XPath locator in an edifact object. | |||
| ===== Segment Insertion ===== | |||
| ===== Message Insertion ===== | |||
| ===== Misc ===== | |||
| * DTM Strings to DateTime<br>convert date and time. | |||
| * DateTime to DTM Strings<br>convert date and time. | |||
| ---- | |||
| Back to [[Online_Documentation#Library_and_Plugin_Overview | Online Documentation]]. | Back to [[Online_Documentation#Library_and_Plugin_Overview | Online Documentation]]. | ||
Aktuelle Version vom 23. Oktober 2020, 12:56 Uhr
Inhaltsverzeichnis
- 1 Edi/Edifact Plugin
- 2 EDIFACT MessageBrowser Plugin
- 3 EDIFACT Plugin Components
- 3.1 EDIFACT Smalltalk Class Library
- 3.1.1 EDIFACT Message Decoder
- 3.1.2 EDIFACT Message Encoder
- 3.1.3 EDIFACT Object Representation Framework
- 3.1.4 EDIFACT Schema Definition Framework
- 3.1.5 EDIFACT Schema Definition Readers
- 3.1.6 Smalltalk Library API
- 3.1.6.1 Decoding a Message
- 3.1.6.2 Encoding a Message
- 3.1.6.3 Accessing Document and Message Components
- 3.1.6.3.1 EdifactDocument API
- 3.1.6.3.2 EdifactMessage API
- 3.1.6.3.3 EdifactSegment API
- 3.1.6.3.4 Well-known Segment Types
- 3.1.6.3.5 UNB (Interchange Header) Segment API
- 3.1.6.3.6 UNH (Message Header) Segment API
- 3.1.6.3.7 UNT (Message Trailer) Segment API
- 3.1.6.3.8 UNZ (Interchange Trailer) Segment API
- 3.1.6.3.9 DTM (Date Time) Segment API
- 3.1.6.3.10 NAD (Name and Address) Segment API
- 3.1.6.3.11 QTY (Quantity) Segment API
- 3.1.6.3.12 MOA (Monetary Amount) Segment API
 
- 3.1.6.4 Getting a Schema Definition
- 3.1.6.5 MessageSet API
- 3.1.6.6 Accessing fields with Schema Information Present
- 3.1.6.7 Accessing fields with XPath like Accessors
 
 
- 3.2 Edifact Plugin Library for Expecco
 
- 3.1 EDIFACT Smalltalk Class Library
Edi/Edifact Plugin[Bearbeiten]
The Edifact Plugin provides extensive support for EDIFACT message processing and adds a hierarchical structure editor to the GUIBrowser user interface. There, EDIFACT messages can be interactively edited and action sequences constructed to operate on it.
Background Information[Bearbeiten]
EDIFACT is an international standard for the exchange of business transaction messages, such as invoices, orders, quotes etc. EDIFACT transactions are heavily used in B2B communications, and supported by SAP, Oracle, IBM and other business processing systems.
EDIFACT messages consist of a number of records (called "segments"), which themselves contain a number of data elements. Both non-composite and composite data elements are possible.
EDIFACT messages are surrounded by a header- and trailer segment.
An EDIFACT transaction may contain multiple individual messages, and is itself surrounded by a transaction header segment and a transaction trailer segment.
For comprehensive documentation and a full specification, see United Nations Economic Commission for Europe in general, and Introducing UN/EDIFACT in particular.
EDIFACT Messages[Bearbeiten]
UN/EDIFACT defines the overall format and layout of EDIFACT messages. Standard specifications are typically published twice a year. New revisions typically add additional data elements, additional segment types and messages. New revisions are typically backward compatible. That means that previous definitions are seldom (if ever) obsoleted.
All messages and segments contain a tag (or message type) field, which allows for the type of message/segment to be identified. Thus, in theory, it is possible to extract field values from arbitrary messages, even if the concrete message layout is unknown.
However, the semantic interpretation of a particular datum highly depends on its position inside the message. For example, a DTM-segment's datetime value can be extracted easily, but its interpretation (delivery date, invoice date, quotation date etc.) is only possible if the overall message structure is known.
Message Subsets[Bearbeiten]
The definitions as published by UN/Unice define the overall structure of messages and segments. However, most concrete application do not need/use/support the full set of possible segments in a message. Many segments are specified as optional and not supported/expected by many concrete users.
For this, typical B2B transaction systems expect and handle only subsets of the overall message set and subsets of the possible segment set within those messages.
Meta Descriptions[Bearbeiten]
In order for a correct semantic interpretation, modification and verification of data elements, a meta description is required, which specifies the format of a message (segment order and grouping) and of data elements (alphanumeric, numeric, minimum/maximum size and value etc.).
Such specifications are published by UN/Unice for the overall framework, and by companies for the B2B partners. Specifications exist in both formal (machine readable) formats and in non-formal (human readable) formats. Formal specification formats are SEF (Structured EDIFACT Format Specification) and XSD (XML Schema definitions). Non-formal specification are PDF, HTML and Text formats.
The biggest challenges in dealing with EDIFACT is the trouble to get formal (machine readable) specifications.
SEF format files provide all required information and would be perfect for this task - however, SEF seems to be obsolete and SEF specifications for newer versions are hard to find (which is a pity). SEF files are not available from UN/Unice.
XSD seems to be the way to go, but official XSD specifications are rare and not published by UN.
DFDL is a new XML-based definition standard, which is very promising in theory. However, it is relatively new. Only a small subset of EDIFACT messages is formally specified in this format.
Official documents from UN/Unice include PDF, HTML and Text files. Although being informal, existing HTML and text files seem to follow a common layout and structure.
EDIFACT MessageBrowser Plugin[Bearbeiten]
This plugin adds a structured EDIFACT message inspector and editor to expecco. It provides an interface very similar to the GUIBrowser's interface, showing the hierarchical structure of EDIFACT messages. It also provides a UI to interactively compose action sequences to analyze, modify and test an EDIFACT message.
EDIFACT Plugin Components[Bearbeiten]
EDIFACT Smalltalk Class Library[Bearbeiten]
This library contains all real functionality of the EDIFACT plugin. The expecco action library (edifact.ets) contains interfacing blocks to the functions found there.
EDIFACT Message Decoder[Bearbeiten]
Includes a general message/segment/data decoder, which is able to read arbitrary (unknown) messages. Such textual messages (wire data) is decoded and presented as a hierarchy of objects, which model the message, segments and elements. The decoder can decode arbitrary messages - i.e. a meta description is not required for basic decoding.
EDIFACT Message Encoder[Bearbeiten]
This encodes a message object back into the external EDIFACT wire representation. It cares for the character set, separator definitions etc. In general, the original input is regenerated (with minor differences) when the a decoded EDIFACT message is encoded back to wire data.
EDIFACT Object Representation Framework[Bearbeiten]
These classes model EDIFACT objects as messages, segments, composite- and non-composite data elements. Without a meta description of the message set, data elements can by accessed by numeric indices (segment#, element#). With a meta description, elements are accessible by more user friendly names, such as the official UN standard field names (C009, C123, 4056 etc), or user assigned application field names (deliveryDate, deliveryLocation, street, city etc.)
EDIFACT Schema Definition Framework[Bearbeiten]
This set of classes represent a meta description of EDIFACT objects. They can be hand-written, dynamically constructed or read in from external specification files in various formats.
EDIFACT Schema Definition Readers[Bearbeiten]
These classes read external meta specifications in various formats and create a schema object. Schema objects are needed to correctly interpret element data, to access elements by user friendly names, and to create new messages which have the correct segment and segment group structure. Readers exist for the following formats:
- XSD
- DFDL
- SEF
- HTML (as in UN docs)
- Text (as in UN docs)
- bots (import specs written in the python programming language, written for the bots processor)
- internal private format
Smalltalk Library API[Bearbeiten]
The EDIFACT class library contains functions to be called from elementary code (and also code which is called by the blocks contained in the EDIFACT block library, which is described below).
Included are parsers for various formats (regular EDIFACT and XML-EDIFACT), generators for those, parsers for meta-definitions and data objects to represent EDIFACT messages (segments, groups, fields etc.)
Decoding a Message[Bearbeiten]
given the following EDIFACT message in wire data form:
UNA:+.? ' UNB+UNOC:3+Senderkennung+Empfaengerkennung+060620:0931+1++1234567' UNH+1+ORDERS:D:96A:UN' BGM+220+B10001' DTM+4:20060620:102' NAD+BY+++Bestellername+Strasse+Stadt++23436+xx' LIN+1++Produkt Schrauben:SA' QTY+1:1000' UNS+S' CNT+2:1' UNT+9+1' UNZ+1+1234567'
assuming, the above message (as String) is held in the "msg" variable, it can be decoded with:
document := EdifactDecoder decode:msg.
This returns a structured (hierarchical) message object in "document", which consists of segments and data fields.
Encoding a Message[Bearbeiten]
for the opposite direction, a number of EdifactCoder classes exist, which take a decoded or constructed EDIFACT message and generate an external representation from it:
EdifactCoder encode: document
generates a standard EDIFACT representation (the above).
For XML output, various different formats are supported, which use different tags for the elements.
For example,
EdifactXMLCoder encode: document
generates the following XML representation:
 <?xml version="1.0" encoding="UTF-8"?>
 <edifact>
   <UNB>
       <element>
           <value>UNOC</value>
           <value>3</value>
       </element>
       <element>
           <value>Senderkennung</value>
       </element>
       <element>
           <value>Empfaengerkennung</value>
       </element>
       <element>
           <value>060620</value>
           <value>0931</value>
       </element>
       ...
   </UNB>
   ...
   <CNT>
       <element>
           <value>2</value>
           <value>1</value>
       </element>
   </CNT>
   <UNT>
       <element>
           <value>9</value>
       </element>
       <element>
           <value>1</value>
       </element>
   </UNT>
   <UNZ>
       <element>
           <value>1</value>
       </element>
       <element>
           <value>1234567</value>
       </element>
   </UNZ>
 </edifact>
whereas:
EdifactXMLEdifactCoder encode: document
would generate:
 <?xml version="1.0" encoding="UTF-8"?>
 <EDIFACT>
   <S_UNB>
       <C_S001>
           <D_0001>UNOC</D_0001>
           <D_0002>3</D_0002>
       </C_S001>
       <C_S002>
           <D_0004>Senderkennung</D_0004>
       </C_S002>
       <C_S003>
           <D_0010>Empfaengerkennung</D_0010>
       </C_S003>
       <C_S004>
           <D_0017>060620</D_0017>
           <D_0019>0931</D_0019>
       </C_S004>
       <D_0020>1</D_0020>
       <C_S005></C_S005>
       <D_0026>1234567</D_0026>
   </S_UNB>
   <ORDERS>
       <S_UNH>
           <D_0062>1</D_0062>
           <C_S009>
               <D_0065>ORDERS</D_0065>
               <D_0052>D</D_0052>
               <D_0054>96A</D_0054>
               <D_0051>UN</D_0051>
           </C_S009>
       </S_UNH>
       <S_BGM>
           <C_C002>
               <D_1001>220</D_1001>
           </C_C002>
           <D_1004>B10001</D_1004>
       </S_BGM>
       <S_DTM>
           <C_C507>
               <D_2005>4</D_2005>
               <D_2380>20060620</D_2380>
               <D_2379>102</D_2379>
           </C_C507>
       </S_DTM>
       <S_NAD>
           <D_3035>BY</D_3035>
           <C_C082></C_C082>
           <C_C058></C_C058>
           <C_C080>
               <D_3036>Bestellername</D_3036>
           </C_C080>
           <C_C059>
               <D_3042>Strasse</D_3042>
           </C_C059>
           <D_3164>Stadt</D_3164>
           <D_3229></D_3229>
           <D_3251>23436</D_3251>
           <D_3207>xx</D_3207>
       </S_NAD>
       <S_LIN>
           <D_1082>1</D_1082>
           <D_1229></D_1229>
           <C_C212>
               <D_7140>Produkt Schrauben</D_7140>
               <D_7143>SA</D_7143>
           </C_C212>
       </S_LIN>
       <S_QTY>
           <C_C186>
               <D_6063>1</D_6063>
               <D_6060>1000</D_6060>
           </C_C186>
       </S_QTY>
       <S_UNS>
           <D_0081>S</D_0081>
       </S_UNS>
       <S_CNT>
           <C_C270>
               <D_6069>2</D_6069>
               <D_6066>1</D_6066>
           </C_C270>
       </S_CNT>
       <S_UNT>
           <D_0074>9</D_0074>
           <D_0062>1</D_0062>
       </S_UNT>
   </ORDERS>
   <S_UNZ>
       <D_0036>1</D_0036>
       <D_0020>1234567</D_0020>
   </S_UNZ>
 </EDIFACT>
many more (customizable) formats are supported; please contact eXept for details.
Accessing Document and Message Components[Bearbeiten]
EdifactDocument API[Bearbeiten]
"document" will be an instance of EdifactDocument. It represents a single transmission or document (possibly containing multiple messages). It supports the API:
- aDocument numberOfMessages
 - returns the number of individual messages in the transaction
- aDocument messages
 - a collection of messages contained in the transaction
- aDocument adviceSegment
 - returns the transmission's advice segment (the 'UNA' segment)
- aDocument headerSegment
 - returns the transaction's header segment (the 'UNB' segment)
- aDocument trailerSegment
 - returns the transaction's trailer segment (the 'UNZ' segment)
Individual messages are fetched as instances of EdifactMessage via "document messages at: index" or enumerated with "document messages do:[:each | ...]".
Examples:
bgmSegment := document messages first segments first. dtmSegmeent := document messages first segments at:2.
EdifactMessage API[Bearbeiten]
EdifactMessage is a single message inside an edifactDocument (or transmission). It consists of segments, and supports the API:
- aMessage segments
 - a collection of individual segments inside the message (excludes header and trailer). This is a "raw" (e.g. flat) list of segments, as present in the original message. No segment group information is reflected in this list.
- aMessage groupedSegments
 - a hierarchical collection of segments and segment groups (excludes header and trailer). This is a hierarchical list where group-segments are founder under segment group objects.
- aMessage headerSegment
 - returns the message header segment (the 'UNH' segment)
- aMessage trailerSegment
 - returns the message trailer segment (the 'UNT' segment)
- aMessage messageType
 - extracts the message type from the message header as a string. In the above example, this would be 'ORDERS'.
 The list of version d14b message types is found at [1] and [2].
- aMessage controllingAgency
 - extracts the standards controlling agency which defined the message structure. In the above, this would be 'UN'.
- aMessage version
 - extracts the message structure standard version number. In the above message, this is '96A'.
- aMessage release
 - extracts the message structure standard release number. In the above message, this is 'D'.
- aMessage messageSchema
 - return the message's schema definition (see below)
- aMessage edifactVersion
 - return the messageSet which contains the message and segment schemas (see below)
The "controllingAgency", "version" and "release" fields are required to determine which meta schema definition is to be used (see below for details)
EdifactSegment API[Bearbeiten]
Segments as returned by any of the above. They consist of a tag (segment type) and a number of data elements. They understand the following API:
- aSegment tag
 - the segments's id (e.g. 'UNA', 'DTM', 'BGM' etc.) The list of version d14b tags is found at [3].
- aSegment dataElementAt: index
 - return a possibly composite data element. This is a wrapper object, which holds on both the schema definition (if known) and the data value.
- aSegment dataElementStringAt: index
 - return the datum as a string. If the element is not present, an empty string is returned.
- aSegment dataElementValueAt: index
 - return the datum as a string or number or nil, if the element is not present.
- aSegment dataElementAt: index1 at: index2
 - similar to the above, but return a composite element's subcomponent
- aSegment dataElementStringAt: index1 at: index2
 - similar to the above, but return a composite element's subcomponent's string
- aSegment dataElementValueAt: index1 at: index2
 - similar to the above, but return a composite element's subcomponent' value
- aSegment dataElementAt: index put: newElement
 -
- aSegment dataElementStringAt: index put: newString
 -
- aSegment dataElementValueAt: index put: newObject
 -
- aSegment dataElementAt: index1 at: index2 put: newElement
 -
- aSegment dataElementStringAt: index1 at: index2 put: newString
 -
- aSegment dataElementValueAt: index1 at: index2 put: newObject
 -
Examples:
   dtmSegment id -> 'DTM'
   (dtmSegment dataElementStringAt:1) -> '4:20060620:102'
   (dtmSegment dataElementValueAt:1) -> #('4' '20060620' '102')
   (dtmSegment dataElementStringAt:1 at:2) -> '20060620'
Well-known Segment Types[Bearbeiten]
Although any data element can be accessed either by an index or its UN-standard name (e.g. C107 or 4063), for some well-known, often used segment types, additional user friendly accessors are provided. The idea is to make program code more readable, by using human readable field names, instead of the more technical UN-standard names.
UNB (Interchange Header) Segment API[Bearbeiten]
- unbSegment interchangeRecipient
- unbSegment interchangeSender
- unbSegment processingPriorityCode
- unbSegment testIndicator
- -- more to be documented --
UNH (Message Header) Segment API[Bearbeiten]
- unhSegment messageIdentifier
 - the contents of the S009 element as a composite string (with colons)
- unhSegment messageType
 - the contents of the S009:0065 element
- unhSegment messageVersionNumber
 - the contents of the S009:0052 element
- unhSegment messageReleaseNumber
 - the contents of the S009:0054 element
- unhSegment controllingAgency
 - the contents of the S009:0051 element
- unhSegment associationAssignedCode
 - the contents of the S009:0057 element
- -- more to be documented --
UNT (Message Trailer) Segment API[Bearbeiten]
- untSegment messageReferenceNumber
 -
- -- more to be documented --
UNZ (Interchange Trailer) Segment API[Bearbeiten]
- unzSegment numberOfMessages
 - the number of messages in the transmission (document)
- unzSegment interchangeControlReference
 -
- -- more to be documented --
DTM (Date Time) Segment API[Bearbeiten]
- dtmSegment dateTimeString
 - retrieves the 2nd component, which is the date time as a string
- dtmSegment dateTimeFormat
 - retrieves the 3rd component, which specifies the format of the dateTimeString
- dtmSegment dateTime
 - converts the dateTimeString as specified by dateTimeFormat and returns a Date, Time, Timestamp, TimeDuration or similar object.
NAD (Name and Address) Segment API[Bearbeiten]
- nadSegment cityName
 - retrieves the city component
- nadSegment countryNameCode
 -
- nadSegment nameAndAddress
 -
- nadSegment partyName
 -
- nadSegment postalIDCode
 -
- nadSegment street
 - retrieves the street component
- - to be documented -
QTY (Quantity) Segment API[Bearbeiten]
- qtySegment quantity
 - the contents of the 6060 element as a string
- qtySegment quantityDetails
 - the contents of the C186 element as a composite string (with colons)
- qtySegment quantityTypeCodeQualifier
 - the contents of the 6063 element
- -- more to be documented -
MOA (Monetary Amount) Segment API[Bearbeiten]
- moaSegment amount
 -
- moaSegment currencyIdCode
 -
- moaSegment currencyTypeCodeQualifier
 -
- moaSegment statusDescriptionCode
 -
- moaSegment typeCode
 -
- -- more to be documented -
Getting a Schema Definition[Bearbeiten]
Schema definitions are normally automatically fetched as required, whenever a segment- or messageSchema is needed. For example, when asking a message for its groupedSegments, or when asking a segment for a named field, the message is consulted for its messageIdentifier, and a corresponding schema definition is tried to be loaded from a folder containing the schema definition files.
However, you can also import schemas manually via the schema parsers. These parsers read specifications in various formats and return an instance of <COD>EdifactMessageSet. This holds the definitions of segments and messages. Because schema parsing may be a relatively expensive operation (depending on the format), these schema definitions are cached both in memory and as an option also in external files (schema cache folder).
Assuming, a schema definition is present in SEF format in the file "~expecco/definitions/un/d10a/sef/INVOIC.SEF",
read it with:
messageSet := EdifactSEFParser parseFile: '~expecco/definitions/un/d10a/sef/INVOIC.SEF'.
For XSD schema definitions use the EdifactXSDParser, as in:
messageSet := EdifactXSDParser parseFile: '~expecco/definitions/un/d10a/xsd/INVOIC.xsd'.
The expecco plugin contains definition files for all UN versions d93a to d14b in the "definitions" subfolder of the plugin.
For standard definitions, a more convenient automatic parsing is possible,
by asking the utility class "EdifactVersion" for a particular schema.
Give it the controlling agency, version and release, as in:
messageSet := EdifactVersion controllingAgency:'UN' version:'03A' release:'D'.
Notice that this is also the proper way to get an incoming message's schema definition and done by the "EdifactMessage >> messageSchema" method. This will automatically extract the correct schema definition (from the message identifier) and load it via an appropriate parser as required.
This is also the mechanism used when a schema definition is needed by the XPath accessors or when asking for the segment group structure.
MessageSet API[Bearbeiten]
A message set object contains the definitions of one or more messages (as described in the schema file) and of all required (e.g. referred to) segments. You can fetch individual messageSchemas with:
- aMessageSchema messageSchemaAt:messageType
 where messageType is one of 'INVOIC', 'ORDERS', etc.
and segment schemas with:
- aMessageSchema segmentSchemaAt:segmentID
 where segmentID is one of 'BGM', 'DTM' etc.
and element schemas with:
- aMessageSchema elementSchemaAt:elementName
 where elementName is one of 'C107', '4063' etc.
MessageSets are organized in a hierarchy, and missing schemas are also searched in a messageSet's baseSchema. Typically, messageSets for custom applications have the more general standard UN messageSet as their baseSchema.
MessageSets are cached internally, to prevent parsing overhead.
Accessing fields with Schema Information Present[Bearbeiten]
if a schema is known, fields and segments and groups of segments can be accessed by name. Assuming the standard schema for the ORDERS message, the above message has the following group schema:
       UNA:+.? '
       UNB+UNOC:3+Senderkennung+Empfaengerkennung+060620:0931+1++1234567'
       UNH+1+ORDERS:D:96A:UN'
       BGM+220+B10001'
       DTM+4:20060620:102'
SG2      NAD+BY+++Bestellername+Strasse+Stadt++23436+xx'
SG25       LIN+1++Produkt Schrauben:SA'
SG25       QTY+1:1000'
       UNS+S'
       CNT+2:1'
       UNT+9+1'
       UNZ+1+1234567'
e.g. the NAD belongs to SG2 and the LIN+QTY belong to SG25. Without proper knowledge of which segment group a segment belongs to, it is usually hard to impossible to correctly interpret the values inside the message (just consider the fact, that there are normally multiple NAD, LIN, QTY etc. segments present).
Accessing fields with XPath like Accessors[Bearbeiten]
When a message knows its schema, you can access elements by XPath:
- message dataElementStringAt: '/SG25/QTY/
or, as a convenient shortcut:
- message xPathGet: '/SG25/QTY'
Many of the XPath search and filter functions are possible, for example, to find the second quantity in a message which contains multiple instances of SG25, use:
- message xPathGet: '/SG25[2]/QTY'
or, to find all fields which contain a particular value, use:
- message xPathGet: '//QTY[contains(text(),"100.0")]'
The same scheme can be used to change fields, when EDIFACT objects are constructed:
- message xPathSet: '/SG25[2]/QTY/6060' value: 100
Edifact Plugin Library for Expecco[Bearbeiten]
The plugin includes a library of action blocks, to call the above described functions. For example, the following activity decodes an EDIFACT message and extracts a particular field by XPath from it:
where the attachment contains an INVOIC as EDIFACT:
UNA:+.? ' UNB+UNOC:3+9900111111112:500+9900111111113:500+150407:0813+000011606231' UNH+000610018555+INVOIC:D:06A:UN:2.6a' BGM+457+PRN780101466824+9' DTM+137:20150405:102' DTM+9:20150405:102' DTM+155:20120731:102' DTM+156:20121127:102' IMD++JVR' RFF+Z13:31004' RFF+OI:PRN771200331164' DTM+171:20121204:102' NAD+MS+9900111111112::293++VNB:::::Z02+Musterstraße::42+Musterhausen++12345+DE' RFF+VA:DE814588308' NAD+MR+9900111111113::293++Lieferant:Kreditoren- und Energiedatenmanagem::::Z02+Musterstraße::42+Musterhausen++12345+DE' NAD+DP++++Musterstr.::41+Musterhausen++12345+DE' LOC+172+DE0001231234500000000000004700823' CUX+2:EUR:4' PYT+3' DTM+265:20121220:102' UNS+S' MOA+77:-53.08' MOA+113:-26' MOA+9:-27.08' TAX+7+VAT+++:::0+O' MOA+125:-53.08' MOA+161:0' MOA+113:-26' MOA+115:0' UNT+28+000610018555' UNZ+1+000011606231'
The INVOIC's segment structure is:
Back to Online Documentation.
Edifact Plugin Library for Expecco[Bearbeiten]
The library contains blocks for:
- EDIFACT document decoding and encoding
- message, segment, field extraction, replacement and adding
- analysis and verification against a schema definition, grouping of segment groups
Most of these action blocks are simple wrappers which call into the above described API.
Message Decoding[Bearbeiten]
- Decode Edifact Document
 takes a textual edifact document, parses it and generates a structured edifact document object consisting of edifact message objects.
- Decode and Enumerate Edifact Messages
 takes a textual edifact message, parses it and generates multiple edifact message objects at its output (streaming).
- Decode Single Edifact Message
 takes a single textual edifact message, parses it and generates a structured edifact message object consisting of edifact segment objects.
- Decode Document from XML/Edifact
 takes a single textual XML/Edifact message, parses it and generates a structured edifact document object consisting of edifact message objects.
Message Encoding[Bearbeiten]
- Encode to Edifact
 takes a structured edifact document object and generates a textual edifact representation.
- Encode to simple XML
 takes a structured edifact document object and generates a textual XML representation (simple XML format).
- Encode to UN XML
 takes a structured edifact document object and generates a textual XML representation (UN XML format).
- Encode to XML/Edifact
 takes a structured edifact document object and generates a textual XML representation (XML/Edifact format).
- Encode to XML/Edifact V2
 takes a structured edifact document object and generates a textual XML representation (XML/Edifact version 2 format).
Document Extraction[Bearbeiten]
- Get AdviceSegment
 extracts the advice segment from a document object.
- Get HeaderSegment
 extracts the header segment from a document object.
- Get TrailerSegment
 extracts the trailer segment from a document object.
- Extract Messages
 extracts all messages from a document object.
- Extract Single Message
 extracts a single message from a document object.
- Get Header Fields
 extracts specific fields from a document object.
Document Construction[Bearbeiten]
- Create
 create a new empty document object.
- Set Header and Trailer Segments
 adds segments to a document object.
- Create from Messages
 create a new document object.
- Create from Single Messages
 create a new document object.
Message Extraction[Bearbeiten]
- Get HeaderSegment
 extracts the header segment from a message object.
- Get TrailerSegment
 extracts the trailer segment from a message object.
- Get Segment by Index
 extracts any segment from a message object.
- Get MessageType
 extracts the message type of a message object.
- Assert Has MessageType
 verifies the message type of a message object.
Segment and Segment Group Extraction[Bearbeiten]
- Get Segment Tag
 extracts the segment tag from a segment object.
- Assert Has Tag
 verifies the segment tag of a segment object.
- Get by XPath
 extracts a segment given an XPath from an edifact object.
Field Extraction[Bearbeiten]
- Get Data Element by Index
 extracts a data element from a segment object.
- Get Composite Data Element by Index
 extracts a composite data element from a segment object.
- Get DTM Fields
 extracts Date time fields from a segment object.
- Get by XPath
 extracts a data element given an XPath locator from an edifact object.
- Get Multiple by XPath
 extracts multiple data elements given an XPath locator from an edifact object.
- Get All Matching by XPath
 extracts multiple data elements given an XPath locator from an edifact object.
Field Modification[Bearbeiten]
- Set by XPath
 change a data element given an XPath locator in an edifact object.
Segment Insertion[Bearbeiten]
Message Insertion[Bearbeiten]
Misc[Bearbeiten]
- DTM Strings to DateTime
 convert date and time.
- DateTime to DTM Strings
 convert date and time.
Back to Online Documentation.


