Following a rigorous methodology is key to delivering customer satisfaction and expanding analytics use cases across the business.
The usage of B2B Data Transformation to process the documents in any format can be optimized by an advanced concept called Chaining. Data Transformation chaining involves the use of more than one Data Transformation component like Parser, Mapper, Serializer, etc. or one or more Data Transformation service to achieve the desired objectives.
Data Transformation chaining philosophy revolves around the use of one or more small logical components, each of which accomplish a specific task to achieve a more complex objective. This methodology encourages using reusable components and breaking down work into smaller chunks to enable easier development and maintenance.
As an example, organizations get data from different sources and store the data in warehouses to generate the big picture about some specific business function. Each source system may stipulate using date in a specific format. It becomes necessary to convert the date, from varied systems and bring them to a standard format to be stored in the warehouse. The same issue would exist in various departments and various projects. Using Data Transformation, an organization can build a reusable date processing component which would accept date in any format, analyze such format and convert the date into a standard format. Different processes can then use the reusable component in their projects (Parser projects/Mapper projects /Serializer projects) and ensure consistency in the date formats that go into the warehouse. The process of using multiple Data Transformation segments in a project to achieve a complete file transformation is called Data Transformation Chaining.
The advantages of Data Transformation Chaining are summarized below:
There is no need to worry about the port sizes at the PowerCenter level as any string limitation is taken care of by Data Transformation internally. Only the Input and Output port sizes need to be defined.
Chaining in a Data Transformation project is achieved by the use of any of the following “Actions”
Scope of data for parsing, mapping or serialization would be the amount of data available from the main component (parser, mapper or serializer) to the secondary component that is being invoked. This can be categorized as follows:
In above code snippet, the input for the “Secondary Serializer” is implicit. The input value that will be used as the source in this case would be the output of the repeating group – “MyRepeatGroup”. Hence the value stored in the complex variable “$Inputs” would be the source. The secondary serializer would be invoked repeatedly for each occurrence of the repeating group.
In the above code snippet, the input for the “Full Serializer” is implicit. The input value that will be used as the source in this case would be the output of the “MainParser”. Hence the whole content of the xml from the “MainParser” is fed as the source document for the serializer.
Explicit Scope: The secondary component provides an explicit reference to a data holder which serves as the source to the secondary component. This method is generally used for parsing additional input files that may be cross referenced during the different stages of data transformation in a file from one format to another format.
In the above code snippet, the component “RunParser” invokes “My_Parser” and uses the value contained in the variable - “v_Temp”.
Note: Similar approach applies for “RunMapper” and “RunSerializer” components as well.
Success
Link Copied to Clipboard