Quantcast
Channel: Pentaho Community Forums
Viewing all articles
Browse latest Browse all 16689

Error processing an XML using "GetDataFromXML" step when xml contains multiple CDATA

$
0
0
Hi,

Can anyone suggest me a way to process an XML which contains multiple CDATA in it. One such problematic record set is given below:

<fields-list>
<field id="id001" type="TEXT" name="Username">
<field-value>
<value><![CDATA[<city><![CDATA[<name>Chennai</name>]]></city>]]>
</value>
</field-value>
</field>
</fields-list>


Note: As per the standard XML convention, if the closing delimiter of inner CDATA "]]>" symbol is replaced by "]]&gt;" then everything works fine. However the challenge is I cannot just find and replace "]]>" with "]]&gt;" as this the <value> tag content is populated by user input on a text-area field which in turn is fitted into this XML template. So the input keyed in by user can prone to be a buggy input when compared against the XML standards.

Having said that "get data from xml" does not allow me to fetch the content in the value tag when such an input is provided by the user. Also even when the "Validate xml?" option is turned off, still the validation is performed by this pentaho step and it is resulting into error due to this buggy format.

Can anyone assist on how to address this issue ?


Many Thanks

Viewing all articles
Browse latest Browse all 16689

Trending Articles



<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>