Hitachi Vantara Pentaho Customer Portal

Best Practices - Big Data - Parsing XML on PDI

Your feedback is important to us!  Email us how we can improve these documents.

Software Version
Pentaho  5.4, 6.x, 7.x
Hadoop Cloudera 5.x
HortonWorks 2.x


We have collected a set of best practice recommendations for different strategies to process and parse XML files stored in a Hadoop cluster. 

Some of the things discussed here include selecting the best method for parsing based on your use case and implementation details for different methods.


   -  Best Practices - Big Data - XML Parsing in PDI

Have more questions? Submit a request


Powered by Zendesk