Your feedback is important to us! Email us how we can improve these documents.
We have collected a set of best practice recommendations for different strategies to process and parse XML files stored in a Hadoop cluster.
Keep these Pentaho Architecture principles in mind while you are working through this document:
- Architecture is important, above all else.
- Platforms are always evolving: sometimes you will have to think creatively.
Some of the things discussed here include selecting the best method for parsing based on your use case and implementation details for different methods.