Your feedback is important to us! Email us how we can improve these documents.
We have collected a library of best practices, presentations, and videos on realtime data processing on big data with Pentaho Data Integration (PDI).
Our intended audience is solution architects and designers, or anyone with a background in realtime ingestion, or messaging systems like Java Message Servers, RabbitMQ, or WebSphere MQ.
These materials covers the following versions of software:
|Pentaho Business Analytics Suite||8.0|
Here are a couple of downloadable resources for Realtime Data Processing with PDI:
|Guidelines - Realtime Data Processing with PDI|
|Presentation - Realtime Data Processing with PDI|
Realtime Data Processing - 8.0 - Video
|Streaming Kafka Producer 101|
|Streaming Kafka Consumer 101|
|Streaming Kafka Consumer 102 (AEL Spark)|
Here is a series of related articles that are either featured in a webinar, or are closely related topics that you may find of interest:
Pentaho Documentation, Best Practices, and How Tos:
- Kafka and Streaming Ingestion on PDI
- Pentaho Components Reference
- Pentaho Data Integration Best Practices and How Tos
- Apache Kafka
- Apache Spark Streaming
- Matt Casters' Blog (pre-Pentaho 8):