Hitachi Vantara Pentaho Customer Portal

Big-Data-Plugin Version 1.3.3.1 for Pentaho BA-Server 4.8.1.x and PDI 4.4.1.x

This release of the Pentaho Big Data Plugin adds support for new Hadoop distributions, enhanced MongoDB functionality, enhanced Cassandra functionality and adds support for reading and writing data to Splunk. 

  • Hadoop distributions:

    The following new Hadoop distributions are now supported by the Pentaho big data abstraction layer supports.  For a complete list of currently supported distributions see the support matrix.
    • Cloudera CDH 4.2.1,
    • Hortonworks HDP1.3,
    • Intel IHD 2.3
    • MapR2.1.x
  • New MongoDB Capabilities

    • Read/Write with full support for Replica Sets ensuring redundancy, backup, and automatic failover.
    • Full support for Tag Sets and Preference Modes enabling targeted operations to specific members
    • Automatic sampling of query result sets and schema generation relieving the user from the tedious data mapping between MongoDB documents and the ETL transform
    • Full support for the MongoDB aggregation framework providing a means to calculate aggregated values without having to use map-reduce.
    • Improved user experience and performance of JSON parsing
    • Support for MongoDB 2.4
  • New Cassandra Capabilities

    • Added support for CQL-3 while continuing support for CQL-2
    • Ability to use composite keys when writing to Cassandra
    • Query preview capability
    • Ability to set the Time To Live for data being written to Cassandra
    • Support for Cassandra 1.2.4
  • Splunk connectors:

    Pentaho’s new adaptor allows reading and writing data to Splunk, one of the most popular software applications for searching, monitoring, and analyzing machine-generated big data. Splunk Step
  • Instaview Templates

    New Instaview templates have been added for Impala, MongDB, and Splunk.  

 

Prerequisites

This Plugin is based on Version BA-Suite 4.8.1.x GA and PDI 4.4.1.x GA respectively.

Before you can install this Plugin you will have to upgrade to the above mentioned Version.

 

Upgrading the Big Data Plugin

Before you begin, please make sure you follow these instructions completely.
When asked to move files and folders out of the software folder structure, please
do exactly as requested - for example, move it to /tmp or to your home folder.
Failure to follow these directions can cause the software to not start properly, or
have the appearance that the upgrade was not applied.

 

PDI-Client

1- Stop the Spoon client if it's running.

2- Move the following folder out of the data-integration folder structure:
data-integration/plugins/pentaho-big-data-plugin

3- Move the following files out of the data-integration folder structure if they exist:
data-integration/libext/JDBC/pentaho-hadoop-hive-jdbc-shim-1.3.0.jar
data-integration/libext/JDBC/pentaho-hadoop-hive-jdbc-shim-1.3.1.jar
data-integration/libext/JDBC/pentaho-hadoop-hive-jdbc-shim-1.3.2.jar

4- Unzip the file pentaho-big-data-plugin-shimtastic-1.3.3.1.zip from the data-integration/plugins folder.

5- Optionally, remove irrelevant folders under data-integration/plugins/pentaho-big-data-plugin/hadoop-configurations.

6- Copy the file pentaho-hadoop-hive-jdbc-shim-1.3.3.jar into the folder 
data-integration/libext/JDBC

7- Unzip the file pentaho-instaview-templates-shimtastic-1.3.3.zip to the following directory to

    data-integration/plugins/spoon/agile-bi/platform/pentaho-solutions/system/instaview/templates/Big Data

Done!

 

PDI - Server

1- Stop DI Server if it's running.

2- Move the following folder out of the Metadata Editor folder structure:
data-integration-server/pentaho-solutions/system/kettle/plugins/pentaho-big-data-plugin

3- Move the following files out of the BI Server folder structure if they exist:
data-integration-server/tomcat/webapps/pentaho-di/WEB-INF/lib/pentaho-hadoop-hive-jdbc-shim-1.3.0.jar
data-integration-server/tomcat/webapps/pentaho-di/WEB-INF/lib/pentaho-hadoop-hive-jdbc-shim-1.3.1.jar
data-integration-server/tomcat/webapps/pentaho-di/WEB-INF/lib/pentaho-hadoop-hive-jdbc-shim-1.3.2.jar

4- Unzip the file pentaho-big-data-plugin-shimtastic-1.3.3.1.zip from the data-integration-server/pentaho-solutions/system/kettle/plugins folder.

5- Optionally, remove irrelevant folders under data-integration-server/pentaho-solutions/system/kettle/plugins/pentaho-big-data-plugin/hadoop-configurations.

6- Copy the file pentaho-hadoop-hive-jdbc-shim-1.3.3.jar into the folder 
data-integration-server/tomcat/webapps/pentaho-di/WEB-INF/lib

Done!

 

BI Server

1- Stop BI Server if it's running.

2- Move the following folder out of the Metadata Editor folder structure:
pentaho-solutions/system/kettle/plugins/pentaho-big-data-plugin

3- Move the following files out of the BI Server folder structure if they exist:
tomcat/webapps/pentaho/WEB-INF/lib/pentaho-hadoop-hive-jdbc-shim-1.3.0.jar
tomcat/webapps/pentaho/WEB-INF/lib/pentaho-hadoop-hive-jdbc-shim-1.3.1.jar
tomcat/webapps/pentaho/WEB-INF/lib/pentaho-hadoop-hive-jdbc-shim-1.3.2.jar

4- Unzip the file pentaho-big-data-plugin-shimtastic-1.3.3.1.zip from the plugins folder.

5- Optionally, remove irrelevant folders under pentaho-big-data-plugin/hadoop-configurations.

6- Copy the file pentaho-hadoop-hive-jdbc-shim-1.3.3.jar into the folder
tomcat/webapps/pentaho/WEB-INF/lib

Done!

Report Designer

1- Stop report designer if it's running.

2- Move the following folder out of the Report Designer folder structure:
report-designer/plugins/pentaho-big-data-plugin

3- Move the following files out of the Report Designer folder structure if they exist:
report-designer/lib/jdbc/pentaho-hadoop-hive-jdbc-shim-1.3.0.jar
report-designer/lib/jdbc/pentaho-hadoop-hive-jdbc-shim-1.3.1.jar
report-designer/lib/jdbc/pentaho-hadoop-hive-jdbc-shim-1.3.2.jar

4- Unzip the file pentaho-big-data-plugin-shimtastic-1.3.3.1.zip from the report-designer/plugins
folder.

5- Optionally, remove irrelevant folders under pentaho-big-data-plugin/hadoop-configurations.

6- Copy the file pentaho-hadoop-hive-jdbc-shim-1.3.3.jar into the folder
report-designer/lib/jdbc

Done!

 

Metadata Editor

1- Stop the Metadata Editor if it's running.

2- Move the following folder out of the Metadata Editor folder structure:
plugins/pentaho-big-data-plugin

3- Move the following files out of the Metadata Editor folder structure if they exist:
libext/JDBC/pentaho-hadoop-hive-jdbc-shim-1.3.0.jar
libext/JDBC/pentaho-hadoop-hive-jdbc-shim-1.3.1.jar
libext/JDBC/pentaho-hadoop-hive-jdbc-shim-1.3.2.jar

4- Unzip the file pentaho-big-data-plugin-shimtastic-1.3.3.1.zip from the plugins folder.

5- Optionally, remove irrelevant folders under pentaho-big-data-plugin/hadoop-configurations.

6- Copy the file pentaho-hadoop-hive-jdbc-shim-1.3.3.jar into the folder
libext/JDBC

Done!

 

 

 

  In order to view the downloads below you must have flash enabled in your browser. 

  

If you have issues with the above Flash application, please use the following link:

pentaho-big-data-1331

Have more questions? Submit a request

Comments

Powered by Zendesk