Preparing Apache Hadoop Hive user data sources for Campaign
Follow the steps below to enable Hive-based Apache Hadoop data sources to be used in HCL® Campaign
About this task
Campaign supports Apache Hadoop Hive for customer tables only, not system tables. For details about supported versions, see the Recommended Software Environments and Minimum System Requirements.
| Task | Description |
|---|---|
|
You can install the DataDirect Hive ODBC Driver from Progress Software, the Cloudera Hive ODBC driver from Cloudera, Inc., or the Hortonworks Hive ODBC driver from Hortonworks, Inc. |
|
|
Configuration includes modifying .ini files and setting path values and environment variables. Be sure to follow the appropriate instructions for the driver that you installed. |
|
|
C. Map existing HBase tables to Hive (OPTIONAL) |
This step is required only if you have existing tables that were created in Apache HBase. |
|
D. Import and configure the BigDataODBCHiveTemplate data source template in Campaign |
Use the configTool utility to import the template BigDataODBCHive.xml into Campaign. Then go to Campaign|partitions|partition[n]| dataSources and create a datasource based on the imported BigDataODBCHiveTemplate. |
|
To enable data file transfers between the IBM Campaign listener (analytic) server and the Hive-based Hadoop big data instance, you must configure SCP and SSH seamless login. |
|
|
Mapping user tables is the process of making external data sources accessible in Campaign. |