Configuring a Kerberos-secured connection to Hive
Hive is one of the many databases that can be added to the list of data sources available for Talend Data Preparation.
The section Adding a new database type explains how to add new JDBC drivers to enrich the list of databases available from Talend Data Preparation. However, this specific example focuses on how to configure a direct connection from your Hive database to Talend Data Preparation. An additional configuration step allows you to secure this connection with Kerberos.
Before you begin
Procedure
Results
In Talend Data Preparation, the Hive database is now available in the database dataset import form, in the Database type drop-down list.
When exporting a preparation made on data stored on your Hive database, you can choose to process the data on the Talend Data Preparation server.
For more information on how to import data from a database, see Adding a dataset from a database.