Cloudera Data Hub Upgrade¶
Table of Contents
Introduction¶
This document includes the upgrade steps for Cloudera Data Hub.
Upgrade Software Installation¶
Important
The software installation must be upgraded on all servers where Gluent Data Platform is installed.
Perform the following actions as the owner of the existing Gluent Data Platform installation.
Backup Existing Software¶
Source the Gluent Data Platform environment file and run backup commands:
$ . <OFFLOAD_HOME>/conf/offload.env
$ cd $OFFLOAD_HOME/..
$ for f in offload/conf/{offload.env,datad.conf,datad.properties,*.xml,*.key}; do [ ! -f $f ] || cp $f .; done
$ mv offload offload.$(date +%Y.%m.%d_%H.%M.%S)
Unpack New Software¶
Unpack the install tarball (gluent_offload_<version>.tar.bz2
):
Note
When unpacking, an offload
directory will be created if it does not exist. The offload
directory is referred to as <OFFLOAD_HOME> and an environment variable ($OFFLOAD_HOME
) will be set when offload.env
is sourced.
$ cd <Gluent Data Platform Base Directory>
$ tar xpf <Gluent Data Platform Installation Media Directory>/gluent_offload_<version>.tar.bz2
Restore Configuration Files¶
Copy configuration files into new $OFFLOAD_HOME
:
$ for f in {offload.env,datad.conf,datad.properties,*.xml,*.key}; do [ ! -f $f ] || cp $f offload/conf; done
Upgrade User-Defined Functions¶
If Gluent Data Platform has been installed on a server in addition to the Oracle Database server, the connect
command to create the user-defined functions (UDFs) detailed below should be run from that server. Otherwise, run this command using the Gluent Data Platform installation on an Oracle Database server.
Tip
By default UDFs are created in the default
Impala database. This database can be changed by specifying the database name in the OFFLOAD_UDF_DB
parameter in offload.env
.
The storage location of the library that is referenced by the Gluent UDFs is determined by the values of parameters in offload.env
. See Integrating with Cloud Storage. Ad hoc overrides to a different cloud or HDFS location are available with the --offload-fs-scheme
, --offload-fs-container
, --offload-fs-prefix
and --hdfs-home
parameters with the connect --install-udfs
command.
To create the UDFs run the supplied connect
command with the --install-udfs
option:
$ cd $OFFLOAD_HOME/bin
$ . ../conf/offload.env
$ ./connect --install-udfs
Note
In systems using Sentry to control authorization the ALL ON SERVER
/CREATE ON SERVER
privilege will be required in order to install UDFs. The privilege can be safely removed once this task is complete.
In systems using Ranger to control authorization, appropriate Ranger permissions are required in order to install UDFs. See Ranger Privileges.
If the user with which Gluent Data Platform will authenticate to Impala is not permitted to have the necessary privileges to create UDFs, even on a temporary basis, then a script can be generated for execution by a system administrator. Use the --sql-file
option to specify a file where commands should be written instead of being executed:
$ cd $OFFLOAD_HOME/bin
$ . ../conf/offload.env
$ ./connect --install-udfs --sql-file=/tmp/gluent_udfs.sql
The /tmp/gluent_udfs.sql
file can then be run by an Impala user with the required Sentry privileges.