Incremental Update

Documentation Conventions

  • Commands and keywords are in this font.

  • $OFFLOAD_HOME is set when the environment file (offload.env) is sourced, unless already set, and refers to the directory named offload that is created when the software is unpacked. This is also referred to as <OFFLOAD_HOME> in sections of this guide where the environment file has not been created/sourced.

  • Third party vendor product names might be aliased or shortened for simplicity. See Third Party Vendor Products for cross-references to full product names and trademarks.

Introduction

Incremental Update is a data synchronization feature of Gluent Offload Engine that enables offloaded data to be kept up-to-date with any data modifications (DML) that occur in the RDBMS after the initial offload. It is primarily designed to enable users to fully-offload tables such as dimensions to Hadoop - typically these tables continue to be modified after offload and therefore need to be frequently synchronized. Incremental Update can also be used to make updates to offloaded data that no longer exists in the RDBMS (for example, to modify data in partitions that were offloaded to Hadoop and since dropped from the RDBMS). The use-cases and options for Incremental Update are described in this guide.

Note

Incremental Update is currently supported for use with Hadoop-based backends only (such as Cloudera Data Hub). Therefore, this guide describes interactions with the backend specifically in terms of Hadoop.

When to Use Incremental Update

Incremental Update is designed to support two primary use-cases:

  1. Full offloading of dimensions that are subject to regular updates after offload

  2. Correcting (or “patching”) offloaded data at any time after offload, even if the corresponding data is no longer present in the RDBMS

There are many other ways that Incremental Update can be used, but it is important to note that it is intended to be used for relatively low-volume changes. It is not designed to be a high-volume, low-latency replication tool for tables with significant rates of change. For example, it is not designed to replace Partition-Based Offload (also known as Incremental Partition Append), although it can be used to change data in previously offloaded partitions. The use-cases and supporting options will become clearer throughout this document.

How Incremental Update Works

Incremental Update is managed by Gluent Offload Engine and can be enabled for tables that have already been offloaded or as part of an initial offload. When Incremental Update is enabled, additional database objects are created (in both RDBMS and Hadoop) to track and store changes to the offloaded data. Changes are periodically synchronized to Hadoop and queried with the base offloaded data via a dedicated view, until they are eventually merged with the original offloaded data.

Incremental Update Phases

There are three phases to Incremental Update:

Phase 1: Capture

The first phase of Incremental Update is capture: that is, the identification and recording of DML changes to an offloaded table. When enabling Incremental Update, an extraction method must be specified. This tells Gluent Offload Engine which RDBMS objects need to be created to track and capture future DML changes to the application table that is to be (or has already been) offloaded. There are six extraction methods: some apply to all offloaded table types and some are more specialized.

Incremental Update also requires a primary key to ensure that changes are applied to the correct records in Hadoop. If the offloaded table does not have a primary key, a set of columns that can be used as a logical key must be provided when enabling the feature. The primary or notional key is used in the setup of some of the additional RDBMS and Hadoop objects that are created when a table is Incremental Update-enabled.

The majority of the extraction methods require the creation of additional RDBMS objects to track and store changes. Most methods create a changelog table, to store change records in the RDBMS prior to transfer to Hadoop (Phase 2), and a changelog sequence, to journal the DML changes and ensure they are applied in the correct order later on. Depending on the method, additional objects (RDBMS triggers, RDBMS view) are also created.

See also

The Choosing an Extraction Method section assists with selecting a suitable extraction method.

Phase 2: Extraction

The second phase of Incremental Update is extraction: that is, extracting changes from the RDBMS, copying them to Hadoop and applying them to the existing offloaded data. Extractions are run using Offload and can be executed as frequently or infrequently as required, depending on factors such as the workload of the RDBMS application and the latency demands (that is, the acceptable delay between DML changes occurring in the RDBMS and then appearing in Hadoop).

When enabling Incremental Update, all extraction methods create the same Hadoop objects: a delta table and two views known as delta views (one internal, one user-facing). The original offloaded table is renamed to an internal format and is known as a base table and the user-facing delta view is named as the original table. This view is the object that users always query to get the latest state of the data (base + changes) once Incremental Update is in use.

During extraction, change records are stored in the delta table and removed from the RDBMS changelog table (where applicable). The delta table contents are physically merged into the offloaded base table data periodically by a process known as compaction (Phase 3).

Phase 3: Compaction

The final phase of Incremental Update is compaction: a process by which the change records being staged in the delta table are physically merged with the data in the base table. Until a compaction takes place, the delta view (used for all queries) has to join the base table with the delta table to merge the change records with the base data. Extractions continually load data into the delta table and as the volume increases, so does the work that the delta view has to perform. Compaction provides a physical reset, whereby a new base table (or new partitions in the base table, depending on the chosen method) are created out of the existing base and delta tables and the delta table is cleared down.

See also

More details on compaction, together with examples, are provided later in Merging Changes (Running a Compaction).

Note

The delta table, delta view and compaction process are needed because Hadoop does not support updates to existing data. The database objects that Gluent Offload Engine creates provide the latest-state view of the data (i.e. the same results that would be available if the change records were actually applied as DML). The compaction process optimizes data access by clearing down the delta table and therefore reducing the work that the delta view needs to do to join the base and delta tables.

Enabling Incremental Update

Incremental Update is enabled per table with the offload command. It can be enabled for a table at the time of an initial offload or at a later stage if required (the command is the same). It is recommended that Incremental Update is enabled for a table only when it is necessary and not used as a default option for all offloaded tables.

The following examples demonstrate how to enable Incremental Update for a table.

Example 1: Enable Incremental Update

$ . $OFFLOAD_HOME/conf/offload.env

$ $OFFLOAD_HOME/bin/offload -t SH.PRODUCTS -x --incremental-updates-enabled --incremental-extraction-method=CHANGELOG

This example chooses the CHANGELOG extraction method as an example of how to use the --incremental-extraction-method option. To choose an alternative extraction method, simply substitute CHANGELOG with the relevant value. Information on all extraction methods directly follows the examples.

Note

This example includes the sourcing of the environment file for an Offload session. All other examples in this guide will assume this step has occurred.

Example 2: Enable Incremental Update (No Primary Key)

As described earlier, if the table being offloaded does not have a primary key, the notional key to use must be provided, as the following example demonstrates.

$ $OFFLOAD_HOME/bin/offload -t SH.PRODUCTS -x \
     --incremental-updates-enabled \
     --incremental-extraction-method=CHANGELOG \
     --incremental-key-columns=PROD_ID

Failure to provide an Incremental Update key will result in an exception and the offload will terminate before any work is done.

Choosing an Extraction Method

There are six extraction methods: three basic (common) and three specialist. Choosing the correct extraction method for the offload use-case is important, as it can ensure the best performance and minimize the impact on the system. It is also important to recognize that the specialist extraction methods require an understanding of the implications of their use and care while using. This is covered below.

The following tables summarize the six methods, when to use them, which DML operations they support and which offload type they apply to.

Table 1: Incremental Update Extraction Methods

Extraction Method

Basic

Supported DML Capture

Supported Offload Types 1

When To Use

INSERT

UPDATE

DELETE

MERGE

Full (100/0)

Full (100/10)

Incremental (90/10)

ORA_ROWSCN

Yes

Yes

Yes

No

No

Yes

Yes

No

Changes are INSERT/UPDATE only and the RDBMS table has ROWDEPENDENCIES set 2

CHANGELOG

Yes

Yes

Yes

Yes

Yes

Yes

Yes

No

The RDBMS table undergoes changes of all DML types

UPDATABLE

Yes

Yes

Yes

Yes

No

Yes

Yes

Yes

Changes need to be made to offloaded data in Hadoop 3, for example if:

* the offloaded data is no longer in the RDBMS

* a trigger on the application table (i.e. the CHANGELOG method) is undesirable. In this case, any DMLs on the RDBMS data must be repeated for the equivalent offloaded Hadoop data

UPDATABLE_CHANGELOG

No

Yes

Yes

Yes

CHANGELOG only

Yes

Yes

No

Changes need to be made to offloaded data that both does and doesn’t still exist in the RDBMS:

* CHANGELOG captures changes to offloaded data that still exists in the RDBMS

* UPDATABLE captures changes to offloaded data that no longer exists in the RDBMS

CHANGELOG_INSERT

No

Yes

No

No

No

Yes

Yes

No

The RDBMS table is modified with INSERT only 4

UPDATABLE_INSERT

No

Yes

No

No

No

Yes

Yes

Yes

Data sometimes needs to be added to partitions that have been offloaded to Hadoop but no longer exist in the RDBMS. The data cannot be inserted into the RDBMS table

1

Offload type refers to the percentage of a table’s data that is offloaded. There are two offload types (INCREMENTAL and FULL) that can be used to support three offload use-cases (known as 90/10 (INCREMENTAL) 100/0 (FULL), 100/10 (FULL)). See the Glossary and Offload Type for more details.

2

The ORA_ROWSCN extraction method can only be used for Oracle Database tables that have the ROWDEPENDENCIES property set. This enables DML changes to be identified at row-level, rather than block-level, and prevents the transfer of too many unchanged records to Hadoop.

3

Incremental Update operations using any of the UPDATABLE* extraction methods is known as data patching. It is generally intended to enable corrections and updates to offloaded data that may or may not exist in the RDBMS. It is also important to note that data patching is performed using hybrid objects in the RDBMS, despite it affecting data in Hadoop.

4

The *INSERT extraction methods are lightweight and cater purely for insert-only workloads. Using this method asserts that no other DML operations are expected and these will not be captured. These methods also have an implication for In-Place Compaction that is described later.

Table 2: Incremental Update Extraction Methods and Database Objects

The following table describes the six extraction methods and the additional RDBMS and Hadoop objects they create.

Extraction Method

Additional Hybrid Objects

RDBMS (Hybrid Schema)

Hadoop

Object Type

Naming Format

Object Type

Naming Format 5

ORA_ROWSCN

None

None

Base Table 6

<SCHEMA>.<TABLE>_DATA

Delta Table

<SCHEMA>_LOAD.<TABLE>_CHANGES

Delta View (User)

<SCHEMA>.<TABLE> 6

Delta View (Internal)

<SCHEMA>.<TABLE>_INTERNAL_USER_VIEW_<N>

CHANGELOG

Changelog Table

<SCHEMA>_H.<TABLE>_LOG

As ORA_ROWSCN

Changelog Sequence

<SCHEMA>_H.<TABLE>_SEQ

Changelog Trigger 7

<SCHEMA>_H.<TABLE>

UPDATABLE

Changelog Table

<SCHEMA>_H.<TABLE>_LOG

As ORA_ROWSCN

Changelog Sequence

<SCHEMA>_H.<TABLE>_SEQ

Updatable View

<SCHEMA>_H.<TABLE>_UV

Updatable Trigger 8

<SCHEMA>_H.<TABLE>_UV

UPDATABLE_CHANGELOG

As CHANGELOG and UPDATABLE

As ORA_ROWSCN

CHANGELOG_INSERT

As CHANGELOG

As ORA_ROWSCN

UPDATABLE_INSERT

As UPDATABLE

As ORA_ROWSCN

5

Objects created to support Incremental Update are named according to the name of the application SCHEMA.TABLE, according to the suffixes shown in the above table. If a suffix extends an object name beyond the supported character limits of the database platform, a hash is used instead to keep the derived name within that limit. The --incremental-changelog-table option can be used to override the name given to the changelog table.

6(1,2)

When Incremental Update is enabled, the base table (i.e. the offloaded version of the RDBMS table stored in Hadoop) is renamed and the user-facing delta view takes on the original offloaded table name. This ensures that existing queries do not need to be modified and they continue to see the latest state of the data once Incremental Update is enabled.

7

The CHANGELOG, UPDATABLE_CHANGELOG and CHANGELOG_INSERT extraction methods create a trigger on the application table. The trigger is created in the Hybrid Schema and fires before insert/update/delete for each row modification to the application table.

8

The UPDATABLE, UPDATABLE_CHANGELOG and UPDATABLE_INSERT extraction methods create a trigger on a specialist updatable view in the hybrid schema. Any modifications to the application table are not captured by this trigger.

Capturing Changes with Different Extraction Methods

The extraction methods capture data modifications in different ways, based on their design and the objects that are created for change tracking.

Capturing Changes with ORA_ROWSCN

With the ORA_ROWSCN extraction method, INSERT and UPDATE changes are automatically captured by Oracle Database’s multiversion read consistency model, so there are no specific user-actions required to ensure that data modifications can be extracted to Hadoop. As noted previously, to ensure the accuracy of this data capture mechanism, the RDBMS table must have been created with ROWDEPENDENCIES. This ensures that changes are tracked at row-level, rather than block-level, for the table. This method does not support DELETE operations.

Capturing Changes with CHANGELOG

With the CHANGELOG extraction method, changes are automatically tracked by a trigger (changelog trigger) that is created on the application table when Incremental Update is enabled. The trigger is row-level and fires before any DML operations on the target table, recording all modifications in a changelog table in the hybrid schema. No specific user-actions are required to capture changes with this method.

Capturing Changes with UPDATABLE

With the UPDATABLE extraction method, changes are tracked by a specialist trigger and view (updatable trigger and updatable view) that are created in the hybrid schema when Incremental Update is enabled. The primary use-case is to modify data that exists in Hadoop but no longer exists in the RDBMS, known as data patching. data patching is executed on the RDBMS, using the hybrid objects as described, and is one of only two extraction methods that is supported for tables that are offloaded with Partition-Based Offload.

To capture changes for eventual extraction to Hadoop, DML statements are made against the updatable view in the hybrid schema. This view is usually suffixed with _UV. When a DML statement is executed against the updatable view, a trigger of the same name fires and captures the modifications and stores them in the changelog table.

Example 3: Capturing Changes with UPDATABLE

In the following example, suppose that most of the partitioned SH.SALES table has been offloaded to Hadoop and the offloaded partitions have been dropped from the RDBMS. It is later discovered that one of the offloaded partitions has incorrect sales amounts and they need to be corrected. The partition is no longer in the RDBMS and therefore needs to be corrected in Hadoop. The updatable view is used as the target for this data:

SQL> UPDATE sh_h.sales_uv
     SET    amount_sold = amount_sold * 1.1
     WHERE  time_id BETWEEN DATE '2017-01-01' AND DATE '2017-01-31';

SQL> COMMIT;

The updatable view will retrieve the required data from Hadoop and the updatable trigger will fire on the update and modify the values as required. The updated rows will be stored in the changelog table in the RDBMS. The changes will not be visible in Hadoop until the next extraction takes place.

Important

Care must be taken when using the UPDATABLE extraction method on tables that are offloaded periodically with Partition-Based Offload. In such cases, users must not use the updatable view to modify data in partitions that have not yet been offloaded. Doing so will prevent the remainder of the partition from being offloaded at the next Partition-Based Offload operation.

Capturing Changes with UPDATABLE_CHANGELOG

As its name suggests, the UPDATABLE_CHANGELOG extraction method covers both the CHANGELOG and UPDATABLE use-cases (remembering that CHANGELOG is primarily used for propagating regular RDBMS changes to Hadoop and UPDATABLE is primarily used for infrequently patching offloaded Hadoop data). Changes to the original RDBMS application table are automatically tracked by the CHANGELOG objects and require no user actions, while changes made to the updatable view are tracked by the updatable trigger (see Capturing Changes with UPDATABLE for an example). Both capture mechanisms store the change records in the same changelog table.

Important

When using the UPDATABLE_CHANGELOG extraction method, the target data for DMLs must not overlap. The UPDATABLE part of this extraction method is purely for patching data that exists in Hadoop and must never be used to repeat DMLs that have already been captured by the changelog trigger. Doing so would risk incorrect change records being propagated to Hadoop. Any changes to the RDBMS application table will automatically be captured by the changelog trigger so must not be repeated on the updatable view.

Capturing Changes with CHANGELOG_INSERT

The CHANGELOG_INSERT extraction method is an optimization to the CHANGELOG method for use with insert-only workloads. With this method, the changelog trigger captures inserts into the RDBMS application table and stores them in the changelog table ready for extraction to Hadoop. No specific user-actions are required to capture the changes, but note that UPDATE, DELETE and MERGE operations will not be captured by this method.

The optimization with this extraction method is realized in the delta view. In this case, the delta view performs a lightweight UNION ALL operation between the delta table and the base table, unlike the more general extraction methods that setup a more complex delta view that merges the delta table with the base table. Because the delta view does not merge the change records with the base data when using CHANGELOG_INSERT, the application table must have a primary key to prevent duplicate keys. Users can override this with the --incremental-no-verify-primary-key option, but in doing so must guarantee that the application does not insert duplicate data. It is recommended that this option is not used.

The CHANGELOG_INSERT optimization also has implications for the downtime needed for in-place compaction. See In-Place Compaction for details.

Capturing Changes with UPDATABLE_INSERT

The UPDATABLE_INSERT extraction method is an optimization to the UPDATABLE method for use with insert-only data patching requirements. With this method, data to be added to the offloaded table in Hadoop is inserted into the updatable view in the hybrid schema and the updatable trigger captures and stores the new records in the changelog table, ready for extraction to Hadoop.

The optimization with this extraction method is realized in the delta view. In this case, the delta view performs a lightweight UNION ALL operation between the delta table and the base table, unlike the more general extraction methods that setup a more complex delta view that merges the delta table with the base table. Because the delta view does not merge the change records with the base data when using UPDATABLE_INSERT, the application table must have a primary key to prevent duplicate keys. Users can override this with the --incremental-no-verify-primary-key option, but in doing so must guarantee that the application does not insert duplicate data into the updatable view. It is recommended that this option is not used.

The UPDATABLE_INSERT optimization also has implications for the downtime needed for in-place compaction. See In-Place Compaction for details.

Example 4: Capturing Changes with UPDATABLE_INSERT

In the following example, suppose that most of the partitioned SH.SALES table has been offloaded to Hadoop and the offloaded partitions have been dropped from the RDBMS. It is later discovered that one of the offloaded partitions has missing sales information that needs to be added. The partition is no longer in the RDBMS and therefore needs to be added to Hadoop. The updatable view is used as the target for this data:

SQL> -- Add individual rows
SQL> INSERT INTO sh_h.sales_uv (...columns...) VALUES (...values...);

SQL> -- Add results of a query
SQL> INSERT INTO sh_h.sales_uv (...columns...) SELECT (...expressions...) FROM ...;

SQL> COMMIT;

The updatable trigger will fire on the inserts and the new rows will be stored in the changelog table in the RDBMS. The changes will not be visible in Hadoop until the next extraction takes place.

Important

Care must be taken when using the UPDATABLE extraction method on tables that are offloaded periodically with Partition-Based Offload. In such cases, users must not use the updatable view to add data to partitions that have not yet been offloaded. Doing so will prevent the remainder of the partition from being offloaded at the next Partition-Based Offload operation.

Note

The UPDATABLE_INSERT optimization has implications for the downtime needed for in-place compaction. See In-Place Compaction for details.

Extracting Changes to Hadoop (Running an Extraction)

Changes that are captured in the RDBMS are not visible to queries until they have been extracted and transferred to Hadoop. While in most cases the capture of changes is automatic, the extraction to Hadoop must either be scheduled or run manually. It is recommended that extractions are scheduled to run frequently to prevent a large backlog of changes. Ultimately, this will depend on the rate of change in the RDBMS and the latency requirements of those parts of the application that use the offloaded data in Hadoop. Changes continue to be captured while an extraction process is running.

Example 5: Running an Extraction

Extractions are executed with an option to offload as follows:

$ $OFFLOAD_HOME/bin/offload -t SH.PRODUCTS -x --incremental-run-extraction

This command will transfer the outstanding change records to the delta table and delete them from the changelog table where applicable.

Example 6: Running an Extraction with Non-Default Threshold

The extraction process is governed by a minimum threshold that determines how many change records must exist before the extraction transfers them (see --incremental-delta-threshold for details). If the number of change records is less than the threshold, the offload command will log this fact and execute silently. The default threshold can be overridden with the --incremental-delta-threshold option. The following example demonstrates how to use this option to extract changes when at least 500 have been detected:

$ $OFFLOAD_HOME/bin/offload -t SH.PRODUCTS -x --incremental-run-extraction --incremental-delta-threshold=500

Merging Changes (Running a Compaction)

Changes that are extracted to Hadoop and stored in the delta table need to be periodically merged with the base table data in a process known as compaction. Until a compaction occurs, the delta table and base table data is merged “on-the-fly” by the delta view; the performance of which degrades over time (i.e. as changes from multiple extractions start to build up). During compaction, a new base table or set of partitions in the base table (depending on the chosen compaction method) are created from the delta view and the previous state is discarded. Compaction is the most complex part of the Incremental Update process, in terms of scheduling (see Scheduling a Compaction) and compaction methods (see Choosing a Compaction Method).

Compaction is required to reduce the impact of having to merge the delta table and base table in the delta view to present the latest state of the data to queries. As this merge is essentially a join between the two tables, it follows that the higher the volume of changes stored in the delta table, the more CPU and I/O resources are needed to merge the data and this is a cost borne by every query that uses the delta view. When the delta volume is relatively low, it is more likely that the join will involve broadcasting the delta set (an efficient mechanism when joining small tables with large tables in Hadoop). As the delta volume increases, it can cross the Hadoop system’s particular threshold for turning broadcast joins into less-efficient alternatives and the performance of the delta view can degrade.

Scheduling a Compaction

Compactions do not need to be scheduled as frequently as extractions. Extractions can run as frequently as required, but compactions should be scheduled to run infrequently (depending on the rate of change) and during quieter system periods or maintenance windows. In this case “infrequently” can be taken to mean a short while before the delta table grows to a size that demonstrates a degradation in performance of the delta view.

By way of an example, consider a dimension table in an RDBMS data warehouse that is fully offloaded to Hadoop with Incremental Update enabled. The dimension is updated throughout each online day, changing approximately 1% of the table’s rows. The Hadoop latency requirements are to have the changes replicated no more than 5 minutes later than the RDBMS. In this example, a suggested approach would be to use the CHANGELOG extraction method for capturing changes; to schedule an extraction to run every 1-2 minutes with an appropriate delta threshold; and to schedule a compaction to run once per week during a quiet system period or maintenance window.

There are three compaction methods available and depending on which method is chosen determines the potential system and application impact of the compaction process (see Choosing a Compaction Method).

Acknowledging Change Ingestion Diversion

Each compaction has many steps to perform and many metadata operations to record. Most of these steps are non-invasive but there is a point beyond which compaction will not proceed until it knows it is safe to do so. This tipping point is the acknowledgment of change ingestion diversion. To understand why this is necessary requires a short explanation of the structure of the delta table and the interaction between extractions and compactions.

The delta table is partitioned into “control partitions”, also known as “change ingestion locations”. Whenever an extraction runs, it checks to see if a compaction is running and if not, the change records continue to be written to the active change ingestion location (i.e. the active control partition). Multiple extractions can write to the same change ingestion location over time. However, if a compaction is underway when an extraction runs, the extraction process creates a new control partition in the delta table and makes this the active change ingestion location for the change records. The previous control partition will not receive any more change records.

When a compaction runs, it performs several tasks before reaching the step where it requires an acknowledgment that change ingestion diversion (i.e. the creation of a new control partition by an extraction process) has taken place. Depending on how compaction is executed (examples follow directly), it will wait on this step until another extraction is executed to perform the change ingestion diversion as described. Once change ingestion diversion is acknowledged, compaction can continue with the remainder of the steps required to compact the data in the static control partition with the base table. This process ensures that no new changes are lost due to extractions running while compaction takes place.

There are two ways to schedule a compaction to manage this acknowledgment step:

Single-Session Compaction

To schedule a single-session compaction process, a sequence of offload commands need to be used to:

  1. Run a compaction with an option to terminate when it reaches the change ingestion diversion acknowledgment step

  2. Run an extraction to create a new change ingestion location (i.e. new control partition)

  3. Run a second compaction to pick up from where the first compaction terminated and complete the process

Example 7 below demonstrates the Incremental Update options required to make this happen.

Example 7: Single-Session Compaction
# Start a compaction
$ $OFFLOAD_HOME/bin/offload -t SH.SALES -x --incremental-run-compaction --incremental-terminate-compaction

# Run an extraction
$ $OFFLOAD_HOME/bin/offload -t SH.SALES -x --incremental-run-extraction

# Run a second compaction
$ $OFFLOAD_HOME/bin/offload -t SH.SALES -x --incremental-run-compaction

Note

The --incremental-terminate-compaction option tells the compaction process to terminate at the acknowledgment stage. It will exit cleanly with a success status code of zero.

Multi-Session Compaction

A multi-session compaction assumes that extractions are already scheduled to run frequently. In this case, only a single offload command is required as shown in Example 8 below. The compaction will reach the change ingestion diversion acknowledgment step and wait until the next extraction executes from another session. The offload log file for the compaction (and stdout when using the -v option) will show that the process is waiting. Once the extraction is done and change diversion is acknowledged, the compaction process continues to completion.

Example 8: Multi-Session Compaction
$ $OFFLOAD_HOME/bin/offload -t SH.SALES -x --incremental-run-compaction

Compaction and Idempotency

Despite having many component steps, each step in the compaction process is idempotent. A compaction process can therefore be restarted after any point of termination, irrespective of which step the termination occurred and whether it was due to failure or a controlled termination by a single-session compaction.

Waiting for Queries to Complete

To minimize its impact on users and applications, the compaction process is designed to be as online an activity as possible. In most cases, the disruptive part of a compaction involves switching the delta views to a new target. These switches are quick and lightweight metadata operations (see In-Place Compaction and Insert Extraction Methods for an important exception). A compaction will wait for a specified period of time for user queries to complete before performing any disruptive operations. By default, this is 15 minutes and can be overridden with the --incremental-wait-time option as shown in example 9 below.

Example 9: Waiting for User Queries During Compaction

In the following example, the query wait time is set to 30 minutes.

$ $OFFLOAD_HOME/bin/offload -t SH.SALES -x --incremental-run-compaction --incremental-wait-time=30

The compaction process itself does not detect any running queries, so the query wait time should be set to a large enough value to comfortably allow application queries to complete. However, it might not always be possible to prevent queries from running when the disruptive compaction steps occur. Setting --incremental-wait-time to 0 will prevent the compaction process from waiting on user queries.

Delta Thresholds

The extraction option --incremental-delta-threshold also applies to compaction. A compaction will terminate immediately (with success but without doing any work) if the number of change records in the delta table are below the threshold. Specifying a suitable threshold with this option will prevent unnecessary compactions. See Example 6: Running an Extraction with Non-Default Threshold for an example of how to use the --incremental-delta-threshold option.

Compaction and Concurrency

More than one table can be compacted at the same time, but each table must only have one compaction process running at any time. A lightweight locking mechanism is used to ensure this, but the lock is node-dependent. This means that to prevent more than one compaction from accidentally being executed against the same table, all compactions should be executed from the same node of the Hadoop cluster or RDBMS server (i.e. wherever Gluent Data Platform is installed and executed). This lock prevents a second compaction for a table from starting when one is already in progress (i.e. to protect against scheduling errors or aggressive scheduling).

Choosing a Compaction Method

Examples 8 and 9 demonstrate the commands needed to run a compaction process for a table, using a default compaction method. There are three compaction methods to choose from:

Choosing a compaction method is based on a number of factors, such as how busy the system is, SLAs, whether compaction needs to be run online or can be an offline operation and so on.

In-Place Compaction

In-place compaction is the default compaction method (used by examples 8 and 9 above). An in-place compaction processes only those partitions that have records with changes in the delta table. Each partition with change records is processed sequentially and the original partition in the base table is replaced with a new compacted set of data. This is a two-step process and for each affected partition:

  1. A compacted copy of the changed partition data is created in the load database

  2. The compacted partition data is written to the base table with an INSERT OVERWRITE statement as an atomic operation

When all partitions have been compacted, new delta views are created to reference the new control partition in the delta table and the previous control partition is removed.

This method is recommended when the number of changed partitions is low or when the base table is too large to create a full copy in a reasonable amount of time or within the resource constraints of the system. In most cases, the in-place compaction method will be quicker than the alternatives and will use fewer system resources to complete.

In-Place Compaction and Insert Extraction Methods

Special attention must be given when using in-place compaction for a table that has Incremental Update enabled with either of the CHANGELOG_INSERT or UPDATABLE_INSERT extraction methods. With these methods, the delta view is non-merging and simply appends the delta table change records to the base table with UNION ALL. This means that for the duration of an in-place compaction, the delta view results will be inconsistent (as each partition is processed, the change records will be duplicated until compaction completes). For this reason, in-place compaction with either of the *_INSERT extraction methods should be considered an offline activity and queries of the delta view should be avoided while compaction is running.

Full Compaction

With the full compaction method, a new base table is created as a full copy of the delta view results (i.e. using Create Table As Select or CTAS) and a new internal delta view is created to reference the new base table and active control partition in the delta table. The final steps are to divert the user-facing delta view to the new objects and remove the previous control partition and base table, minimizing the disruption to the application. A full compaction is enabled by appending the --incremental-full-compaction option to the offload compaction command.

A full compaction is recommended when a significant number/proportion of partitions in the base table have corresponding changes in the delta table and the base table itself can be replicated comfortably within the CPU and storage resources of the Hadoop cluster in a single CTAS operation. It is also recommended as an alternative to an in-place compaction when using either of the *_INSERT extraction methods and a minimal change window is available (see In-Place Compaction and Insert Extraction Methods for details).

This method is not recommended when creating a full copy of the base table would impact other users of the system or risk over-stretching the available resources. If this is an issue, a Partition-Wise Full Compaction would be a better method to use.

Partition-Wise Full Compaction

Like the full compaction, a partition-wise full compaction creates a new base table from the delta view, but does so one partition at a time. Being a full-table compaction, this method requires sufficient free storage to temporarily maintain two base table copies, but has less impact on system resources such as CPU and hence less impact on other users of the cluster. However, a partition-wise full compaction will take longer than a simple full compaction. A partition-wise full compaction is selected by appending the --incremental-partition-wise-full-compaction option to the offload compaction command.

A partition-wise full compaction is recommended when a full compaction is a better option than in-place compaction, but would be too disruptive to other users of the system during the CTAS phase (i.e. when creating a new base table from the delta view).

Saving Pre-Compaction State

Compactions are destructive operations. With either of the full compaction methods, the previous version of the base table is dropped, the previous control partition in the delta table is dropped, the previous internal delta view is dropped and the user-facing delta view is redirected to the new compacted objects. With an in-place compaction, previous versions of partitions are overwritten with new compacted data and the previous control partition in the delta table is dropped.

There are several mechanisms available to save the pre-compaction state of a table:

Saving Pre-Compaction State with Backups

All production systems should have automated backup solutions in place. Pre-compaction state should automatically be captured by whatever backup solution is in use for the Hadoop cluster.

Note

If a table is offloaded to cloud storage (a storage option provided by offload), rather than HDFS, it is essential that timely backups are available prior to running compactions if the previous state of the table is to be retained for any reason. HDFS mechanisms described below do not apply to cloud storage.

Saving Pre-Compaction State with HDFS Snapshots

HDFS snapshots can be integrated with offload to automatically create a snapshot at the start of a compaction process. To enable HDFS snapshots in this way, they must be configured in the offload.env configuration file by setting the HDFS_SNAPSHOT_PATH variable (and HDFS_SNAPSHOT_SUDO_COMMAND if applicable). The HDFS_SNAPSHOT_PATH location must be a snapshottable directory and snapshots will need to be enabled by an HDFS Administrator. When changing the value of HDFS_SNAPSHOT_PATH from its default, the new value must remain a parent directory of HDFS_DATA.

If for any reason HDFS snapshots are configured but unable to be taken, the compaction process will fail with an exception. To continue with the compaction but without attempting to take an HDFS snapshot, the --incremental-run-compaction-without-snapshot option can be appended to the offload command being used.

When the HDFS_SNAPSHOT_PATH variable is unset, HDFS snapshots are disabled and will not be taken.

Note

This option does not apply to tables that have been offloaded to cloud storage.

Saving Pre-Compaction State with HDFS Trash

When HDFS trash is enabled for a cluster, the pre-compaction version of files for an offloaded table will be saved to trash by each destructive operation performed by a compaction. This includes the dropping of the previous base table by full or partition-wise full compactions, the dropped control partition in the delta table and even the previous versions of partitions replaced by INSERT OVERWRITE operations during an in-place compaction. Previous versions of views will not, however, be put into HDFS trash.

Note

This mechanism does not apply to tables that have been offloaded to cloud storage.

Saving Pre-Compaction State with Compaction Options

When running a compaction, all prior versions of database objects can be retained by appending the --incremental-retain-obsolete-objects option to the offload command. For both of the full compaction methods, this means that the previous base table is retained and for in-place compactions, previous partitions are copied to the load database before being overwritten with new compacted data. Regardless of compaction method, the previous control partition in the delta table is preserved and prior versions of views are retained.

None of these retained objects are used or referenced by Gluent Query Engine when querying the new version of the compacted offloaded table, but it does mean that a previous state of data is available for ad hoc queries by Hadoop database users or the objects can be used as a temporary backup for a table undergoing compaction.

Warning

When using the --incremental-retain-obsolete-objects option, Gluent Offload Engine no longer manages the previous artifacts or associated storage and any links between the offloaded table and the prior database artifacts are removed. This means that users must manage the removal of these obsolete objects when required. In some circumstances, retaining obsolete objects can cause the re-offloading of entire tables to fail (i.e. when an offloaded table is completely re-offloaded with the --reset-backend-table option), although resetting a table in this way should be a rare occurrence.

Disabling Incremental Update

Incremental Update can be disabled for a table with the offload command as shown in Example 10 below.

Example 10: Disable Incremental Update

$ $OFFLOAD_HOME/bin/offload -t SH.PRODUCTS -x --incremental-updates-disabled

Incremental Update can only be disabled if there are no changes to be applied (i.e. no change records in the delta table or changelog table, where applicable). If there are outstanding changes to be applied, they must either be extracted and compacted or manually discarded (if they are not required).

When Incremental Update is disabled, all supporting RDBMS and Hadoop objects are dropped and the base table is renamed to match the original offloaded table, enabling queries to continue to work transparently.

Incremental Update Options

All Incremental Update options for the offload command are available in the Incremental Update Parameters section of the Reference documentation.

Using Incremental Update

This section provides some guidance on how to integrate Incremental Update with existing ETL and offload operations.

Using Incremental Update for Fully Offloaded Tables

For any of the following types of fully-offloaded table, Incremental Update can be used when the RDBMS table is subject to further DML changes:

  • Heap table

  • HASH/REFERENCE/SYSTEM partitioned table

  • LIST/RANGE/INTERVAL partitioned table where no new partitions are to be added (the partition structure is static)

Refer to Table 1: Incremental Update Extraction Methods for details of each of the extraction methods, but in most cases, CHANGELOG will suffice.

Using Incremental Update with Partition-Based Offload Tables

Using Incremental Update with Partition-Based Offload tables is more restrictive and requires greater consideration than for fully-offloaded tables. For example, the following considerations help to determine how, when or whether Incremental Update can be used:

  • How new partitions are created (e.g. added, split, intervals)

  • How partitions are populated (e.g. inserts, bulk loaders, partition exchange loading)

  • Volume of data being loaded into each new partition (low/high relative to system capabilities)

  • How and when partitions are modified (e.g. before offloading, after offloading)

For performance reasons, it is important to retain Partition-Based Offload offloading wherever possible, as this is the high-volume, high-throughput offloading method that best supports large partitioned tables. However, there are certain scenarios that require offloaded partitions to be updated and Incremental Update can be used in most cases. Table 3 below summarizes the various loading scenarios for such tables and suggests when Incremental Update might be useful.

Table 3: Partition-Based Offload Scenarios

Partition ETL Method

Partition Updates

Offload Type 9

Offload Strategy (Partition)

Offload Strategy (Updates)

Extraction Methods 10

Insert / Bulk Load

Before offload

Any

Partition-Based Offload

N/A

N/A

After offload

Any 11

Partition-Based Offload

Incremental Update

UPDATABLE

UPDATABLE_INSERT

Exchange Partition

Before offload

Any

Partition-Based Offload

N/A

N/A

After offload

INCREMENTAL

Partition-Based Offload

Incremental Update

UPDATABLE

UPDATABLE_INSERT

FULL

Partition-Based Offload

Incremental Update

Any

9

Offload type refers to the percentage of a table’s data that is offloaded. There are two offload types (INCREMENTAL and FULL) that can be used to support three offload use-cases (known as 90/10 (INCREMENTAL) 100/0 (FULL), 100/10 (FULL)). See the Glossary and Offload Type for more details.

10

See Table 1: Incremental Update Extraction Methods for details of each of the extraction methods and which ones are suited to the range of DML operations that are expected.

11

When a partitioned table is offloaded with the FULL offload type, it is possible to use extraction methods such as CHANGELOG and ORA_ROWSCN, but it is not recommended. Using either of these extraction methods will track all DML into the partitioned table, including the initial bulk loads. Incremental Update is not designed with large partitioned tables in mind and does not provide the level of performance required to offload large partitions of data. Instead, the initial partition offload should be performed with Partition-Based Offload and any later DML replayed using a suitable UPDATABLE* data patching method.

Important

When adopting an offload strategy that combines Partition-Based Offload with Incremental Update, it is essential that partitions are offloaded before any changes are captured and extracted by Incremental Update. If DML changes are captured and extracted before the partitions are offloaded, this will prevent Partition-Based Offload from offloading those partitions. The application must guarantee not to capture updates to partitions until after their initial offload. For this reason, it is almost always safest to use an UPDATABLE* extraction method to provide greater control over the timing of extractions in relation to Partition-Based Offload offloads.

Note

Similar considerations apply to partitioned and non-partitioned tables that are offloaded with Incremental Data Append (i.e. Predicate-Based Offload).

Incremental Update Status

Incremental Update creates several database objects and has several phases of operations, making it more complex than a regular offload. Gluent Data Platform provides two additional utilities for displaying the Incremental Update status of an offloaded table and listing the supporting database objects:

  1. display_incremental_status (see Displaying Incremental Update Status)

  2. offload_status_report (see Offload Status Reporting)

Displaying Incremental Update Status

The display_incremental_status utility describes the latest Incremental Update status for a table, including compaction state, change ingestion location (control partition) and supporting objects. Example 11 below demonstrates how to use this utility and provides a sample of its output.

Example 11: Displaying Incremental Update Status

$ $OFFLOAD_HOME/bin/display_incremental_status -t SH.PRODUCTS

Connecting to Oracle as gluent_adm (localhost:1521/ORA11204)
Connecting to HiveServer2 (localhost:21050) - Kerberos service impala
HDFS operations will use WebHDFS (localhost:50070) using Kerberos

=============================
| CHANGE EXTRACTION STATUS: |
=============================

Max change SCN extracted:     87716237
Most recent extraction time:  2018-11-09 12:11:41
Extraction method:            CHANGELOG

============================
| CHANGE INGESTION STATUS: |
============================

Current requested change ingestion location:     /user/gluent/offload/sh_load.db/products_changes/gl_control_partition=0
As of:                                           2018-11-09 12:01:48
Current acknowledged change ingestion location:  /user/gluent/offload/sh_load.db/products_changes/gl_control_partition=0
As of:                                           2018-11-09 12:01:48

======================
| COMPACTION STATUS: |
======================

(There is currently no ongoing compaction)
Last full compaction run completed at: (No compactions have been completed so far)

=====================
| MISC INFORMATION: |
=====================

Base table:   sh.products_data
Merge view:   sh.products_internal_user_view_0
User view:    sh.products
Delta table:  sh_load.products_changes
Id columns:   PROD_ID

========================
| OFFLOAD INFORMATION: |
========================

Hybrid view:    sh_h.products
RDBMS table:    sh.products
Hadoop target:  sh.products
Offload type:   FULL

This example shows the latest extraction time and ingestion location (control partition in the delta table) for changes, but no compactions have taken place for this table yet.

Offload Status Reporting

In addition to the display_incremental_status, Gluent data Platform provides the more general Offload Status Report. This reports a wide range of information on offloaded tables at database, schema or table level, including the RDBMS and Hadoop objects that are created for Incremental Update. Example 12 below demonstrates how to use this report and provides a relevant sample of its output.

Example 12: Running an Offload Status Report

$ $OFFLOAD_HOME/bin/offload_status_report --schema=SH --table=PRODUCTS --output-level=detail

Offload Status Report v3.0.0
Log file: /u01/app/gluent/offload/log/offload_status_report_2018-11-09T12:32:45.116622.log

Find Tables For Report
Done

Fetch Rdbms And Offload Data
Done

Process Report Data
Done

Generate Report
Report saved to /u01/app/gluent/offload/log/Gluent_Offload_Status_Report_ora11204_2018-11-09_12-32-45.txt
Done

The Offload Status Report saves a text report by default, as reported by the utility. A snippet of the report, relevant to Incremental Update, is as follows:

Incremental Update Objects (RDBMS)
----------------------------------

    Incremental Update Object Type Object Name                                                  Object Type
    ------------------------------ ------------------------------------------------------------ ------------------------
    Changelog trigger              SH_H.PRODUCTS                                                TRIGGER
    Changelog table                SH_H.PRODUCTS_LOG                                            TABLE
    Changelog sequence             SH_H.PRODUCTS_LOG_SEQ                                        SEQUENCE

Incremental Update Objects (Offloaded)
--------------------------------------

    Incremental Update Object Type Object Name                                                  Object Type
    ------------------------------ ------------------------------------------------------------ ------------------------
    Delta view                     SH.PRODUCTS                                                  VIEW
    Delta table                    SH_LOAD.PRODUCTS_CHANGES                                     TABLE
    Base table                     SH.PRODUCTS_DATA                                             TABLE
    Internal delta view            SH.PRODUCTS_INTERNAL_USER_VIEW_0                             VIEW

See also

The Offload Status Report can also produce rich HTML reports, in which the Incremental Update objects are displayed in a dedicated tab of a pop-up describe window for each table in the report. See Offload Status Report for more information.

Documentation Feedback

Send feedback on this documentation to: feedback@gluent.com