Skip to content

Cleaning Up Execution Data

As part of the Hyperscale execution run, the system will create data files (unload service), split files and masked files (masking service) on the file server. As the data size can be large (3 times of source data) and include sensitive information, therefore, it is important to clean up this data. Additionally, unload service, masking service, and load service will also store transient internal data for the execution while running it. This data is also not required once execution is completed and should be cleaned. Following are the three ways this data will be/can be cleaned.

  1. Using retain_execution_data

    While setting up a Hyperscale Job (POST /jobs), you can set value for retain_execution_data property to intimate system when it should clean up data automatically based on the table below.

    EXECUTION_STATUS RETAIN_EXECUTION_DATA CLEAN UP AUTOMATICALLY?
    NA(SUCCESS/FAILED) NO YES
    SUCCESS ON_ERROR YES
    FAILED ON_ERROR NO
    NA(SUCCESS/FAILED) ALWAYS NO
  2. Manual Clean Up

    Hyperscale exposes a delete API (DELETE /executions/{id}) to manually cleanup data for an execution, if it’s not already cleaned.

  3. Start a New Execution

    While starting a new execution, Hyperscale will first validate if previous execution data is cleaned. If it’s not cleaned, then Hyperscale will trigger cleanup before starting new execution.