Prepare

This second phase of the process aims to transform the original data, thus reducing the need for computational resources in the subsequent steps. Based on the briefly configured connector parameters, in this phase, we will have:
  • Deleting header lines

  • Deleting unnecessary columns

  • Transforming ID Columns with Suffix Identifiers

  • Replacement the terms

  • Deletion of the original file

The output will be a new temporary file for consumption in the next phase::

# python manage.py etl –prepare {all or connector}

It will start the data preparation phase for all connectors or just one specified. Essential to have the file in PSA. Otherwise, the system will display a warning:

The reset option will reset the control for all or a specific connector in the preparation phase and the two later ones.