Tecton can use Amazon Redshift as a source of batch data for Feature Materialization.
To set up Tecton with Redshift, you need the following:
- A notebook connection to Databricks or EMR.
- A Redshift Cluster Endpoint. The Redshift cluster must be configured for access over the public internet. We recommend using IP whitelisting to ensure only Tecton can access your Redshift Cluster (your Tecton deployment specialist can provide you with IP ranges).
- A Redshift username and password. We recommend that you create a new user in Redshift configured to give Tecton read-only access to Redshift.
Setting Up the Connection
With the Prerequisite information, your Tecton deployment specialist configures Tecton to read data from Redshift.
To verify the connection, add a Redshift-backed Data Source. Do the following:
RedshiftDSConfigData Source Config object in the Redshift Feature Repository as shown here:
transactions_redshift_batch_ds = RedshiftDSConfig( endpoint=REDSHIFT_ENDPOINT, table=REDSHIFT_TABLE, )
The Data Source is added to Tecton. A misconfiguration results in an error message.