Skip to content

Stream Window Aggregate Feature View in Continuous Mode

For applications that require the most up-to-date feature data, continuous processing for Stream Window Aggregate Feature Views can update feature values in less than a second after the event is available in the stream data source. With continuous mode, Tecton will process each event as it arrives, rather than waiting for the slide period to complete.

Please see the Stream Window Aggregate Feature View Overview documentation for details on how a Stream Window Aggregate Feature View Overview Feature View works.

Feature Definition Example

  • Set aggregation_slide_period=continuous to enable continuous event processing.
  • Optionally set instance_ability=on_demand within the stream_cluster_config. Spot Instances may lead to feature processing delays due to spot termination or looking for an available instance, so On-demand Instances will deliver more consistent performance.

This example Feature View shows how to configure the annotation parameters.

Stream Data Source Configuration If your stream data source is Kinesis, we suggest lowering the default buffering to avoid delays in event processing.

Here are some suggested parameter values for a KinesisDSConfig:

  • For Databricks users:

    • maxFetchDuration="200ms"
    • maxFetchRate="2"
    • minFetchPeriod="200ms"
  • For EMR users:

    • kinesis.executor.idleTimeBetweenReadsInMs ="200"
    • kinesis.executor.maxFetchTimeInMs = "200"

This example data source shows how to configure the stream options on a KinesisDSConfig with Databricks.

Usage Example

See how to use a Stream Window Aggregate Feature View in a notebook here.