In Tecton, features are defined as a view on registered Data Sources or other Feature Views. Feature Views are the core abstraction that enables:
- Using one feature definition for both training and serving.
- Reusing features across models.
- Managing feature lineage and versioning.
- Orchestrating compute and storage of features.
A Feature View contains all information required to manage one or more related features, including:
- Transformation: A Feature View takes in one or more input sources and runs transformations to compute features. Sources can be Tecton Data Sources or in some cases, other Feature Views.
- Entities: The business entities that the features are attributes of such as Customer or Product. The entities dictate the join keys for the Feature View.
- Configuration: Materialization configuration for defining the orchestration and serving of features, as well as monitoring configuration.
- Metadata: Optional metadata about the features used for organization and discovery. This can include things like descriptions and tags.
The transformation and entities for a Feature View define the semantics of what the feature values truly represent. Changes to a Feature View's transformation or entities are therefore considered destructive and will result in the rematerialization of feature values.
Concept: Feature Views in a Feature Store
Types of Feature Views
There are 3 types of Feature Views:
- Batch Feature Views run transformations on one or more Batch Sources and can materialize feature data to the Online and/or Offline Feature Store on a schedule.
- Stream Feature Views transform features in near-real-time against a Stream Source, or Push Data Source, and can materialize data to the Online and Offline Feature Store.
- On-Demand Feature Views run transformations at request time based on data from a Request Source, Batch Feature View, or Stream Feature View.
|Batch Feature View
|Batch Data Source
|python, pandas, pyspark, spark_sql, snowpark, snowflake_sql
|Stream Feature View
|Stream Data Source, Push Data Source
|python, pandas, pyspark, spark_sql
|On-Demand Feature View
|Request Data Source, Batch Feature View, Stream Feature View
Defining a Feature View
A Feature View is defined using an decorator over a function that represents a pipeline of Transformations.
Below, we'll describe the high-level components of defining a Feature View. See the individual Feature View type sections for more details and examples.
# Feature View type
# Pipeline attributes
# Materialization and serving configuration
# Feature View name
intermediate_data = my_transformation(input_data)
output_data = my_transformation_two(intermediate_data)
See the API reference for the specific parameters available for each type of Feature View.
Registering a Feature View
Feature Views are registered by adding a decorator (e.g.
to a Python function. The decorator supports several parameters to configure the
The default name of the Feature View registered with Tecton will be the name of
the function. If needed, the name can be explicitly set using the
The function inputs are retrieved from the specified
sources in corresponding
order. Tecton will use the function pipeline definition to construct, register,
and execute the specified graph of transformations.
Defining the Feature View Function
A Feature View defines one transformation function that is executed when the Feature View runs. It runs against data retrieved from external data sources. For details, see the Transformations section.
Interacting with Feature Views
The Tecton SDK provides a set of methods that allow you to interactively test features or generate training data in any Python environment, such as a Python notebook. See Reading Feature Data for details.