FeaturesAutoTSConfig#
- class datarobotx.FeaturesAutoTSConfig(feature_derivation_window_start=None, feature_derivation_window_end=None, differencing_method=None, exponentially_weighted_moving_alpha=None, default_to_known_in_advance=None, default_to_do_not_derive=None, calendar_id=None, aggregation_type=None, cross_series_group_by_columns=None, segmentation_task_id=None, treat_as_exponential=None, use_supervised_feature_reduction=None, use_cross_series_features=None, periodicities=None, feature_settings=None)#
Time series featurization configuration.
Parameters that default to ‘None’ (or are omitted by the user) are overridden to server-side defaults at runtime. Consult the DataRobot REST API and GUI documentation for additional information on each parameter.
- Parameters:
feature_derivation_window_start (int) – For time series projects only. How many timeUnits of the datetimePartitionColumn into the past relative to the forecast point the feature derivation window should begin.
feature_derivation_window_end (int) – For time series projects only. How many timeUnits of the datetimePartitionColumn into the past relative to the forecast point the feature derivation window should end.
differencing_method ({'auto', 'none', 'simple', 'seasonal'}) – For time series projects only. Used to specify which differencing method to apply if the data is stationary. For classification problems simple and seasonal are not allowed. Parameter periodicities must be specified if seasonal is chosen. Defaults to auto.
exponentially_weighted_moving_alpha (float) – Discount factor (alpha) used for exponentially weighted moving features
default_to_known_in_advance (bool) – For time series projects only. Sets whether all features default to being treated as known in advance features, which are features that are known into the future. Features marked as known in advance must be specified into the future when making predictions. The default is false, all features are not known in advance. Individual features can be set to a value different than the default using the featureSettings parameter. See the Time Series Overview for more context.
default_to_do_not_derive (bool) – For time series projects only. Sets whether all features default to being treated as do-not-derive features, excluding them from feature derivation. Individual features can be set to a value different than the default by using the featureSettings parameter.
calendar_id (str) – The ID of the calendar to be used in this project.
aggregation_type ({'total', 'average'}) – For multiseries projects only. The aggregation type to apply when creating cross-series features.
cross_series_group_by_columns (list of str) – For multiseries projects with cross-series features enabled only. List of columns (currently of length 1). Setting that indicates how to further split series into related groups. For example, if every series is sales of an individual product, the series group-by could be the product category with values like “men’s clothing”, “sports equipment”, etc.
segmentation_task_id (str) – Specifies the SegmentationTask that will be used for dividing the project up into multiple segmented projects.
treat_as_exponential ({'auto', 'never', 'always'}) – For time series projects only. Used to specify whether to treat data as exponential trend and apply transformations like log-transform. For classification problems always is not allowed.
use_supervised_feature_reduction (bool) – When true, during feature generation DataRobot runs a supervised algorithm feature lists using only qualifying features. Setting false can severely impact autopilot duration, especially for datasets with many features.
use_cross_series_features (bool) – Indicating if user wants to use cross-series features.
periodicities (list of FeaturesTSPeriodicityConfig) – A list of periodicities for time series projects only. For classification problems periodicities are not allowed. If this is provided, parameter ‘differencing_method’ will default to ‘seasonal’ if not provided or ‘auto’.
feature_settings (list of FeaturesTSFeatureSettingConfig) – An array specifying per feature settings. Features can be left unspecified.
See also
DRConfig
Configuration object for DataRobot project and autopilot settings, also includes detailed examples of usage
Attributes:
For multiseries projects only.
The ID of the calendar to be used in this project.
For multiseries projects with cross-series features enabled only.
For time series projects only.
For time series projects only.
For time series projects only.
Discount factor (alpha) used for exponentially weighted moving features.
For time series projects only.
For time series projects only.
An array specifying per feature settings.
A list of periodicities for time series projects only.
Specifies the SegmentationTask that will be used for dividing the project up into multiple segmented projects.
For time series projects only.
Indicating if user wants to use cross-series features.
When true, during feature generation DataRobot runs a supervised algorithm feature lists using only qualifying features.
Inherited methods:
keys
()- rtype:
to_dict
()Return configuration as a dict.
- property aggregation_type: str#
For multiseries projects only. The aggregation type to apply when creating cross-series features.
Notes
aggregation_type : {‘total’, ‘average’}
- property calendar_id: str#
The ID of the calendar to be used in this project.
Notes
calendar_id : str
- property cross_series_group_by_columns: List[str]#
For multiseries projects with cross-series features enabled only. List of columns (currently of length 1). Setting that indicates how to further split series into related groups. For example, if every series is sales of an individual product, the series group-by could be the product category with values like “men’s clothing”, “sports equipment”, etc.
Notes
cross_series_group_by_columns : list of str
- property default_to_do_not_derive: bool#
For time series projects only. Sets whether all features default to being treated as do-not-derive features, excluding them from feature derivation. Individual features can be set to a value different than the default by using the featureSettings parameter.
Notes
default_to_do_not_derive : bool
- property default_to_known_in_advance: bool#
For time series projects only. Sets whether all features default to being treated as known in advance features, which are features that are known into the future. Features marked as known in advance must be specified into the future when making predictions. The default is false, all features are not known in advance. Individual features can be set to a value different than the default using the featureSettings parameter. See the Time Series Overview for more context.
Notes
default_to_known_in_advance : bool
- property differencing_method: str#
For time series projects only. Used to specify which differencing method to apply if the data is stationary. For classification problems simple and seasonal are not allowed. Parameter periodicities must be specified if seasonal is chosen. Defaults to auto.
Notes
differencing_method : {‘auto’, ‘none’, ‘simple’, ‘seasonal’}
- property exponentially_weighted_moving_alpha: float#
Discount factor (alpha) used for exponentially weighted moving features.
Notes
exponentially_weighted_moving_alpha : float
- property feature_derivation_window_end: int#
For time series projects only. How many timeUnits of the datetimePartitionColumn into the past relative to the forecast point the feature derivation window should end.
Notes
feature_derivation_window_end : int
- property feature_derivation_window_start: int#
For time series projects only. How many timeUnits of the datetimePartitionColumn into the past relative to the forecast point the feature derivation window should begin.
Notes
feature_derivation_window_start : int
- property feature_settings: List[FeaturesTSFeatureSettingConfig]#
An array specifying per feature settings. Features can be left unspecified.
Notes
feature_settings : list of FeaturesTSFeatureSettingConfig
- property periodicities: List[FeaturesTSPeriodicityConfig]#
A list of periodicities for time series projects only. For classification problems periodicities are not allowed. If this is provided, parameter ‘differencing_method’ will default to ‘seasonal’ if not provided or ‘auto’.
Notes
periodicities : list of FeaturesTSPeriodicityConfig
- property segmentation_task_id: str#
Specifies the SegmentationTask that will be used for dividing the project up into multiple segmented projects.
Notes
segmentation_task_id : str
- property treat_as_exponential: str#
For time series projects only. Used to specify whether to treat data as exponential trend and apply transformations like log-transform. For classification problems always is not allowed.
Notes
treat_as_exponential : {‘auto’, ‘never’, ‘always’}
- property use_cross_series_features: bool#
Indicating if user wants to use cross-series features.
Notes
use_cross_series_features : bool
- property use_supervised_feature_reduction: bool#
When true, during feature generation DataRobot runs a supervised algorithm feature lists using only qualifying features. Setting false can severely impact autopilot duration, especially for datasets with many features.
Notes
use_supervised_feature_reduction : bool