Assets

Metadata to record with the corresponding AssetMaterialization event.

Type: Optional[RawMetadataMapping]

check_results

Check results to record with the corresponding AssetMaterialization event.

Type: Optional[Sequence[AssetCheckResult]]

data_version

The data version of the asset that was observed.

Type: Optional[DataVersion]

class dagster.AssetSpec

Tags to record with the corresponding AssetMaterialization event.

Type: Optional[Mapping[str, str]]

Specifies the core attributes of an asset, except for the function that materializes or observes it.

An asset spec plus any materialization or observation function for the asset constitutes an “asset definition”.

The unique identifier for this asset.

Type: AssetKey

deps

The asset keys for the upstream assets that materializing this asset depends on.

Type: Optional[AbstractSet[AssetKey]]

description

Human-readable description of this asset.

Type: Optional[str]

A dict of static metadata for this asset. For example, users can provide information about the database table this asset corresponds to.

Type: Optional[Dict[str, Any]]

skippable

Whether this asset can be omitted during materialization, causing downstream dependencies to skip.

Type: bool

group_name

A string name used to organize multiple assets into groups. If not provided, the name “default” is used.

Type: Optional[str]

code_version

The version of the code for this specific asset, overriding the code version of the materialization function

Type: Optional[str]

backfill_policy

BackfillPolicy to apply to the specified asset.

Type: Optional[BackfillPolicy]

owners

A list of strings representing owners of the asset. Each string can be a user’s email address, or a team name prefixed with team:, e.g. team:finops.

Type: Optional[Sequence[str]]

class dagster.AssetsDefinition

Tags for filtering and organizing. These tags are not attached to runs of the asset.

Type: Optional[Mapping[str, str]]

kinds: (Optional[Set[str]]): A list of strings representing the kinds of the asset. These will be made visible in the Dagster UI.

partitions_def

Defines the set of partition keys that compose the asset.

Type: Optional[PartitionsDefinition]

with_io_manager_key

Returns a copy of this AssetSpec with an extra metadata value that dictates which I/O manager to use to load the contents of this asset in downstream computations.

Parameters: io_manager_key (str) – The I/O manager key. This will be used as the value for the “dagster/io_manager_key” metadata key.Returns: AssetSpec

Defines a set of assets that are produced by the same op or graph.

AssetsDefinitions are typically not instantiated directly, but rather produced using the @asset@asset or @multi_asset@multi_asset decorators.

static from_graph

Constructs an AssetsDefinition from a GraphDefinition.

Parameters:

graph_def (GraphDefinitionGraphDefinition) – The GraphDefinition that is an asset.
keys_by_input_name (Optional[Mapping[str, AssetKeyAssetKey]]) – A mapping of the input
keys_by_output_name (Optional[Mapping[str, AssetKeyAssetKey]]) – A mapping of the output
key_prefix (Optional[Union[str, Sequence[str]]]) – If provided, key_prefix will be prepended
internal_asset_deps (Optional[Mapping[str, Set[AssetKeyAssetKey]]]) – By default, it is assumed
partitions_def (Optional[PartitionsDefinitionPartitionsDefinition]) – Defines the set of partition keys that
partition_mappings (Optional[Mapping[str, PartitionMappingPartitionMapping]]) – Defines how to map partition
resource_defs (Optional[Mapping[str, ResourceDefinitionResourceDefinition]]) – experimental
group_name (Optional[str]) – A group name for the constructed asset. Assets without a
group_names_by_output_name (Optional[Mapping[str, Optional[str]]]) – Defines a group name to be
descriptions_by_output_name (Optional[Mapping[str, Optional[str]]]) – Defines a description to be
metadata_by_output_name (Optional[Mapping[str, Optional[RawMetadataMapping]]]) – Defines metadata to
tags_by_output_name (Optional[Mapping[str, Optional[Mapping[str, str]]]]) – Defines
freshness_policies_by_output_name (Optional[Mapping[str, Optional[FreshnessPolicy]]]) – Defines a
automation_conditions_by_output_name (Optional[Mapping[str, Optional[AutomationConditionAutomationCondition]]]) – Defines an
backfill_policy (Optional[BackfillPolicyBackfillPolicy]) – Defines this asset’s BackfillPolicy
owners_by_key (Optional[Mapping[AssetKeyAssetKey, Sequence[str]]]) – Defines

static from_op

Constructs an AssetsDefinition from an OpDefinition.

Parameters:

op_def (OpDefinitionOpDefinition) – The OpDefinition that is an asset.
keys_by_input_name (Optional[Mapping[str, AssetKeyAssetKey]]) – A mapping of the input
keys_by_output_name (Optional[Mapping[str, AssetKeyAssetKey]]) – A mapping of the output
key_prefix (Optional[Union[str, Sequence[str]]]) – If provided, key_prefix will be prepended
internal_asset_deps (Optional[Mapping[str, Set[AssetKeyAssetKey]]]) – By default, it is assumed
partitions_def (Optional[PartitionsDefinitionPartitionsDefinition]) – Defines the set of partition keys that
partition_mappings (Optional[Mapping[str, PartitionMappingPartitionMapping]]) – Defines how to map partition
group_name (Optional[str]) – A group name for the constructed asset. Assets without a
group_names_by_output_name (Optional[Mapping[str, Optional[str]]]) – Defines a group name to be
descriptions_by_output_name (Optional[Mapping[str, Optional[str]]]) – Defines a description to be
metadata_by_output_name (Optional[Mapping[str, Optional[RawMetadataMapping]]]) – Defines metadata to
tags_by_output_name (Optional[Mapping[str, Optional[Mapping[str, str]]]]) – Defines
freshness_policies_by_output_name (Optional[Mapping[str, Optional[FreshnessPolicy]]]) – Defines a
automation_conditions_by_output_name (Optional[Mapping[str, Optional[AutomationConditionAutomationCondition]]]) – Defines an
backfill_policy (Optional[BackfillPolicyBackfillPolicy]) – Defines this asset’s BackfillPolicy

get_asset_spec

Returns a representation of this asset as an AssetSpecAssetSpec.

If this is a multi-asset, the “key” argument allows selecting which asset to return the spec for.

Parameters: key (Optional[AssetKeyAssetKey]) – If this is a multi-asset, select which asset to return its AssetSpec. If not a multi-asset, this can be left as None.Returns: AssetSpec

get_partition_mapping: Returns the partition mapping between keys in this AssetsDefinition and a given input asset key (if any).

to_source_asset

Returns a representation of this asset as a SourceAssetSourceAsset.

If this is a multi-asset, the “key” argument allows selecting which asset to return a SourceAsset representation of.

Parameters: key (Optional[Union[str, Sequence[str], AssetKeyAssetKey]]]) – If this is a multi-asset, select which asset to return a SourceAsset representation of. If not a multi-asset, this can be left as None.Returns: SourceAsset

to_source_assets

Returns a SourceAsset for each asset in this definition.

Each produced SourceAsset will have the same key, metadata, io_manager_key, etc. as the corresponding asset

property asset_deps: Maps assets that are produced by this definition to assets that they depend on. The dependencies can be either “internal”, meaning that they refer to other assets that are produced by this definition, or “external”, meaning that they refer to assets that aren’t produced by this definition.

property can_subset

If True, indicates that this AssetsDefinition may materialize any subset of its asset keys in a given computation (as opposed to being required to materialize all asset keys).

Type: bool

property check_specs

Returns the asset check specs defined on this AssetsDefinition, i.e. the checks that can be executed while materializing the assets.

Return type: Iterable[AssetsCheckSpec]

property dependency_keys

The asset keys which are upstream of any asset included in this AssetsDefinition.

Type: Iterable[AssetKey]

property descriptions_by_key

Returns a mapping from the asset keys in this AssetsDefinition to the descriptions assigned to them. If there is no assigned description for a given AssetKey, it will not be present in this dictionary.

Type: Mapping[AssetKey, str]

property group_names_by_key

Returns a mapping from the asset keys in this AssetsDefinition to the group names assigned to them. If there is no assigned group name for a given AssetKey, it will not be present in this dictionary.

Type: Mapping[AssetKey, str]

property key

The asset key associated with this AssetsDefinition. If this AssetsDefinition has more than one asset key, this will produce an error.

Type: AssetKey

property keys

The asset keys associated with this AssetsDefinition.

Type: AbstractSet[AssetKey]

property node_def

Returns the OpDefinition or GraphDefinition that is used to materialize the assets in this AssetsDefinition.

Type: NodeDefinition

property op

Returns the OpDefinition that is used to materialize the assets in this AssetsDefinition.

Type: OpDefinition

property partitions_def

The PartitionsDefinition for this AssetsDefinition (if any).

Type: Optional[PartitionsDefinition]

property required_resource_keys

The set of keys for resources that must be provided to this AssetsDefinition.

Type: Set[str]

property resource_defs

A mapping from resource name to ResourceDefinition for the resources bound to this AssetsDefinition.

Type: Mapping[str, ResourceDefinition]

class dagster.AssetKey

Object representing the structure of an asset key. Takes in a sanitized string, list of strings, or tuple of strings.

Example usage:

from dagster import AssetKey

AssetKey("asset1")
AssetKey(["asset1"]) # same as the above
AssetKey(["prefix", "asset1"])
AssetKey(["prefix", "subprefix", "asset1"])

Parameters: path (Union[str, Sequence[str]]) – String, list of strings, or tuple of strings. A list of strings represent the hierarchical structure of the asset_key.

property path

Graph-backed asset definitions

Refer to the Graph-backed asset documentation for more information.

@dagster.graph_asset

Creates a software-defined asset that’s computed using a graph of ops.

This decorator is meant to decorate a function that composes a set of ops or graphs to define the dependencies between them.

Parameters:

name (Optional[str]) – The name of the asset. If not provided, defaults to the name of the
description (Optional[str]) – A human-readable description of the asset.
ins (Optional[Mapping[str, AssetInAssetIn]]) – A dictionary that maps input names to information
config (Optional[Union[ConfigMappingConfigMapping], Mapping[str, Any]) –

Describes how the graph underlying the asset is configured at runtime.

If a ConfigMappingConfigMapping object is provided, then the graph takes on the config schema of this object. The mapping will be applied at runtime to generate the config for the graph’s constituent nodes.

If a dictionary is provided, then it will be used as the default run config for the graph. This means it must conform to the config schema of the underlying nodes. Note that the values provided will be viewable and editable in the Dagster UI, so be careful with secrets.
key_prefix (Optional[Union[str, Sequence[str]]]) – If provided, the asset’s key is the
group_name (Optional[str]) – A string name used to organize multiple assets into groups. If
partitions_def (Optional[PartitionsDefinitionPartitionsDefinition]) – Defines the set of partition keys that
metadata (Optional[RawMetadataMapping]) – Dictionary of metadata to be associated with
tags (Optional[Mapping[str, str]]) – (Experimental) Tags for filtering and organizing. These tags are not
owners (Optional[Sequence[str]]) – experimentalteam:,
kinds (Optional[Set[str]]) – A list of strings representing the kinds of the asset. These
automation_condition (Optional[AutomationConditionAutomationCondition]) – The AutomationCondition to use
backfill_policy (Optional[BackfillPolicyBackfillPolicy]) – The BackfillPolicy to use for this asset.
code_version (Optional[str]) – Version of the code that generates this asset. In
key (Optional[CoeercibleToAssetKey]) – The key for this asset. If provided, cannot specify key_prefix or name.

Examples:

@op
def fetch_files_from_slack(context) -> pd.DataFrame:
    ...

@op
def store_files(files) -> None:
    files.to_sql(name="slack_files", con=create_db_connection())

@graph_asset
def slack_files_table():
    return store_files(fetch_files_from_slack())

@dagster.graph_multi_asset

Create a combined definition of multiple assets that are computed using the same graph of ops, and the same upstream assets.

Each argument to the decorated function references an upstream asset that this asset depends on. The name of the argument designates the name of the upstream asset.

Parameters:

name (Optional[str]) – The name of the graph.
outs – (Optional[Dict[str, AssetOut]]): The AssetOuts representing the produced assets.
ins (Optional[Mapping[str, AssetInAssetIn]]) – A dictionary that maps input names to information
partitions_def (Optional[PartitionsDefinitionPartitionsDefinition]) – Defines the set of partition keys that
backfill_policy (Optional[BackfillPolicyBackfillPolicy]) – The backfill policy for the asset.
group_name (Optional[str]) – A string name used to organize multiple assets into groups. This
can_subset (bool) – Whether this asset’s computation can emit a subset of the asset
config (Optional[Union[ConfigMappingConfigMapping], Mapping[str, Any]) –

Describes how the graph underlying the asset is configured at runtime.

If a ConfigMappingConfigMapping object is provided, then the graph takes on the config schema of this object. The mapping will be applied at runtime to generate the config for the graph’s constituent nodes.

If a dictionary is provided, then it will be used as the default run config for the graph. This means it must conform to the config schema of the underlying nodes. Note that the values provided will be viewable and editable in the Dagster UI, so be careful with secrets.

If no value is provided, then the config schema for the graph is the default (derived

Multi-asset definitions

Refer to the Multi-asset documentation for more information.

@dagster.multi_asset

Create a combined definition of multiple assets that are computed using the same op and same upstream assets.

Each argument to the decorated function references an upstream asset that this asset depends on. The name of the argument designates the name of the upstream asset.

You can set I/O managers keys, auto-materialize policies, freshness policies, group names, etc. on an individual asset within the multi-asset by attaching them to the AssetOutAssetOut corresponding to that asset in the outs parameter.

Parameters:

name (Optional[str]) – The name of the op.
outs – (Optional[Dict[str, AssetOut]]): The AssetOuts representing the assets materialized by
ins (Optional[Mapping[str, AssetInAssetIn]]) – A dictionary that maps input names to information
deps (Optional[Sequence[Union[AssetsDefinitionAssetsDefinition, SourceAssetSourceAsset, AssetKeyAssetKey, str]]]) – The assets that are upstream dependencies, but do not correspond to a parameter of the
config_schema (Optional[ConfigSchemaConfigSchema) – The configuration schema for the asset’s underlying
required_resource_keys (Optional[Set[str]]) – Set of resource handles required by the underlying op.
internal_asset_deps (Optional[Mapping[str, Set[AssetKeyAssetKey]]]) – By default, it is assumed
partitions_def (Optional[PartitionsDefinitionPartitionsDefinition]) – Defines the set of partition keys that
backfill_policy (Optional[BackfillPolicyBackfillPolicy]) – The backfill policy for the op that computes the asset.
op_tags (Optional[Dict[str, Any]]) – A dictionary of tags for the op that computes the asset.
can_subset (bool) – If this asset’s computation can emit a subset of the asset
resource_defs (Optional[Mapping[str, object]]) – experimental
group_name (Optional[str]) – A string name used to organize multiple assets into groups. This
retry_policy (Optional[RetryPolicyRetryPolicy]) – The retry policy for the op that computes the asset.
code_version (Optional[str]) – Version of the code encapsulated by the multi-asset. If set,
specs (Optional[Sequence[AssetSpecAssetSpec]]) – The specifications for the assets materialized
check_specs (Optional[Sequence[AssetCheckSpecAssetCheckSpec]]) – Specs for asset checks that
non_argument_deps (Optional[Union[Set[AssetKeyAssetKey], Set[str]]]) – deprecateddeps instead.) Deprecated, use deps instead.

Examples:

@multi_asset(
    specs=[
        AssetSpec("asset1", deps=["asset0"]),
        AssetSpec("asset2", deps=["asset0"]),
    ]
)
def my_function():
    asset0_value = load(path="asset0")
    asset1_result, asset2_result = do_some_transformation(asset0_value)
    write(asset1_result, path="asset1")
    write(asset2_result, path="asset2")

# Or use IO managers to handle I/O:
@multi_asset(
    outs=\{
        "asset1": AssetOut(),
        "asset2": AssetOut(),
    }
)
def my_function(asset0):
    asset1_value = do_some_transformation(asset0)
    asset2_value = do_some_other_transformation(asset0)
    return asset1_value, asset2_value

@dagster.multi_observable_source_asset

experimental

This API may break in future versions, even between dot releases.

Defines a set of assets that can be observed together with the same function.

Parameters:

name (Optional[str]) – The name of the op.
required_resource_keys (Optional[Set[str]]) – Set of resource handles required by the
partitions_def (Optional[PartitionsDefinitionPartitionsDefinition]) – Defines the set of partition keys that
can_subset (bool) – If this asset’s computation can emit a subset of the asset
resource_defs (Optional[Mapping[str, object]]) – (Experimental) A mapping of resource keys to resources. These resources
group_name (Optional[str]) – A string name used to organize multiple assets into groups. This
specs (Optional[Sequence[AssetSpecAssetSpec]]) – (Experimental) The specifications for the assets
check_specs (Optional[Sequence[AssetCheckSpecAssetCheckSpec]]) – (Experimental) Specs for asset checks that

Examples:

@multi_observable_source_asset(
    specs=[AssetSpec("asset1"), AssetSpec("asset2")],
)
def my_function():
    yield ObserveResult(asset_key="asset1", metadata=\{"foo": "bar"})
    yield ObserveResult(asset_key="asset2", metadata=\{"baz": "qux"})

class dagster.AssetOut

Defines one of the assets produced by a @multi_asset@multi_asset.

key_prefix

If provided, the asset’s key is the concatenation of the key_prefix and the asset’s name. When using @multi_asset, the asset name defaults to the key of the “outs” dictionary Only one of the “key_prefix” and “key” arguments should be provided.

Type: Optional[Union[str, Sequence[str]]]

The asset’s key. Only one of the “key_prefix” and “key” arguments should be provided.

Type: Optional[Union[str, Sequence[str], AssetKey]]

dagster_type

The type of this output. Should only be set if the correct type can not be inferred directly from the type signature of the decorated function.

Type: Optional[Union[Type, DagsterType]]]

description

Human-readable description of the output.

Type: Optional[str]

is_required

Whether the presence of this field is required. (default: True)

Type: bool

io_manager_key

The resource key of the IO manager used for this output. (default: “io_manager”).

Type: Optional[str]

A dict of the metadata for the output. For example, users can provide a file path if the data object will be stored in a filesystem, or provide information of a database table when it is going to load the data into the table.

Type: Optional[Dict[str, Any]]

group_name

A string name used to organize multiple assets into groups. If not provided, the name “default” is used.

Type: Optional[str]

code_version

The version of the code that generates this asset.

Type: Optional[str]

freshness_policy

(Deprecated) A policy which indicates how up to date this asset is intended to be.

Type: Optional[FreshnessPolicy]

automation_condition

AutomationCondition to apply to the specified asset.

Type: Optional[AutomationCondition]

backfill_policy

BackfillPolicy to apply to the specified asset.

Type: Optional[BackfillPolicy]

owners

A list of strings representing owners of the asset. Each string can be a user’s email address, or a team name prefixed with team:, e.g. team:finops.

Type: Optional[Sequence[str]]

class dagster.SourceAsset

Tags for filtering and organizing. These tags are not attached to runs of the asset.

Type: Optional[Mapping[str, str]]

Source assets

Refer to the External asset dependencies documentation for more information.

deprecated

This API will be removed in version 2.0.0. Use AssetSpec instead. If using the SourceAsset io_manager_key property, use AssetSpec(...).with_io_manager_key(...)..

A SourceAsset represents an asset that will be loaded by (but not updated by) Dagster.

The key of the asset.

Type: Union[AssetKey, Sequence[str], str]

auto_observe_interval_minutes

Metadata associated with the asset.

Type: Mapping[str, MetadataValue]

io_manager_key

The key for the IOManager that will be used to load the contents of the asset when it’s used as an input to other assets inside a job.

Type: Optional[str]

io_manager_def

(Experimental) The definition of the IOManager that will be used to load the contents of the asset when it’s used as an input to other assets inside a job.

Type: Optional[IOManagerDefinition]

resource_defs

(Experimental) resource definitions that may be required by the dagster.IOManagerDefinitiondagster.IOManagerDefinition provided in the io_manager_def argument.

Type: Optional[Mapping[str, ResourceDefinition]]

description

The description of the asset.

Type: Optional[str]

partitions_def

Defines the set of partition keys that compose the asset.

Type: Optional[PartitionsDefinition]

observe_fn: Type: Optional[SourceAssetObserveFunction]

op_tags

A dictionary of tags for the op that computes the asset. Frameworks may expect and require certain metadata to be attached to a op. Values that are not strings will be json encoded and must meet the criteria that json.loads(json.dumps(value)) == value.

Type: Optional[Dict[str, Any]]

While the asset daemon is turned on, a run of the observation function for this asset will be launched at this interval. observe_fn must be provided.

Type: Optional[float]

freshness_policy

A constraint telling Dagster how often this asset is intended to be updated with respect to its root data.

Type: FreshnessPolicy

property is_observable

Tags for filtering and organizing. These tags are not attached to runs of the asset.

Type: Optional[Mapping[str, str]]

Whether the asset is observable.

Type: bool

property op

The OpDefinition associated with the observation function of an observable source asset.

Throws an error if the asset is not observable.

Type: OpDefinition

@dagster.observable_source_asset

experimental

This API may break in future versions, even between dot releases.

Create a SourceAsset with an associated observation function.

The observation function of a source asset is wrapped inside of an op and can be executed as part of a job. Each execution generates an AssetObservation event associated with the source asset. The source asset observation function should return a DataVersion, a ~dagster.DataVersionsByPartition, or an ObserveResultObserveResult.

Parameters:

name (Optional[str]) – The name of the source asset. If not provided, defaults to the name of the
key_prefix (Optional[Union[str, Sequence[str]]]) – If provided, the source asset’s key is the
metadata (Mapping[str, RawMetadataValue]) – Metadata associated with the asset.
io_manager_key (Optional[str]) – The key for the IOManager that will be used to load the contents of
io_manager_def (Optional[IOManagerDefinitionIOManagerDefinition]) – (Experimental) The definition of the IOManager that will be used to load the contents of
description (Optional[str]) – The description of the asset.
group_name (Optional[str]) – A string name used to organize multiple assets into groups. If not provided,
required_resource_keys (Optional[Set[str]]) – Set of resource keys required by the observe op.
resource_defs (Optional[Mapping[str, ResourceDefinitionResourceDefinition]]) – (Experimental) resource
partitions_def (Optional[PartitionsDefinitionPartitionsDefinition]) – Defines the set of partition keys that
op_tags (Optional[Dict[str, Any]]) – A dictionary of tags for the op that computes the asset.
tags (Optional[Mapping[str, str]]) – Tags for filtering and organizing. These tags are not
observe_fn (Optional[SourceAssetObserveFunction]) – Observation function for the source asset.
automation_condition (Optional[AutomationConditionAutomationCondition]) – A condition describing when Dagster

class dagster.ObserveResult

experimental

This API may break in future versions, even between dot releases.

An object representing a successful observation of an asset. These can be returned from an @observable_source_asset decorated function to pass metadata.

asset_key

The asset key. Optional to include.

Type: Optional[AssetKey]

Metadata to record with the corresponding AssetObservation event.

Type: Optional[RawMetadataMapping]

check_results

Check results to record with the corresponding AssetObservation event.

Type: Optional[Sequence[AssetCheckResult]]

data_version

The data version of the asset that was observed.

Type: Optional[DataVersion]

class dagster.AssetDep

Tags to record with the corresponding AssetObservation event.

Type: Optional[Mapping[str, str]]

Dependencies

Specifies a dependency on an upstream asset.

asset

The upstream asset to depend on.

Type: Union[AssetKey, str, AssetSpec, AssetsDefinition, SourceAsset]

partition_mapping

Defines what partitions to depend on in the upstream asset. If not provided and the upstream asset is partitioned, defaults to the default partition mapping for the partitions definition, which is typically maps partition keys to the same partition keys in upstream assets.

Type: Optional[PartitionMapping]

Examples:

upstream_asset = AssetSpec("upstream_asset")
downstream_asset = AssetSpec(
    "downstream_asset",
    deps=[
        AssetDep(
            upstream_asset,
            partition_mapping=TimeWindowPartitionMapping(start_offset=-1, end_offset=-1)
        )
    ]
)

class dagster.AssetIn

Defines an asset dependency.

key_prefix

If provided, the asset’s key is the concatenation of the key_prefix and the input name. Only one of the “key_prefix” and “key” arguments should be provided.

Type: Optional[Union[str, Sequence[str]]]

The asset’s key. Only one of the “key_prefix” and “key” arguments should be provided.

Type: Optional[Union[str, Sequence[str], AssetKey]]