MetaflowOfflineProcess Class¶
The MetaflowOfflineProcess class is a subclass of the BaseProcess class and provides functionality to execute the offline Metaflow flow.
Attributes¶
METAFLOW_CLASS- The name of the class that inherits from metaflow'sFlowSpec
Configuration¶
Required Configuration¶
The MetaflowOfflineProcess requires the following components:
data_transformermetadata_trackerresource_version_controldata_checkerdata_profiler
Methods¶
run¶
Execute the offline Metaflow flow.
def run(self, source_data_name, source_data, **kwargs)
Arguments:
source_data_name(str): The name of the data source.source_data(object): The object containing the source data.
Returns
None
get_artifacts¶
Retrieve artifacts from the latest run of the pipeline.
get_artifacts(self, artifact_keys)
Arguments:
artifact_keys(list): A list of artifact keys to retrieve.
Returns
artifacts(dict): A dictionary containing the requested artifacts.
MetaflowOfflineProcessSpec Methods¶
MetaflowOfflineProcessSpec contains the following methods. These are mirrored from the OfflineProcess class, and you should see that documentation for more information (Note: instead of these method explicitly using arguments, they instead access saved artifacts during the Metaflow run).
starttransform_datatrack_dataprofile_datacheck_datacompare_dataend