src.pipeline.verbose_transformation_pipeline¶
Module containing the VerboseTransformationPipeline class.
Classes
|
A verbose transformation pipeline that logs to the terminal. |
- class src.pipeline.verbose_transformation_pipeline.VerboseTransformationPipeline(steps=<factory>, title='Transformation Pipeline')[source]¶
A verbose transformation pipeline that logs to the terminal.
- get_cache_exists(cache_args=None)[source]¶
Check if some cache exists.
- Parameters:
cache_args (
CacheArgs
|None
) – The cache arguments.- Return type:
bool
- Returns:
Whether the cache exists.
- run_dossier_pipeline(dossier_id, raw_data_path, processed_data_path, final_data_path)[source]¶
Run the pipeline on the data.
It also creates a cache after each step in the pipeline.
- Parameters:
dossier_id (
str
) – The dossier ID.raw_data_path (
Path
) – The path to the raw data.processed_data_path (
Path
) – The path to the processed data.final_data_path (
Path
) – The path to the final data.
- Return type:
DataFrame
- Returns:
The processed data.