src.pipeline.verbose_transformation_pipeline¶
Module containing the VerboseTransformationPipeline class.
Classes
|
A verbose transformation pipeline that logs to the terminal. |
- class src.pipeline.verbose_transformation_pipeline.VerboseTransformationPipeline(steps=<factory>, title='Transformation Pipeline')[source]¶
A verbose transformation pipeline that logs to the terminal.
- get_cache_exists(cache_args=None)[source]¶
Check if some cache exists.
- Parameters:
cache_args (
CacheArgs|None) – The cache arguments.- Return type:
bool- Returns:
Whether the cache exists.
- run_dossier_pipeline(dossier_id, raw_data_path, processed_data_path, final_data_path)[source]¶
Run the pipeline on the data.
It also creates a cache after each step in the pipeline.
- Parameters:
dossier_id (
str) – The dossier ID.raw_data_path (
Path) – The path to the raw data.processed_data_path (
Path) – The path to the processed data.final_data_path (
Path) – The path to the final data.
- Return type:
DataFrame- Returns:
The processed data.