Shared utilities for the ML pipeline (data loaders, eval helpers)
Go to file
2026-01-28 16:52:37 +00:00
src transforms: clean — handle inf and nan separately 2026-01-28 16:52:37 +00:00
tests eval: plots — switch to plotnine for ggplot-style 2025-11-29 16:52:37 +00:00
LICENSE transforms: clean — log row drop counts per rule 2025-12-29 16:52:37 +00:00
pyproject.toml loaders: parquet — preserve column dtypes from arrow schema 2025-12-11 16:52:37 +00:00
README.md eval: metrics — handle multi-class classification 2025-12-23 16:52:37 +00:00
requirements.txt tests: add property-based tests for clean.py via hypothesis 2025-12-17 16:52:37 +00:00
setup.py requirements: pin numpy<2 (breaks downstream) 2025-12-05 16:52:37 +00:00

ml-pipeline-utils — README

(Initial — see git history.)

update 11 (2026-04)