Token-exact RAG ingestion where the hot path runs in Rust.
Token-aware RAG ingestion pipeline where Rust handles performance-critical chunking via PyO3, Python orchestrates embeddings, and Qdrant stores searchable vectors.
M.Sc. Data Science. 7+ years enterprise engineering.
Available for senior engineering roles and freelance projects in Python, ML systems, and data products.
I build ML systems, data pipelines, and Python tools that run in production. Day job developing features in financial risk software, open-source projects on the weekends, and freelancing for clients. Here's where to go depending on what brought you here.
Building across the full engineering stack.
Reproducible ML experiments, statistical analysis, ETL pipelines, DuckDB/Parquet, scientific Python
market-lab →AI/ML · Data Science · Cloud · Full-Stack
Token-exact RAG ingestion where the hot path runs in Rust.
Token-aware RAG ingestion pipeline where Rust handles performance-critical chunking via PyO3, Python orchestrates embeddings, and Qdrant stores searchable vectors.
A stateless weekly pipeline running on free infrastructure
Analytics pipeline that transforms raw job snapshots into curated DuckDB/Parquet datasets, bilingual reports, and a public MkDocs documentation site.
Local pipeline that turns a Twitch VOD into publish-ready 9:16 clips using Whisper + LLM selection.
Local media automation pipeline that downloads VODs, transcribes Spanish audio, ranks candidate moments by chat activity, uses an LLM to select highlights, and cuts 9:16 MP4 clips.
The fractal-Purkinje method as an installable library — PyPI 0.4.0, OIDC-released, 107 tests on dual-Python CI
Modular scientific Python package for generating Purkinje-network geometries over cardiac surface meshes, with simulation, visualization, and PyPI packaging.