Token-exact RAG ingestion where the hot path runs in Rust.
Token-aware RAG ingestion pipeline where Rust handles performance-critical chunking via PyO3, Python orchestrates embeddings, and Qdrant stores searchable vectors.
M.Sc. Data Science. 7+ years enterprise engineering.
Open to select freelance projects in Python pipelines, RAG, and ML tooling.
At MSCI I build data pipelines, APIs, and analytics tooling for large-scale financial data systems. Outside of that: 7 published Python packages, select freelance engagements, and a research track through an M.Sc. in Data Science. Here's where to go depending on what brought you here.
Building across the full engineering stack.
Reproducible ML experiments, statistical analysis, ETL pipelines, DuckDB/Parquet, scientific Python
market-lab →AI/ML · Data Science · Cloud · Full-Stack
Token-exact RAG ingestion where the hot path runs in Rust.
Token-aware RAG ingestion pipeline where Rust handles performance-critical chunking via PyO3, Python orchestrates embeddings, and Qdrant stores searchable vectors.
A stateless weekly pipeline running on free infrastructure
Analytics pipeline that transforms raw job snapshots into curated DuckDB/Parquet datasets, bilingual reports, and a public MkDocs documentation site.
Local pipeline that turns a Twitch VOD into publish-ready 9:16 clips using Whisper + LLM selection.
Local media automation pipeline that downloads VODs, transcribes Spanish audio, ranks candidate moments by chat activity, uses an LLM to select highlights, and cuts 9:16 MP4 clips.
The fractal-Purkinje method as an installable library
Python package for generating Purkinje-network geometries over cardiac surface meshes, with simulation, visualization, and PyPI packaging.