Python scraping pipeline that collects job listings, normalizes records, persists run history in SQLite, and exports datasets through CLI workflows.
Projects
A full view of public work across automation, data engineering, AI tooling, and scientific software.
9 projects
Analytics pipeline that transforms raw job snapshots into curated DuckDB/Parquet datasets, bilingual reports, and a public MkDocs documentation site.
Repository analysis tool that scans codebases, detects language and framework signals, summarizes dependencies, and outputs structured Markdown or JSON audit reports.
Token-aware RAG ingestion pipeline where Rust handles performance-critical chunking via PyO3, Python orchestrates embeddings, and Qdrant stores searchable vectors.
Local media automation pipeline that downloads VODs, transcribes Spanish audio, ranks candidate moments by chat activity, uses an LLM to select highlights, and cuts 9:16 MP4 clips.
Modular scientific Python package for generating Purkinje-network geometries over cardiac surface meshes, with simulation, visualization, and PyPI packaging.
Package-first research toolkit for market experiments: data preparation, baselines, ML model training, walk-forward evaluation, diagnostics, reports, and paper-trading workflows.
Refactored and packaged myocardial mesh utilities into a maintainable Python library for computational cardiology workflows.
Modernized a Bayesian Optimization library for current Python/JAX versions, improving documentation, testing demos, and expanding optimizer functionality.