Senior Software Engineer building AI-powered products with TypeScript, Rust, and LLMs. Writes about AI agents, eval-driven development, and edge computing.
View all authorsAI-Powered Skill Extraction with Cloudflare Embeddings and a Vector Taxonomy
This bulk processor extracts structured skill tags for job postings using an AI pipeline that combines:
AI Observability for LLM Evals with Langfuse
This article documents an evaluation harness for a Remote EU job classifier—but the real focus is AI observability: how to design traces, spans, metadata, scoring, and run-level grouping so you can debug, compare, and govern LLM behavior over time.
Schema-First RAG with Eval-Gated Grounding and Claim-Card Provenance
This article documents a production-grade architecture for generating research-grounded therapeutic content. The system prioritizes verifiable artifacts (papers → structured extracts → scored outputs → claim cards) over unstructured text.
You can treat this as a “trust pipeline”: retrieve → normalize → extract → score → repair → persist → generate.
Evals for Workflow-First Production LLMs: Contracts, Rubrics, Sampling, and Observability

Agentic Job Pre-Screening with LangGraph + DeepSeek: Auto-Reject Fake “Remote” Roles
Introduction
Building Long-Running TTS Pipelines with LangGraph: Orchestrating Long-Form Audio Generation
Introduction
DeFiTuna: On-Chain Limit Orders on Solana
TL;DR
Deploying Real-Time Solana Data Streams on Cloudflare Containers with LaserStream

