Blog
AI engineering insights, practical advice, and things I'm learning.
Ai Agents
How to Build Long-Running AI Agents With Google ADK 1.0
Google's Agent Development Kit 1.0 enables multi-day workflows that survive restarts. Learn to configure durable state machines and persistent session storage.
Google Adk · Durable Computing · State Machines
Ai Engineering
How to Scale PyTorch Training With AWS Building Blocks
Learn how to configure AWS infrastructure and Hugging Face tools to optimize large-scale foundation model pre-training and inference workflows.
Pytorch · Aws Cloud · Foundation Models
Ai Agents
How to Build Multi-Agent CNC Workflows on AMD MI300X
Learn how to coordinate LangChain agents and Qwen 2.5 7B on the AMD MI300X to reduce CNC manufacturability analysis time from hours to seconds.
Amd Mi300x · Langchain · Qwen Model
Ai Engineering
How to Fine-Tune Qwen3 on AMD MI300X Using ROCm
Learn how to configure ROCm 6.1 environment variables and use the Hugging Face stack to fine-tune Qwen3-1.7B on AMD hardware without CUDA.
Fine Tuning · Amd Rocm · Qwen 3
Ai Coding
Cursor's Autoinstall Bootstraps RL Training Environments
Learn how Cursor uses previous model generations to automate reinforcement learning environment setups, mock dependencies, and verify target commands.
Cursor Editor · Reinforcement Learning · Automated Testing
Ai Engineering
How to Implement Event-Driven Webhooks in the Gemini API
Learn how to configure static and dynamic webhooks in the Gemini API to eliminate polling overhead for long-running AI operations and agent workflows.
Gemini Api · Webhooks · Event Driven Architecture
Ai Coding
How to build an iOS app with Claude Code subagents
Learn how to orchestrate parallel subagents in Claude Code to build and ship a production Swift application using natural language and specialized roles.
Claude Code · Ios Development · Ai Agents
Ai Agents
How to Run Multi-Tenant Agents With Dynamic Workflows
Learn how to use Cloudflare Dynamic Workflows to execute durable, stateful operations for multi-tenant applications and long-running AI agents.
Cloudflare Workers · Multi Tenancy · Durable Execution
Ai Engineering
How to Build Cross-Modal RAG Pipelines With Gemini Embedding 2
Learn how to process text, images, video, and audio into a single semantic vector space using Google's natively multimodal Gemini Embedding 2 model.
Multimodal Rag · Gemini Embedding 2 · Vector Databases
Ai Coding
How to Build Programmatic Agents With the Cursor SDK
Learn how to automate engineering workflows and deploy specialized coding agents using the TypeScript-based Cursor SDK and Cloud Agents API.
Cursor Sdk · Cloud Agents · Typescript
Ai Coding
How to Integrate Claude Code into Large Legacy Codebases
Learn how to integrate Claude Code into massive legacy projects using incremental context and the new native binary features in version 2.1.119.
Claude Code · Legacy Systems · Software Engineering
Ai Agents
Google's 5-Day Vibe Coding Course Returns to Kaggle in June
Learn how to build production-ready agents and use natural language as a programming interface in Google's returning 5-day intensive course on Kaggle.
Vibe Coding · Google Kaggle · Agentic Workflows
Ai Coding
OpenAI Codex Desktop Adds 90 Plugins and Reusable Skills
Learn how to configure OpenAI Codex plugins and reusable skills to automate desktop tasks and connect your coding workflows to external data sources.
Openai Codex · Workflow Automation · Plugin Architecture
Ai Engineering
Google Graduates LiteRT NPU Acceleration to Production
Learn how to configure LiteRT for hardware-accelerated on-device AI inference using Google's production-ready NPU capabilities.
Litert · On Device Ai · Npu Acceleration
Ai Agents
How to Refactor Monolithic Agents with Google ADK
Learn how to transition monolithic prompt scripts into production-ready multi-agent pipelines using Google's Agent Development Kit and the Agent2Agent protocol.
Multi Agent Systems · Google Adk · Agent To Agent Protocol
Ai Agents
Build Korean AI Agents with Nemotron Synthetic Personas
Learn how to use NVIDIA Nemotron-Personas-Korea to ground AI agents in authentic South Korean demographics, cultural norms, and honorifics.
Nvidia Nemotron · Synthetic Data · Sovereign Ai
Ai Agents
Build a Multi-Agent AI Code Review System with OpenCode
Learn how Cloudflare uses specialized AI agents and OpenCode to automate security, performance, and compliance reviews across thousands of repositories.
Opencode · Automated Code Review · Ci Cd Automation
Ai Engineering
Build Real-Time Voice Agents with Cloudflare Agents SDK
Learn how to integrate low-latency voice interactions into your AI agents using Cloudflare's new @cloudflare/voice package and Durable Objects.
Cloudflare Workers · Voice Ai · Stt
Ai Engineering
Build a Fast Multilingual OCR with Nemotron-OCR-v2
Learn how to deploy NVIDIA Nemotron-OCR-v2 for high-speed document extraction across six languages using synthetic data and GPU acceleration.
Nvidia Nemotron · Multilingual Ocr · Synthetic Data
Ai Engineering
Train Multimodal Sentence Transformers for Visual Retrieval
Learn how to finetune multimodal embedding and reranker models for text, image, and audio using the updated Sentence Transformers library.
Sentence Transformers · Multimodal Ai · Embedding Models