Blog

AI engineering insights, practical advice, and things I'm learning.

Latest AI news, updated daily. Go to News →

Ai Agents

How to Build Long-Running AI Agents With Google ADK 1.0

Google's Agent Development Kit 1.0 enables multi-day workflows that survive restarts. Learn to configure durable state machines and persistent session storage.

Google Adk · Durable Computing · State Machines

Ai Engineering

How to Scale PyTorch Training With AWS Building Blocks

Learn how to configure AWS infrastructure and Hugging Face tools to optimize large-scale foundation model pre-training and inference workflows.

Pytorch · Aws Cloud · Foundation Models

Ai Agents

How to Build Multi-Agent CNC Workflows on AMD MI300X

Learn how to coordinate LangChain agents and Qwen 2.5 7B on the AMD MI300X to reduce CNC manufacturability analysis time from hours to seconds.

Amd Mi300x · Langchain · Qwen Model

Ai Engineering

How to Fine-Tune Qwen3 on AMD MI300X Using ROCm

Learn how to configure ROCm 6.1 environment variables and use the Hugging Face stack to fine-tune Qwen3-1.7B on AMD hardware without CUDA.

Fine Tuning · Amd Rocm · Qwen 3

Ai Coding

Cursor's Autoinstall Bootstraps RL Training Environments

Learn how Cursor uses previous model generations to automate reinforcement learning environment setups, mock dependencies, and verify target commands.

Cursor Editor · Reinforcement Learning · Automated Testing

Ai Engineering

How to Implement Event-Driven Webhooks in the Gemini API

Learn how to configure static and dynamic webhooks in the Gemini API to eliminate polling overhead for long-running AI operations and agent workflows.

Gemini Api · Webhooks · Event Driven Architecture

Ai Coding

How to build an iOS app with Claude Code subagents

Learn how to orchestrate parallel subagents in Claude Code to build and ship a production Swift application using natural language and specialized roles.

Claude Code · Ios Development · Ai Agents

Ai Agents

How to Run Multi-Tenant Agents With Dynamic Workflows

Learn how to use Cloudflare Dynamic Workflows to execute durable, stateful operations for multi-tenant applications and long-running AI agents.

Cloudflare Workers · Multi Tenancy · Durable Execution

Ai Engineering

How to Build Cross-Modal RAG Pipelines With Gemini Embedding 2

Learn how to process text, images, video, and audio into a single semantic vector space using Google's natively multimodal Gemini Embedding 2 model.

Multimodal Rag · Gemini Embedding 2 · Vector Databases

Ai Coding

How to Build Programmatic Agents With the Cursor SDK

Learn how to automate engineering workflows and deploy specialized coding agents using the TypeScript-based Cursor SDK and Cloud Agents API.

Cursor Sdk · Cloud Agents · Typescript

Ai Coding

How to Integrate Claude Code into Large Legacy Codebases

Learn how to integrate Claude Code into massive legacy projects using incremental context and the new native binary features in version 2.1.119.

Claude Code · Legacy Systems · Software Engineering

Ai Agents

Google's 5-Day Vibe Coding Course Returns to Kaggle in June

Learn how to build production-ready agents and use natural language as a programming interface in Google's returning 5-day intensive course on Kaggle.

Vibe Coding · Google Kaggle · Agentic Workflows

Ai Coding

OpenAI Codex Desktop Adds 90 Plugins and Reusable Skills

Learn how to configure OpenAI Codex plugins and reusable skills to automate desktop tasks and connect your coding workflows to external data sources.

Openai Codex · Workflow Automation · Plugin Architecture

Ai Engineering

Google Graduates LiteRT NPU Acceleration to Production

Learn how to configure LiteRT for hardware-accelerated on-device AI inference using Google's production-ready NPU capabilities.

Litert · On Device Ai · Npu Acceleration

Ai Agents

How to Refactor Monolithic Agents with Google ADK

Learn how to transition monolithic prompt scripts into production-ready multi-agent pipelines using Google's Agent Development Kit and the Agent2Agent protocol.

Multi Agent Systems · Google Adk · Agent To Agent Protocol

Ai Agents

Build Korean AI Agents with Nemotron Synthetic Personas

Learn how to use NVIDIA Nemotron-Personas-Korea to ground AI agents in authentic South Korean demographics, cultural norms, and honorifics.

Nvidia Nemotron · Synthetic Data · Sovereign Ai

Ai Agents

Build a Multi-Agent AI Code Review System with OpenCode

Learn how Cloudflare uses specialized AI agents and OpenCode to automate security, performance, and compliance reviews across thousands of repositories.

Opencode · Automated Code Review · Ci Cd Automation

Ai Engineering

Build Real-Time Voice Agents with Cloudflare Agents SDK

Learn how to integrate low-latency voice interactions into your AI agents using Cloudflare's new @cloudflare/voice package and Durable Objects.

Cloudflare Workers · Voice Ai · Stt

Ai Engineering

Build a Fast Multilingual OCR with Nemotron-OCR-v2

Learn how to deploy NVIDIA Nemotron-OCR-v2 for high-speed document extraction across six languages using synthetic data and GPU acceleration.

Nvidia Nemotron · Multilingual Ocr · Synthetic Data

Ai Engineering

Train Multimodal Sentence Transformers for Visual Retrieval

Learn how to finetune multimodal embedding and reranker models for text, image, and audio using the updated Sentence Transformers library.

Sentence Transformers · Multimodal Ai · Embedding Models