Blog

AI engineering insights, practical advice, and things I'm learning.

Latest AI news, updated daily. Go to News →

Career

Why Most AI Advice Is Terrible

Most AI advice falls into hype or fear. Neither helps. What actually matters: understanding the mechanics, building real skills, and thinking for yourself.

Ai · Career · Critical Thinking

Ai Engineering

How to Build a RAG Application (Step by Step)

A practical walkthrough of building a RAG pipeline from scratch: chunking documents, generating embeddings, storing vectors, retrieving context, and generating grounded answers.

Rag · Retrieval Augmented Generation · Embeddings

Ai Engineering

How to Run LLMs Locally on Your Machine

Running AI models locally gives you privacy, speed, and zero API costs. Here's what hardware you need, which tools to use, and how to choose the right model.

Local Llms · Ollama · Llama

Ai Engineering

Structured Output from LLMs: JSON Mode Explained

LLMs generate text, but applications need structured data. Here's how JSON mode, function calling, and schema enforcement turn free-form AI output into reliable, typed data.

Structured Output · Json Mode · Function Calling

Ai Coding

The AI Coding Workflow That Actually Works

The practical coding workflow with AI: what to hand the model, what to review line by line, and when to throw the output away.

Ai Coding · Developer Tools · Workflow

Ai Engineering

Fine-Tuning vs RAG: When to Use Each Approach

RAG changes what the model knows. Fine-tuning changes how it behaves. Here's when to use each approach, their real tradeoffs, and why the answer is usually both.

Fine Tuning · Rag · Llm

Ai Agents

What Are AI Agents and How Do They Work?

AI agents can plan, use tools, and take action autonomously. Here's what they are, how they work under the hood, and what separates useful agents from overhyped demos.

Ai Agents · Llms · Automation

Ai Engineering

What Is the Model Context Protocol (MCP)?

MCP standardizes how AI models connect to tools and data. Here's what the Model Context Protocol is, how it works, and why it matters for developers building AI applications.

Mcp · Model Context Protocol · Ai Agents

Ai Engineering

What Is RAG? Retrieval-Augmented Generation Explained

RAG lets AI models pull in real data before generating a response. Here's how retrieval-augmented generation works, why it matters, and where it breaks down.

Rag · Retrieval Augmented Generation · Llms

Ai Coding

What Is Vibe Coding? The Developer's Guide

Vibe coding means describing what you want in natural language and letting AI write the code. Here's what it actually looks like, where it works, where it fails, and how to do it well.

Vibe Coding · Ai Coding · Developer Tools

Ai Engineering

What Are Embeddings in AI? A Technical Explanation

Embeddings turn text into numbers that capture meaning. Here's how they work, why they matter for search and RAG, and how to choose the right model for your use case.

Embeddings · Vector Search · Ai Architecture

Prompt Engineering

Prompt Engineering Guide: How to Write Better AI Prompts

Prompting isn't about magic phrases. It's structured thinking that determines output quality. Here's how to write prompts that actually work, from frameworks to chain-of-thought to system prompts.

Prompt Engineering · Llm · Ai Engineering

Ai Engineering

Why AI Hallucinates and How to Reduce It

AI hallucination isn't a bug you can patch. It's a consequence of how language models work. Here's what causes it, how to measure it, and what actually reduces it.

Hallucination · Llms · Ai Safety

Ai Engineering

What Is AI Temperature and How Does It Affect Output?

Temperature controls how random or deterministic an AI model's output is. Here's what it does technically, how it relates to top-p and top-k, and when to adjust it.

Temperature · Llm · Ai Engineering

Ai Engineering

Context Windows Explained: Why Your AI Forgets

Context windows determine how much an AI model can 'see' at once. Here's what they are technically, how attention scales, and practical strategies for working within their limits.

Context Windows · Llms · Prompt Engineering

Ai Engineering

What Is an LLM? How Large Language Models Actually Work

LLMs predict text, they don't understand it. Here's how large language models work under the hood, from training to transformers to next-token prediction, and why it matters for how you use them.

Llm · Large Language Models · Ai Engineering

Ai Engineering

What Tokenization Means for Your Prompts

Tokenization isn't just a technical detail. It shapes how LLMs process your input. Understanding it changes the way you write prompts.

Tokenization · Llms · Prompt Engineering