AI News

Latest AI engineering news, updated daily.

In-depth tutorials and guides. Go to Blog →

Ai Engineering

Pentagon Approves Eight AI Vendors For IL7 Classified Networks

The Department of War has authorized models from OpenAI, Google, and six other vendors for classified networks following its dispute with Anthropic.

Defense Technology · Classified Networks · Government Procurement

Ai Engineering

Pre-Trial AI Toxicity Filters Isolate IRS4 Cancer Target

Researchers at St. Jude used AI safety filtering to identify IRS4 as a high-potential target for solid tumors by predicting toxicity before clinical trials.

Ai Safety Filters · Drug Discovery · Biotech Innovation

Ai Agents

On-Call Agent TasksMind Drops Incident Resolution to 60 Seconds

TasksMind has introduced an autonomous incident response agent that writes patches and resolves production alerts in under 60 seconds.

Autonomous Agents · Incident Response · Devops Automation

Ai Coding

RLHF Leak Forces OpenAI to Ban Goblin Metaphors in Codex

OpenAI hardcoded a ban on goblin metaphors in the GPT-5.5 Codex CLI after an unintended reinforcement learning generalization corrupted bug descriptions.

Rlhf · Openai Codex · Reinforcement Learning

Ai Engineering

Grok Training Partly Relied on OpenAI Model Distillation

Elon Musk testified in federal court that xAI partly relied on model distillation from OpenAI to validate and train the Grok chatbot.

Model Distillation · Large Language Models · Xai Grok

Ai Agents

GPT-5.5-Cyber Launch Restricted to Trusted Defense Partners

OpenAI has launched GPT-5.5-Cyber for autonomous vulnerability detection, restricting access to government and critical infrastructure through its TAC program.

Cybersecurity · Autonomous Agents · Government Ai

Ai Agents

128B Mistral Medium 3.5 Moves Vibe Coding Agents to the Cloud

Mistral AI's new 128-billion parameter dense model introduces configurable reasoning alongside asynchronous cloud-based execution for coding agents.

Mistral Medium · Large Language Models · Vibe Coding

Ai Coding

Anthropic's Claude Security Beta Patches Code With Opus 4.7

Anthropic released the public beta of Claude Security, an Opus 4.7-powered defensive tool that scans codebases for vulnerabilities and generates patches.

Anthropic · Vulnerability Detection · Automated Patching

Ai Agents

DeepMind AI Co-Clinician Logs Zero Critical Errors in 97 Cases

Google DeepMind introduced the AI co-clinician to support physicians in real-world care settings, logging zero critical errors across 97 primary care cases.

Healthcare Ai · Google Deepmind · Clinical Decision Support

Ai Coding

Claude Code Retrospective Details 5x Drop in Session Costs

Anthropic's new technical retrospective reveals that prompt caching and prefix compaction act as strict architectural constraints for complex agentic workflows.

Anthropic · Prompt Caching · Agentic Workflows

Ai Coding

Agent Harness Tuning Gives Cursor a 26-Point Lead Over Codex

Anysphere released the Cursor SDK and new benchmarks showing its customized agent harness improves GPT-5.5 functional correctness by 26 percentage points.

Cursor Editor · Agent Harness · Benchmarking

Ai Agents

Claude Cowork brings sandboxed agent workflows to local desktops

Anthropic released a five-level enterprise deployment guide for Claude Cowork outlining sandboxed desktop execution, MDM support, and third-party inference.

Anthropic · Claude Cowork · Enterprise Ai

Ai Engineering

Malicious element-data Release Steals Cloud API Credentials

A supply-chain attack on the popular element-data Python package exposed cloud provider keys and warehouse credentials for roughly 12 hours.

Supply Chain Attack · Python Security · Cloud Api Credentials

Ai Coding

JetBrains and Warp Bundle Claude API Skill for Opus Migrations

Anthropic has integrated its open-source claude-api skill into major developer tools to automate model upgrades, context compaction, and caching strategies.

Anthropic Claude · Developer Tools · Api Integration

Ai Engineering

DeepInfra Brings $0.08/1M Inference to Hugging Face Hub

Developers can now route Hugging Face API requests directly to DeepInfra's serverless GPU infrastructure for high-performance model inference.

Hugging Face · Gpu Infrastructure · Inference Optimization

Ai Engineering

Evaluation Now Consumes 20% of AI Compute Budgets

Hugging Face and the EvalEval Coalition report that evaluating frontier AI models now requires massive inference compute, driving up development costs.

Ai Benchmarking · Inference Compute · Model Evaluation

Ai Agents

Agents Can Provision Cloudflare Accounts via Stripe Projects

Cloudflare has partnered with Stripe to launch a protocol allowing AI agents to autonomously create accounts, manage billing, and register domains.

Autonomous Agents · Cloud Infrastructure · Api Integration

Ai Engineering

IBM Granite 4.1 Pushes Dense 8B Model Past Previous 32B MoE

IBM released the Granite 4.1 open-source model family featuring dense text architectures, a 512K context window, and specialized vision and speech variants.

Open Source Llm · Ibm Granite · Dense Architecture

Ai Coding

Lovable Ships Voice-to-React Vibe Coding App for iOS

Lovable has launched its mobile application for iOS and Android, allowing developers to generate and deploy React applications directly via voice prompts.

Vibe Coding · Mobile Development · React Applications

Ai Agents

Tank OS Hardens OpenClaw Agent Deployments via Rootless Podman

Red Hat engineer Sally O'Malley released Tank OS, an open-source tool that secures OpenClaw AI agents using immutable Linux environments and rootless Podman.

Open Source · Enterprise Security · Rootless Podman