AI News
Latest AI engineering news, updated daily.
Ai Engineering
Pentagon Approves Eight AI Vendors For IL7 Classified Networks
The Department of War has authorized models from OpenAI, Google, and six other vendors for classified networks following its dispute with Anthropic.
Defense Technology · Classified Networks · Government Procurement
Ai Engineering
Pre-Trial AI Toxicity Filters Isolate IRS4 Cancer Target
Researchers at St. Jude used AI safety filtering to identify IRS4 as a high-potential target for solid tumors by predicting toxicity before clinical trials.
Ai Safety Filters · Drug Discovery · Biotech Innovation
Ai Agents
On-Call Agent TasksMind Drops Incident Resolution to 60 Seconds
TasksMind has introduced an autonomous incident response agent that writes patches and resolves production alerts in under 60 seconds.
Autonomous Agents · Incident Response · Devops Automation
Ai Coding
RLHF Leak Forces OpenAI to Ban Goblin Metaphors in Codex
OpenAI hardcoded a ban on goblin metaphors in the GPT-5.5 Codex CLI after an unintended reinforcement learning generalization corrupted bug descriptions.
Rlhf · Openai Codex · Reinforcement Learning
Ai Engineering
Grok Training Partly Relied on OpenAI Model Distillation
Elon Musk testified in federal court that xAI partly relied on model distillation from OpenAI to validate and train the Grok chatbot.
Model Distillation · Large Language Models · Xai Grok
Ai Agents
GPT-5.5-Cyber Launch Restricted to Trusted Defense Partners
OpenAI has launched GPT-5.5-Cyber for autonomous vulnerability detection, restricting access to government and critical infrastructure through its TAC program.
Cybersecurity · Autonomous Agents · Government Ai
Ai Agents
128B Mistral Medium 3.5 Moves Vibe Coding Agents to the Cloud
Mistral AI's new 128-billion parameter dense model introduces configurable reasoning alongside asynchronous cloud-based execution for coding agents.
Mistral Medium · Large Language Models · Vibe Coding
Ai Coding
Anthropic's Claude Security Beta Patches Code With Opus 4.7
Anthropic released the public beta of Claude Security, an Opus 4.7-powered defensive tool that scans codebases for vulnerabilities and generates patches.
Anthropic · Vulnerability Detection · Automated Patching
Ai Agents
DeepMind AI Co-Clinician Logs Zero Critical Errors in 97 Cases
Google DeepMind introduced the AI co-clinician to support physicians in real-world care settings, logging zero critical errors across 97 primary care cases.
Healthcare Ai · Google Deepmind · Clinical Decision Support
Ai Coding
Claude Code Retrospective Details 5x Drop in Session Costs
Anthropic's new technical retrospective reveals that prompt caching and prefix compaction act as strict architectural constraints for complex agentic workflows.
Anthropic · Prompt Caching · Agentic Workflows
Ai Coding
Agent Harness Tuning Gives Cursor a 26-Point Lead Over Codex
Anysphere released the Cursor SDK and new benchmarks showing its customized agent harness improves GPT-5.5 functional correctness by 26 percentage points.
Cursor Editor · Agent Harness · Benchmarking
Ai Agents
Claude Cowork brings sandboxed agent workflows to local desktops
Anthropic released a five-level enterprise deployment guide for Claude Cowork outlining sandboxed desktop execution, MDM support, and third-party inference.
Anthropic · Claude Cowork · Enterprise Ai
Ai Engineering
Malicious element-data Release Steals Cloud API Credentials
A supply-chain attack on the popular element-data Python package exposed cloud provider keys and warehouse credentials for roughly 12 hours.
Supply Chain Attack · Python Security · Cloud Api Credentials
Ai Coding
JetBrains and Warp Bundle Claude API Skill for Opus Migrations
Anthropic has integrated its open-source claude-api skill into major developer tools to automate model upgrades, context compaction, and caching strategies.
Anthropic Claude · Developer Tools · Api Integration
Ai Engineering
DeepInfra Brings $0.08/1M Inference to Hugging Face Hub
Developers can now route Hugging Face API requests directly to DeepInfra's serverless GPU infrastructure for high-performance model inference.
Hugging Face · Gpu Infrastructure · Inference Optimization
Ai Engineering
Evaluation Now Consumes 20% of AI Compute Budgets
Hugging Face and the EvalEval Coalition report that evaluating frontier AI models now requires massive inference compute, driving up development costs.
Ai Benchmarking · Inference Compute · Model Evaluation
Ai Agents
Agents Can Provision Cloudflare Accounts via Stripe Projects
Cloudflare has partnered with Stripe to launch a protocol allowing AI agents to autonomously create accounts, manage billing, and register domains.
Autonomous Agents · Cloud Infrastructure · Api Integration
Ai Engineering
IBM Granite 4.1 Pushes Dense 8B Model Past Previous 32B MoE
IBM released the Granite 4.1 open-source model family featuring dense text architectures, a 512K context window, and specialized vision and speech variants.
Open Source Llm · Ibm Granite · Dense Architecture
Ai Coding
Lovable Ships Voice-to-React Vibe Coding App for iOS
Lovable has launched its mobile application for iOS and Android, allowing developers to generate and deploy React applications directly via voice prompts.
Vibe Coding · Mobile Development · React Applications
Ai Agents
Tank OS Hardens OpenClaw Agent Deployments via Rootless Podman
Red Hat engineer Sally O'Malley released Tank OS, an open-source tool that secures OpenClaw AI agents using immutable Linux environments and rootless Podman.
Open Source · Enterprise Security · Rootless Podman