AI News

Latest AI engineering news, updated daily.

In-depth tutorials and guides. Go to Blog →

Ai Engineering

vLLM V1 Migration: Fix Logprobs Before RL Corrections

ServiceNow's vLLM V1 migration shows why RL pipelines need backend logprob parity before objective-level corrections.

Llm Inference · Vllm Framework · Reinforcement Learning

Ai Engineering

GPT-5.5 Instant Cuts ChatGPT Hallucinations by 52.5%

OpenAI has replaced ChatGPT's default engine with GPT-5.5 Instant, a less verbose model featuring improved factuality, personalization, and memory sources.

Llm Optimization · Hallucination Reduction · Openai Updates

Ai Engineering

Private Evaluation Track Deters Open ASR Benchmaxxing

Hugging Face partnered with Appen and DataoceanAI to introduce a private evaluation track to the Open ASR Leaderboard, mitigating test-set contamination.

Automatic Speech Recognition · Model Evaluation · Benchmarking Integrity

Ai Engineering

GENE-26.5 Gives Hardware-Agnostic Robots Human-Scale Dexterity

The French robotics startup Genesis AI has released GENE-26.5, a hardware-agnostic foundation model paired with a custom human-scale robotic hand.

Robotics · Foundation Models · Hardware Agnostic

Ai Agents

Claude Managed Agents Add Background Dreaming and Subagents

Anthropic updated Claude Managed Agents with background memory consolidation, multiagent orchestration, and rubric-based output grading for complex workflows.

Multiagent Orchestration · Memory Consolidation · Autonomous Workflows

Ai Engineering

Steering Chemical Synthesis via LLM Evaluation in EPFL's Synthegy

EPFL researchers have developed Synthegy, a framework that uses large language models to evaluate and guide traditional computational chemistry algorithms.

Computational Chemistry · Llm Evaluation · Chemical Synthesis

Ai Engineering

Native iOS 27 Workloads Can Now Route to Claude and Gemini

Apple's Extensions framework for iOS 27 allows developers to integrate third-party AI models directly into native Siri and Writing Tools workflows.

Ios Development · Apple Intelligence · Model Integration

Prompt Engineering

Mindgard Uses Visible Thinking to Jailbreak Claude Sonnet 4.5

Security firm Mindgard bypassed Claude Sonnet 4.5 safety filters by using psychological pressure to manipulate the model's visible internal reasoning process.

Adversarial Attacks · Jailbreaking · Large Language Models

Ai Agents

$27M Funding Round Backs CopilotKit's App-Native Agent Stack

CopilotKit has raised $27 million to expand its generative UI framework and launch a self-hostable enterprise intelligence platform for app-native AI agents.

Venture Capital · App Native Agents · Generative Ui

Ai Agents

AWS Tackles Agent Drift With Bedrock AgentCore Optimization

AWS has introduced AgentCore Optimization in preview to automate prompt updates and A/B testing, alongside a new desktop AI assistant called Amazon Quick.

Aws Bedrock · Agent Drift · Prompt Optimization

Ai Engineering

Runpod Flash Removes Container Overhead for AI Inference

The open-source Flash Python SDK allows developers to convert local functions into auto-scaling serverless AI inference endpoints without Dockerfiles.

Serverless Inference · Cloud Infrastructure · Open Source Sdk

Ai Engineering

DeepSeek V4 Pro Trails GPT-5.5 by 8 Months in NIST Benchmarks

The Center for AI Standards and Innovation evaluated DeepSeek-V4-Pro, placing its capabilities eight months behind U.S. frontier models while matching GPT-5.

Deepseek V4 · Nist Benchmarks · Llm Evaluation

Ai Engineering

TPU v5p Inference Speeds Triple With DFlash Block-Diffusion

Google and UCSD researchers released DFlash, a block-diffusion speculative decoding method that achieves a 3.13x average inference speedup on TPU v5p hardware.

Llm Inference · Google Tpu · Speculative Decoding

Ai Engineering

PyTorch Lightning 2.6.2 Drops Self-Spreading Credential Stealer

Threat actors hijacked the PyPI credentials for PyTorch Lightning to publish two malicious versions containing a self-propagating credential stealer.

Pypi Security · Pytorch Lightning · Supply Chain Attack

Ai Engineering

ChatGPT Images 2.0 Adds Multilingual Text and Thinking Mode

OpenAI released ChatGPT Images 2.0 with the gpt-image-2 model, adding agentic web search, 2K resolution, and non-Latin script rendering capabilities.

Generative Ai · Multilingual Support · Agentic Workflows

Ai Engineering

Hybrid ML-KEM Arrives in Cloudflare IPsec for WAN Tunnels

Cloudflare has introduced general availability for post-quantum IPsec tunnels using a hybrid ML-KEM handshake compatible with Cisco and Fortinet hardware.

Post Quantum Cryptography · Hybrid Ml Kem · Ipsec Tunnels

Ai Engineering

CVE-2026-31431 Grants Local Root via Linux Page Cache Write

A logic bug in the Linux kernel's userspace crypto API allows unprivileged local users to gain root access across major distributions dating back to 2017.

Linux Kernel · Privilege Escalation · Security Vulnerability

Ai Engineering

xAI Ships 2-Minute Voice Clones and Grok 4.3 APIs

xAI has introduced a fast custom voice cloning suite and a new Voice Library alongside the launch of its 1M-context Grok 4.3 model.

Voice Cloning · Grok Api · Multimodal Ai

Ai Engineering

Meta Acquires ARI for Open Humanoid Intelligence Platform

Meta has acquired robotics startup Assured Robot Intelligence to build foundational control and behavioral models for third-party humanoid hardware.

Robotics Ai · Foundational Models · Meta Acquisition

Ai Engineering

Amazon Bedrock Gains GPT-5.5 and Codex in $50B OpenAI Deal

Following the end of Microsoft's exclusive distribution rights, Amazon Web Services has introduced OpenAI's GPT-5.5 and Codex models to the Bedrock platform.

Amazon Bedrock · Openai Partnership · Gpt 5 5