AI News
Latest AI engineering news, updated daily.
Ai Engineering
vLLM V1 Migration: Fix Logprobs Before RL Corrections
ServiceNow's vLLM V1 migration shows why RL pipelines need backend logprob parity before objective-level corrections.
Llm Inference · Vllm Framework · Reinforcement Learning
Ai Engineering
GPT-5.5 Instant Cuts ChatGPT Hallucinations by 52.5%
OpenAI has replaced ChatGPT's default engine with GPT-5.5 Instant, a less verbose model featuring improved factuality, personalization, and memory sources.
Llm Optimization · Hallucination Reduction · Openai Updates
Ai Engineering
Private Evaluation Track Deters Open ASR Benchmaxxing
Hugging Face partnered with Appen and DataoceanAI to introduce a private evaluation track to the Open ASR Leaderboard, mitigating test-set contamination.
Automatic Speech Recognition · Model Evaluation · Benchmarking Integrity
Ai Engineering
GENE-26.5 Gives Hardware-Agnostic Robots Human-Scale Dexterity
The French robotics startup Genesis AI has released GENE-26.5, a hardware-agnostic foundation model paired with a custom human-scale robotic hand.
Robotics · Foundation Models · Hardware Agnostic
Ai Agents
Claude Managed Agents Add Background Dreaming and Subagents
Anthropic updated Claude Managed Agents with background memory consolidation, multiagent orchestration, and rubric-based output grading for complex workflows.
Multiagent Orchestration · Memory Consolidation · Autonomous Workflows
Ai Engineering
Steering Chemical Synthesis via LLM Evaluation in EPFL's Synthegy
EPFL researchers have developed Synthegy, a framework that uses large language models to evaluate and guide traditional computational chemistry algorithms.
Computational Chemistry · Llm Evaluation · Chemical Synthesis
Ai Engineering
Native iOS 27 Workloads Can Now Route to Claude and Gemini
Apple's Extensions framework for iOS 27 allows developers to integrate third-party AI models directly into native Siri and Writing Tools workflows.
Ios Development · Apple Intelligence · Model Integration
Prompt Engineering
Mindgard Uses Visible Thinking to Jailbreak Claude Sonnet 4.5
Security firm Mindgard bypassed Claude Sonnet 4.5 safety filters by using psychological pressure to manipulate the model's visible internal reasoning process.
Adversarial Attacks · Jailbreaking · Large Language Models
Ai Agents
$27M Funding Round Backs CopilotKit's App-Native Agent Stack
CopilotKit has raised $27 million to expand its generative UI framework and launch a self-hostable enterprise intelligence platform for app-native AI agents.
Venture Capital · App Native Agents · Generative Ui
Ai Agents
AWS Tackles Agent Drift With Bedrock AgentCore Optimization
AWS has introduced AgentCore Optimization in preview to automate prompt updates and A/B testing, alongside a new desktop AI assistant called Amazon Quick.
Aws Bedrock · Agent Drift · Prompt Optimization
Ai Engineering
Runpod Flash Removes Container Overhead for AI Inference
The open-source Flash Python SDK allows developers to convert local functions into auto-scaling serverless AI inference endpoints without Dockerfiles.
Serverless Inference · Cloud Infrastructure · Open Source Sdk
Ai Engineering
DeepSeek V4 Pro Trails GPT-5.5 by 8 Months in NIST Benchmarks
The Center for AI Standards and Innovation evaluated DeepSeek-V4-Pro, placing its capabilities eight months behind U.S. frontier models while matching GPT-5.
Deepseek V4 · Nist Benchmarks · Llm Evaluation
Ai Engineering
TPU v5p Inference Speeds Triple With DFlash Block-Diffusion
Google and UCSD researchers released DFlash, a block-diffusion speculative decoding method that achieves a 3.13x average inference speedup on TPU v5p hardware.
Llm Inference · Google Tpu · Speculative Decoding
Ai Engineering
PyTorch Lightning 2.6.2 Drops Self-Spreading Credential Stealer
Threat actors hijacked the PyPI credentials for PyTorch Lightning to publish two malicious versions containing a self-propagating credential stealer.
Pypi Security · Pytorch Lightning · Supply Chain Attack
Ai Engineering
ChatGPT Images 2.0 Adds Multilingual Text and Thinking Mode
OpenAI released ChatGPT Images 2.0 with the gpt-image-2 model, adding agentic web search, 2K resolution, and non-Latin script rendering capabilities.
Generative Ai · Multilingual Support · Agentic Workflows
Ai Engineering
Hybrid ML-KEM Arrives in Cloudflare IPsec for WAN Tunnels
Cloudflare has introduced general availability for post-quantum IPsec tunnels using a hybrid ML-KEM handshake compatible with Cisco and Fortinet hardware.
Post Quantum Cryptography · Hybrid Ml Kem · Ipsec Tunnels
Ai Engineering
CVE-2026-31431 Grants Local Root via Linux Page Cache Write
A logic bug in the Linux kernel's userspace crypto API allows unprivileged local users to gain root access across major distributions dating back to 2017.
Linux Kernel · Privilege Escalation · Security Vulnerability
Ai Engineering
xAI Ships 2-Minute Voice Clones and Grok 4.3 APIs
xAI has introduced a fast custom voice cloning suite and a new Voice Library alongside the launch of its 1M-context Grok 4.3 model.
Voice Cloning · Grok Api · Multimodal Ai
Ai Engineering
Meta Acquires ARI for Open Humanoid Intelligence Platform
Meta has acquired robotics startup Assured Robot Intelligence to build foundational control and behavioral models for third-party humanoid hardware.
Robotics Ai · Foundational Models · Meta Acquisition
Ai Engineering
Amazon Bedrock Gains GPT-5.5 and Codex in $50B OpenAI Deal
Following the end of Microsoft's exclusive distribution rights, Amazon Web Services has introduced OpenAI's GPT-5.5 and Codex models to the Bedrock platform.
Amazon Bedrock · Openai Partnership · Gpt 5 5