AI News

Latest AI engineering news, updated daily.

In-depth tutorials and guides. Go to Blog →

Ai Engineering

IBM MAMMAL Foundation Model Unifies Gene and Protein Analysis

IBM Research released MAMMAL, a unified 458-million parameter foundation model that processes genes, proteins, and molecules in a single shared framework.

Foundation Models · Computational Biology · Ibm Research

Ai Engineering

Wirestock DaaS Platform Lands $23M for Ethical Multimodal Data

Wirestock raised $23 million to expand its data-as-a-service platform, supplying foundation model makers with ethically licensed images, video, and 3D assets.

Multimodal Data · Ethical Ai · Data As A Service

Ai Coding

Mobile Codex Command Center Enters Preview for macOS Hosts

Developers can now monitor, approve, and redirect long-running Codex coding tasks directly from the ChatGPT mobile app for iOS and Android.

Openai Codex · Mobile Development · Developer Tools

Ai Agents

Osaurus Pivots to Unified macOS Agent Platform With Linux VMs

The open-source Osaurus app now routes local MLX models and cloud APIs through a hardware-isolated agent harness natively built for Apple Silicon.

Apple Silicon · Open Source Ai · Macos Automation

Ai Engineering

32K Context Hits IBM's Open Multilingual Embedding R2 Models

IBM released Granite Embedding Multilingual R2, upgrading its Apache 2.0 encoder models with a 32,768-token context window and ModernBERT architecture.

Embedding Models · Multilingual Ai · Vector Search

Ai Agents

Anthropic Limits Claude Mythos Following 83% Exploit Success

Anthropic has restricted its new Claude Mythos model to select partners after pre-release testing revealed autonomous cyberattack capabilities.

Autonomous Capabilities · Cybersecurity Risk · Model Alignment

Ai Engineering

Async CUDA Streams Eliminate 25% GPU Wait in Transformers

Hugging Face implemented asynchronous continuous batching in the transformers library, using CUDA streams to recover 25% of runtime lost to CPU idle gaps.

Cuda Streams · Gpu Optimization · Continuous Batching

Ai Coding

Cursor Adds Multi-Repo Support to Cloud Agent Environments

Cursor's updated Cloud Agent Development Environments introduce multi-repo capabilities, layer caching, and scoped egress for autonomous coding tasks.

Cursor Editor · Cloud Agents · Multi Repo Support

Ai Engineering

Google AI Edge Taps Arm SME2 for 5x Faster CPU Inference

Google and Arm have integrated SME2 micro-kernels into LiteRT, accelerating on-device generative AI workloads by up to 5x without custom assembly code.

Edge Ai · On Device Inference · Arm Architecture

Ai Agents

Claude 4.7 UI Guidelines Require Strict Screenshot Downscaling

Anthropic's new best practices for computer use identify click accuracy bottlenecks, providing precise screenshot limits and token configurations for Opus 4.7.

Anthropic Claude · Computer Use · Token Optimization

Ai Agents

Browser Run Migrates to Edge Containers for 4x Concurrency

Cloudflare rebuilt its Browser Run platform on native edge containers, quadrupling concurrency limits and halving latency for automated web tasks.

Edge Computing · Browser Automation · Infrastructure Scaling

Ai Engineering

Gemini 3.1 Flash-Lite Ships 1M Context at $0.25 Per Million

Google's lowest-latency Gemini model is now generally available, introducing variable thinking levels and a 1M token context window for high-volume routing.

Gemini Flash Lite · Context Window · Google Cloud

Ai Agents

Android 17 Integrates OS-Level Gemini Agentic Automation

Google previewed Android 17, introducing cross-app Gemini agents, generative UI widgets, and a biometric lockout feature for lost devices.

Android 17 · Gemini Ai · Agentic Workflows

Ai Engineering

Origin Lab Raises $8M for Game Engine Telemetry Marketplace

Origin Lab has secured $8 million in seed funding to launch a platform that converts raw video game engine data into licensed datasets for world model research.

World Models · Dataset Licensing · Video Game Telemetry

Ai Engineering

AutoScientist Automates Simultaneous Data and Weight Tuning

Adaption launched AutoScientist to automate model fine-tuning by optimizing training datasets and model weights simultaneously.

Fine Tuning · Model Optimization · Automated Machine Learning

Ai Agents

FrontierMath Tier 4 Record Falls to DeepMind Co-Mathematician

Google DeepMind's AI Co-Mathematician agent workbench doubled the baseline Gemini 3.1 Pro score to reach 48% on the FrontierMath Tier 4 benchmark.

Deepmind · Frontiermath · Mathematical Reasoning

Ai Coding

Anthropic Builds CLUE Threat Detection Platform in One Week

Anthropic's internal security team used Claude Code to develop and deploy CLUE, a natural language threat detection platform, in just seven days.

Anthropic · Claude Code · Cybersecurity

Ai Agents

$50M Series B Values Voice Infrastructure Provider Vapi at $500M

Vapi secured a $50 million Series B funding round at a $500 million valuation after Amazon Ring shifted its entire inbound call volume to the voice platform.

Voice Ai · Series B Funding · Ai Infrastructure

Ai Engineering

TML-Interaction-Small Achieves 0.40s Full-Duplex Latency

Thinking Machines Lab has released a research preview of TML-Interaction-Small, a 276-billion-parameter Mixture-of-Experts model for full-duplex conversation.

Mixture Of Experts · Full Duplex Communication · Large Language Models

Ai Engineering

Gemini Intelligence System Debuts With Googlebooks Platform

Google introduced the Gemini Intelligence system, a unified Android and ChromeOS core powering a new laptop hardware category called Googlebooks.

Google Gemini · Android Os · Chromeos