AI News
Latest AI engineering news, updated daily.
Ai Engineering
IBM MAMMAL Foundation Model Unifies Gene and Protein Analysis
IBM Research released MAMMAL, a unified 458-million parameter foundation model that processes genes, proteins, and molecules in a single shared framework.
Foundation Models · Computational Biology · Ibm Research
Ai Engineering
Wirestock DaaS Platform Lands $23M for Ethical Multimodal Data
Wirestock raised $23 million to expand its data-as-a-service platform, supplying foundation model makers with ethically licensed images, video, and 3D assets.
Multimodal Data · Ethical Ai · Data As A Service
Ai Coding
Mobile Codex Command Center Enters Preview for macOS Hosts
Developers can now monitor, approve, and redirect long-running Codex coding tasks directly from the ChatGPT mobile app for iOS and Android.
Openai Codex · Mobile Development · Developer Tools
Ai Agents
Osaurus Pivots to Unified macOS Agent Platform With Linux VMs
The open-source Osaurus app now routes local MLX models and cloud APIs through a hardware-isolated agent harness natively built for Apple Silicon.
Apple Silicon · Open Source Ai · Macos Automation
Ai Engineering
32K Context Hits IBM's Open Multilingual Embedding R2 Models
IBM released Granite Embedding Multilingual R2, upgrading its Apache 2.0 encoder models with a 32,768-token context window and ModernBERT architecture.
Embedding Models · Multilingual Ai · Vector Search
Ai Agents
Anthropic Limits Claude Mythos Following 83% Exploit Success
Anthropic has restricted its new Claude Mythos model to select partners after pre-release testing revealed autonomous cyberattack capabilities.
Autonomous Capabilities · Cybersecurity Risk · Model Alignment
Ai Engineering
Async CUDA Streams Eliminate 25% GPU Wait in Transformers
Hugging Face implemented asynchronous continuous batching in the transformers library, using CUDA streams to recover 25% of runtime lost to CPU idle gaps.
Cuda Streams · Gpu Optimization · Continuous Batching
Ai Coding
Cursor Adds Multi-Repo Support to Cloud Agent Environments
Cursor's updated Cloud Agent Development Environments introduce multi-repo capabilities, layer caching, and scoped egress for autonomous coding tasks.
Cursor Editor · Cloud Agents · Multi Repo Support
Ai Engineering
Google AI Edge Taps Arm SME2 for 5x Faster CPU Inference
Google and Arm have integrated SME2 micro-kernels into LiteRT, accelerating on-device generative AI workloads by up to 5x without custom assembly code.
Edge Ai · On Device Inference · Arm Architecture
Ai Agents
Claude 4.7 UI Guidelines Require Strict Screenshot Downscaling
Anthropic's new best practices for computer use identify click accuracy bottlenecks, providing precise screenshot limits and token configurations for Opus 4.7.
Anthropic Claude · Computer Use · Token Optimization
Ai Agents
Browser Run Migrates to Edge Containers for 4x Concurrency
Cloudflare rebuilt its Browser Run platform on native edge containers, quadrupling concurrency limits and halving latency for automated web tasks.
Edge Computing · Browser Automation · Infrastructure Scaling
Ai Engineering
Gemini 3.1 Flash-Lite Ships 1M Context at $0.25 Per Million
Google's lowest-latency Gemini model is now generally available, introducing variable thinking levels and a 1M token context window for high-volume routing.
Gemini Flash Lite · Context Window · Google Cloud
Ai Agents
Android 17 Integrates OS-Level Gemini Agentic Automation
Google previewed Android 17, introducing cross-app Gemini agents, generative UI widgets, and a biometric lockout feature for lost devices.
Android 17 · Gemini Ai · Agentic Workflows
Ai Engineering
Origin Lab Raises $8M for Game Engine Telemetry Marketplace
Origin Lab has secured $8 million in seed funding to launch a platform that converts raw video game engine data into licensed datasets for world model research.
World Models · Dataset Licensing · Video Game Telemetry
Ai Engineering
AutoScientist Automates Simultaneous Data and Weight Tuning
Adaption launched AutoScientist to automate model fine-tuning by optimizing training datasets and model weights simultaneously.
Fine Tuning · Model Optimization · Automated Machine Learning
Ai Agents
FrontierMath Tier 4 Record Falls to DeepMind Co-Mathematician
Google DeepMind's AI Co-Mathematician agent workbench doubled the baseline Gemini 3.1 Pro score to reach 48% on the FrontierMath Tier 4 benchmark.
Deepmind · Frontiermath · Mathematical Reasoning
Ai Coding
Anthropic Builds CLUE Threat Detection Platform in One Week
Anthropic's internal security team used Claude Code to develop and deploy CLUE, a natural language threat detection platform, in just seven days.
Anthropic · Claude Code · Cybersecurity
Ai Agents
$50M Series B Values Voice Infrastructure Provider Vapi at $500M
Vapi secured a $50 million Series B funding round at a $500 million valuation after Amazon Ring shifted its entire inbound call volume to the voice platform.
Voice Ai · Series B Funding · Ai Infrastructure
Ai Engineering
TML-Interaction-Small Achieves 0.40s Full-Duplex Latency
Thinking Machines Lab has released a research preview of TML-Interaction-Small, a 276-billion-parameter Mixture-of-Experts model for full-duplex conversation.
Mixture Of Experts · Full Duplex Communication · Large Language Models
Ai Engineering
Gemini Intelligence System Debuts With Googlebooks Platform
Google introduced the Gemini Intelligence system, a unified Android and ChromeOS core powering a new laptop hardware category called Googlebooks.
Google Gemini · Android Os · Chromeos