Nothing OS 4.1 Adds On-Device Voice Dictation
Nothing released Essential Voice, an on-device AI dictation tool in Nothing OS 4.1 that removes filler words, translates languages, and applies formatting.
Nothing released Essential Voice as part of the Nothing OS 4.1 update, transitioning the smartphone maker’s ecosystem toward a voice-first interface. Announced via Nothing’s Intelligence Toolkit integration, the feature is an on-device AI dictation and transcription tool. By refining raw speech into polished text without cloud latency, Nothing aims to replace traditional keyboard input, shifting users from an average typing speed of 36 words per minute to a speaking speed of 150 words per minute.
Real-Time Speech Processing and Translation
Essential Voice operates primarily as a real-time speech refinement engine. The system automatically detects and strips filler words, interjections, and stutters from raw audio before outputting text. Users can apply structural formatting dynamically by dictating commands like “make this a bulleted list” or “format as steps.” The engine parses and executes these commands in real-time without requiring manual text editing.
The tool supports over 100 languages and includes automatic language detection. It accounts for regional variant preferences, distinguishing between dialects like Latin American and European Spanish. A built-in live translation agent allows users to speak in their native language while outputting text in another. This expands the utility of local models compared to traditional cloud-dependent voice agents.
Users can configure personal mappings for frequent text insertions. Pre-configured voice shortcuts trigger specific outputs. A user can map the phrase “my email” to insert a primary email address, or say “office address” to input a full location string and map link directly into the text field.
Hardware Integration and Privacy Controls
Accessing the tool relies on physical hardware triggers or native software hooks. Users activate Essential Voice via a long-press on the Essential Key, a dedicated hardware button on supported Nothing devices, or directly through the system keyboard. Because the tool operates at the system level, it functions across third-party applications including WhatsApp, Gmail, Google Keep, and Slack.
Processing is handled strictly on-device, prioritizing privacy for sensitive dictation tasks. Nothing encrypts the audio during the transcription process and states that no voice data is stored on external servers after execution. This local AI inference model mirrors broader industry moves toward native offline dictation, keeping user data out of cloud transmission logs.
Availability Schedule
Essential Voice is bundled with the Nothing OS 4.1 update, which also includes the April 2026 Android security patch and community-inspired lock screen customizations. The rollout spans three primary hardware tiers:
- Nothing Phone (3): Available immediately as of the April 23 launch.
- Nothing Phone (4a) Pro: Scheduled for late April 2026.
- Nothing Phone (4a): Expected early May 2026.
Future updates to the Intelligence Toolkit will introduce context awareness to the dictation engine. This update will enable the system to automatically adjust the tone and structural output of transcribed text based on the active application, distinguishing between professional phrasing in an email client and casual phrasing in a messaging app. Developers building context-aware mobile applications should note Nothing’s system-level approach to interpreting user intent across application boundaries.
Get Insanely Good at AI
The book for developers who want to understand how AI actually works. LLMs, prompt engineering, RAG, AI agents, and production systems.
Keep Reading
Google Graduates LiteRT NPU Acceleration to Production
Learn how to configure LiteRT for hardware-accelerated on-device AI inference using Google's production-ready NPU capabilities.
Google AI Edge Eloquent brings free offline dictation to iOS
Google's new AI Edge Eloquent app uses Gemma 4 models to offer high-quality, offline-first transcription and text polishing for free on iPhone.
Outpacing Whisper: Cohere Transcribe Hits Top ASR Speed
Experience enterprise-grade audio intelligence with Cohere Transcribe, a new open-weights model topping the ASR leaderboard with 3x faster speeds than Whisper.
Microsoft Releases MAI-Transcribe-1 to Rival Whisper
Microsoft AI unveils MAI-Transcribe-1, MAI-Voice-1, and MAI-Image-2 to reduce reliance on OpenAI with high-efficiency, in-house foundational models.
Cohere Transcribe debuts as open-source ASR model
Cohere Transcribe launches as a 2B open-source speech-to-text model with 14-language support, self-hosting, and vLLM serving.