Domain Experts Sweep Claude Opus 4.7 Hackathon Results
Anthropic's latest hackathon highlights a shift in AI development, with doctors and teachers using Opus 4.7 to build complex agentic applications.
On June 15, 2026, Anthropic announced the results of the Built with Opus 4.7: a Claude Code hackathon, a global virtual competition focused on agentic applications. Run alongside Cerebral Valley, the event provided 500 selected participants with $100,000 in API credits. The winning results demonstrated a definitive shift in application development, with non-traditional builders and subject matter experts consistently outperforming career software engineers.
Winning Domain Implementations
The standout projects solved narrow, complex problems rather than generic productivity tasks. This emphasizes how raw technical proficiency is giving way to deep domain expertise when architecting AI systems.
| Placement | Project | Domain | Core Functionality |
|---|---|---|---|
| First | Medkit | Healthcare | Voice-first clinical simulator evaluating users against NICE, ESC, and AHA guidelines. |
| Second | Wrench Board | Electronics Repair | High-resolution schematic and image analysis for microsoldering diagnostics. |
| Third | Kyōjo | Education | Interactive computer science instruction platform for teachers in Chile. |
Medkit took the top spot by utilizing four separate Claude Code sessions to run voice processing, clinical content generation, a 3D game layer, and the core application. By orchestrating parallel subagents under the Claude Managed Agents framework, the tool operates as a real-time patient simulator with distinct AI personalities. It enters pilot testing at three medical faculties in Istanbul in late June 2026.
Other notable entries included Aria, a factory maintenance tool that ingests audio and shift logs to predict machine failure, and Beyond The Limit, a memory recall assistant for dementia patients.
Opus 4.7 Capabilities
The projects relied heavily on the specifications of the Opus 4.7 release from April 2026. The model features a 1 million-token context window and supports up to 128,000 output tokens, allowing for large codebase manipulation and dense document ingestion.
Developers utilized the model’s new “xhigh” effort level for adaptive reasoning to process multi-step logic. The baseline capabilities of Opus 4.7 scored 87.6% on SWE-bench Verified, allowing participants to assemble complex architectures within the strict one-week timeline. Inference costs are fixed at $5 per million input tokens and $25 per million output tokens.
The Transition to 4.8
The hackathon results coincided with Anthropic’s Claude Build Day in San Francisco, which introduced the next iteration of the model. Claude Opus 4.8 pushes baseline performance higher, achieving 88.6% on SWE-bench Verified. Enterprise teams and individual builders are already migrating applications to the new endpoints to capture the incremental reasoning improvements.
If you build agentic tools, the success of these projects indicates that specific industry knowledge is your primary structural advantage. Grounding an application in authoritative guidelines, precise hardware schematics, or strict domain rules yields substantially higher utility than generalized workflows.
Get Insanely Good at AI
The book for developers who want to understand how AI actually works. LLMs, prompt engineering, RAG, AI agents, and production systems.
Keep Reading
How to Build Long-Running AI Agents With Google ADK 1.0
Google's Agent Development Kit 1.0 enables multi-day workflows that survive restarts. Learn to configure durable state machines and persistent session storage.
Turn Prompts Into Prototypes Using Claude Design
Anthropic Labs debuts Claude Design, a conversational AI tool powered by Opus 4.7 that generates brand-aware visual assets and interactive prototypes.
Claude Opus 4.7: Better Coding, 3x Vision, Cyber Controls
Anthropic releases Claude Opus 4.7 with major software engineering gains, 3x higher image resolution, automated cybersecurity safeguards, and a new xhigh effort level.
OpenEnv Standardizes Agentic RL With Universal Action Space API
Hugging Face and academic partners have released OpenEnv, providing a unified API and 1,200 tasks to train agents across digital and physical interfaces.
Anthropic Moves Claude Mythos Toward Public Agent Access
Anthropic's autonomous vulnerability discovery model, Claude Mythos, has appeared in Claude Code, suggesting an upcoming public release for the restricted tier.