Ai Agents 3 min read

Domain Experts Sweep Claude Opus 4.7 Hackathon Results

Anthropic's latest hackathon highlights a shift in AI development, with doctors and teachers using Opus 4.7 to build complex agentic applications.

On June 15, 2026, Anthropic announced the results of the Built with Opus 4.7: a Claude Code hackathon, a global virtual competition focused on agentic applications. Run alongside Cerebral Valley, the event provided 500 selected participants with $100,000 in API credits. The winning results demonstrated a definitive shift in application development, with non-traditional builders and subject matter experts consistently outperforming career software engineers.

Winning Domain Implementations

The standout projects solved narrow, complex problems rather than generic productivity tasks. This emphasizes how raw technical proficiency is giving way to deep domain expertise when architecting AI systems.

PlacementProjectDomainCore Functionality
FirstMedkitHealthcareVoice-first clinical simulator evaluating users against NICE, ESC, and AHA guidelines.
SecondWrench BoardElectronics RepairHigh-resolution schematic and image analysis for microsoldering diagnostics.
ThirdKyōjoEducationInteractive computer science instruction platform for teachers in Chile.

Medkit took the top spot by utilizing four separate Claude Code sessions to run voice processing, clinical content generation, a 3D game layer, and the core application. By orchestrating parallel subagents under the Claude Managed Agents framework, the tool operates as a real-time patient simulator with distinct AI personalities. It enters pilot testing at three medical faculties in Istanbul in late June 2026.

Other notable entries included Aria, a factory maintenance tool that ingests audio and shift logs to predict machine failure, and Beyond The Limit, a memory recall assistant for dementia patients.

Opus 4.7 Capabilities

The projects relied heavily on the specifications of the Opus 4.7 release from April 2026. The model features a 1 million-token context window and supports up to 128,000 output tokens, allowing for large codebase manipulation and dense document ingestion.

Developers utilized the model’s new “xhigh” effort level for adaptive reasoning to process multi-step logic. The baseline capabilities of Opus 4.7 scored 87.6% on SWE-bench Verified, allowing participants to assemble complex architectures within the strict one-week timeline. Inference costs are fixed at $5 per million input tokens and $25 per million output tokens.

The Transition to 4.8

The hackathon results coincided with Anthropic’s Claude Build Day in San Francisco, which introduced the next iteration of the model. Claude Opus 4.8 pushes baseline performance higher, achieving 88.6% on SWE-bench Verified. Enterprise teams and individual builders are already migrating applications to the new endpoints to capture the incremental reasoning improvements.

If you build agentic tools, the success of these projects indicates that specific industry knowledge is your primary structural advantage. Grounding an application in authoritative guidelines, precise hardware schematics, or strict domain rules yields substantially higher utility than generalized workflows.

Get Insanely Good at AI

Get Insanely Good at AI

The book for developers who want to understand how AI actually works. LLMs, prompt engineering, RAG, AI agents, and production systems.

Keep Reading