AWS Bedrock Agents Automate Pen-Testing and Incident Triage
AWS has launched two persistent Frontier Agents for automated penetration testing and SRE investigations built on Bedrock AgentCore and MCP.
AWS has introduced a new class of persistent, goal-driven systems with the general availability of its Frontier Agents. Powered by Amazon Bedrock AgentCore, the AWS Security Agent and AWS DevOps Agent operate autonomously for hours or days to execute complex infrastructure workflows.
Industry data from 2025 indicated that 81% of organizations deployed vulnerable code due to release pressure. AWS positions these autonomous AI agents as a mechanism to close the gap between rapid deployment cycles and necessary operational oversight.
Automated Penetration Testing
The Security Agent functions as a virtual security engineer capable of continuous reconnaissance, vulnerability discovery, and active exploitation. It demonstrates real-world impact by chaining vulnerabilities. For example, it can use a stored XSS vulnerability to capture administrative session cookies without human intervention.
In testing against the 40-vulnerability CVE Bench, the agent recorded an 80% success rate without hints and a 92.5% success rate when provided hints. It integrates directly into development workflows via the AWS Security Agent GitHub App to conduct pull request reviews and autonomously open remediation PRs.
Autonomous SRE and Incident Management
The DevOps Agent serves as an automated Site Reliability Engineer designed to manage the full incident lifecycle. It triages alerts and correlates telemetry across CloudWatch, Datadog, Splunk, and Dynatrace to map application resource topologies. The agent generates an immutable audit journal detailing the exact steps of its investigation.
AWS reports the agent accelerates investigations by 80% and reduces Mean Time to Resolution by up to 75% with 94% root cause accuracy. Early adopters reported shrinking production investigation times from two hours down to 28 minutes.
Pricing is usage-based at $0.498 per agent-minute, which equates to $0.0083 per second. Customers on Business Support+, Enterprise Support, and AWS Unified Operations plans receive the agent as an included feature with allocated usage credits.
Technical Architecture and Integration
AWS defines Frontier Agents as autonomous, scalable, and persistent systems. Both agents rely heavily on the Model Context Protocol to interact safely with on-premises systems and third-party observability tools. This architecture demonstrates how MCP standardizes context access for multi-cloud telemetry environments.
| Agent | Primary Function | Key Benchmarks / Metrics | Core Integrations |
|---|---|---|---|
| Security Agent | Pen-testing and PR review | 80% CVE Bench success (no hints) | GitHub App |
| DevOps Agent | SRE and incident triage | 75% MTTR reduction, 94% accuracy | CloudWatch, Datadog, Splunk |
The agents are currently available in six AWS regions, including US East (N. Virginia). A third system targeting autonomous coding tasks, named Kiro, remains in public preview.
If your organization relies on manual log correlation during critical incidents, configuring the DevOps Agent with your specific observability stack provides an immediate baseline for automated triage. Establish strict boundaries using IAM roles to ensure the agent has read access to telemetry without possessing unintended modification privileges.
Get Insanely Good at AI
The book for developers who want to understand how AI actually works. LLMs, prompt engineering, RAG, AI agents, and production systems.
Keep Reading
How to run Claude Managed Agents in self-hosted sandboxes
Learn how to deploy Claude Managed Agents using self-hosted sandboxes and MCP tunnels to securely execute tools and access private data.
GPT-5.5 Hits Bedrock as AWS Ships First-Party Autonomous Agents
AWS has launched autonomous Frontier Agents for security and SRE tasks alongside a native Amazon Bedrock integration for OpenAI's GPT-5.5 and Codex models.
AWS Tackles Agent Drift With Bedrock AgentCore Optimization
AWS has introduced AgentCore Optimization in preview to automate prompt updates and A/B testing, alongside a new desktop AI assistant called Amazon Quick.
Anthropic's Claude Security Beta Patches Code With Opus 4.7
Anthropic released the public beta of Claude Security, an Opus 4.7-powered defensive tool that scans codebases for vulnerabilities and generates patches.
Cisco Secures IDEs With New AI Agent Security Scanner
Cisco's open-source tool protects IDEs like Cursor and VS Code by scanning AI agents and MCP servers for prompt injection and memory poisoning threats.