Amazon CloudWatch GenAI observability now supports Amazon AgentCore Evaluations

Published

December 2, 2025

Amazon CloudWatch AgentCore Evaluations

Amazon CloudWatch introduces AgentCore Evaluations, a new feature for automated quality assessment of AI agents. This capability helps developers monitor and improve agent performance based on real-world interactions, ensuring quality issues are identified and addressed before affecting customers.

AgentCore Evaluations includes 13 pre-built evaluators for quality dimensions such as helpfulness, tool selection, and response accuracy, along with support for custom model-based scoring systems. Quality metrics and agent telemetry are available in CloudWatch dashboards, with end-to-end tracing to correlate evaluation metrics with prompts and logs. This feature integrates with existing CloudWatch capabilities like Application Signals, Alarms, Sensitive Data Protection, and Logs Insights.

AgentCore Evaluations is available in US East (N. Virginia), US West (Oregon), Europe (Frankfurt), and Asia Pacific (Sydney). To learn more, visit the documentation and pricing details. Standard CloudWatch pricing applies for underlying telemetry data.

Source: AWS release notes

If you need further guidance on AWS, our experts are available at AWS@westloop.io. You may also reach us by submitting the Contact Us form.

Amazon CloudWatch GenAI observability now supports Amazon AgentCore Evaluations

Amazon CloudWatch AgentCore Evaluations

Follow our blog

Related posts

Amazon EC2 C7i-flex, M7i-flex & M7i instances now available in Asia Pacific (Hyderabad) region

AWS AppConfig launches managed experimentation tools for A/B testing

AWS Private CA now supports post-quantum digital certificates

Email

Phone

Office