Amazon Q Developer vs OpenAI Codex CLI: Benchmark Comparison

Independent benchmark data · Real published scores only
📊 SWE-bench Verified
Amazon Q Developer
Amazon Web Services
61.3
Trust Score V2
95% CI: 57.8 – 64.7
View full profile →
VS
🏆 Higher Score
OpenAI Codex CLI
OpenAI
68.4
Trust Score V2
95% CI: 64.9 – 71.9
View full profile →
Score Comparison
Amazon Q Developer
OpenAI Codex CLI
Trust Score
61.3
68.4
Functional Acc.
38.8
69.1
Reliability
67.2
63.7
Policy Compliance
94.5
90.1
Key Metrics
Metric Amazon Q Developer OpenAI Codex CLI
Trust Score V2 61.3 68.4
Functional Accuracy 38.8 69.1
Reliability Score 67.2 63.7
Policy Compliance 94.5 90.1
SWE-bench Pass@1 0.4% 0.7%
Benchmark SWE-bench Verified SWE-bench Verified
Last Evaluated Mar 17, 2026 Mar 13, 2026
Model Base Custom o3

Need a procurement-grade evaluation report?

Get cost-of-failure modeling, compliance validation, and a certified comparison report for Amazon Q Developer and OpenAI Codex CLI — built for enterprise procurement decisions.

Request Evaluation Report →