Amazon Q Developer vs Windsurf: Benchmark Comparison
Independent benchmark data · Real published scores only
📊 SWE-bench Verified
VS
🏆 Higher Score
Windsurf
Codeium / OpenAI
71.6
Trust Score V2
95% CI: 68.1 – 75.0
View full profile →
Score Comparison
Key Metrics
| Metric | Amazon Q Developer | Windsurf |
|---|---|---|
| Trust Score V2 | 61.3 | 71.6 |
| Functional Accuracy | 38.8 | 75.2 |
| Reliability Score | 67.2 | 67.4 |
| Policy Compliance | 94.5 | 92.8 |
| SWE-bench Pass@1 | 0.4% | 0.8% |
| Benchmark | SWE-bench Verified | SWE-bench Verified |
| Last Evaluated | Mar 17, 2026 | Mar 17, 2026 |
| Model Base | Custom | SWE-1 |