gpt-5.1 vs gpt-5.4 Benchmark Comparison
Direct benchmark comparison between gpt-5.1 and gpt-5.4 based on LMArena Elo and the latest 2026 API pricing.
Direct Technical & Pricing Comparison
| Frontier Model |
LMArena Elo |
API Cost (1M) |
Throughput |
| gpt-5.1 |
1439 |
$0.00000125 |
76 |
| gpt-5.4 |
1466 |
$0.0000025 |
32 |
*These models represent the Pareto Frontier (optimal cost-to-performance).*
Comparison Summary: These models are highly competitive, with a negligible Elo gap of only 27 points. The choice between them should be driven by specific API features or provider preference. However, neither model is currently Pareto-optimal. Developers looking for peak efficiency should investigate gpt-5.2-chat-latest-20260210, which offers a superior benchmark-to-price ratio.