Story

Benchmarking GPT-5.1 vs. Gemini 3.0 vs. Opus 4.5 across 3 Coding Tasks

heymax054 Wednesday, November 26, 2025
Summary
This article compares the performance of three large language models - GPT-5.1, Gemini 3.0, and OPUS 4.5 - on a range of benchmarks, including natural language understanding, text generation, and arithmetic reasoning. The results provide insights into the relative strengths and weaknesses of these models.
2 0
Summary
blog.kilo.ai
Visit article Read on Hacker News