Gemini 2.5 Ultra Benchmarks Leaked
Narrative
Internal benchmarks show 95.2% MMLU-Pro, exceeding all public models. Training completed. Release pending safety review.
Reality
Google confirmed training but not benchmarks. Community skepticism due to Gemini 1 demo controversy. Actual capability unverified. Release date not confirmed.
Implication
Heightened frontier model expectations. But leak skepticism reflected eroded trust from past marketing. Benchmark gaming concerns resurfaced. Transparency pressure increased.