Debates over AI benchmarking have reached Pokémon

Besvar
nyheder
Indlæg: 8614
Tilmeldt: tirs sep 22, 2020 3:13 pm

Debates over AI benchmarking have reached Pokémon

Indlæg af nyheder »

Not even Pokémon is safe from AI benchmarking controversy. Last week, a post on X went viral, claiming that Google’s latest Gemini model surpassed Anthropic’s flagship Claude model in the original Pokémon video game trilogy. Reportedly, Gemini had reached Lavendar Town in a developer’s Twitch stream; Claude was stuck at Mount Moon as of late […]

Source: https://techcrunch.com/2025/04/14/debat ... d-pokemon/
Besvar