Study accuses LM Arena of helping top AI labs game its benchmark

Besvar
nyheder
Indlæg: 8567
Tilmeldt: tirs sep 22, 2020 3:13 pm

Study accuses LM Arena of helping top AI labs game its benchmark

Indlæg af nyheder »

A new paper from AI lab Cohere, Stanford, MIT, and Ai2 accuses LM Arena, the organization behind the popular crowdsourced AI benchmark Chatbot Arena, of helping a select group of AI companies achieve better leaderboard scores at the expense of rivals. According to the authors, LM Arena allowed some industry-leading AI companies like Meta, OpenAI, […]

Source: https://techcrunch.com/2025/04/30/study ... benchmark/
Besvar