Side 1 af 1

Study accuses LM Arena of helping top AI labs game its benchmark

: tors maj 01, 2025 12:08 am
af nyheder
A new paper from AI lab Cohere, Stanford, MIT, and Ai2 accuses LM Arena, the organization behind the popular crowdsourced AI benchmark Chatbot Arena, of helping a select group of AI companies achieve better leaderboard scores at the expense of rivals. According to the authors, LM Arena allowed some industry-leading AI companies like Meta, OpenAI, […]

Source: https://techcrunch.com/2025/04/30/study ... benchmark/