The leaderboard “you can’t game,” funded by the companies it ranks | TechCrunch

Artificial intelligence models are multiplying fast, and competition is stiff. With so many players crowding the space, which one will be the best — and who decides that? Arena, formerly LM Arena, has emerged as the de facto public leaderboard for frontier LLMs, influencing funding, launches, and PR cycles. In just seven months, the startup went from a UC Berkeley PhD research project to being valued at $1.7 billion.

Watch as Equity host Rebecca Bellan catches up with Arena co-founders Anastasios Angelopoulos and Wei-Lin Chiang about how their platform became the go-to leaderboard for frontier AI models, and how they’re trying to build a neutral benchmark even as companies like OpenAI, Google, and Anthropic back the project.

They break down how Arena works and why it’s harder to game than static benchmarks, what “structural neutrality” actually means, why Claude is currently topping expert leaderboards in legal and medical use cases, and how the company is expanding beyond chat to benchmark agents, coding, and real-world tasks with a new enterprise product.

Subscribe to Equity on YouTube, Apple Podcasts, Overcast, Spotify and all the casts. You also can follow Equity on X and Threads, at @EquityPod.

Source link

The Republic News News for Everyone | News Aggregator

The leaderboard “you can’t game,” funded by the companies it ranks | TechCrunch

About The Republic

Check Also

The Artemis II astronauts are back after a 10-day journey around the moon

Leave a Reply Cancel reply

Pakistan playing pivotal role in global peace, major talks underway in Islamabad, Sherry Rehman – BOL News

Chris Brown and Usher set for joint stadium tour in 2026

IPL anchor Sahiba Bali comes in support of Samay Raina after ‘Still Alive’ release

High-level U.S. delegation lands in Islamabad for peace talks

What issues will dominate the US–Iran talks in Islamabad?

Understanding the Concept of Imamate in Shia Thought

Revised arbitration law ‘very good news’ for Hong Kong

Polygon Labs and Cypher Capital Expand Institutional Access to POL Across the Middle East – Pakistan News Express

Mainland China plans to defeat Taiwan’s most advanced weapons

Bitget Announces Smart Awards 2025 to Honor Top Trading Talent – Pakistan News Express