[State of Evals] LMArena's $1.7B Vision — Anastasios Angelopoulos, LMArena - Latent Space Recap

Podcast: Latent Space

Published: 2026-01-06

Duration: 24 minutes

Guests: Anastasios Angelopoulos

Summary

Anastasios Angelopoulos discusses LMArena's journey from an academic project to a $1.7 billion valuation company, emphasizing their focus on platform integrity, real-world AI model evaluation, and expansion into various verticals.

What Happened

Anastasios Angelopoulos, founder of LMArena, discusses the company's growth from a project in a Berkeley basement to a startup valued at $1.7 billion. LMArena began as an academic incubation under Anjney Midha at a16z, who provided early grants and support. The decision to transition from an academic project to a company was driven by the need for scale and resources to accurately evaluate AI models in real-world scenarios. The company raised $100 million, which is primarily allocated for inference costs and platform developments, including a technical migration from Gradio to React.

LMArena has become a leading platform for evaluating AI models, trusted by major labs and industry players. With over 250 million conversations on the platform, it provides insights into which models work best for various real-world applications. The platform has a diverse user base, including a significant number of software professionals who contribute to its evaluations.

The episode covers the controversy surrounding the 'leaderboard delusion' paper, which criticized LMArena for undisclosed private testing. Angelopoulos refutes these claims, highlighting factual errors in the paper and emphasizing the transparency and community-driven nature of their evaluation process. The platform's integrity is maintained by not allowing models to pay for placement or removal in the leaderboard.

Angelopoulos also discusses the importance of community involvement in LMArena, with features like secret codenames and early access to unreleased models fostering engagement. The 'Nano Banana' moment is discussed as a significant event, illustrating the economic impact of multimodal models and their potential in marketing and design.

Looking forward, LMArena plans to expand into new categories such as occupational and expert arenas, and is working towards launching a video arena. These expansions aim to provide specialized evaluations for sectors like medicine, legal, and finance, further establishing LMArena as a central evaluation platform.

The conversation highlights the challenges and considerations in managing a large-scale consumer platform, including the need for constant innovation and user retention strategies. Angelopoulos highlights the role of platform integrity and transparency as core principles guiding LMArena's operations and future directions.

Key Insights