built by extend.ai
Back to Arena

About

OCR Arena is a free playground for testing and evaluating leading foundation VLMs and open source OCR models on document parsing tasks. Upload a document, measure accuracy, and vote for the best models on a public leaderboard. OCR Arena was built by the team at Extend. We've initially launched with 10+ models, powered by our friends over at Baseten. New models will be added as they're released. Have feedback or want to see additional OCR models? Let us know via email or X below.

Why did we build OCR Arena?

Document processing has become a core foundation of building AI applications, and OCR is evolving faster than ever. New models are released frequently, but evaluating them remains difficult. Benchmarks only tell part of the story, and most teams care about how models perform on their documents and edge cases. Our goal is to reduce the friction of testing new models and make OCR evaluation open, unbiased, and grounded in real-world performance.

Join the community

Connect with other users, share feedback, and stay updated on new features.