Show HN: Qwen-2.5-32B is now the best open source OCR model

Last week was big for open source LLMs. We got:- Qwen 2.5 VL (72b and 32b)- Gemma-3 (27b)- DeepSeek-v3-0324And a couple weeks ago we got the new mistral-ocr model. We updated our OCR benchmark to include the new models.We evaluated 1,000 documents for JSON extraction accuracy. Major takeaways:- Qwen 2.5 VL (72b and 32b) are by far the most impressive. Both landed right around 75% accuracy (equivalent to GPT-4o’s performance). Qwen 72b was only 0.4% above 32b. Within the margin of error.- Both Qwen models passed mistral-ocr (72.2%), which is specifically trained for OCR.- Gemma-3 (27B) only scored 42.9%. Particularly surprising given that it's architecture is based on Gemini 2.0 which still tops the accuracy chart.The data set and benchmark runner is fully open source. You can check out the code and reproduction steps here:- https://getomni.ai/blog/benchmarking-open-source-models-for-...- https://github.com/getomni-ai/benchmark- https://huggingface.co/datasets/getomni-ai/ocr-benchmark Comments URL: https://news.ycombinator.com/item?id=43549072 Points: 41 # Comments: 8

Avr 1, 2025 - 21:56

0

Show HN: Qwen-2.5-32B is now the best open source OCR model

Last week was big for open source LLMs. We got:

- Qwen 2.5 VL (72b and 32b)

- Gemma-3 (27b)

- DeepSeek-v3-0324

And a couple weeks ago we got the new mistral-ocr model. We updated our OCR benchmark to include the new models.

We evaluated 1,000 documents for JSON extraction accuracy. Major takeaways:

- Qwen 2.5 VL (72b and 32b) are by far the most impressive. Both landed right around 75% accuracy (equivalent to GPT-4o’s performance). Qwen 72b was only 0.4% above 32b. Within the margin of error.

- Both Qwen models passed mistral-ocr (72.2%), which is specifically trained for OCR.

- Gemma-3 (27B) only scored 42.9%. Particularly surprising given that it's architecture is based on Gemini 2.0 which still tops the accuracy chart.

The data set and benchmark runner is fully open source. You can check out the code and reproduction steps here:

- https://getomni.ai/blog/benchmarking-open-source-models-for-...

- https://github.com/getomni-ai/benchmark

- https://huggingface.co/datasets/getomni-ai/ocr-benchmark

Comments URL: https://news.ycombinator.com/item?id=43549072

Points: 41

# Comments: 8

Tags :

Article précédent

How AI is creating a rift at McKinsey, Bain, and BCG

Article suivant

A man powers home for eight years using a thousand old laptop batteries

Articles similaires

Exposing concurrency bugs with a custom scheduler

Fév 14, 2025 0

Writing your own C++ standard library from scratch

Writing your own C++ standard library from scratch

Mar 25, 2025 0

Watch R1 "think" with animated chains of thought

Watch R1 "think" with animated chains of thought

Fév 17, 2025 0

Ce site utilise des cookies. En continuant à naviguer sur le site, vous acceptez notre utilisation des cookies.