The Open Weights

Latest Models Leaderboards Companies

The Open Weights

The daily record of open-source AI. New model releases, leaderboards, and what's coming next — written for people who ship.

Refreshed every 12 hours

Discover

Latest releases
New today
Trending models

Browse

All models
Companies
Categories
Leaderboards

About

About
Editorial policy
RSS feed
Newsletter

© 2026 The Open Weights. An independent publication.

Privacy Terms SMSAggregated by Claude · curated by humans.

B

Company

Baidu

9 models

CategoriesVision-Language Text → Video Text → Image

Releases

Baidu/Vision-Language

Baidu's PP-OCRv6 packs 50-language OCR into tiny models

The latest release of PaddlePaddle's optical character recognition suite spans models from 1.5M to 34.5M parameters under an Apache 2.0 license.

Jun 22, 2026

Vision-Language

Baidu/Vision-Language

Baidu releases Unlimited-OCR under permissive MIT license

The Chinese tech giant's multilingual vision-language model targets text extraction across languages and document types.

Jun 19, 2026

Vision-Language

Unlimited-OCR

Baidu/Vision-Language

PaddleOCR's PP-OCRv6 Adds a Medium Detection Model

Baidu's open-source OCR toolkit ships an Apache-licensed text-line detector in safetensors format, tuned for a balance of accuracy and speed.

Jun 9, 2026

Vision-Language

PP-OCRv6 Medium Detection

Baidu/Text → Video

Baidu Releases NAVA for Text-to-Video with Audio

The new model from the Chinese tech giant uses a Multimodal Diffusion Transformer to generate synchronized audio and video from text or image prompts.

May 29, 2026

NAVA

Baidu/Text → Image

Baidu Releases 8B Text-to-Image Model ERNIE-Image

The large diffusion model from the Chinese tech giant is available under the commercially permissive Apache 2.0 license, a notable release for the community.

Apr 7, 2026

ERNIE-Image

Baidu/Vision-Language

Baidu Releases Qianfan-OCR for Document Intelligence

The new vision-language model from the Chinese tech giant is designed for complex, multilingual optical character recognition and layout analysis.

Mar 18, 2026

Vision-Language

Qianfan-OCR

Baidu/Vision-Language

Baidu Releases Open VLM for Advanced Document OCR

The new PaddleOCR-VL model is built to parse not just text, but also the tables, formulas, and page layouts found in complex documents.

Jan 28, 2026

Vision-Language

PaddleOCR-VL-1.5

Baidu/Vision-Language

Baidu Releases Open Vision-Language MoE Model

The new ERNIE 4.5 VL model brings advanced multimodal reasoning to the open-source community with an efficient Mixture-of-Experts architecture.

Nov 7, 2025

Vision-Language Reasoning

ERNIE 4.5 VL 28B A3B Thinking

Baidu/Vision-Language

Baidu Releases PaddleOCR-VL for Document AI

The new vision-language model is fine-tuned to understand not just text, but the complex structure of tables, charts, and formulas.

Oct 16, 2025

Vision-Language

PaddleOCR-VL