The Open Weights
LatestModelsLeaderboardsUpcomingCompanies
Subscribe
The Open Weights

The daily record of open-source AI. New model releases, leaderboards, and what's coming next — written for people who ship.

Refreshed every 12 hours

Discover

  • Latest releases
  • New today
  • Trending models
  • Upcoming launches

Browse

  • All models
  • Companies
  • Categories
  • Leaderboards

About

  • About
  • Editorial policy
  • RSS feed
  • Newsletter

© 2026 The Open Weights. An independent publication.

Aggregated by Claude · written with Gemini · curated by humans.

LatestBaiduv6
BaiduVision-Language

PaddleOCR's PP-OCRv6 Adds a Medium Detection Model

Baidu's open-source OCR toolkit ships an Apache-licensed text-line detector in safetensors format, tuned for a balance of accuracy and speed.

Jun 9, 2026
UpdateApache 2.0
Baidu · Vision-Language
PP-OCRv6 Medium Detection
PP-OCRv6 Medium Detection

Baidu has published PP-OCRv6 Medium Detection, the text-line detection component of the latest generation of its widely used PaddleOCR toolkit. The model is available on Hugging Face in safetensors format under a permissive Apache-2.0 license.

Text detection is the first stage in most OCR pipelines: before any characters can be read, a model has to locate where text actually sits on the page. This "medium" variant targets the practical middle ground between lightweight mobile-friendly detectors and heavier, higher-accuracy ones, making it a reasonable default for documents, receipts, and natural-scene imagery.

Why it matters

PaddleOCR has become one of the more popular open OCR stacks, particularly for Chinese and multilingual text, and a modular detection model lets developers swap stages without committing to the full framework.

  • Apache-2.0 licensing allows commercial use and redistribution.
  • The safetensors format improves loading safety and interoperability.
  • A sub-1B parameter footprint keeps inference costs modest.

As an early v6-generation release, the model signals continued investment in PaddleOCR's pipeline approach rather than a single end-to-end system. Teams already building on PaddleOCR can treat it as a drop-in detection upgrade, while others may find it a useful standalone building block for document-processing workflows.

Sources

  • PaddlePaddle/PP-OCRv6_medium_det_safetensors

    Hugging Face

    Visit

0 comments

No comments yet. Be the first to weigh in.

Get the model

Weights

Specs

Parameters—
Context window—
LicenseAPACHE-2.0
Downloads718

Modalities

Vision-Language

More in Vision-Language

Moonshot AI
Kimi-K2.7-Code
Kimi-K2.7-Code
Moonshot AI/Code

Moonshot AI Releases Kimi, a Multimodal Coding Model

The new Mixture-of-Experts model from the Chinese AI company can generate code while also understanding visual inputs, a rare combination in open models.

Jun 11, 2026
Google DeepMind
DiffusionGemma 26B-A4B Instruct
DiffusionGemma 26B-A4B Instruct
Google DeepMind/Text / LLM

Google Releases Open-Source DiffusionGemma 26B Model

The new 26B parameter model from DeepMind uses a diffusion-based architecture, a technique more common in image generation, to produce text.

Jun 9, 2026
MiniMax
MiniMax-M3
MiniMax-M3
MiniMax/Vision-Language

MiniMax Releases M3, a Multimodal MoE Model

The new open-weight model from MiniMax AI combines vision, coding, and reasoning using a Mixture-of-Experts architecture.

Jun 2, 2026