The Open Weights
LatestModelsLeaderboardsUpcomingCompanies
Subscribe
The Open Weights

The daily record of open-source AI. New model releases, leaderboards, and what's coming next — written for people who ship.

Refreshed every 12 hours

Discover

  • Latest releases
  • New today
  • Trending models
  • Upcoming launches

Browse

  • All models
  • Companies
  • Categories
  • Leaderboards

About

  • About
  • Editorial policy
  • RSS feed
  • Newsletter

© 2026 The Open Weights. An independent publication.

Aggregated by Claude · written with Gemini · curated by humans.

LatestDatalab1.0
DatalabVision-Language

Datalab Releases Chandra, a New OCR Vision Model

The new vision-language model from Datalab is fine-tuned from Qwen2-VL to specialize in extracting text and structure from complex documents.

Oct 21, 2025
NotableOpenRAIL-M
Datalab · Vision-Language
Chandra OCR
Chandra OCR

Datalab has introduced Chandra, a new open-source model designed to tackle the complex challenge of optical character recognition (OCR) and document understanding. As a vision-language model (VLM), Chandra goes beyond simple text extraction, aiming to interpret the layout and structure of documents like forms, receipts, and invoices.

The model is a specialized fine-tune of Alibaba's powerful Qwen2-VL-7B, giving it a strong foundation in both visual perception and language comprehension. Datalab has released Chandra under an OpenRAIL license, which permits a wide range of uses while including certain restrictions to encourage responsible deployment of the technology.

While traditional OCR tools are effective at converting clean, printed text into digital formats, they often falter with varied layouts, tables, or handwritten content. By leveraging a VLM architecture, Chandra can analyze a document holistically, understanding the relationship between different visual elements and the text they contain. This capability is key for automating data entry and digitizing complex archives more accurately.

Key Applications

  • Extracting structured data from invoices and receipts.
  • Parsing complex tables and forms.
  • Digitizing handwritten notes and annotations.
  • Analyzing documents with multi-column layouts.

Chandra represents a focused application of large vision models to a persistent business problem. For developers working on document processing pipelines, it offers a powerful new tool for improving accuracy and automation. The model and further documentation are available on the Hugging Face Hub.

Sources

  • datalab-to/chandra

    Hugging Face

    Visit

0 comments

Protected by Turnstile

No comments yet. Be the first to weigh in.

Get the model

Weights

Specs

Parameters—
Context window—
LicenseOPENRAIL
Downloads165.7K

Modalities

Vision-Language

More in Vision-Language

Moonshot AI
Kimi-K2.7-Code
Kimi-K2.7-Code
Moonshot AI/Code

Moonshot AI Releases Kimi, a Multimodal Coding Model

The new Mixture-of-Experts model from the Chinese AI company can generate code while also understanding visual inputs, a rare combination in open models.

Jun 11, 2026
Google DeepMind
DiffusionGemma 26B-A4B Instruct
DiffusionGemma 26B-A4B Instruct
Google DeepMind/Text / LLM

Google Releases Open-Source DiffusionGemma 26B Model

The new 26B parameter model from DeepMind uses a diffusion-based architecture, a technique more common in image generation, to produce text.

Jun 9, 2026
MiniMax
MiniMax-M3
MiniMax-M3
MiniMax/Vision-Language

MiniMax Releases M3, a Multimodal MoE Model

The new open-weight model from MiniMax AI combines vision, coding, and reasoning using a Mixture-of-Experts architecture.

Jun 2, 2026