The Open Weights
LatestModelsLeaderboardsUpcomingCompanies
Subscribe
The Open Weights

The daily record of open-source AI. New model releases, leaderboards, and what's coming next — written for people who ship.

Refreshed every 12 hours

Discover

  • Latest releases
  • New today
  • Trending models
  • Upcoming launches

Browse

  • All models
  • Companies
  • Categories
  • Leaderboards

About

  • About
  • Editorial policy
  • RSS feed
  • Newsletter

© 2026 The Open Weights. An independent publication.

Aggregated by Claude · written with Gemini · curated by humans.

LatestLightOn2
LightOnVision-Language

LightOn Releases OCR-2, a 1B Document AI Model

The new vision model from the Paris-based AI lab uses Mistral architecture to extract text and structure from complex documents like PDFs and forms.

Jan 16, 2026
NotableOther
LightOn · Vision-Language
LightOnOCR-2 1B
LightOnOCR-2 1B

Parisian AI company LightOn has released LightOnOCR-2, a new 1-billion-parameter vision language model specialized in document understanding. The model is designed to perform optical character recognition (OCR) on complex documents, extracting not just text but also structural information.

Unlike simple OCR tools, LightOnOCR-2 is built to parse challenging layouts like tables, forms, and multi-column PDFs. This makes it suitable for enterprise automation tasks such as processing invoices or digitizing records, a domain often dominated by proprietary, API-gated services.

A Mistral-based Architecture

The model features a Transformer-based encoder-decoder architecture. In an interesting design choice, its decoder was initialized using a subset of weights from Mistral-7B-v0.1, allowing it to leverage the powerful language capabilities of the popular open model while maintaining a much smaller, more efficient footprint.

The complete model weights and code are available on the Hugging Face Hub for developers to download and use. It's released under a custom LightOnAI-OpenRAIL-M license, which permits commercial use but includes some use-case restrictions common to Responsible AI licenses.

Sources

  • lightonai/LightOnOCR-2-1B

    Hugging Face

    Visit

0 comments

Protected by Turnstile

No comments yet. Be the first to weigh in.

Get the model

Weights

Specs

Parameters1B
Context window—
LicenseOTHER
Downloads210.5K

Modalities

Vision-Language

More in Vision-Language

Moonshot AI
Kimi-K2.7-Code
Kimi-K2.7-Code
Moonshot AI/Code

Moonshot AI Releases Kimi, a Multimodal Coding Model

The new Mixture-of-Experts model from the Chinese AI company can generate code while also understanding visual inputs, a rare combination in open models.

Jun 11, 2026
Google DeepMind
DiffusionGemma 26B-A4B Instruct
DiffusionGemma 26B-A4B Instruct
Google DeepMind/Text / LLM

Google Releases Open-Source DiffusionGemma 26B Model

The new 26B parameter model from DeepMind uses a diffusion-based architecture, a technique more common in image generation, to produce text.

Jun 9, 2026
MiniMax
MiniMax-M3
MiniMax-M3
MiniMax/Vision-Language

MiniMax Releases M3, a Multimodal MoE Model

The new open-weight model from MiniMax AI combines vision, coding, and reasoning using a Mixture-of-Experts architecture.

Jun 2, 2026