LightOnVision-Language

LightOn Releases OCR-2, a 1B Document AI Model

The new vision model from the Paris-based AI lab uses Mistral architecture to extract text and structure from complex documents like PDFs and forms.

Jan 16, 2026

NotableOther

Parisian AI company LightOn has released LightOnOCR-2, a new 1-billion-parameter vision language model specialized in document understanding. The model is designed to perform optical character recognition (OCR) on complex documents, extracting not just text but also structural information.

Unlike simple OCR tools, LightOnOCR-2 is built to parse challenging layouts like tables, forms, and multi-column PDFs. This makes it suitable for enterprise automation tasks such as processing invoices or digitizing records, a domain often dominated by proprietary, API-gated services.

A Mistral-based Architecture

The model features a Transformer-based encoder-decoder architecture. In an interesting design choice, its decoder was initialized using a subset of weights from Mistral-7B-v0.1, allowing it to leverage the powerful language capabilities of the popular open model while maintaining a much smaller, more efficient footprint.

The complete model weights and code are available on the Hugging Face Hub for developers to download and use. It's released under a custom LightOnAI-OpenRAIL-M license, which permits commercial use but includes some use-case restrictions common to Responsible AI licenses.

Sources

lightonai/LightOnOCR-2-1B
Hugging Face
Visit

0 comments

No comments yet. Be the first to weigh in.

Thinking Machines Debuts Inkling Small, a Compact Multimodal MoE

The Apache-2.0 model brings mixture-of-experts efficiency to image, audio, and text tasks in a smaller footprint.

Jul 27, 2026

Microsoft/Vision-Language

Microsoft's Mage-VL Streams Video Natively

A codec-native multimodal foundation model aims to understand live video and vision-language input in real time.

Jul 26, 2026

Swiss Ai/Text / LLM

Apertus v1.5 70B arrives with an Apache-2.0 license

Switzerland's open-model effort ships a 70-billion-parameter, multilingual and multimodal system that anyone can use, modify, and deploy.

Jul 24, 2026

A Mistral-based Architecture