The Open Weights
LatestModelsLeaderboardsUpcomingCompanies
Subscribe
The Open Weights

The daily record of open-source AI. New model releases, leaderboards, and what's coming next — written for people who ship.

Refreshed every 12 hours

Discover

  • Latest releases
  • New today
  • Trending models
  • Upcoming launches

Browse

  • All models
  • Companies
  • Categories
  • Leaderboards

About

  • About
  • Editorial policy
  • RSS feed
  • Newsletter

© 2026 The Open Weights. An independent publication.

Aggregated by Claude · written with Gemini · curated by humans.

LatestGoogle DeepMind1.5
Google DeepMindVision-Language

Google's MedGemma brings open vision AI to medicine

The new 4-billion-parameter vision-language model is specialized for tasks in radiology, pathology, and complex clinical reasoning.

Jan 7, 2026
NotableGemma
Google DeepMind · Vision-Language
MedGemma 1.5 4B IT
MedGemma 1.5 4B IT

Google DeepMind has released MedGemma 1.5, a new open-weights model designed specifically for the medical domain. This 4-billion-parameter model extends the capabilities of the Gemma family into specialized areas, focusing on vision-language tasks that are critical for healthcare.

As a vision-language model (VLM), MedGemma is trained to understand and reason about both text and images. According to its official model card, its instruction tuning targets complex use cases like interpreting radiological scans, analyzing pathology slides, and aiding in multifaceted clinical reasoning.

The release marks a significant step for open-source AI in healthcare. By providing researchers and developers with a capable, specialized foundation model, MedGemma could accelerate the development of new diagnostic aids, medical education tools, and clinical support systems that were previously reliant on proprietary, closed-source models.

MedGemma 1.5 is available under the Gemma license, which permits commercial use and distribution subject to its terms. This particular version, medgemma-1.5-4b-it, is an instruction-tuned variant, making it ready for conversational and question-answering applications out of the box.

Sources

  • google/medgemma-1.5-4b-it

    Hugging Face

    Visit

0 comments

Protected by Turnstile

No comments yet. Be the first to weigh in.

Get the model

Weights

Specs

Parameters4B
Context window—
LicenseGEMMA
Downloads338.7K

Modalities

Vision-LanguageReasoning

More in Vision-Language

Moonshot AI
Kimi-K2.7-Code
Kimi-K2.7-Code
Moonshot AI/Code

Moonshot AI Releases Kimi, a Multimodal Coding Model

The new Mixture-of-Experts model from the Chinese AI company can generate code while also understanding visual inputs, a rare combination in open models.

Jun 11, 2026
Google DeepMind
DiffusionGemma 26B-A4B Instruct
DiffusionGemma 26B-A4B Instruct
Google DeepMind/Text / LLM

Google Releases Open-Source DiffusionGemma 26B Model

The new 26B parameter model from DeepMind uses a diffusion-based architecture, a technique more common in image generation, to produce text.

Jun 9, 2026
MiniMax
MiniMax-M3
MiniMax-M3
MiniMax/Vision-Language

MiniMax Releases M3, a Multimodal MoE Model

The new open-weight model from MiniMax AI combines vision, coding, and reasoning using a Mixture-of-Experts architecture.

Jun 2, 2026