The Open Weights
LatestModelsLeaderboardsUpcomingCompanies
Subscribe
The Open Weights

The daily record of open-source AI. New model releases, leaderboards, and what's coming next — written for people who ship.

Refreshed every 12 hours

Discover

  • Latest releases
  • New today
  • Trending models
  • Upcoming launches

Browse

  • All models
  • Companies
  • Categories
  • Leaderboards

About

  • About
  • Editorial policy
  • RSS feed
  • Newsletter

© 2026 The Open Weights. An independent publication.

Aggregated by Claude · written with Gemini · curated by humans.

LatestMisoLabs1
MisoLabsText → Speech

MisoLabs Debuts MisoTTS, an Open Voice Model

The new text-to-speech system adapts the decoder-only architecture of language models like Llama to generate more natural-sounding speech.

May 21, 2026
UpdateOther
MisoLabs · Text → Speech
MisoTTS
MisoTTS

A new contender has entered the open-source speech synthesis space. Startup MisoLabs has released MisoTTS, a text-to-speech (TTS) model that applies popular architectural patterns from large language models to the challenge of generating human-like audio.

Unlike many traditional TTS systems, MisoTTS uses a decoder-only transformer architecture, a design heavily inspired by models in the Llama family. This approach treats audio generation as a sequence-to-sequence task, similar to how an LLM predicts the next word in a sentence. The goal is to produce more natural and expressive speech by leveraging the same principles that have dramatically advanced text generation.

MisoTTS at a Glance

The model was trained on a foundation of public domain audiobooks and currently supports two languages. Key features of the initial release include:

  • Architecture: 24-layer decoder-only transformer.
  • Languages: English and Japanese.
  • License: Creative Commons BY-NC-SA 4.0 (non-commercial use).

The release of MisoTTS highlights a growing trend of cross-pollination in AI research, where successful architectures from one domain are adapted to solve problems in another. While its non-commercial license limits its use in products, it provides researchers and hobbyists a new tool for exploring the intersection of language and speech. The model and code are available now on the Hugging Face Hub.

Sources

  • MisoLabs/MisoTTS

    Hugging Face

    Visit

0 comments

Protected by Turnstile

No comments yet. Be the first to weigh in.

Get the model

Weights

Specs

Parameters—
Context window—
LicenseOTHER
Downloads0

Modalities

Text → Speech

More in Text → Speech

Zyphra
Zonos 2
Zonos 2
Zyphra/Text → Speech

Zyphra Releases Open-Source Zonos 2 TTS Model

The new text-to-speech model offers a commercially permissive alternative for developers in a field still dominated by closed-source APIs.

Jun 11, 2026
Boson AI
Higgs Audio v3 TTS 4B
Higgs Audio v3 TTS 4B
Boson AI/Text → Speech

Boson AI's Higgs Audio v3 Offers Expressive, Multilingual TTS

The new 4-billion-parameter text-to-speech model is available for non-commercial use, promising fine-grained control over vocal delivery.

Jun 4, 2026
OpenMOSS
MOSS-TTS v1.5
MOSS-TTS v1.5
OpenMOSS/Text → Speech

MOSS-TTS Aims for More Robust Speech Synthesis

A new text-to-speech model introduces 'delay-pattern decoding' to solve common word skipping and repetition errors in parallel generation.

May 25, 2026