T-TechSpeech → Text

T-Tech Releases T-one for Russian Speech Recognition

The new streaming Conformer model from the Russian digital bank is optimized for real-time transcription of telephone conversations.

Jul 14, 2025

UpdateOther

T-Tech, the AI research arm of Russian digital bank Tinkoff, has released T-one, a new model for automatic speech recognition (ASR). The model is specifically designed for transcribing Russian-language audio from telephone calls, a common but challenging domain for speech-to-text systems.

The model uses a streaming Conformer architecture, which allows it to process audio in real-time with low latency. This makes it well-suited for live transcription applications like call center monitoring or voice assistant interactions. According to its creators, T-one was trained on a substantial dataset of over 30,000 hours of Russian speech to achieve its specialized performance.

Use Cases and Licensing

Given its focus on telephony, T-one is primarily aimed at enterprise and research applications involving voice data. The model's architecture and training are tailored to handle the lower audio quality and specific conversational patterns found in phone calls.

While the model weights are publicly available on the Hugging Face Hub, they are released under a Creative Commons Non-Commercial (CC BY-NC 4.0) license. This permits use for research and experimentation but restricts deployment in commercial products without a separate agreement with T-Tech.

Sources

t-tech/T-one
Hugging Face
Visit

0 comments

No comments yet. Be the first to weigh in.

KRAFTON releases A.X-K2 Raon speech MoE model

The game maker's new open model blends text-to-speech and speech recognition in a single 21B mixture-of-experts system with just 3B active parameters.

Jul 27, 2026

Microsoft/Speech → Text

Microsoft's VibeVoice ASR Goes BitNet for CPU Speech

A BitNet-quantized speech recognition model trades GPU dependence for efficient CPU inference in English and Chinese.

Jul 24, 2026

Nyralabs/Speech → Text

CrisperWhisper 2.0 Large targets verbatim transcription

A Whisper-based ASR model that keeps every filler word and stamps timestamps to the individual word, now covering English and German.

Jul 15, 2026

Use Cases and Licensing