Skip to content
#

speech-representation

Here are 17 public repositories matching this topic...

MOSS-Audio-Tokenizer is a Causal Transformer-based audio tokenizer built on the CAT architecture. Trained on 3M hours of diverse audio, it supports streaming and variable bitrates, delivering SOTA reconstruction and strong performance in generation and understanding—serving as a unified interface for next-generation native audio language models.

  • Updated Feb 13, 2026
  • Python

Improve this page

Add a description, image, and links to the speech-representation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-representation topic, visit your repo's landing page and select "manage topics."

Learn more