kyutai-labs

kyutai-labs/moshi

steady

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python View on GitHub

Stars

10,501

Forks

975

Open issues

69

24h

+2

+0.0%

7d

+46

+0.4%

Refresh

2h

Star history (7 days)

Last checked

1h ago

Last pushed

16 May 2026

Next check

just now