kyutai-labs/moshi
steadyMoshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.
Python
View on GitHub
Stars
10,501
Forks
975
Open issues
69
24h
+2
+0.0%
7d
+46
+0.4%
Refresh
2h
Star history (7 days)
Last checked
1h ago
Last pushed
16 May 2026
Next check
just now