jundot/omlx
steadyLLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar
Python
View on GitHub
Stars
16,961
Forks
1,436
Open issues
491
24h
+96
+0.7%
7d
+935
+6.9%
Refresh
2h
Star history (7 days)
Last checked
1h ago
Last pushed
10h ago
Next check
just now