xorbitsai/inference
steadySwap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-ready inference API.
Python
View on GitHub
Stars
9,393
Forks
841
Open issues
14
24h
+4
+0.0%
7d
+22
+0.2%
Refresh
1h
Star history (7 days)
Last checked
54m ago
Last pushed
4h ago
Next check
just now