mit-han-lab/streaming-llm
steady[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
Python
View on GitHub
Stars
7,239
Forks
399
Open issues
47
24h
+2
+0.0%
7d
+6
+0.1%
Refresh
2h
Star history (7 days)
Last checked
1h ago
Last pushed
11 Jul 2024
Next check
just now