Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
1.
RIS-Kernel: Running 64k context LLMs on CPU via sparse attention (github.com/santosardr)
2 points
santosardr
4 days ago
discuss