Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
1.
Ternative – C++/CUDA inference engine for ternary LLMs with runtime LoRA (github.com/michelangeloromerochisco)
3 points
michelangeloro
16 days ago
1 comment