Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
1.
Show HN: Run Qwen3-Next-80B on 8GB GPU at 1tok/2s throughput (github.com/Mega4alik)
123 points
anuarsh
9 months ago
17 comments
2.
Show HN: Run gpt-oss-20b on 8GB GPUs (github.com/Mega4alik)
6 points
anuarsh
9 months ago
discuss
3.
Show HN: oLLM – LLM Inference for large-context tasks on consumer GPUs (github.com/Mega4alik)
3 points
anuarsh
9 months ago
7 comments
4.
Show HN: Fine-tune Llama3-8B on 8GB GPU without quantization (github.com/Mega4alik)
3 points
anuarsh
7 months ago
discuss