Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
1.
▲
Show HN: Run Qwen3-Next-80B on 8GB GPU at 1tok/2s throughput
(github.com/Mega4alik)
123 points
anuarsh
9 months ago
17 comments
2.
▲
Show HN: Run gpt-oss-20b on 8GB GPUs
(github.com/Mega4alik)
6 points
anuarsh
9 months ago
discuss
3.
▲
Show HN: oLLM – LLM Inference for large-context tasks on consumer GPUs
(github.com/Mega4alik)
3 points
anuarsh
9 months ago
7 comments
4.
▲
Show HN: Fine-tune Llama3-8B on 8GB GPU without quantization
(github.com/Mega4alik)
3 points
anuarsh
7 months ago
discuss