Show HN: Lightweight Llama3 Inference Engine – CUDA C | Heykuki News