Why LLM decode is memory-bound, not compute-bound | Heykuki News