Show HN: Nvidia's CUDA libraries are generic and not optimized for LLM inference | Heykuki News