llm.c is now down to 26.2ms/iteration, matching PyTorch | Heykuki News