There's (exactly) seven ways to optimize latency in an LLM application | Heykuki News