Implementing Semantic Cache to Reduce LLM Cost and Latency | Heykuki News