Show HN: InferShrink – Cut LLM API costs 10x with automatic model routing | Heykuki News