GPT-4-turbo-2024-04-09 "wins" simple evals benchmark | Heykuki News