LLM INQUISITOR: Evaluating how AI models handle long, realistic tasks | Heykuki News