Anthropic's SHADE-Arena: Evaluating sabotage and monitoring in LLM agents | Heykuki News