GRPO experiment - I trained a Language Model to schedule events | Heykuki News