Models self-report difference between RLHF trained responses and base cognition | Heykuki News