Research is being released in collaboration with an external evaluation team
In controlled tests, behaviors consistent with scheming were observed in frontier models, and a method to reduce them was evaluated
You are viewing a single comment's thread from: