Open R1: Update #2
Hugging Face is replicating R1 in the open. It has succeeded in running distillation on R1 itself to generate 800k reasoning traces.
Hugging Face is replicating R1 in the open. It has succeeded in running distillation on R1 itself to generate 800k reasoning traces.