Research
Self-Distillation without Imitation - Bootstrapping Dense Reward Models from Privileged Information
Work in Progress (2026)Self-Distillation without Imitation - Bootstrapping Dense Reward Models from Privileged Information
Work in Progress (2026)