For a long time, training large models has relied heavily on the guidance of a "teacher." This could either be human-annotated "standard answers," which are time-consuming and labor-intensive, or ...