From pretraining to alignment
Pretraining objectives, data engineering, scaling laws, and compute-optimal training.
SFT, RLHF, DPO, and preference optimization methods.