Fine-tuning and RLHF

Fine-tuning and RLHF#

Slides from the lecture introducing different fine-tuning techniques (PEFT, instruction fine-tuning, RL-based fine-tuning) can be found here.

Additional materials#

If you want to dig a bit deeper, here are (optional!) supplementary readings on some of the topics covered in class:

Supervised fine-tuning:

RLHF: