1 Comment
User's avatar
Daniel Popescu / ⧉ Pluralisk's avatar

Wow, the way you connect RLHF to our own reward systems in daily life truely stood out to me. It makes me wonder if having that self-awareness of our internal 'loss functions' is the first step towards fine-tuning our own human models for desired outcomes.

Expand full comment