Wow, the way you connect RLHF to our own reward systems in daily life truely stood out to me. It makes me wonder if having that self-awareness of our internal 'loss functions' is the first step towards fine-tuning our own human models for desired outcomes.
Wow, the way you connect RLHF to our own reward systems in daily life truely stood out to me. It makes me wonder if having that self-awareness of our internal 'loss functions' is the first step towards fine-tuning our own human models for desired outcomes.