((Bit-Mage) 'buffer) ::
RLHF
Table of Contents
1. Relevant Nodes
1.1. Direct Preference Optimization
1.2. RLAIF
2. Resources
1.
Relevant Nodes
1.1.
Direct Preference Optimization
1.2.
RLAIF
2.
Resources
https://openai.com/index/instruction-following/
Tags::rl:ai: