Friendly Wiki
Home
About
Random
Help
Updates
Contact
Login
Friendly Wiki
☰
Home
About
Random
Help
Updates
Contact
Login
RLHF
REDIRECT
Reinforcement learning from human feedback
{{rcatsh |
{{R from initialism}}
}}