Svelte Hacker News logo
  • top
  • new
  • show
  • ask
  • jobs
  • about

Reinforcement Learning from Human Feedback (RLHF) in Notebooks

github.com

68 points by ash_at_hny 12 hours ago

kcdom1000f 10 hours ago

Hl

careful_ai 6 hours ago

[dead]

bobvylan 5 hours ago

[dead]