Logo
Blog
  • Home
  • Posts
  • Notes
  • Tags
  • Graph
  • Portfolio

Reinforcement Learning Human Feedback

Notes
reinforcement-learning

Graph

title: Reinforcement Learning Human Feedback

Backlinks

  • Complex LLM Systems
    • Instruction Tuned LLM (often trained on base llm with rlhf)
      • Reinforcement Learning Human Feedback
LinkedIn Image

Made with ❤️ by Sidharth Arya © 2024

Blog Bot
Online