To make large models smarter, they need to be taught and managed by humans — this is Human-in-the-loop RL.



I recently completed a task on @JoinSapien, which involved reviewing a reasoning process written by AI:

🌱 How does AI think?
🌱 Does it make sense?
🌱 At what step did it start to go wrong?

This task is called CoT (Chain-of-Thought) evaluation, and I am its logic auditor.

Furthermore, there is the Tree of Thoughts, where the AI will attempt multiple idea branches, and humans will filter which path is the best.

Actually, you can think of it as explaining a problem to a child, but this child is a GPT-level future brain.

This task cannot be done by AI alone; human participation is necessary.

@JoinSapien #sapien @KaitoAI # KaitoAI #SapienProtocol
IN-3.78%
TREE-2.32%
GPT2.35%
View Original
post-image
This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.
  • Reward
  • Comment
  • Share
Comment
0/400
No comments
Trade Crypto Anywhere Anytime
qrCode
Scan to download Gate app
Community
English
  • 简体中文
  • English
  • Tiếng Việt
  • 繁體中文
  • Español
  • Русский
  • Français (Afrique)
  • Português (Portugal)
  • Bahasa Indonesia
  • 日本語
  • بالعربية
  • Українська
  • Português (Brasil)