汀的知识碎片

Tag: RLHF

2 items with this tag.

Mar 04, 2026
LLM 原理专栏导览
Mar 04, 2026
指令微调与 RLHF——从基座模型到对话助手

Created with Quartz v4.5.2 © 2026

GitHub
Discord Community