2024-04-03-insights

发表于 2024-04-03 更新于 2024-08-09 分类于 Arxiv-Insights 阅读次数： Valine：
本文字数： 120 阅读时长 ≈ 1 分钟

Advancing LLM Reasoning Generalists with Preference Trees

推荐组里的工作：在reasoning任务上构造大规模高质量的SFT数据，进而训练了70B模型，在OOD的reasoning任务上也表现很好