2026-04-01-llm #252

2026-04-03T14:34:04Z

giscus[bot]
Bot Apr 3, 2026

2026-04-01-llm

很多人会把模型升级理解为参数变大，但线上体感差异常常出在后半段训练和发布链路。这篇从预训练一路讲到蒸馏上线，重点看数据工程、系统配方、后训练、评测奖励和 Agent 训练怎么一起影响最终表现。最后会看到，模型变强通常是权重、训练链路和部署决策共同作用的结果，不只是参数规模。

https://tw93.fun/2026-04-03/llm.html

xiaoxiaohub · 2026-04-14T01:58:06Z

xiaoxiaohub
Apr 14, 2026 — with giscus

图是用claude画的

1 reply

tw93 Apr 20, 2026 — with giscus
Maintainer

对对对

MrSchnappi · 2026-04-16T08:00:25Z

MrSchnappi
Apr 16, 2026 — with giscus

感谢博主讲解，内容上基本把主流工作模式和每个节点的目标和难点都讲解了，很到位！。如果从传统 LLM 训练 -> LLM + 对齐人类偏好 -> 以agent 的LLM，讲解似乎更好。如果全文再贯穿某一个团队的持续工作（比如deepseek， openai，anthropic等）讲解会更直观（个人观点，个人建议）～

1 reply

tw93 Apr 20, 2026 — with giscus
Maintainer

哈哈哈你这个也更清晰

zionuke · 2026-04-28T09:06:57Z

zionuke
Apr 28, 2026 — with giscus

配图都好好看！（有啥提示词可参考的嘛）；整体内容量很扎实，收获很大！

0 replies

nornand · 2026-05-30T13:33:05Z

nornand
May 30, 2026 — with giscus

感谢博主讲解，而且流程图很好看！请问是用哪个skill画的吗？

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

2026-04-01-llm #252

Uh oh!

{{title}}

Uh oh!

Replies: 4 comments 2 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

2026-04-01-llm #252

Uh oh!

giscus[bot] Bot Apr 3, 2026

2026-04-01-llm

Replies: 4 comments · 2 replies

Uh oh!

xiaoxiaohub Apr 14, 2026 — with giscus

Uh oh!

tw93 Apr 20, 2026 — with giscus Maintainer

Uh oh!

MrSchnappi Apr 16, 2026 — with giscus

Uh oh!

tw93 Apr 20, 2026 — with giscus Maintainer

Uh oh!

zionuke Apr 28, 2026 — with giscus

Uh oh!

nornand May 30, 2026 — with giscus

giscus[bot]
Bot Apr 3, 2026

Replies: 4 comments 2 replies

xiaoxiaohub
Apr 14, 2026 — with giscus

tw93 Apr 20, 2026 — with giscus
Maintainer

MrSchnappi
Apr 16, 2026 — with giscus

tw93 Apr 20, 2026 — with giscus
Maintainer

zionuke
Apr 28, 2026 — with giscus

nornand
May 30, 2026 — with giscus