🔥DeepSeek-R1 and Kimi k1.5: How Chinese AI…

Jan 23

Two prominent Chinese AI startups, DeepSeek and Moonshot AI, unveiled reasoning models that rival OpenAI’s o1, and shared insights into large-scale reinforcement learning for training LLMs.

Read →

5 Comments

Interesting Engineering ++

Jan 24

Nice Tony. Just saw this. Great stuff. Have repacked it (since I had only done notes earlier). Yours has good relevant details.

Expand full comment

Reply (1)

Interesting Engineering ++

Jan 24

ReStaCked. Hahah damn spellcheck

Expand full comment

Reply (1)

Tony Peng

Jan 24

haha thanks for reading!

Expand full comment

Paul Triolo

Jan 25

Great work Tony, I am reference this in new Substack post that will go up tomorrow. Wanted to see what you thought of this: https://www.amd.com/en/developer/resources/technical-articles/amd-instinct-gpus-power-deepseek-v3-revolutionizing-ai-development-with-sglang.html

I am trying to determine what GPU resources DeepSeek has access to. All of the discussion over the past week has focused on H800, A100, and H100, but not on AMD....what is your view of this press release which talks about the "long standing collaboration" between AMD and DeepSeek?

Expand full comment

Reply (1)

Tony Peng

Jan 28

Seemingly AMD optimized its GPU to better run DeepSeek-v3 for inference?

Expand full comment

Recode China AI

🔥DeepSeek-R1 and Kimi k1.5: How Chinese AI…