返回笔记列表

CS336

课程笔记

17 篇笔记

0 个标签

CS336: Language Models from Scratch (Lecture 1)

CS336: PyTorch Primitives & Resource Accounting (Lecture 2)

LLM 架构与超参数 (Architectures and Hyperparameters)

Mixture of Experts (MoE) From Scratch to DeepSeek V3

GPU 架构与大模型高效计算 (GPU Architecture & Efficient Computing for LLMs)

GPU 高性能编程与内核优化 (High Performance GPU Programming & Kernels)

Parallelism 1 (Multi-Machine Optimization)

大规模语言模型训练——并行计算基础 (Parallelism 2)

Scaling Laws (Part 1) - 基础与数据/模型扩展

从零构建语言模型 (Language Modeling from Scratch) - 第10讲：推理 (Inference)

Scaling Laws 2 (扩展定律进阶与实战)

斯坦福 CS336：从零构建语言模型 (Language Modeling from Scratch)

CS336 Language Modeling from Scratch | Lecture 13: Data 1

CS336 Language Modeling from Scratch

Stanford CS336 Lecture 15: Alignment - Supervised Fine-Tuning (SFT) & RLHF

Alignment - RL from Verifiable Rewards (基于可验证奖励的强化学习)

从零构建语言模型 - 对齐与强化学习进阶 (Alignment - RL 2)