Efficient Large-Scale Language Model Training on GPU Clusters Using Megatron-LM | Jared Casper
Vídeos relacionados
34:14
Understanding the LLM Inference Workload - Mark Moyou, NVIDIA
1:22:58
Ultimate Guide To Scaling ML Models - Megatron-LM | ZeRO | DeepSpeed | Mixed Precision
6:28
Why Is Claude Opus 4.7 Being Lazy? And How to Properly Fix It With Opus 4.8
1:25:56
The Ultra-Scale Playbook: Training LLMs on GPU Clusters
1:12:53
Stanford CS231N | Spring 2025 | Lecture 11: Large Scale Distributed Training
1:02:38
Applications of Transformer Based Language Models to Comparative Evolutionary Genomics | Kevin Liu
32:46
Chinchilla Explained: Compute-Optimal Massive Language Models
56:00
Training LLMs at Scale - Deepak Narayanan | Stanford MLSys #83
32:31
How Fully Sharded Data Parallel (FSDP) works?
41:41
Multi-GPU Communication Libraries for Scaling HPC and AI Workloads | NVIDIA GTC 2025
33:39
Mastering LLM Inference Optimization From Theory to Cost Effective Deployment: Mark Moyou
21:18