Efficient Large-Scale Language Model Training on GPU Clusters Using Megatron-LM | Jared Casper

⏱ 24:04 | 👁 8,2 mil visualizações | 🗓 2 years ago
🎵 Baixar MP3 🎥 Baixar MP4

Vídeos relacionados

baixar Understanding the LLM Inference Workload - Mark Moyou, NVIDIA mp3 34:14

Understanding the LLM Inference Workload - Mark Moyou, NVIDIA

27k • 1 year ago
baixar Ultimate Guide To Scaling ML Models - Megatron-LM | ZeRO | DeepSpeed | Mixed Precision mp3 1:22:58

Ultimate Guide To Scaling ML Models - Megatron-LM | ZeRO | DeepSpeed | Mixed Precision

36k • 3 years ago
baixar Why Is Claude Opus 4.7 Being Lazy? And How to Properly Fix It With Opus 4.8 mp3 6:28

Why Is Claude Opus 4.7 Being Lazy? And How to Properly Fix It With Opus 4.8

92 • 1 hour ago
baixar The Ultra-Scale Playbook: Training LLMs on GPU Clusters mp3 1:25:56

The Ultra-Scale Playbook: Training LLMs on GPU Clusters

3k • 1 year ago
baixar Stanford CS231N | Spring 2025 | Lecture 11: Large Scale Distributed Training mp3 1:12:53

Stanford CS231N | Spring 2025 | Lecture 11: Large Scale Distributed Training

46k • 9 months ago
baixar Applications of Transformer Based Language Models to Comparative Evolutionary Genomics | Kevin Liu mp3 1:02:38

Applications of Transformer Based Language Models to Comparative Evolutionary Genomics | Kevin Liu

80 • 3 weeks ago
baixar Chinchilla Explained: Compute-Optimal Massive Language Models mp3 32:46

Chinchilla Explained: Compute-Optimal Massive Language Models

22k • 4 years ago
baixar Training LLMs at Scale - Deepak Narayanan | Stanford MLSys #83 mp3 56:00

Training LLMs at Scale - Deepak Narayanan | Stanford MLSys #83

16k • Streamed 2 years ago
baixar How Fully Sharded Data Parallel (FSDP) works? mp3 32:31

How Fully Sharded Data Parallel (FSDP) works?

34k • 2 years ago
baixar Multi-GPU Communication Libraries for Scaling HPC and AI Workloads | NVIDIA GTC 2025 mp3 41:41

Multi-GPU Communication Libraries for Scaling HPC and AI Workloads | NVIDIA GTC 2025

7.7k • 1 year ago
baixar Mastering LLM Inference Optimization From Theory to Cost Effective Deployment: Mark Moyou mp3 33:39

Mastering LLM Inference Optimization From Theory to Cost Effective Deployment: Mark Moyou

46k • 1 year ago
baixar Turing-NLG, DeepSpeed and the ZeRO optimizer mp3 21:18

Turing-NLG, DeepSpeed and the ZeRO optimizer

21k • 6 years ago