FirefliesAudio

🏠 Home ❤️ Liked ⏳ History

Deep Dive: Optimizing LLM inference

⏱ 36:12 | 👁 49 mil visualizações | 🗓 2 years ago

🎵 Baixar MP3 🎥 Baixar MP4

Vídeos relacionados

baixar Deep Dive: Quantizing Large Language Models, part 2 mp3

Deep Dive: Quantizing Large Language Models, part 2

4.5k • 2 years ago

baixar Why Inference is hard.. mp3

Why Inference is hard..

158k • 1 month ago

baixar AMA: What's Actually Working in AI for Coaches & Educators Right Now mp3

AMA: What's Actually Working in AI for Coaches & Educators Right Now

153 • Streamed 17 hours ago

baixar Mastering LLM Inference Optimization From Theory to Cost Effective Deployment: Mark Moyou mp3

Mastering LLM Inference Optimization From Theory to Cost Effective Deployment: Mark Moyou

46k • 1 year ago

baixar Exploring the Latency/Throughput & Cost Space for LLM Inference // Timothée Lacroix // CTO Mistral mp3

Exploring the Latency/Throughput & Cost Space for LLM Inference // Timothée Lacroix // CTO Mistral

28k • 2 years ago

baixar 🔍 AI Serving Frameworks Explained: vLLM vs TensorRT-LLM vs Ray Serve | Which One Should You Use? mp3

🔍 AI Serving Frameworks Explained: vLLM vs TensorRT-LLM vs Ray Serve | Which One Should You Use?

1.6k • 8 months ago

baixar Deep dive - Better Attention layers for Transformer models mp3

Deep dive - Better Attention layers for Transformer models

15k • 2 years ago

baixar The Big LLM Architecture Comparison mp3

The Big LLM Architecture Comparison

41k • 8 months ago

baixar Deep Dive: Quantizing Large Language Models, part 1 mp3

Deep Dive: Quantizing Large Language Models, part 1

23k • 2 years ago

baixar How the VLLM inference engine works? mp3

How the VLLM inference engine works?

21k • 8 months ago

baixar Nicholas Carlini - Black-hat LLMs | [un]prompted 2026 mp3

Nicholas Carlini - Black-hat LLMs | [un]prompted 2026

353k • 2 months ago

baixar State of LLMs 2026: RLVR, GRPO, Inference Scaling — Sebastian Raschka mp3

State of LLMs 2026: RLVR, GRPO, Inference Scaling — Sebastian Raschka

19k • 4 months ago