Тёмный

Accelerating LLM Inference with vLLM 

Подписаться
Просмотров 4,1 тыс.
% 130

vLLM is an open-source highly performant engine for LLM inference and serving developed at UC Berkeley. vLLM has been widely adopted across the industry, with 12K+ GitHub stars and 150+ contributors worldwide. Since its initial release, the vLLM team has improved performance by more than 10x. This session will cover various topics in LLM inference performance, including paged attention and continuous batching. Then, we will focus on new innovations we’ve made to vLLM and the technical challenges behind them, including: Speculative Decoding, Prefix Caching, Disaggregated Prefill, and multi-accelerator support. The session will conclude with industry case studies of vLLM and future roadmap plans. Takeaways: vLLM is an open source engine for LLM inference and serving, providing state-of-the-art performance and an accelerator-agnostic design. In focusing on production-readiness and extensibility, vLLM’s design choices have led to new system insights and rapid community adoption.
Talk By: Cade Daniel, Software Engineer, Anyscale ; Zhuohan Li, PhD student, UC Berkeley / vLLM
Here's more to explore:
LLM Compact Guide: dbricks.co/43WuQyb
Big Book of MLOps: dbricks.co/3r0Pqiz
Connect with us: Website: databricks.com
Twitter: databricks
LinkedIn: www.linkedin.com/company/data…
Instagram: databricksinc
Facebook: databricksinc

Наука

Опубликовано:

 

23 июл 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 8   
@SilasEgbert-i7s
@SilasEgbert-i7s 11 дней назад
Era Brooks
@MukulTripathi
@MukulTripathi Месяц назад
Once it starts supporting tool calling with local models, I will switch to it.
@VirginiaMarrone-p1v
@VirginiaMarrone-p1v 5 дней назад
Benton Club
@AmySmith-w5n
@AmySmith-w5n 7 дней назад
McDermott Lake
@LawsonLynn-o9v
@LawsonLynn-o9v 12 дней назад
Crawford Meadows
@BensonBetsy-w3u
@BensonBetsy-w3u 16 дней назад
Miller Views
@JosephCherry-y1f
@JosephCherry-y1f 17 дней назад
Troy Motorway
@RichardsonSandy-p5h
@RichardsonSandy-p5h 17 дней назад
Jerome Cliff