Powering Generative AI with Kubernetes: A Cloud Native Approach by Janakiram MSV

Supercharge Your AI Platform with KubeRay: Ray + Kubernetes - Archit Kulkarni & Winston Chiang

小路飞嫁祸姐姐搞破坏 #路飞#海贼王

3 fuzzy socks OVERNIGHT HEATLESS CURLS 😍#easycurls #curlyhairhacks #longhair #heatlesscurls #hair

Russian fighter jet executes risky manoeuvre near US aircraft

Women’s Celebrations + Men’s 😮‍💨

Accelerate Your GenAI Model Inference with Ray and Kubernetes - Richard Liu, Google Cloud

CNCF [Cloud Native Computing Foundation]

Подписаться 119 тыс.

Просмотров 731

50% 1

Видео Поделиться Скачать Добавить в

Accelerate Your GenAI Model Inference with Ray and Kubernetes - Richard Liu, Google Cloud
Generative AI has become increasingly prevalent in recent years, and is reaching a critical point as the models are demonstrating human-level capabilities. However, serving these massive models have presented new technical challenges, as they contain hundreds of billions of model parameters and require massive computational resources. In this talk, we will discuss how to serve GenAI models using KubeRay on Kubernetes with hardware accelerators like GPUs and TPUs. Practitioners will learn how to get these large models into production on a performant and cost-effective Kubernetes platform. Ray is an open-source framework for distributed machine learning. It enables ML practitioners to scale their workloads out to large clusters of machines. Ray Serve offers a scalable and framework-agnostic library for online inference that’s suitable for large and complex models. The audience will learn how integrating Ray with accelerators can create a powerful platform for serving GenAI models.

Опубликовано:

4 окт 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии

Далее

Powering Generative AI with Kubernetes: A Cloud Native Approach by Janakiram MSV

39:20

Powering Generative AI with Kubernetes: A Cloud Native Approach by Janakiram MSV

Просмотров 311

Supercharge Your AI Platform with KubeRay: Ray + Kubernetes - Archit Kulkarni & Winston Chiang

32:53

Supercharge Your AI Platform with KubeRay: Ray + Kubernetes - Archit Kulkarni & Winston Chiang

Просмотров 1,5 тыс.

小路飞嫁祸姐姐搞破坏 #路飞#海贼王

00:45

小路飞嫁祸姐姐搞破坏 #路飞#海贼王

Просмотров 18 млн

3 fuzzy socks OVERNIGHT HEATLESS CURLS 😍#easycurls #curlyhairhacks #longhair #heatlesscurls #hair

00:20

3 fuzzy socks OVERNIGHT HEATLESS CURLS 😍#easycurls #curlyhairhacks #longhair #heatlesscurls #hair

Просмотров 4,2 млн

Russian fighter jet executes risky manoeuvre near US aircraft

00:13

Russian fighter jet executes risky manoeuvre near US aircraft

Просмотров 1 млн

Women’s Celebrations + Men’s 😮‍💨

00:20

Women’s Celebrations + Men’s 😮‍💨

Просмотров 3,3 млн

Scale AI with Ray on Vertex AI

24:41

Scale AI with Ray on Vertex AI

Просмотров 1,5 тыс.

Deploying Many Models Efficiently with Ray Serve

25:42

Deploying Many Models Efficiently with Ray Serve

Просмотров 4,2 тыс.

Deploying and Scaling AI Applications with the NVIDIA TensorRT Inference Server on Kubernetes

31:48

Deploying and Scaling AI Applications with the NVIDIA TensorRT Inference Server on Kubernetes

Просмотров 7 тыс.

What are AI Agents?

12:29

What are AI Agents?

Просмотров 504 тыс.

Serving Large Language Models with KubeRay on TPUs

24:59

Serving Large Language Models with KubeRay on TPUs

Просмотров 789

Bay.Area.AI: Build RAG-based large language model applications with Ray and KubeRay, Kai-Hsun Chen

33:06

Bay.Area.AI: Build RAG-based large language model applications with Ray and KubeRay, Kai-Hsun Chen

Просмотров 304

Inference Optimization with NVIDIA TensorRT

36:28

Inference Optimization with NVIDIA TensorRT

Просмотров 12 тыс.

Enabling Cost-Efficient LLM Serving with Ray Serve

30:28

Enabling Cost-Efficient LLM Serving with Ray Serve

Просмотров 6 тыс.

Operationalizing Ray Serve on Kubernetes

29:54

Operationalizing Ray Serve on Kubernetes

Просмотров 1,9 тыс.

Deploying machine learning models on Kubernetes

26:32

Deploying machine learning models on Kubernetes

Просмотров 17 тыс.

小路飞嫁祸姐姐搞破坏 #路飞#海贼王

00:45

小路飞嫁祸姐姐搞破坏 #路飞#海贼王

Просмотров 18 млн