Тёмный

Google Cloud AI Platforms and Infrastructure 

Tech Field Day
Подписаться 56 тыс.
Просмотров 19 тыс.
50% 1

In this session, we’ll explore how Vertex AI, Google Kubernetes Engine (GKE) and Google Cloud’s AI Infrastructure provide a robust platform for AI development, training and inference. We'll discuss hardware choices for inference (CPUs, GPUs, TPUs), showcasing real-world examples. We’ll cover distributed training and inference with GPUs/TPUs and optimizing AI performance on GKE using tools like autoscaling and dynamic workload scheduling.
Brandon Royal, product manager at Google Cloud, discusses the use of Google Cloud's AI infrastructure for deploying AI on Google's infrastructure. The session focuses on how Google Cloud is applying AI to solve customer problems and the trends in AI, particularly the platform shift towards generative AI. Brandon discusses the AI infrastructure designed for generative AI, covering topics such as inference, serving, training, fine-tuning, and how these are applied in Google Cloud.
Brandon explains the evolution of AI models, particularly open models, and their importance for flexibility in deployment and optimization. He highlights that many AI startups and unicorns choose Google Cloud for their AI infrastructure and platforms. He also introduces Gemma, a new open model released by Google DeepMind, which is lightweight, state-of-the-art, and built on the same technology as Google's Gemini model. Gemma is available with open weights on platforms like Hugging Face and Kaggle.
The session then shifts to a discussion about AI platforms and infrastructure, with a focus on Kubernetes and Google Kubernetes Engine (GKE) as the foundation for open models. Brandon emphasizes the importance of flexibility, performance, and efficiency in AI workloads and how Google provides a managed experience with GKE Autopilot.
He also touches on the hardware choices for inference, including CPUs, GPUs, and TPUs, and how Google Cloud offers the largest selection of AI accelerators in the market. Brandon shares customer stories, such as Palo Alto Networks' use of CPUs for deep learning models in threat detection systems. He also discusses the deployment of models on GKE, including autoscaling and dynamic workload scheduling.
Finally, Brandon provides a live demo of deploying the Gemma model on GKE, showcasing how to use the model for generating responses and how it can be augmented with retrieval-augmented generation for more grounded responses. He also demonstrates the use of Gradio, a chat-based interface for interacting with models, and discusses the scaling and management of AI workloads on Google Cloud.
Recorded live at AI Field Day 4 in Santa Clara, California, on February 22, 2024. Watch the entire presentation at techfieldday.com/appearance/g... or visit cloud.google.com/ai/generativ... or TechFieldDay.com/event/aifd4/ for more information.

Наука

Опубликовано:

 

23 фев 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии    
Далее
Google Releases AI AGENT BUILDER! 🤖 Worth The Wait?
34:21
🎙ПОЮ твои ЛЮБИМЫЕ ПЕСНИ💥
3:10:10
Пиратские котики
00:50
Просмотров 202 тыс.
Eric Siegel | The AI Playbook | Talks at Google
59:18
Oxidize Conference: How Rust makes Oxide possible
50:18
What's going on with Windows Laptops?
10:30
Просмотров 2,9 млн
The moment we stopped understanding AI [AlexNet]
17:38
Просмотров 865 тыс.
The most important AI trends in 2024
9:35
Просмотров 232 тыс.
What are AI Agents?
12:29
Просмотров 126 тыс.
How I'd Learn AI in 2024 (if I could start over)
17:55
Просмотров 905 тыс.
Yanlışlıkla Telefonumu Parçaladım!😱
0:18
Просмотров 3,1 млн