Тёмный

Ep#7 Managing Generative AI and APIs with Josh Collier 

FinOps NEXUS
Подписаться 146
Просмотров 746
50% 1

In this episode of the FinOps NEXUS Podcast, host Jon Myer is joined by Josh Collier from Grammarly to dive into the intricate world of managing Generative AI and APIs. Josh shares his expertise on choosing the right capacity type, the benefits and challenges of using third-party LLMs, and how these decisions impact financial operations. Whether you're exploring how to optimize costs with AI or seeking insights into the technical aspects of API management, this episode has you covered.
🔑 Key Topics Covered:
The importance of choosing the right LLM capacity type
Differences between hosted and third-party LLMs
How Generative AI Impacts FinOps
Cost management and optimization strategies
Practical insights into API performance and latency
📅 Podcast Timeline:
0:00 - Introduction and welcome
0:06 - Guest introduction: Josh Collier
0:22 - Overview of managing Generative AI APIs
0:55 - Explanation of LLM and its relevance
1:37 - Benefits of using third-party LLMs
2:55 - Cost implications and FinOps relevance
3:16 - Hosted LLM vs. third-party LLM comparison
3:55 - Understanding token consumption and costs
5:05 - Performance considerations for shared vs. dedicated capacity
7:39 - Support and troubleshooting differences
7:55 - Key considerations for performance and latency
10:51 - Acceptable latency levels and impact on user experience
12:20 - FinOps and cost optimization in Generative AI
13:44 - Rapid adoption of GenAI and cost implications
15:06 - Advice for implementing Generative AI effectively
15:59 - Wrap-up and closing remarks
👉 Check out our community website and more episodes: finopsnexus.com
👉 Listen on Spotify: open.spotify.com/show/1iyJVVI...
#FinOps #GenerativeAI #APIs #CloudOptimization #Podcast

Опубликовано:

 

14 июн 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии    
Далее
How did they do?! 😂👀🕺 | Triple Charm #Shorts
00:16
How To Prepare AI For Uses In Science
23:49
Просмотров 25 тыс.
Is data management the secret to generative AI?
11:36
What is Retrieval-Augmented Generation (RAG)?
6:36
Просмотров 546 тыс.
Brian Chesky’s new playbook
1:13:28
Просмотров 199 тыс.
Google Data Center 360° Tour
8:29
Просмотров 5 млн
The Turing Lectures: The future of generative AI
1:37:37
Просмотров 559 тыс.