Eclipse Trace Compass: TSP Python Client - Unleash your CLI

Deploying Docker Containers on AWS Elastic Container Service (ECS) | Container Orchestration

She Couldn’t Believe I Did This! 😭 #shorts

У него не было ни ДОМА, ни ДЕНЕГ, ни ЕДЫ. Но у него БЫЛА МЕЧТА😢😢😢 #shorts

1 Subscriber = 1 Penny

ШАРЛОТКА Нужна ли сода? Ставлю точку! Шарлотка с яблоками в духовке, в мультиварке, без миксера

Deploy ANY Open-Source LLM with Ollama on an AWS EC2 + GPU in 10 Min (Llama-3.1, Gemma-2 etc.)

Developers Digest

Подписаться 24 тыс.

Просмотров 3,6 тыс.

50% 1

Видео Поделиться Скачать Добавить в

Опубликовано:

21 окт 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 19

@DevelopersDigest 2 месяца назад

The best way to support this channel? Comment, like, and subscribe!

@hpongpong 2 месяца назад

Great concise presentation. Thank you so much!

@DevelopersDigest 2 месяца назад

Thank you! 🙏

@ryanroman6589 2 месяца назад

this is super valuable. awesome vid!

@DevelopersDigest 2 месяца назад

Thank you! 🙏

@rembautimes8808 2 месяца назад

Thanks very nice tutorial

@DevelopersDigest 2 месяца назад

Thank you

@alejandrogallardo1414 2 месяца назад

for models at ~70b, i am getting timeout issues using vanilla ollama. It works with the first pull/run, but times out when i need to reload model. Do you have any recommendations for persistently keeping the same model running?

@DevelopersDigest 2 месяца назад

github.com/ollama/ollama/pull/2146

@danielgannage8109 2 месяца назад

This is very informative! Thanks :) Curious why you used a g4dn.xlarge GPU ($300/month) instead of a t3.medium CPU ($30/month)? I assumed the 8 Billion parameter model was out of reach with regular hardware. What max model size works with the g4dn.xlarge GPU? To put into perspective, I have a $4K macbook (16gb ram) that can really only run the large (150 million) or medium (100 million parameter) sized model, which i think the t3.medium CPU on AWS can only run the 50 million param (small model).

@rehanshaikh2708 12 дней назад

how can i use this endpoint in langchain chatollama?

@dylanv3044 2 месяца назад

maybe a dumb question. how do you turn the stream data you received into readable sentences

@DevelopersDigest 2 месяца назад

You could accumulate tokens and split by the end of sentences . ! ? Etc and then send resp after grouping function like that

@nexuslux 2 месяца назад

Can you use open web ui?

@ConAim Месяц назад

Stay away from AWS, it will cost you arms and legs in a long run..

@DevelopersDigest Месяц назад

Which vendors do you prefer? 🙂

@BeCodeless-dot-net 2 месяца назад

nice explaination

@DevelopersDigest 2 месяца назад

Thank you!

Далее

Eclipse Trace Compass: TSP Python Client - Unleash your CLI

6:45

Eclipse Trace Compass: TSP Python Client - Unleash your CLI

Просмотров 226

Deploying Docker Containers on AWS Elastic Container Service (ECS) | Container Orchestration

35:22

Deploying Docker Containers on AWS Elastic Container Service (ECS) | Container Orchestration

Просмотров 72 тыс.

She Couldn’t Believe I Did This! 😭 #shorts

00:12

She Couldn’t Believe I Did This! 😭 #shorts

Просмотров 2,8 млн

У него не было ни ДОМА, ни ДЕНЕГ, ни ЕДЫ. Но у него БЫЛА МЕЧТА😢😢😢 #shorts

01:00

У него не было ни ДОМА, ни ДЕНЕГ, ни ЕДЫ. Но у него БЫЛА МЕЧТА😢😢😢 #shorts

Просмотров 1,2 млн

1 Subscriber = 1 Penny

00:17

1 Subscriber = 1 Penny

Просмотров 50 млн

ШАРЛОТКА Нужна ли сода? Ставлю точку! Шарлотка с яблоками в духовке, в мультиварке, без миксера

13:53

ШАРЛОТКА Нужна ли сода? Ставлю точку! Шарлотка с яблоками в духовке, в мультиварке, без миксера

Просмотров 232 тыс.

NotebookLM: Will Instant Podcasts Transform Learning?

6:39

NotebookLM: Will Instant Podcasts Transform Learning?

Просмотров 34 тыс.

Your Own Llama 2 API on AWS SageMaker in 10 min! Complete AWS, Lambda, API Gateway Tutorial

14:46

Your Own Llama 2 API on AWS SageMaker in 10 min! Complete AWS, Lambda, API Gateway Tutorial

Просмотров 27 тыс.

host ALL your AI locally

24:20

host ALL your AI locally

Просмотров 1,2 млн

The cloud is over-engineered and overpriced (no music)

14:39

The cloud is over-engineered and overpriced (no music)

Просмотров 661 тыс.

Deploy Ollama and OpenWebUI on Amazon EC2 GPU Instances

45:18

Deploy Ollama and OpenWebUI on Amazon EC2 GPU Instances

Просмотров 619

Cursor + V0: Can We Build An AI Next.js App in 8 Minutes?

8:40

Cursor + V0: Can We Build An AI Next.js App in 8 Minutes?

Просмотров 54 тыс.

Cursor Composer: Is This How We'll Code Now? 🤔

18:23

Cursor Composer: Is This How We'll Code Now? 🤔

Просмотров 32 тыс.

Expert Guide: Installing Ollama LLM with GPU on AWS in Just 10 Mins

10:14

Expert Guide: Installing Ollama LLM with GPU on AWS in Just 10 Mins

Просмотров 9 тыс.

The Future of Knowledge Assistants: Jerry Liu

16:55

The Future of Knowledge Assistants: Jerry Liu

Просмотров 110 тыс.

I Analyzed My Finance With Local LLMs

17:51

I Analyzed My Finance With Local LLMs

Просмотров 488 тыс.

She Couldn’t Believe I Did This! 😭 #shorts

00:12

She Couldn’t Believe I Did This! 😭 #shorts

Просмотров 2,8 млн