Expert Guide: Installing Ollama LLM with GPU on AWS in Just 10 Mins

Fast and Simple Development

Подписаться 3,3 тыс.

Просмотров 9 тыс.

50% 1

Видео Поделиться Скачать Добавить в

Опубликовано:

29 окт 2024

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 25

@fastandsimpledevelopment 8 месяцев назад

Need a heavy GPU machine? Check out this video on setting AWS EC2 GPU instance. If you like this one check out my video on setting up a full RAG API with Llama3, Ollama, Langchain and ChromaDB - ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-7VAs22LC7WE.html

@joshwaphly Месяц назад

OMG!!! I freaking love you, I've been struggling with deployment on AWS with llama and you've made it crystal clear. I'll do anything to support ur channel. UR THE BEST!!!

@fastandsimpledevelopment Месяц назад

Thanks for the comments

@christague2084 7 месяцев назад

Cannot wait for part two with LangChain! This video was fantastic

@ExpertKNowledgeGroup 8 месяцев назад

What a simple way to setup Ollama LLM with GPU support in only a few minutes, thanks!

@sahillakhe1093 2 месяца назад

Briliant! Its that simple only because you explained it simply :). Thank you!

@ShoeBoxHydroponics 2 месяца назад

Thanks, glad you enjoyed it!

@bingbingxv 7 месяцев назад

Thank you so much! Your video helps me a lot. I am looking forward to your new video.

@danilchurko2882 5 месяцев назад

Thanks man a lot! Great video!

@fastandsimpledevelopment 5 месяцев назад

Glad you enjoyed it

@hebertgodoy5039 4 месяца назад

Excellent. Thank you very much for sharing.

@paulluka7594 5 месяцев назад

Thanks a lot for the video !! Question : Is it possible to start the instance only if we do a request to the server ? It can be usfull to limit the costs. I think it is feasable with kubernetes and docker, but i would enjoy a video about it :) ! Thnks again, very good video

@123arskas 7 месяцев назад

Thank you. This was helpful

@sachin1250 5 месяцев назад

How to add openwebui to it, and expose the openwebui to be accessible from macbook browser?

@yashshinde8185 3 месяца назад

The Video was awesome and prety helpful but can you cover the security point of view too like anyone with the IP and portnumber can access it So how can we avoide that?

@Gerald-iz7mv 7 месяцев назад

can you also use the ubuntu 22.04 image and install cuda etc? why use this deep learning image?

@fastandsimpledevelopment 7 месяцев назад

I only select this AMI since it already has teh other code I need like Python

@Gerald-iz7mv 7 месяцев назад

@@fastandsimpledevelopment if i correctly understand you can select the base ubuntu 22.04 image and install all yourself: nvidia driver, cuda driver, tensorflow, python etc?

@adityanjsg99 11 дней назад

So ollama detects and uses the GPU automatically?

@fastandsimpledevelopment 9 дней назад

Yes, if the OS has support and you have an AMD or Nvidia GPU installed and the latest version does auto-detect. You can also set it to NOT use the GPU in the Ollama config files but by default it does auto-detect.

@ctoxyz 5 месяцев назад

good vid!

@ferao0o0o0 6 месяцев назад

thanks buddy

@pushkarpadmnav 7 месяцев назад

How do you make it scalable ?

@fastandsimpledevelopment 7 месяцев назад

By itself it is not, you need to add a front end like Nginx and then have several Ollama servers running, that is the only way that I am aware today. There is new updates all the time to keep track of Ollama updates