Getting Started with Ollama and Web UI

Dan Vega

Подписаться 60 тыс.

Просмотров 19 тыс.

50% 1

Видео Поделиться Скачать Добавить в

Опубликовано:

8 сен 2024

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 35

@hfislwpa Месяц назад

2 videos in 1 day? Woah! Thanks

@user-zk1zm6sm2u Месяц назад

Interesting tutorial with Web UI and Ollama, Thanks!!!

@AleksandarT10 Месяц назад

Great one Dan! Keep ups updated on the AI stuff!

@bause6182 Месяц назад

Ollama should integrate a feature like artifact that allow you to test your html css code in a mini webview

@user-ym6tb5xb2v 24 дня назад

How can I connect my local ollama3 with webUi, My webUI couldn't find the locally running ollama3

@MURD3R3D 7 дней назад

same problem

@MURD3R3D 7 дней назад

from home page of your webUI localhost3000 in your browser, click on your account name in the lower left, then click settings, then "models", then you can pull llama3.1 by typing it in the "pull" box and clicking the download button. when it completes, close webUI and reopen it. then i had the option to select 3.1 8B from the models list

@user-ym6tb5xb2v 6 дней назад

@@MURD3R3D i found that happen due to docker networking.

@vrynstudios 29 дней назад

A perfect tutorial.

@lwjunior2 Месяц назад

This is great. Thank you

@je2587 21 день назад

Love your terminal, which tools do you use to customize it?

@borntobomb Месяц назад

Note for 405B: We are releasing multiple versions of the 405B model to accommodate its large size and facilitate multiple deployment options: MP16 (Model Parallel 16) is the full version of BF16 weights. These weights can only be served on multiple nodes using pipelined parallel inference. At minimum it would need 2 nodes of 8 GPUs to serve. MP8 (Model Parallel 8) is also the full version of BF16 weights, but can be served on a single node with 8 GPUs by using dynamic FP8 (Floating Point 8) quantization. We are providing reference code for it. You can download these weights and experiment with different quantization techniques outside of what we are providing. FP8 (Floating Point 8) is a quantized version of the weights. These weights can be served on a single node with 8 GPUs by using the static FP quantization. We have provided reference code for it as well. 405B model requires significant storage and computational resources, occupying approximately 750GB of disk storage space and necessitating two nodes on MP16 for inferencing.

@user-br4gt7xu2j 29 дней назад

and what about 70B? How it could be served? Could some of llama 3.1 be used by simple 16-cores laptop with integrated GPU and 32GB ram?

@isaac10231 5 дней назад

When you say "we" do you work for meta?

@chameleon_bp Месяц назад

Dan, what the specs for your local machine?

@zo7lef Месяц назад

Would make a video on how to integrate llama 3 to wordpress website, making chatbot or co pilot

@trapez_yt 25 дней назад

Hey, could you make a video on how to edit the login page? I want to make the login page to my liking.

@mochammadrevaldi1790 6 дней назад

in Ollama Is there an admin dashboard for tuning the model, sir?

@expire5050 18 дней назад

finally setup open webui thanks to you. i'd approached it, seen "docker" and left it on my todo list for weeks/months. I'm running gemma2 2b on my gtx 1060 6gb vram. any suggestions on good models for my size?

@NikolaiMhishi Месяц назад

Bro you the G

@khalildureidy Месяц назад

Big thanks from Palestine

@ilkou Месяц назад

❤💚🖤

@elhadjibrahimabalde1234 22 дня назад

be safe

@kashifmanzoor7949 22 дня назад

Stay strong

@vikas-jz3tv 14 дней назад

How we can tune a model with custom data?

@DrMacabre 24 дня назад

hello, any idea how to set keep_alive when running the windows exe ?

@stoicguac9030 Месяц назад

Is WebUI a replacement for aider?

@elhadjibrahimabalde1234 22 дня назад

hello. After installing OpenWebUI, I am unable to find OLLAM under 'Select a Model'. Is this due to a specific configuration? For information, my system is running Ubuntu 24.04.