Enchanted LLM and the bright path for open language models

Code to the Moon

Подписаться 71 тыс.

Просмотров 6 тыс.

50% 1

Видео Поделиться Скачать Добавить в

Опубликовано:

2 окт 2024

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 32

@codetothemoon 8 месяцев назад

ERRATA: In the video I mention that setting OLLAMA_HOST is an alternative to using ngrok, but that's the case only in scenarios where you only need access on your local network. ngrok apparently lets you leverage your Ollama instance from anywhere, which sounds awesome (thanks to @havokgames8297 for pointing this out)

@melongrasp 8 месяцев назад

Oh, that's really nice. Thanks for sharing!

@newtonchutney 7 месяцев назад

Yepp ngrok can be used to forward your RPi! 😂 But I'd suggest people start looking into tailscale instead.. As it has a lot more security and privacy

@newtonchutney 7 месяцев назад

Tailscale is a mesh VPN system.. Not a reverse proxy like ngrok.. BTW..

@Kabodanki 8 месяцев назад

LLM not biased by the bay area mentality is the future. I'm glad Mistral is french, there's some hope to get away from censorship

@codetothemoon 8 месяцев назад

not sure, it might have a bias towards crepes and baguettes, but I think I'm ok with that!

@fooblahblah 8 месяцев назад

You can use ngrok to proxy to your internal machine but via an external hostname or ip

@codetothemoon 8 месяцев назад

Thanks - I’ve added this as a pinned errata comment

@devopstoolbox 8 месяцев назад

That is SO COOL!!!

@codetothemoon 8 месяцев назад

agree - I know you've been on the Ollama train too 🚂

@undefined24 8 месяцев назад

Looks promising, thanks for sharing.

@codetothemoon 8 месяцев назад

thanks for watching!

@lenninlc 8 месяцев назад

So cool!

@codetothemoon 8 месяцев назад

😎

@dpi3981 8 месяцев назад

What gpu do you use for your setup?

@codetothemoon 8 месяцев назад

I have an M1 Max which has an integrated GPU

@vimaximus1360 8 месяцев назад

I would love to see some hardware comparisons, between mac with 32+ GB ram and some Nvidia GPU.

@codetothemoon 8 месяцев назад

this video might be what you're looking for! ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-jaM02mb6JFM.html

@vimaximus1360 8 месяцев назад

perfect! thank you @@codetothemoon !

@youpapai 8 месяцев назад

why does `ollama run something` pull/download the model every time ? Is there a setting to cache it or use the cached downloaded model?

@codetothemoon 8 месяцев назад

it doesn't, at least for me. everything that appears in the list of language models to choose from is already downloaded and ready to go. That said, they might take a few seconds to load into memory, especially if they are on the larger side. Mistral 7B only takes ~10 seconds or so to load into memory for me. Are you seeing an isue where the model is downloaded on every run?

@youpapai 7 месяцев назад

@@codetothemoonyes. being downloaded every run

@havokgames8297 8 месяцев назад

Ngrok would let you access your Ollama without being on the same WIFI as your computer

@codetothemoon 8 месяцев назад

ahh got it - thanks for clarifying this! I should have looked into it a bit more. I'll post this in an errata comment.

@havokgames8297 8 месяцев назад

@@codetothemoonno worries. No one would expect you to be an expert at everything. I've used Ngrok for example when developing a web app locally that has webhooks and I want an external service to be able to access my local development server. It is perfect for this. The issue is that on the free tier it won't keep the same host name, so when you configure your Enchanted LLM app - if you restart NGrok then the URL will be different. Either you can pay for the service and get static URLs (I believe), or use another static DNS service with a hostname pointing to your machine.