Тёмный

Understanding How Ollama Stores Models 

Matt Williams
Подписаться 29 тыс.
Просмотров 6 тыс.
50% 1

Where does Ollama store the models? How can do you decipher the blobs? Why can't the models just be named something that makes sense? Everything has a reason, and after watching this, it should all make sense.
Be sure to sign up to my monthly newsletter at technovangelist.com/newsletter
And if interested in supporting me, sign up for my patreon at / technovangelist

Наука

Опубликовано:

 

21 янв 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 14   
@kenchang3456
@kenchang3456 6 месяцев назад
I'm still waiting for some kind of out-take moment at the end of the video :-) Thanks for sharing.
@technovangelist
@technovangelist 6 месяцев назад
I make no mistakes of course
@dr.mikeybee
@dr.mikeybee 6 месяцев назад
Well done! Thank you.
@technovangelist
@technovangelist 6 месяцев назад
Thanks for watching!
@bernard2735
@bernard2735 6 месяцев назад
Love you videos, can’t wait for the Windows version 😊
@technovangelist
@technovangelist 6 месяцев назад
Thanks so much. Anything else I can help you with?
@Tarbard
@Tarbard 6 месяцев назад
Good video. The sha256:... format filenames aren't supported on windows filesystems because of the colon (I noticed this when trying to backup to a drive that was shared with windows) so I wonder if this is one of the things that needs to be changed for the windows release.
@ExploreTogetherYT
@ExploreTogetherYT 4 месяца назад
you have any idea how to give the model context data similar. like how we can use python libraries to request from openai interact with it via code?
@greyowlaudio
@greyowlaudio 3 месяца назад
Windows ollama has now come out and I'm just wondering where it stores them when it comes to Windows?
@nassosdimou3337
@nassosdimou3337 5 месяцев назад
At ollama, can I run a model through path and not by name? For example instead of 'ollama run ' and linux and with the python library 'ollama.run("model")" to put a path istead of a model.
@technovangelist
@technovangelist 5 месяцев назад
well, normally the modelfile isn't something you use beyond creating the model. If you are thinking about pointing to the model weights, well that’s just a portion of the model in Ollama. It's strange that some tools suggest that the gguf or alternative file would be the model on its own.
@truehighs7845
@truehighs7845 4 месяца назад
Yes but more importantly how do I reuse the already downloaded models with other frameworks, because everyone is downloading between 5 to 30GB files + and it would be good if they'd all be sourced form the same image, is there a way to expose those ollama models for consumption with another engine? currently Jan doesn't recognise the way models are stored in ollama. *Edit and vice versa - I am thinking about vLLM and jan? Ps Ollama webui lite doesn't post in the chat, and Open-webui is broken I could not get the container to render the app, not matter how much I played with the compose yamls, but then again, I wanted GPU acceleration so I don't know if the deploy agrs are going to work without Swarm enabled: deploy: resources: reservations: devices: - driver: ${OLLAMA_GPU_DRIVER-nvidia} count: ${OLLAMA_GPU_COUNT-1} do you happen to know if those variables are recognisable as is system-wide or do they need to be i the usr/bin?
@technovangelist
@technovangelist 4 месяца назад
I have another video that shows a tool that supports this. It’s pretty easy to use ollamas models in every other toolset
@truehighs7845
@truehighs7845 4 месяца назад
@@technovangelist Great I'll look for it, thanks!
Далее
Function Calling in Ollama vs OpenAI
8:49
Просмотров 31 тыс.
Getting Started on Ollama
11:26
Просмотров 46 тыс.
Whats the best Chunk Size for LLM Embeddings
10:46
Просмотров 11 тыс.
How does function calling with tools really work?
10:09
Vim Tips I Wish I Knew Earlier
23:00
Просмотров 48 тыс.
The Secret Behind Ollama's Magic: Revealed!
8:27
Просмотров 30 тыс.
Is Dify the easiest way to build AI Applications?
13:50
This Chrome Extension Surprised Me
10:31
Просмотров 16 тыс.
Unlock AI with Fabric Patterns
9:28
Просмотров 13 тыс.
RAG from the Ground Up with Python and Ollama
15:32
Просмотров 27 тыс.
Premature Optimization
12:39
Просмотров 779 тыс.
КРАХ WINDOWS 19 ИЮЛЯ 2024 | ОБЪЯСНЯЕМ
10:04
Nokia 3310 top
0:20
Просмотров 4,3 млн
Новые iPhone 16 и 16 Pro Max
0:42
Просмотров 2,4 млн