Тёмный

host ALL your AI locally 

NetworkChuck
Подписаться 4,2 млн
Просмотров 1,1 млн
50% 1

Опубликовано:

 

22 сен 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 2,7 тыс.   
@NetworkChuck
@NetworkChuck 4 месяца назад
Ready to get a job in IT? Start studying RIGHT NOW with ITPro: go.acilearning.com/networkchuck (30% off FOREVER) *affiliate link Discover how to set up your own powerful, private AI server with NetworkChuck. This step-by-step tutorial covers installing Ollama, deploying a feature-rich web UI, and integrating stable diffusion for image generation. Learn to customize AI models, manage user access, and even add AI capabilities to your note-taking app. Whether you're a tech enthusiast or looking to enhance your workflow, this video provides the knowledge to harness the power of AI on your local machine. Join NetworkChuck on this exciting journey into the world of private AI servers. 📓📓Guide and Commands: ntck.co/ep_401 ⌨⌨My new keyboard: Keychron Q6 Max: geni.us/0SGY 🖥🖥My Computer Build🖥🖥 --------------------------------------------------- ➡Lian Li Case: geni.us/B9dtwB7 ➡Motherboard - ASUS X670E-CREATOR PROART WIFI: geni.us/SLonv ➡CPU - AMD Ryzen 9 7950X3D Raphael AM5 4.2GHz 16-Core: geni.us/UZOZ5 ➡Power Supply - Corsair AX1600i 1600 Watt 80 Plus Titanium: geni.us/O1toG ➡CPU AIO - Lian Li Galahad II LCD-SL Infinity 360mm Water Cooling Kit: geni.us/uBgF ➡Storage - Samsung 990 PRO 2TB Samsung: geni.us/hQ5c ➡RAM - G.Skill Trident Z5 Neo RGB 64GB (2 x 32GB): geni.us/D2sUN ➡GPU - MSI GeForce RTX 4090 SUPRIM LIQUID X 24G Hybrid Cooling 24GB: geni.us/G5BZ 🔥🔥Join the NetworkChuck Academy!: ntck.co/NCAcademy **Sponsored by ITProTv from ACI Learning
@MARO_MR
@MARO_MR 4 месяца назад
first reply
@mshark111
@mshark111 4 месяца назад
@@MARO_MR Second reply
@MARO_MR
@MARO_MR 4 месяца назад
@@mshark111 third reply
@xozx1715
@xozx1715 4 месяца назад
I use chat with rtx. Do you advise me to change to this?
@mshark111
@mshark111 4 месяца назад
@@MARO_MR LOL
@grregis
@grregis 3 месяца назад
Awesome video and super easy to follow along. Quick tip: if you forget to run a command as sudo, just type sudo !! and it will run your last command as sudo.
@chornge1
@chornge1 4 месяца назад
That moment when you realize port 11434 looks like the word llama
@arunramachandran5012
@arunramachandran5012 4 месяца назад
lol then it really should be 011434
@ThatRandomDude914
@ThatRandomDude914 4 месяца назад
@@arunramachandran5012you can’t do that
@MrAnt1V1rus
@MrAnt1V1rus 3 месяца назад
l33t knowledge right here
@MrAnt1V1rus
@MrAnt1V1rus 3 месяца назад
@@arunramachandran5012 its too many numbers for a service port, but yes
@9ubagurbi6
@9ubagurbi6 3 месяца назад
@@MrAnt1V1rus 1337
@Zvxers7
@Zvxers7 4 месяца назад
Man really gave his kids 2x rtx 4090s for school, he did the "mom i need this [overkill computer] for school"
@brandonwiederhold2573
@brandonwiederhold2573 4 месяца назад
Its only a $6K build lol
@Zvxers7
@Zvxers7 4 месяца назад
@@brandonwiederhold2573 only $6000 for school...
@notaras1985
@notaras1985 4 месяца назад
​@@brandonwiederhold2573ONLY 6000? You can adopt me any day
@Outsider_07
@Outsider_07 4 месяца назад
@@notaras1985 exactly
@fp1715
@fp1715 4 месяца назад
​@@notaras1985just do a video for vmware
@muditmishra1129
@muditmishra1129 Месяц назад
Bro called us poor in 14 different languages
@Fondofmelobster
@Fondofmelobster 21 день назад
That’s kinda his whole thing
@qkb3128
@qkb3128 18 дней назад
Right
@LordDudeious
@LordDudeious 12 дней назад
"He said we were poor, in fourteen different languages." Enough said.
@jamesbelcher
@jamesbelcher 4 месяца назад
Chuck, I saw the video yesterday on Ollama and I tried it today. I am blown away at how good llama3 is and how fast it is. Running on my i7 linux laptap with a nvidia gpu and it is incredible. Thanks again for your wonderful videos. Keep it up!
@samchris3793
@samchris3793 Месяц назад
Its brilliant isnt. Crazy part is totally free
@MandeepSingh-hn4jd
@MandeepSingh-hn4jd 17 дней назад
Apart from daily conversation what are other task it can do?
@JuankM1050
@JuankM1050 12 дней назад
What gpu?
@JeremyFeldmesser
@JeremyFeldmesser 4 месяца назад
I'm 62 years old and a computer techy, I'm no super genius though and I'm really happy to have been able to run a local AI on my PC. Private AI is the way to go for sure. I signed up for your free academy for now, there's enough in there to keep me learning/busy for a while yet! :)
@nahrafe
@nahrafe 4 месяца назад
Good job pops
@projectptube
@projectptube 4 месяца назад
now if we can just get some models that have no wokeness/leftist insanity.
@gaiustacitus4242
@gaiustacitus4242 4 месяца назад
​@@projectptube I would be happy with an AI that could actually write fairly entry level code instead of churning out garbage code that: 1) won't compile, and efforts to have AI integrated into the development environment correct issues makes it worse with each iteration 2) doesn't actually meet requirements (regardless of how many iterations made to fine tune the output, by which YOU are training the AI) 3) is poorly structured (leading to maintainability problems) 4) lacks proper error handling (leading to problems with stability and data integrity) 5) fails to follow any type of consistent naming convention (code quality/maintainability issues) 6) randomly include variables which determine type on first assignment 7) creates classes where local data types do not correspond to the columns defined in database tables: 7.a) string data types do not enforce the defined length limits 7.b) numeric variables are of inconsistent types 7.c) the data access layer doesn't handle null values, always storing 0 for numeric data types or zero-length strings for (n)varchar fields 8) thrashes database connections (a problem that connection pooling implemented in the client stack doesn't reliably solve) 9) introduces security vulnerabilities. I could go on, but why bother? The current state of AI for software development is to have companies and sole developers pay to use it while the AI is trained on the well-written source code (or at least better written) the developers end up producing. A packet sniffer will detect that not only is the corrected AI generated code being shared but also proprietary code which has not been authorized for such use.
@legendaryphoenix8607
@legendaryphoenix8607 4 месяца назад
​​​@@projectptube exactly cough... Gemini... cough. But what do you have in mind when you said that? I am interested to know
@HandFromCoffin
@HandFromCoffin 4 месяца назад
@@projectptube Hi my name is Richard, I always have to inject my views on things in to every topic. That’s my skill.
@OgBrog
@OgBrog 4 месяца назад
Alright, now integrate it into home assistant with text to speech and voice to text so you can have your own alexa that controls your home automation.
@shannonbreaux8442
@shannonbreaux8442 3 месяца назад
That's what I would like to see a video of him do
@sonofsid1
@sonofsid1 3 месяца назад
@@shannonbreaux8442 the ollama get hub has a plug in on how to do this. Also ollama has a python library so you can write your own python scripts to interact with ollama
@Mr_LA_Z
@Mr_LA_Z 3 месяца назад
Yeah, we need API access for home assistant. Does anyone know how we can do that, or that is too much of a challenge?
@miroslavwiesner7366
@miroslavwiesner7366 2 месяца назад
@@Mr_LA_Z ask AI
@rickeeepps6461
@rickeeepps6461 2 месяца назад
Read the HA release notes, they are working on this as we speak
@alexclark6777
@alexclark6777 4 месяца назад
This video was an absolute gem, thank you so much. I've been struggling with setting up local AI and the majority of videos I've watched have resulted in me having to try and learn concepts while also deciphering a very heavy accent from the narrator, which made it so much harder for me to focus. This was clear, to the point, and covered everything I wanted. Thank you!
@JG27Korny
@JG27Korny 4 месяца назад
Just use LM studio. You will get just that. Also recommendation of models and information if they can run on your machine. Also the models get downloaded authomatically from hugging face.
@mad_engineer3254
@mad_engineer3254 3 месяца назад
Just wanna say Huge Thanks to you! Your video inspired me to give another try on my way to local LLMs and I was literally blown away with how fast my RTX 2060 could actually generate with Llama3 and ollama. A year go I tried local Pygmalion and when I saw literally one word per 2 seconds I decided "'Nah, local AI is only for happy guys with 4090 on board". Once again, thank you, you made my life better!
@danielmpr
@danielmpr 3 месяца назад
Hello, Chuck! I tried this on my OLD, upgraded to it's max Dell 660s, which I have to date: Intel Core i7 3770, running at 3.40ghz, 16GB ram, Windows 11, and a 1TB SSD.... Followed your tutorial, and didn't expect it to work on my system! "I have NO GPU!" it runs SUPER SLOW, but works! installed llama3 model, gonna try some more!!! LOVE your videos! Greetings from Puerto Rico!!! 😁
@donnymontreano9235
@donnymontreano9235 25 дней назад
is it super slow? oh noo... is adding ram will make it faster?
@Johnsormani
@Johnsormani 19 дней назад
Nice project but in my opinion it’s totally useless to run ai on your own server. It’s being on 24/7 ,using tons of energy, and is not so often used. This is typically something that is better off in the cloud. If not for this reason, than it is for training the models and neural networks. Tesla wouldn’t be able to exist if they had gone this route
@kuthub1989
@kuthub1989 18 дней назад
Try to get NVIDIA Tesla K80 24GB Kepler gpu. It's super cheap in used market.
@IPLAYMTG628
@IPLAYMTG628 4 месяца назад
I am using Ollama on my 13 year Old MacBook Pro and it's running pretty fine. Thanks a lot. Keep the great work. Thanks for the videos!! :)!
@Grandwigg
@Grandwigg 4 месяца назад
That is about how old my desktop is. Maybe i have a chance after all.
@UmeshJoshi333
@UmeshJoshi333 4 месяца назад
Good idea ;)
@Shadow_Banned_Conservative
@Shadow_Banned_Conservative 4 месяца назад
I want to play with this as well. I wound up with a Best Buy open-box i5-12400, 32gb or ram, and an open-box Nvidia 4060 OC 8GB. So I'm in for about $600 all together. I wanted to start as cheap as I could and be power efficient at the same time, at least to start with. Hopefully I'll start playing with it in the next couple of weeks. One thing I'm curious about though. I wonder how secure these are. Are they really secure, or is it one of those "not too many of them today so nobody is bothering to hack them, yet" situations?
@kulligo3192
@kulligo3192 4 месяца назад
@@Shadow_Banned_Conservative selfhosted LLMs are completly local, there isnt really anything to hack
@ronilevarez901
@ronilevarez901 4 месяца назад
The magic is that the GPU is more powerful than the average 13yo GPU. In my 15yo pc nothing can run.
@guitarguy911
@guitarguy911 4 месяца назад
Ollama troubleshooting: if you can’t run Ollama on the first try, open a new terminal and type “Ollama serve”
@ezradevs
@ezradevs 4 месяца назад
On my Mac, I had to keep an ollama serve window open and in a new terminal window running the ollama commands would work.
@Jalan-Api
@Jalan-Api 4 месяца назад
@@ezradevs you do not have to do that to work...
@nuggetbugget9305
@nuggetbugget9305 4 месяца назад
@@Jalan-Api I had to use the ollama serve command on my computer for it to work on WSL, but the windows preveiw works without using the ollama serve command.
@itachi_shrestha
@itachi_shrestha 4 месяца назад
Try ollama run llama3
@Jalan-Api
@Jalan-Api 4 месяца назад
@@nuggetbugget9305 No no, I meant like you do not need the terminal open in background running "ollama serve" on Mac
@chinmaykapoor962
@chinmaykapoor962 4 месяца назад
Man!!! My boss showed me the last local AI video of yours, introducing me to your channel. Now I feel any video you’re making on similar topics I need to see them! Make more videos on this, exploring what all we can do, in workplaces. This is so interesting and cool! Thanks man!
@matrixploit
@matrixploit 3 месяца назад
What do you work as a?
@chinmaykapoor962
@chinmaykapoor962 3 месяца назад
@@matrixploit Data Scientist/ML engineer for a startup (Co-op)
@matrixploit
@matrixploit 3 месяца назад
@@chinmaykapoor962 which country bro?
@chinmaykapoor962
@chinmaykapoor962 3 месяца назад
@@matrixploit canada
@adityasingh017
@adityasingh017 4 месяца назад
Hey guys👋🏽 , I've installed this on my pc successfully. Running on my pc(host machine), But I'm accessing it on my another pc (connected in same network) but its not opening. Could you guys plz help me
@techlitindia
@techlitindia 4 месяца назад
Over the past year, I've incorporated all your tips into my daily routine, and they've definitely helped me feel more energetic and productive! For a while, I even felt unstoppable! But lately, I've been feeling down and sluggish again. It seems like forcing yourself to do all these things every day can feel like a chore, especially if they're not natural habits. It takes time and effort to build new routines, and it can be frustrating to miss a day. This made me realize that feeling good is more about your mind than your body. When you have a positive outlook, you naturally have more energy. The key is to find activities that make YOU happy. There's no one-size-fits-all solution! You might find great advice from others, but ultimately, you need to discover what works for you. I'm not trying to be negative, just to remind everyone that feeling good and bad is a normal part of life. You might have some fantastic days, but there will also be times when you feel down. Don't let that stop you from doing the important things, even if it's just for a few minutes each day. Consistency is key! Successful people might not always feel amazing, but they make time for what matters most in their lives.
@markverstappen1365
@markverstappen1365 4 месяца назад
I love these plain simple straight on explanation videos. A suggestion or addition to this would be: - how to add or restrict the knowledge base. For example: - corporate data, pdf's, tables, pictures, statistics etc and how to purely add this info as knowledge. - Ask the AI questions and so that it only searches the corporate data and doesn't get blurred with other data. - let the AI do analysis on the data and pull conclusions on it. This would be a perfect addition.
@tonymburu7804
@tonymburu7804 4 месяца назад
No one does it better, NC is awesome. Simple and very intuitive videos.
@jesuiscool7
@jesuiscool7 4 месяца назад
"- how to add or restrict the knowledge base." Well, he shows exactly that by showing you the system prompt he gives. You can kinda do whatever you want there, like banning words etc. Looking into Ollama, you can also train your model on specific data which can help for your your specific uses cases. There is a lot of documentation/videos on that topic on YT if you want. But that's more relevant of AI training than "easy and fast setup" which was the scope of this video.
@matthewarchibald5118
@matthewarchibald5118 4 месяца назад
check out his last local AI video and his mentions of "Private GPT"
@kiranwebros8714
@kiranwebros8714 4 месяца назад
Instead of chatting with models there should be agents with specific skills. why nobody creating something like that?
@randallrulo2109
@randallrulo2109 4 месяца назад
@@kiranwebros8714 this is what i thought modelfiles were supposed to be, but it doesnt really look like it...
@Bdantioch
@Bdantioch 4 месяца назад
Easy mode: 1. Microcenter's RTX 3090TI x2 (24gb VRAM x2) OR get the Tesla K80's (cheaper) . 2. MOBO that supports either x16 x 2 or x8 x 2. 3. Get at least 64gb system ram (GGUF models run on CPU/RAM/ GPU combined). 4. A 850 - 1,000 Watt power supply. Congrats. You have a computer that almost rivals a system with RTX A6000 (5,000$) card.
@sil778
@sil778 4 месяца назад
Thx Man..
@sisakamence
@sisakamence 4 месяца назад
I m building cheap home server for cloud gaming.. for 4 VM : Dell T7810 (200euro) 2x Xeon E5-2697v3 (50euro), ECC 64GB 2400Mhz in quad channel (70euro) Nvidia Tesla P100 16GB (160euro) and added Tesla M40 12G , second PSU 1000w . I hope Llama will use 2 different GPUs. Now the server will be for cloudgaming and AI, so cool :)
@randallrulo2109
@randallrulo2109 4 месяца назад
tesla k80... dude, your a lifesaver... i feel seriously dumb for not having found this a year ago...
@ToucheFarming
@ToucheFarming 4 месяца назад
@@randallrulo2109 something you need to know about the K80's is that it is not a normal PCIe cable needed, it uses a 8 PIN CPU plug. you can get an adapter to convert 2 PCIe 8 pins to 1 8 pin CPU connector
@VioFax
@VioFax 4 месяца назад
@@ToucheFarming Its also a Pita to get working on some workstations like Dell or HP without Rebar. I'd skip the Tesla's TBH. Ive been fooling with 2 P40s for 2 months. Really not worth the trouble they caused me. Its a good option if you have no money but plenty of time on your hands and really want to be a masochist trying to keep them cool enough ect... I ended up getting the 3090's and am much happier. Yeah I lose ECC but whoopty doo, i rather just not be waiting on replies from the model... and to run without compression that's already messing with accuracy. 2x 3090's just end up making more sense for the time/money ratio. I ended up getting the Teslas to work on a Dell 5820 and you have to change the Vbios mode to the GPU with nvflash to be in graphics mode instead of compute. You lose a lot of performance doing it this way though. Cuts it in half. But it will work. Was a week of research to figure that out. I gave up on the Teslas and the dell after finally pulling this off and having to get a windows machine to change the vbois anyway... and just got 2 3090's in a cheap gaming board. Works so so much better. Looking back i wish i had not wasted my time. I hope i save someone else some time by sharing my experience with the Tesla cards.
@p4l4d1n7
@p4l4d1n7 3 дня назад
came back to get this running on my school laptop. Chuck you rock.
@lorenzoplaatjies8971
@lorenzoplaatjies8971 3 месяца назад
Man really skipped the part where it works on other computers too
@carmody90
@carmody90 2 месяца назад
It's on the network so use the same url that you'd use on the machine it's running on
@jonjayb
@jonjayb 4 месяца назад
Maaaaaan i did this last week on my own, i just had to wait for the master to come along and do it better haha
@jonathonvargas8724
@jonathonvargas8724 4 месяца назад
That’s awesome bro!
@eropoke
@eropoke 4 месяца назад
Me too!
@murlock666
@murlock666 4 месяца назад
if you did this alone. be proud of that. don't lessen your achievement. there's enough people out there that will do it as it is. don't help them by doing to yourself.
@jonjayb
@jonjayb 4 месяца назад
It all turned out okay. This video helped with Stable Diffusion. Also had some jankyness with WSL networking to work around.
@RashadPrince
@RashadPrince 4 месяца назад
Same 😁
@Hack_O_Lantern
@Hack_O_Lantern 4 месяца назад
Another fantastic video! And your on screen graphics are some of the best on RU-vid.
@KipIngram
@KipIngram 4 месяца назад
Chuck, THIS has got to be the most significant video I've seen in ages. Thank you for sharing this information. I LOVE the idea that we can now have this power under our own control. I will definitely have to do this when I can gather up enough money to build my own Terry (if I'm going to do it I want to do it right).
@emirmustafic786
@emirmustafic786 4 месяца назад
With the example where you asked two models a question: you can clearly see the "2/2" in the bottom left part of the answer, so you have two answers available (one from each model, i guess). :)
@Marustic
@Marustic 4 месяца назад
I only watched like 4 minutes of your video and I wanted to try asap. Not only did I get it up and running in like an hour but I also configured it to be accessed anywhere in the world I want. Thank you for sparking this fun little piece of technology I can utilize in my own home. This is actually much more useful than I thought because I can have my mother utilize this in her everyday life since I’m all grown up now and out of the house.
@maxhaberstroh2504
@maxhaberstroh2504 4 месяца назад
can you hint me in a direction for making it accessible from other pcs in a local network?
@Satan-Claus
@Satan-Claus 4 месяца назад
@@maxhaberstroh2504 Tailscale is probably your easiest solution
@HansrajTechTips
@HansrajTechTips 4 месяца назад
Hi, can you please tell me how you're accessing it on other networks
@Marustic
@Marustic 4 месяца назад
@@HansrajTechTips I’m hosting it on a site I can access
@DmitryAvramenko
@DmitryAvramenko 4 месяца назад
Can you share configuration of your PC?
@DanielNeedles
@DanielNeedles 4 месяца назад
One caveat. Using Windows WSL access from the outside is not possible without a lot of hoop-jumping. Though the "--network=host" will sync up Docker on Ubuntu in WSL2, there is a whole lot more hoop-jumping required to get WSL2 to talk to your local network as there is no "bridging" option like there is with VMware or Virtualbox.
@ichirokun6275
@ichirokun6275 3 месяца назад
Thanks man I noticed this Trying to use Ubuntu for this was quite tasking as I did not know how to install the cuda drivers properly 😅. Ended up breaking the Grub boot loader of the Os😂😂
@Outcast100
@Outcast100 3 месяца назад
Thats why Ive been having all this trouble😫 omg...any tips
@BrookStockton
@BrookStockton 3 месяца назад
Hi Dan!
@DanielNeedles
@DanielNeedles 3 месяца назад
@@BrookStockton lol. Small world. I am up in Port Townsend these days. I believe you are just south in the same area as Dave McKinnon.
@VincentWillcox
@VincentWillcox 4 месяца назад
Thank you for making it simple! I've followed several tutorials for getting these running locally and they all have their own plus points. Your's with its Stable Diffusion addition is a nice added touch!
@jeredblumenfeld8556
@jeredblumenfeld8556 4 месяца назад
Which other videos do you rec?
@iPadChannel
@iPadChannel 3 месяца назад
This tutorial is insane! Many thanks! The steps are so easy to follow and implement. I just finished the tutorial, and currently enjoying the local AI in my laptop.
@viztrak_2606
@viztrak_2606 4 месяца назад
That's just insane ! I'm simply following your steps to test it on my gaming PC, there are no words for this. I'm eager to start building a case for it. NEW PROJECT UNLOCKED
@DiannaGold
@DiannaGold 4 месяца назад
I love this ... I was wondering what PC project I wanted to do next. NOW I KNOW!
@iant720
@iant720 4 месяца назад
This will greatly help my daughter in the future as we plan to homeschool especially since private GPT can be loaded with local sources like PDF's of books. Very hyped for this content!
@SpragginsDesigns
@SpragginsDesigns 4 месяца назад
Dude, your videos are so good. I never miss a video from you. Im working on a project analyzing sports data with local AI for work, so its been very interesting going outside the realm of the simple UIs from OpenAI/Anthropic etc.
@BWane-wd7zz
@BWane-wd7zz 4 месяца назад
Hmm... May be a huge vegas hit
@RyWilliamz
@RyWilliamz 3 месяца назад
ugh im trying to install docker so many times in wsl unbuntu and its giving me such a headache. heck even installing unbuntu and ollama gave me ttrouble. I had to use AI to walk me through how to install an AI which is so weird to think about.
@mr.gk5
@mr.gk5 День назад
it took me solid 3 days on first try
@This_Guy_is_not_real
@This_Guy_is_not_real Месяц назад
I followed your video slightly off the beaten path but it works and im now running all my AI locally. Thanks
@peacemaker9807
@peacemaker9807 4 месяца назад
I was literally thinking of doing exactly this recently, great timing. Thanks.!
@dariushoniball3825
@dariushoniball3825 4 месяца назад
"We can hold hands and sing," 😂😂😂 That was the most hilarious thing I've heard all week Thank you for keeping it authentic
@alpine7840
@alpine7840 4 месяца назад
This is sweet! Just did this on my spare system and it was faster then I thought it would be. I9-10900 with 64gb and a SFF Quadro RTX A2000 12gb. Thank you Chuck
@Brax1982
@Brax1982 4 месяца назад
What was faster? These cheap models he is showing? Or got anything better to run?
@CafeComClicks
@CafeComClicks 4 месяца назад
lol, wish i had a spare system like that! that´s a beast.
@TradeWizardPremiumGuide-qktz
@TradeWizardPremiumGuide-qktz 3 дня назад
After restarting, it worked! Great software!
@x12_79
@x12_79 2 месяца назад
This is cool but the majority of people don't have a computer nearly good enough for this to be practical. I tried it and the model was both slow and incredibly dumb due to me being limited to llama3 8B. That being said you should still try it out, pretty cool.
@kristoftorres
@kristoftorres 4 месяца назад
Hi @NetworkChuck At 13:25 you explain that if you want someone else use this server on your PC or Laptop, they can access it from anywhere, as long they have your IP Address. How exactly do you do that?
@Kkkkkkkk-bf5ne
@Kkkkkkkk-bf5ne 3 месяца назад
There's this little thing called port forwarding :)
@AdrianC2006Uk
@AdrianC2006Uk 4 месяца назад
Great video. I've just gone through all of this myself. Looks like they have also added a few more features (LiteLLM, Whisper). Local AI is where it's at!!! Privacy first!!! Hoping they add MemGPT and CrewAI/AutoGen to it.
@MichelBertrand
@MichelBertrand 4 месяца назад
I've had it running - slowly - on a RaspberryPi 5. Love the imploementation on WSL in Windows 11, **BUT** we definitely need a complete guide for those of us who are running an AMD GPU in Windows. Not everyone had $10K lying around to build a server with TWO $3200CAD Nvidia cards, Chuck...
@antonyaustin1388
@antonyaustin1388 4 месяца назад
the updated version of ollama checks amd graphics
@MichelBertrand
@MichelBertrand 4 месяца назад
@@antonyaustin1388 I found that on the ollama website - unfortunately it looks like the cutoff is 6800XT, right above my 6750XT. Oh well.
@BrandonHurt
@BrandonHurt 4 месяца назад
I have it running via docker using an old radeon 7 and a ryzen 9 with 12 cores 24 threads and 32gb ram and it runs decently fast on gentoo, and downloaded the auto1111 the way he showed how and its not any slower than his shows.
@MichelBertrand
@MichelBertrand 4 месяца назад
@@BrandonHurt does it actually use your GPU? If so I'd be interested to see what your docker config is exactly. It runs ok on just my CPU (13700k), but would be faster using the GPU from what I can tell.
@krzysmis2366
@krzysmis2366 Месяц назад
its not 10k I believe ... It would be close to 7-8k though ?
@comptvlee
@comptvlee 3 месяца назад
everything about this video is sick. excellent quality, excellent overview and use cases, and excellent server build. 🎉🎉🎉
@jburnash
@jburnash 3 месяца назад
This was an ABSOLUTELY fabulous tutorial on AI. It was (as others have commented) *extremely* accessible to somebody starting out with self hosted AI, but with a background in Linux and system administration. Well done sir! I will use this to setup my own install on a currently underutilized but reasonably powerful server in my homelab.
@Adopted_Gaming
@Adopted_Gaming 4 месяца назад
Would be great if you could make a video on setting up a local AI language model to be trained on documents that get permanently saved in its memory. Seems like there is potential for that using webAI? I want to use this program to be able to reference a part number and have it give me information on the product or manual for that specific part number in my company.
@hillishudson32
@hillishudson32 4 месяца назад
Check out RAG ( retrieval augmented generation). Essentially use a model to store docs into a vector database which is queried by the AI when sending prompts to use in its context window. Lots of videos on RAG out there
@whok2
@whok2 2 месяца назад
any update about this topic???
@vittoriodangiolino334
@vittoriodangiolino334 29 дней назад
ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-nPpgh_KaNng.htmlsi=81MvlhId2dDeYEd4​@@whok2
@briantcosta
@briantcosta 4 месяца назад
This is some next level content, man!! All love from Brazil
@DaengRosanda
@DaengRosanda 2 месяца назад
I was experimenting this on my local from Feb 2024. And it was so powerful. I've often used this for calculating some data, convert it into models, and doing some cool stuff like: "Hey, what is gross margin for my local store branch in Jan 2024?" Then the bot give awesome answer with correct data..
@TrueMystique
@TrueMystique 4 месяца назад
Im at the part where im trying to download pyenv. However the prereqs wont download. It seems like they are missing. How do yoy go about fixing that?
@RyWilliamz
@RyWilliamz 2 месяца назад
Anyone else stuck on the Docker Container part? heres what I get E: Malformed entry 1 in list file /etc/apt/sources.list.d/docker.list ([option] no value) E: The list of sources could not be read. E: Malformed entry 1 in list file /etc/apt/sources.list.d/docker.list ([option] no value) E: The list of sources could not be read. curl: (22) The requested URL returned error: 404 -bash: /docker.asc: No such file or directory chmod: cannot access '/etc/apt/keyrings/docker.asc': No such file or directory
@gordonpollock6079
@gordonpollock6079 Месяц назад
yup
@grambam
@grambam 15 дней назад
same yep :(
@RahnAbbott
@RahnAbbott 3 месяца назад
One of the first uses I thought of for summarizing files was to load a linux log file and have it summarize it for me. AMAZING!
@jimarasthegod
@jimarasthegod 4 месяца назад
Cheaper alternatives that can be combined with other nvidia GPUs, solely for running AI, are used Nvidia Tesla P40, (24GBof VRAM) currently about ~200 bucks each on the used market. Otherwise go AMD 6800 or newer/better, (16GB+ of VRAM) which are also supported out of the box.
@Brax1982
@Brax1982 4 месяца назад
Are you kidding? These go for 7k new. I can see that there are a lot of these offers for used ones, but did you ever confirm that it is legit? Looks like very obvious fraud. Or are you trying to run a scam, yourself?
@VioFax
@VioFax 4 месяца назад
Those p40's are a pain in the butt though...i'd stay away from them unless you can't do something better.
@VioFax
@VioFax 4 месяца назад
@@Brax1982 I have 2 they work (bought used for $175 each) but they aren't that great and were a pita to get working and keep cool enough... Get a 3090 instead.
@Brax1982
@Brax1982 4 месяца назад
@@VioFax Thanks, I was not considering it, because how could they be that much cheaper than list price? Are you sure you got the real ones? I would seriously doubt that...even if "something" works. I guess this is one of those things where you have to be a master engineer to get it to work and that's why it's so cheap...
@archuser420
@archuser420 2 месяца назад
@@jimarasthegod Nahhh the P40s are horrible at FP16, because the GP104 lacks the capability of fast FP16 computation. Well at least it supports DP4a. I would say use something at least from the Turing Generation. At the AMD side I only tested GCN 5.1 Radeon Pro VII GPU, it was ok for basic PyTorch operations
@duynguyenngoc2174
@duynguyenngoc2174 14 дней назад
Can you share me information for pen and table draw screen?
@themagiccode-fi
@themagiccode-fi 14 дней назад
Me too
@crypto_que
@crypto_que 2 месяца назад
This video should have millions of views. The time value of this video compared to the production value it brings is totally asymmetric. After a week or so I finally figured out that having more than one instance of Linux (WSL & WSL2) running at the same time is really bad for this install. Also you can only have Ollama installed in one place on your machine or Docker will NOT play nice. Finally got it running after just a few minutes of uninstalling and re-configuring and voila! OpenWeb UI has the connection, & all the models can be loaded & used. I am a Wizard.
@AjvarRelish
@AjvarRelish 2 месяца назад
This is truly amazing that this type of content is available for free!
@BarrelOfLube-cl2qq
@BarrelOfLube-cl2qq 4 месяца назад
PS: please support the open source project you use, the devs put in a lot of effort in creating and maintaining them for free, making them accessible for everyone. No pressure tho, enjoy free AI for everyone
@Napert
@Napert 4 месяца назад
Good luck running anything larger than 8B parameters on just the cpu (and even that might be too big for most people) and expecting more than 2 tokens per second A relatively recent 8gb gpu is highly recommended to run up to 8B models at over 50 tokens per second
@touma-san91
@touma-san91 4 месяца назад
And not just that.. You need to get to something like 100-400B models to be comparable to the bigger AI services.. Those small LLM models are good for things like roleplay and such but when it comes to factual information and productive tasks, they tend to be quite poor.
@CappellaKeys
@CappellaKeys 4 месяца назад
@@touma-san91 First time i've seen someone mention the comparison to the larger ones. Never knew nor though of that. I might be doing all this work for nothing lol
@aaroncarroll4158
@aaroncarroll4158 4 месяца назад
I run llama3-70B on CPU only I7-13700K and 64gb ddr5. Is it fast, fast? No, but it runs fine. I can also run it on my 2021 M1 Mac Pro with 64gb of ram. Runs fine there as well.
@touma-san91
@touma-san91 4 месяца назад
@@CappellaKeys If you have lot of RAM (Minimum is something like 64 gigs for 70B-models) and good CPU and good GPU with decent chunk of VRAM, you can run these things using GGUF but it will probably take a few minutes to get a response out of the larger models. And you really should use GGUF because that way you can split the load on both the CPU and GPU so it runs tiny bit faster than fully running on CPU.
@touma-san91
@touma-san91 4 месяца назад
@@aaroncarroll4158 I'm curious, how fast it is for you? Like how long it takes for it to generate a whole message
@DanielsHugo
@DanielsHugo 4 месяца назад
As always, a great analysis. Newcomers often wonder if it's too late to navigate the financial market, but the market is always unpredictable. Trading has more advantages than simply holding, so it's important to learn before diving in. Active trades are necessary to ride the market's waves. Thanks to Linda Sue Baier insights, daily trade signals, and my dedication to learning, I've been increasing my daily earnings. Keep it up!
@Vajsbsbssjssmsnsk
@Vajsbsbssjssmsnsk 4 месяца назад
It's unexpected to come across her name here. She understands every beginner’s intention and fix you to a trading course that matches your capacity, she knows her stuff! Her advice has been invaluable to my trading journey. Definitely worth giving a shot!
@DeboraGruba
@DeboraGruba 4 месяца назад
Investing in alternative income streams that are independent of the government should be the top priority for everyone right now. especially given the global economic crisis we are currently experiencing. Stocks, gold, silver, and virtual currencies are still attractive investments at the moment.
@SixtoClarke
@SixtoClarke 4 месяца назад
Such market uncertainties are the reason I don’t base my market judgements and decisions on rumors' and here-says, got the best of me 2020 and had me holding worthless position in the market, I had to revamp my entire portfolio through the aid of an advisor, before I started seeing any significant results happens in my portfolio, been using the same advisor and I’ve scaled up 950k within a year, whether a bullish or down market, both makes for good profit, it all depends on where you’re looking.
@MichelleDavidson-x8e
@MichelleDavidson-x8e 4 месяца назад
I’ve been down a ton, I’m only holding on so I can recoup, I really need help, who is this investment-adviser that guides you?
@mchisolm0
@mchisolm0 Месяц назад
Thanks for this! I teach computer science at a rural high school and have been thinking about how I could help my students get experience with LLMs while also meeting the expectation of public schools to protect students from harm and protect their privacy. This definitely helps me learn. 😁
@Lampe2020
@Lampe2020 4 месяца назад
3:15 Oh no, a curl piped into a shell… Aargh!
@_modiX
@_modiX 4 месяца назад
Unjustified panic mode. If you install anything from the internet there is always risk to it no matter the install method. The beauty of an installer script is just you just can read it and make sure it's not doing anything nasty.
@Lampe2020
@Lampe2020 4 месяца назад
@@_modiX The problem with curl|sh is that a failed download will still get executed. So if the script e.g. had some "rm -rf /tmp/someapp" and the download happened to fail after "rm -rf /", then you can't do anything about it. Or a failed download may cause the partially downloaded script to break and leave you with a broken configuration. So rather just download the script, quickly check it if it didn't fail (maybe even check the download hash) and _then_ execute it in a seperate step.
@BruceNJeffAreMyFlies
@BruceNJeffAreMyFlies 4 месяца назад
Could you describe how to do it your recommended way? I.E. copy the prompt, but remove " | sh" from the end, and - after SUCCESSFUL download - enter "sh ollama run" ?
@nikolai00115
@nikolai00115 4 месяца назад
@@BruceNJeffAreMyFlies Redirect curl into a file, check the file, and then run it.
@BruceNJeffAreMyFlies
@BruceNJeffAreMyFlies 4 месяца назад
@@nikolai00115 Eh, sorry bro. If someone knows how to 'redirect curl into a file, and then run it', they probably already know the answer to my question.
@kalsiscorpion
@kalsiscorpion 4 месяца назад
Can we run all this in proxmox
@mopeygoff
@mopeygoff Месяц назад
I have my instance set up in a proxmox LXC. You need to pass the GPU(s) through first which is a tiny bit tricky but there's plenty of instructions to be found online (..if you're using proxmox 7+ make sure you use cgroup2's not cgroups). Once you do that, it's a basically the same instructions. I don't care for docker so I actually set up a conda environment. Really just the same thing, mostly.
@fchris82
@fchris82 4 месяца назад
How much energy is eaten by Terry per month? Do you have any data about this? Real question, I am interested in it.
@abitw210
@abitw210 4 месяца назад
totally not worth it over regular subscriptions from OpenAi
@fchris82
@fchris82 4 месяца назад
@@abitw210 I think you haven't watched the video, or you just didn't understand what it is for. He could give a "self prompted" AI for his daughter with limitations. Can you do the same in the OpenAI? And many companies won't share private, sensitive business documents with a third party AI. I can imagine, it is not for you, but it doesn't mean it is not worth it for anybody.
@BaldurNorddahl
@BaldurNorddahl 3 месяца назад
he should really suspend Terry when it is not being used. Unless used for some automated tasks, a private server like that is going to be sitting idle most of the time. However it would not use much if it only was on for responding to a few prompts daily.
@fchris82
@fchris82 3 месяца назад
@@BaldurNorddahl Yes, that is why I asked it, what are the real experiences in a "general" use case.
@farazalimcp1
@farazalimcp1 3 месяца назад
Thanks @NetworkChuck for amazing video. I tried to use my existing PC with an 8GB Nvidia 4060 Ti and a Core i9 9th Gen for my local AI server. While Ollama models worked fine, Stability Diffusion didn't perform as expected and getting "Cuda out of memory..." To address this, I upgraded my setup to: Ryzen 9 7950X3D MSI MAG B650 Tomahawk 128GB Corsair RAM NZXT 1000 PSU NZXT Elite 360 NZXT H9 Elite case 2 x 1TB M.2 Samsung 990 Pro (one for Pop!_OS and one for Windows 11) Nvidia Zotac 4070 Ti Super GPU This new configuration has significantly improved performance and stability for all my AI tasks. Highly recommend the upgrade for anyone facing similar issues!
@egemen9200
@egemen9200 2 месяца назад
Which model u using
@farazalimcp1
@farazalimcp1 Месяц назад
I keep 3 - mistral, llama3 and llava - but recently I saw new version released - will download those as well
@ashwinsarith2152
@ashwinsarith2152 3 месяца назад
How did you get terrys network address ? so that you can access it from another PC
@gravy7861_
@gravy7861_ 4 месяца назад
Terry seems nice
@tdrg_
@tdrg_ 4 месяца назад
He has a great personality
@FATEH-se9kr
@FATEH-se9kr 4 месяца назад
I met him in my dream
@birdboygee9660
@birdboygee9660 4 месяца назад
Have you met Deborah? She is nice to
@markoyos5841
@markoyos5841 4 месяца назад
Ohoho this is fire! 🔥
@truemotivemedia
@truemotivemedia Месяц назад
Your content is so accessible, thanks for taking the time to make it so.
@luminographix
@luminographix 4 месяца назад
Wish I had teachers like @NetworkChuck growing up.. you are simply awesome!
@luminographix
@luminographix 4 месяца назад
And your 20 min video takes me 48 hours to complete step by step, wonder how long it takes for you to produce, cheers thou!
@КравчукІгор-т2э
@КравчукІгор-т2э 20 дней назад
Thank you for the very well prepared material. Classy, localized and interesting. From the bottom of my heart I wish you success and prosperity!
@BraydenCooper-po3ck
@BraydenCooper-po3ck 19 дней назад
0:30 so I can run an AI server on my iPad?
@2024manohya
@2024manohya 3 месяца назад
okay first of all you're so charismatic and you are excellent at what you're doing so thank you very much for this amazing tutorial
@TrejonEdmonds
@TrejonEdmonds 2 месяца назад
Cool idea! While Home Assistant doesn't currently offer built-in voice-to-text, there are add-ons like Whisper and local pipelines that can be integrated for voice control. Text-to-speech options like Google Translate are also available. This could create a more Alexa-like experience for home automation. However, it's important to remember that these integrations might require some technical setup and may not be as seamless as commercial voice assistants
@MuhammadFarhan-tg3pd
@MuhammadFarhan-tg3pd 3 месяца назад
Always the best content from Chuk! Thanks for the great tips on Local AI setup.
@jstoutatl2009
@jstoutatl2009 2 месяца назад
Does anyone else find it odd that when you Google search what time is the presidential debate it doesn't give you specifics about it .. For that matter you can't even search the date it's going to be on “today”
@skperera-g8l
@skperera-g8l 2 месяца назад
The RAG example given is for a single document. Is there a way to bulk-upload many documents at once from the repo (for chunking and embedding)?
@TJ-hs1qm
@TJ-hs1qm Месяц назад
langchain or haystack
@angelique2934
@angelique2934 9 дней назад
I could imagine it would also be helpful, to give your daughters the possibility to use the AI models for language training. I found it very useful to have conversations with an AI to improve my Spanish. For example, you can ask the Model to correct you and give you suggestions (with synonyms) to sound more like a natural speaker and so on.
@MaxVoltageMiningCrypto
@MaxVoltageMiningCrypto 2 месяца назад
oooooooooooo... The sound of that keyboard is fire. Had to stop the video to see which keyboard it was. Thanks for the content. Was looking for an intro to local AI and ollama. Thank you!! EDIT: I managed to convince work to allow me to purchase a Keychron V6 keyboard with browns. I do a lot of typing at work so it was life changing and actually made me more productive so it was a win win. Ok, back to the video...
@Weriwd
@Weriwd 10 дней назад
I followed every step and everything seems to work fine, but for some reason I'm not able to generate an image from the 'Generate Image" icon using stable defusion. It keeps giving me an error message that says 'Server connection faild', but I'm able to access it at 127.0.0.1:7860 where it is hosted...any ideas whats wrong?
@AutisticThinker
@AutisticThinker 4 месяца назад
11:24 - Do you see "< 2 / 2 >"? That means you are seeing the second of two responses generated. The greater than and less than symbols are navigation controls that determine which response is displayed. ChatGPT has similar after you select one of two generated outputs for the same prompt.
@crispytek6783
@crispytek6783 3 месяца назад
That keyboard is MELTING
@mrbabyhugh
@mrbabyhugh 4 месяца назад
4:25 how the video has FLOWED thus far and your SMOOTH TRANSITION in to your sponsor is EXCELLENT. I haven't watched a lot of your videos, but the few I have watched have been fantastic work!
@meh11235
@meh11235 25 дней назад
beautiful design! private hosting ai is so legit
@slimerone
@slimerone 4 месяца назад
this was intense and dense but made so much sense. thank you kind sir
@jonathanrobitaille3756
@jonathanrobitaille3756 6 часов назад
Can't find the model files in my computer... I followed the wsl process, everything works fine, I need to find the location/directory where it was pulled ?
@r-news-ny
@r-news-ny 4 месяца назад
Subject: Inquiry Regarding AI Local Installation and Python Scripting: I'm new to AI projects and Linux, seeking guidance on installing AI Local without Docker, managing Python scripts for module creation/installation, and creating a personal database for AI Local. Appreciate any advice.
@Wynner3
@Wynner3 3 месяца назад
You make it look so easy to set up. I spent hours just trying to find causes of errors and how to fix them. I re-installed Docker and Ubuntu several times without luck. Finally re-installed everything and signed up for Open WebUI again to finally see the AI models appear. I suppose it was for the best since I learned so much along the way. lol
@Yander_van_der_wurff
@Yander_van_der_wurff 2 месяца назад
hiii, did you experience a GPG error where the key was not available, after the first INSTALL DOCKER command? I'm very stuck and can't figure out wat is wrong
@larscw74
@larscw74 2 дня назад
I just found your channel and enjoyed the video, very informational, I would like to see options about personal assistants, where you connect systems like a 3D scanner and different CNC type devices like 3D printer, to basically create a Jarvis like system.
@siddharth1712
@siddharth1712 4 месяца назад
15:18 hey there how to deploy this complete web UI on web
@ryn022
@ryn022 2 месяца назад
As a dad, this hit the money! Thabks for showing the setup for your girls, will be using the same model for my kids!
@RunnerProductions
@RunnerProductions Месяц назад
Would you recommend at this point (8/22/2024) is 4090 still the best to invest in now or should there be a special NPU type GPU coming out? Or are they going to be so similar? OR will NPUs be a new type processor you can just add to the computer. So you can put a GPU, CPU, and maybe a future NPU in this computer box?
@gabrieljudoquinha
@gabrieljudoquinha 17 часов назад
Finally found a working solution, thanks for the video!
@BigGrims2648
@BigGrims2648 3 месяца назад
Kudos to the amazing tutorial! I have attempted to follow many others and yours was by far the most in depth and worked perfectly the first try! Thank you!
@oimwiolevrai
@oimwiolevrai 2 месяца назад
Hey, I am able to use openweb ui from MY pc, but from other devices, no! any help?
@kannansingaravelu
@kannansingaravelu 4 месяца назад
models are not listed on the webui - how do we resolve it
@BirdsPawsandClaws
@BirdsPawsandClaws 22 дня назад
I watch a lot of your content. I love this video tutorial very much. Now I can start to use AI locally. Great video!
@Steve-i5295
@Steve-i5295 48 минут назад
I looked for your mentioned guide but it looks like I just wandered into some chat board where nobody answers questions. May I suggest if you mention a link in a RU-vid video, make it easy to find on your website somehow. There is no way to delete a profile created either. I imagine you might find those suggestions helpful.
@hatterdmad8504
@hatterdmad8504 3 месяца назад
NetworkChuck thanks for your awesome videos. I bought my own version of Terry, his weaker little brother Jerry. I plan on building a server closer to Terry hopefully next year. I put my local AI on it by following your instructions. Thank you, thank you. I need help. How do I access my local AI remotely?
@mr.gk5
@mr.gk5 День назад
Another idea would be to SSH to your remote server to use and take advantage of the more computational power of Terry. Is that even possible? I'd love to watch your guide on that. I'm running a cheap laptop at work which certainly not powerful enough to run local AI but would like to SSH to my PC at home but can't seem to make it work because both are using WSL?
@kirankumar3114
@kirankumar3114 Месяц назад
Hi chuck, When I try to install docker using the apt commands, I am getting the below error: Package docker-ce is not available, but is referred to by another package. This may mean that the package is missing, has been obsoleted, or is only available from another source. Help me in resolving this error.
@RAMATW2023
@RAMATW2023 4 месяца назад
This is great, saw your video a while back and it was only command line, decided to skip doing it, now this one shows up on my "For You", love it! Thank you!
@krzysztofwaclawski9002
@krzysztofwaclawski9002 Месяц назад
That worked beautifully on remote digital ocean droplet! Even though llama2 did not meet install requirements - tiny llama model did. Great straitghforward introduction to the topic - thanks a bunch mate!
Далее
Run your own AI (but private)
22:13
Просмотров 1,5 млн
I Built a CoPilot+ AI PC (without Windows)
12:50
Просмотров 373 тыс.
Wait for winner 🏆 😂 #shorts
00:42
Просмотров 3,9 млн
O’zim bilib ketvotudima😅
01:00
Просмотров 919 тыс.
Why Are Open Source Alternatives So Bad?
13:06
Просмотров 639 тыс.
You've been using AI Wrong
30:58
Просмотров 504 тыс.
Is Computer Science still worth it?
20:08
Просмотров 201 тыс.
Your Old PC is Your New Server
11:07
Просмотров 6 млн
Hide your files like a hacker (5 Ways)
19:17
Просмотров 112 тыс.
How To Access the DARK WEB in 2024 (3 Levels)
15:20
Просмотров 4,9 млн
3 Levels of WiFi Hacking
22:12
Просмотров 2 млн
How I Made AI Assistants Do My Work For Me: CrewAI
19:21
Wait for winner 🏆 😂 #shorts
00:42
Просмотров 3,9 млн