Тёмный

I used the BEST Open Source LLM to build a GPT WebApp (Falcon-40B Instruct) 

Nicholas Renotte
Подписаться 280 тыс.
Просмотров 100 тыс.
50% 1

Опубликовано:

 

25 сен 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 122   
@NicholasRenotte
@NicholasRenotte Год назад
Thanks a million to NordPass Business for helping me end my Shin Ramen speed run…and for sponsoring this video! Whatcha waiting for? Grab a 3-month free trial here nordpass.com/nicholasnord with code nicholasnord!
@kakamoora7874
@kakamoora7874 Год назад
Bro how we can make webscrapping in ai please make one video… or give some tips please
@careyatou
@careyatou Год назад
Yes. A fine tuning video would be amazing.
@officialdiadonacs
@officialdiadonacs Год назад
Bump
@sujith_25
@sujith_25 Год назад
Yeah , Please do it sir.
@muhammadumaranzar3431
@muhammadumaranzar3431 10 месяцев назад
Bump
@AIEntusiast_
@AIEntusiast_ 7 месяцев назад
dont think we get it from this guy, all his videos are just superficial this is possible but never show detailed how
@zd676
@zd676 Год назад
It’s one thing to build something cool with LLM, it’s completely different to bring it to production.
@Someone17122
@Someone17122 Год назад
I started following quite a long time, I could see an incredible transition in video, quality and content. I will be always waiting to explore more new tech from you.
@NicholasRenotte
@NicholasRenotte Год назад
Thank you so much, I've been trying to improve them dramatically as of late!
@ikurious
@ikurious Год назад
did you run that on your local machine?
@NicholasRenotte
@NicholasRenotte Год назад
Nope, RunPod, couldn't get it to run locally.
@jannik3475
@jannik3475 Год назад
Nice! I am currently looking for a way to make a document Q&A Chatbot with Falcon. Also a Video on fine tuning would be helpful! Thanks Nicholas!
@NicholasRenotte
@NicholasRenotte Год назад
Working on it!
@NicholasRenotte
@NicholasRenotte Год назад
Also, thanks a mil for checking out the vid :)
@shawn.builds
@shawn.builds Год назад
I've seen a bunch of your videos and this one has got to be the best. Such a complicated topic turned engaging + informative is no joke. Thanks Nicholas!
@gianlucafiorini
@gianlucafiorini Год назад
amazing content!!!! i was doing almost the same with falcon 7b and langchain last week, happy to see a better explanation on what i was doing heheh!
@NicholasRenotte
@NicholasRenotte Год назад
hahahah, I've been experimenting like crazy with it!! Whatcha building?
@gianlucafiorini
@gianlucafiorini Год назад
@@NicholasRenotte trying to create a Private expert on specific subject but i am seeing i will need to through fine tunninv to do what i want, i love the way you explain this tech, to me as a begginer is hard to realy understand what i am doing, aways watching your content to realy get an understanxing on what i am realy doing
@noobking5056
@noobking5056 Год назад
please make a video on the difference between langchain fine-tuning and normal fine tuning!
@guimaraesalysson
@guimaraesalysson Год назад
Why install torchvision and torchaudio libraries with IMAGE and AUDIO datasets for one text-to-text app ?
@fulltimefrontend
@fulltimefrontend Год назад
How may RTX4090 required for Falcon 40B ? 160gb vRAM via RTX 4090 means 6-7cards , which is roughly $10k and still cheaper than 2 A100's. Using RTX for Stable diffusion already.
@NicholasRenotte
@NicholasRenotte Год назад
Huh, interesting. I couldn't easily find many cloud providers that were using 4090s, there were a bunch offering H100 and A100s though!
@kotykd6212
@kotykd6212 Год назад
​@@NicholasRenottedata centers can't rent them out, but places like runpod and vast ai can since there are hosts instead of only datacenters
@elchippe
@elchippe Год назад
USe the gmml model, can run in a CPU with a lot of memory and can share memory with a GPU running cuda.
@essentials9030
@essentials9030 9 месяцев назад
thanks for this video, but please make a video in which implementation should be there for object detection by using LLM or multimodal, please
@chongdashu
@chongdashu 8 месяцев назад
Have you been able to use VSCode connected to a remote jupyter instance that still allows Pylance to work? e.g., so you can make use of VSCode's nifty features like cmd/ctrl clicking to see a method definition, etc.
@farrukhzamir
@farrukhzamir Год назад
Can we not use model.save pretrained and save the model in the form of shards so that when device_map=auto is used accelerate would kick in and allow to offload the shards to disk and memory. I think that's why you were getting OOM errors.
@Arvolve
@Arvolve Год назад
Wonderful informative and concise content! A video on Fine tunning potentially on a cloud service would be awesome!
@pkmnjourney
@pkmnjourney Год назад
I cannot wait for a finetuning video! Looking forward to it.
@fuba44
@fuba44 Год назад
If truncating the weights to maybe 8 bit or less, can it fit on a high end consumer grade GPU?
@sindoc42
@sindoc42 Год назад
What's the cheapest hardware we can find on the market to be able to run this? And I prefer it be local. Can someone help me order the required hardware for this?
@Movierecap998
@Movierecap998 Год назад
I have purchased amd 6800xt and i want to learn about AI and ML is there any chance i will be able to learn ?
@gluttony4778
@gluttony4778 Год назад
Any idea how one can deploy a web app like this or one using streamlit online? maybe hosted on a domain/ just integrated a chatbot for a site.
@wasgeht2409
@wasgeht2409 Год назад
Nicholas, thank you :) I have two questions. First, is it possible to use this on my personal laptop? I don't have a robust GPU; instead, I have a Mac M1 with 8 GB RAM. Second, could I train the pre-trained 40B model for a specific task in German? I'd like to use it for classifying sentences into labels. Is that feasible?
@AkolytosCreations
@AkolytosCreations Год назад
No, this requires large GPU’s to run. It doesn’t even have the capability to run on CPU but extremely slowly.
@NicholasRenotte
@NicholasRenotte Год назад
Realistically, no, you'll need a beast GPU to run it. Didn't work on my mac. You would probably need to fine tune it for that but you would need even more GPU compute to achieve that. But tbh, if it's just sentence classification there's much easier ways to do that, you could just use a small encoder only model and it would probably work well!
@elchippe
@elchippe Год назад
The bloke gmml falcon version can run in a CPU but 8GB is way to low for this model.
@deanchanter217
@deanchanter217 Год назад
Llama2 with fine tuning...crazy that as you are producing this video a new and better model dropped
@eeshanchanpura3356
@eeshanchanpura3356 Год назад
can i run this on Google colab? it does provide me cloud memory which can be helpful
@rexlaurus5894
@rexlaurus5894 Год назад
Would have been cool to see how you set up runpod.
@adelekefikayomi8351
@adelekefikayomi8351 Год назад
Please can it work with tensorflow????
@so_i_learn_3d549
@so_i_learn_3d549 Год назад
hey Nich thanks for u video but i have i have already devellope Q.A chatbot response and i try to implementation fontionnality , to make him read excel file using text generation, do u have any idea how an i implement this? the only i found is to use Langchain and openai API but i try to do withoug openIA API
@NicholasRenotte
@NicholasRenotte Год назад
Working on it right now!
@akashsavalgi-k4f
@akashsavalgi-k4f Год назад
hey, very nice video. Can you tell me what is the system requirement to train our model.
@luis96xd
@luis96xd Год назад
What is the best LLM model for low RAM memory usage, for example implementing in a free tier hosting service
@wichawt3079
@wichawt3079 Год назад
perhaps a 13b model, " ehartford/Wizard-Vicuna-13B-Uncensored " is the highest ranked 13B on huggingface's leaderboard
@luis96xd
@luis96xd Год назад
@@wichawt3079 Thank you so much for your answer! I will try
@fahnub
@fahnub Год назад
10/10 Content. Engaging, Informative, Precise. ❤
@FunCodingwithRahul
@FunCodingwithRahul Год назад
Excellent video Nich. I am also exploring Falcon for my domain specific requirement using the concept of RAG with Langchain. But the model is taking too much of time to generate the result even after quantization. Do you have any suggestion on how to reduce the runtime? If I set max_length to less than 1000, model is unable to generate anything. Kind of stuck with the issue !!
@NicholasRenotte
@NicholasRenotte Год назад
Yeah I ran into this as well, only way to see fast results is running if on big GPUs.
@alirezagoudarzi1915
@alirezagoudarzi1915 Год назад
Thanks, Hey Nick how can I integrate langchain codes?
@DominicMarrocco
@DominicMarrocco Год назад
your style of production, personality and content is excellent
@abbeynguyen8396
@abbeynguyen8396 Год назад
It is amazing. Could you please do a video about Tree-of-thoughts?
@patchshorts
@patchshorts Год назад
what video card is required to run this?
@i2c_jason
@i2c_jason Год назад
On the math thing... is the word on the street that we're going to handle math just by increasingly larger parameter counts? Because that scares the crap outta me for engineering applications where the math becomes very technical and obscure. Almost like we need a separate ALU baked into the model to make math feasible on lightweight small parameter count models.
@NicholasRenotte
@NicholasRenotte Год назад
Mark my words, some new architecture will come out that will boost performance with dramatically smaller parameter counts. You're right though using a separate ALU could work as well, e.g. Langchain using Wolfram. Also, I can share some of the work our research teams are doing for efficient fine tuning and building smaller parameter efficient models!
@LowestofheDead
@LowestofheDead Год назад
I think they only do arithmetic tests to see how well the model can generalize. Like he said, people already use Langchain or the Wolfram plugin to do math properly.
@i2c_jason
@i2c_jason Год назад
@@LowestofheDead Yes, agree, but that's not going to accelerate us very far. It means you still have to be super specialized in mathematics to know how to use those tools. The promise of AI would be to get to a point where the AI model can use those tools to output highly mathematical solutions with simple prompts. For example, "imagine a geometrically correct STEP file assembly of a handheld drill"... then open it in Fusion360 and print or machine all of the parts. That is the next fundamental step change in this tech, IMO. "3D" images don't count, because they are not geometrical engineering files of reproducible physical objects.
@ShifraTech
@ShifraTech Год назад
Have been fintuning this model into learnign new languages a fun experiment indeed... I think more people need to play around with this. 😇
@ParthPatel-db4tk
@ParthPatel-db4tk Год назад
Hello Nicholas sir, this video was really helpful to learn how to make own chatbot, it would be helpful if you make video upon how to use LLMs to perform classification using fine tuning techniques such as Zero shot & Few shots learning. Thanks.....
@divaxshah9424
@divaxshah9424 Год назад
Really loving this going through technique , what an amazing video that's a lot . Also I have 2 questions.. 1) can I run falcon 40 instruct on Colab free version, which has Tesla T4 16GB ?! 2) can you make a video on Fine Tuning a Stable Diffusion model like sd2.1 or sdxl to make our own checkpoints ?! PS: really amazing video, thank lot❤
@Woollzable
@Woollzable Год назад
Answer to your first question: No. You cannot run falcon 40b-instruct on a Colab free version. Falcon 40b needs 85GB - 100GB of VRAM at 16-bit precision. Even with reduced precision down to 8-bit it still requires some 45 GB VRAM. At 4-bit precision, it requires 35 GB VRAM. You. need to load the entire model on to GPU memory (could be multiple GPUs).
@heltengundersen
@heltengundersen Год назад
please, a video on fine turning falcon 40b on a large code base
@mmmhhh.
@mmmhhh. Год назад
Whats the best model multi target emotionally informed hatespeech detection
@NicholasRenotte
@NicholasRenotte Год назад
Think there's a bunch of those, probably encoder only models. I've seen a few in the HuggingFace model repo.
@rahulkiroriwal8779
@rahulkiroriwal8779 Год назад
why among us sounds lmao 😂😂😂😂
@NicholasRenotte
@NicholasRenotte Год назад
LOL, been watching too many streams.
@jzam5426
@jzam5426 Год назад
thanks for the great content!! Newish to the channel. Has it been tested against gpt-4?
@DCinzi
@DCinzi Год назад
This is so exiting and yet so demoralizing at the same time. Unless you have some real good understanding of coding and llm looks like an impossible task. And rightly so.. but I wish there were more effort out there to make this way more accessible to people that focused on other subjects, also because we may just end up with a lot of very superficial products .
@luis96xd
@luis96xd Год назад
Amazing video, everything was well explained!
@jafferaliumar
@jafferaliumar Год назад
Nice video and very informative. We almost going start the same testing and it definitely useful.
@NicholasRenotte
@NicholasRenotte Год назад
Heya, glad you liked it!
@ragunanthan7499
@ragunanthan7499 Год назад
wonderful content sir, can put a video how to train llm on own data
@foreignconta
@foreignconta Год назад
Excellent!!!!!!! Only if I could run falcon 40B on my 4Gb DDR6 GPU. 😂
@pranavagrawal4324
@pranavagrawal4324 Год назад
Hey, how to train a llm from scratch using multiple dataset from huggingface Video will be amazing
@rbanondo
@rbanondo Год назад
best teacher in youtube. is there any chance you will make a video about working with medical images?
@NicholasRenotte
@NicholasRenotte Год назад
For like segmentation?
@renegadezed
@renegadezed Год назад
id be more interested in integrating this into a discord or twitch bot than some random webapp..
@americanwayformation8717
@americanwayformation8717 Год назад
Pas trop mal ton français et mieux que l'anglais de mes collègues 😜, as an american who has lived in France for more than 10 years, I've both said and heard a lot worse. Love the videos! so much great info and things to learn. Thanks so much for sharing 🙏
@NicholasRenotte
@NicholasRenotte Год назад
Hahahahahah, I was honestly crying with laughter when watching the edit but let's be real it was a 3/10 performance!
@The_Conspiracy_Analyst
@The_Conspiracy_Analyst 8 месяцев назад
is it censored?
@conradcaldeira7131
@conradcaldeira7131 Год назад
a fine tuning video for non- GPU users will be most appreciated
@i2c_jason
@i2c_jason Год назад
Excellent work, as always. Thank you so much!!
@NicholasRenotte
@NicholasRenotte Год назад
Thanks a mil for checking it out Jason!
@Imnotsoumyajit
@Imnotsoumyajit Год назад
Congrats on your new position Nick 👏👏👏👏👏👏
@NicholasRenotte
@NicholasRenotte Год назад
Thanks a mil @developertreats!!
@tkololfi5999
@tkololfi5999 Год назад
Please run MPT-30b
@bvdlio
@bvdlio Год назад
Yay, finally first
@NicholasRenotte
@NicholasRenotte Год назад
Thanks a mil man! Ngl, this one took a while.
@kevynkrancenblum5350
@kevynkrancenblum5350 Год назад
Quelle vidéo ! Tu es le meilleur Nic !!! Love the new videos styles ! Look so nice 💪🏻💪🏻
@NicholasRenotte
@NicholasRenotte Год назад
KEV?!! I didn't know you spoke French? Also, YESSSS, stoked you liked it!
@adeelhasan7536
@adeelhasan7536 Год назад
PLEASE UPLOAD VIDEO ON FINE TUNING
@TheAbdallahk
@TheAbdallahk Год назад
You are the GOAT! This tutorial is fire! 🔥🔥🔥🔥
@NicholasRenotte
@NicholasRenotte Год назад
Thanks a mil man!!!
@wasgeht2409
@wasgeht2409 Год назад
Thanks!
@sauravkumar-sz5zx
@sauravkumar-sz5zx 9 месяцев назад
Fine tuning video please
@fishnchips6627
@fishnchips6627 Год назад
Congratulations on your promotion!
@kakamoora7874
@kakamoora7874 Год назад
Bro how we can make webscrapping in ai please make one video… or give some tips please
@NicholasRenotte
@NicholasRenotte Год назад
I don't think you really need AI or ML for it, BeautifulSoup is your best friend!
@kakamoora7874
@kakamoora7874 Год назад
@@NicholasRenotte actually we have 2000 websites, if I’m trying in beautifulsoupe it’s taking one month it’s so long process…. Selenium also not working
@ChewDaPi
@ChewDaPi Год назад
You should have used Sagemaker chief could have been cheaper
@NicholasRenotte
@NicholasRenotte Год назад
Cheers! Will check it out!
@jonnybrabals
@jonnybrabals Год назад
Give us the lipnet working with our own videos! Pleaseeeeeeee men
@chrisweeks8789
@chrisweeks8789 Год назад
So 2 A100's = 30 sec response? 😅 Inner tom ford 🤣🤣
@NicholasRenotte
@NicholasRenotte Год назад
LOL, honestly it started out as 30 minutes with no response on my local machine.
@DanielCampbellYT
@DanielCampbellYT Год назад
Great Video Nick!
@CMT-p6q
@CMT-p6q 6 месяцев назад
I have to leave a comment for the family
@guimaraesalysson
@guimaraesalysson Год назад
Great video, man
@sabarishrajksabarishrajk292
Nice one man.Keep rocking...
@NicholasRenotte
@NicholasRenotte Год назад
Thanks so much!
@philtoa334
@philtoa334 Год назад
Tu parles bien Français Nicho !! Merci pour Ta vidéo .
@NicholasRenotte
@NicholasRenotte Год назад
Bien merci Phil!! I don’t know if it’s that great though 😂
@jorgefelipegaviriafierro705
Great as always!
@danielgormly6064
@danielgormly6064 Год назад
12 days later this is out dated... damn things are moving fast
@NicholasRenotte
@NicholasRenotte Год назад
ikr
@khalidal-reemi3361
@khalidal-reemi3361 Год назад
Fine Tuning Pleeeeeeeeeeeeeeeeeeeeeeeeeeeeese
@cvs2010
@cvs2010 Год назад
Best vídeo ever
@deathspainvincentblood6745
@deathspainvincentblood6745 Год назад
guys I'll introducing programming helper the programming helper is so powerful and much better than OPENAI wat waiting for now since start 1990 is there Lua language in AI CHAT and many more
@emperor1337
@emperor1337 Год назад
Another pro-falcon, low impressions count video... falcon is a 3rd tier llm at best
@rverm1000
@rverm1000 Год назад
How can you overlay photo's? Open cv? I'm looking at just photo's that taken one after another. What I find interesting is just level of detail. At first glance they look like photo's taken in the 1950's until you hit the zoom button. There are thousands of stars and alot of stuff moving around in space. The just photo's you can see all the stuff. What I want to do I overlay 100 photo's of the same area and color everything that's not in all 100 photo's. See if we can discover new objects moving in space. Here's the starting photo jw0157126001_04201_00001_nis_trapsfilled.jpg the target is Antennea. These photos start around 790 in the list
@Tripp111
@Tripp111 Год назад
Thank you!
Далее
Have You Picked the Wrong AI Agent Framework?
13:10
Просмотров 68 тыс.
Признавайтесь, кто его смыл?
00:54
Avaz Oxun - Turqi sovuq kal
14:50
Просмотров 548 тыс.
БЕЛКА СЬЕЛА КОТЕНКА?#cat
00:13
Просмотров 966 тыс.
How to Code a AI Trading bot (so you can make $$$)
35:09
OpenAI’s New ChatGPT: 7 Incredible Capabilities!
6:27
How ChatGPT was secretly designed to suck at real work
19:21
The BEST Open Source LLM? (Falcon 40B)
23:56
Просмотров 98 тыс.
Build Anything with Llama 3 Agents, Here’s How
12:23
Просмотров 144 тыс.
I Analyzed My Finance With Local LLMs
17:51
Просмотров 480 тыс.
Признавайтесь, кто его смыл?
00:54