I used the BEST Open Source LLM to build a GPT WebApp (Falcon-40B Instruct)

Nicholas Renotte

Подписаться 280 тыс.

Просмотров 100 тыс.

50% 1

Видео Поделиться Скачать Добавить в

Опубликовано:

25 сен 2024

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 122

@NicholasRenotte Год назад

Thanks a million to NordPass Business for helping me end my Shin Ramen speed run…and for sponsoring this video! Whatcha waiting for? Grab a 3-month free trial here nordpass.com/nicholasnord with code nicholasnord!

@kakamoora7874 Год назад

Bro how we can make webscrapping in ai please make one video… or give some tips please

@careyatou Год назад

Yes. A fine tuning video would be amazing.

@officialdiadonacs Год назад

Bump

@sujith_25 Год назад

Yeah , Please do it sir.

@muhammadumaranzar3431 10 месяцев назад

Bump

@AIEntusiast_ 7 месяцев назад

dont think we get it from this guy, all his videos are just superficial this is possible but never show detailed how

@zd676 Год назад

It’s one thing to build something cool with LLM, it’s completely different to bring it to production.

@Someone17122 Год назад

I started following quite a long time, I could see an incredible transition in video, quality and content. I will be always waiting to explore more new tech from you.

@NicholasRenotte Год назад

Thank you so much, I've been trying to improve them dramatically as of late!

@ikurious Год назад

did you run that on your local machine?

@NicholasRenotte Год назад

Nope, RunPod, couldn't get it to run locally.

@jannik3475 Год назад

Nice! I am currently looking for a way to make a document Q&A Chatbot with Falcon. Also a Video on fine tuning would be helpful! Thanks Nicholas!

@NicholasRenotte Год назад

Working on it!

@NicholasRenotte Год назад

Also, thanks a mil for checking out the vid :)

@shawn.builds Год назад

I've seen a bunch of your videos and this one has got to be the best. Such a complicated topic turned engaging + informative is no joke. Thanks Nicholas!

@gianlucafiorini Год назад

amazing content!!!! i was doing almost the same with falcon 7b and langchain last week, happy to see a better explanation on what i was doing heheh!

@NicholasRenotte Год назад

hahahah, I've been experimenting like crazy with it!! Whatcha building?

@gianlucafiorini Год назад

@@NicholasRenotte trying to create a Private expert on specific subject but i am seeing i will need to through fine tunninv to do what i want, i love the way you explain this tech, to me as a begginer is hard to realy understand what i am doing, aways watching your content to realy get an understanxing on what i am realy doing

@noobking5056 Год назад

please make a video on the difference between langchain fine-tuning and normal fine tuning!

@guimaraesalysson Год назад

Why install torchvision and torchaudio libraries with IMAGE and AUDIO datasets for one text-to-text app ?

@fulltimefrontend Год назад

How may RTX4090 required for Falcon 40B ? 160gb vRAM via RTX 4090 means 6-7cards , which is roughly $10k and still cheaper than 2 A100's. Using RTX for Stable diffusion already.

@NicholasRenotte Год назад

Huh, interesting. I couldn't easily find many cloud providers that were using 4090s, there were a bunch offering H100 and A100s though!

@kotykd6212 Год назад

@@NicholasRenottedata centers can't rent them out, but places like runpod and vast ai can since there are hosts instead of only datacenters

@elchippe Год назад

USe the gmml model, can run in a CPU with a lot of memory and can share memory with a GPU running cuda.

@essentials9030 9 месяцев назад

thanks for this video, but please make a video in which implementation should be there for object detection by using LLM or multimodal, please

@chongdashu 8 месяцев назад

Have you been able to use VSCode connected to a remote jupyter instance that still allows Pylance to work? e.g., so you can make use of VSCode's nifty features like cmd/ctrl clicking to see a method definition, etc.

@farrukhzamir Год назад

Can we not use model.save pretrained and save the model in the form of shards so that when device_map=auto is used accelerate would kick in and allow to offload the shards to disk and memory. I think that's why you were getting OOM errors.

@Arvolve Год назад

Wonderful informative and concise content! A video on Fine tunning potentially on a cloud service would be awesome!

@pkmnjourney Год назад

I cannot wait for a finetuning video! Looking forward to it.

@fuba44 Год назад

If truncating the weights to maybe 8 bit or less, can it fit on a high end consumer grade GPU?

@sindoc42 Год назад

What's the cheapest hardware we can find on the market to be able to run this? And I prefer it be local. Can someone help me order the required hardware for this?

@Movierecap998 Год назад

I have purchased amd 6800xt and i want to learn about AI and ML is there any chance i will be able to learn ?

@gluttony4778 Год назад

Any idea how one can deploy a web app like this or one using streamlit online? maybe hosted on a domain/ just integrated a chatbot for a site.

@wasgeht2409 Год назад

Nicholas, thank you :) I have two questions. First, is it possible to use this on my personal laptop? I don't have a robust GPU; instead, I have a Mac M1 with 8 GB RAM. Second, could I train the pre-trained 40B model for a specific task in German? I'd like to use it for classifying sentences into labels. Is that feasible?

@AkolytosCreations Год назад

No, this requires large GPU’s to run. It doesn’t even have the capability to run on CPU but extremely slowly.

@NicholasRenotte Год назад

Realistically, no, you'll need a beast GPU to run it. Didn't work on my mac. You would probably need to fine tune it for that but you would need even more GPU compute to achieve that. But tbh, if it's just sentence classification there's much easier ways to do that, you could just use a small encoder only model and it would probably work well!

@elchippe Год назад

The bloke gmml falcon version can run in a CPU but 8GB is way to low for this model.

@deanchanter217 Год назад

Llama2 with fine tuning...crazy that as you are producing this video a new and better model dropped

@eeshanchanpura3356 Год назад

can i run this on Google colab? it does provide me cloud memory which can be helpful

@rexlaurus5894 Год назад

Would have been cool to see how you set up runpod.

@adelekefikayomi8351 Год назад

Please can it work with tensorflow????

@so_i_learn_3d549 Год назад

hey Nich thanks for u video but i have i have already devellope Q.A chatbot response and i try to implementation fontionnality , to make him read excel file using text generation, do u have any idea how an i implement this? the only i found is to use Langchain and openai API but i try to do withoug openIA API

@NicholasRenotte Год назад

Working on it right now!

@akashsavalgi-k4f Год назад

hey, very nice video. Can you tell me what is the system requirement to train our model.

@luis96xd Год назад

What is the best LLM model for low RAM memory usage, for example implementing in a free tier hosting service

@wichawt3079 Год назад

perhaps a 13b model, " ehartford/Wizard-Vicuna-13B-Uncensored " is the highest ranked 13B on huggingface's leaderboard

@luis96xd Год назад

@@wichawt3079 Thank you so much for your answer! I will try

@fahnub Год назад

10/10 Content. Engaging, Informative, Precise. ❤

@FunCodingwithRahul Год назад

Excellent video Nich. I am also exploring Falcon for my domain specific requirement using the concept of RAG with Langchain. But the model is taking too much of time to generate the result even after quantization. Do you have any suggestion on how to reduce the runtime? If I set max_length to less than 1000, model is unable to generate anything. Kind of stuck with the issue !!

@NicholasRenotte Год назад

Yeah I ran into this as well, only way to see fast results is running if on big GPUs.

@alirezagoudarzi1915 Год назад

Thanks, Hey Nick how can I integrate langchain codes?

@DominicMarrocco Год назад

your style of production, personality and content is excellent

@abbeynguyen8396 Год назад

It is amazing. Could you please do a video about Tree-of-thoughts?

@patchshorts Год назад

what video card is required to run this?

@i2c_jason Год назад

On the math thing... is the word on the street that we're going to handle math just by increasingly larger parameter counts? Because that scares the crap outta me for engineering applications where the math becomes very technical and obscure. Almost like we need a separate ALU baked into the model to make math feasible on lightweight small parameter count models.

@NicholasRenotte Год назад

Mark my words, some new architecture will come out that will boost performance with dramatically smaller parameter counts. You're right though using a separate ALU could work as well, e.g. Langchain using Wolfram. Also, I can share some of the work our research teams are doing for efficient fine tuning and building smaller parameter efficient models!

@LowestofheDead Год назад

I think they only do arithmetic tests to see how well the model can generalize. Like he said, people already use Langchain or the Wolfram plugin to do math properly.

@i2c_jason Год назад

@@LowestofheDead Yes, agree, but that's not going to accelerate us very far. It means you still have to be super specialized in mathematics to know how to use those tools. The promise of AI would be to get to a point where the AI model can use those tools to output highly mathematical solutions with simple prompts. For example, "imagine a geometrically correct STEP file assembly of a handheld drill"... then open it in Fusion360 and print or machine all of the parts. That is the next fundamental step change in this tech, IMO. "3D" images don't count, because they are not geometrical engineering files of reproducible physical objects.

@ShifraTech Год назад

Have been fintuning this model into learnign new languages a fun experiment indeed... I think more people need to play around with this. 😇

@ParthPatel-db4tk Год назад

Hello Nicholas sir, this video was really helpful to learn how to make own chatbot, it would be helpful if you make video upon how to use LLMs to perform classification using fine tuning techniques such as Zero shot & Few shots learning. Thanks.....

@divaxshah9424 Год назад

Really loving this going through technique , what an amazing video that's a lot . Also I have 2 questions.. 1) can I run falcon 40 instruct on Colab free version, which has Tesla T4 16GB ?! 2) can you make a video on Fine Tuning a Stable Diffusion model like sd2.1 or sdxl to make our own checkpoints ?! PS: really amazing video, thank lot❤

@Woollzable Год назад

Answer to your first question: No. You cannot run falcon 40b-instruct on a Colab free version. Falcon 40b needs 85GB - 100GB of VRAM at 16-bit precision. Even with reduced precision down to 8-bit it still requires some 45 GB VRAM. At 4-bit precision, it requires 35 GB VRAM. You. need to load the entire model on to GPU memory (could be multiple GPUs).

@heltengundersen Год назад

please, a video on fine turning falcon 40b on a large code base

@mmmhhh. Год назад

Whats the best model multi target emotionally informed hatespeech detection

@NicholasRenotte Год назад

Think there's a bunch of those, probably encoder only models. I've seen a few in the HuggingFace model repo.

@rahulkiroriwal8779 Год назад

why among us sounds lmao 😂😂😂😂

@NicholasRenotte Год назад

LOL, been watching too many streams.

@jzam5426 Год назад

thanks for the great content!! Newish to the channel. Has it been tested against gpt-4?

@DCinzi Год назад

This is so exiting and yet so demoralizing at the same time. Unless you have some real good understanding of coding and llm looks like an impossible task. And rightly so.. but I wish there were more effort out there to make this way more accessible to people that focused on other subjects, also because we may just end up with a lot of very superficial products .

@luis96xd Год назад

Amazing video, everything was well explained!

@jafferaliumar Год назад

Nice video and very informative. We almost going start the same testing and it definitely useful.

@NicholasRenotte Год назад

Heya, glad you liked it!

@ragunanthan7499 Год назад

wonderful content sir, can put a video how to train llm on own data

@foreignconta Год назад

Excellent!!!!!!! Only if I could run falcon 40B on my 4Gb DDR6 GPU. 😂

@pranavagrawal4324 Год назад

Hey, how to train a llm from scratch using multiple dataset from huggingface Video will be amazing

@rbanondo Год назад

best teacher in youtube. is there any chance you will make a video about working with medical images?

@NicholasRenotte Год назад

For like segmentation?

@renegadezed Год назад

id be more interested in integrating this into a discord or twitch bot than some random webapp..

@americanwayformation8717 Год назад

Pas trop mal ton français et mieux que l'anglais de mes collègues 😜, as an american who has lived in France for more than 10 years, I've both said and heard a lot worse. Love the videos! so much great info and things to learn. Thanks so much for sharing 🙏

@NicholasRenotte Год назад

Hahahahahah, I was honestly crying with laughter when watching the edit but let's be real it was a 3/10 performance!

@The_Conspiracy_Analyst 8 месяцев назад

is it censored?

@conradcaldeira7131 Год назад

a fine tuning video for non- GPU users will be most appreciated

@i2c_jason Год назад

Excellent work, as always. Thank you so much!!

@NicholasRenotte Год назад

Thanks a mil for checking it out Jason!

@Imnotsoumyajit Год назад

Congrats on your new position Nick 👏👏👏👏👏👏

@NicholasRenotte Год назад

Thanks a mil @developertreats!!

@tkololfi5999 Год назад

Please run MPT-30b

@bvdlio Год назад

Yay, finally first

@NicholasRenotte Год назад

Thanks a mil man! Ngl, this one took a while.

@kevynkrancenblum5350 Год назад

Quelle vidéo ! Tu es le meilleur Nic !!! Love the new videos styles ! Look so nice 💪🏻💪🏻

@NicholasRenotte Год назад

KEV?!! I didn't know you spoke French? Also, YESSSS, stoked you liked it!

@adeelhasan7536 Год назад

PLEASE UPLOAD VIDEO ON FINE TUNING

@TheAbdallahk Год назад

You are the GOAT! This tutorial is fire! 🔥🔥🔥🔥

@NicholasRenotte Год назад

Thanks a mil man!!!

@wasgeht2409 Год назад

Thanks!

@sauravkumar-sz5zx 9 месяцев назад

Fine tuning video please

@fishnchips6627 Год назад

Congratulations on your promotion!

@kakamoora7874 Год назад

Bro how we can make webscrapping in ai please make one video… or give some tips please

@NicholasRenotte Год назад

I don't think you really need AI or ML for it, BeautifulSoup is your best friend!

@kakamoora7874 Год назад

@@NicholasRenotte actually we have 2000 websites, if I’m trying in beautifulsoupe it’s taking one month it’s so long process…. Selenium also not working

@ChewDaPi Год назад

You should have used Sagemaker chief could have been cheaper

@NicholasRenotte Год назад

Cheers! Will check it out!

@jonnybrabals Год назад

Give us the lipnet working with our own videos! Pleaseeeeeeee men

@chrisweeks8789 Год назад

So 2 A100's = 30 sec response? 😅 Inner tom ford 🤣🤣

@NicholasRenotte Год назад

LOL, honestly it started out as 30 minutes with no response on my local machine.

@DanielCampbellYT Год назад

Great Video Nick!

@CMT-p6q 6 месяцев назад

I have to leave a comment for the family

@guimaraesalysson Год назад

Great video, man

@sabarishrajksabarishrajk292 Год назад

Nice one man.Keep rocking...

@NicholasRenotte Год назад

Thanks so much!

@philtoa334 Год назад

Tu parles bien Français Nicho !! Merci pour Ta vidéo .

@NicholasRenotte Год назад

Bien merci Phil!! I don’t know if it’s that great though 😂

@jorgefelipegaviriafierro705 Год назад

Great as always!

@danielgormly6064 Год назад

12 days later this is out dated... damn things are moving fast

@NicholasRenotte Год назад

ikr

@khalidal-reemi3361 Год назад

Fine Tuning Pleeeeeeeeeeeeeeeeeeeeeeeeeeeeese

@cvs2010 Год назад

Best vídeo ever

@deathspainvincentblood6745 Год назад

guys I'll introducing programming helper the programming helper is so powerful and much better than OPENAI wat waiting for now since start 1990 is there Lua language in AI CHAT and many more

@emperor1337 Год назад

Another pro-falcon, low impressions count video... falcon is a 3rd tier llm at best

@rverm1000 Год назад

How can you overlay photo's? Open cv? I'm looking at just photo's that taken one after another. What I find interesting is just level of detail. At first glance they look like photo's taken in the 1950's until you hit the zoom button. There are thousands of stars and alot of stuff moving around in space. The just photo's you can see all the stuff. What I want to do I overlay 100 photo's of the same area and color everything that's not in all 100 photo's. See if we can discover new objects moving in space. Here's the starting photo jw0157126001_04201_00001_nis_trapsfilled.jpg the target is Antennea. These photos start around 790 in the list