Chat with Llama-3 with HuggingFace & Build a chatbot with Gradio

Tirendaz AI

Подписаться 5 тыс.

Просмотров 6 тыс.

50% 1

Видео Поделиться Скачать Добавить в

Опубликовано:

1 окт 2024

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 12

@mustafadogruer3830 5 месяцев назад

You always keep up the updates. Thanks for the video!

@TirendazAI 5 месяцев назад

You're welcome 😀

@mohammadsbeeh6131 3 месяца назад

for me i didnt get same output as u when debuged prompt in 07:15

@dheenathsundararajan7225 2 месяца назад

i did the exact ways u did brother..but the model takes a lot of time to generate the response..is there any possible fix?

@iiTzMemo 2 месяца назад

What are your pc specs?

@VincentTang-tn1bi Месяц назад

It seems the chatbot will not answer before checking the history🤔

@thetanukigame3289 4 месяца назад

Thank you for the great video. It was really helpful for getting everything set up. If I may ask, I have a 4090 graphics card and I can this maxing out my GPU usage so the cuda should be working correctly. However, my prompts when asked take anywhere between 20s and 2 minutes to return and after a few questions the chatbot stops responding at all and just stays processing. Is this normal?

@TirendazAI 4 месяца назад

Which large model do you use? If you're using llama-3:70B, I think it's normal.

@Alicornriderm 4 месяца назад

Thanks for the tutorial! Is this actually running locally? If so, how did it download so quickly?

@TirendazAI 4 месяца назад

Yes, the model is running locally and for free. To download the model, you can use ollama.

@muhaiminhading3477 3 месяца назад

@@TirendazAI hi, I have use ollama3 with this localhost port : 127.0.0.1:11434. but I am confusing, how to load the model with transformers ? so I can follow your step?

@iiTzMemo 2 месяца назад

Does this work on mac with m2 too?