Тёмный

Machine Learning Foundations: Ep #8 - Tokenization for Natural Language Processing 

Google for Developers
Подписаться 2,4 млн
Просмотров 27 тыс.
50% 1

Наука

Опубликовано:

 

5 окт 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 33   
@fytubevw
@fytubevw 4 года назад
Nice, practical and clearly delivered. Thanks!
@LaurenceMoroney
@LaurenceMoroney 4 года назад
Thanks!
@jaimeebhayani
@jaimeebhayani 18 дней назад
Great content, as always! Just a quick off-topic question: My OKX wallet holds some USDT, and I have the seed phrase. (air carpet target dish off jeans toilet sweet piano spoil fruit essay). How should I go about transferring them to Binance?
@oraya2689
@oraya2689 4 года назад
Pretty neat I'll just have a question : You first explained that you choose word based encoding over character based because it needs a sequence model to make the difference between 2 similar sets of codes (2:13). But in the end (8:37), you also explain that next step will be about sequences of tokens (which make sense, because there's no meaning without order in a sentence). Why highlighting this difference while we could as well have dealt with character based encoding from the start ? This point is more about the didactic than the subject itself ^^
@LaurenceMoroney
@LaurenceMoroney 4 года назад
At the end we're highlighting that we'll show how to turn word tokens into sequences of word tokens to represent the sentences. So, 'I love my dog' instead of being a bag of words would be tokens in that order.
@emamulmursalin9181
@emamulmursalin9181 3 года назад
Simple and just exactly on the point. Bingo!
@bhovmikvaity701
@bhovmikvaity701 4 года назад
nice explanation
@LaurenceMoroney
@LaurenceMoroney 4 года назад
Thanks!
@zapphodbeeblebrox564
@zapphodbeeblebrox564 4 года назад
Thanks for making it easy to understand...
@user-wp8yx
@user-wp8yx Год назад
How to tokenize "सम्मार्जनंकृतवतीवा"? This means "have you swept the floor?" In Sanskrit. All my models break it down to individual characters.
@RAVIKUMAR-xv9sy
@RAVIKUMAR-xv9sy 4 года назад
Looking for more NLP content now on.
@LaurenceMoroney
@LaurenceMoroney 4 года назад
Thanks!
@sanooosai
@sanooosai 8 месяцев назад
great thank you
@RajaSekharaReddyKaluri
@RajaSekharaReddyKaluri 4 года назад
@Laurence From here on, I assuming NLP zero to hero content would repeat. I'm optimistic about few additional learnings though.
@LaurenceMoroney
@LaurenceMoroney 4 года назад
It's close, but this is a bit more detailed, and includes exercises
@thijsoudeavenhuis999
@thijsoudeavenhuis999 4 года назад
Very good examples, clear explanation. Good stuff. Thank you.
@LaurenceMoroney
@LaurenceMoroney 4 года назад
Thanks!
@شيفاحمدالمصري-و3ش
@شيفاحمدالمصري-و3ش 4 года назад
Good job
@LaurenceMoroney
@LaurenceMoroney 4 года назад
Thanks!
@madhusudaneyunni7816
@madhusudaneyunni7816 4 года назад
Hello Mr Lauren, How does the tool determine what is the most common word?
@LaurenceMoroney
@LaurenceMoroney 4 года назад
For the *how* -- it's best to look at the source code. For the *why*, if you're not going to keep everything, you, logically, want to keep the most common words in it.
@madhusudaneyunni7816
@madhusudaneyunni7816 4 года назад
@@LaurenceMoroney , can you please point me to the source code?
@laurencemoroney655
@laurencemoroney655 4 года назад
@@madhusudaneyunni7816 github.com/tensorflow
@alistairdelacour6104
@alistairdelacour6104 4 года назад
Just finished my dissertation on this 😂
@LaurenceMoroney
@LaurenceMoroney 4 года назад
How'd it go?
@alistairdelacour6104
@alistairdelacour6104 4 года назад
Laurence Moroney pretty well, haven’t got the grade yet but the company I made the model for loved it.
@alistairdelacour6104
@alistairdelacour6104 4 года назад
Laurence Moroney it was based on classification by clustering to determine the similarity in written fault reports
@earlbullock8367
@earlbullock8367 4 года назад
How Can I Get Your Full Machine Learning Course?
@LaurenceMoroney
@LaurenceMoroney 4 года назад
It's on Coursera (TensorFlow: In Practice and TensorFlow: Data and Deployment)
@abdulspeak5322
@abdulspeak5322 4 года назад
great
@LaurenceMoroney
@LaurenceMoroney 4 года назад
Thank you!
@earlbullock8367
@earlbullock8367 4 года назад
I Really Wanna Learn How To Get A High Paying Profession
@LaurenceMoroney
@LaurenceMoroney 4 года назад
Then go for it!
Далее
Bro's Using 3 Weapons
00:36
Просмотров 3,3 млн
ВЫЖИЛ В ДРЕВНЕМ ЕГИПТЕ!
13:09
Просмотров 211 тыс.
RAG vs. Fine Tuning
8:57
Просмотров 23 тыс.
Machine Learning Zero to Hero (Google I/O'19)
35:33
Просмотров 1,8 млн
Machine Learning Foundations: Ep #1 - What is ML?
15:34
Career Advice For A World After AI
23:07
Просмотров 220 тыс.
3x 2x 1x 0.5x 0.3x... #iphone
0:10
Просмотров 2,8 млн
Заказал Li 7 из Китая
0:59
Просмотров 178 тыс.
CED: часть 1
23:37
Просмотров 103 тыс.