Machine Learning Lecture 28 "Ball Trees / Decision Trees" -Cornell CS4780 SP17

Kilian Weinberger

Подписаться 22 тыс.

Просмотров 30 тыс.

50% 1

Видео Поделиться Скачать Добавить в

Опубликовано:

8 окт 2024

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 32

@aleksandarcvetkovic7045 2 года назад

I needed to learn about KD Trees and Ball Trees for a specific application but after just 2 videos I am hooked! Now I am going to watch the whole course, thank you so much Kilian for making it public :)

@Dragon-Slay3r Год назад

🐘? 😂

@shrek22 Год назад

this guy inspires me. thank you

@bgtheaicompany214 5 лет назад

Thank you for uploading this video,

@marcelo.5271 4 года назад

Really great content. And very much appreciated! Thanks :)

@abhinav9561 4 года назад

Decision Trees at 35:14

@salmaabdelmonem7482 4 года назад

thanks a lot :) for the very unique explanation!

@allennelson1987 4 года назад

Helpful. Sololearn uses KD trees in the machine learning tutorial.

@adiflorense1477 4 года назад

Great

@vincentxu1964 5 лет назад

Question about Ball Tree Complexity. To construct the ball tree, we need to perform argmax distance(x, xi) for xi in S, therefore the algorithm still need to go through all the data points and compute their distance to the random chosen point. In this sense, comparing to KNN, I don't see any advantage using ball tree, since the complexity is almost the same. Or it is because we consider constructing ball tree as a pre-computed process before testing, and we don't add that part of running time to the final running time? Great thanks!

@jeeveshjuneja445 5 лет назад

Yes. It is precomputed.

@scchouhansanjay 4 года назад

1:20 I thought he will say, "if something does not make sense then contact me" 😂😂😂

@rjain3443 Год назад

A huge thanks to you Prof. Kilian!!! Big fan of your teaching. You really kill the topic and it seems nothing more is left to know about the topic. 😀 I have a little question- why ball trees are not that famous, given they work so well and are invariant of the dimensionality of feature space?

@kilianweinberger698 Год назад

They have gotten a little out of fashion, because they are not that suitable for GPUs. On GPUs it is often more effective to perform approximate nearest neighbor search with brute force parallelism.

@angelmcorrea1704 4 года назад

thanks for shared this lectures. :)

@bhupensinha3767 5 лет назад

Hi I got a slightly different understanding of KD tree from python scikit learn implementation. It says it uses or switches to brute Force method in the final search table to find the k nearest neighbor. The documentation does not talk about going up the tree and checking for neighbors hiding in other partitions. Not sure if I am able state my confusion Your ball tree example seems good. But scikit learn is quiet abt ball tree although it supports it for knn algorithm

@kilianweinberger698 5 лет назад

What you describe sounds like Approximate Nearest Neighbor search. Here, one variant is to also build a tree (ball-tree, KD-Tree, or any other variant), however during test-time you descent into the leaf into which the test-point falls and return the nearest neighbor only from this leaf. There is no guarantee that you will indeed return the exact nearest neighbor in a global sense, but often this is good enough and can be very fast in practice.

@vocabularybytesbypriyankgo1558 5 дней назад

Thanks !!!

@ThePatelprateek 2 года назад

If aligned axes is the issue , would random axis work , why did we create spheres instead of random aligned hyperplanes , any particular advantages to having spheres vs hyperplanes ? any pointers would help . thanks

@saketdwivedi8187 4 года назад

What if we incorporate the label as an additional dimension and begin splitting from that first? Wont that always ensure that our leaf has only 1 label?(because in a way we are trying to explain the variability in labels through variability in dimensions/features so the variability in labels must be significant)

@genesisevolution4243 3 года назад

@Saket Dwivedi that wouldn't help because during testing your feature vector will not have that dimension of label that you just added because that's what we want to predict.

@subhasdh2446 2 года назад

That answer was really psych. Damnnnn!!!

@sudhanshuvashisht8960 3 года назад

Could anyone explain to me why ball trees are better in higher dimensional space (with a lower-dimensional manifold) than kd trees? I find it hard to imagine/understand when Prof. explains (12:55) it is because of the axis-aligned splits that kd trees are bad even in this setting (low-dimension manifold in high ambient dimension)

@forthrightgambitia1032 2 года назад

Because in higher dimensional space the boxes are more 'crushed up' together. In higher dimensional space points become equally far apart so their bounding boxes become more indistinguishable. One of the earlier lessons covers this.

@prateekpatel6082 2 года назад

question : why construct sphere instead of hyperplanes like kd-tree but not axes aligned ?

@kilianweinberger698 2 года назад

A sphere is typically a tighter bound on the distance. In these data structures you want to bound the distance between the test point and a potential training point, and see if it could possibly be your nearest neighbor. In KD trees you say the distance from the test point to the training point is _at least_ the distance to the hyper-plane. In ball-trees you say, it is at least the distance to the sphere. Typically the latter is larger, i.e. a better approximation and allows you to prune away more points. Hope this helps.

@sandeshhegde9143 5 лет назад

Decision Tree starts from: ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-E1_WCdUAtyE.html

@vatsan16 4 года назад

there goes the professor recruiting arthur for a phd :P

@Dragon-Slay3r 2 месяца назад

I always tried to help, each time i did the cult took advantage now its their problem, and i dont know if i can help anybody everything has changed since last night

@RGDot422 11 месяцев назад

Hi. Is ball trees and KD trees the same?

@kilianweinberger698 9 месяцев назад

No, KD trees only split along features and essentially place the data into boxes. Ball trees fit the data into little spheres. Another way of thinking about it is, imagine you have a data set and you rotate it. KD-trees will now create a very different data structure, because the axis have changed. Ball-trees will be exactly the same. In general, KD-trees tend to be better in very low dimensional spaces (2D, 3D) and are often used in computer graphics to find nearest neighbors for meshes. Ball-trees tend to be better in medium sized dimensionalities (e.g. 10D-50D).

@adiflorense1477 4 года назад

6:20 It's like the bottom up method huh