Jonathan Dinu

14
46 387

Generative AI tutorials, surveys, and "folk histories".... answering the "why" and/or "why not"

Комментарии

@Nick-tv5pu 2 месяца назад

This is my second time going through your Ray videos, about 6mo after first watching them. I just want to say not only is this some of the most thorough and digestible Ray content I've come across, its also some of the best technical tutorial/explanation content I've come across. You have a real knack for this stuff.

@JonathanDinu 2 месяца назад

@@Nick-tv5pu glad you found it helpful! Always touching to hear this from folks, really the reason I make these 🙏🏻🙏🏻🙏🏻

@Nick-tv5pu 5 месяцев назад

Listen man... this can't be an "I made five videos and its done" kinda thing. You should pick this back up! This series is one of the best explanations of a topic like this I've ever seen on RU-vid. I know it probably took a lot of work to make these even though they're "simple" videos, but they are so well thought out and planned and executed. You have a talent for explaining this kind of stuff.

@duongkstn 6 месяцев назад

Love your knowledge in Ray, what do y stop youtube ?

@JonathanDinu 6 месяцев назад

thanks! I stopped mainly because I started other things but might start up again soon

@JavArButt 9 месяцев назад

Very nice, thank you for that

@MSalman1 Год назад

Excellent explanation with simplicity and depth.

@wilsonnybinghamton Год назад

Hi Jonathan, I am wondering will serverless replace flink and spark in the future? I am thinking if beam can be a thing that a serverless platform can use to replace flink and spark.

@JonathanDinu Год назад

I see them as somewhat complementary. Serverless is really an infrastructure concept where Spark/Flink/Ray/etc. are more programming models (kind of analogous to the distinction of user space vs kernel space for personal computers). So you can in theory have a serverless deployment of Flink/Spark/etc. and is exactly what product like AWS Serverless EMR provide.

@wilsonnybinghamton Год назад

@@JonathanDinu Thanks for the reply! Btw, if i have beam running on top of a Faas framework which provides alternative runners other than Spark/Flink, does this make sense? Is it an alternative and direct competitor with them?

@JonathanDinu Год назад

@@wilsonnybinghamton without knowing the exact FaaS platform internals it is hard to say for sure, but for if you are writing/running Beam it shouldn't really matter what the underlying runners are.

@curumo_curunir Год назад

One of the clearest explaination about what is ray and why its useful. Kudos! Subscribed.

@jnevercast 2 года назад

are there any guarantees when you call ray.get() to get age, that all of the grow_older invocations have resolved? I presume yes for each actor (keeping in-line with each actor is syncronous)

@JonathanDinu 2 года назад

yeah that is the case for a specific actor but not guaranteed between actors necessarily. To synchronize between actors you usually pass object references between them using remote() function calls.

@juanpabloguerra9512 2 года назад

Hey Jonathan, this is great! Thank you! Quick question, the GH repo that you posted is not taking me to the one on your video, I also tried to look up the content using GH search but I can't find it!

@juanpabloguerra9512 2 года назад

Also, based on your GH repo it seems like you have some content in O'reilly. Do you mind sharing the link here? Thanks!

@JonathanDinu 2 года назад

sorry about that, I changed the repo name (so the video is outdated) but the code and examples should be the same as what is in github.com/jonathandinu/spark-ray-data-science. Let me know if that doesn't work

@JonathanDinu 2 года назад

@@juanpabloguerra9512 sure thing! if you have an Oreilly subscription you can access it at learning.oreilly.com/videos/spark-ray-and/9780136805922/

@JonathanDinu 2 года назад

for individual purchase you can get it at www.informit.com/store/spark-ray-and-python-for-scalable-data-science-video-9780136805946

@juanpabloguerra9512 2 года назад

@@JonathanDinu I do have a subscription thanks

@juanpabloguerra9512 2 года назад

This is great. Thank you for putting this together and I look forward to check out more Ray focused videos.

@ramirotapia2217 2 года назад

Amazing i was searching for this explanation. is there a way to make only a method inside a class remote? maybe thinking to override the remote method accordingly

@JonathanDinu 2 года назад

hmmmm, I actually haven't tried that ever. You can try using a regular (un-decorated) Python class but then use the @ray.remote decorator just on a single method of the class. The thing to keep in mind though is what variables/state from the main class the remote method might access.

@SuperOnlyP 2 года назад

After going through some basic tutorial from ray-core introduction, 7:25 is really helpful to have overview of it .thanks !

@wyattholden9485 2 года назад

Does a cluster have a max amount of workers it can hold?

@JonathanDinu 2 года назад

you can specify a maximum when you launch a cluster if you want but in theory there is no bound (or you can set this to effectively be infinite)

@wyattholden9485 2 года назад

@@JonathanDinu thanks for the quick reply! subscribed

@JonathanDinu 2 года назад

@@wyattholden9485 if you are curious about the cluster launcher configuration, the options are listed here : docs.ray.io/en/latest/cluster/config.html

@feifeizhang7757 2 года назад

Very informative video. Short and clear!

@tripathi26 2 года назад

Very Informative. Thanks for sharing your knowledge. I wonder why have you stopped uploading new videos!

@JonathanDinu 2 года назад

got distracted with other work for the past year but I am actually recording a new video now 🤗 and will try to actually be consistent going forward

@tripathi26 2 года назад

@@JonathanDinu ❤️ glad hearing that. Eagerly waiting 😊

@heathenfire 2 года назад

Awesome explanation! (Someone in the comments section said this is an understatement, but I can't think of any other adjective!😅) But seriously, great videos.

@JonathanDinu 2 года назад

🙏🙏

@syedsohail1513 2 года назад

Hi ! I have setup a local cluster and connected another laptop but the client node the second laptop won’t do any work at all no cpu usage or memory what is the issue ? Please help all the work is done by my host laptop (laptop1) how can I do load sharing ?

@JonathanDinu 2 года назад

How do you connect to the cluster from the host laptop?

@syedsohail1513 2 года назад

Hey, when I start ray on my own system it gets started and I connect another laptop using ray start -address=‘xxxxx’ -redis-password=‘xxxxxx’ The laptop gets connected to my computer but when I run heavy task only I see my computer is using 80% memory and 80% cpu while the laptop (worker) doesn’t use much cpu and memory it’s just getting connected but not using all the resources. I wonder why Do you have any idea where am I going wrong? Thanks

@JonathanDinu 2 года назад

it is hard to say without knowing more about the specifics of the machines and code that is running. My guess is that the task might not need more resources than the computer already has (hence only 80% utilization). So Ray might be using the resources it needs only from the single computer and since it is more efficient to avoid communication over the network it never uses the "cluster"

@karimjedda 2 года назад

Saying this is awesome is an understatement. Absolutely fantastic explanation, can’t wait to try it out!

@user-wr4yl7tx3w 2 года назад

This is really advanced.

@subhamagrawal9024 2 года назад

This was a really great starter on deploying and serving ml models for production at scale. Thank you soo much!!!

@thefullbridgerectifier 2 года назад

Hi Jonathan - This is your 450th subscriber speaking. Just found this series and absolutely loving it! Please continue making more content on Ray (and maybe RLLib!) Lets get you to 100k soon!

@JonathanDinu 2 года назад

welcome! I'll be starting making more videos real soon, took a bit of a hiatus for the past *checks uploads* more than a year...

@tljstewart 2 года назад

Hmm whats the difference between Ray and MPI? 🤔

@JonathanDinu 2 года назад

conceptually they are a little similar, but Ray has a higher level API that feels very Pythonic, has built in fault tolerance, and likely different performance characteristics for different types of jobs. From a technology standpoint, Ray is much more similar to Akka than it is to MPI.

@CuriousAnand 2 года назад

earlier you had mentioned cluster manager in the Ray's layered cake, could you say, out of box which resource manager is used by Ray?

@JonathanDinu 2 года назад

I believe that the default is to use Ray's native scheduling usually referred to as the autoscaler. In practice though you usually run Ray on a cloud provider (like GCP or AWS) or on-premise using Kubernetes. Ray has a nice built in module to manage all the complexity of this for you though docs.ray.io/en/latest/cluster/deploy.html

@SigmundVestergaard 2 года назад

Thank you for this video. I was wondering: were Ray Datasets released after you made this video? It must be the case, because otherwise I'm sure you'd have mentioned them.

@JonathanDinu 2 года назад

yeah at the time of the video the only official Ray modules were the ones from the diagram and the ecosystem has actually changed pretty dramatically in the time since

@argmax11 2 года назад

Awesome video ! Can yo cover more of the high level APIs Ray offers? And things like connecting to different - hybrid - clusters ?

@dantedt3931 2 года назад

This is good. Would like to see more videos on Ray.

@mojtabatorabi3724 2 года назад

thanks for the content. I have a question how can I set a timeout for executing a task for a given Actor ?

@osi0.111 2 года назад

Thx for this great video

@SenthilKumar-kd2dm 2 года назад

Is this applicable only to reduce time in maths computation ? why GPU is used then ? please explain

@MrAnandml 2 года назад

Man you are really good in explaining complex stuff 👏.

@timharris72 2 года назад

This was a really good explanation. Thanks for posting this!

@skysavani7095 3 года назад

Thanks for the clear explanations! can you please make a video for how to debug code with ray

@andrewparsons1041 3 года назад

This is very well explained. I particularly liked the second section as few people actually talk about sustainable/scalable machine learning deployments!

@golagaz 3 года назад

Thank you Jonathan. Nice presentation.

@dinoscheidt 3 года назад

Thank you very much! Subscribed

@vikrantkulkarni4487 3 года назад

I cannot connect to the dashboard. Its showing can't reach site after opening the url. Please help.

@JonathanDinu 3 года назад

are you using the Docker container? Sometimes depending on your local machine configuration the ports or port mappings need to be adjusted

@vikrantkulkarni4487 3 года назад

@@JonathanDinu I wasn't using docker. I'll try that. Thank you!

@JonathanDinu 3 года назад

let me know if the Docker setup earlier in the video doesn't work for you,. Unfortunately with the dashboard it is somewhat dependent on your individual machine setup but hopefully Docker helps. I also setup a Discord server for general chat and Q&A: discord.gg/nbyZ6EpUum

@vikrantkulkarni4487 3 года назад

@@JonathanDinu Actually I have had docker installed in my machine previously. But the process to install ray in docker seems too complicated for me. I don't have that much experience with docker tbh. So I was just using Ray on my anaconda server - jupyter notebook. Thanks for the Discord server!

@cansucandan4124 3 года назад

Thanks! I can't wait for you to share more videos about Ray for more practice.

@cansucandan4124 3 года назад

Thank you for the information you share! This video really helped to understand Ray :)

@cansucandan4124 3 года назад

I look forward to more theoretical and practical lessons on RAY framework

@andrewcampbell7011 3 года назад

Thank you. This is by the far the best overview of the distinction between these platforms. I'm a long time Spark user and Ray newbie and this break down rings true. I really like Ray for hyperparameter tuning and model serving.

@christianrakotondrainibe6256 3 года назад

Thank you very much for your videos, it's awesome ! Do you also know how to setup a Ray cluster on two different machines please ?

@JonathanDinu 3 года назад

Ray has a pretty nice library called Ray Cluster that manages all the coordination needed to setup the library on multiple machines (or deploy to the cloud). docs.ray.io/en/master/cluster/index.html

@101graffhead 3 года назад

so is it like the akka for python

@JonathanDinu 3 года назад

only the Actor API can be thought of as similar to Akka. The Ray project itself has many other components and is designed for ML and reinforcement learning, hence the focus on Python. So even though you could probably do similar things with Akka, they have somewhat different target audiences and use cases.

@jamesmckeown2746 3 года назад

Thanks for the clear explanations! I'd be interested to see an example of recovery in a multi node cluster where a node fails and its actors are recovered on another node.

@TheDevildragon95 3 года назад

Good introduction to the topic, now I will be able to migrate my application to a distributed system.