What is AutoML? A conversation with Gnosis Data Analysis

StatQuest with Josh Starmer

Подписаться 1,3 млн

Просмотров 35 тыс.

50% 1

Видео Поделиться Скачать Добавить в

Опубликовано:

11 окт 2024

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 94

@statquest 4 года назад

NOTE: This StatQuest is sponsored by JADBIO. Just Add Data, and their automatic machine learning algorithms will do all of the work for you. For more details, see: bit.ly/3bxtheb BAM! Support StatQuest by buying The StatQuest Illustrated Guide to Machine Learning!!! PDF - statquest.gumroad.com/l/wvtmc Paperback - www.amazon.com/dp/B09ZCKR4H6 Kindle eBook - www.amazon.com/dp/B09ZG79HXC

@murilopalomosebilla2999 3 года назад

Really hope to see AutoML become more popular. Too many people wasting time doing things like hyperparameter optimization or such when there are much more important things to look at.

@statquest 3 года назад

bam!

@Ballbagsaggins Месяц назад

Genuinely love StatQuest! This channel has helped me though so many knowledge gaps when trying to get through my degree programme. I'm looking to assess AutoML systems for my dissertation so thanks for this video. I would love to see some practical examples on the channel - if that's at all possible or interesting to you.🙂

@statquest Месяц назад

Thanks! I'll keep that in mind.

@silsed 4 года назад

Hi Josh and all. Thanks for the video! Maybe could you do an autoML video series? For example, interview with people from Auto-Keras, or H2O.io auto-model companies? Thanks

@statquest 4 года назад

That's a great idea!

@edwardswing5203 4 года назад

Very good video on autoML. It would be nice to have some tutorials on AutoML, including the whole development cycle of a problem using AutoML. BAM!!!

@statquest 4 года назад

It's in the works.

@gnosisdataanalysis8595 4 года назад

HI Edwards, JADBIO does not have a video tutorial .... yet :-) , but if you want, we have a two week trial and sample data and several step-by-step tutorials that will walk you through the process of either regression, binary or multi-class classification, and survival. you can find all at JADBIO.com under our use cases.

@rhn122 4 года назад

This kind of topics always reminds me the statement by Harari in Homo Deus: _"Eventually, algorithms will be so advanced that machines will be in your position to be interviewed, only to find out the company also deployed machines to interview the applicants"_

@statquest 4 года назад

@ISK_VAGR 3 года назад

Well, it might be in the fuuuuture, but for now. I can tell you of some companies that are testing this and it is a catastrophe. Avira, don't provide call center services anymore. When I had a problem with the VPN, the only option was an email. I sent the email and I received the answer of an AI that correctly directed me to the troubleshooting question, but the troubleshooting didn't solve the problem and the AI assumed that it did, so it sent me another email saying that the problem was successfully resolved after 1h trying to waste my time in solving the problem. As a result, I ask for a reimbursement, which I got, and I rated AVIRA VPN like trash and a company that doesn't take care of clients. That means we are not quite there.

@StratosFair 4 года назад

Hey Josh, thank you so much for all these videos on your channel ! They saved my life more than once ! As a suggestion, I think you should consider making a video about Partial Least Squares (PLS) regression, it's a quite nice and efficient method, but I remember struggling so much with it back then !

@statquest 4 года назад

I'll keep that in mind.

@saiprakashssp1820 4 года назад

Hi StarQuest team, Ur approach of theoretical explanations are simply superb (BAM BAM) Can you make videos with coding examples and exercises

@statquest 4 года назад

I have a few videos along those longs. You can see all of my videos here: statquest.org/video-index/

@dheerajkura5914 4 года назад

Building a Machine Learning model involves lot of analysis , discussion and decision making while treating the data, choosing ML algorithm, inferencing ML model results ... Any automation can make faster ML model building by scalable infra / any other way But the irony is building a ML model is next best priority after understanding business problem, sourcing right data and treating the data as well

@statquest 4 года назад

It's true - the hardest part of any "data science" project is understanding the problem and getting the data into the correct format.

@ran_domness 3 года назад

Love Crete!....now Love Auto ML! Thanks Josh , very interesting.

@statquest 3 года назад

Thanks for watching!

4 года назад

AutoML is what i suggest the future... like automation always is. like things we use today and think we have to do many stuff manually ... in fact... people earlier called building up the basis for that: automation

@statquest 4 года назад

@gnosisdataanalysis8595 4 года назад

Double Bam!

@buithanhlam3726 4 года назад

Hi StatQuest! Could you please make videos about Markov nets and conditional random fields? Thank you.

@statquest 4 года назад

I'll keep that in mind.

@nishant8507 Год назад

Hi Statquest. Love your content. Been following for some time. I was looking at entering the Data Science and ML field?? SHould i learn how to do hyperparameter optimization or just learn Auto ML for doing it?? Your response will be much appreciated.

@statquest Год назад

It's still important to learn how to tune hyperparameters because autoML is only getting started and you might want to use a model that isn't part of it.

@rmfalcao 4 года назад

Hi! Thanks for creating and maintaining such a useful channel! Multiple BAM's! :) I looked for videos that clearly explain Apriori and Ripper, but couldn't find them... Did you create those? RU-vid is full of examples... but no one else does it like you do. (Thanks again!)

@statquest 4 года назад

I don't have videos on those topics yet. All of my videos are listed here: statquest.org/video-index/

@RollingcoleW Год назад

Great episode and song! lol

@statquest Год назад

Thank you!

@anothervanwinkle 4 года назад

Well, amazing and slightly frightening perspective at the same time. But as I. Tsamardinos pointed out, the role of human data scientist will shift focus to data preparation and outcome evaluation, as it should be. The image of sci-fi genuses fostering their artificial "child" (HAL or Mr. Data) comes to mind. For this to work instantly I see an obstacle in the computational power needed for ML processes. Furthermore, prediction is only part of the story, to be exaxt, the second part. Knowing what constitutes a model is implicit in any scientific field aiming at explaining the world and it gains importance where model based decisions have severe consequences for individuals' lives, e.g. bank lending or forensic risk assessment. But yeah, having AutoML would leave more time for other nice things in life, too. Looking forward to seeing how the story evolves.

@statquest 4 года назад

Nice!

@boyangsong3091 4 года назад

I have used azure automl and it is powerful.

@PraveenKumar-pd9sx 4 года назад

Can it be used in Python as an import?

@boyangsong3091 4 года назад

@@PraveenKumar-pd9sx Yes, you can import it from the azure ml sdk

@PraveenKumar-pd9sx 4 года назад

@@boyangsong3091 Thanks dude

@sps014 4 года назад

I am also enjoying Azure Services with Student Account..

@GamerAlphaInd 4 года назад

GPT3 is also hosted On Azure.

@tiwa2929 4 года назад

Thanks for the video, It is really interesting to watch your video and I can easily understand the statistics I did not understand before, could you also make one for PLS and PLS-DA?

@statquest 4 года назад

I'll keep those topics in mind! :)

@khoaphamdang6745 4 года назад

When will u post the next video? I hope them much

@statquest 4 года назад

As soon as I can. Maybe in one or two weeks.

@shichengguo8064 4 года назад

I have a dataset including dependent variable and multiple independent variables (features), Is there any platform I can use to make machine learning training with the most frequently used models like the random forest, SVM, naive Bayes ... and return a comparison table to show the performance for these models?

@statquest 4 года назад

I'm pretty sure thats one of the things they do at JADBIO: bit.ly/3bxtheb

@gnosisdataanalysis8595 4 года назад

To add to what Josh just said, we have just added a new feature that reports, in addition to the detailed performance of the best performing model and the best interpretable model, a report of the best performance of each machine learning algorithm we test. Would this be what you want?

@adamc6821 4 года назад

Sorry to hijack this comment section, but I was wondering whether StatQuest talks about the DBSCAN algorithm in one of his videos (since I tried searching for it, but couldn't find any). TY in advance.

@statquest 4 года назад

Not yet.

@diegobuenovillafane869 4 года назад

Gottta love his songs

@statquest 4 года назад

BAM! :)

@puzzlecollector11235 4 года назад

awesome video! thanks :)

@statquest 4 года назад

Glad you liked it!

@moisesdiaz9852 4 года назад

Super interesting video! BAM!!

@statquest 4 года назад

Thanks!

@gnosisdataanalysis8595 4 года назад

Glad you liked it Moises - Ioannis did a detailed video on using JADBIO with Covid19 data that is also available as a hands-on tutorial, if you would like to try it. (ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-lHCjEmlOigc.html). You can also sign up to use JADBIO at our web site. It is a friendly trial subscription (no automatic access to your credit card)

@chloeh7119 4 года назад

Like before watching the video

@statquest 4 года назад

BAM!

@胡愉霄 4 года назад

i want to know how to get the PPT/ Courseware?

@statquest 4 года назад

I've made study guides for some of my videos and they are available here: statquest.org/studyguides/

@vinusujithamichael2314 4 года назад

Please do a statquest on ANOVA

@statquest 4 года назад

I have one already. Just follow the play list on linear models: ru-vid.com/group/PLblh5JKOoLUIzaEkCLIUxQFjPIlapw8nU

@Lotrhandler 4 года назад

Hii StatQuest! Could you please make a video on Cox Proportional-Hazards Model? Thanks a lot!

@statquest 4 года назад

I'll keep that in mind.

@thirtysixnanoseconds1086 4 года назад

I see this as the ode45 of machine learning - incredibly powerful but issues surrounding understanding of underlying system, instability etc. I don't think it's good to have people using tools they don't understand. Machine learning models for the most part are black boxes, interpretability techniques such as LIME/SHAP are not a substitute for model understanding - feature importance of an instance isn't an understanding. We already see this kind of sausage machine, handle cranking mentality in professionals in electronics with circuit simulation etc, the more trust we put in a system we don't ourselves understand, the more we use tools unknown to us - the more dangerous they are

@statquest 4 года назад

To a certain extant, we address this at 10:46. Using a high level programming language, like Python, which takes care of memory management and other low level things that used to be programmed by hand in assembly language, doesn't make one a bad programmer. In fact, it makes you a programmer that is in demand, job wise. Likewise, learning high level ML tools doesn't make you bad at ML - but you still need to know what you're doing.

@_Chafia 4 года назад

@@statquest... and still need to follow StatQuest :) Thank you Dr Starmer... Learning so much from your channel.. wish you the Best Sir.

@statquest 4 года назад

@@_Chafia BAM!

@gnosisdataanalysis8595 4 года назад

I remember when people used to say the same sort of things about bioinformatics tools. Gradually the tools and the users evolved in a way that domain experts outside of bioinformatics were able to use user-friendly software tools to enable their research. The bioinformaticians were then able to focus on much more interesting challenges. I think the same will also be true for AutoML, but I agree with you, prior to the evolution, it is possible for a novice to make erroneous conclusions, and it is the responsibility of the developers of friendly tools to provide safety guards against that naivety. #StatQuest, also helps a lot!

@statquest 4 года назад

@@gnosisdataanalysis8595 BAM!

@sharan9993 3 года назад

12:40 he says data scientists should provide more to justify the salary. can u explain what those skills might be?

@statquest 3 года назад

Interpreting the models, data presentation, communication skills etc.

@pankajmodi8009 4 года назад

do we have to follow the ML tutorials..?

@statquest 4 года назад

Can you elaborate on what you mean by that?

@pankajmodi8009 4 года назад

Do I need to watch or study the full playlist of machine learning or i have to learn automl only? thnks for replying.. your videos and explanation is awesome. thnk uh

@statquest 4 года назад

@@pankajmodi8009 It really depends on what you want to get out of it. At a bare minimum, you should be familiar the general idea of ML and my four "Machine Learning Fundamentals". These are the first 5 videos in the ML section on this page: statquest.org/video-index/

@chanman1568 4 года назад

It will take the job of data scientist

@statquest 4 года назад

I'm not sure. 90% of what we do is format/clean data and deal outliers etc. Only a small percentage of time is spent fitting models to the data. This makes that part easier.