Тёмный

Project 10. Credit Card Fraud Detection using Machine Learning in Python | Machine Learning Projects 

Siddhardhan
Подписаться 119 тыс.
Просмотров 284 тыс.
50% 1

Hi! I will be conducting one-on-one discussion with all channel members. Checkout the perks and Join membership if interested: / @siddhardhan Check membership Perks: / @siddhardhan
. In this video we have built a Credit card Fraud Detection system using Machine Learning with Python. For this project, we have used the Logistic Regression model.
All presentation files for the Machine Learning course as PDF for as low as ₹200 (INR): Drop a mail to siddhardhans2317@gmail.com
Enroll at One Neuron to learn from 100 courses in one subscription with 5% discount: courses.ineuron.ai/neurons/Te...
Machine Learning Projects Playlist: • Machine Learning Projects
Machine Learning Course with Python Playlist: • Machine Learning Cours...
Hello everyone! I am setting up a donation campaign for my RU-vid Channel. If you like my videos and wish to support me financially, you can donate through the following means:
From India 👉 UPI ID : siddhardhselvam2317@oksbi
Outside of India? 👉 Paypal id: siddhardhselvam2317@gmail.com
(No donation is small. Every penny counts)
Thanks in advance!
Hi guys! I am Siddhardhan. I work in the field of Data Science and Machine Learning. It all started with my curiosity to learn about Artificial Intelligence and the ability of AI to solve several Real Life Problems. I worked on several Machine Learning & Deep Learning projects involving Computer Vision.
I am on this journey to empower as many students & working professionals as possible with the knowledge of Machine Learning and Artificial Intelligence.
Let's build a Community of Machine Learning experts! Kindly Subscribe here👉 tinyurl.com/md0gjbis
I am making a "Hands-on Machine Learning Course with Python" in RU-vid. I'll be posting 3 videos per week: Monday Evening; Wednesday Evening; Friday Evening.
Dataset file: www.kaggle.com/mlg-ulb/credit...
Colab File Link: colab.research.google.com/dri...
Download the Course Curriculum File from here: drive.google.com/file/d/17i0c...
LinkedIn: / siddhardhan-s-741652207
Telegram Group: t.me/siddhardhan
Facebook group: groups/49085... Instagram: / siddhardhan23

Опубликовано:

 

8 апр 2021

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 319   
@rajatagarwal4436
@rajatagarwal4436 2 года назад
wow, this is such a fluent, smooth explainer, I am a mere beginner, and still able to understand almost everything. My teacher was not even able to explain a line of it correctly.
@saisasikanthduvvuri2209
@saisasikanthduvvuri2209 Год назад
Hello, @Siddhardhan Your presentation feed me more than I thought. Really awesome accuracy score and good EDA. Thank you for your video.
@kandrunaresh-mx9zj
@kandrunaresh-mx9zj Год назад
Hi siddhardhan..Your explaination is awesome.. keep up the good work..Nice comparision between over fitting and under fitting over accuracy.. and nice example too..
@marcoandresc.5560
@marcoandresc.5560 2 года назад
Many RU-vid videos and even courses in Spanish do not explain very well and do not cover everything necessary to do a machine learning project, but you explain it very well. I'm not very good at English but I understood the procedure very well, excellent video
@Siddhardhan
@Siddhardhan 2 года назад
Thanks a lot 😇
@Ricksanches_2001
@Ricksanches_2001 Год назад
Great work Siddhardhan, you really explained it in an amazing way
@virago8883
@virago8883 2 года назад
The way you explained was really amazing. I was able to clear all my doubts regarding this project. Keep up the good work and seriously thanks a lot for providing such a great content!!!!
@Siddhardhan
@Siddhardhan 2 года назад
thanks a lot for your positive words 😇
@nandinimadan6421
@nandinimadan6421 Год назад
when I am loading the dataset and checking the null values I am getting some,but he is not is anyone else getting this error
@musical_touch.
@musical_touch. 11 месяцев назад
​@@Siddhardhan accuracy of this prj?
@jeeruveeresh8942
@jeeruveeresh8942 11 месяцев назад
simple and nice explanation...,I didn't no machine learning. just, I know python but the way of your explanation helps me lot to understand machine learning.Thanks a lot.
@sandipansarkar9211
@sandipansarkar9211 2 года назад
finished coding practice .Feeling a lot confident
@priyankaeklaspure8163
@priyankaeklaspure8163 2 года назад
Absolutely , Stunning . Your way of explanation is too good 💥💥. Thanks for sharing !! Super excited from more projects videos ...
@Siddhardhan
@Siddhardhan 2 года назад
Thank you so much 😀
@mamondhar1823
@mamondhar1823 8 месяцев назад
Really explain step by step very easy way . Want more vdo on machine learning as Covid Detection etc
@556west
@556west Год назад
Hi, Siddhardhan "I just wanted to take a moment to express my sincere gratitude for the excellent tutorial on Credit Card Fraud Detection using Machine Learning in Python that you posted on RU-vid. Your clear and concise explanations, combined with the practical examples, have helped me tremendously in understanding the fundamentals of this complex topic. Your dedication to providing high-quality content is evident, and I appreciate the time and effort you put into creating such an informative tutorial. Once again, thank you so much for sharing your knowledge with the world - you're making a positive impact on the lives of many, including mine."
@rohitsarkar9338
@rohitsarkar9338 3 года назад
A great teacher ❤️❤️❤️❤️❤️❤️ i have ever seen in my life...who is explaining each and every line of a big project code🔥🔥🔥🔥🔥🔥🔥🔥
@Siddhardhan
@Siddhardhan 3 года назад
Thanks a ton 😇
@rohitsarkar9338
@rohitsarkar9338 3 года назад
@@Siddhardhan i can't explain you how much you help me for clearing my doubt.. that stuck into my brain for atleast a week❤️
@faizanwar_
@faizanwar_ Год назад
Very informative video...thanks for your community work....God bless 🙏
@adityamahamuni7365
@adityamahamuni7365 3 года назад
Thanks a lot! You're doing a great work. Keep it up!!
@Siddhardhan
@Siddhardhan 3 года назад
my pleasure 😇
@tahiraleem
@tahiraleem 9 месяцев назад
Thanks a lot Sid sir. Love and support from Pakistan.
@jeremyheng8573
@jeremyheng8573 Год назад
Thank you for this very informative tutorial!
@mansishrivastava7259
@mansishrivastava7259 3 месяца назад
Your explanation is really wonderful and so easy to understand
@kamaleshsenthilmurugan1561
@kamaleshsenthilmurugan1561 2 года назад
Extraordinary content! I have watched all your videos from hands on ml course to this one.Everything was explained such that even a beginner would understand it. You have a really great gift in teaching complex stuff in a easy manner. My request to you is to keep teaching like this so that you will be able to change the life of lots of people like me. I am going to recommend this channel to all my juniors and friends.
@Siddhardhan
@Siddhardhan 2 года назад
Thanks a lot 😇
@tusharkhatri5795
@tusharkhatri5795 Год назад
@@Siddhardhan why have you done train_test_split after balancing dataset wont it create problem of data leakage?
@user-ub9pd5yl7p
@user-ub9pd5yl7p 9 месяцев назад
can you suggest some key points to add this project in resume. Thanks
@nawin7789
@nawin7789 Месяц назад
man you are doing a really great job!
@sk.creations.
@sk.creations. 10 месяцев назад
great , bhai mst explain kiya
@niteshprajapat7918
@niteshprajapat7918 3 года назад
Thank You Sir, It was one of the best tutorials ever and I loved the way you explaining all lillte things in easiest way..
@Siddhardhan
@Siddhardhan 3 года назад
Thanks a lot for your positive words 😇 happy that you liked it!
@niteshprajapat7918
@niteshprajapat7918 3 года назад
@@Siddhardhan Sir, can you make a complete video on how to start ML, what are its pre-requisite and maths and mathematical intuition of all algorithm . Please Sir .Separate playlist . Because I'm getting confused what to do first ....
@Siddhardhan
@Siddhardhan 3 года назад
You can follow this playlist: ru-vid.com/group/PLfFghEzKVmjsNtIRwErklMAN8nJmebB0I
@asdad9715
@asdad9715 2 года назад
hi Siddhardhan, thank you so much for this content, your explanation is really easy to understand. listening to your step by step explanation and directly practiced it on my google colab really help me to understand it. your content helps me a lot!
@nandinimadan6421
@nandinimadan6421 Год назад
when I am loading the dataset and checking the null values I am getting some,but he is not is anyone else getting this error
@sohamnath76
@sohamnath76 Год назад
Extremely nice explanation. It's explained in such a simple way. Thank you, brother! Really helped a lot. 😁
@rohitgupta4354
@rohitgupta4354 Год назад
*Please tell me what topics of ML topics used in this project so that I can start this project after learning those topics .*
@fiz982000
@fiz982000 2 года назад
Sidhard ,you are awesome!!!!!
@durgavutla1055
@durgavutla1055 3 года назад
Amazing Videos!! Thanks for sharing to get practices on ML Projects
@Siddhardhan
@Siddhardhan 3 года назад
My pleasure 😇
@shahriarafridi
@shahriarafridi 29 дней назад
Thank You for this entire project
@igorsanjane4927
@igorsanjane4927 Год назад
Very good video and easy to understand code, thumbs up!!
@user-vk4xe7vz9k
@user-vk4xe7vz9k 11 месяцев назад
Hey, have build the same project as u teached but with different dataset & while using logistic regression im facing the ValueError: as 'Could not ocnvert string to float' what should i do?
@tanishabiswas6268
@tanishabiswas6268 11 месяцев назад
Thank You! This was very helpful
@dipakdas8887
@dipakdas8887 10 месяцев назад
You explained so well.. 😍
@OceanAlves23
@OceanAlves23 2 года назад
Hi 👨‍🎓, from Brazil/Teresina/PI. 👏👏👏
@farhabikamal302
@farhabikamal302 Год назад
Hi... I am working with the same dataset. But In my pc when I try to find info of the dataset, it shows a different amount of data than yours. I coundn't find why is it.
@ashitnayak1912
@ashitnayak1912 Год назад
Awesome mate, finally my search ends with your content.
@rohitgupta4354
@rohitgupta4354 Год назад
*Please tell me what topics of ML topics used in this project so that I can start this project after learning those topics .*
@pranay6708
@pranay6708 2 года назад
wow... very nice and detail video and smooth explanation. Please add how to inbuilt sampling technique in this video . I liked the video very much .
@vasanthkani2509
@vasanthkani2509 2 года назад
Your teaching good sir your videos very useful my project sir thank you so much sir
@femiOkaseun
@femiOkaseun 4 месяца назад
Awesome video. Thank you.
@shadowalker5467
@shadowalker5467 Год назад
sir I have a question whenever I run this part of the code: credit_card_data['Class'].value_counts() it doesn't show the exact amount of fradulent transaction for example its supposed to show 492 but in my case its showing 239 why is that sir?
@sakshigarg5996
@sakshigarg5996 22 дня назад
Thankyou so much for making this video It helped me alot😄
@yogeetakhatri4015
@yogeetakhatri4015 2 года назад
Very good explanation thank u fro the explanation
@rahulgaud4340
@rahulgaud4340 Год назад
Thankyou so much sir it helped me lot.
@user-zp7uv3vd3r
@user-zp7uv3vd3r Год назад
bro from next video onwards please add some visualizations. any way ur explanation is excellent, thank u for sharing this content to public who needs like this content.
@shaikhirfan7749
@shaikhirfan7749 2 года назад
What if i use Random forest (isolation forest algorithem)Instead of logistic regression?? With svm also bcz of the data is imbalenced and having some outliers?
@subhamsaha2235
@subhamsaha2235 3 года назад
Very nice explanation and really liked the video. The classification report is also very good measure of the model. I think if we do cross validation and use some boosting techniques then the score can be increased more and one more imp thing is that here accuracy score doesnt matters, main is precision and recall because we cant let a fraud trans to become non fraud. Thank you
@Siddhardhan
@Siddhardhan 3 года назад
nice insights. you can definitely try to do some optimizations.
@nandinimadan6421
@nandinimadan6421 Год назад
when I am loading the dataset and checking the null values I am getting some,but he is not is anyone else getting this error
@VarunKumar-ek3kr
@VarunKumar-ek3kr 2 года назад
Hey.... I got total no. Of iterations reached limit warning in logisttic regression model...... What to do to solve this
@swarnavdeb2064
@swarnavdeb2064 2 года назад
I still didn't get the part where groupby is being used for checking on the mean. Why is it determined and what does it conclude? Kindly through some light please
@rachana7044
@rachana7044 2 года назад
Hello sir, i m getting an error like 'dataframe'object not callable whn i run legit_sample= legit.sample() .could yu please help me with this...
@charanravikumar
@charanravikumar 11 месяцев назад
Great Work! but the collab link does not contain the required files like the dataset, it contains other files of sampling house
@rahulvijay6611
@rahulvijay6611 3 года назад
Awesome, very helpful
@Siddhardhan
@Siddhardhan 3 года назад
Thanks 😇
@shubhasmitanayak3495
@shubhasmitanayak3495 10 месяцев назад
Can you please tell me how can I manage to perform test of different algorithms on different dataset in single colab repository ?
@yassine.h3262
@yassine.h3262 Год назад
Great. I would like to add some remarks that might help in future projects : - The way you dealed with imbalanced data will certainly lead to bad predictions, because alot of information were lost when taking a small sample from a large dataset. - use SMOTE instead of sampling - use decision trees or random forests algos they are better when dealing with imbalanced data - use more evaluation metrics like ROC curve, F1 score, recall...
@nandinimadan6421
@nandinimadan6421 Год назад
when I am loading the dataset and checking the null values I am getting some,but he is not is anyone else getting this error
@chanlovebmx
@chanlovebmx 11 месяцев назад
@@nandinimadan6421there are no missing values from the default dataset maybe u wan to download again the data
@srinivasarao416
@srinivasarao416 2 года назад
Dear sir, I am doing my MSc. I am thinking to do my project dissertation on Credit Card Fraud Detection . Every body doing on kaggle dataset. If I do on same data set university people will say "copied" . Could you please suggest any other dataset from another resource . Thanks
@hirakhan8015
@hirakhan8015 Год назад
very good content sir. one question i have a project on this i searched for null values , in mine its showing null values from V8-Class whole. how come in urs not showing null values?
@growingfire
@growingfire Месяц назад
Thanks a lot !
@sreebvmcreation9388
@sreebvmcreation9388 Год назад
Thank u so much sir
@sailakshmi3792
@sailakshmi3792 Год назад
I just want to suggest few points before you post a project 1. Goal of the Project 2. Output of this 3. ways that it can be implemented 4. in realworld how it is checked with the user inputs I think this basic points must be mentioned in the project to be called it as a meaning full project. Anyways thanks for your contribution @Siddhardhan
@jeremycapital
@jeremycapital Год назад
Do you have an idea of how this model could be used in a real-world, for instance, in a web application with a backend in python or node js?
@kabirbhawar2045
@kabirbhawar2045 Год назад
Hey can you please give the answers to these questions
@rudroroy1054
@rudroroy1054 Год назад
@@jeremycapitalyou will have to embed the model in the server. Also you will have to include separate modules for data cleaning and preprocessing. that would include a separate data engineering vertical to your application, we use kafka for stream processing
@fun_with_AI269
@fun_with_AI269 2 года назад
What's the ultimate result out of it? What did we learn? Is there any way to find out fraudulent transactions in legit data set?
@aadarshmishra2375
@aadarshmishra2375 2 месяца назад
after CONCAT I should be getting both 492 values for each class right? But I'M getting 492 values for class 1 and 1 value for class 0. PLS HELP!
@ANURAGSINGH-nl2ll
@ANURAGSINGH-nl2ll 9 месяцев назад
Nice explanations
@rattlesstrings2729
@rattlesstrings2729 2 года назад
hi there, there are missing values in the dataset from kaggle.. pls do check
@JayashreeS-nn6rk
@JayashreeS-nn6rk Год назад
hi siddhardhan, I have an error,kindly resolve it.# training the Logistic Regression Model with Training Data model.fit(X_train, Y_train)...in this part ,I got STOP:TOTAL NO OF ITERATIONS REACHED LIMIT.How can i resolve it,
@amollute4841
@amollute4841 Год назад
This is very good video
@poornimanair8662
@poornimanair8662 Год назад
Can we do over sampling instead of under sampling in this case?
@amani4541
@amani4541 2 года назад
Have you also implemented this for neural network algorithm?
@Sai-ph5vh
@Sai-ph5vh Год назад
Hello sir, amazing lecture, but have a doubt, why did do take those numbers instead of transaction id's and how do you take those numbers
@adnanemehdaoui5487
@adnanemehdaoui5487 Год назад
we can use smote method to resolve unbalanced data, it very useful also
@sulochanakamshetty1711
@sulochanakamshetty1711 3 года назад
great job siddhardhan
@Siddhardhan
@Siddhardhan 3 года назад
thanks 😇
@s.o.r.e8362
@s.o.r.e8362 11 месяцев назад
Bro u could have hyper tune the model..and u could use f1,recall,auc/ruc , precision for checking for more accurate score of its ther
@Jayansh-it2yu
@Jayansh-it2yu 10 дней назад
can anyone tell me where the data preprocessing is done in the video?
@vinsanargeese4384
@vinsanargeese4384 10 месяцев назад
I just wanna know whether it gives the accuracy details only or detect whether card is fraud or not
@maggiewaits3261
@maggiewaits3261 Год назад
Thanks!
@rickricky7847
@rickricky7847 Год назад
I am getting an error after training data and testing data At 36:17 After fitting in model.fit(x_train, y_train) Found input variables with inconsistent numbers of sample: [197,787] Kindly help me in resolving this
@sandipansarkar9211
@sandipansarkar9211 2 года назад
finished watching
@mdmynuddin1888
@mdmynuddin1888 Год назад
Can anyone tell me about encode categorical feature where some feature have more than 10k Category. I am just work with diff dataset
@rish5591
@rish5591 2 года назад
fab!!!
@monicagullapalli6106
@monicagullapalli6106 2 года назад
Hi, amazing video! I wanted to ask that when this is tested against user input, what all inputs are required to be taken from the user? Only the amount?
@moonlight-td8ed
@moonlight-td8ed 2 года назад
adi depend on cases
@sanketgaikwad9803
@sanketgaikwad9803 Год назад
very helpful
@KarthikKannan01
@KarthikKannan01 3 года назад
great video very helpful
@Siddhardhan
@Siddhardhan 3 года назад
thanks 😇
@datharaj1370
@datharaj1370 3 года назад
Hi Siddhardhan The video is very informative and easy to implement. However, undersampling is not the optimal way to approach this problem because we are discarding almost 95% of data and just training over
@Siddhardhan
@Siddhardhan 3 года назад
good insights. I'll research more about this.
@prathikshagowda924
@prathikshagowda924 15 дней назад
hii everytime iam uploading your data there is a missing values in data any suggestions...??
@Akth1518
@Akth1518 3 месяца назад
Hi, i have a doubt. Can i work this project side by side and do everything you did, and just put it in my resume? or should i have to do something else. I'm a darta science fresher who wants to start career in data science. so could you please clarify, how to present projects in my resume
@shivanichouhan-nb9rj
@shivanichouhan-nb9rj Год назад
thanks sir
@user-ps3tl8ze6r
@user-ps3tl8ze6r 5 месяцев назад
how to process the missing values to make as a meaningfull data??
@zabashhd459
@zabashhd459 9 месяцев назад
Hi what did we conculde in the end what is the result ?
@FerinKingsly
@FerinKingsly 15 дней назад
Bro while foolwing the same codes it is showing a name error in the X_train, X_test,Y_train, Y_test = train_test_split(......)...and it shows like namerror : name 'train_test_split' is not defined. But I have givenfrom ""sklearn.model_selection import train_test_split"" in the beginning itself . Could anyone say why this error is happening to me?
@PoojaKumari-wo8qi
@PoojaKumari-wo8qi 3 месяца назад
credit card fraud detection using machine learning iska code app kha pe likh rha hu thora btyia mugha ya pura code milskti ha kya or pppt
@kushsheth4801
@kushsheth4801 2 года назад
Hi, can you give to code to plot the dataframe , to visualise the comparison of fraudulent and legit means. thank you
@user-qi8xq7jk8t
@user-qi8xq7jk8t Месяц назад
Anyone knows what the numbers in the dataset denote. How do they relate to real life transactions?
@vinaynaik953
@vinaynaik953 3 года назад
Thank you
@Siddhardhan
@Siddhardhan 3 года назад
My pleasure 😇
@user-kp4jh7zj9g
@user-kp4jh7zj9g Год назад
Video is good, but I am getting Found input variables with inconsistent numbers of samples: [787, 197] error...can anyone help me out
@DibbyasreedeyMumu
@DibbyasreedeyMumu 3 месяца назад
best sir 🥰
@yashchandak8016
@yashchandak8016 2 года назад
Great effort
@Siddhardhan
@Siddhardhan 2 года назад
Thanks a lot😇
@e321321
@e321321 3 года назад
Very informative thanks a lot. One question: What is the problem statement in this study? I mean, the aim is to evaluate the accuracy of Logistic Regression model on credit card fraud data_set scenario, right?
@Siddhardhan
@Siddhardhan 3 года назад
Hi! It's not just to find the accuracy score. We can also make predictions based on our trained model. You can refer my other project videos to learn how we can build a predictive system. I didn't do that in this video because I wanted it to be a exercise for the viewers and so they can try it out.
@e321321
@e321321 3 года назад
@@Siddhardhan Thanks for such a quick response. Okay, now I see in the Diabetes Prediction Project video you also tested your model with manually input data, highly respect your great contributions.
@Siddhardhan
@Siddhardhan 3 года назад
Thanks a lot for your positive words 😇
@practicemail3227
@practicemail3227 9 месяцев назад
you could have also tried SMOTE technique to better understand and predict.
@sreebvmcreation9388
@sreebvmcreation9388 Год назад
Hello sir , thank u so much sir, and Please upload a Video Deep learning using python .
@oguzaydiner3936
@oguzaydiner3936 2 года назад
How to deal with "ValueError: could not convert string to float " if we are working with a dataset with various data types ? For example my data has columns such as actionType which includes "transfer", "withdrawal" etc.
@renusrimorla1778
@renusrimorla1778 Год назад
I got the same error.. how to solve it?
@user-qi8xq7jk8t
@user-qi8xq7jk8t Месяц назад
Can someone explain how thr logisticregression function works? How the v1 v2 v3...... Values are defined as fradulent or not.
@nandinimadan6421
@nandinimadan6421 Год назад
when I am loading the dataset and checking the null values I am getting some,but he is not is anyone else getting this error
@prateekkukreja3524
@prateekkukreja3524 3 года назад
Hi, great work. I have a question. What is role of time feature in dataset towards prediction of fraud? Does fraudulent translation depend on time feature or we can just drop it? Hope you can answer.
@Siddhardhan
@Siddhardhan 3 года назад
hi! we cannot drop time feature. if you analyse the data more, you will get to know that the fradulent transactions are more at some time period.
@sohelimtiaz9777
@sohelimtiaz9777 2 года назад
Which algorithm u used here?
@adityams1659
@adityams1659 2 года назад
**Bro you dont use Accuracy as a metric here, you can get 99% without doing all the preprocessing you just did. Use F1 or TPR or FPR or AUC ROC!**
@youngsaint1951
@youngsaint1951 11 месяцев назад
Since he is balancing the data, accuracy metric is fine
@ItsVansh2003
@ItsVansh2003 8 месяцев назад
How is he balancing the data ?
@aryanyadav5804
@aryanyadav5804 4 месяца назад
He mentioned
@shubhamlahoti1325
@shubhamlahoti1325 3 месяца назад
Since the data is balanced, accuracy score is a good metric. If the data is not balanced then something like Sensitivity, specificity can work
Далее
I'm Excited To see If Kelly Can Meet This Challenge!
00:16
ML Was Hard Until I Learned These 5 Secrets!
13:11
Просмотров 254 тыс.
Data Analysis with Python for Excel Users - Full Course
3:57:46
PyTorch in 100 Seconds
2:43
Просмотров 882 тыс.
Create a Python GPT Chatbot - In Under 4 Minutes
3:56
Просмотров 188 тыс.
The Complete Machine Learning Roadmap [2024]
5:25
Просмотров 74 тыс.