Below is the outline of this video. First, though, a few relevant links: - Sample data and sample code: www.csdojo.io/data - Introduction to Data Visualization with Python, the full course: www.csdojo.io/moredata OUTLINE: 0:37: Why data visualization? 1:05: Why Python? 1:39: Why Matplotlib? 2:23: Installing Jupyter through Anaconda 3:20: Launching Jupyter 3:41: DEMO begins: create a folder and download data 4:27: Create a new Jupyter Notebook file 5:09: Importing libraries 6:04: Simple examples of how to use Matplotlib / Pyplot 7:21: Plotting multiple lines 8:46: Importing data from a CSV file 10:46: Plotting data you’ve imported 13:19: Using the third argument in the plot() function 13:42: A real analysis with a real data set - loading data 14:49: Isolating the data for the U.S. and China 16:29: Plotting US and China’s population growth 18:22: Comparing relative growths instead of the absolute amount 21:21: About how to get more videos like this - it’s at www.csdojo.io/moredata
CS Dojo Jupyter or Spyder? Which is your advice and can you tell me the reason why? I am a newbie, start learning code with Python for Data analysis. Thank for your answer!
@@TegaraMusic Outline - check the comment section for a clickable version: 0:37 : Why data visualization? 1:05 : Why Python? 1:39: Why Matplotlib? 2:23: Installing Jupyter through Anaconda 3:20: Launching Jupyter 3:41: DEMO begins: create a folder and download data 4:27: Create a new Jupyter Notebook file 5:09: Importing libraries 6:04: Simple examples of how to use Matplotlib / Pyplot 7:21: Plotting multiple lines 8:46: Importing data from a CSV file 10:46: Plotting data you’ve imported 13:19: Using a third argument in the plot() function 13:42: A real analysis with a real data set - loading data 14:49: Isolating the data for the U.S. and China 16:29: Plotting US and China’s population growth 18:22: Comparing relative growths instead of the absolute amount 21:21: About how to get more videos like this - it’s at www.csdojo.io/moredata
@@hdm_vision Nope only watch and click. An algorithm counts the view and click and calculate the amount of money to give. Then Google give him that money. You get 0.5 cent per ad views. Plus YT takes 60% of revenue, so you get like 0.2 cent per ad view.
Dear Dojo, It took only 22 min for you to describe what it takes almost two weeks of study. Well organized and summarised, easy to catch-up, format. Thank you.
I found this video to be absolutely satisfying to watch, due to the speaker not leaving anything to my imagination. Very clear to understand, bro - thanks!
I really loved this! You should consider making more of such short tutorials for those of us who can't afford the course on pluralsight. Thanks again! :)
Clickable Timestamps as it was not worked for me on windows chrome browser : 0:37 Why data visualization? 1:05 Why Python? 1:39 Why Matplotlib? 2:23 Installing Jupyter through Anaconda 3:20 Launching Jupyter 3:41 DEMO begins: create a folder and download data 4:27 Create a new Jupyter Notebook file 5:09 Importing libraries 6:04 Simple examples of how to use Matplotlib / Pyplot 7:21 Plotting multiple lines 8:46 Importing data from a CSV file 10:46 Plotting data you’ve imported 13:19 Using a third argument in the plot() function 13:42 A real analysis with a real data set - loading data 14:49 Isolating the data for the U.S. and China 16:29 Plotting US and China’s population growth 18:22 Comparing relative growths instead of the absolute amount
Your teaching style demonstrates the saying "student doesn't need a perfect teacher instead teacher that will inspire them to learn and to grow in learn further" Indeed you're a hallmark of a true and inspiring teacher in your field. Thank you for such a wonderful video on data analysis and virtualization. Looking forward to seeing more videos.
Thank you so much! You've just saved me a lot of time and delivered me with a lot of knowledge. Your tutorial is so clear and nice to watch. One note: I'd use the terminology of "mask" when filtering by countries. Thank you so much again!
OMG, I've been waiting for something like this!! YK, I want to be a data analyst. What are some features in python that you must know for data analyst and what are features that are okay just to learn the basics? I hope you come out with more videos on data analyst/ data science. I love your python videos, especially recursion. Recursion is hard but you made me understand it a little bit more! Thank You
Thank you so much! I would say, just the fundamental topics are important-- classes and objects, functions, etc. After that, learning statistics fundamentals is probably more important. (Maybe on sites like Khan Academy or Coursera)
0:37 : Why data visualization? 1:05 : Why Python? 1:39 : Why Matplotlib? 2:23 : Installing Jupyter through Anaconda 3:20 : Launching Jupyter 3:41 : DEMO begins: create a folder and download data 4:27 : Create a new Jupyter Notebook file 5:09 : Importing libraries 6:04 : Simple examples of how to use Matplotlib / Pyplot 7:21 : Plotting multiple lines 8:46 : Importing data from a CSV file 10:46 : Plotting data you’ve imported 13:19 : Using a third argument in the plot() function 13:42 : A real analysis with a real data set - loading data 14:49 : Isolating the data for the U.S. and China 16:29 : Plotting US and China’s population growth 18:22 : Comparing relative growths instead of the absolute amount 21:21 : About how to get more videos like this - it’s at www.csdojo.io/moredata
You can see mine too. Most of the Python tutorials in the playlist, fundamental and detailed, step by step and easy to follow. Source files can be checkef in the description of the video. There is another playlist in the channel, R Beginning course, if you are interested in.
when i type the same code, i get this kind of error and kdonw know why... TypeError Traceback (most recent call last) in () 5 plt.plot(x,z) 6 plt.title("test plot") ----> 7 plt.xlabel("x") 8 plt.ylabel("y") 9 plt.show() UserWarning: Legend does not support 'this is y,' instances. A proxy artist may be used instead. See: matplotlib.org/users/legend_guide.html#creating-artists-specifically-for-adding-to-the-legend-aka-proxy-artists "aka-proxy-artists".format(orig_handle)
for personal use 0:37 Why data visualization? 1:05 Why Python? 1:39 Why Matplotlib? 2:23 Installing Jupyter through Anaconda 3:20 Launching Jupyter 3:41 DEMO begins: create a folder and download data 4:27 Create a new Jupyter Notebook file 5:09 Importing libraries 6:04 Simple examples of how to use Matplotlib / Pyplot 7:21 Plotting multiple lines 8:46 Importing data from a CSV file 10:46 Plotting data you’ve imported 13:19 Using a third argument in the plot() function 13:42 A real analysis with a real data set - loading data 14:49 Isolating the data for the U.S. and China 16:29 Plotting US and China’s population growth 18:22 Comparing relative growths instead of the absolute amount 21:21 About how to get more videos like this - it’s at www.csdojo.io/moredata
hey i started learning python a week ago and now i know all the basics like variables,loops,functions,etc which are inlcuded in yk's 15 part series. do u think that i am learning fast enough or am i slow. i am 15 yrs old. what should i do next????
I'm not an expert, but I do have a couple of suggestions: 1. Check out some of the free content that is available for learning Python. I always suggest "Automate the boring stuff with Python", which is a great book that the author has put online for free. There are other great resources out there to help you continue learning as well. 2. Start trying to build actual working programs. This is great because not only will you learn quickly and stay motivated, but you will naturally learn as you try to figure out how to solve the problems that you will run into. It's great that you're starting young, I don't think you're learning slow! Good luck!
Hey man, lovely video. I am interested in your online course, but I find it difficult to find the price of it in the website, could you please tell me how much is it?; also, I was wondering if it came with some sort of certification that you can place in LinkedIn. Thank you!
I've used Matplotlib before, but I never understood the point of it to be honest. If the data is already in an application with the capabilities of making charts and graphs(i.e. excel), why export and do everything manually with code? Hell, excel is powerful enough to perform search and replace, as well as anything else you want to throw at it. Aside from that, businesses are spending more and more money on applications for non-technical staff to feed data into, to create stunning visuals on their own...bypassing code altogether. So honestly, why should I spend the time to learn this material?
It still goes back to where businesses are spending their money though. These applications are made for big data essentially...per end user. It changes drastically by the industry, however it doesn't change the fact that these vast applications are being released quarterly and used by many businesses. I guess I just fail to see the practicality of it. I wouldn't teach my staff python if they can have python packaged up with a front end thrown on it with nice big buttons and easy ways to make charts, trend lines, and maybe even generate infographics
Like you said is all about data... Soon data will come from all the built environment, feeding patterns which will require rapidly changing dynamic systems and new software engineering paradigms. These types of evolving systems which may need to react to the environment may need to expand or remove columns on the fly, a limitation which fixed, static systems like Excel, cannot overcome. Currently you could run a bot which could plot, say, the most popular twit on the net...Now, at the press of a button...'forever'. Try that with Excel. They are different tools, with different end users. One is for users of technology. The other for users who develop technologies.
@@cuchitp Excel has come a long way and it's great at handling dynamic data. Most people who haven't used Excel in a while are amazed when they're shown how good of a tool it is for data visualization. You're right though: Excel is not good for huge data sets. If you have to choose between learning Excel and learning Python, think first about the amount of data you'll need to analyze. Based on that, choose what to focus on. Spend 80% of your time on your first choice, and 20% of your time on the second. After a few months of consistent learning and practice, you'll figure out the best tools and workflow for your job.
If you have access to those tools ,both are great if you already know them well! Both of those require paid accounts and are usually tied to your company. As always, you should follow whatever the team has decided to use so everyone is on the same page, even if it is less efficient. The use case for using python over the other two would be if you already have a lot of python as part of your workflow. I've worked places where they use python over VBA for excel macros and there are a lot of internal tools and applications that use python. The other benefit is you can run these workflows in linux server environments really easily. Lastly, python is a lighter weight solution compared to PowerBI and Tableau. A .py file with these dependencies is much smaller and more portable than the other two. It really depends on your requirements and your use case. One is not necessarily better than the other. There is no "best data analysis tool", you just need the right tool for the job.
What is this problem? --------------------------------------------------------------------------- AttributeError Traceback (most recent call last) in 1 import pandas as pd ----> 2 df = pd.read_csv('temporal.csv') 3 df.head(10) #View first 10 data rows AttributeError: module 'pandas' has no attribute 'read_csv'
My anaconda don't, my anaconda don't, my anaconda don't want none unless you got buns hun... oh cmon you are gonna expect these kinda comments after you said anaconda. xD
Hi, im a beginner in python and i download python but it doesnt seem to open properly.. it shows just something like (Python 3.12.3 (tags/v3.12.3:f6650f9, Apr 9 2024, 14:05:25) [MSC v.1938 64 bit (AMD64)] on win32 Type "help", "copyright", "credits" or "license" for more information.)... . any help please?
Hi, Is it possible to convert below the input Customer_Name,Product_1,Price_1,Product_2,Price_2 Zayn,Milk,30,Chocolate,40 Peter,Cheese,190,Oil,80 Andrew,Coconut,10,Milk,60 Dwayne,Soya,100,Butter,120 to this output where the Product name should be ascending and it should also have its price in the next column Customer_Name,Product_1,Price_1,Product_2,Price_2 Zayn,Chocolate,40,Milk,30 Peter,Cheese,190,Oil,80 Andrew,Coconut,10,Milk,60 Dwayne,Butter,120,Soya,100 Can you please help
Hi, after executing the command ' from matplotlib import pyplot as plt ' in the jupyter notebook using anaconda , I am getting the following error - --------------------------------------------------------------------------- ImportError Traceback (most recent call last) in ----> 1 import matplotlib.pyplot as plt ~\anaconda3\envs otebook\lib\site-packages\matplotlib\pyplot.py in 34 from cycler import cycler 35 import matplotlib ---> 36 import matplotlib.colorbar 37 import matplotlib.image 38 from matplotlib import rcsetup, style ~\anaconda3\envs otebook\lib\site-packages\matplotlib\colorbar.py in 42 import matplotlib.collections as collections 43 import matplotlib.colors as colors ---> 44 import matplotlib.contour as contour 45 import matplotlib.cm as cm 46 import matplotlib.gridspec as gridspec ~\anaconda3\envs otebook\lib\site-packages\matplotlib\contour.py in 15 import matplotlib.collections as mcoll 16 import matplotlib.font_manager as font_manager ---> 17 import matplotlib.text as text 18 import matplotlib.cbook as cbook 19 import matplotlib.mathtext as mathtext ~\anaconda3\envs otebook\lib\site-packages\matplotlib\text.py in 14 from .font_manager import FontProperties 15 from .patches import FancyArrowPatch, FancyBboxPatch, Rectangle ---> 16 from .textpath import TextPath # Unused, but imported by others. 17 from .transforms import ( 18 Affine2D, Bbox, BboxBase, BboxTransformTo, IdentityTransform, Transform) ~\anaconda3\envs otebook\lib\site-packages\matplotlib\textpath.py in 9 from matplotlib.font_manager import FontProperties, get_font 10 from matplotlib.ft2font import LOAD_NO_HINTING, LOAD_TARGET_LIGHT ---> 11 from matplotlib.mathtext import MathTextParser 12 from matplotlib.path import Path 13 from matplotlib.transforms import Affine2D ~\anaconda3\envs otebook\lib\site-packages\matplotlib\mathtext.py in 25 26 import numpy as np ---> 27 from PIL import Image 28 from pyparsing import ( 29 Combine, Empty, FollowedBy, Forward, Group, Literal, oneOf, OneOrMore, ~\AppData\Roaming\Python\Python38\site-packages\PIL\Image.py in 92 # Also note that Image.core is not a publicly documented interface, 93 # and should be considered private and subject to change. ---> 94 from . import _imaging as core 95 96 if __version__ != getattr(core, "PILLOW_VERSION", None): ImportError: cannot import name '_imaging' from 'PIL' (C:\Users\Rohit\AppData\Roaming\Python\Python38\site-packages\PIL\__init__.py) I updated and reinstalled matplotlib , anaconda and jupyter notebook to their respective latest versions but it is showing the same error. It is not showing error if I run the command 'import matplotlib' but it is showing error for this command ' from matplotlib import pyplot as plt' . Also searched in Stack Overflow but still no use. Also updated python3 to the latest cersion. Pls suggest to come out of this problem. What to do?
I'm getting the following error NameError Traceback (most recent call last) ~\AppData\Local\Temp/ipykernel_43496/1751034645.py in 1 x = [1, 2, 3] 2 y = [1, 4, 9] ----> 3 plt.plot(x, y) 4 plt.show() NameError: name 'plt' is not defined
Hey!! I was isolating the data from the sample_data such as it will only have the second row with 4 in column_b but the output did not come out as expected. Can someone please tell what is wrong here? --> c1= sample_data[sample_data.column_b == '4'] output: c1 column_a column_b column_c
If I want to import my data, jupyter compresses all my data into one column instead of the 3 columns I have in my original csv file. How can you fix that?
Thanks YK for sharing your time and knowledge! But what does this mean ? My data doesn't look like yours... it isn't Linear. I tried to use my own data.
Matplotlib sucks. Use ggplot in R instead It's way easier, has more intuitive syntax. I use Python and pandas for more heavy calculations and then plot everything in R
I downloaded Thonny IDE and could follow every step with no problems and plot graphs. Just needed to use print function of course and download the particular libraries.
i am getting this error while performing Histograms can u please help me out C:\Users\User\Anaconda3\lib\site-packages\pandas\core\ops.py:1649: FutureWarning: elementwise comparison failed; returning scalar instead, but in the future will perform elementwise comparison result = method(y)
Hi, what if I'm working on multiple files. How do I do something like vlookup function in Excel? I've been doing analysis on Excel for about 4 months. I'm having a hard time when doing analysis for multiples 350++mb file size. It's slowing me down. Is python able to run calculation faster for big data from multiple Excel sheets?
Try INDEX MATCH it's supposed to be faster than vlookup. On my training course we had a big excel sheet and it went from lagging with vlookup to do the calculation in secs for index match (can't really remember)
Command errored out with exit status 1: python setup.py egg_info Check the logs for full command output whenever i try to install the module visualization in pycharm through cmd in windows 10..it gives me the above error.. please help me sir how could i fix this error..need your help and suggestion..Thank you
Hey sugi, just want to inform you that I was going through your course " Introduction to Data Visualization with Python" in Pluralsight. But at the time of creating the histogram, I am unable to proceed because of the unavailability of the extended "Countries.csv" file. Can you please help me where can I get that file?
I got the plot stuff to work, but when I try to use read_csv('sample_data.csv') my program just sits there and does nothing. No errors, just hangs up doing nothing.
Forgive my stupidity, but why would you do that in Python if you can do it on Excel? I still have a hard time to understand the "practical application" of it clearly... but I'll keep watching some videos on your channel, you're legit!
How is this better than excel pivot table? It also presents graph charts, summarizes from different row/column, and view data from different angle. Can someone explain to me why data visualization analyst uses Python instead of excel? I'm in learning Python and i really hope that it's worth the time. Thanks!
How is this better than excel pivot table? It also presents graph charts, summarizes from different row/column, and view data from different angle. Can someone explain to me why data visualization analyst uses Python instead of excel? I'm in learning Python and i really hope that it's worth the time. Thanks!
Why not just used excel? For instance, I could reproduce every graph on this video with excel...what more can I do with python? Is it because you can manipulate data outputs much more when using python and in a faster manner? I'm pretty knowledgeable with excel, and only really ran into problems when the data I used exceeded what excel supports (had to use powerpivot). How could I benefit from using python when confronted to large amounts of data or what benefits does python bring to data visualization compared to a program like Excel?
Very nice video. Question How can I plot multiple xksx spreadsheets that has same data on x and y on same plots automatically. A.xlsx B.xlsx C.xlsx ... N.xlsx I want to plot same column x and column B on the same graph and move to same x and colum c and so forth Thanks
Hi CS Dojo I would like to use my python programming skills to work in water resource engineering system, which I would like if you can mentor me through this. I would be glad if I can get your email
Excuse me, how do you use indexes instead of labels? You always know the column label before hand (.year, .country, and so on). But it isn't flexible this way.
Can someone tell me why when I try putting sample_data = pd.read_csv('sample_data.csv') it returns that the name 'pd' is not defined even through I wrote the first line that should have defined what pd is?
Superb, your video is very practical and easy to understand, best part is that I have reached 15 minutes of this video but couldn't resist till end to write the comment, moreover your voice and language is very descent. Good work ....
Hi there ... amazing video By the way ... i installed pandas on my anaconda but while doing the import step, i’m getting: no module named pandas ... any solution? .... thanks
Hi man! I have a question. I have a model that produces an output in csv. The columns are as follows: | Car | Price | Year | The car column has different car manufacturers for example, with an average car price for each year in column 'Year'. Example | Car | Price | Year | | BMW | 34000 | 1990 | | BMW | 35000 | 1991 | | BMW | 36000 | 1992 | | BMW | 37000 | 1993 | | AUDI | 32000 | 1991 | | AUDI | 33500 | 1992 | | AUDI | 34000 | 1993 | | AUDI | 35500 | 1994 | | SEAT | 25600 | 1994 | How can I plot the average prices for the years where I have given data, in an area chart, if the names of the different cars and their prices for given years are displayed in 3 columns like this? The total model period time is 20 years, however, BMW may lack prices for some of these years. I would like the data to be a stacked area chart showing the values for each car for each corresponding year that the price is available (between for example 1995-2015). This is just an example for simplification, but the layout of the actual data is the same. Would much appreciate some help on this one!
I followed the tutorial and I was able to plot the x and y fine until I tried to put the title and and x/y label. It gave me error and said the title str is not callable
On that line graph, how can we use logo at the end of this line. Suppose if we want to use Facebook and Twitter logo, small size or even how can make a circle and write"FB" or Twitter like this at the end of each multiple line.
Hallo, thanks for this video amazing video,it really helped me a loot,i just have a question if i want instead of having true or false a 0 and 1 how should i do? for true and false you used "data"
please help me to solve this question 1. Investigate the effects of age-group on attainment and engagement 2. Present the results of the age-group investigation using an appropriate visualisation 3. Investigate the effects of engagement on qualification 4. Test the hypothesis that there is a significant effect on qualification
this is what i came up with def mul(x,z,a,b): print(list(range(a,b))) total=0 total2=0 newval=0 for i in range(a,b): if i % x ==0: total = total + i if i % z ==0: total2 = total2 + i newval=sum(total,total2) return(newval) print(mul(3,5,1,100)) Ans 2633
thank you very much sir, your tutorial is very under stable, i want to work with tweepy in Juypter notebook ,plz help me for this because i am so confused about that how to import data in jupter notebook using tweepy API
Bro this isn't the video im even looking for I have a huge test on Monday my basics are trash wtf am i doing to do the test is in 2 days!!!! I can feel my soul dying send help