Тёмный

Pivot tables with Pandas 

Python and Pandas with Reuven Lerner
Подписаться 19 тыс.
Просмотров 37 тыс.
50% 1

What are pivot tables? How do they work? And how can you create them using Pandas? In this video, I explain why they're such useful tools in data analytics, and how we can easily create them with Pandas.
As usual, the Jupyter notebooks for my videos are on GitHub: github.com/reuven/RU-vid-not...
Don't forget to subscribe to my free, weekly "Better developers" newsletter, at BetterDevelopersWeekly.com/!

Наука

Опубликовано:

 

9 мар 2022

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 85   
@YossiSoberano
@YossiSoberano 2 года назад
Thanks Reuven, for another informative and educational video 👍
@reygaji4001
@reygaji4001 Год назад
Omg, you are a godsend. Where have you been all this time when I was just starting to learn this stuff T_T Your videos are better than many python tutorials out there with millions of views damn
@ReuvenLerner
@ReuvenLerner Год назад
I'm delighted to hear that you enjoyed it! Thanks so much for your kind words.
@joseluisbeltramone599
@joseluisbeltramone599 Год назад
Very cool explanation. Thanks a lot!
@ReuvenLerner
@ReuvenLerner Год назад
Glad you enjoyed it!
@angelhanna2917
@angelhanna2917 Год назад
Easy to understand, thanks!
@ReuvenLerner
@ReuvenLerner Год назад
Glad it helped!
@mcllen
@mcllen Год назад
There was a great class, Thanks!
@ReuvenLerner
@ReuvenLerner Год назад
Glad it helped!
@Kaelthas_Sunstrider
@Kaelthas_Sunstrider Год назад
You’re an amazing teacher! I’m enjoying your book Python Workout :)
@ReuvenLerner
@ReuvenLerner Год назад
Thanks so much for your kind words, and for your support! (And hey, if you have a few moments to review the book on Amazon, that's always highly appreciated...) More is coming soon, BTW, with Pandas Workout!
@Kaelthas_Sunstrider
@Kaelthas_Sunstrider Год назад
@@ReuvenLerner I’m glad you mentioned. Just got it via early access!
@thomasgyting3251
@thomasgyting3251 5 месяцев назад
I was having a difficult time understanding pivot tables and what exactly they are vs regular tables, and your example at the end that included multiple agg functions made the lightbulb go off! Love your content!
@ReuvenLerner
@ReuvenLerner 5 месяцев назад
So happy to hear it; thanks so much!
@selim.vidangos
@selim.vidangos 9 месяцев назад
Thank you! Useful video
@ReuvenLerner
@ReuvenLerner 9 месяцев назад
I'm so glad to know it helped!
@helloworld2740
@helloworld2740 2 года назад
great video sir
@user-uu5sc1hf9p
@user-uu5sc1hf9p 10 месяцев назад
thanks Reuven
@ReuvenLerner
@ReuvenLerner 7 месяцев назад
My pleasure!
@kaiserkonok
@kaiserkonok Год назад
Thank you so much🖤. You are doing amazing for us. We would be more happy if you post videos on machine learning with python. 🙏
@ReuvenLerner
@ReuvenLerner Год назад
I'll try to do some machine learning stuff when I can, but I've got a backlog of other ideas already...
@kaiserkonok
@kaiserkonok Год назад
@@ReuvenLerner Thank you so much
@33samogo
@33samogo Год назад
Great explained! .pivot() is amazing, it's exactly what I was looking for, it's like y axes in "xy coordinate table", fantastic!
@ReuvenLerner
@ReuvenLerner Год назад
Delighted it helped!
@33samogo
@33samogo Год назад
@@ReuvenLerner Thanks Reuven, I have one more question, how to create a list of lists from pivot table? Is there a function like DataFrame.iterrows() for pivot table?
@ReuvenLerner
@ReuvenLerner Год назад
@@33samogo A pivot table is a data frame, so all of the methods you know and love from data frames will also work there. So yes, you can use iterrows, but remember that when you turn a data frame into a list of lists, you're moving data from NumPy's compact and efficient storage to regular Python objects, which are neither compact nor efficient.
@33samogo
@33samogo Год назад
@@ReuvenLerner Thank you! I struggle with data-frames for whole day, especially with multiindexing, I am trying to interpolate pivot table rows, that's why I need a list of row values to create a new interpolated one, but for sure there is a way to interpolate a whole pivot data-frame in to a tubular data data-frame also?!
@33samogo
@33samogo Год назад
Pandas documentation : "Please note that only method='linear' is supported for DataFrame/Series with a MultiIndex." I need to apply quadratic interpolation, so pivot table is useful for me just to reshape the data-frame values, also I am not sure if data-frame length can be automatically adjusted during interpolation operation according to number of output values (from 20-->~1000). So, I know how to retrieve a row values from pivot table, change a type of values, apply interpolation and save values in list variable, now I need to automate the process for all rows in the pivot/data-frame table and save all list variables as data frame using for loop or comprehension. What are the steps to perform this operation in a "clean" way or what functions should I consider? I started with Pandas a week ago just for fun but now I'm fell in love with it, thank you.
@pramishprakash
@pramishprakash Год назад
very helpful sir
@ReuvenLerner
@ReuvenLerner Год назад
Glad it helped!
@mosama22
@mosama22 4 месяца назад
Thank you so much for the beautiful videos, I just wish you had a direct link to the data please, so we can follow up with you. Thanks again though for the beautiful videos 🙂
@ReuvenLerner
@ReuvenLerner 4 месяца назад
Glad you are enjoying them! most of the data is (should be!) available from files.lerner.co.il/ .
@mohemmedkamal
@mohemmedkamal 2 года назад
thank you
@mario1ua
@mario1ua 5 месяцев назад
Hey Reuven, great tutorial, thanks! I've also watched your pivot and **kwargs videos and they were awesome too
@ReuvenLerner
@ReuvenLerner 5 месяцев назад
I'm delighted to hear you enjoyed them! More are (hopefully) coming soon.
@AlexKashie
@AlexKashie 6 месяцев назад
Thank you
@ReuvenLerner
@ReuvenLerner 6 месяцев назад
My pleasure!
@rajeshn5006
@rajeshn5006 Год назад
excellent
@ReuvenLerner
@ReuvenLerner Год назад
Glad you enjoyed!
@bhargavkongara2003
@bhargavkongara2003 Год назад
Thank u Sir
@ReuvenLerner
@ReuvenLerner Год назад
Glad it helped!
@Darkev77
@Darkev77 11 месяцев назад
Great, thanks for explaining it so concisely. However, why would I use a pivot table when I can also group by multiple columns (group by year and team: `groupby(["Year", "Team"], observed=True)`)?
@ReuvenLerner
@ReuvenLerner 7 месяцев назад
Good question! When you group by multiple columns, you get a multi-index as the result. A pivot table takes one dimension of the multi-index and turns it into columns. So instead of a single column of results with a multi-index on the rows, you get a single column of results with a simple index on the rows *and* a simple index on the columns. Same information, presented differently, which can be easier to understand and process.
@Darkev77
@Darkev77 7 месяцев назад
@@ReuvenLerner brilliant, thanks a lot!
@izzy8655
@izzy8655 7 месяцев назад
Thanks
@ReuvenLerner
@ReuvenLerner 7 месяцев назад
Glad it helped!
@rafaelsantana5808
@rafaelsantana5808 Месяц назад
Hello, I would like to know how I can make it count in a column at the end all columns that have values. example: count = ID1 + ID2+ ID3
@KonradTamas
@KonradTamas 10 месяцев назад
One question, do I always need Categorical data to use pivot tables ? Thanks!
@ReuvenLerner
@ReuvenLerner 10 месяцев назад
The answer, of course, is "yes and no." You don't *need* categorical data to do a pivot table, just as you don't *need* categorical table to do a groupby. But think about what's happening in such cases: Every unique value in the column will be a separate row (or column) in the pivot table. If you have only a handful of integers or floats, then that's fine. But if you have several hundred, or several thousand, then you'll end up with a pivot table that doesn't serve anyone well. So as a general rule, we say that you should only create pivot tables with categorical data, even though you could well have numeric data that's so limited in the distinct values that it'll work just fine.
@jingchun21
@jingchun21 6 месяцев назад
Thank you so much! I am curious that can we just show a few columns we want? For example, for the sports, I just want to show baseball, badminton. Thanks.
@ReuvenLerner
@ReuvenLerner 6 месяцев назад
If you only want a few columns, then you have at least two choices: (1) filter the rows before you create the pivot table, or (2) filter the column after the pivot table is created with [[ ]] . I would recommend doing the first, since it reduces the amount of time and memory needed to create the pivot table in the first place.
@joshux3210
@joshux3210 Год назад
In the pivot_table method, you don't specify you want to calculate the mean. It auto assumes that? Can I use other calculation such as median or max?
@joshux3210
@joshux3210 Год назад
Ok I watched to the end: it's by using the 'aggfunc' parameter
@ReuvenLerner
@ReuvenLerner Год назад
​@@joshux3210 Yup! Pretty amazing, right?
@real.samad_
@real.samad_ 28 дней назад
Hello Reuven, can I use smally portion of this video for an instagram reel I am working on?
@ReuvenLerner
@ReuvenLerner 28 дней назад
Maybe -- it depends on the context and what you're using it for. Feel free to e-mail me (reuven@lerner.co.il) to discuss this further.
@AnjanBasumatary
@AnjanBasumatary Год назад
How to display the all the rows and column cobination even values are missing in those combination?
@ReuvenLerner
@ReuvenLerner Год назад
If values are missing from that row-column combination, you'll get a NaN value.
@AnjanBasumatary
@AnjanBasumatary Год назад
@@ReuvenLerner Actually, in my case both rows and columns doesnot appear if those rows and columns combination doesnot have any value... tried usi g dropna parameter but no use
@Amber-rh8dh
@Amber-rh8dh 5 дней назад
Please share the datasource too, if possible...
@ReuvenLerner
@ReuvenLerner 4 дня назад
Data for most of my demos are from files for Pandas Workout, at files.lerner.co.il/pandas-workout-data.zip .
@inderjeetchandnani302
@inderjeetchandnani302 Месяц назад
At time 3:59 if we have a date data, will it work?
@twocentswithdeb
@twocentswithdeb 2 месяца назад
How can I rename the aggregated columns?
@ReuvenLerner
@ReuvenLerner 2 месяца назад
It's a data frame, so you can use the "rename" method to change the column names.
@hieuthuan1999
@hieuthuan1999 Год назад
Thanks @Reuven. I wonder if it is possible to create pivot table with more than one index (row field), and more than 1 value, such as height and weight in average. Excel can do it easily :)
@ReuvenLerner
@ReuvenLerner Год назад
Yup, you can definitely do that! Instead of passing a string for the index or column, pass a list of strings. That'll create a pivot table with a multi-index on one or both axes. It's pretty straightforward - give it a shot!
@proud_indian0161
@proud_indian0161 2 месяца назад
From where can i get this athlete dataset?
@ReuvenLerner
@ReuvenLerner 2 месяца назад
It's in the data set for my book, Pandas Workout, at files.lerner.co.il/pandas-workout-data.zip
@marvinlomo5845
@marvinlomo5845 2 года назад
Thanks Reuven. Is this covered in the Book?
@ReuvenLerner
@ReuvenLerner 2 года назад
Yes, one exercise looks at pivot tables specifically, and a few ask you to create them based on different data sets.
@aakashniture
@aakashniture 5 месяцев назад
Hi Reuven, amazing video. After pivoting a column when you get that column's distinct values as new columns. I want to select those columns or save them in a new df but i can't. I am simply just using names of new columns e.g., df['Afghanistan'] but it gives error even though in df.shape() it says Afghanistan as a column. Kindly help.
@ReuvenLerner
@ReuvenLerner 5 месяцев назад
Hmm, I'm not sure. The result of pivot_table is a new data frame, so you should be able to say pivot_df = df.pivot_table(...) and then retrieve whichever columns you want. I'm guessing there's a syntax bug somewhere.
@aakashniture
@aakashniture 5 месяцев назад
@@ReuvenLerner my column names were numeric.. i pivoted Years column so new column names were 2019,2020,2021. I was selecting them like table['2022'] and was getting a Key error for 2022. I removed apostrophe and tried df22 = table[2022]. And that worked. I did not know that if the column name is in numeric ' ' is not used. Thanks for the reply Reuven
@ReuvenLerner
@ReuvenLerner 5 месяцев назад
@@aakashniture Right, if your column names are integers, then you can't use quotes around their names. I'm glad that you were able to figure it out!
@ravindrareddy5478
@ravindrareddy5478 Год назад
is there any way to merge empty cells using python in a pivot table
@ReuvenLerner
@ReuvenLerner Год назад
I'm not quite sure what you mean by this. Can you give me a short example?
@ravindrareddy5478
@ravindrareddy5478 Год назад
@@ReuvenLerner like I'm having a pivot table in which some cells I'm between the rows are empty, so I want merge those cells with nearest value
@guillegonzalez6146
@guillegonzalez6146 9 месяцев назад
blud i love ya
@ReuvenLerner
@ReuvenLerner 9 месяцев назад
Thanks so much! Delighted to have you here with me.
Далее
Selecting columns when reading a CSV into pandas
5:17
How do I use the MultiIndex in pandas?
25:01
Просмотров 172 тыс.
СПРАВКА ДЛЯ УНИВЕРА
00:44
Просмотров 348 тыс.
Incredible magic 🤯✨
00:53
Просмотров 2,9 млн
Pandas Pivot &  Pivot table Difference
10:05
25 Nooby Pandas Coding Mistakes You Should NEVER make.
11:30
The six most important read_csv arguments in Pandas
16:50
Flipping Data with Pandas: Stack & Unstack
8:17
Просмотров 2,4 тыс.
Сводные таблицы в pandas
17:28
Просмотров 6 тыс.
Stack, Unstack, Melt, Pivot - Pandas
10:23
Просмотров 39 тыс.
My top 25 pandas tricks
27:38
Просмотров 265 тыс.
You Won't Believe These Crazy PIVOT TABLE Hacks!
11:30
Просмотров 672 тыс.
Automate Excel Pivot Table With Python
24:03
Просмотров 25 тыс.
Это спасёт камеру iPhone
0:32
Просмотров 433 тыс.
ОБСЛУЖИЛИ САМЫЙ ГРЯЗНЫЙ ПК
1:00