Тёмный

Foundations of Data Visualisation - Computerphile 

Computerphile
Подписаться 2,4 млн
Просмотров 73 тыс.
50% 1

Following a look at 'Sensemaking' Associate Professor Dr Kai Xu delves into some more tricks of the visualisation trade.
Kai's presentation:
docs.google.com/presentation/...
/ computerphile
/ computer_phile
This video was filmed and edited by Sean Riley.
Computer Science at the University of Nottingham: bit.ly/nottscomputer
Computerphile is a sister project to Brady Haran's Numberphile. More at www.bradyharan.com

Опубликовано:

 

12 апр 2023

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 83   
@patton72010
@patton72010 Год назад
"Mr. Anderson, you are the expert in all matters related to drawing red lines. We need you to draw seven red lines. All of them strictly perpendicular, some with green ink and some with transparent. Can you do that?"
@3zrv
@3zrv Год назад
* nervously sweats with 2 perpendicular sweat drops on his forehead and the 3rd drop doesn't know where to go *
@JinKee
@JinKee Год назад
When you inflate the balloon, can you do it in the shape of a kitten?
@patton72010
@patton72010 Год назад
​@@phandao5404You are talking about different Mr Anderson lol
@arnoldbr8418
@arnoldbr8418 Год назад
yo thats a swastika?
@Aziqfajar
@Aziqfajar Год назад
Can't wait to have my data visualized with electric shock intensity! ❤
@alphgeek
@alphgeek Год назад
Let me run that past the ethics committee for a sec.
@moralboundaries1
@moralboundaries1 Год назад
If we're talking about, visualizations, that must mean the electrodes will be hooked up to our eyeballs. Extra FUN!
@TheAgentOfDeath
@TheAgentOfDeath Год назад
Thanks for the free, high-quality education.
@F_L_U_X
@F_L_U_X Год назад
Welcome.
@Tiwo1991
@Tiwo1991 Год назад
When he mentioned marks and channels with the specific examples, I immediately think of the analogy of how these are used in cartography and the choices made there.
@mattgenovese
@mattgenovese Год назад
I would consider data visualization as closer to the discipline of User Experience design.
@moralboundaries1
@moralboundaries1 Год назад
This is the other kind of graph theory.
@MaeLSTRoM1997
@MaeLSTRoM1997 Год назад
To anyone who is interested in data visualization: I highly recommend the five books on data visualization by Edward Tufte, particularly the first one, "The visual display of quantitative information." He is the founding figure of the field of data visualization and his books are very interesting and pleasant to read.
@leogama3422
@leogama3422 3 месяца назад
thanks!
@willemrood
@willemrood Год назад
Really interesting stuff. I'm very surprised to see that using color lum/sat for highlighting magnitude is considered worse than markersize. I mean, it makes sense that without a colorbar it is very difficult to say what saturation/transparency corresponds to anything in absolute or relative sense. So I feel like when utilizing those, a colorbar is mandatory. I'd be interested to see Dr. Xu's take on how to visualize high density datasets. Because when you have a set of n=1e6, a scatter plot that uses area to denote magnitude will not really be usable due to the markers overlapping or being to small to be visible. I'm expecting that at some point you have to shift from using scatterplots to porkchop plots and so on. Would be nice to see something of an overview between data set size and plotting formats!
@xbzq
@xbzq Год назад
RGB on a computer screen is a super poor way to refer to saturation. HSV and HLS are super poor derivatives. If you do not understand color spaces and vision in depth you shouldn't be talking about levels of saturation because that is a very complex topic and knowing only RGB values and your run of the mill Photoshop color pickers will lead you only to talking a lot of pseudoscientific nonsense that misleads people. To be more accurate it is necessary to talk about the Lch, Luv and L*a*b color spaces. Then you can use words like "twice as saturated" and have it actually mean something. Luminance is also fraught with peril. Something twice as bright doesn't look twice as bright. When printing on paper or viewing on a monitor there's a maximum brightness but this isn't true in general. Twice the number of lightbulbs will give it twice the brightness but it doesn't look it. It's all a bit too complicated for this comment but suffice it to say that there're not many people that understand it yet there's a large percentage that really think they know when in fact they do not.
@willemrood
@willemrood Год назад
@@xbzq Yeah exactly. I completely agree. To obtain something absolute from "a color" is incredibly sensitive and depends on way more factors than you'd expect. It becomes even worse when you use colormaps that vary multiple channels.
@lens07
@lens07 Год назад
I can do another video if there is enough interest about visualising large dataset (I assume this is what you mean by 'high density'). A bit of spoiler: there is no silver bullet for large dataset unfortunately.
@xbzq
@xbzq Год назад
@@lens07 How do you define "saturation" that allows you to say that one area has twice the saturation of another?
@willemrood
@willemrood Год назад
@@lens07 Yeah, although what I mean with high density is quite specific. So I'm not too sure about the wording. When doing optimization problems, the results should converge to the optimum. So what happens is that your distribution of markers is quite dense around the optimum. Which doesn't leave a lot of space for varying marker size (and thus the possibility to distinguish between the individual markers). Which makes me always opt for varying color (however a porkchop is superior of course). Hopefully that clears the "high density" part up a bit. Thanks for the reply!
@Nafrodite
@Nafrodite Год назад
dr xu has impeccable drip
@squishmastah4682
@squishmastah4682 Год назад
I knew I liked Dr. Xu early on, but that sensation only magnified as time went on; much like electroshock.
@Hatamoto95
@Hatamoto95 Год назад
Computerphile invites data visualisation expert, films his presentation from a distance on a tiny screen
@jamesusespivot
@jamesusespivot Год назад
Idk if you were just making a joke or if you actually want to see the presentation. But if so, it's in the description.
@sanketdutta4981
@sanketdutta4981 Год назад
Really informative video! Can someone please name the books on screen between 0:25 - 0:47. I can recognize Tufte’s classic from a mile away but I can’t see the other two. One of them is surely a springer handbook not sure which one.
@AF-lt2fr
@AF-lt2fr Год назад
The visual display of quantitative information - second edition The grammar of graphics - second edition Visualisation analysis and design
@Computerphile
@Computerphile Год назад
Apologies: photos.app.goo.gl/brmCVQYgFked85kx8
@sanketdutta4981
@sanketdutta4981 Год назад
@@Computerphile Thanks a lot
@sanketdutta4981
@sanketdutta4981 Год назад
@@AF-lt2fr Thank you
@AF-lt2fr
@AF-lt2fr Год назад
@@sanketdutta4981 no problem - I ended up getting it by changing the video quality under advanced to 4k and zooming in.
@Pedritox0953
@Pedritox0953 Год назад
Great video!
@andrewnemov
@andrewnemov Год назад
Can you, please, provide names of the books in beginning of the video.
@kenakins
@kenakins Год назад
Can you guys do a video on the new SLP bug CVE-2023-29552? I think it would be really interesting and would love to hear your professional takes on it!
@nervous711
@nervous711 Год назад
So it's about categories and magnitude, and how you should represent them depends on how accurate you want them to be. But what about relation, trend, and connection intensity?
@pengain4
@pengain4 Год назад
Is it possible to get original presentation somewhere?
@31b41a59l26u53
@31b41a59l26u53 Год назад
I also would like to have the slides.
@jamesusespivot
@jamesusespivot Год назад
It's in the description.
@computer_science_in_depth
@computer_science_in_depth Год назад
good video and explanation
@MrKrock164
@MrKrock164 Год назад
Wouldn't a pie chart (area, 0.7, underestimated) be more reliable than a straight line (length, 1, normalized)? I think there's more tricks like that to improve visualization for better precision of the estimated value.
@idaho777
@idaho777 Год назад
I think pie charts utilize a mix of area and length. The example in the video with squares compares 2 geometrically similar squares with different areas. The pie chart's pie's are not geometrically similar with changes in area but instead the arclengths change (since radius is constant to the bounds of the entire pie). I'm assuming this study comparing areas used uniform scale. We could change the visualization to compare a square and scaled square along one dimension (rectangle) because now you can compare side lengths (linear term), or display the numerical values inside the marker.
@rugbybeef
@rugbybeef Год назад
No, people's perceptions of triangular area or sectors (pie slices) are unreliable especially those undergoing rotation rely on their ability to estimate angular displacement. If instead of percentage of totals they display raw count data, say 9,720 votes, 9,000 votes, 8,280 votes, and 7,200 votes, and 1,800 votes, you would have trouble recognizing that these are 27%, 25%, 23%, 20%, and 5% respectively from a total of 36,000 votes. A linear graph with 4 closely but separated marks at each raw vote count and another very near 0 would show the differences of 720 votes between the first three and 1,080 votes to the fourth and the very wide gap to the last 5%. You may even notice that the gap between the 3rd & 4th values is wider than those between 1st & 2nd and 2nd & 3rd which are of equal width.
@chinobambino5252
@chinobambino5252 Год назад
Yes as someone in a field where visualization of data is very important (biology) i have been told to always steer clear of pie charts. Honestly i’ve never seen someone use one in a talk, and i think there would be snickers from the crowd if they did.
@jmasterX
@jmasterX Год назад
great video thanks so much!!!!!!!!!!!!
@davidmorrison7742
@davidmorrison7742 Год назад
... but my boss wants 3D pie charts and 3D stacked bar charts. Basically, add 3D to everything.
@gabrote42
@gabrote42 Год назад
Cool video. I wonder if you are preparibg one about the GPT-4 pause
@HighMansx
@HighMansx Год назад
I was quite shocked that they were all 2x darker, longer, and larger! I had guessed, 2.5, 2.5 and 2!
@tronster
@tronster Год назад
Overall good talk. Disappointed in tinting most everything green when showing the chart for the Magnitude Channel in the rows for "Color luminance" and "Color saturation" as well as for the Identity Channel for the row "Color hue"; these should not have been tinted to all be green.
@thomash4810
@thomash4810 Год назад
Cool video
@cmuller1441
@cmuller1441 Год назад
Can someone add caption (not automatic) ?
@thirdcoffee
@thirdcoffee Год назад
Thanks for the great video. One question... this laptop looks amazing. Is it a macbook or a windows machine? Which one? Does anyone know?
@matbronk1
@matbronk1 Год назад
The experiment was a bit biased towards giving a different answer for each one, I'd say. Having three things to judge and three judgements to use may cause you think you have tot use all three judgements once
@hanswoast7
@hanswoast7 Год назад
Yep, but can you visualize it?
@bscutajar
@bscutajar Год назад
I wonder why they used voltage with electric shock and not power, since it would make sense pain is proportional to power.
@griggiorouge
@griggiorouge Год назад
genius stuff.
@goopytoobers9397
@goopytoobers9397 Год назад
Isn’t this a re-upload?
@me0101001000
@me0101001000 Год назад
I don't have a CS background. I'm more of a traditional engineer in ChemE/MatSci. For people like me, you really can't separate engineering and design. In fact, I'd argue that Engineering is just a small circle inside of the larger circle that is design. Is it similar for CS, where all kinds of CS work has to involve some kind of design?
@lashoes2207
@lashoes2207 Год назад
Electric shocks? Suffer to get your data puny human
@harriehausenman8623
@harriehausenman8623 Год назад
That's obviously how he got his funding 😆
@carl8703
@carl8703 Год назад
So this suggests to me that any visualization whatsoever should only ever use distance to represent numeric data, since anything else would potentially deceive the audience. Any other channel like RGB, hue, etc. should be used strictly to distinguish nonnumeric data.
@technicalcked
@technicalcked Год назад
❤❤❤❤
@misium
@misium Год назад
10:15 Hmm the infamous electric shock visualiser.
@alexxx4434
@alexxx4434 Год назад
Strangely saturation was guessed right, and length wrong.
@jaydeep-p
@jaydeep-p Год назад
Wow
@olgierd245
@olgierd245 Год назад
Why the dislikes tho?
@trikers471
@trikers471 Год назад
Whatever you did to the video made the length example wrong, that line is not twice the first line, it is clearly more, as measured with a ruler on my screen
@SebastianSchleussner
@SebastianSchleussner Год назад
At 10:00? It's your screen doing something funny. Here it is precisely 4.5 vs 9.0 cm.
@HebaruSan
@HebaruSan Год назад
So ironic how there's nothing to look at through so much of this video!
@marklonergan3898
@marklonergan3898 Год назад
I think visualization is great, but should always be accompanied by the raw data itself, unless the presenter is deliberately trying to mislead. So many charts are presented without labels on the scales - it might not be 0-based, the scale might be logarithmic, etc. The raw data at least can't be "misinterpreted". The main reason i mention this is because of the statement "a small increase in the voltage is perceived as a large increase by the subject". Are we talking a small increase in units or a small increase in percentage? Human perception has been shown to be logarithmic naturally (you can very quickly differentiate 4 lions from 5 lions at a glance, but not 100 lions from 101 lions). I'm not accusing your example of being misleading, but moreso backing-up my point that raw data should always be included so there's no room for misinterpretation.
@gloverelaxis
@gloverelaxis Год назад
and how is the raw data formatted?
@guilherme5094
@guilherme5094 Год назад
👍
@arcdam7041
@arcdam7041 Год назад
The options in the test were confusing the user and were manipulating his mind, so i think if wouldn't interven and let the user to give anwer without any assitance the result would be more accurate
@TracesOfNuts
@TracesOfNuts Год назад
9:32 me most of the time
@MrFrondoso
@MrFrondoso Год назад
Didn't mention Jacques Bertin in the important books about Data Visualization. Sorry but that's a red flag for me. He, and no one else wrote the first and widest intent to provide a theoretical foundation to Information Visualization, and his works are still valid. Is it because an Anglo centrism? I'm deeply sorry, much because I deeply like your work and everything you brought to me.
@glamourread9392
@glamourread9392 Год назад
Its a QR code 😂
@johnsenchak1428
@johnsenchak1428 Год назад
BORING THIS CHANNEL IS GOING DOWN THE TUBES
@SebastianSchleussner
@SebastianSchleussner Год назад
Nah. It's only you.
Далее
Has Generative AI Already Peaked? - Computerphile
12:48
Emulation - Computerphile
22:36
Просмотров 200 тыс.
Malware and Machine Learning - Computerphile
20:54
Просмотров 74 тыс.
A Computer Animated Hand (1972) Remastered to 4K
6:53
Просмотров 4,8 тыс.
Bing Chat Behaving Badly - Computerphile
25:07
Просмотров 323 тыс.
The beauty of data visualization - David McCandless
18:18
Non-Deterministic Automata - Computerphile
21:09
Просмотров 52 тыс.
ChatGPT Jailbreak - Computerphile
11:41
Просмотров 341 тыс.