Тёмный

The Spectrogram and the Gabor Transform 

Steve Brunton
Подписаться 354 тыс.
Просмотров 63 тыс.
50% 1

Here I introduce the spectrogram, which is a moving-window Fourier transform, giving insight into the time-frequency content of a data set.
Book Website: databookuw.com
Book PDF: databookuw.com/databook.pdf
These lectures follow Chapter 2 from:
"Data-Driven Science and Engineering: Machine Learning, Dynamical Systems, and Control" by Brunton and Kutz
Amazon: www.amazon.com/Data-Driven-Sc...
Brunton Website: eigensteve.com
This video was produced at the University of Washington

Наука

Опубликовано:

 

27 июл 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 114   
@AllElectronicsChannel
@AllElectronicsChannel 4 года назад
Is the Gabor Transform a special case of a STFT ? What are the tradeoffs of using other windows functions in place of the gaussian?
@vinilsimoes
@vinilsimoes 4 года назад
You never cease to amaze us Dr. Brunton!!! Keep up your magnificent work!!
@JulioDiaz614
@JulioDiaz614 4 года назад
Thank you Dr. Brunton for your always insightful and inspiring lectures. I may find a way to use this in my research
@Eigensteve
@Eigensteve 4 года назад
Wonderful!
@bobspianosbffl
@bobspianosbffl 4 года назад
I was trying to understand spectrograms recently and finally thanks to this video it clicked! A few clear visuals do a wonder for elucidating the maths and concepts
@neb5615
@neb5615 4 года назад
Thanks for all lectures, I really appreciate your explanations
@Lrexmo
@Lrexmo 4 года назад
The description is great, the ideas are clear and the logic is coherent. Thanks for your work.
@Eigensteve
@Eigensteve 4 года назад
Glad it was helpful!
@hugeride
@hugeride 3 года назад
This is just amazIng. He can pass the overall concept in 10 minutes better than one can read in books in 1 hour. Conceptual understanding is crucial to guide learning. After understanding what this concept is about and where you are within the topic and how can it be used in practice, its much easier to absort the material and guide the learning by connecting the detailed concepts you learn afterward, by digging deeper. But having this overall knowledge is essential and most books don't give that.
@erikgottlieb9362
@erikgottlieb9362 Год назад
Thank you for clear, concise, organized presentation. Appreciative of how much time and effort such a presentation / explanation takes to create and deliver. Appreciative of the format you use and precision in getting explanation correct. Explanation of terms and where terms originate has always been helpful in your presentations. Thanks again. (Erik Gottlieb)
@succirasuccira
@succirasuccira 3 года назад
Dr. Brunton's concise explanations of all these transform and compression algorithms are first class, and the visual here just incomparable! In this video, on the topic of Shazam's algorithm, you mentioned that it has the caveat that: when the song is stretched in time it makes it harder to match peaks in the power spectrum. This got me thinking about the other dimension: a slightly transposed (pitch shifted) song also breaks the algorithm given the spectrum was measured in fixed frequency. This could be me imagining things but: it might be useful to have spectrum that measures relative frequency. That way you can match songs even if it's transposed to different keys.
@RealMcDudu
@RealMcDudu 3 года назад
You're really an amazing teacher! Explained very clearly. Shows you also understand it very well.
@zoheirtir
@zoheirtir 4 года назад
Thanks a lot for your inspiring lectures ! Zoheir TIR Algeria
@hunters.dicicco1410
@hunters.dicicco1410 4 года назад
i wish i had this when i was first learning the math required for signal processing. great stuff!
@twangist
@twangist Год назад
Thank you Professor Brunton, you're really excellent at this.
@pieterjoubert4346
@pieterjoubert4346 3 года назад
Thank you Dr. Brunton! Clear and concise. Liked and subscribed.
@gamedeeds
@gamedeeds 3 года назад
teaching tech is beyond this gen keep it up prof, very useful and understanding
@davidtolle9533
@davidtolle9533 4 года назад
The bit about Shazam using the power spectral density property to accurately identify songs was interesting. Thanks for the content
@dragoncurveenthusiast
@dragoncurveenthusiast 4 года назад
You are such a good lecturer! Thank you!
@lancelotdsouza4705
@lancelotdsouza4705 2 года назад
Thanks Dr Brunton ...The Gabor transform was very well explained ,,,,needed the code for the same
@spkt1001
@spkt1001 3 года назад
Best video for understanding the intuition of spectrogram!
@danielnagy6360
@danielnagy6360 4 года назад
Awesome videos, really great content and great quality, and also a great topic.
@AnilAgiral
@AnilAgiral 4 года назад
This is really neat!
@jushkunjuret4386
@jushkunjuret4386 3 года назад
This is a wonderful lecture!
@marcogelsomini7655
@marcogelsomini7655 2 года назад
Wow that's awesome! Thank you for introduce it to me Dr. Brunton :)
@BruinChang
@BruinChang 4 года назад
I like the whole series very much.
@geraldheinig1473
@geraldheinig1473 11 месяцев назад
Excellent explanation. I'm particularly happy about the mention of how Shazam works: that's something that's intrigued me for a while now. Thank you!
@-E42-
@-E42- Год назад
I like the creativity with the transparent wall between the lecturer and the camera, on the other hand the presentation seems strangely surreal due to the fact the presenter is only visible as floating head/shoulder and arms
@mehdis.7404
@mehdis.7404 3 года назад
Excellent quality!
@nisanaryal156
@nisanaryal156 4 года назад
I really love this video. I am working in audio classification and I have learned the basic about Spectrogram(STFT), Mel scale and mel spectrogram, MFCC, Consant Q transform etc but I still cant figure out which spectral representation should I use at which condition . Apart from the representation there is the selection of window length and the hop length of the window (trade off between temporal and frequency resolution). At the end of these series I would love to see the comparison and your view on these different representations.
@abhishek_sengupta
@abhishek_sengupta 3 года назад
Very nicely explained!! Thanks!
@rodrigomesquita8741
@rodrigomesquita8741 Год назад
Isso é uma das coisas mais lindas da engenharia. Fundamentalmente você vai calcular a transformada de furrier para janelas de tempo específicas, e vai poder ver quais as componentes de frequencia naquele instante!
@josueprieto7371
@josueprieto7371 2 года назад
Beautiful explanation
@Anorve
@Anorve 3 месяца назад
I love your videos and explanation
@aalselwi
@aalselwi 4 года назад
I am enjoying your trip of learning process
@Eigensteve
@Eigensteve 4 года назад
I'm so glad!
@Thejus_5511
@Thejus_5511 2 года назад
Great explanation
@yuep6808
@yuep6808 2 дня назад
I can't help but think about how you do inverted writing so well 😂
@udomatthiasdrums5322
@udomatthiasdrums5322 3 года назад
love your work!!
@abdellahsellam912
@abdellahsellam912 3 года назад
Thanks, this video was very helpful for me
@kaxxamhinna5044
@kaxxamhinna5044 Год назад
Thank you very much 🎉🎉 you saved my weekend 😂 Have a great day
@trinetram707
@trinetram707 4 года назад
Great video sir please keep posting such videos
@tech01q
@tech01q 3 года назад
If the music is shrinked or stretched, it should still be easy to recognize the music, if the program is adjusted according to the percentages of time intervals between peaks of sound, than a unique pattern can be generated. This has a great potential in the future… Thank you …
@mohamadhamoudy8232
@mohamadhamoudy8232 3 года назад
Thanks a lot Prof. Steve , please could you upload a video for using Spectrogram on sound classifications and feature extraction , regards
@brendawilliams8062
@brendawilliams8062 Год назад
I work with number theory. The teacher is powerful.
@user-px2nq6rk9i
@user-px2nq6rk9i 4 года назад
Please video on Mel spectrogram and why it can't be reversed, thanks for the book and the videos.
@qilinwang5889
@qilinwang5889 Год назад
Hi Steve, I can't thank you enough for making these beautiful videos. I have purchased the book as a way to say thank you. The book is beautifully printed, and if I can give you some feedback from a reader's perspective, I would like the book to have a larger font. They are too small to read for a long time. Anywaysl, thank you for your work!
@StefanT41
@StefanT41 3 года назад
Wow ! Thanks a lot!
@danielparra6902
@danielparra6902 4 года назад
Thank you very much for the clear intuitive explanations Dr. Brunton. I was wondering if there is a Gabor transform analog that uses a data driven approach like the SVD? In the case of SVD would the power spectrum change along with the basis or should one compute the basis with the whole signal and only then with a fixed basis apply the transform?
@SRIMANTASANTRA
@SRIMANTASANTRA 4 года назад
Hi Professor Steve, Nice.
@Eigensteve
@Eigensteve 4 года назад
Glad you like it!
@Jo-ce6gd
@Jo-ce6gd 2 года назад
Perfect! thanks
@banggiangle8258
@banggiangle8258 3 года назад
best explanation ever!
@Eigensteve
@Eigensteve 3 года назад
Wow, thanks!
@sandras2624
@sandras2624 4 года назад
... i'm just silently wondering why this was in my recommendations; i am a social science major and i spend most of my time here on youtube watching cat videos. oh, the yt-algorithm. however: keep up the good work!
@JohnVKaravitis
@JohnVKaravitis 3 года назад
SUE RU-vid! You've been scarred for life!
@yousifyahiaahmed8206
@yousifyahiaahmed8206 3 года назад
Thanks so much indeed
@NaveenKumar-gs8xn
@NaveenKumar-gs8xn 2 года назад
Thanks a lot ...
@Yang-YTchannel
@Yang-YTchannel 3 года назад
Super good explanation!! May I know how do we get the power information if the y-axis is frequency and x-axis is time? Like how large the signal is for each frequency at an instant time?
@zhihuachen3613
@zhihuachen3613 4 года назад
like your video, especially programming in both python and Matlab
@Eigensteve
@Eigensteve 4 года назад
Glad you liked it!
@aefieefnvhas
@aefieefnvhas 3 года назад
Amazing
@user-ml6gr1rk1m
@user-ml6gr1rk1m 4 года назад
Thanks a lot for the series of videos. They are very useful for my projects. How about S-transform? Thank you again.
@Eigensteve
@Eigensteve 4 года назад
Thanks! Maybe I'll make one on the S-transform sometime.
@Via.Dolorosa
@Via.Dolorosa 3 года назад
thank you
@GabrieleNunnari
@GabrieleNunnari 4 года назад
This videos have an incredible quality, really. Content and graphical. My only real question is: how are you able to explain and write mirrored making it look so natural!!!
@evanritchie6195
@evanritchie6195 4 года назад
You can record the video with backwards writing, then mirror the video in an editor afterwards. :)
@pavelkonovalov8931
@pavelkonovalov8931 4 года назад
Thank you so much for your labour. Do you mind to make a video on harmonic distortion?
@Eigensteve
@Eigensteve 4 года назад
I can't promise I'll make one, but I will add it to the list.
@bhaveshamarsingh1656
@bhaveshamarsingh1656 4 года назад
Hello Steven sir, I have gone through wavelet transforms back in the day and i wanted to ask that is it not similar in the sense that they too have evolved/developed because Fourier Transform fails to specify the time at which certain frequency occurred in the original signal. And moreover, please do bring up a short video lecture series on wavelet transforms as well. Thank you.
@philipq6906
@philipq6906 4 года назад
Great explanation can’t wait for the next vid. BTW this is just like a music score. The wave are decomposed by windowed Fournier transformer. I am wondering in real control or identification system, how do we update the realtime signal? We cannot wait a long period and the windows g(x) size also matters. How do we choose a appropriate length
@Eigensteve
@Eigensteve 4 года назад
You are right -- and yes, in control applications, the spectrogram will be computed continuously with a sliding window.
@aliadams516
@aliadams516 4 года назад
Wonderful :)
@Eigensteve
@Eigensteve 4 года назад
Thank you! Cheers!
@haowang4306
@haowang4306 2 года назад
thank you for your lecture. how to make this kind of video in which the drawings can be shown in front of lecturer?
@jm3415
@jm3415 4 года назад
what are the HUP implications on time and freq uncertainty for the Gabor Transform?
@lorenzosparacio1184
@lorenzosparacio1184 Год назад
Beautiful video! I have the following questions. Why does the weight function have to be a gaussian? What would happen if this function is, for instance, a constant of unitary value (so, I'm applying a wight of 1 across the entire window)?
@shashidharmuniswamy2620
@shashidharmuniswamy2620 2 года назад
How do I locate the fundamental frequency at that particular instant? and what do I do to find the ratio of the harmonics to the fundamental frequency as it evolves with time? :)
@carlossama2191
@carlossama2191 4 года назад
This is sweeet! How can this be applied to voice recognition? Go dawgs!
@Eigensteve
@Eigensteve 4 года назад
I think so. Modern voice recognition uses recurrent neural networks, but the spectrogram can be very useful here too.
@udayanbanerjee5271
@udayanbanerjee5271 4 года назад
Dear sir Is W vs t a continuous function? Can we identify the nature of the change in frequency and then invert it back to get that part of 'f'?
@Space-Audio
@Space-Audio 4 года назад
I have a few out-of-this-world examples of spectrograms you just might be interested in.
@ffelixvideos
@ffelixvideos 4 года назад
I would like to know: what are the advantages of this spectrogram over the STFT algorithm or the mel-spectrogram?
@MrKrvo
@MrKrvo 4 года назад
Mr Brunton said some a little misleading information. Gabor transform was the first time-frequency representation of signals, it is special case of STFT (because STFT (Short-Time Fourier Transform) can have any sliding window, Gabor transformation is done using only Gaussian window; there is also Triangular, Hamming, Hann, Blackmann and many others). Each of them has advantage and disadvantage - e.g. wider first lobe in frequency means suppressed the others and vice versa. It always depends on the use and purpose of the analysis. For audio signals, the most common window is Hann window (it doesn't have so sharp edges, e.g. like triangular, it is made by harmonic function - cosinus, so it is more smooth but its also wider for the first frequency lobe). The most common spectrogram is computed with STFT (Fourier T. in general), not Gabor transform. Mel-spectrogram is little different because it uses "mels" - Mel is a unit from psychoacoustic for a subjective melody; it also uses cosine transform but in short, it is another time-frequency representation of a signal and it tries to simulate or imitate human hearing and musical perception. Mel-spectrum (and kepstrum) is commonly used for research purposes of MIR (Music Information Retrieval) because the signal representation is usually closer to the subjective aspects of human hearing and thus is better for most of the applications (so far).
@ffelixvideos
@ffelixvideos 4 года назад
@@MrKrvo Ok, I got it. Thanks for your time.
@MrKrvo
@MrKrvo 4 года назад
@@ffelixvideos No problem.
@Eigensteve
@Eigensteve 4 года назад
Thanks for the great extra information!
@tobi3497
@tobi3497 4 года назад
@@MrKrvo Thanks for this !
@mohammedal-haddad2652
@mohammedal-haddad2652 3 года назад
What about the width of the Gaussian function?
@wesleytaylor-rendal5648
@wesleytaylor-rendal5648 2 года назад
How to find the time resolution? Look at the width of the Gabor function?
@miguelfernandesdesousa7784
@miguelfernandesdesousa7784 3 года назад
is there any difference between Gabor Transform and STFT? Is it just a particular case with a gaussian window and unitary gain?
@neuronneuron3645
@neuronneuron3645 4 года назад
This is behind the uncertainty principle?
@trinetram707
@trinetram707 4 года назад
Doesn't a Gaussian window alter the frequency content of the signal ??
@amruthgadag4813
@amruthgadag4813 Год назад
So basically it is the short-time Fourier transform-based spectrograms. Please reply if yes or no.
@trinetram707
@trinetram707 4 года назад
Why do we use Gaussian window can't we use a rectangular window ??
@numoru
@numoru 11 месяцев назад
i know its late but we ahould definitly have two, since left and right channels/phases can be different . would be a good extension. yYet, even more so 3d phases allow for darn near infinite phases so would lobe a transform that splits that u0 based on color and intensity to see the intricacies. I mean if were not talking polyphonic it should be possible, monophonic yeah . but honestly I dont get why its a problem for multiple phases and poly phonic if we take each initiation of intonation as a phase of a fractional phases that doesn't multiple/algorithmically interact with/ into the greater whole/nor parts (if given the total data first). Further, this is based off of first principle in the idea that we can generate moire patterns even with aperiodic data (Glass patterns) easily yet it is the microstructures (local) that we aren't able to inversely appropriate/segment, yet if the phase data is there we should be able to,..-wrong?
@numoru
@numoru 11 месяцев назад
typos will not be dealt with, it to much to read. so guess
@carlosvargas2907
@carlosvargas2907 4 года назад
You rock
@JohnVKaravitis
@JohnVKaravitis 3 года назад
Zsa Zsa or Eva Gabor?
@Kong9901
@Kong9901 4 года назад
So the spectrogram just shows the frequency played at each time, but there is no information about the amplitude of these frequencies ?
@Kong9901
@Kong9901 4 года назад
I got my answer from the next video. Thank you :)
@arnabprophet
@arnabprophet Год назад
Did anyone else realize that Dr. Brunton is writing backwards on a screen?
@AbhilashIngale
@AbhilashIngale 3 года назад
Pleased to meet you, the Real Men and Women of culture ;)
@rolandstefan5204
@rolandstefan5204 4 года назад
Fun fact: Gabor transform was named after by Gábor Dénes who was a hungarian physicist and electrical engineer and he also got the Phsics Nobel Prize for inventing holography. Sorry for the grammatical mistakes.
@melihozcan8676
@melihozcan8676 2 года назад
@11:50 I miss the times when RU-vid had dislikes, and we had a chance to avoid the garbage...
@Ting3624
@Ting3624 4 года назад
so one can technically write an algorithm to listen to music and generate sheet music.... holy there are plenty after google it
@Eigensteve
@Eigensteve 4 года назад
This is indeed one of the big open challenges that people are working on. Can you imagine if researchers could create an algorithm that would generate new Vivaldi?
@leif1075
@leif1075 3 года назад
@@Eigensteve what exactly is the open challenge? If you can write an algorithm to write mew music from scratch? But it would probsbly just be random stitching together of notes from existing pieces wouldn't it?
@treksis
@treksis Год назад
@6:45 low 22222 high~~~😁😁😁
@kelvinxie1029
@kelvinxie1029 Год назад
Music sheets are spectrograms from Gabor transformation LOL
@v8pilot
@v8pilot 3 года назад
Did you mention the Gabor transform? I must have dozed off if you did. I've been re-reading his paper and want to understand its relation to the spectrogram.
@frankservant5754
@frankservant5754 3 года назад
RU-vid University Freshers where you at!
Далее
Spectrogram Examples [Matlab]
9:57
Просмотров 40 тыс.
Wavelets and Multiresolution Analysis
15:12
Просмотров 137 тыс.
2DROTS vs RISENHAHA! КУБОК ФИФЕРОВ 2 ТУР
11:31
My Problem With (Most) Spectrograms
14:59
Просмотров 20 тыс.
The Laplace Transform: A Generalized Fourier Transform
16:28
Understanding the Z-Plane
16:25
Просмотров 9 тыс.
Wavelets-based Feature Extraction
37:40
Просмотров 59 тыс.
Denoising Data with FFT [Python]
10:03
Просмотров 170 тыс.