Тёмный
Stanford Computational Imaging Lab
Stanford Computational Imaging Lab
Stanford Computational Imaging Lab
Подписаться
PixelRNN | CVPR 2024
5:00
3 месяца назад
Eye Tracking Revisited
34:33
2 года назад
Keyhole Imaging | IEEE TCI 20201
3:39
3 года назад
EE 267 - HW6 with pre-recorded data
1:02
3 года назад
Neural Sensors | ICCP 2020
12:22
3 года назад
Комментарии
@bigzigtv706
@bigzigtv706 3 дня назад
Cool work
@tomkent4656
@tomkent4656 Месяц назад
The future is bright!
@mariaceciliatv
@mariaceciliatv Месяц назад
"Wow, that's amazing! Congratulations on taking a huge step in the world of AI."
@rastermapper
@rastermapper Месяц назад
One can see the immediate applicablity in gaming. With improved source-output resolution, even motion pictures could minimize real-world shots for this approach... saves time and money, and increase safety by not having to close streets in the real world for filming. Exciting stuff.
@jatindhall7633
@jatindhall7633 Месяц назад
Magic🪄😍
@parsfilmstudio2023
@parsfilmstudio2023 Месяц назад
@brianshissler3263
@brianshissler3263 2 месяца назад
has anyone seen my turtle?
@snacxzy
@snacxzy 2 месяца назад
scamp! 😂
@dann_y5319
@dann_y5319 3 месяца назад
Awesome
@MrNoipe
@MrNoipe 3 месяца назад
PixelRNN refers to Oord et al 2016, why name your cool project after something that exists?
@edsonjr6972
@edsonjr6972 4 месяца назад
Did anyone try using this in transformers?
@_zproxy
@_zproxy 5 месяцев назад
is it like a new jpg?
@monkeysfromvenus
@monkeysfromvenus 7 месяцев назад
That retroreflective reconstruction project is absolutely insane, I never thought that would be possible.
@agr8trip
@agr8trip 9 месяцев назад
I'm genuinely excited about computational holography, but I don't understand 80% of this video. I wish you could explain things in simpler terms for youtube.
@suissino9982
@suissino9982 10 месяцев назад
minute 06:24 in the screen there is CPD : does it stand for Contrast detection perimetry (CDP) ???
@IvanEng747
@IvanEng747 10 месяцев назад
No code no interested
@szynkers
@szynkers 10 месяцев назад
i wonder why nobody even attempted to ever use this commercially... it's the only solution that is simultaneously solid-state and doesn't seem to sacrifice resolution. I've heard from a live presentation that generating the 2 plane images was supposedly very computationally heavy for them, but I bet it could be optimized, i.e. by using the z-buffer for depth data instead of rendering multiple view points.
@anilaxsus6376
@anilaxsus6376 Год назад
yeah i was wondering why people weren't using sin's and cosine's cause i watched a video and the guy explained that, a neural network of L number of layers, and N number of nodes per Layer, which use relu activation, can perfectly match a function with N to the power L number of bends or turning points in its curve (assuming the neural network has a single scalar node output), i guess that is why it failed on the audio, there is a lot of turning point in audio data, so technical the SIREN networks performance can be matched by a large enough relu neural network, so am looking at SIREN as an optimization on the usual relu networks. Am glad i saw this, i will look into it further. i suspect that sinusoidal activation will be useful in domains with some sort of repetition, cause relu act more like threshold switches.
@isalutfi
@isalutfi Год назад
Cool
@alexeychernyavskiy4193
@alexeychernyavskiy4193 Год назад
How would this approach compare to InstantNGP?
@tiagotiagot
@tiagotiagot Год назад
How does it compare to using a sawtooth wave in place of the sine wave?
@casev799
@casev799 Год назад
Can't say I completely understand what is said, but it's very promising.
@TileBitan
@TileBitan Год назад
The music part was outstanding. Audio waveforms are just stacked sinewaves, as opposed to images or text where the input may not be too related to the sine function. So it just feels right to use sine activations and the required tweaks to make that work, instead of ReLUs, but I'm going to be careful with this as even though I have some experience in ML i haven't ever touched anything other than ReLUs, sigmoids, tanh and straight up linear activations
@Oktokolo
@Oktokolo Год назад
You can aproximate _everything_ with stacked sine waves. All modern video and image compression algorithms are based on that.
@TileBitan
@TileBitan Год назад
@@Oktokolo let me rephrase that then. Audio waveforms can be approximated by a relatively SMALL number of stacked sine waves, so it feels natural to use them in NNs. Everything can be approximated by infinite numbers of sine waves, but sometimes it doesn't make sense to do it
@Oktokolo
@Oktokolo Год назад
@@TileBitan It obviously makes sense for images as that is how the best compression algorithms use. It should also be possible to encode text reasonably well - even though the resulting set of weights is probably larger than the text itself when not encoding input of a huge language model...
@TileBitan
@TileBitan Год назад
@@Oktokolo i don't understand. Sounds are different amplitude waves with different frequencies inside the hearing range. Images nowadays can be 100M pixels with 3 times 256 on the BEST case scenario, where relationships between pixels can be really close to nothing. The case is completely different. The text case doesn't really have much to do with a wave. They might use FFTs for images but you gotta agree with me, for the same error you need way way less terms for sound than images.
@Oktokolo
@Oktokolo Год назад
@@TileBitan Doesn't matter whether it looks like it has anything to do with a wave or not or whether adjacent values look like they are in any relation to eachother. Treating data as signals and then encoding the signal as stacked waves just works surprisingly well. It might not work well for truly random bit noise. But most data interesting to humans seems to exhibit a surprisingly low entropy and can be compressed using stacked sines.
@taisiralhilo7972
@taisiralhilo7972 Год назад
Hello I am working on a project Eye Tracking Analysis Can you help me with information on how to obtain and deal with data, knowing that I use Matlab
@SandhyaRani-np9be
@SandhyaRani-np9be 2 года назад
There is no response from you Tomorrow I have to submit the details
@DJDextek
@DJDextek 2 года назад
incredible
@SandhyaRani-np9be
@SandhyaRani-np9be 2 года назад
ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-izE7j1b95uI.html I got the idea yesterday to move the cursor in computer by use of this technology(for handicaps). I am sad because this invention was already invented 😔.Will you help me in doing this project to me please. Share the software you used sense the eyeball and I will make it. I think you will help me, thank you.
@SandhyaRani-np9be
@SandhyaRani-np9be 2 года назад
I got the idea yesterday to move the cursor in computer use of this technology(for handicaps). I am sad because this invention was already invented 😔.Will you help me in doing this project to me please. Share the software you used and I will make it. I think you will help me, thank you.
@macratak
@macratak 2 года назад
awesome work
@Yakuo
@Yakuo 2 года назад
ty
@user-qp2ps1bk3b
@user-qp2ps1bk3b 2 года назад
a very nice presentation! Thank you!
@braydenhallie1994
@braydenhallie1994 2 года назад
𝓹𝓻𝓸𝓶𝓸𝓼𝓶 😆
@BellaSportMotoristici
@BellaSportMotoristici 2 года назад
Impressive research, impressive lucky researchers.
@macratak
@macratak 2 года назад
cool work. ur enunciation needs some work tho
@ch1caum
@ch1caum 2 года назад
Waiting for pre-baking radiance fields with bacon
@rewixx69420
@rewixx69420 Год назад
Gona pre-bake pepa with bacon
@HAWXLEADER
@HAWXLEADER 2 года назад
I got here by googling eyeball parallax because I noticed this effect in real life and wanted to see whether people actually thought about simulating it in the VR world. Apparently you guys did ^^
@susiundjohnlesenview-maste5559
@susiundjohnlesenview-maste5559 2 года назад
Hi, we saw your video, liked it and subscribed to Your channel. We are also fascinated by the 3D-VIEW-MASTER and the Stereography. We read the old View-Master Booklets on our Channel to the reels...come and see us :-)
@beardordie5308
@beardordie5308 2 года назад
Today: nightmare fuel. Tomorrow: everybody is half Obama.
@TiceLedbetter
@TiceLedbetter 2 года назад
You did an amazing job on this!! 🥓
@Neptutron
@Neptutron 2 года назад
Can this be combined with DALLE?
@lealemtaye
@lealemtaye 2 года назад
Is the source code publicly available?
@DerekWilsonProgrammer
@DerekWilsonProgrammer 2 года назад
so, you could bounce a laser beam off of the moon and tell the speed at which it's moving away or closer, assuming you had a detector that can sense the reflected light
@atul1004
@atul1004 2 года назад
Hey!! If possible could you please use a more humanistic voice-over for your video? Thank you
@kwea123
@kwea123 2 года назад
1:22 I think Mip-NeRF is single scale, it only trains on the finest scale, and can naturally generate anti-aliased images at any scale
@mannyk7634
@mannyk7634 2 года назад
Very nice work especially the sinusoidal activation. I like to point out Candes in 1997 covered it rigorously in "Harmonic Analysis of Neural Networks" about periodic activation function - "admissible neural activation function". Strangely enough, the paper is not even cited by the authors.
@foreignaustrian
@foreignaustrian 2 года назад
No sound? :-)
@jeremykong7604
@jeremykong7604 2 года назад
good work
@Likeiverson
@Likeiverson 3 года назад
I wish I was smart enough to understand
@emmanueloluga9770
@emmanueloluga9770 2 года назад
Don't wish, put in the work if it is within your means
@buddhagautama673
@buddhagautama673 3 года назад
I understand that some scientists are from outer space.