Stanford Computational Imaging Lab

64
481 932

1:52

Streetscapes: Large-scale Consistent Street View Generation Using Autoregressive Video Diffusion

Месяц назад

5:00

PixelRNN | CVPR 2024

3 месяца назад

4:28

PhysAvatar: Learning the Physics of Dressed 3D Avatarsfrom Visual Observations | ECCV 2024

4 месяца назад

8:00

SinGRAF: Learning a 3D Generative Radiance Field for a Single Scene | CVPR 2023

Год назад

3:17

Towards Attention-aware Foveated Rendering | ACM SIGGRAPH 2023

Год назад

26:44

Beyond the Metaverse - Towards Human-centric XR

Год назад

34:33

Eye Tracking Revisited

2 года назад

5:00

Learning to Solve PDE-constrained Inverse Problems with Graph Networks | ICML 2022

2 года назад

6:26

Time-multiplexed Neural Holography | SIGGRAPH 2022

2 года назад

4:30

3D GAN Inversion for Controllable Portrait Image Animation

2 года назад

3:31

Efficient Geometry-aware 3D Generative Adversarial Networks | CVPR 2022

2 года назад

6:49

BACON: Band-limited Coordinate Networks for Multiscale Scene Representation | CVPR 2022

2 года назад

1:19

Partially-coherent Neural Holography | Science Advances 2021

2 года назад

8:31

Time Multiplexed Coded Aperture Imaging | ICCV 2021

2 года назад

5:33

Neural Holography 3D | SIGGRAPH Asia 2021

2 года назад

3:39

Keyhole Imaging | IEEE TCI 20201

3 года назад

2:38

Fast Training of Neural Lumigraph Representations using Meta Learning | NeurIPS 2021

3 года назад

7:25

ACORN: Adaptive Coordinate Networks for Neural Scene Representation | SIGGRAPH 2021

3 года назад

1:02

EE 267 - HW6 with pre-recorded data

3 года назад

3:01

Eccentricity-dependent Spatio-temporal Flicker Fusion for Foveated Graphics | SIGGRAPH 2021

3 года назад

12:22

Neural Sensors | ICCP 2020

3 года назад

5:10

ScanGAN360: A Generative Model of Realistic Scanpaths for 360° Images

3 года назад

3:33

Event-based Near-eye Gaze Tracking at 10,000 Hz | IEEE VR 2021

3 года назад

2:30

Neural Lumigraph Rendering | CVPR 2021

3 года назад

7:16

AutoInt: Automatic Integration for Fast Neural Volume Rendering | CVPR 2021

3 года назад

3:10

pi-GAN: Periodic Implicit Generative Adversarial Networks for 3D-Aware Image Synthesis | CVPR 2021

3 года назад

2:00

Semantic Implicit Neural Scene Representations with Semi-supervised Training | 3DV 2020

3 года назад

14:53

Gaze Contingent Stereo Rendering | SIGGRAPH Asia 2020

3 года назад

2:46

Confocal Diffuse Tomography | Nature Communications 2020

4 года назад

Комментарии

@bigzigtv706 3 дня назад

Cool work

@tomkent4656 Месяц назад

The future is bright!

@mariaceciliatv Месяц назад

"Wow, that's amazing! Congratulations on taking a huge step in the world of AI."

@rastermapper Месяц назад

One can see the immediate applicablity in gaming. With improved source-output resolution, even motion pictures could minimize real-world shots for this approach... saves time and money, and increase safety by not having to close streets in the real world for filming. Exciting stuff.

@jatindhall7633 Месяц назад

Magic🪄😍

@parsfilmstudio2023 Месяц назад

@brianshissler3263 2 месяца назад

has anyone seen my turtle?

@snacxzy 2 месяца назад

scamp! 😂

@dann_y5319 3 месяца назад

Awesome

@MrNoipe 3 месяца назад

PixelRNN refers to Oord et al 2016, why name your cool project after something that exists?

@edsonjr6972 4 месяца назад

Did anyone try using this in transformers?

@_zproxy 5 месяцев назад

is it like a new jpg?

@monkeysfromvenus 7 месяцев назад

That retroreflective reconstruction project is absolutely insane, I never thought that would be possible.

@agr8trip 9 месяцев назад

I'm genuinely excited about computational holography, but I don't understand 80% of this video. I wish you could explain things in simpler terms for youtube.

@suissino9982 10 месяцев назад

minute 06:24 in the screen there is CPD : does it stand for Contrast detection perimetry (CDP) ???

@IvanEng747 10 месяцев назад

No code no interested

@szynkers 10 месяцев назад

i wonder why nobody even attempted to ever use this commercially... it's the only solution that is simultaneously solid-state and doesn't seem to sacrifice resolution. I've heard from a live presentation that generating the 2 plane images was supposedly very computationally heavy for them, but I bet it could be optimized, i.e. by using the z-buffer for depth data instead of rendering multiple view points.

@anilaxsus6376 Год назад

yeah i was wondering why people weren't using sin's and cosine's cause i watched a video and the guy explained that, a neural network of L number of layers, and N number of nodes per Layer, which use relu activation, can perfectly match a function with N to the power L number of bends or turning points in its curve (assuming the neural network has a single scalar node output), i guess that is why it failed on the audio, there is a lot of turning point in audio data, so technical the SIREN networks performance can be matched by a large enough relu neural network, so am looking at SIREN as an optimization on the usual relu networks. Am glad i saw this, i will look into it further. i suspect that sinusoidal activation will be useful in domains with some sort of repetition, cause relu act more like threshold switches.

@isalutfi Год назад

Cool

@alexeychernyavskiy4193 Год назад

How would this approach compare to InstantNGP?

@tiagotiagot Год назад

How does it compare to using a sawtooth wave in place of the sine wave?

@casev799 Год назад

Can't say I completely understand what is said, but it's very promising.

@TileBitan Год назад

The music part was outstanding. Audio waveforms are just stacked sinewaves, as opposed to images or text where the input may not be too related to the sine function. So it just feels right to use sine activations and the required tweaks to make that work, instead of ReLUs, but I'm going to be careful with this as even though I have some experience in ML i haven't ever touched anything other than ReLUs, sigmoids, tanh and straight up linear activations

@Oktokolo Год назад

You can aproximate _everything_ with stacked sine waves. All modern video and image compression algorithms are based on that.

@TileBitan Год назад

@@Oktokolo let me rephrase that then. Audio waveforms can be approximated by a relatively SMALL number of stacked sine waves, so it feels natural to use them in NNs. Everything can be approximated by infinite numbers of sine waves, but sometimes it doesn't make sense to do it

@Oktokolo Год назад

@@TileBitan It obviously makes sense for images as that is how the best compression algorithms use. It should also be possible to encode text reasonably well - even though the resulting set of weights is probably larger than the text itself when not encoding input of a huge language model...

@TileBitan Год назад

@@Oktokolo i don't understand. Sounds are different amplitude waves with different frequencies inside the hearing range. Images nowadays can be 100M pixels with 3 times 256 on the BEST case scenario, where relationships between pixels can be really close to nothing. The case is completely different. The text case doesn't really have much to do with a wave. They might use FFTs for images but you gotta agree with me, for the same error you need way way less terms for sound than images.

@Oktokolo Год назад

@@TileBitan Doesn't matter whether it looks like it has anything to do with a wave or not or whether adjacent values look like they are in any relation to eachother. Treating data as signals and then encoding the signal as stacked waves just works surprisingly well. It might not work well for truly random bit noise. But most data interesting to humans seems to exhibit a surprisingly low entropy and can be compressed using stacked sines.

@taisiralhilo7972 Год назад

Hello I am working on a project Eye Tracking Analysis Can you help me with information on how to obtain and deal with data, knowing that I use Matlab

@SandhyaRani-np9be 2 года назад

There is no response from you Tomorrow I have to submit the details

@DJDextek 2 года назад

incredible

@SandhyaRani-np9be 2 года назад

ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-izE7j1b95uI.html I got the idea yesterday to move the cursor in computer by use of this technology(for handicaps). I am sad because this invention was already invented 😔.Will you help me in doing this project to me please. Share the software you used sense the eyeball and I will make it. I think you will help me, thank you.

@SandhyaRani-np9be 2 года назад

I got the idea yesterday to move the cursor in computer use of this technology(for handicaps). I am sad because this invention was already invented 😔.Will you help me in doing this project to me please. Share the software you used and I will make it. I think you will help me, thank you.

@macratak 2 года назад

awesome work

@Yakuo 2 года назад

@user-qp2ps1bk3b 2 года назад

a very nice presentation! Thank you!

@braydenhallie1994 2 года назад

𝓹𝓻𝓸𝓶𝓸𝓼𝓶 😆

@BellaSportMotoristici 2 года назад

Impressive research, impressive lucky researchers.

@macratak 2 года назад

cool work. ur enunciation needs some work tho

@ch1caum 2 года назад

Waiting for pre-baking radiance fields with bacon

@rewixx69420 Год назад

Gona pre-bake pepa with bacon

@HAWXLEADER 2 года назад

I got here by googling eyeball parallax because I noticed this effect in real life and wanted to see whether people actually thought about simulating it in the VR world. Apparently you guys did ^^

@susiundjohnlesenview-maste5559 2 года назад

Hi, we saw your video, liked it and subscribed to Your channel. We are also fascinated by the 3D-VIEW-MASTER and the Stereography. We read the old View-Master Booklets on our Channel to the reels...come and see us :-)

@beardordie5308 2 года назад

Today: nightmare fuel. Tomorrow: everybody is half Obama.

@TiceLedbetter 2 года назад

You did an amazing job on this!! 🥓

@Neptutron 2 года назад

Can this be combined with DALLE?

@lealemtaye 2 года назад

Is the source code publicly available?

@DerekWilsonProgrammer 2 года назад

so, you could bounce a laser beam off of the moon and tell the speed at which it's moving away or closer, assuming you had a detector that can sense the reflected light

@atul1004 2 года назад

Hey!! If possible could you please use a more humanistic voice-over for your video? Thank you

@kwea123 2 года назад

1:22 I think Mip-NeRF is single scale, it only trains on the finest scale, and can naturally generate anti-aliased images at any scale

@mannyk7634 2 года назад

Very nice work especially the sinusoidal activation. I like to point out Candes in 1997 covered it rigorously in "Harmonic Analysis of Neural Networks" about periodic activation function - "admissible neural activation function". Strangely enough, the paper is not even cited by the authors.

@foreignaustrian 2 года назад

No sound? :-)

@jeremykong7604 2 года назад

good work

@Likeiverson 3 года назад

I wish I was smart enough to understand

@emmanueloluga9770 2 года назад

Don't wish, put in the work if it is within your means

@buddhagautama673 3 года назад

I understand that some scientists are from outer space.