Тёмный

DirectStorage on Windows will make games so much better 

Linus Tech Tips
Подписаться 16 млн
Просмотров 1,5 млн
50% 1

Get your NZXT Function Keyboard today at: nzxt.co/FunctionLTT
Break the code and win awesome prizes, including a $12,000 Dream setup: go.tech/linus
DirectStorage, which lets your GPU access your SSD directly, has promised to make games better for years now. While it’s been on Xbox Series X since launch (and PS5 has its own version), now we’re finally getting it for PC. Why was it worth the wait?
Discuss on the forum: linustechtips.com/topic/14206...
Buy an AMD Ryzen 9 5950X: geni.us/nMA6el
Buy an ASUS Crosshair VIII Hero Wi-Fi: geni.us/B13k
Buy a GeForce RTX 3060: geni.us/WQMAcA
Purchases made through some store links may provide some compensation to Linus Media Group.
► GET MERCH: lttstore.com
► AFFILIATES, SPONSORS & REFERRALS: lmg.gg/sponsors
► PODCAST GEAR: lmg.gg/podcastgear
► SUPPORT US ON FLOATPLANE: www.floatplane.com/
FOLLOW US ELSEWHERE
---------------------------------------------------
Twitter: / linustech
Facebook: / linustech
Instagram: / linustech
TikTok: / linustech
Twitch: / linustech
MUSIC CREDIT
---------------------------------------------------
Intro: Laszlo - Supernova
Video Link: • [Electro] - Laszlo - S...
iTunes Download Link: itunes.apple.com/us/album/sup...
Artist Link: / laszlomusic
Outro: Approaching Nirvana - Sugar High
Video Link: • Sugar High - Approachi...
Listen on Spotify: spoti.fi/UxWkUw
Artist Link: / approachingnirvana
Intro animation by MBarek Abdelwassaa / mbarek_abdel
Monitor And Keyboard by vadimmihalkevich / CC BY 4.0 geni.us/PgGWp
Mechanical RGB Keyboard by BigBrotherECE / CC BY 4.0 geni.us/mj6pHk4
Mouse Gamer free Model By Oscar Creativo / CC BY 4.0 geni.us/Ps3XfE
CHAPTERS
---------------------------------------------------
0:00 Intro
1:00 DirectStorage is underrated
1:40 Playing with the demo
3:18 How game assets are usually loaded
4:26 Previous attempts to address the problem
5:52 Enter DirectStorage - Bigger than you think
6:38 Performance scaling and compression
7:46 Not ready... Yet.

Наука

Опубликовано:

 

31 май 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 2,7 тыс.   
@clebbington
@clebbington 2 года назад
Can't wait for game engines like Unity and Unreal to implement these features - that way a bunch of developers will be able to add DirectStorage to their games without having to completely restructure them.
@andymccray8914
@andymccray8914 2 года назад
Nice
@goilup
@goilup 2 года назад
@@youtubekilledtrustedflaggi9274 I'd love to hear who your source is
@R3DCODE
@R3DCODE 2 года назад
@@youtubekilledtrustedflaggi9274 you think implementing a feature automatically cuts out who has hardware that does not support it? You know, right, it's common practice in game development to probe the hardware you are running on to check for available features and then use slower methods that do not require what are you missing? For example some old video cards do not support Compute shaders. (GPU accelerated calculations for things not related to graphics), easy, check if you have access to Compute shaders at the beginning, if not, you do that on the cpu, at a lower quality even if it would be too slow. Doesn't look like to me that all the games that support RTX do not run on non RTX hardware either...
@UwePieper
@UwePieper 2 года назад
@@youtubekilledtrustedflaggi9274 Existing games that patch in support for DirectStorage won‘t become unplayable for setups that don‘t support it. And with new games it will quite a bit of time before it will be in any way „required“
@635574
@635574 2 года назад
And more headaches for games with their own engines
@junidiff
@junidiff 2 года назад
I'm a simple man, I see an Anthony thumbnail, I watch immediately
@featheryknife
@featheryknife 2 года назад
I simple man. I see funny comment, I reply 😅😂
@clipsdelvilches833
@clipsdelvilches833 2 года назад
You are so based
@King_DarkSide
@King_DarkSide 2 года назад
What else can a simple man hope for. Lol!
@RobotischeHilfe
@RobotischeHilfe 2 года назад
I am even simpler I can only count to 10.
@donniearmstrong9410
@donniearmstrong9410 2 года назад
Can’t imagine not watching an LTT video with my man Anthony as the thumbnail
@drsupergood8978
@drsupergood8978 2 года назад
A few key points were missing from this video... DirectStorage on Windows 11 is using bypass IO if supported by the attached storage device. This greatly reduces kernel overhead by skipping many stages of the IO stack allowing IO to start more quickly and results to be returned more quickly both with lower CPU overhead. This alone will have significant latency improvements as well as save CPU time in ways not previously possible using standard APIs on Windows 10 and earlier. DirectStorage itself is an asynchronous API. Asynchronous APIs have been the ruler of IO performance for a long time. Yet due to the complexities involved with using them their adoption has been quite poor with much slower stream based IO often being used instead due to its simplicity. This can be seen in how a lot of game archive formats are constructed. Anything that requires frequently reading a small amount of data to know what or how much more data to read is not optimised for asynchronous IO. DirectStorage offers batch reading support. Much like modern graphic API command queues you can submit multiple different read requests with a single kernel call. This allows high utilisation of NVMe read queues, such as using a single queue submission to read the meshes and all textures for multiple modules. In the video this would be similar to the shown texture atlas animation except when not using a texture atlas. DirectStorage does not bypass the CPU and RAM entirely. Custom decompression, a supported feature by the API, will still have to use the CPU in some way to process the data from memory. When moving directly to GPU the file data is still temporarily placed into RAM, even if directly into an upload heap staging buffer and not touched by CPU threads. Although the technology does already exist to bypass CPU threads and RAM entirely it is currently unclear why DirectStorage is not using it.
@matyaskatocz9323
@matyaskatocz9323 2 года назад
So if they were to use technology to bypass the CPU, would that mean that CPU bottleneck is eliminated when upgrading a GPU?
@drsupergood8978
@drsupergood8978 2 года назад
GPU upgrade would not matter, unless the current GPU lacks required features forcing the data into the CPU for processing as a work around. The main improvement over the current direct storage implementation would be a saving of some latency and memory (or cache) bandwidth. The data would be moved from the NVMe SSD straight to the GPU VRAM via the PCIe controller cutting out CPU caches and memory entirely. This is apparently what consoles like the PS4 use and does exist in the data centre with a proprietary API for Nvidia enterprise GPUs.
@matyaskatocz9323
@matyaskatocz9323 2 года назад
@@drsupergood8978 oh wow thats very interesting to hear… but wouldn't that crash the GPU market? I mean if it doesn't matter then how will companies compete? There wont be a need for buying new GPUs because they wouldn't speed up the process anyways. Maybe if they break due to aging or manufacturing issues. If thats the case, then that the only way i see companies profiting off of this is if they force us to buy new one by ruining the GPUs in software, hardware or maybe monthly payments in order to use the technology. Of course future upgrades that unlock new features would be a thing and maybe thats where we will want to upgrade. But im sure that im wrong somehere (im also just a child so for anyone reading this, im just saying whats on my mind and don't take anything from this, im sure there are solutions and answers to my questions)
@rba42
@rba42 2 года назад
@@matyaskatocz9323 Why would the GPU Market crash? We're only talking about storage en transferring assets here. It has nothing to do with rendering power and how much FPS/Details a GPU can render. Rendering power will still be something that will grow over time and this is where GPUs will continue to compete.
@matyaskatocz9323
@matyaskatocz9323 2 года назад
@@rba42 oooooooooh right! I had it all confused
@jordan-mn6yy
@jordan-mn6yy 2 года назад
ssd is anything but humble, it's about damn time that other systems are using it to it's full advantage
@neattricks7678
@neattricks7678 2 года назад
TBH its value has been understated for a long time.
@prateekpanwar646
@prateekpanwar646 2 года назад
@@neattricks7678 In respect yeah but it's value in cost was a lot 4 years ago. They got so common only because price lowered.
@menthols4625
@menthols4625 2 года назад
@@prateekpanwar646 They are so common now that I have 2. I have a regular SSD and an m.2, because why not. One is mainly for my OS, the other is dedicated to games I'm playing.
@Demmortal
@Demmortal 2 года назад
other systems? pc doesnt utilize ssd in games to its full advantage while consoles can do that.
@bladeofmiquella8633
@bladeofmiquella8633 Год назад
Hey guys, what did u think about get SSD SATA instead NVME M.2? (Considering DIRECT STORAGE in future)
@scmstr
@scmstr 2 года назад
HEY I'VE GOT AN IDEA: How about, we sell games on their own SSD? And since SSDs are fragile chips, we could enclose them in their own containers. And put labels on them for what game it is! We've come full circle!!!!!!!
@khx73
@khx73 2 года назад
and we'll have to blow on them sometimes if the connection is janky
@ASLUHLUHCE
@ASLUHLUHCE 2 года назад
Oh shit
@kainbrah
@kainbrah 2 года назад
In the form factor of a Sega Card.
@azin6732
@azin6732 2 года назад
Won't be possible considering the business model a lot of companies are following nowadays, selling incomplete games and promising future updates
@93hothead
@93hothead 2 года назад
With the silicon issue now no way
@KenMathis1
@KenMathis1 2 года назад
Compression is not just important for reducing installation file sizes. It also speeds up the I/O since reading the data is a bottleneck in the system. If the data is compressed, you have less data to read, so it takes less time. The consoles have hardware decompression built-in so data can be stored compressed and read in that state. The level of compression achieved then becomes a multiplier to the effective I/O read speed.
@rngQ
@rngQ 2 года назад
But not automatically, like you said consoles have hardware compression available, but PCs have to use the CPU, there are a surprising number of people still running
@KenMathis1
@KenMathis1 2 года назад
@@rngQ Yep. That's the point. Compression is a huge part of I/O speed and this video relegated it only to reducing installation size. The fact that PCs will have to continue supporting CPUs that can't handle decompression in real time while running the game is the reason why PCs as a whole will still be playing catch up to consoles on this issue, contrary to the title of this video. In fact it will likely be the need of games to also run on a wide rage of PCs that will hold back what can fully be done with the much higher SSD I/O speeds for some time to come.
@scorpion07070
@scorpion07070 2 года назад
Same reason file system compression with a decent CPU can greatly improve read/loading speeds. Apparently that may be moving to the GPU sooner than later. Less and less data seems to get processed by the CPU as time goes on and eventually, GPU may just stand for "General Processing Unit"
@flimermithrandir
@flimermithrandir 2 года назад
@@KenMathis1 Kinda. It depends on Nvidia for their GPUs to make something. RTX IO would be the Solution you want here. And that should be out since End of 2021. Since Direct Storage just released and no Game is using it anyways until Forspoken releases… they probably said „Fuck that, we will wait to release it since no one could use that anyways“. I assume Nvidia will (re) announce it when they show the new 40 Series Cards. Also wonder what AMD will say to this. They are working with Xbox and PS together and know how they do it but they never mentioned it til now.
@motorolah8107
@motorolah8107 2 года назад
@@rngQ though some compression can be incredibly efficient even on low end cpus, for example windows's built in compact.exe with lzx has no perf impact even on a lowly i5 2400, and in some games it can go down to like 20% of its original size
@Paul.Reviews
@Paul.Reviews 2 года назад
Just taking a moment to appreciate how good he is as a presenter.
@itsscottiemac
@itsscottiemac 2 года назад
Anthony is LInus medium group's best presenter, bar none. Bravo.
@Wings_of_foam
@Wings_of_foam 2 года назад
Yes, I hear it every fucking time!
@speedfastman
@speedfastman 2 года назад
He could get a better haircut though.
@jamiebeith1875
@jamiebeith1875 2 года назад
Yeah, he's come a long way since his first videos. His scripted stuff comes across as pretty natural
@Axiomatic75
@Axiomatic75 2 года назад
Yeah, he's by far the best on LTT. I just worry he'll die from cardiac arrest soon.
@UncleKennysPlace
@UncleKennysPlace 2 года назад
There was a time in decades past where it was expected that RAM and storage would soon merge; your storage would be a non-volatile RAM that worked at high speed. Sort of like the old RAM-disk utilities. You wouldn't "load" a program into memory, just access the memory where you installed it. I would have hoped that this would have come to pass, but not quite yet.
@grn1
@grn1 2 года назад
There are a few SSDs that can match RAM speeds but it's usually an expensive proposition (Optane and/or certain raid setups with higher end SSDs) and with DDR5 just now coming out it seems that RAM will soon be much faster than storage again in the near future. So long as RAM has a significant speed advantage it's unlikely we'll ever stop using it. I remember hearing about some technologies that were in the works years ago that would supposedly give us Storage with equal speed to RAM using spintronics or carbon nano-tubes but I've not heard anything of those technologies in years suggesting they hit a roadblock they simply couldn't overcome. If we do ever get such hardware capabilities working and affordable enough then we'd still have do so some major work on the software front to take advantage of it or really just to use such a system at all (at first they'd probably have some section of storage emulating RAM for non-aware software but the better solution would be to reinterpret the code or perhaps recompile it to use the new tech).
@syarifairlangga4608
@syarifairlangga4608 2 года назад
they can build it but its impossible, because the number of write we can write in SSD, even SLC wont even close to the durability of Volatile Memory
@grn1
@grn1 2 года назад
​@@syarifairlangga4608 The carbon nanotube storage I mentioned would theoretically solve the issue of durability. The basic idea as I understood and remember it (this was back when Optane was first announced) was to have cells composed of a tiny carbon particle inside a tube that would switch between 2 positions depending on the value. In theory such a system would be far more durable or at least far easier to repair since the memory modules themselves would be unlikely to fail. I presume the road block was either that it was too difficult to manufacturer or they could get them to be fast enough (if memory serves they could theoretically switch faster than Optane but theory and practice often differ).
@AceStrife
@AceStrife 2 года назад
DFM? I remember reading about a non-volatile RAM replacement years ago, but can't remember if it was DFM or something else. Just sad these technologies get announced then.. never seem to come out, or get released to the consumer market.
@VVayVVard
@VVayVVard Год назад
@@grn1 Optane's PCIe latency is still ~1000x slower (on the order of tens of microseconds) compared to low-end DDR4 (tens of nanoseconds). And SSDs with RAID are a joke compared to Optane, never mind RAM. Optane Persistent Memory modules are comparatively faster, but still 10 times slower than DDR4. With Optane's development now halted, it seems unlikely we'll get to the great merge anytime soon.
@chiptrustworthy
@chiptrustworthy 2 года назад
I want to see how DirectStorage will affect load times between various SSDs like comparing a TLC NVMe SSD and an Intel P5800X.
@laitinlok1
@laitinlok1 2 года назад
It didn't just affect load times, some games that doesn't use all the gpu resources because of having too many assets to load and causes a cpu bottleneck will have a huge improvement in performance.
@Cheeseypoofs85
@Cheeseypoofs85 2 года назад
im guessing it will scale with bandwidth of the SSD. so m.2 will perform better than sata SSDs. i could be wrong though. it could also scale directly with RAM frequency and be less effected by latency?? i dunno. just brainstorming here
@Kholaslittlespot1
@Kholaslittlespot1 2 года назад
I would think this kind of tech will only really be a thing with nvme tbh.
@marciusnhasty
@marciusnhasty 2 года назад
@@Cheeseypoofs85 From Microsoft's developer blog directly: DirectStorage is DirectX 12 API for Windows 10 and 11 PC with NVMe storage. Data is loaded to RAM and copied to VRAM without I/O calls involving CPU (GPU was able to read dedicated RAM for decades, now it will have instruments to make I/O call to NVMe storage itself). Purpose of this API is to shift data decompression duty from CPU to GPU to the limit allowed by the hardware and it's firmware (yes some firmware will lock you out on fully capable hardware).
@georgerichards5369
@georgerichards5369 2 года назад
@@Kholaslittlespot1 I guess the real question is will we need something like a PCIe Gen4/Gen5 SSD? Or will there be no notable difference past storing on a 2gb/s Gen3 NVMe drive?
@guycxz
@guycxz 2 года назад
I really want to see how this affects CPU usage, as well as how much it improves loading times on lower end CPUs.
@ceebee
@ceebee 2 года назад
The CPU is still very much involved. The video is misrepresenting quite a bit.
@guycxz
@guycxz 2 года назад
@@ceebee It is stated in the video that its still involved, and that it is an issue that may be solved in the future. The question is how much, and the only way to get a definitive answer is to test it.
@reappermen
@reappermen 2 года назад
It likely won't have much of an impact on load times in practice. It is already just an edge case that the cpu file copying/decompression during load tkmes is the speed limiter. And it should be even less difference in % gain between high and low end CPU's. Keep in mind that you are unlikely to e.g. combine a 3080 12GB with a intel celeron or somethin really low end. And lower end Graphics card usualy also come with less VRAM, and thus also less data loading during loading screens. The actuall rela benefit will be in data streaming, as that happens while the game is actively running and the CPU has plenty to do with that. If your CPU is only running at 20% saving 5% doesn't vhange much. If your CPU is already at 100% and starts having a backlog? Then those 5% make a hughe difference.
@ladislavzima8382
@ladislavzima8382 2 года назад
@@reappermen Smaller VRAM actually means you have to put more data through it in the modern games streaming an "infinite" world (less VRAM to cache existing data you may return to). E.g. it took me 7 hours to run through the starting island in Elden Ring (is there more islands? I didn't fight & level, just travelled to see about 2/3 of the world freely accesible to a beginner character). That happened without the old-school loading screens. Also there are already coming modern GPUs - much faster than what you can have in PC - equipped with "Celeron" class CPU. There's no need to pay for a beefy CPU when the GPU can load the data from NVMe itself. When you scale huge server farms, you save a lot of money on the CPU.
@reappermen
@reappermen 2 года назад
@@ladislavzima8382 You are correct on the smaller VRAm cards needing mroe data streaming. My reference for no load time imapcts there was specifically aimed at the OP's mention of loading screens, which do get faster with smaller VRAM because it can only fill the VRAM once during a loading screen, and the less VRAm there is the fast that is (o nthe same bus). As for there beeing modern GPU's paired with Celerons, that might exist for neural networking or specialised analytics and modelling software on some servers, but those already run their homecooked everything as OS and don't need Direct Storage. For everything gaming related 8which the Op semed to refer to with his post) you absolutely do not want to pair a high end GPU with a cleeron or similar crappy CPUI, as that will massively bottleneck your system no matter whta you play. And finally, there currently isn't really all that much in the GPU server space that is that much 'better' than a 3090/3090ti when it comes to the GPU alone. most of the gains there is in better Bandwith and MASSIVELY more VRAM. I forgot which card it was, but in fact one of Nvidias current highest end Server GPU runs on a slightly changed 3090 Chip, with like 4 times the VRAM and other fringe benefits attached. But for pure cycles, the servers space isn't mcuh ahead. Server software is usualy made to run on multiple GPU's at once if it's really demanding, so getting more cycles in one card has limtied benefits.
@xnadave
@xnadave 2 года назад
Great video, but there are some inaccuracies. You don't have to reload an entire atlas to replace one of the sub-textures. Rather, render the "new" texture into a spot in the atlas to replace one of its components. It's not intuitive, but it's not complicated. Atlases are intended to minimize thrashing when swapping textures, and they work really well. Replacing an element comes with its own cost, but it's not nearly as bad as a round-trip copy. The "blurry lines" between adjacent textures in an atlas are compression and/or sampling artifacts, easily solvable by using lossless compression and properly-selected sampler settings in the shaders. When generating the atlas on the GPU, those artifacts do not exist.
@akiraic
@akiraic Год назад
that is one technique, that is not always used. Because, in the end, the GPU needs to wait for the new changes in the atlas to be made, creating idle time/bottleneck. That's why, in a coding standpoint, just sending a new atlas have the same result in the end.
@xnadave
@xnadave Год назад
@@akiraic Copy and render happen simultaneously. Most asset streaming techniques rely on this.
@Rhythmattica
@Rhythmattica 2 года назад
This was really informative... It actually makes me understand why Sony has licensed Oodle Texture technolog (Oodle & Kraken Decoder for the PS5.)
@darkcaste
@darkcaste 2 года назад
Yeah, there are so many bad takes in the comments section, so I'm glad someone gets it. Btw, this was all explained very clearly in Sony's "The Road to PS5" video two years ago.
@ABaumstumpf
@ABaumstumpf 2 года назад
A few things: First - same as any new API, it must be actively included and developed for by the developers. 2nd - the data still gets loaded into RAM first - it can NOT load directly from storage, so your graphics and assertions are wrong there. 3rd - now the GPU needs to handle decompressions, so it needs a bit more memory and more GPU-time 4th - the comparison shown by calling the same scene twice with different arguments can really change the behaviour as that data is now hot and loads way faster (unless you did multiple runs before, but nothing like that shown). Compression for videogame textures .... that has basically always been around. Doom 1993, nearly 30 years old - RLE compression SuperMario RPG (1996) had hardware decompression And the thing is - there has been, for decades, a way to minimise load-times specifically for PCs: Just load the damn data into RAM BEFORE you need it. If a user has 32 GB of memory there is no reason for a game not to just pre-load the next 4-8 GB of assets the GPU might need soon. And in that case that would yet again be faster than direct-storage as RAM >>> SSD.
@genderender
@genderender 2 года назад
Seriously, this video is shockingly bad coming from LTT
@seeibe
@seeibe 2 года назад
Yeah, that's poor research, especially for Anthony. :( I looked it up, and apparently bypassing RAM and going directly to GPU has been a thing for a while in data center. Would love to see them talk more about that.
@TexelGuy
@TexelGuy 2 года назад
Yeah, the biggest problem in this video is the claim that data goes straight from SSD to GPU VRAM. Microsoft's own directstorage presentation shows that data still has to go through RAM and then to GPU, not sure how they missed that when researching this video. Source: Look up "directstorage flow of GPU assets chart". Posting links is an easy way for a comment to shadow deleted so you'll have to do it yourself.
@ethearnos6397
@ethearnos6397 2 года назад
Lol just witness this dude edit his comment in reall time
@KenS1267
@KenS1267 2 года назад
@@TexelGuy I was almost stunned at how wrong this video was. The traces on the motherboard simply do not allow data to flow the way he claimed. Anthony is usually not this wrong but wow.
@Somerandom1922
@Somerandom1922 2 года назад
I'm curious whether this could be used for render engines. On of the bigger problems for small-medium sized VFX studios is that they'll use GPU rendering, so often (particular with larger or more complex renders, they're limited by VRAM, even with a 3090. Being able to chuck 1TB of super fast tertiary storage, could allow much larger renders on cards that don't have enough VRAM, although likely at a fairly significant performance overhead. However 'slow but available' beats unavailable 100% of the time when it's needed.
@brucepreston3927
@brucepreston3927 2 года назад
I think they just did a video about a Radeon GPU that had SSD slots on the card for this exact purpose...I don't think it was very good though, I can't remember...
@Rugg-qk4pl
@Rugg-qk4pl 2 года назад
@@brucepreston3927 iirc it worked but has a pretty specialized api that wasn't picked up by many major softwares
@mycosys
@mycosys 2 года назад
Theres no reason a fetch over PCIe form the SSD would be faster than a fetch over PCIe from the (much faster) memory controller. Though it would certainly be cheaper, youre probably still better off caching it in system RAM.
@d2factotum
@d2factotum 2 года назад
@@mycosys I agree. Even super-fast NVMe SSDs usually top out at maybe 2-3000Mb/sec, which is around a tenth of the bandwidth of a single DDR4-3200 stick--and once you have dual channel RAM (and what gamer doesn't?) the discrepancy gets even greater.
@Hojogamer
@Hojogamer 2 года назад
True
@Cernunn0s90
@Cernunn0s90 2 года назад
The Crash games for PS1 streamed data off the disc, and Andy Gavin, the lead programmer behind the Crash games has a fantastic dev blog that goes into some of the details. It was groundbreaking stuff at the time, and Sony was worried because they never built the PS1 with that amount of constant disc reads in mind.
@EximiusDux
@EximiusDux Год назад
That's eventually how most big games began to function on DVD based consoles. You can't fit entire big levels in the extremely tiny amount of ram on a PS2, or even the PS3. Crash might have been one of the first to do it, but there is a reason why all PS2 and PS3 DVD based games had to constantly, literally non-stop, read the disc.
@Secret_Takodachi
@Secret_Takodachi 2 года назад
I remember how much Playstation focused on this feature for their PS5 so it's nice to see that in a couple years this console selling feature will be implemented on a platform with a larger number of titles & greater control over system settings!
@samfkt
@samfkt 2 года назад
Still no real PS5 game that use its full potential. Loading times on pc with SSDs and no direct storage are similar to PS5 with it......
@chillnspace777
@chillnspace777 2 года назад
@@samfkt it's coming
@riazuwimpyegg2615
@riazuwimpyegg2615 2 года назад
And so the ps5 continue it's elusiveness till another year if scalper take a lot.... sigh.....
@sanjuskillz8529
@sanjuskillz8529 2 года назад
@@chillnspace777 yes it means PC will be more and more fasters than console.
@sergeysiminyuk
@sergeysiminyuk 2 года назад
@@samfkt thats because consoles don't have the single thread performance that PC does. Traditional loading on PC with a fast CPU and a slower SSD will match the consoles with the fancy APIs. Even Direct Storage doesn't remove the CPU advantage, only helps lower it somewhat.
@Masterrunescapeer
@Masterrunescapeer 2 года назад
Do remember some of the very large size is because on HDD you'd have multiple copies of the same file so it could load it all as one block when you need it, rather than having to jump around (reducing load times), that should drastically reduce some game sizes if HDD support is dropped/ SSD becomes a requirement (as SSD's have way faster load times for lots of small files since no spindle). Note as well, the Nvidia GPUDirect storage is also Linux compatible, currently in beta on Ubuntu.
@rajder656
@rajder656 2 года назад
i don't think the ssd requirement will come to fruition in near future. The high capacity ssds still cost a lot more than HDDs
@Masterrunescapeer
@Masterrunescapeer 2 года назад
@@rajder656 you can get a 1TB SSD for
@demonz9065
@demonz9065 2 года назад
@@rajder656 except high capacity is only a huge worry because the games are so big now. if direct storage also means trimming down file size its not really an issue. also I don't know why people think HDDs would just not work when games are developed for direct storage. itll just become another thing that plays into your overall performance. hdds still see a speed improvement with direct storage. additionally methods to optimize performance on old hardware will be around until its irrelevant. like grabbing smaller assets to display the same thing off a hard drive. a low poly version of the models a hard drive might struggle with
@rajder656
@rajder656 2 года назад
@@Masterrunescapeer and 1 tb hard drive is half of that price in fact you can get 8tb hdd for that price
@rajder656
@rajder656 2 года назад
@@demonz9065 i mean if you store only games good for you. And it also only applies to new new games. If i have idk movies or other stuff alongside games then HDDs are better even without those if you want just games HDDs are better right now and for many years after since we won't see widespread usage until 4-5+ years
@Neoxon619
@Neoxon619 2 года назад
I’m curious to see if more games support DirectStorage in the future. I know Forspoken will, but I hope the likes of Street Fighter 6 follow suit.
@AlRoderick
@AlRoderick 2 года назад
Initially I thought it was weird that you brought up Street fighter because I would think that a fighting game would have one fixed set of assets for an entire match and not need to load anything during gameplay, but then I remember how nice and snappy it was to start matches on old arcade games where everything was in ROM, you could be fighting in under a few seconds. A very nice user experience.
@ItsAkile
@ItsAkile 2 года назад
SF6 of all games, I'm down though
@rajder656
@rajder656 2 года назад
@@AlRoderick let's be honest with ssds it would take seconds to load anyway
@Neoxon619
@Neoxon619 2 года назад
@@ItsAkile It’s more about fast loading, as that’s been an issue with some recent fighting games. Tekken 7 in particular was awful about loading, even on PC.
@ItsAkile
@ItsAkile 2 года назад
@@Neoxon619 early Unreal Engine 4 pains, recent titles much better though. but I get what youre saying, these games should load fast
@IroxX0
@IroxX0 2 года назад
Anthony, thank you for putting in the effort for technical stuff like this.
@captainthunderbolt7541
@captainthunderbolt7541 2 года назад
I think one of the first games to stream in assets was 1999's Soul Reaver on the PS1, so the industry was moving towards that kind of game design even before the turn of the century.
@Kholaslittlespot1
@Kholaslittlespot1 2 года назад
Good game
@Reverii
@Reverii 2 года назад
Love all the new videos with Anthony!
@Ladioz
@Ladioz 2 года назад
Would like more Linus instead..
@DrakkarCalethiel
@DrakkarCalethiel 2 года назад
Heck yeah, give Anthony more screen time!
@EnsignLovell
@EnsignLovell 2 года назад
I love this. I feel like I need to note though, just because hardware is faster shouldn't be an excuse for Devs to cheap out on optimisation. Maybe for demos as proof of concept yeah, but please not for final release.
@electrosssnake1036
@electrosssnake1036 2 года назад
Yea I also can see unity and unreal engine devs be like okay so we put uncompressed files to make load speed faster, right? Oh wait what is this... a 400 gb folder? Also imagine if you're not running SSD. You're IO performance will be killed with these giant uncompressed files that will just eat whole throughput.
@electrosssnake1036
@electrosssnake1036 2 года назад
That might be the reason that DirectStorageAPI will be a niche thing. Because consoles are very cool in a way that you can do all of the tricks with the hardware because you're sure that EVERYBODY will have same hardware.
@Ubreakable-lr2dk
@Ubreakable-lr2dk 2 года назад
they do because it gets the tech industry more money by forcing costumers to buy newer hardware
@gormauslander
@gormauslander 2 года назад
@@Ubreakable-lr2dk you know game devs don't own hardware companies right? They don't benefit from hardware sales
@imo098765
@imo098765 2 года назад
@@electrosssnake1036 It wont be a niche thing because its what developers want
@nicholas9051
@nicholas9051 2 года назад
I literally asked this question about 5 years ago when I was first introduced to m.2 slots... I asked when are they going to allow GPU to directly connect to the PCIe lanes of the m.2 slots instead of going out of the way and then stuff like compression to help.... thanks for the content!
@mycosys
@mycosys 2 года назад
Turns out they still havent - everything teh video discusses is still future plans for DirectStorage :(
@pixels_per_inch
@pixels_per_inch 2 года назад
The data still have to pass through the IO controller, it only bypasses the CPU cores. Since CPUs have embedded northbridge nowadays, which include the IO controller, it means that the data will have to go to the CPU first since there's no direct connection between the PCIE slots
@nightshocker6908
@nightshocker6908 2 года назад
so glad you showed "unreal tournament" i love that game and was really good at it. i won office pro 1997(in 1999) playing that game.
@thatsgottahurt
@thatsgottahurt 2 года назад
Been waiting to see content on DirectStorage for a while. Cant wait to see see what it can really do when we finally a get some games that support it.
@vasocreta
@vasocreta 2 года назад
I pretty much like every video where Anthony is the presenter before I even watch the video now. You win, Anthony.
@19cw
@19cw 2 года назад
Great video, learnt something new and got exicted about it, all at the same time! Thank you
@yoggany
@yoggany 2 года назад
I love tech pieces with Anthony lile this!
@ricardofunes413
@ricardofunes413 2 года назад
In the presentation slides from Microsoft, the data goes from the SSD to the RAM and only then copied into the VRAM. So I am not sure that Direct Storage actually works like depicted in this video, where it is loaded directly from the SSD into the VRAM.
@AlejandroLZuvic
@AlejandroLZuvic 2 года назад
It's kind of weird, in the case of consoles, main memory and video memory are in the same shared pool so you can load assets and both GPU and CPU can point to the same memory address. But in the case of a PC I don't really get how it works, I guess I'll research into it but what you say makes sense.
@mycosys
@mycosys 2 года назад
Youre right - we were told for years it would have DMA and GPU decompression but that hasnt been initially implemented. The vid is completely wrong,
@GebaseerdeKikker
@GebaseerdeKikker 2 года назад
The demo application mentioned and shown, describes this in the README. It is bypassing decompression by doing it in a shader, that's the tech. It's not P2P PCIe. LTT research showing it's worth again.
@yaroslavpanych2067
@yaroslavpanych2067 2 года назад
@@mycosys At last someone remembered about DMA here. When I first heard about this 'Direct Storage API' thingy, my first though was 'DMA? What's new?'
@pixels_per_inch
@pixels_per_inch 2 года назад
Correct, even the CPU has to be involved somewhere since there's CPU connected PCIE. I guess true Direct Storage would only work when everything is connected to the chipset or a switch. Edit: Found more info, apparently it bypasses the CPU (or more specifically the CPU cores) but the data must pass through the IO controller. Since CPUs have embedded northbridge nowadays, which include the IO controller, it means that the data will have to go to the CPU first since there's no direct connection between the PCIE slots
@tynao2029
@tynao2029 2 года назад
Today, yesterday, every day ever...
@KenleyBrooks
@KenleyBrooks 2 года назад
Love Anthony's videos! Love learning the "nitty gritty" of why something works, or works better.
@Bears11422
@Bears11422 2 года назад
I knew this already but I came for. Anthony Thanks for another great show!
@matsv201
@matsv201 2 года назад
While compression streeming have been around pretty much since the 90tys. There have really been very few games that used it. Granted, most game have suported them.... but a lot of them, i would say most, have had it turned of as default and hidden it somewhere in the menues.
@shaunware1232
@shaunware1232 2 года назад
It's not 90tys it's '90s
@catch616
@catch616 2 года назад
Nice English
@CottidaeSEA
@CottidaeSEA 2 года назад
@@shaunware1232 I find that strange to be honest, feels more natural to write 90's, even though I know '90s is the correct way.
@trikewars
@trikewars 2 года назад
@@shaunware1232 you noticed 90tys but missed every other mistake?
@mycosys
@mycosys 2 года назад
So have Direct Memory Access and Bus Mastering - which he completely failed to mention. The transfers havent gone through the CPU cores literally since the first IBM PC. This adds a layer of SOFTWARE primarily, an API to let software tell the OS to use a bus master transfer from the storage to the GPU. It has been possible (just not practical) since the dawn of PCs.
@aaron6807
@aaron6807 2 года назад
08:09 Also, Consoles usually have shared memory, ie.. Cpu and Gpu share the same memory space, so the data is only loaded once So it's already easier to implement there
@wayland7150
@wayland7150 2 года назад
It could be a case of simply passing a pointer rather than moving data. The massive separate VRAM in a GPU is a bit of an anomaly. In home computers of the 1980's the video chip was connected to the same RAM space as the CPU. If the VRAM could be directly addressed by the CPU that could be a huge help. Maybe PCIe gen5 will do this. That's another thing, in 1980's microprocessors everything was on the same bus. IO space and memory space were all the same, just different addresses. Very Alan Turin.
@mycosys
@mycosys 2 года назад
Sadfly the vid is really premature and none of what he talked about has even been implemented. The initial API just provides vastly lower IO overhead on these streaming calls
@JustRockMySoul
@JustRockMySoul 2 года назад
Not even a minute in and you already know that Anthony will once again give insightful foresight presented in an entertaining manner, easy to follow even for people like me that don't have a clue.
@PhonePhreak3z
@PhonePhreak3z 2 года назад
I love all this Anthony content, please keep em coming!
@HalfUnder
@HalfUnder 2 года назад
I'm always happy when there is some new tech that Anthony gets to do a video on. Just has a way of making things easy to digest and understand. So credit both to Anthony and the writing team.
@TheStigma
@TheStigma 2 года назад
I'm pretty hyped for DirectStorage moving forward. I don't think anyone will miss the days of needing loading screens, so I hope and think that it will just be integrated into all modern engines eventually. It will also be very interesting to see how this can be leveraged for non-game apps that need a ton of data-throughput, like video editing for example. That could be a serious game-changer.
@Kholaslittlespot1
@Kholaslittlespot1 2 года назад
Combine tech like this with stuff like UE5s Nanite mesh handler and we're going to see a huge jump in visual fidelity, too. Closer to photo realism than ever.
@AintPopular
@AintPopular Год назад
Very detailed explanation learned alot
@mikejones-nd6ni
@mikejones-nd6ni 2 года назад
Very well explained, i don't think anyone could have done it better 👍👍
@FreeAsInFreeBeer
@FreeAsInFreeBeer 2 года назад
Linux has had the possibility for this for a long time. The question is just if Proton can translate the DirectStorage API in an effective way.
@mycosys
@mycosys 2 года назад
PCs have had the capability to do PCI bus master transfers like this since as long as PCI has existed. And with DMA transfers havent been going thru the CPU since the original PC, cos DMA
@SegaSaturnSubs
@SegaSaturnSubs 2 года назад
Look up io_uring - many of DirectStorage’s ideas come from there, and the main concept of the API (reducing the cost of IO calls) translates pretty much the same into DirectStorage calls. Decompression (when it arrives) will be more complicated, but the base API is already there.
@timothygibney159
@timothygibney159 2 года назад
Linux has non functional Nvidia drivers
@feschber
@feschber 2 года назад
@@timothygibney159 what's that got to do with any of this?
@ethan-fel
@ethan-fel 2 года назад
@@timothygibney159 damn i've been doing deep learning, media streaming and gaming for years with non functional nvidia drivers, thanks for info.
@he1No.
@he1No. 2 года назад
Tysm for the pc build video, i just got finished with my first myself built system!
@meatbleed
@meatbleed 2 года назад
any casualties along the way?
@Ghost-hs6qq
@Ghost-hs6qq 2 года назад
Which one?
@he1No.
@he1No. 2 года назад
@@meatbleed luckyly there wasnt!
@frankcastle13
@frankcastle13 2 года назад
Tell us what you built and your experience doing so please.
@Ja_Crispy
@Ja_Crispy 2 года назад
@@frankcastle13 Is frank castle your real name?? are you the punisher??!
@TYKUHN2
@TYKUHN2 2 года назад
This begs the question though, were the test results valid? You compared compressed vs uncompressed from the looks. This draws into question the validity of the results based on the data given. It is entirely possible it is still a mostly valid result if the compression saved enough time in disk reads that the decompressor was not adding time, but that was not shown.
@jacobbabcock8943
@jacobbabcock8943 2 года назад
the uncompressed is loading faster. thats the important thing. usually compressed assets would load faster
@protator
@protator 2 года назад
You seem to be missing the point. PCs lack the built-in hardware decompression consoles have, so while compressed data doesn't hurt them and basically just increases their effective I/O, the decompression on the cpu has become a serious bottleneck for PC ports. So uncompressed vs. compressed is exactly what this is all about. GPU decompression is going to come but hasn't been implemented yet and is therefore unlikely a part of this demo. Games that use direct storage on PC will most likely have even larger installation sizes as a result, but I'd rather add another $200 ssd than buy an entire new system with higher ipc processor and a 12+GB gpu just to handle the compressed data streams optimized for consoles. Because that's what we've been forced to do until now. A game stuttering on PC while running perfectly on a console with much lower specs has never been anything else than the PC choking on a compressed data stream it wasn't designed to handle. Optimizing a PC port for the most part means changing compression algorithms, package sizes and the overall data streaming behavior of the engine ... so devs should have it a lot easier in the future if optimizing a title for our platform simple means storing assets uncompressed or with lighter compression the GPU can chew through in an instant. With the possible transfer rates of PCIe4.0, the already finalized PCIe5.0 and DDR5x/6/6x memory we are way better equipped to handle that than what we've had to deal with in the past.
@1989ElLoco
@1989ElLoco 2 года назад
@@protator Thanks for your insightful comment mate. So as of know we are waiting for MS to release the second part (gpu decompression). How is this on the Linus side for the steam Deck etc.
@protator
@protator 2 года назад
@@1989ElLoco Haha, thx for the compliment, but though I have been doing some 3d modeling for games in the past I'm basically just another enthusiast who pieces together bits of info from multiple sources. So take my comments should you run across them in the future with the obligatory grain of salt. Either the final or future implementations of this tech will use some form of data compression as well, just in a form that makes it far quicker on a gpu than it used to be using the cpu, bringing consoles and PCs very close together in terms of how they handle asset streaming. What I said before is just my takeaway from what I've read so far and what I assume is showcased in this demo. Well, Linus will probably go all out testing this, but you obviously meant Linux^^. Unfortunately I'm a windows-only guy - although that could always change. The steam deck uses a moderately clocked APU with 16GB shared memory iirc, so the concept doesn't apply 1:1 since you need to shove that data through the cpu and into system memory anyway, even if you can skip the compute intensive decompression. But if there's still some decompression work to be done then at least the cpu will be spared from it and it'll be done on iGPU cores which are still more efficient in that case. With DirectStorage the CPU acts mostly as a bus controller while the gpu exchanges data sets with vram and nvme storage directly, so memory management might be the deciding factor as to how much of the benefits you get with and APU that has tighter memory constraints compared to a desktop pc with dedicated graphics where you simply have more headroom to brute-force things. On the other hand Linux has always been very good in terms of overhead and latency compared to the windows kernel, so I am convinced that steam deck users will see meaningful performance improvements once this gets implemented.
@YezzyHD
@YezzyHD 2 года назад
The results still seem valid in a sense. On consoles the difference is quite huge, but the real world performance we have seen so far has been pretty much useless, but hopefully fixing the compression issues gets us closer to those console loading times.
@stephenmeinhold5452
@stephenmeinhold5452 2 года назад
i am truly impressed by some of the games comeing out recently, i remember the days when we woundered if they would look as good as the movies CGI and now they look as good and in some cases better. gameing or vertual enviroments have so much potential as long as the devs are willing to think outside the box. i think we are just getting started with where games can transport us to and its only being held back by the industry playing it safe.
@photonboy999
@photonboy999 2 года назад
If game's increasingly STREAM assets from an SSD in the future does that mean VIDEO memory requirements will drastically slow down? I've seen demos for DirectStorage that used FAR less video memory to produce the same scene as normal methods.
@Abhi-tr9hj
@Abhi-tr9hj 2 года назад
Probably
@clebbington
@clebbington 2 года назад
Yes, absolutely! Being able to load assets as needed over the PCIe bus instead of prefetching assets will definitely reduce how much data needs to be sitting in VRAM for a scene to be rendered quickly
@devaraft
@devaraft 2 года назад
Idk about that but at most it'll probably be stagnant. But maybe they'll focus on speed more in the future. Nvidia did prove that speed is extremely important. Who knows we might even go back to HBM2 in the future
@mycosys
@mycosys 2 года назад
not enormously - they still need to hold all the textures for the scene being rendered - it would save main system RAM (because you dont need to copy all the stuff there first and probably cache it there) though
@ZackSNetwork
@ZackSNetwork 2 года назад
8gbs should still be the bare minimum for 1080p gaming. 12gbs would be enough for 1440p and 16gbs for 4K.
@Eoraph
@Eoraph 2 года назад
"...is loading nearly THREE TIMES FASER." 0.33 / 0.08 = 4.125 I mean, in the grand sceme of real numbers, 3 and 4 are quite close but woud "OVER 4 TIMES FASTER" sot have sounded even better?
@deadasfak
@deadasfak 2 года назад
Yeah, I was like "wait did my quick maths skill failed me?"
@hxd9321
@hxd9321 2 года назад
Congratulations 🎉🎈 you’re the 500th person to comment this!
@mario9896
@mario9896 2 года назад
I don’t often watch LTT videos but Anthony is killing it and definitely seems much more comfortable.
@Julzilla
@Julzilla 2 года назад
This will be really exciting for editors as well, I presume you will be able to scrub though video\audio without the CPU being affected?
@Catsrules1
@Catsrules1 2 года назад
Decompression on the GPU sounds really cool. My question is will that require new hardware or can it be added later in software? For example could my GTX 1070 do decompression when that feature is available or will it require a new GPU with a specific chip onboard dedicated for decompression?
@flameshana9
@flameshana9 2 года назад
I've always heard it called "hardware decompression."
@Velocifero
@Velocifero 2 года назад
Well, Nvidia allready announced RTX I/O which as you can guess requires a RTX GPU.
@jayrosgameboymodz5438
@jayrosgameboymodz5438 2 года назад
I feel that it could be done in Software on older GPUs, but include an instruction set for compression in future hardware revisions. Kind of like how RTX 20x0/30x0 cards can do real-time ray-tracing in hardware, and GTX 10x0 cards can only do it in software.
@pixels_per_inch
@pixels_per_inch 2 года назад
@@jayrosgameboymodz5438 That seems to be what's happening, RTX series have Tensor cores which will aid in the decompression. It still can be done on the shader cores but that's at the cost of your graphics performance.
@electrosssnake1036
@electrosssnake1036 2 года назад
@@jayrosgameboymodz5438 I mean I doubt nvidia will push such a big driver change including the CUDA version of decompression. It will be a big change that can lead to potentially broken compatibility or just something else buggy. They just won't do it.
@idcrafter-cgi
@idcrafter-cgi 2 года назад
such a feature but opensource would be nice and great for the steam deck
@lennart3100
@lennart3100 2 года назад
Not only for the steam deck, but for linux gamers in general.
@darin7553
@darin7553 2 года назад
The sad reality is that Microsoft does not give a damn about Linux unless it is serving there interests
@kazioo2
@kazioo2 2 года назад
Funfact: Nvidia had a similar feature (GPUdirect) for professional uses on Linux something like 5 years ago.
@idcrafter-cgi
@idcrafter-cgi 2 года назад
@@darin7553 yes like their game streaming thing
@idcrafter-cgi
@idcrafter-cgi 2 года назад
@@lennart3100 i am a Linux gamer since 2019 that is why i mentioned the steam deck as Linux gaming
@andreylucass
@andreylucass 2 года назад
Great video Anthony, as always.
@Kholaslittlespot1
@Kholaslittlespot1 2 года назад
Been banging on about this for a while. Can't wait to see it all come to fruition.
@TexelGuy
@TexelGuy 2 года назад
FYI: SATA SSDs will also see a decent improvement in speed, at least, according to forspoken's benchmarks with and without directstorage. NVMe SSDs will see a much bigger improvement because the overall drive speed itself is much higher. This is pretty surprising as directstorage was always something the was said to _require_ an NVMe SSD to function at all, so not sure why they were saying that when SATA SSDs also see noticeable improvements (Around 20% in forspoken)
@demonz9065
@demonz9065 2 года назад
yeah idk why people would think an SSD was a necessity. the benefit comes in removing steps required to get an asset from the drive to the GPU. those steps arent something that only exist in an SSD so why would other storage media have no benefit?
@TexelGuy
@TexelGuy 2 года назад
@@demonz9065 I think the reasoning I heard most was that NVMe drives use PCIe lanes instead of the SATA interface, which means they have a direct connection with GPU and RAM whereas SATA has to go through the CPU. Despite that SATA SSDs still see improvements, so there's definitely more to directstorage than utilizing the NVMe interface. Still not sure why they _required_ NVMe drives when they announced directstorage, when forspoken devs were able to run it from a SATA drive and still see decent improvements.
@lexsanderz
@lexsanderz 2 года назад
Nobody said that. Tech media said that. Tech media doesn't read dev blogs. DS works on floppy disk.
@lexsanderz
@lexsanderz 2 года назад
@@TexelGuy that is all false. DS takes data from NVME to system RAM and then again to GPU VRAM. It's not direct over PCI to PCI. Watch the microsoft video, it has 4k views cleary nobody wanted to be educated.
@TexelGuy
@TexelGuy 2 года назад
@@lexsanderz 1. NVMe drives were always implied to be the _only_ way to use directstorage in ALL of the information made available by microsoft online. 2. You seem to be misunderstanding my comment, I never said data can go from NVMe to GPU directly, only that it provided a direct path for it through RAM without going through the CPU like on a SATA drive.
@robotparadise
@robotparadise 2 года назад
Wondering if the 2nd render (directstorage) took assets from the SSD's on-board cache (loaded during 1st render).
@carpenecopinum1665
@carpenecopinum1665 2 года назад
The directstorage demo read a different file though, so it's kinda improbable that the SSD had that one cached *. Also I trust Anthony to be competent enough that he ran the demos in the opposite order at some point, just to make sure that the caching isn't massively skewing the results. * "improbable" because SSD manufacturers may implement heuristics and other magic to help caching and make their SSD look/feel faster. But since the file was like 52MB big, I'd feel that the SSD won't decide to pre-load and cache it, even if some magic software layer concluded "He always loads these two files one shortly after the other.".
@robotparadise
@robotparadise 2 года назад
@@carpenecopinum1665 I concur 🤝
@howardlam6181
@howardlam6181 2 года назад
been waiting for this for tensorflow
@TheLargedwarf
@TheLargedwarf 2 года назад
Machine learning will love this. Training on a big dataset can give you lumpy GPU utilisation graphs even with batch streaming, as the data is loaded to CPU/RAM between batches. This is especially bad due to how bad parallelism works in python, but with direct data loading that work can be shifted to a more parallelism friendly API and be faster.
@dan_loup
@dan_loup 2 года назад
This is literally the same thing as the sega genesis "blast processing", an DMA circuit that can copy data from the storage to the video memory a lot faster than the CPU can.
@thatkles
@thatkles 2 года назад
The SNES had DMA too, but it just wasn't part of their advertising campaign.
@mycosys
@mycosys 2 года назад
The IBM PC had DMA. Whats worse this video is REALLY premature and DrectStroage doesnt have DMA or GPU decompression yet. ATM its an API that improves IO call overhead.
@dan_loup
@dan_loup 2 года назад
@@mycosys Sadly they didn't evolved this side as much as they should, otherwise well, we wouldn't quite need direct storage. It would be pretty cool if PCs could do timed DMA transfers etc..
@xeridea
@xeridea 2 года назад
"Blast Processing" was 100% a marketing gimmick. Some marketer overheard discussion of a bug in the VDP, during conversation the words "blast" and "processing" came up, so they just put them together for marketing. It is a real technique, though very obscure, not used by any commercial games, and was only figured out to be viable a few years ago. The fact that it even works is pure luck. It allows using more colors than the color palette allows by abusing a bug, advanced manipulation of the CPU to sync with CRT, and consuming 100% of the CPU, so isn't really feasible in gameplay. The SNES also had DMA, and was used a lot. The genesis was faster CPU, though the SNES had an advantage with Mode 7.
@dan_loup
@dan_loup 2 года назад
@@xerideaOf course it's a marketing ploy. also i think, not 100% sure that the sega genesis can actually copy data outside of the vblank time, which allows it to move a lot more tile data per frame than the snes, which was abused by games like sonic etc..
@ThePlayerOfGames
@ThePlayerOfGames 2 года назад
Fully expecting Microsoft to try and crowbar this into the Linux kernel in order to pull an Embrace, Extend, Extinguish
@Flameancer
@Flameancer 2 года назад
But wouldn’t that be good for Linux? What if they made an open source version of direct storage?
@ThePlayerOfGames
@ThePlayerOfGames 2 года назад
@@Flameancerit's worth understanding the difference between "open source" and "proprietary". Teslas designs are open source and proprietary, which means they can go after anyone that makes anything like a Tesla because they can say you copied them. Linux is open source & GNU licensed which means you can modify & copy to your hearts content. It's worth reading up on Embrace, Extend, Extinguish directly, I will suck at explaining it because it's like the financial system; deliberately hard to look into The Extend, Extinguish aspect of what Microsoft does can include several strategies such as: a) Having an open source nonproprietary version that gets less love than the main one. This is seen in the Open Document Format that Microsoft got forced to use, technically you should be able to use any Microsoft-software made document on any other editor easily but they set about specifically adding functions that competitors couldn't use or replicate easily to make interoperability harder or impossible. You end up with two standards, the official one and the proprietary one at the same time. b) They bloat the standard directly. Assuming it becomes a nonproprietary standard they can have it get bigger, more demanding, harder to develop on until nobody wants to use it, it slows systems down, or slows development down. They can then have the good version they keep internally, rename it something like "Xbox Microsoft Ultimate Gold Windows Fast Loading System", and release it as proprietary in their products. c) Linked to b they can also either use industry links to have hardware that prefers their proprietary standard released or they can include code in their own systems that skips the issues. nVidia does this with their drivers; they include code their old hardware has to run in addition to its normal job that newer hardware will skip over
@elementw74
@elementw74 2 года назад
The groundwork to make this possible was laid all the way back in 2016. It is possible to export an OpenGL or Vulkan buffer object (with GL_EXT_memory_object_fd & VK_KHR_external_memory_fd respectively) as a dma_buf file descriptor. What is left to do is to allow syscalls like "copy_file_range" or "sendfile" to initiate DMA transfers to the GPU buffer from userspace; a thing they can't do yet (only fs-to-fs or fs-to-pipe/socket copies), but in theory is possible. Slap that in an io_uring OR poll the dma_buf object and you've got a DirectStorage equivalent.
@webkahlifa
@webkahlifa 2 года назад
Texture atlases are still a thing for particual use cases, usually enviroments and props. Even today we game devs struggle with drawcalls on lower end platforms like switch and today even PS4. Khm..Cyberpunk..Khm
@tomeg1370
@tomeg1370 2 года назад
I really love how there is more videos with Anthony. He is awesome. :D
@crayzeape2230
@crayzeape2230 2 года назад
There still has to be some CPU involvement to direct the video card to the blocks it needs from the drive. I doubt the video card understands filesystem structure, which could be FAT32, exFAT, or NTFS at this point.
@yaroslavpanych2067
@yaroslavpanych2067 2 года назад
Most likely, API will take care about that: it will determine blockaddress to read from and pass them to GPU, to initiate exchange.
@electrosssnake1036
@electrosssnake1036 2 года назад
@@yaroslavpanych2067 but what if the file is defragged? It will read garbage then.
@electrosssnake1036
@electrosssnake1036 2 года назад
Actually the question of how hardware-deep memory access DirectStorageAPI will use under the hood could actually be a very important performance factor. It'll make huge difference. From my experience from working with DirectAPI's(mostly d3d's) it will have numerous capabilities enumerations that will vary in hardware->software with hardware things being fastest but requiring hardware and software being garbage but working on any system(sorta HAL).
@protator
@protator 2 года назад
@@electrosssnake1036 dude, data fragmentation and read errors are too completely different issues, drive fragmentation just means the read heads of your hdd have to jump from place to place in order to gather the requested data instead of reading it in one go. Basically random I/O verses sequential read performance. But that's mostly a thing with spinning rust, and this tech is meant for current gen systems with NVMe storage, not 20 year old toasters.
@electrosssnake1036
@electrosssnake1036 2 года назад
@@protator dude not everybody has ssds. Question is what will happen if you do not have last gen tech. Such as hdd which could be even slower if fragmented.
@wavehockey150
@wavehockey150 11 месяцев назад
Here we are more than a year later, and there is literally only one game that supports it, and it has very mixed reviews.
@DSON89
@DSON89 2 года назад
Love seeing videos with Anthony. He is wholesome and knowledgeable.
@sr43ezh
@sr43ezh 2 года назад
i realy like the calm way how anthony presents topics. even a noob like me can get the gist of it. big-ups to anthony.
@leesanction2068
@leesanction2068 2 года назад
I really hope this also helps with frametimes. Im so tired of modern titles having horrible stuttering issues. Really breaks immersion and sometimes hinders gameplay. Maybe with faster asset streaming, we can not only see faster load times but also smoother gameplay.
@rajder656
@rajder656 2 года назад
ekhem elden ring ekhem
@GamerzPro1231
@GamerzPro1231 2 года назад
this is something to do with dx12 not ssd. we literally had great frametimes all generation until that piece of shit came out
@rtyzxc
@rtyzxc 2 года назад
@@GamerzPro1231 It's problem in dx11 games too. It's an issue of shitty game engines that are designed for ease of development and not performance.
@motorolah8107
@motorolah8107 2 года назад
@@rtyzxc but MUCH less than dx12 ones, to be honest i can't remember any dx12 title that doesn't have heavy stuttering problems (other than fh5)
@joshjlmgproductions3313
@joshjlmgproductions3313 2 года назад
@@motorolah8107 FH5 doesn't have heavy stuttering? That's news to me. 80 hours in-game and I'm finally getting tired of how bad of a game it is.
@DragoDan7786
@DragoDan7786 2 года назад
Keep up the great content
@saph1235
@saph1235 2 года назад
Could we get testing on different quality drives and motherboards? I'm sure that using a lower bandwidth drive, or even differences between PCI gen 3 and 4, and then different motherboard tiers to see how direct storage is affected
@TheRealEclipse1
@TheRealEclipse1 2 года назад
I love seeing new efficient innovations. One question, is this only for SSDs or will this also work with hard drives as well?
@wyk997
@wyk997 2 года назад
PCIE 3.0 or Higher NVMe SSD with at least 1TB of space and DirectX 12 Ultimate compatible GPUs
@TheRealEclipse1
@TheRealEclipse1 2 года назад
@@wyk997 Ok, that makes sense. I have an RTX 3060 but still using all hard drives. One of the many reasons why I really should upgrade my storage 😂
@wyk997
@wyk997 2 года назад
@@TheRealEclipse1 Getting an NVMe SSD last year was the best upgrade I made to my pc in years it is worth all the investment
@cruizera2194
@cruizera2194 2 года назад
Great job Anthony and gang, you explained this so very well, particularly the industry common practices and remaining challenges. Eager to see what it can do.
@randombailey1166
@randombailey1166 2 года назад
This is going to be a game changer, especially on lower end CPU's.
@alienrenders
@alienrenders 2 года назад
This would be great for megatexturing. You could have a cache of texture tiles on the onboard SSD. So basically two levels of megatexturing. Basically, the GPU could use megatexturing and the SSD could use clip mapping. Megatexturing is using the exact tiles that are seen on-screen. Clip mapping is using loading the highest resolution tiles for those that are near the camera and gradually load coarser resolution tiles the further away they are. This used to be limited when using the GPU. But with an SSD, you could prefetch a lot more data and significantly reduce texture popping when using magatextures. In fact, only far away coarse textures would be prone to texture popping and since they're already coarse, the effect would be far less noticeable if it happens at all. You could do the same for 3D assets and load higher resolution meshes for nearby objects and lower resolution the further away they are with as much overlap as possible for multiple LODs.
@gustavosaliola
@gustavosaliola 2 года назад
Thanks Anthony for a great presentation of a technology unknown for me. Could be this applied to Postproduction / Motion Graphics / Animation on the cache side of things? Like for speeding up previews on After Effects and its interaction with Premiere and C4D? Any Tech Savvy Geek Good Soul in the room?
@Broken4forever
@Broken4forever 2 года назад
Linus got such a good partner, I enjoy Anthony's energy every video.
@thebyzocker
@thebyzocker 2 года назад
Fellow Anthony enjoyer
@ray076NL
@ray076NL 2 года назад
i was just thinking the same and looking for such a comment. also like that his content is more informative and goes deeper, you actually learn something here.
@Broken4forever
@Broken4forever 2 года назад
Reminds me of old days linus is alot more busier now and I'm sure a bit more annoyed, nice breathe of fresh air that reminds me of older days.
@Tom5TomEntertainment
@Tom5TomEntertainment 2 года назад
I just imagine video games in my head. It's free and the cops can't stop you.
@JeremyHansenblue2kid3
@JeremyHansenblue2kid3 2 года назад
This is very helpful and exciting. I love Anthony he's my favorite at LTT
@VonSpud
@VonSpud 2 года назад
Exciting stuff coming down the pipe .. Will we need compatible motherboard, cpu and gpu to qualify for direct storage in future ?
@tudalex
@tudalex 2 года назад
The last 2 generations of consoles (and actually phones) also have unified memory, the gpu uses the same memory as the cpu so there is no need to copy it to vram
@rajder656
@rajder656 2 года назад
i mean apus do that too. I mean apple is using this in m1 macs too
@genderender
@genderender 2 года назад
@@rajder656 APUs do not share memory. They simply segment it. You literally lose chunks of system memory, which then has to be copied from that chunk of system RAM to the chunk allocated for the APU
@rbdunne
@rbdunne 2 года назад
I'm curious to see how difficult this will be to implement in compatibility layers like Proton for Linux/SteamDeck. I don't trust MS to make things easy now that SteamDeck has proven to be so popular.
@FunFreakeyy
@FunFreakeyy 2 года назад
To quote Microsoft: "Microsoft Loves Linux" May 6, 2015 (Can't post the link, because it gets deleted, but you can find it easy) Guess it's time to stand by their word and take action with that statement. Especially because hardware-wise the SteamDeck already is the console-like platform that's also a PC, that gets mentioned in the video at 8:15.
@MostafaAhmed-bp7ce
@MostafaAhmed-bp7ce 2 года назад
In all honesty I don't think Linux competes directly with Microsoft, it is just not convenient enough to be used by normal people for everyday use
@daedalusspacegames
@daedalusspacegames 2 года назад
Word on the street is that the DirectStorage .dll files already work in Wine, no additional effort required.
@potens1
@potens1 2 года назад
I wonder if HMM (Heterogeneous Memory Management, wich does already exists in Linux for ages -2017- and permit, if I'm not wrong using GPU memory "as any RAM") can not help here, or if does solve only a part of the problem (because I don't understand everything but I -think- it will only bypass the RAM storage, but not the fact the CPU still needs to be involved)
@wayland7150
@wayland7150 2 года назад
@@potens1 Direct Memory Access or DMA has been used by network cards for decades. The device is configured to put the data directly into RAM and then ping the CPU when it's done. I don't know the specifics but the same principle should allow storage to put things in VRAM. The CPU could issue a command to the storage which would then get the task done and ping the CPU when done.
@billrichards2177
@billrichards2177 2 года назад
So excited! This will be awesome! I would like to see Steve from GN and Anthony do a collaboration on something.. anything!
@Howling_Mad
@Howling_Mad 2 года назад
great video, thanks Anthony
@MrEnyecz
@MrEnyecz 2 года назад
I hope you know that GPU and SSD can directly read/write system memory, so the CPU cores were never heavily involved (except decompression). This is DMA (well, it's modern version, where there is a request PCIe packet sent). Moreover, usually, there is no other path from the SSD to the GPU but through the PCIe root complex, which is basically the PCIe controller inside the CPU. The only real thing left out is the system memory; we now have a path SSD->PCIe controller->memory->PCIe controller->GPU, and GPU direct can do SSD->PCIe controller->GPU. (And note that for data centers, NICGPU copy is already a thing since at least 6 years; now we just have SSDGPU in Win.)
@mycosys
@mycosys 2 года назад
I made the same point - then discovered the vid is REALLY premature and the DMA and GPU decompression havent even been implemented yet. ATM its just an API that improves IO call overhead.
@jesper164a
@jesper164a 2 года назад
Imagine GTA 5 without 10 min loading screens 😍
@sachoslks
@sachoslks 2 года назад
The PS5 version is loading in around 20 seconds now iirc (still waaaay too long when you compare it to other first party Sony games that load in less than 3 sec)
@jesper164a
@jesper164a 2 года назад
@@sachoslks Are you for real? When i open GTA online it takes between 5-10 mins to actually join a server. us PC users just really get shit on by rockstar xD
@Rikimkigsck
@Rikimkigsck 2 года назад
You are just so good at explaining stuff.
@iplaybose
@iplaybose Год назад
Very nice video thank you anthony!
@yeahhbuddy3932
@yeahhbuddy3932 2 года назад
Consoles forever in PCs shadow
@thedarkdojo5115
@thedarkdojo5115 2 года назад
False...
@sethb1689
@sethb1689 2 года назад
One of the best ways to see the impact of direct storage imo is Ratchet and Clank Rift Apart when going through the rifts, especially the part towards the end of the game when you are quickly going through a sting of rifts bringing you to a bunch of drastically different looking areas. That has been one of the very few moments so far in next gen games that really shows the advantage of next gen hardware. This is why i think Microsoft's decision to keep supporting xbox one with all new games for the foreseeable future is a massive mistake regardless of them claiming last gen isn't holding next gen back, that line is a complete and utter lie, they know it as does anyone who has played a true next gen game. The amazing rift mechanic in the newest Ratchet and Clank game is simply impossible to do without the extremely fast ssd and direct storage, especially when looking as good as it does. Microsoft making all games also work on xbox one prevents them from adding core gameplay changing features like that. Fast/near instant load times are nice but things like the rift mechanic are where the new direct storage feature truly shines. It allows true open world games that don't require ANY tricks to hide loading screens like mini cutscenes when going inside buildings, you just go right in. I am sure it enable even more really cool things that couldn't be done before.
@lalu613
@lalu613 2 года назад
👏🏻👏🏻👏🏻👏🏻
@frazmunir
@frazmunir 2 года назад
beautifully explained. im glad this time it was anthony explaining because last time (like serveral times) linus messed up when this tech was introduced on ps5 and had to sorry later. i guess linus should focus more on understanding tech instead of outright making fun of it
@TheAyanamiRei
@TheAyanamiRei 2 года назад
Will those of us with older hardware be able to get good results too? Or is it something only for the newest hardware? Would love to see this with a variety of hardware!!
@FatheredPuma81
@FatheredPuma81 2 года назад
Oh yea I remember now. This is that tech that everyone praised those devs for implementing into their indie game. Was truly revolutionary seeing a loading screen get replaced... with a... loading... screen...
@AlexYTx1
@AlexYTx1 2 года назад
Direct Storage : Comes onto PCs Rockstar Games : Casually re releasing GTA V for PC , saying "With faster loading Screens". (If you don't get it, it's cuz they said " Seamless character switching " when advertising Expanded and Enhanced, but it was all because of the console's hardware)
@wayland7150
@wayland7150 2 года назад
GTA V had that slow loading issue on the PC for years until last year when someone hacked one of their DLLs and fixed it.
@m1_zt360
@m1_zt360 Год назад
3:42 Damn that shot brought me back
@midnightchurningspriteshaq8533
this will allow for further cpu implementation as well, because you will have the option to use additional cpu resources with your gpu to improve game quality. cpu load reduction will not only save money, but can be diverted to other necessary calculations.
@SapioiT
@SapioiT 2 года назад
Not 3 times faster, in the tool Microsoft provided, but slightly more than 4 times faster. 8 x 3 = 24, 8 x 4 = 32. DirectStorage On = 0.08s, DirectStorage Off = 0.33. So you could load the same thing with Direct Storage 4 times, with time to spare.
@nitemantube
@nitemantube 2 года назад
Indeed. I noticed the same mistake, and watched the video again to make sure. Anthony is great nonetheless 👏
@joshuacheung6518
@joshuacheung6518 2 года назад
Margin of error
@FearInfected
@FearInfected 2 года назад
Thats why he said, clearly, 3 TIMES FASTER, not 3 TIMES AS FAST. Can you spot the difference here?
@Viruzzz
@Viruzzz 2 года назад
@@FearInfected When you use "times" it implies multiplying, i.e. 3 times faster mean 3 times the baseline speed, not 3 times the baseline speed in addition to the baseline speed. If you talk to normal people and use this kind of language, they will naturally assume it's a simple multiplier, which is also why nobody uses "1 times faster" because it doesn't make any sense, you say twice as fast/two times faster, those two will be understood as the same thing by basically everybody. If you google "1 times faster" the top results are people trying to justify why "2 times faster" actually means 3 times by using "1 times faster", in real life "1 times faster" isn't used because it's obviously ambiguous. And besides, he clearly got the math wrong anyway, as he said "nearly 3 times" while in fact it was slightly greater than 4 times faster, I would guess the script may have been written with different numbers and wasn't updated or something like that, because the math is wrong no matter which way you slice it.
@SapioiT
@SapioiT 2 года назад
@@joshuacheung6518 I would beg to differ. At least in gaming and performance-related tasks, it makes a big difference.
@AshtonSnapp
@AshtonSnapp 2 года назад
Now we just need something like DirectStorage but for Linux…
@noodle_Cx
@noodle_Cx 2 года назад
Lol
@Rick-kb
@Rick-kb 2 года назад
Haha no
@seshpenguin
@seshpenguin 2 года назад
Given it's a DirectX technology, it'll be the responsibility of Wine/Proton and DXVK... though it might need some kernel level support too?
@ShiroCh_ID
@ShiroCh_ID 2 года назад
@@seshpenguin might be!
@nrdfoss
@nrdfoss 2 года назад
yea
@jaleger2295
@jaleger2295 2 года назад
Those movements in the scene definitelly gave me some photo epilepsy headache. Nice
@brianb6969
@brianb6969 Год назад
nice Vid Anthony! Could you tell me How to Safely Overclock my build?
@xHdDude
@xHdDude 2 года назад
*Today, PC Gaming catches up to Consoles* r/pcmasterrace: *sweating*
@stiegelzeine2186
@stiegelzeine2186 2 года назад
Nobody is sweating because after all PCs are superior and everyone knows that it’s just a fact and not even in terms of loading times the ps5 is superior because direct storage only reduces the loading time from 2,1 seconds to 1,9 seconds on a m2 ssd thats only 10% shorter PCs have a shorter loading time without direct storage than the ps5 does with direct storage
@ShiroCh_ID
@ShiroCh_ID 2 года назад
@@stiegelzeine2186 tru,is hould say OP means Consoles that sweating instead altough for gaming only console is enough while PC Can multitask
@stiegelzeine2186
@stiegelzeine2186 2 года назад
@Transistor Jump they actually do because the most used consoles are still the last gen and the most used gpu is the 1060 wich is comparable to a ps5
@stiegelzeine2186
@stiegelzeine2186 2 года назад
@Transistor Jump keep dreaming kid
@rklrkl64
@rklrkl64 2 года назад
Let's hope that Proton on Linux will be able to translate DirectStorage calls into native equivalents or Linux gamers (Steam Deck anyone?) could be in for an increasingly rough time w.r.t. new game releases. I'm surprised Anthony didn't at least touch on this subject, especially since his boss is spending a month exclusively playing games on the Steam Deck!
@isntyournamebacon
@isntyournamebacon Год назад
The first time i remember reading about asset streaming was Crash Bandicoot on PS1. Yes PS1, it's why the levels undulate up and down and you couldn't move the camera. Hiding asset loading. The opposite of that on PS1 was Ridge Racer. That game loaded the whole thing at the start and you could swap out the game for a music CD and keep playing. Im thinking Bethesda has to be in on this for Elder Scrolls 6. Since they are using photogrammetry for assets. Loading them in as you move between exterior cells wouldn't be a problem anymore. Anyone who loaded up the original Skyrim with a tone of mods back in the day can tell ya. It would stutter like crazy if you added to many large texture mods.
@pikliGN
@pikliGN 2 года назад
So... Are PC parts afordable now or what?
Далее
This SSD is Faster Than Your RAM - Apex Storage X21
22:36
KO’P GAP ESHAKKA YUK!😂
00:57
Просмотров 689 тыс.
The Slow Death of Windows
17:22
Просмотров 1 млн
Can you game WITHOUT driver updates?
11:55
Просмотров 1,9 млн
When Will Direct Storage Be The Norm In PC Gaming?
2:58
Dear Apple - The iPad needs help.
11:47
Просмотров 907 тыс.
Making Nvidia’s CEO mad - RTX 3090 Review
15:52
Просмотров 4,3 млн
You SHOULD buy Used GPUs - Testing Used Mining Cards
14:38
Don't let Microsoft get away with this!
13:33
Просмотров 1,2 млн
Полезные программы для Windows
0:56
Pratik Cat6 kablo soyma
0:15
Просмотров 8 млн
What’s your charging level??
0:14
Просмотров 7 млн