The Future of Chip Memory. The End of the SRAM Era

Просмотров 160 тыс.

% 9 334

Check out New ASUS Vivobook S 15: asus.click/vbs_anastasi
#ASUSCopilotPlusPC #ASUS #Vivobook #Microsoft
Modern CPUs, GPUs and SoCs have a major problem. SRAM memory scaling is dead. In this video I discuss new disruptive memory technology that may solve this problem.
Timestamps:
00:00 - Major Problem with Modern Chips
09:00 - Possible Solution
11:24 - New Memory Technology Explained
LinkedIn ➜ www.linkedin.com/in/anastasiintech/
Support me at Patreon ➜ www.patreon.com/AnastasiInTech
Sign up for my Deep In Tech Newsletter for free! ➜ anastasiintech.substack.com
Website: anastasiintech.com

Опубликовано:

12 июл 2024

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 580

@AnastasiInTech 29 дней назад

Check out New ASUS Vivobook S 15: asus.click/vbs_anastasi

@luckspell 29 дней назад

Please explain why we don't have quantum computers with Ning Li's room temperature superconductor?

@YodaWhat 29 дней назад

@Anastasi In Tech - What about using i-squared-l logic and/or vacuum channel FETs, possibly on chiplets? I2L seemed very promising when first introduced, but it's power consumption was high since transistors were all large at that time. As a bipolar technology it will not suffer from gate leakage problems. Are there any other reasons why it might not work? As for "vacuum" channel FETs, they are 10 times faster or more, partly because they use free electrons. They also benefit from nanoscale features, are extremely radiation resistant, and they can operate comfortably at temperatures up to hundreds of degrees Celsius. Also they don't actually require vacuum when built at small nanoscales.

@fluiditynz 29 дней назад

@@YodaWhat This is about Anastasi's Asus Vivobook commercial she boldly snuck into her main content?

@YodaWhat 29 дней назад

@@fluiditynz - I left my comments and questioon here because it is the most likely place for her to see it. Nothing to do with the laptop she's promoting.

@hdcomputerkeith 29 дней назад

xoxooxoxoxooxox

@StephenBoothUK 28 дней назад

When I first started programming, and RAM was off chip and typically a few KB, we'd spend a lot of dev time working out how to do as much as possible in as little RAM as possible and as few clock cycles as possible. These days the demands to cut development time and get new features out, more driven by senior management and Product Owners than by real customer demand, seems to have ditched those ideas. If it's too slow the customer is expected to just buy a higher spec machine and new developers are taught ways to shorten development time but not execution time. I think that this is a false economy. About 10 years ago I was able to shorten a big data-processing job from 3 days to under 20 minutes, on the same hardware, by applying the techniques I'd learned back in the 1980s to key functions. It took me 5 days, but when this is something that has to be run every week the saving soon stacks up

@crazyedo9979 27 дней назад

You are absolutely right. Once I participated in a service job to get a power station running. The problem was to bring the gas engines up and running as fast as possible. After a few days the programmer had been flown in and looked for alternative assembler commands to save a clock cycle here and a clock cycle there.😁

@NullHand 26 дней назад

Wirth's Corollary to Moore's Law: Any improvement in Hardware performance will be negated by code bloat at an equivalent rate. Kinda like traffic in London.

@gorilladisco9108 26 дней назад

It's not a false economy, just a different emphasize due to the change in price structure. In the old days, memory were expensive, so we tried to economize its use. Today's memory are so cheap, that software developing time has become the most expensive part of a system.

@StephenBoothUK 26 дней назад

@@gorilladisco9108 the cost of memory is largely immaterial. It’s the cost of execution time. Say you’ve got a transaction that currently takes 10 minutes to complete but if the code was optimised would take 7 minutes. To optimise the code would take the developer an extra 5 days effort and the developer earns £30 an hour (that’s the mid-point for a developer where I work), so that’s about £1100 wage cost but once it’s done that cost is done. Once rolled out the application is used by 200 people paid £16 an hour (I have some specific applications we use in mind here). Saving 3 minutes per transaction means either those same staff can process 30% more transactions or we can lose 60 staff at a saving of just over £7000 a day. That extra development time would repay in a little over an hour on the first day and after that would be pure cost saving.

@mititeimaricei 25 дней назад

NO COPILOT! NO RECALL! This future is PRISONPLANET!

@AdvantestInc 29 дней назад

You really have a knack for making complex topics engaging and easy to follow for everyone! Breaking down the challenges of SRAM and introducing phase change memory in such a clear manner is no small feat. Excited for more content like this!

@soufianebellahbib7808 29 дней назад

👍🏽💚🌴☀️🌏

@KGopidas 28 дней назад

Has datsbus ended?

@soufianebellahbib7808 25 дней назад

@@Raphy_Afk 😂😅no..my English is bad🐪☀️

@soufianebellahbib7808 23 дня назад

@Magastz love💚and peace 🌏

@AnthraxVX 16 дней назад

Not bad on the eyes either

@ego.sum.radius 29 дней назад

Science communicators who actually are professionals in their field are allways welcome. Thank you Anastasi

@nicholasfigueiredo3171 29 дней назад

I didn't even know she was from the field, I thought she was just smart. But I guess that makes sense

@Sergei_Goncharov 27 дней назад

The point "good endurance 2*10^8 cycles" prohibits its use for cache memory. But it's really a viable and competitive option as a replacement for Flash memory!

@timothym.3880 27 дней назад

So, the two biggest old school technologies that are slowing progress seems to be memory and batteries.

@prophetzarquon1922 6 дней назад

Yup! Also, a shortage of railways.

@simonescuderi5977 28 дней назад

The problem with chiplet design is heat management. Since every layer is active, it burns energy and produces heat, and this isn't good. A secondary problem is the bus interconnect because stacking requires shared lanes, so memory layers are in parallel, making the bus interconnect a bottleneck. Last but not least is signal strength and propagation time: stacking layers requires precise alignment and add electron jumping around, so there's a potential limiting factor in electron propagation, noise and eventual errors. This isn't much of a problem if the system is built around it, but it still is a limiting factor. There are solutions: since there's one master and multiple slaves there's no risk of collisions and so you can make a lot of assumptions on the drawing board... but busses are going to become wider and more complex, and that will add latency where you don't want it. My 2 cents.

@gorilladisco9108 26 дней назад

- I wonder if they run veins of metal in between the layers to send the heat to radiator. - They put L3 cache on the second layer, which by virtue is quite removed from the logic circuits.

@pentachronic 23 дня назад

Heat, latency, voltage regulation, signal integrity, etc…. Stacked dies has never been simple which is why there aren’t many of them.

@IragmanI 29 дней назад

I'd be curious about the thermodynamic side effects of phase change memory during transitions as the crystallisation would release heat while amorphization would be cooling

@rchin75 29 дней назад

Thanks. Amazing video. It's kind of interesting how it always comes down to the same principles. First shrinking the size in 2D, then layering stuff, and eventually going into the 3rd dimension. And when that reaches its limits, then change the packaging and invent some hybrid setup. Next, change the materials and go nano or use light etc. instead. Even the success criteria are usually similar: energy consumption, speed or latency, size and area, cost of production, reliability and defect rate, and the integration with the existing ecosystem.

@erroroftheworld6927 28 дней назад

А потом ещё уйти в 4 измерение:D

@rafaelgonzalez4175 29 дней назад

My memory is so fragmented I can't tell which particle remembered me.

@ALTERRAa8 28 дней назад

😂😂😂

@rafaelgonzalez4175 28 дней назад

@@ALTERRAa8 Alterra, also included in a game I enjoyed for a very long time. SubNautica. Thanks for the extra smiles. On my face that is.

@taurniloronar1516 28 дней назад

My memory is fine. Only problem is having the parity bit in a Schrödinger box.

@rafaelgonzalez4175 27 дней назад

@taurniloronar1516 damned light. Kick the box and listen for giggles. Good one.

@vicaya 29 дней назад

It's quite bizarre that you thought the PCM memory is a future replacement of SRAM, as the it has a switching speed of 40ns (on par with DRAM), according to the paper you cited. This is an order of magnitude slower than SRAM. The current only viable option to replace SRAM is SOT-MRAM, which TSMC is working on. Go research SOT-MRAM😁

@kazedcat 29 дней назад

It is good enough for cache application but very bad for register memory.

@jim-co-llier 26 дней назад

It also involves a physical change to the medium, which means wear and limited number of writes. I believe a similar principle has been around since at least the 90s. I used to have a CD-R/W type device that used a laser to heat up spots of a special metallic medium, changing it from smooth to amorphous. Could be rewritten some number of times. I will say though, your point is probably good and valid, but could have been made more constructively.

@cj09beira 24 дня назад

@@kazedcat its not good enough for cache, modern caches are at most in the low dozen of ns, 40ns is DRAM levels of latency

@simontillson482 24 дня назад

This is true. PCM is totally useless as SRAM replacement and doesn’t have sufficient speed or rewrite resilience. Honestly, she really failed to understand its use case. It’s a great alternative to floating-gate FLASH memory, not SRAM!

@stavrozbach3992 20 дней назад

what about 4ds memory? 4.7 nanosecond write speeds

@cpuuk 27 дней назад

The words "dynamic" and "static" are a reference to the powering method between state changes. You kind of hinted at this with the TTL logic diagram, but didn't expand. Static is faster because it doesn't have to wait for the re-fresh cycles before it can change state. Static also runs hotter and consumes more power- there are no free lunches ;-)

@simontillson482 24 дня назад

Not exactly. DRAM consumes power all the time, because it needs constant refresh to preserve contents. SRAM only consumes power during state change. Both consume some leakage current though, and with that, SRAM consumes more due to having more transistors per bit cell. DRAM also consumes considerable current to change state, because of its larger gate capacitance. Overall, DRAM tends to consume more power per bit but costs less and is more compact, which is why we use it for main memory and reserve SRAM for cache and internal registers.

@bobclarke5913 29 дней назад

You explain things so well, thanks for a well thought out presentation

@tappyuser 29 дней назад

Been waiting for your vid.... Love the content

@PeterBergstrom-vv2sl 29 дней назад

Very interesting. Thanks for sharing your expertise. There is always something interesting in your videos. At least in the three or four i have seen so far.😊

@Brodda-Syd 29 дней назад

"And here I wanted to make a memory joke, but I don't remember which one"😂

@jeffbguarino 4 дня назад

I bought a book on how to improve my memory. But I forgot where I put it.

@DCGreenZone 29 дней назад

Linked to my substack, title, "The very definition of brilliant" That meams you Anastasi. 😊

@caltron919 29 дней назад

I worked on micron/intels PCM, optane, for a few years. While we were making peogress on some of the problems you mentioned, the venture ultimately failed due to the economics of producing the chips as well as a lack of customers. Would be cool to see it make a comeback in the future

@thom1218 28 дней назад

I am shocked she failed to mention optane as well - "new technology" lol.

@cj09beira 24 дня назад

had they holded on till CXL was here imo it could have taken off, it had great promise it was just in the wrong interfaces

@complexity5545 16 дней назад

I thank you for your service. When intel announced that they were ending optane, I bought 6 of those PCIE drives; I caught a fire sale. Those drives are the fastest drives I have for doing some disk intensive Studio work. I wish they could've gotten the price down around $100-$200 dollars for the good stuff. I actually got 6 optanes for $45 a piece. I lucked up and bought a box.

@springwoodcottage4248 29 дней назад

Interesting idea, but very speculative and in need of a demonstration at scale to assess its practicality. Moreover, although a 23% decrease in area is good for an existing bottle neck, it is not revolutionary, that would need a factor of at least 10. At the current estimated level of improvement it becomes a commercial decision on whether this improvement has a fast enough pay back to justify the r&d costs to make it practical. Is anyone making the investment to commercialize this discovery? Thank you for sharing!

@Aim54Delta 29 дней назад

Not really, the silicon lattice constant is only 0.7 nanometers. We can't scale in silicon below that. Germanium has a lattice constant of about 0.5. While process nodes and technology are mostly marketing terms and there is room for improvement beyond "1 nanometer process" - we are about at the end of what we can achieve with existing semiconductor paradigms. It will be almost all architecture and material sciences by 2030. We can't get much smaller. A 20% improvement over SRAM is disruptive even if it doesn't scale any smaller. SRAM is unable to be scaled any smaller due to the physics underwriting operation. We only have a few more die shrinks left before we are up against the size of the atom. ... Again, sort of ... A 1 nanometer node doesn't necessarily mean that you can make a grid of 1 nanometer square pads separated by 1 nanometer troughs on all sides, or vice-versa. But as I mentioned, the lattice constant of silicon is 0.7 nanometers, their latest process node is 1.4 nanometers. You can't really cleave off half a crystalline arrangement without having weird things happen, the next die shrink, if it is possible, would come at 0.7 nanometers. We would be, assuming we can make the grid arrangement described, making the smallest transistors possible with silicon, using existing paradigms.... And whatever paradigm comes next would need to use atoms much more efficiently - or some other concepts entirely - to function. On the plus side, it means that in about another 10 years, we might see computers built with the idea they could last decades in their application.

@springwoodcottage4248 29 дней назад

@@Aim54Delta Great points! Thank you for expanding on the technological limits of the underlying physics not covered in the video. Given these fundamental limits to silicon, research efforts will move to entirely different concepts that may or may not work. Perhaps we will not see much further progress ending the decades long run of ever increasing chip performance or something new will make current silicon architectures obsolete. Fascinating field with huge commercial risk/rewards for company boards to ponder. Thank you for your comments.

@garlandgarrett6332 29 дней назад

Very interesting, I like the way you present info clearly and concisely

@garycard1826 29 дней назад

Very comprehensive and interesting video. Thanks Anastasi! 👍

@donaldpmurt2446 29 дней назад

Thank you Anastasi - great presentation!

@danleclaire8110 29 дней назад

I greatly admire the passion you infuse into your presentations. Your work is outstanding, please continue this excellent effort. Thank you!

@rsmrsm2000 27 дней назад

Amazing! This girl researched exactly what I wanted to know. Thanks.

@scottwatschke4192 29 дней назад

That was a great video very informative. You're right, it is an exciting time to be alive with all the evolving technology.

@Sven_Dongle 29 дней назад

I invented stacking when I was 3.

@grndzro777 29 дней назад

Astro blocks.

@snakezdewiggle6084 28 дней назад

@Sven_Dongle Was that you!? I though it was David! Good job 👍😉😆 I enjoy your work.

@fachryaruwija9777 28 дней назад

Yups.. but it keeps bulking

@robertsmith2956 27 дней назад

Not bad. My kid at 2 would stack boxes to make a stair to get over the gate. Necessity is the mother of inventions.

@multivariateperspective5137 26 дней назад

Oh hey Al gore… when did u change your name? Lol

@johnhughes5430 29 дней назад

Thank you for your presentation. I found it fascinating. The phase change memory, amorphous crystal back to uniform array crystal seems like the mental models used to explain demagnetization around the currie point.

@petenielsen6683 26 дней назад

I am probably close to double your age. When I say I forget a memory joke I am not kidding!

@bunkynpaws7369 28 дней назад

Nice idea. Very similar to Nantero NRAM that also uses Van der Walls effect to provide resistive cells using carbon nanotubes for SSD/DRAM universal memory. I've been waiting for NRAM for 20 years, and it is only now beginning to make it's way into the data centre. Let's hope that this technology takes less time to mature.

@marcleblanc2026 27 дней назад

This helps me immensely with my DD into the tech & companies involved in the memory sector, Thank you very much Anastasi!

@simphiwehlela5399 29 дней назад

Great information 😊

@TimothyDanielson 29 дней назад

Well said. Excellent video Anastasi!

@dxd42 29 дней назад

Very well explained. Thanks We need more Journalism with clarity to present for the public the real challenges and advancements of Technology.

@jamesjohn2537 29 дней назад

thank dear, its informative

@supremepartydude 26 дней назад

Great stuff. As someone who built their own desktops through computer conventions in the 90s I appreciate you bringing me up to date on where we stand now in personal computer development😊

@MoiraWillenov 10 дней назад

Subscribed... Always interested in intelligent people. You understand what you are saying and are not just spewing words. Fascinating.

@SalahddineABERKAN 29 дней назад

I Love the joke about Nvidea Cash 😂

@theminer49erz 29 дней назад

I remember hearing about the SRAM scalling issue some time before the Zen4 release, but then haven't heard anything even though I kept hearing about shinking nodes. Been curious what was coming of that. I was thinking that since it's not benefiting from the scaling, if it may have been counterproductive regarding degradation etc. I wonder if that is what is happening with the Intel 13 and 14K skus? I guess we will find out soon enough. Thanks for the update, I'm glad they are on top of it!

@Progameroms 29 дней назад

loved that memory zinger, ur so awesome!

@MrFoxRobert 29 дней назад

Thank you!

@solidreactor 29 дней назад

I believe that down the line we would need to use another processor architecture than the Von Neumann one that we use today (i.e. having logic and memory separated), an architecture that instead has an "on memory compute" design, or perhaps a mix of them. In the end the speed of light makes it hard to compute over longer distances (i.e. CM or even MM) specially when the frequency goes up and the data becomes even larger.

@DFPercush 29 дней назад

So basically smart RAM chips with shaders?

@BilichaGhebremuse 29 дней назад

Great explanation

@TheBann90 24 дня назад

Your channel has really improved over the 2 or so years Ive followed you. Im impressed!

@AnastasiInTech 24 дня назад

Thank you for being here

@ozzymandius666 29 дней назад

I appreciate you giving us glimpses into the future of chip design. I think that soon enough, AI will start to play a role in new designs. Thanks!

@betanapallisandeepra 27 дней назад

Awesome explanation…. Thanks 😊

@ilkoderez601 29 дней назад

Love the channel!

@Ottomanmint 26 дней назад

Thank you for sharing this new & exciting development 😊

@DrinkingStar 29 дней назад

Although I do not comprehend all the things you mentioned, what I do understand I find very fascinating. Yours and videos of others help me to decide on what companies and technologies in which to invest (= gambling) at the Wall Street Casino. Investing in stock is like playing Black Jack. The more you know such as via "card counting", the better your chances of winning. For me, your advice is akin to card counting when it comes to gambling on stock purchases. Thanks for your insight in this realm. BTW, my 1st computer was an Atari 800XL which I purchased in 1985. I also wrote code in Atari Basic and in HiSoft Basic. Ten years later, I used the program I wrote to analyze the data for my Master's degree in Human Nutrition. With the Windows computers, writing code now has become too complicated for me, so I have given up on that endeavor.

@blkcrow 16 дней назад

Well done excellent video and very informative 👍

@BartvandenDonk 28 дней назад

This does remember me of a mechanical (robot related) movement solution. They used the same idea in a mechanical way. It works like muscle cells.

@GaryBeilby 26 дней назад

In addition to learning heaps about memory, I really enjoyed hearing you say SRAM lots.

@robertmiller1638 29 дней назад

Great video. Loved your humor and I learned so much. Thank you!

@jaimeduncan6167 29 дней назад

As always fantastic work. I am not so enthusiastic right now with the new technology an endurance of 2E8 is amazing for something like storage, but the computer will go over that in no time for the cache. Even a microprocessor that is not super scalar and runs on the ghz range will be accessing memory in the other of 10^9 per second. Clearly, that access is per cell, and not for the full memory but they need to improve that number a lot.

@cemery50 29 дней назад

One of the chief benefits I can see in going to optical computing is the ability to have associative addressing through polarization and muliple concurrent optical reading/writing heads for raid like processing.

@patriceesela5000 13 дней назад

Excellent analysis 👏🏾 👍🏾 👌🏾

@GeoffryGifari 29 дней назад

So each of the 2 phases of the PCM has a different resistance, so the computer can tell 1 from 0? Can PCM memory be integrated in the same chip as the processor core? Seems like it requires a unique material to be added on a chip

@clauzone03 29 дней назад

Loved the graph you put together with the memory pyramid (access time vs where is used, with volatility information)!! P.S. Your accent also becomes more and more easy to understand!

@cthulholmhastur5317 25 дней назад

You are brilliant! Great content. Thanks for this. ;)

@marsthunder 29 дней назад

Stacking silicon...who woulda thought ...now it makes perfect sense for chip real estate. Thank you for your brilliant assessment of the latest chip technology. You have expanded my knowledge regularly.

@fhajji 29 дней назад

Non-volatile and low-latency at the same time, coupled with scalability and hopefully cost-effectiveness in manufacturing, would be a huge technological leap. Thank you for the information.

@anirudhapandey1234 20 дней назад

Thanks for the updates, really informative... I was working on OTP memory designs and this new time of glass memory is looking similar to the concept of OTP memory, may be we can see this kind of evolution in OTP memories side also.

@scollins4436 24 дня назад

Nicely done.

@costrio 29 дней назад

What about keeping the heat down. Sure lower power required in some case but stacking should also increase the requirement for improved cooling perhaps?

@kotztotz3530 29 дней назад

I'd love to see a AIT and High Yield collab someday :D

@filker0 29 дней назад

I worry about using non-volatile memory for primary or cache memory because of the security aspect. If the information remains after power is interrupted, quite a few "secrets" will be in clear text, and the determined and well equipped "bad actor" will be able to extract surprising amounts of information from a system. My industry has to issue letters of volatility with everything we produce, and for anything with NVM, the sanitization procedure usually involves removing the part with non-volatile storage and destroying it. The only exception is when it can be proven that the hardware is incapable of writing to that NVM from any component present on the assembly, even if malicious or maintenance software is loaded onto the device. This phase change memory built in the same package as the CPU logic could not be provably zeroized without some sort of non-bypassible hold up power, and that would increase the cost and size of the chip package. I think this is very promising for secondary addressable storage, but I don't see it replacing main memory in most applications.

@hhf39p 16 дней назад

Paul Schnitzlein taught me how to design static RAM cells. This video speaks to me. Yes the set/clear, and sense amps are all in balance. It is an analogish type circuit that can burn a lot of power when being read.

@Noam_Kinrot 23 дня назад

Thank you for this video. It's great. My two issues: (1) heat dissipation, is not addressed (over cycles there is growth of H.A.Z.), (2) One thing I heard about and remember vaguely, was an attempt at self healing logics (rather, materials + control circuitry), which is aimed at reducing the need for redundancy, in elements at the core of the chip (hottest and fastest environment), and attempts to also better the chip lifetime (cycles 'til dead). -I would be grateful if you could address both.

@user-di4bt7qu2i 25 дней назад

This is an excellent explanation of the current state of IC memory. Thanks.

@cyberkiller83 25 дней назад

That memory joke at 2:32 hahahahahaha, it wasn't just a memory, but a recursivity joke hahahahahaha

@gljames24 29 дней назад

It should be mentioned that process node sizes like N3 or N5 nodes are density measurements and not actually a transistor size. Intel 10nm was equivalent to TSMC 7nm as they average over different area sizes and utilize different shapes and can't be compared directly or even with the size of a silicon atom which is only 0.1 nm in "size".

@bhuvaneshs.k638 29 дней назад

Another banger video. Do you have discord channel to reach out to?

@devilsolution9781 29 дней назад

telegram probably if shes russian

@mititeimaricei 25 дней назад

NO COPILOT! NO RECALL! This future is PRISONPLANET! NO WORK NON-STOP!

@Dr.Juergens 15 дней назад

3 nm and so on is a marketing term that has no relation to any dimension of the transistors anymore. The true gate width until now is 14 nm due to asml's lithography machines limitation. The next step for the next decade is going down to 8nm (about 80 atoms wide).

@i2c_jason 29 дней назад

My concern with the phase change memory is just the lifetime and reliability. Do the cells grow oxides or change chemistry over time? Can they be ruined by ripple or electrical noise at scale that hasn't been discovered yet? Etc. Love your videos!

@darkflip 29 дней назад

So fancy! I think I want that laptop

@CosmosNut 21 день назад

I very much appreciate your videos and recommend them to every engineer I know !!

@AnastasiInTech 21 день назад

Thank you

@asm_nop 27 дней назад

This sort of tech is very interesting, because depending on how it advances, it stands to change the computing landscape in one or more different ways. If Phase-Change Memory is fast enough and gets good enough density, it can replace SRAM in L3 cache. If the speed cannot get high enough, it could still find use as an L4 cache or a replacement for DRAM. If all else fails, I bet it could give Flash storage a run for its money.

@christopherdecorte1599 26 дней назад

I love the way you explain the topic it gets me thinking even though I have no idea. Like possibly folding the memory and interconnecting them to form cubes cause I always see dies represented in 2d. Like I said, not my field.

@samuelmoore7768 29 дней назад

Is the new phase change memory you described the GST467 superlattice? Very nicely explained set up for the fact that cache is not scaling, btw.

@pentachronic 23 дня назад

OK I’m calling this out as not feasible in lots of cases. The issue is that SRAM needs to be tightly coupled into an architecture to get the performance benefit. However if a bond-out pad is required (eg chiplet etc) via Bunch Of Wires interface then there will be a delay penalty due to capacitance and transmission line issues. This means added latency and a performance hit. Might be useful for L2 cache but anything local it is of no use. SRAM at the local level is still the best solution.

@dion6146 10 дней назад

It has been discussed for decades that close stacking of chips has advantages of speed and size. The issue is heat generation, thus trying to reduce the total charge (electron count per bit). New memory technology is required with far smaller charge transfered per operation.

@goldark3 29 дней назад

You are an amazing Vlogger and i love your accent :D

@complexity5545 16 дней назад

This was an unexpected good video. This is my first video watch of the channel.

@thom1218 28 дней назад

@Anastasi - you might want to re-post this video taking Intel Optane memory into account. It's a non-volatile PCM memory that's been out for years, and even used as DRAM. As another comment mentioned, it failed due to the expensive manufacturing of the chips and the market simply wasn't there mostly due to fierce competition from flash memory.

@cj09beira 24 дня назад

optane's real problem was it being place in 2 places where it didn't belong, one was a flash competitor which placed it in a much too slow interface, another was in DIMM competing with much faster Dram chips, if we make a dedicated space for it optane could really bring some benefit but we need to do it the right way.

@garylcamp 26 дней назад

I had thought of building memory (and the whole IC) in 3D 10 years ago. I think I even put the idea in my website years ago. One part of my idea that is not used yet is using microfluidics to cool the chips that are stacking transistors in 3D, thus restricting heat transfer. The channels could run many levels, and of course, they need fluid-tight connections (a big problem). And use optics to communicate instead of a BUS. Possibly LED or laser tech.

@MagusArtStudios 29 дней назад

Amazing content! Very interesting and intriguing as always. I was surprised to see how small the logic section of the chip was relative to the sram, it makes sense that it's stuck at 3nm because of the structure is 3 transistors wide.

@panzerofthelake4460 29 дней назад

Skeleton meme hahaha

@jensonee 29 дней назад

when new stuff comes into use it's nice to hear how it works, how it was developed. thanks. i've been retired since 2005, when 3com's cowardly lion closed it's doors.

@wojtekbratek5156 7 дней назад

It's incredible how realistic AI creates movies. You can fall in love.

@rogerthomas7040 27 дней назад

This is not a solution to the SRAM problem, even the authors of the paper state "his work provides key materials and engineering insights towards the design and optimization of energy-efficient PCM, and could inspire the industry-scale adoption of nanoscale superlattice phase-change materials for low-power and high-density storage." The report states that they have a nice cell size of 45 nm, but a switching time of 40ns and endurance of 2 x 10^8 cycles (SRAM is around 10^15). So this is a possible replacement for Flash memory not SRAM. As a side note, the use of any heat based phase change storage solution on or near the CPU die would result in some very interesting performance issues as the heat output of the CPU would be impacted by the number of true values held within the cache storage and the frequency the cache is rewritten.

@bgjohns47 27 дней назад

PCM memory chip technology has been in R&D since the mid 2000s. Intel, StMicroelectronics and Ovonyx were in the game together in a joint development starting around 2005. Samsung was also doing research in PCM. I believe the biggest player now in Micron Technology.. And you are correct about all the advantages of PCM. I believe the two big challenges are being able program the device.into two or more distinct, well defined resistance states reliably coupled with manufacturing very small structures with precise dimensions. Nvidea is talking about PCM.

@GeoffryGifari 29 дней назад

Would be interesting to know what makes 3D stacking structure so difficult to achieve

@GeoffryGifari 29 дней назад

Heat exchange maybe?

@GodbornNoven 29 дней назад

Yes, though theres more limitations, it's really hard to manage heat in a 3d structure. It also requires new innovative ways to do it. This is why a room temperature super conductor would be such an amazing breakthrough. You wouldn't need to worry about heat management and you could up the frequency to Thz levels while maintaining manageability even in a 3d transistor structure. Computing would be millions and billions and even trillions of times faster

@Baikur1 28 дней назад

For quantum computers, this problem is even greater. There, the area of the RAM "pixels" is huge for now. The speed of ordinary RAM is small for quantum gates/switches. RAM is not just memory, but arrays of NDR (negative differential resistance) counters.

@teeborg1519 25 дней назад

About the memory joke, I see you are well trained in dad jokes :D

@ezion67 29 дней назад

I hope this one can make its promise true. I remember FRAM the previous miracle memory tech. That one promised to be, non volatile, have SRAM speed, DRAM durability and surpass Flash density. It's still around, but didn't hold up to expectations.

@hovant6666 27 дней назад

Cooling the buried cores may present a problem in the future

@piginus2 28 дней назад

damn.. really wanted to hear that memory joke

@johnjakson444 26 дней назад

One way of attacking the Memory Wall hierarchy is to attack it from the top, use RLDRAM which has been around for >25 years but only in NPUs (network PUs) since it offers DRAM cycle rates closer to 1ns but latency of 10ns or 8 clocks. Since it is highly banked, 16-64 banks working concurrently allows for 8 memory accesses every 8 clocks so throughput is 2 orders better than conventional DRAM. Of course in single thread use, not much benefit and to keep as many threads in flight requires that thread selects pseudo randomly across the banks and not hit on the same bank successivly.This could be used as an extra layer between normal DRAM on slow DIMM packages and the first SRAM cache level. This RLDRAM layer is where it would be used in CAM modules or soldered. We are substituting The Memory Wall for a Thread Wall here. But we already are used to having dozen threads these days. The RLDRAM model could be applied one level lower down in an RLSRAM version which would be perhaps several times faster but allow bank cycles and latency near 1-2ns but still 8 clocks and 16 banks.

@walterk7934 29 дней назад

Make a memory joke but I don’t remember which one. 😂

@levieux1137 27 дней назад

These new analog memories can be super efficient for LLMs because we don't need exact values, just approximate ones, so each cell could store a weight. We still need the analog multipliers and then we'll finally have hardware neurons that will be way more efficient than the current systems which are solely bound by memory bandwidth and CPU power consumption.

@Robert-zc8hr 24 дня назад

32/64bit APUs and 1bit LLM are a thing. AI is done on GPUs anyway, or better TPU/NPU, so we already have chips dedicated for it.

@berndhaas431 17 дней назад

Great video - thank you Anastasi :-) I think if we stack much more memory as 3rd level cache chiplets on top of CPUs we may reach the size of gigabyte 3rd level cache. And this would eliminate the external DIMMs on the mainboard which makes future Notebooks and PC again cheaper and reduces not just the complexity of the mainboard but also of the operating system, drivers and firmware because data can be loaded directly via fast PCIe lanes connected SSDs to 3rd level cache.