Тёмный

4,000,000,000,000 Transistors, One Giant Chip (Cerebras WSE-3) 

TechTechPotato
Подписаться 130 тыс.
Просмотров 128 тыс.
50% 1

Опубликовано:

 

27 сен 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 524   
@shmookins
@shmookins 6 месяцев назад
I'm gonna need more thermal paste.
@henrik2117
@henrik2117 6 месяцев назад
😂👍
@nicknorthcutt7680
@nicknorthcutt7680 6 месяцев назад
😂😂
@j.lietka9406
@j.lietka9406 6 месяцев назад
It should have its own cooling system, like a freezer!
@cef-ym3gb
@cef-ym3gb 6 месяцев назад
I hear it's offered in 55 gal drums. 😂
@TechTechPotato
@TechTechPotato 6 месяцев назад
Tubes per chip, rather than chips per tube
@jackdoesengineering2309
@jackdoesengineering2309 6 месяцев назад
The yield is 100% because if it doesn't work you get a bangin cool frisbee !
@carstenraddatz5279
@carstenraddatz5279 6 месяцев назад
If a manufacturing defect knocks out a single core, you still have 900k minus 1 other cores. The design caters for that.
@RubixB0y
@RubixB0y 6 месяцев назад
It's called "catch" because when you don't catch it, the game is over 🙃
@richr161
@richr161 5 месяцев назад
​@@carstenraddatz5279if the average yield is 80% your going to have 20% of the chip be dead weight. Don't see the benefit of this design than just breaking the wafer down. You're not worried about size or space requirements at that scale.
@carstenraddatz5279
@carstenraddatz5279 5 месяцев назад
@@richr161 Worries exist, especially with this type of chip. However at that scale you are very worried if you are TSMC and only get 80% yield. Customers won't come back if you don't improve that. Realistically you're aiming for north of 97% yield or so, I hear.
@richr161
@richr161 5 месяцев назад
@@carstenraddatz5279 Tmsc yield is literally published average at 80% with peaks of greater than 90% on a leading node. I'd assume the nodes they keep around for companies who don't use leading edge are in that range, with all optimization going into yield rather than performance.
@jolness1
@jolness1 6 месяцев назад
This is such a cool idea. Never can get over what a wild idea it is to have a die that is a full wafer with the round parts lopped off.
@kellymoses8566
@kellymoses8566 6 месяцев назад
Imagine showing this video to someone 30 years ago.
@afc8981
@afc8981 6 месяцев назад
They would probably approve. It's like a giant AI mainframe.
@10lauset
@10lauset 6 месяцев назад
Imagine showing this video to someone in China today.
@JohnSmith762A11B
@JohnSmith762A11B 6 месяцев назад
Imagine showing it to Alan Turing. It would be like that scene where the archeologists get to Jurassic Park and see actual dinosaurs.
@Eugensson
@Eugensson 6 месяцев назад
Imagine showing this video to someone 30 years from now in the future?
@fracturedlife1393
@fracturedlife1393 6 месяцев назад
What someone? John Connor. What future? 1984.
@JanMagnusson72
@JanMagnusson72 6 месяцев назад
Moore's law is based on the observation that transistor density used to double every 18-24 months. This product does not even use the latest process. If anything it indicates that Moore's law is no longer applicable. Moore's law was never about performance.
@wombatillo
@wombatillo 6 месяцев назад
Strictly speaking Moore's law was originally about the actual number of transistors per chip. Originally the TTL and NMOS and whatever chips were 5x5 mm max. so the chip size was fairly limited. The process generation improvements are of course what kept this cycle going until maybe 2012 but after that it's been a combination of increasing the chip size and shrinking the transistors. Moore's law was never thought to apply to one square foot silicon chips.
@Poctyk
@Poctyk 6 месяцев назад
@@wombatillo Funny enough in 1975 article(?) Moore actually noted that increase in die size was part of how doubling transisor count was achieved
@BaBaNaNaBa
@BaBaNaNaBa 6 месяцев назад
bro is holding the holy grail casually in his arms 😱
@LemonsRage
@LemonsRage 6 месяцев назад
and literally taking a bite out it
@radugrigoras
@radugrigoras 6 месяцев назад
Lol “bro” at the minimum Dr. Bro.
@FodrMichalych
@FodrMichalych 6 месяцев назад
@@radugrigoras, esquire
@GuidedBreathing
@GuidedBreathing 6 месяцев назад
That golden coco bar looking thing is probably more expensive than a normal coco bar looking thing
@charleshendry5978
@charleshendry5978 6 месяцев назад
A La Monty Python 😂
@ProjectPhysX
@ProjectPhysX 6 месяцев назад
If only Cerebras hardware had OpenCL support and wouldn't need an own proprietary language! Would open doors to HPC/simulation workloads way beyond AI.
@RahulAhire
@RahulAhire 6 месяцев назад
They do support HPC simulation, right? I do see cerebras SDK supporting scientific computing. I might assume it will need some workaround.
@forceofphoenix
@forceofphoenix 6 месяцев назад
OpenCL? Vulkan is the real shit ;-)
@seeibe
@seeibe 6 месяцев назад
Don't think this was what Moore had in mind when he formulated the law 😅
@hrdcpy
@hrdcpy 6 месяцев назад
Correct. He imagined a trillion-dollar company limiting users to 64GB of storage in order to push cloud solutions.
@INFINITY-f1m
@INFINITY-f1m 6 месяцев назад
​@@hrdcpyGood old apple and its supporter
@jackdoesengineering2309
@jackdoesengineering2309 6 месяцев назад
Bitcoin miners are now selling the heat generated into an industrial process. Datacentres may soon follow suit. They really need a way to recapture the energy costs
@Ang3lUki
@Ang3lUki 6 месяцев назад
Our monthly reminder that it was never really a "law" in the scientific sense
@dixie_rekd9601
@dixie_rekd9601 6 месяцев назад
@@Ang3lUki thats why i always called it "Moores lore"
@leonardmilcin7798
@leonardmilcin7798 6 месяцев назад
This cooking range-sized CPU emits actually 10x more heat than a typical cooking range. That is just crazy.
@endeshaw1000
@endeshaw1000 6 месяцев назад
you can literally heat your house with it, even in deepest winter :)
@SaHaRaSquad
@SaHaRaSquad 6 месяцев назад
@@endeshaw1000I too am a fan of house heating that can do computation as side effect
@christopherleubner6633
@christopherleubner6633 6 месяцев назад
It is about the same as a clothes dryer, which is far less than I thought it would need. 24kW isn't too bad, a basic watercooling system with a pre chiller radiator would be fine. The 5V or 3V bussing would be nuts though, the amps would be 8000 at 3v and just under 5000 at 5V. 😮
@miemiemiedesu
@miemiemiedesu 6 месяцев назад
Best Device for Training AI Cooking
@wawaweewa9159
@wawaweewa9159 6 месяцев назад
Connect to a floor heating system 😂
@brodymiller9299
@brodymiller9299 6 месяцев назад
I need two of those, that way I can have 1 core for each pixel to get 100,000 fps
@gensteps923
@gensteps923 6 месяцев назад
Yesterday when this news broke I looked for a video on It, couldn't find it so I found your vid on WSE-2. Now today you deliver on the news regarding WSE-3. Nice work
@matthewsjc1
@matthewsjc1 6 месяцев назад
I remember at one point in the 90s changing the jumpers on the motherboard to overclock my pentium from 60 mhz to 66 mhz, but never finding a way to cool it enough to remain stable. At the time I would’ve been thrilled to have that 10% jump in performance. My brain may have melted knowing that in 2024 I’d have multiple machines (including portable ones!) that are note only multi-core, but run at BILLIONS of cycles per second.
@eonreeves4324
@eonreeves4324 6 месяцев назад
it's crazy to think of the amount of work that goes into creating these and then to sell 9 or 10 of them a year. it shows how niche the market is for this kind of processor
@gustamanpratama3239
@gustamanpratama3239 6 месяцев назад
Can't wait to see what kind of performance boost the next gen Wafer-Scale Engine 4 will bring us!!🤤 Imagine that it will be using 2nm Forksheet GAA or 1nm CFET tech
@TechTechPotato
@TechTechPotato 6 месяцев назад
I asked. Was told to wait
@Wingnut353
@Wingnut353 6 месяцев назад
@@TechTechPotato hold yer horses potato man they says!
@yancgc5098
@yancgc5098 6 месяцев назад
Considering they went from 7nm to 5nm for WSE-3, the next logical step will be TSMC 3nm for WSE-4
@cem_kaya
@cem_kaya 6 месяцев назад
This is one of the most interesting chips on the market. Happy to hear they are earned more money then they have raised.
@NoSpeechForTheDumb
@NoSpeechForTheDumb 6 месяцев назад
Moore's law is about logic DENSITY. It's not about more logic in a chip the size of a chess board LOL
@n00blamer
@n00blamer 6 месяцев назад
"the number of transistors in an integrated circuit (IC) doubles about every two years." -- Gordon Moore
@NoSpeechForTheDumb
@NoSpeechForTheDumb 6 месяцев назад
@@n00blamer LOL what you posted is NOT Moore's Law. It's the simplistic theme park version spreaded by the media. The actual law is distributed over his article "Cramming more components onto integrated circuits" from 1965, where he referenced to the complexity in two-mil squares. Please do your own homework LOL
@n00blamer
@n00blamer 6 месяцев назад
@@NoSpeechForTheDumb In his original article, Gordon Moore stated the essence of what became known as Moore's Law with the following quote: "The complexity for minimum component costs has increased at a rate of roughly a factor of two per year... Certainly over the short term this rate can be expected to continue, if not to increase." This statement captures the crux of Moore's Law, highlighting the exponential growth in the number of components (transistors) that can be integrated onto a semiconductor chip at minimal cost, with the expectation that this trend would persist into the foreseeable future.
@AlexSeesing
@AlexSeesing 6 месяцев назад
No potato for sir. With these kind of chips I can't get rid of the feeling I had around 1990. 80386 was kinda in our grasp but yet, RISC told us: Nope you won't. This feels kinda the same again.
@XIIchiron78
@XIIchiron78 6 месяцев назад
Imagine sending an entire 100amp residential service into a single chip
@cryptocsguy9282
@cryptocsguy9282 6 месяцев назад
I do remember cerebras claiming that their WSE 2 was better than Nvidia's offering at the time but Nvidia seems to have the most hype out of all the companies involved in AI. My understanding is that having all the processing units on one massive wafer just makes everything move faster instead of having many discrete GPUs connected together
@peoplez129
@peoplez129 5 месяцев назад
It uses a lot of power, but the sheer scale of it makes it more efficient. For example, the RTX 4090 tops out at 100tflops. That means it would take 10 of those to have 1 petaflop. This chip does 125x that, so you would need over a thousand RTX 4090's to equal the same processing power. Not to mention the 4090's would require over 400,000 watts of power, while this requires only 25,000. That alone gives it a huge win over Nvidia to the point that if anyone is even still using Nvidia, it's because they're laundering money, because ignoring the savings of 375,000 watts an hour to run it, can't be anything but a money laundering operation at that point.
@dakoderii4221
@dakoderii4221 6 месяцев назад
Since 2020, all the memes and parodies became reality.
@techman2553
@techman2553 6 месяцев назад
Can't wait for the laptop version of the chip !!
@andychow5509
@andychow5509 5 месяцев назад
Imagine if every cold country used these to heat buildings in winter. You could reduce heating costs to zero, and really get both compute and heat in virtually perfect harmony.
@nicknorthcutt7680
@nicknorthcutt7680 6 месяцев назад
Astonishingly powerful, one hell of a CPU 😳
@xl0xl0xl0
@xl0xl0xl0 6 месяцев назад
What kind of software / framework do they provide? I take it, it's not PyTorch or JAX? How hard is it actually to implement those models and the training code?
@TechTechPotato
@TechTechPotato 6 месяцев назад
Pytorch and tensor flow iirc. I didnt show the slide, but they stood up gigaGPT in 565 lines of code, vs 20000 for megatronLM. Both 175B parameters
@SpencerHHO
@SpencerHHO 5 месяцев назад
That single piece of silicon uses more power than my 200amp and 180 amp welders combined even when maxxed out. In fact it uses more than my entire house does 99% of the time. The stitching of rheticals is truly a remarkable innovation and something I want to learn more about. If I had to guess, I'd think there'd be a buffer in from the edge of the conventional masks then a second 'stitching' mask would be used to overlap rheticals and mark over them in a manner reminiscent of multi patterning in conventional lithography. Regardless of how it's done it's truly remarkable the level of precision and the fact that they can yield something this big on 5nm is actually insane. It seems they've exceeded their own expectations from what they initially set out to achieve. They were initially talking about being a few nodes behind but they're now basically on the leading edge and only one step from the absolute bleeding edge.
@Void_Glitcher
@Void_Glitcher 6 месяцев назад
one thing I really want to see are smaller AI chips that are for personal/commercial use. I've messed with ai image generation and some other ai stuff but you can't really go any higher than 520x520 image quality with a middle ground GPU. if there are any products already like this please tell me.
@gandalfgreyhame3425
@gandalfgreyhame3425 6 месяцев назад
Is that chip from a single silicon slice? Or, more likely, a 12 x 7 array of individual chiplets stitched together?
@unvergebeneid
@unvergebeneid 6 месяцев назад
It's one piece of silicon (not silicone, that's a polymer). Hence the name wafer-scale.
@TheReferrer72
@TheReferrer72 6 месяцев назад
Wafer Scale implies one piece of silicon. It has a lot of engineering to get around defects, plus the cores are tiny.
@salmiakki5638
@salmiakki5638 6 месяцев назад
They claim it is monolithic
@gandalfgreyhame3425
@gandalfgreyhame3425 6 месяцев назад
@@unvergebeneid OK, I corrected the spelling.
@gandalfgreyhame3425
@gandalfgreyhame3425 6 месяцев назад
@@unvergebeneid The yield rate for such a gigantic single piece of silicon with over a trillion transistors must be really low. I mean I think the yield rates for standard size CPUs are only in the range of 10-20%. The chances for one or more defects to be present on a giant chip that is 84x larger in size must be enormous.
@kellyeye7224
@kellyeye7224 6 месяцев назад
I remember my first PC - ETI Magazine DIY computer called the Transam Triton. 8080-based and 256 BYTES of memory! Cost me £300 in 1978.
@Bobby-fj8mk
@Bobby-fj8mk 6 месяцев назад
I still have an SDK 8085 kit.
@Veptis
@Veptis 6 месяцев назад
Wait? Moore's Law was never size limited? So it's not just density alone?? I am most excited about the Qualcomm Cloud AI100 Ultra card tbh. It seems to be the best solution for workstation/researchers who mainly care about running evals which purely require inference. And 128GB per card... Would take like two A100 to match. And those costs easily 30k+ Please let Qualcomm know we want them! I am almost ready to pay 10k for a single card... If they can sell it to me, proof the software works and finally release some accurate benchmarks. Like I want to know what a single card can do for throughput with like a 70B model at FP/BF 16 Can they donate a WSE (1,2 or 3) to Fritz for dieshots? Also the door behind you spells MOOR - surely that's on purpose
@Poctyk
@Poctyk 6 месяцев назад
It is/was not about density but the total transistor count.
@whyjay9959
@whyjay9959 6 месяцев назад
Do they also make chips out of single or a few tiles? Like from outside of the square. It's an interesting method, gets one thinking about how else it could be applied, like a CPU getting 2 or 4 still-attached tiles instead of 2 or 4 of the same chiplet. Also, imagine if we were using 450mm wafers; that might not have been a profitable transition for most uses, but for this and silicon interconnect fabrics it would've been different.
@Wingnut353
@Wingnut353 6 месяцев назад
Its a single wafer... normally chips are made from a wafer just like this and then diced up into smaller chips... the reason chips are normally limited to smaller sizes is the projection system they use to image the chip only covers a small portion the rectangular areas you see on this wafer.... since they are doing all this on teh same wafer though they can put ultra high bandwidth links between the normal rectical scan areas... and link it all together. there is far more bandwidth available here than you would normally get even through an interposer since all the layers are there.... instead of it just being one layer through an interposer. Making GPUs like this might actually make sense...that said planar latency on this thing is probably quite "bad", its part of the reason vertically stacked cache on Ryzen x3d has low latency is that going vertical is faster than going sideways twice as far.
@freedom_aint_free
@freedom_aint_free 6 месяцев назад
One day we will have a solid black monolith of nothing but transistors and memory, like the one in 2001 Space Odyssey !
@RedPillRachel
@RedPillRachel 6 месяцев назад
The only thing that this video, and all of your other videos for a while now, is the meme-worthy "What's your minimum specification?" jingle you used to have... is there anybody else missing that or just me?
@50shadesofbeige88
@50shadesofbeige88 6 месяцев назад
Now THATs a big chip.
@sunnohh
@sunnohh 6 месяцев назад
I cannot believe you held it that long without eating it 😊
@ernsailor9041
@ernsailor9041 6 месяцев назад
I might be wrong but are you sure that'll fit in my phone, looks like it might be a tad too big but things can look bigger on the screen so who knows.
@TechTechPotato
@TechTechPotato 6 месяцев назад
😂👌
@jorcyd
@jorcyd 6 месяцев назад
@5:06 was meant to be "a quarter of FP16 zettaflops", don't ?
@TechTechPotato
@TechTechPotato 6 месяцев назад
Yeah it was. Jet lag hitting hard!
@Safetytrousers
@Safetytrousers 6 месяцев назад
Have technology, must bite it.
@Idoldissr.11
@Idoldissr.11 6 месяцев назад
Still... will it break the 60 fps barrier in Skyrim SE/AE? LOL (Just thinking out loud.)
@nothinghere1996
@nothinghere1996 5 месяцев назад
anamartic and wafer scale memory. happy days.
@SharpsBox
@SharpsBox 5 месяцев назад
Boy, Witcher 3 with RT on will be sweet with this rig!
@the.bog.
@the.bog. 6 месяцев назад
okay but what's the gemm/W, how are you guys solving non-stationary dataflow. inter core communication has an incredible power overhead. not to mention the developer nightmare of having to debug and troubleshoot non-deterministic compilation tools.
@gerbil7771
@gerbil7771 6 месяцев назад
I can’t comprehend the scale of the capabilities these processors have anymore. It’s absolutely nuts.
@TheBestNameEverMade
@TheBestNameEverMade 6 месяцев назад
Why is it not circular and use the entire disk?
@TechTechPotato
@TechTechPotato 6 месяцев назад
You need shoreline bandwidth for external comms. That doesn't work well with rounded edges.
@ironicdivinemandatestan4262
@ironicdivinemandatestan4262 6 месяцев назад
According to Cerebras themselves, a rectangular design made I/O, cooling, and other things much more practical.
@TheBestNameEverMade
@TheBestNameEverMade 6 месяцев назад
@@ironicdivinemandatestan4262 thanks!
@isaacyonemoto
@isaacyonemoto 6 месяцев назад
Routing is not "just that simple". If you turn off a core, it will warp the wafer from thermal stress. Even with their thermal solutions I would bet that microfractures will render a wafer dead after a few months or a year.
@unvergebeneid
@unvergebeneid 6 месяцев назад
Why is it square and not round? Packaging reasons?
@TechTechPotato
@TechTechPotato 6 месяцев назад
I have a video that explains just that!
@unvergebeneid
@unvergebeneid 6 месяцев назад
@@TechTechPotato dammit, and here I thought I'm keeping up with your videos!
@zwe1l1nkehaende
@zwe1l1nkehaende 6 месяцев назад
@@TechTechPotato but in that video you said ~"thats why I don't expect to see round chips anytime soon, unless someone does a waferscale round chip". So since this is a waferscale chip, why not NOT trim the edges and use as much of the wafer as possible?
@TechTechPotato
@TechTechPotato 6 месяцев назад
The programming model changes a fair bit, especially with chained workloads. The edge corner cores end up burning power and being underutilised. Also cutting the thing would be trickier and more expensive. Then having similar cuts for power and IO. A rectangle keeps the shoreline identical and easier to design for
@whyjay9959
@whyjay9959 6 месяцев назад
Check the comments on [Cerebras @ Hot Chips 34 - Sean Lie's talk, "Cerebras Architecture Deep Dive"]
@DileepB
@DileepB 6 месяцев назад
Moore's Law is about transistor density in a monolithic piece of silicon. There are creative ways of driving performance despite the end of Moore's Law!
@jdogdarkness
@jdogdarkness 18 дней назад
I rly want to know what the cooling situation is with this. What it looks like & what temperature is like.
@mickeygallo6586
@mickeygallo6586 5 месяцев назад
That's makes one hell of a schematic
@wolftheai
@wolftheai 6 месяцев назад
Ok with that kind of power can we get a deep dive into the cooling system?
@Theodorus5
@Theodorus5 6 месяцев назад
2:15 I was waiting for him to do that 😄
@Blackvipe1
@Blackvipe1 5 месяцев назад
have they thought about cutting the chips then stacking them, and put cooling plates between the stacks.
@marsovac
@marsovac 6 месяцев назад
The numbers here are less important than "can you buy it? can you buy enough of it? can you easily deploy it instead of competing technologies like gpus?". I guess the answer is no, since GPU prices are going up, not down.
@TechTechPotato
@TechTechPotato 6 месяцев назад
Cerebras increased unit production 8x in 2023, and they're 10x this year again. One they've deployed over 200 and got an order for another 400 from one customer.
@Phantom_Communique
@Phantom_Communique 6 месяцев назад
I had to double check the zeros in the title. Holy moly.
@SimEon-jt3sr
@SimEon-jt3sr 6 месяцев назад
Amazing rundown thanks man
@exidy-yt
@exidy-yt 6 месяцев назад
This must play a mean game of Crysis. TBH this gives me hope I may just live long enough to be able to upload my mind to run on a CPU before I die.
@renevandenbosch9967
@renevandenbosch9967 6 месяцев назад
How old are you?
@GuidedBreathing
@GuidedBreathing 6 месяцев назад
4 Trillion transistors; what is the optimized use case for this computing chip? If we compare this one to Nvidias solutions; what’s the main differences ? Thanks 🙏
@fundiambb
@fundiambb 6 месяцев назад
does it run ark survival evolved tho?
@lucamatteobarbieri2493
@lucamatteobarbieri2493 6 месяцев назад
The specs are amazing. Did Cerebras reduce complexity like groq did?
@Alkanen
@Alkanen 6 месяцев назад
What's the yield on these behemoths?
@tomstech4390
@tomstech4390 6 месяцев назад
One of the few times Moores law is used correctly factoring in the cost, I'm not aware of another time it's actually kept true in the last 10 years.
@tuxjob
@tuxjob 6 месяцев назад
And again you have to bite it... xD
@talroitberg5913
@talroitberg5913 6 месяцев назад
I wonder if these sorts of Wafer Scale Engines can be combined with advanced packaging / memory stacking? To my understanding, large AI models are bottlenecked by memory capacity and throughput, so adding a closely-bundled cache or HBM stack on top could increase performance by a lot. That said, with the energy this thing uses, powering and cooling extra memory stacked directly on top might be a problem. Maybe if it has separate power delivery, fluidic cooling channels through and between the chips, etc? There's probably high-end customers who would want that if it gives significant advantages over H100s for their applications.
@dan-tv1kp
@dan-tv1kp 6 месяцев назад
Cool, but what if u gotta send data from one corner to the another/one corner to the center?
@Bob-of-Zoid
@Bob-of-Zoid 6 месяцев назад
Come on over, pop it on my Motherboard!! I want to try it!!
@Karthig1987
@Karthig1987 6 месяцев назад
Awesome video
@eightsprites
@eightsprites 5 месяцев назад
Had to create a motherboard to that one.
@El.Duder-ino
@El.Duder-ino 6 месяцев назад
One of the kind, very unique solution only from Cerebras. They found a hole in the market otherwise they would be out of business by now and with inference cards and renting model they can also monetize pretty good as well.
@catchnkill
@catchnkill 6 месяцев назад
You get it all wrong. Cerebras' wse-3 chips will be used for training primarily. They are not for inference. They sold it as a whole system as a supercomputer system.
@El.Duder-ino
@El.Duder-ino 6 месяцев назад
@@catchnkill "Greetings to Chinese state hackers!" - u got it wrong and obviously u r not reading that I mentioned "inference cards" mentioning Qualcomm ASICs. Nice trolling though...
@DreamCodeLove
@DreamCodeLove 6 месяцев назад
My next gen AI powered cook top...
@spectro742
@spectro742 5 месяцев назад
can't wait for 450mm wafers if they ever come along...
@Dragoon91786
@Dragoon91786 6 месяцев назад
The difference though-Mr. Potato, how much more computer power does that 80 MW get them compared with one of the national labs top systems (not integrating a Cerberus WSC unit (Wafer Scale Compute unit-I know *_technically_* they're called "engines", but 🙄 it's a MASSIVE compute unit, not simply an engine performing classical work; I know; I'm jaded at the marketing department) into their super computer? I have read that the national labs have started mixing these monsters into their design architecture.) That sucker looks about the same size as one of Seymour's custom CPUs from when he switched over to Gallium Arsenide for the Cray-3 & Cray-4. RIP. Way ahead of his time. I just wonder how much more powerful our tech would be today (and power efficient) had the national labs not had the budget cuts and cancelled their super computer orders (well, and had he not died in that car accident in 1996). We might all have GaN performance compute cores by now cranking 40+ GHz. 😅
@switzerland3696
@switzerland3696 6 месяцев назад
20 of the card in one chassis, how do you do the PCIe channel routing? 16x? How many CPU's?
@goodfodder
@goodfodder 6 месяцев назад
Question, how do they get 100% yeld when tsmc themselves can’t ?
@TechTechPotato
@TechTechPotato 6 месяцев назад
Yield doesn't mean defect free. Tsmc n5 is about 50 defects per wafer. WSE3 has around 9000 redundant cores, so cores with defects can be disabled and routed around. So there is logic redundancy.
@christopherleubner6633
@christopherleubner6633 6 месяцев назад
That is not only wafer scale, that looks like it used an entire 13 inch wafer. The fact that they can make a single IC die that big shows how far we have come. I can only imagine the amps required and heat extraction required for a CPU that size while running at full power. 😮😮😮
@wombatillo
@wombatillo 6 месяцев назад
How much does a 5nm process 300mm wafer cost at TSMC these days? $15000? That's a heck of an expensive chip even with no margins added for RDI, marketing, manufacturing outside the fab, distribution, sales, profit, etc.
@ironicdivinemandatestan4262
@ironicdivinemandatestan4262 6 месяцев назад
​​@@wombatilloThe WSE chips are sold for around $2 million, so the cost of the wafer is a drop in the bucket.
@wombatillo
@wombatillo 6 месяцев назад
@@ironicdivinemandatestan4262 The chip must really be worth it to have such valuation. The distributed memory and sheer bandwith is insane compared to h100 clusters and others.
@larrybuzbee7344
@larrybuzbee7344 6 месяцев назад
Cerberus (also spelt Kerberos) is a vicious three-headed dog in Greek mythology, who guards the entrance to the underworld. He allowed the souls of the dead to enter Hades but prevented the living (except for a few exceptions) from leaving. Coincidence ¿???¿ 🔥🥵😱👽🤖
@dfv671
@dfv671 5 месяцев назад
How's the failure rate of those giant chips? Cost of replacing would be much higher.
@TechTechPotato
@TechTechPotato 5 месяцев назад
You mean the warranty? At least 3yrs I'd assume.
@imurrx
@imurrx 6 месяцев назад
How many copies of Doom can it run at one time?
@Alexagrigorieff
@Alexagrigorieff 6 месяцев назад
Thermal expansion issues are gonna be a bish. CPU load across the "tile" will have to be carefully balanced to even out the temperatures. Oh, and you cannot mount it onto a single substrate, it would have to be split into smaller segments.
@danielgrayling5032
@danielgrayling5032 6 месяцев назад
No more room at the bottom? No problem, plenty of room at the top. It's a big universe.
@petergibson2318
@petergibson2318 6 месяцев назад
You could heat a village with that. Cooling it must be a nightmare.
@Capeau
@Capeau Месяц назад
What are the downsides of this vs Nvidia/AMD/intel their solutions?
@guytech7310
@guytech7310 6 месяцев назад
Seems pretty silly to make it from a single wafer. better option would be chiplets on a silcon base for interconnects. This would address defects since, if a chiplet has a defect you toss only the chiplet & not an entire 12 cm chip. Or are Cerebas WSE are actually composed of chiplets?
@TechTechPotato
@TechTechPotato 6 месяцев назад
The wafer is designed to route around defects. There's 45-50 per wafer, and any 'dead' cores get disabled. There are 1000+ extra redundant cores for this among tge 900k cores, so the actual yield is near 100%. Defects in SERDES or voltage/frequency takes some yield, but not a lot. I do say some of this in the video quite early on
@guytech7310
@guytech7310 6 месяцев назад
@@TechTechPotato Not really practical. Using chiplets would reduce the costs & provide defectless processors. Even with some extra cores, sometimes the deflects will be in something that isn't redundant & cannot be disabled\bypassed I betycha in a future release, they'll switch over to a chiplet design. Overall the biggest problem with super large processes is heat dissipation & getting enough power. I suspect that there is a limit that cannot be avoided.
@TechTechPotato
@TechTechPotato 6 месяцев назад
@guytech7310 they're showing it to be practical. It's a team with decades of experience, and they've sold over a billion dollars of hardware already. Yes, there will be some defects that can't be bypassed, hence why yield is *near*, not exactly 100%. Heat and power is also solved. They have patents.
@johnpereztwo6059
@johnpereztwo6059 6 месяцев назад
in 5 years sitting in desktops . 10 years sitting in tv sets .
@tibbydudeza
@tibbydudeza 6 месяцев назад
Holy smokes - what is the cooling and power requirements ???.
@TechTechPotato
@TechTechPotato 6 месяцев назад
24kW. They sell it as a system, self contained with cooling. Just plug in power and networking.
@tibbydudeza
@tibbydudeza 6 месяцев назад
Who would use such a beast - NSA ???@@TechTechPotato
@viralking7336
@viralking7336 5 месяцев назад
What is This chip price cost
@brookerobertson2951
@brookerobertson2951 6 месяцев назад
Can I swop it with the chip that's in my netbook or ?
@kcm624
@kcm624 Месяц назад
Wait what am I watching? These numbers are nuts.
@kayakMike1000
@kayakMike1000 6 месяцев назад
I don't understand why that big chip wouldn't be better as a bunch of chiplets that could be binned by quality, then you would need to worry about _rooooooting_ around defective cores. Might also help with heat dissipation.
@TechTechPotato
@TechTechPotato 6 месяцев назад
The extra routing is built into the architecture to streamline it. Cleverer people than us have figured it out, and the benefits are power and latency.
@WhiteDragon689
@WhiteDragon689 Месяц назад
That's a dead end tech. The future is organics.
@JorgetePanete
@JorgetePanete 6 месяцев назад
What are the plans from companies now that model parameters go as low as 1 bit instead of 16 or even 4?
@TechTechPotato
@TechTechPotato 6 месяцев назад
Lots of companies looking at INT4.
@iwmaxx
@iwmaxx 6 месяцев назад
Wonder how it compares to Tesla Dojo, it uses the full wafer scale chip design also.
@fiery_transition
@fiery_transition 6 месяцев назад
Can I have one? And whatever monster that chip plugs into? 😂
@Paberu85
@Paberu85 6 месяцев назад
But will it run Crysis?
@Awave3
@Awave3 6 месяцев назад
This is the kind of chip that is going to wake up and become conscious as soon as it is plugged in.
@SteveSperandeo
@SteveSperandeo 23 дня назад
Thanks!
@TheVigilantEye77
@TheVigilantEye77 6 месяцев назад
Each supercomputer will need its own SMR
@VMMark-IVCentral
@VMMark-IVCentral 6 месяцев назад
can i use cerebras compute power to do crypto mining?
@zaurenstoates7306
@zaurenstoates7306 6 месяцев назад
44GB of on chip memery seems pretty low for a chip of that size tbh
@xeode
@xeode 6 месяцев назад
out in the land of the 'on premise'
@gags730
@gags730 6 месяцев назад
They need to pack one up and send it over to Google for their Gemini... heard they have to fix a couple of things and that would speed up the process.
@DS-uy6jw
@DS-uy6jw 6 месяцев назад
How is it cooled?!
Далее
Intel's Newest $350 Million Machine
19:18
Просмотров 198 тыс.
Cerebras Co-Founder Deconstructs Blackwell GPU Delay
23:17
Se las dejo ahí.
00:10
Просмотров 661 тыс.
Avaz Oxun - Yangisidan bor
14:29
Просмотров 430 тыс.
We're not even at PCIe 6.0 Yet!
28:19
Просмотров 14 тыс.
The Gate-All-Around Transistor is Coming
15:44
Просмотров 469 тыс.
This 1mm-Thin Chip Cools In Tiny Places
17:58
Просмотров 74 тыс.
AnandTech has Closed.
26:09
Просмотров 96 тыс.
It’s like paper, but better - reMarkable Paper Pro
11:35
Hot Chips Preview, Zen 5 Reviews: The Tech Poutine #3
3:39:32
Se las dejo ahí.
00:10
Просмотров 661 тыс.