Тёмный

CPU? GPU? This new ARM chip is BOTH 

Coreteks
Подписаться 135 тыс.
Просмотров 253 тыс.
50% 1

Hostinger Deal:
Go to www.hostinger.com/coreteks and use code CORETEKS to get up to 91% OFF yearly web hosting plans. Succeed faster!
Support me on Patreon: / coreteks
Buy a mug or a t-shirt: teespring.com/stores/coreteks
Visit: coreteks.tech/
I now stream at:
/ coreteks_youtube
Follow me on Twitter: / coreteks
Follow me on Instagram: / hellocoreteks
Sources & Further Reading:
• "Opening Address: From...
• Striving towards high-...
• A64fx and Fugaku - A G...
www.titech.ac.jp/english/grad...
www.fujitsu.com/downloads/SUP...
www.fujitsu.com/downloads/SUP...
www.fujitsu.com/jp/solutions/...
Some footage from Charbax:
• Fujitsu A64FX Post-K S...
• Fujitsu at CEATEC 2019...
• Fujitsu Post-K ARM Sup...
- Videos from the AMD official RU-vid channel were used for illustrative purpose, all copyrights belong to the respective owners, used here under Fair Use.
- Videos from the Intel official RU-vid channel were used for illustrative purpose, all copyrights belong to the respective owners, used here under Fair Use.
- Videos from the NVidia official RU-vid channel were used for illustrative purpose, all copyrights belong to the respective owners, used here under Fair Use.
- A few seconds from several other sources on youtube (*including other youtubers*) are used with a transformative nature, for educational and illustrative purposes. If you haven't been credited please CONTACT ME directly and I will credit your work. Thanks!!
#A64FX #FUGAKU #FUJITSU

Наука

Опубликовано:

 

3 мар 2020

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 756   
@billykotsos4642
@billykotsos4642 4 года назад
RIP SATORU IWATA. A BRILLIANT AND UNIQUE MIND. His father never wanted him to pursue a games career.
@mix3k818
@mix3k818 4 года назад
There's only one video that comes to my mind at this point. ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-j2dxX5DIEMQ.html R.I.P. to both.
@masternobody1896
@masternobody1896 4 года назад
Intel is better
@perhapsyes2493
@perhapsyes2493 4 года назад
And I'm glad he didn't listen.
@MissMan666
@MissMan666 4 года назад
@@masternobody1896 intel is nr.2.
@nelsoncabrera6464
@nelsoncabrera6464 4 года назад
Nice Haiku
@MarcoGPUtuber
@MarcoGPUtuber 4 года назад
A64FX.....Why have I heard that name before? Oh yeah! Athlon 64 FX!
@arusenpai5957
@arusenpai5957 4 года назад
Yeah, tha name remind´s me that too XDD
@pflernak
@pflernak 4 года назад
So thats where the deja vu feeling came from
@zM-mc2tf
@zM-mc2tf 4 года назад
What goes round...
@jean-pierreraduocallaghan8422
@jean-pierreraduocallaghan8422 4 года назад
I knew I'd seen that somewhere before but I couldn't put my finger on it ! Thanks for the reminder ! :)
@xxdizannyxx
@xxdizannyxx 4 года назад
FX you...
@TechKerala
@TechKerala 4 года назад
Dedicated my life's 20 MINUTES.. Worth it as always..
@adnan4688
@adnan4688 4 года назад
Absolutely!
@Soul-Burn
@Soul-Burn 4 года назад
Only dedicated 10 minutes. x2 speed is great.
@johnnyxp64
@johnnyxp64 4 года назад
20:36 actually for me..cause I wanted to see my name on the Credits... 🤣😝
@AwesomeBlackDude
@AwesomeBlackDude 4 года назад
Always guarantee when you watch a (Jim) #AdoredTV video ❎
@miguelpereira9859
@miguelpereira9859 3 года назад
@@johnnyxp64 Being a Coreteks patreon means having big pp
@Zellonous
@Zellonous 4 года назад
This video sounds more like a nonfiction crime tvshow than something about processors.
@rcrotorfreak
@rcrotorfreak 3 года назад
can u share us ur pic?
@kcvriess
@kcvriess Год назад
You make me laugh but at the same time I'm annoyed. This dude has a wealth of knowledge and insights, but he's HORRIBLE to listen to.
@suibora
@suibora 4 года назад
17:28 sure, streaming today data would be instant with tomorrows technology, but what about tomorrows data? The extinction of load times is far away. More powerful computers? That will just be an excuse to use more detailed textures :'D
@EVOXSNES
@EVOXSNES 4 года назад
There will be a soft ceiling for textures butI think it won't be a limitation more a practical one. It'll be a similar argument to the difference between 8K and 4K. How much detail can you see before it's just FOMO.
@Mil-Keeway
@Mil-Keeway 4 года назад
loading nowadays is no longer limited by file size, it is limited by bad code. NVMe SSDs do many GiB/s, no game asset needs more than the blink of an eye to load. Sadly, developers have some of the fastest possible hardware available (especially in big-budget games and programs), so they have no need to optimize. Running the same code on an average PC then makes it unusable.
@redrock425
@redrock425 4 года назад
The biggest issue is poor telecoms infrastructure. Even in the UK it varies massively in speed, they're already trying to save cost and not put in full fibre.
@pflernak
@pflernak 4 года назад
@jayViant Talking of holograms: ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-V7V05T4DhrU.html
@635574
@635574 4 года назад
@@Mil-Keeway compression and bad structuring of data makes for terrible load times rven on high end NVMes. Games before the next get werent optimized for this, maybe except star citizen and arkham knight.
@micronyaol
@micronyaol 4 года назад
Can't imagine a japanese chip without TOFU interface
@glasser2819
@glasser2819 4 года назад
iExplorer has SHAKRA engine (as shown in TaskMgr) and if it was coded in Germany it would have a SAUSAGE Cache pipe... LOL
@IngwiePhoenix
@IngwiePhoenix 4 года назад
@@glasser2819 No, it would have a Bierfass (beer jar) pipeline ;) I am german, I should know. ^.^
@minitntman1236
@minitntman1236 4 года назад
The driver of AE86 was a tofu delivery man
@prashanthb6521
@prashanthb6521 4 года назад
SUSHI coming up next.
@matthewcalifana488
@matthewcalifana488 3 года назад
They make the Best capitors .
@lawrencedoliveiro9104
@lawrencedoliveiro9104 4 года назад
7:55 Not quite. Supercomputing applications actually have limits to their parallelism. There is also a need for heavy communication traffic between cores. Hence the fast interconnect, which is a major component of the build cost of a super. For an example of a massively parallel application which doesn’t need such heavy interprocessor communication, consider rendering a 3D animation. The renderfarms that are deployed for such an application are somewhat cheaper than supercomputers.
@MrTrilbe
@MrTrilbe 4 года назад
So,, ARM, AMD and Fujitsu teamed up for a super APU, that's in some ways more epic then EYPC..., I will call this colab FARMeD!
@absoluterainbow
@absoluterainbow 3 года назад
Proof?
@MrTrilbe
@MrTrilbe 3 года назад
@@absoluterainbow it was a tongue in cheek summery of this video and an pun at the end
@kipronosoi
@kipronosoi 4 года назад
Woot !! Coreteks is back, feels like its been forever...
@StopMediaFakery
@StopMediaFakery 4 года назад
Don't you just love their Masonic logo? The honeycomb hexagon also known as the Cube, a reference to Saturn and the system we live in. Just so happens to also be in the beehive colours.
@lawrencedoliveiro9104
@lawrencedoliveiro9104 4 года назад
8:53 That doesn’t make sense. “Teraflops” is a unit of computation (“flops” = “floating-point operations per second”), not of data transfer. Data transfer rates would be measured in units of bits or bytes per second.
@blackdoveyt
@blackdoveyt 4 года назад
Yeah, A64FX has 1TB/s theoretical bandwidth and 840GB/s of actual bandwidth.
@wajihbleik436
@wajihbleik436 4 года назад
Thank you for doing what you're. I learn a lot from your videos.
@Battlebaconxxl
@Battlebaconxxl 4 года назад
What you describe sounds like a modern version of the PS3's cell chip.
@FrankHarwald
@FrankHarwald 4 года назад
Kind of, yes! The PS3 used several DSP-like processors connected onto a ring bus - except that rings, as well as other pure bus like topologies, while being the simplest way to interconnect multiple regions on a chip have several inherent limits which restrains this kind of topology to a limited amount of locally adjacent cells which is why the kind of processor presented here not only has one ring, but a hierachy of rings topology: See this paper as an example for examining & describing different hierachical ring topoligy variants as on-chip interconnection networks, also called NoC = "network on chip" "Design and Evaluation of Hierarchical Rings with Deflection Routing": pages.cs.wisc.edu/~yxy/pubs/hring.pdf This has been a hot reseach topic in hp & scientific computer engineering for several years now. Another really old, formerly rejected but increasingly interesting & related research topic is "computing-in-memory", also "processing-in-memory" or "near memory processing" because the costs to transfer data between processing units & memory is, as mentioned in this video, increasingly becoming a limiting factor, see "Computing In-Memory, Revisited": ieeexplore.ieee.org/document/8416393 but also semiengineering.com/in-memory-vs-near-memory-computing/ & while the recent emergence of array processors like Googles tensor cores & other forms of neuromorphic processing units is clearly at least partly due to that, this problem isn't limited to applications using AI but applies to a much broader category of problems - the "bandwidth wall" is a thing.
@SerBallister
@SerBallister 4 года назад
@@FrankHarwald One of the biggest headaches of working with the cell BB was the relatively tiny amounts of accessable memory each SPU had (256kb IIRC). This meant you couldn't use a lot of general purpose algorithms and instead had to modify them to be streamable with high locality of reference - for some algorithms it just isn't possible to optimise in such a way.
@FrankHarwald
@FrankHarwald 4 года назад
@@SerBallister indeed, but modifying algorithms so that they run with a high amount of locality is something that you'll have to do for all data intensive algorithms anyway - no matter how much of it is done automatically, profiler-assisted or by hand - regardless of what the underlying architecture is because while all shared memory architectures will start hitting the bandwidth wall at some point, distributed memory architectures will be the only way to circumvent these limitations. & yes, this also means that algorithm that access a lot of memory from the same chunk in a purely serial way will either have to be modified to access data in parallel from multiple chunks (if possible) or remain bandwidth limited (if this is acceptable or if the algorithm is inherently serial).
@SerBallister
@SerBallister 4 года назад
@@FrankHarwald You should aim for that yeah. The SPU local memory presented an addressing barrier instead of a cache miss like on a multicores, all data has to be present in that block. Take a ps3 game for example. Some systems like physics and pathfinding can be hard to compress your game world in 256kb, the PPU had to work on that stuff and you then had the headache of pipelining the output of that into the SPU (e.g. animation) if you want to avoid stalls. Interesting chip but can be hard work, task scheduling and synchronisation is also not straight forward. I would prefer working with modern desktop multicores with shared memory.
@thurfiann
@thurfiann 4 года назад
of course it is
@chafacorpTV
@chafacorpTV 4 года назад
I once heard that HAL got its name by grabbing IBM's and ticking the characters because they saw themselves as "one step ahead of IBM". Seeing this, I truly believe it.
@miketaratuta
@miketaratuta 4 года назад
ticking them back, not forwards
@chuuni6924
@chuuni6924 4 года назад
If you haven't already, you may want to look into RISC-V's upcoming Vector extension. It does all that SVE does, but better.
@Toothily
@Toothily 4 года назад
Better how?
@chuuni6924
@chuuni6924 4 года назад
@@Toothily There are a couple of independent things. For one thing, there's no architectural upper limit to the number of vector lanes. Another thing is that the dynamic configuration of the vector registers allows better utilization of the register file (for example, if only a couple of vector registers are used, they can subsume the register storage of the other registers to get much, much wider vectors). Also, while that part of the specification is still a bit up in the air, there is an aim to provide for polymorphic instructions based on said dynamic configurations, which means that it's far easier for it to adopt new data types with very small architectural changes. They also aim to provide not only 1D vector operations, but even 2D or 3D matrix operations, which could provide functionality similar to eg. nVidia's tensor cores, except in a more modular fashion. There are more examples too, but I think this post is running long enough as it is. I recommend reading the specification.
@Toothily
@Toothily 4 года назад
@@chuuni6924 that sounds really cool spec wise, but do they have working silicon yet?
@chuuni6924
@chuuni6924 4 года назад
@@Toothily The spec isn't even finalized yet, so no, there's definitely no silicon yet. However, the Hwacha research project is being carried out in parallel and I know there's a very strong connection between it and RV-V, and I believe they have working silicon in some sense of the word. It's a research project rather than a product, however, so not in the ordinary sense of the word.
@mrjean9376
@mrjean9376 3 года назад
Really wanted to know, what you guys think/opinion about this computer compared to nvidia dgx a100?? Does it has equal performance or something? I really excited to know this. Thx :)
@datsquazz
@datsquazz 4 года назад
Those chips are cool and all, but did you see THIS? 18:04 That truck has FOUR WHEEL STEERING, now THAT is innovation
@onebreh
@onebreh 4 года назад
They have been on the roads for years...
@carholic-sz3qv
@carholic-sz3qv 4 года назад
there is also all wheel steering at the wheels at the back too, look at this tatra video ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-U-ujpvOeydk.html
@Mil-Keeway
@Mil-Keeway 4 года назад
lots of 3-axle garbage trucks in europe have frontmost and rearmost steering, pivoting around the middle axle basically.
@keubis2132
@keubis2132 4 года назад
@@onebreh pog realy ?
@koolyman
@koolyman 4 года назад
You call that innovation? Get back to me after you google 'Spork'
@fanitriastowo
@fanitriastowo 4 года назад
I like that progress bar ads
@kentaaoki8064
@kentaaoki8064 4 года назад
17:01 Reseat that RAM!
@lahmyaj
@lahmyaj 4 года назад
Kenta Aoki lol I noticed that too
@miguelpereira9859
@miguelpereira9859 3 года назад
Oof
@TechdubberStudios
@TechdubberStudios 4 года назад
Loved this video so much, watched it twice in a row.
@arthurcuesta6041
@arthurcuesta6041 4 года назад
You're finally back. Thanks again for the amazing work.
@desjardinspeter1982
@desjardinspeter1982 4 года назад
your video presentations are so well done. I always look forward to watching them! such an interesting product. thank you for covering this!
@user-uy5xv8by5x
@user-uy5xv8by5x 4 года назад
Congratulations on 100.000 subscribers !! I love your videos and i came a long way in computer khnowlage because of you , i hope you have a great year ! Love you from EU Si ♥️😊
@rickbhattacharya2334
@rickbhattacharya2334 4 года назад
Man your videos always inspire me to read more computer architecture . I have computer architecture as a subject in my bachelor's and i don't like it but your videos always inspire me to read it more.
@seylaw
@seylaw 4 года назад
And ARM already has announced the SVE2 extension which is a replacement for their NEON instruction set (for home/multimedia usage instead of SVE1 which is tuned for HPC workloads). Interesting times are ahead and can't wait for ARM storming the PC desktop...
@VicenteSchmitt
@VicenteSchmitt 4 года назад
Glad to see your videos again!
@tipoomaster
@tipoomaster 4 года назад
"The future is Fusion", the slogan was just 12 years ahead of the technology
@N0N0111
@N0N0111 4 года назад
7:15 Finally the memory bottleneck is being some what addressed.
@aarneuuk9601
@aarneuuk9601 4 года назад
Thank you for yet more fantastic content! I read you make (at least some?) of your own background music (WOW!)
@InfinitePCGaming
@InfinitePCGaming 4 года назад
Was worrying you disappeared. Glad we got a new video.
@raymondobouvie
@raymondobouvie 4 года назад
I am no engineer in any shape, but with Coreteks videos I am getting such a digestible form of explanation that teaches me, even thaw i am 37yo) Thank you so much!
@mrlithium69
@mrlithium69 4 года назад
37 is not too late. God willing you will be learning well past 37 and even at 73.
@Seskoi
@Seskoi 4 года назад
I'm 101 years old and still learning!
@IARRCSim
@IARRCSim 4 года назад
@@Seskoi in base ten?
@raymondobouvie
@raymondobouvie 4 года назад
@@IARRCSim they opened schools on Mars - finally)
@pottingsoil723
@pottingsoil723 4 года назад
@@Seskoi I'm 1,009,843,000 seconds old and I push myself every nanosecond to learn more and more
@peacenaga7725
@peacenaga7725 4 года назад
I stumbled upon your channel when viewing your interview of Jon Masters and have binge watched 3 episodes losing sleep. Kudos. I am learning a lot! Thank you. I havent binge watched in a long time.
@andrew1977au
@andrew1977au 4 года назад
Awesome video bud, some very interesting info there. Thank you
@CitizenTechTalk
@CitizenTechTalk 4 года назад
Simply mind blown! Wow! Thank you, amazingly educational video!!!
@metallurgico
@metallurgico 4 года назад
Finally the first video since I subscribed! I watched all your previous videos lol
@accesser
@accesser 4 года назад
Fascinating documentary, you clearly put a lot of work into this
@materialburst983
@materialburst983 4 года назад
Hey, I was about to take a nap. ;D Thanks for the content!
@m_schauk
@m_schauk 9 месяцев назад
Damn this video had aged well... so good. Wish more videos like this were made and popular on RU-vid.
@zM-mc2tf
@zM-mc2tf 4 года назад
Thank you again for your insight, and all the info.
@benegesserit9838
@benegesserit9838 4 года назад
gratzz for 100k...
@greenempower1053
@greenempower1053 4 года назад
I've been seeing this coming for years now.
@DanafoxyVixen
@DanafoxyVixen 4 года назад
The comparison with the duel intel xeons is a little silly now that they have already been blown out of the water with eypc.. still an interesting CPU tho..
@stefangeorgeclaudiu
@stefangeorgeclaudiu 4 года назад
I think people are going to get surprised when AMD announces Milan this year. Also, the Frontier 1.5 exaFLOPS supercomputer will use a CPU chiplet + 4 GPU chiplets + memory in the same AMD chip.
@thomasjensen1590
@thomasjensen1590 4 года назад
The question is, what is more EPYC?
@BrianCroweAcolyte
@BrianCroweAcolyte 4 года назад
I agree. With how many problems Intel has been having for the last 4-5 years stagnating them on 14nm, comparing anything besides other x86 CPUs to Intel feels disingenuous. If they compared this ARM chip to the actual current x86 performance leader (a 2U Epyc Rome server with 128 cores) it would be beaten by at least 2-3X. Maybe performance per watt would be better on the ARM chip, but the performance density would almost definitively be unbeaten.
@aminorityofone
@aminorityofone 4 года назад
​@@BrianCroweAcolyte This isn't the first time ARM was expected to be dominate. It happened int he 90's as well. In fact Microsoft made Windows NT compatible with ARM back then. There was big promise that RiSC cpus would take over the world. Well, that didnt happen, and i still dont think it will happen today or in the future.
@defeqel6537
@defeqel6537 4 года назад
@@aminorityofone ARM will probably continue to dominate the market where chips are designed for purpose (unless RISC-V takes that market), mostly because x86 isn't licensed to anyone new.
@MrSchweppes
@MrSchweppes 3 года назад
Great Video! Thanks!
@jorgegomes83
@jorgegomes83 4 года назад
Impressive as always, sir. Thank you.
@TheJabberWockyy
@TheJabberWockyy 4 года назад
Awesome video man! Ty for the great content
@opteronprimext6867
@opteronprimext6867 4 года назад
Thank you for your educated entertaining info!
@lawrencedoliveiro9104
@lawrencedoliveiro9104 4 года назад
5:36 Actually, the plural of “die” is “dice”. Yes, those dice. As in the phrase “the die is cast”, which means instead of throwing several dice, you have thrown just one, and must stand by whatever it shows.
@ehp3189
@ehp3189 4 года назад
The "die is cast" comes from the middle high German/English Gutenberg printing. The printed page came from a single die cast, which is why it was slow and expensive (though cheaper than the Monks drawing each page by hand). This allowed Bibles to be printed, helped people learn how to read, and bring education to the people.
@lawrencedoliveiro9104
@lawrencedoliveiro9104 4 года назад
@@ehp3189 That can’t have been right. Guternberg’s innovation was the invention of movable-type printing, as in having separate pieces for each letter that were assembled to make up a page. Printing an entire page from a single block was a technique that had been invented by the Chinese centuries earlier.
@ehp3189
@ehp3189 4 года назад
@@lawrencedoliveiro9104 Granted, but the expression goes more towards the assembled type set being cast together in a block and any changes to that during a printing run were not to be allowed. It was difficult enough that breaking apart the group and then reassembling it for one letter change was more expensive than it was worth. At least that is my understanding. I liked philologogy in college but they only offered one class ...
@ClaudioSL619
@ClaudioSL619 4 года назад
congrats for your 100k subs!
@GarretSlarrity
@GarretSlarrity 4 года назад
Very excited for the video on neuroscience and computing!
@lordswaggity1213
@lordswaggity1213 4 года назад
YES YOURE BACK
@KyleWillsonDigital
@KyleWillsonDigital 4 года назад
Great video as always! Def a top fav tech channel! Keep em coming! ❤️👌🔥
@TLN-qu4rq
@TLN-qu4rq 4 года назад
Just got here. After watching this it's an instant sub. I can't wait for more.
@Connor3G
@Connor3G 4 года назад
Thanks for the fascinating video!
@liamtingle2762
@liamtingle2762 4 года назад
Nice vid dude. Any idea on power figures between V100 and the ARM chip?
@stefanosstamatiadis740
@stefanosstamatiadis740 4 года назад
Blew my mind, as every time.
@DarthAwar
@DarthAwar 9 месяцев назад
If the utilise the newest HBM version instead of traditional DRAM for Cache it would vastly increase its processing speed and reliability but also dramatically increase production costs
@UHDking
@UHDking 4 года назад
I am a fan. Good content man. Thanks for your research and sharing the knowledge.
@tdreamgmail
@tdreamgmail 4 года назад
The quality of production and quality content are next level. Always worth the wait. Remember guys quality takes time.
@danuuu101
@danuuu101 3 года назад
Your channel is a gold mind for computer engineers I really like your analysis and getting into details more then other channels do. In other note, I really want to see a video about RISC V and its future in personal computing and IoT I'm currently learning RISC V assembly and planning on building a small RISC V CPU on a FPGA but I'm very curious about its future and if it worth the effort.
@TECHN01200
@TECHN01200 4 года назад
When I heard "CPU and GPU" I was thinking AVX turned to 11
@kingphiltheill
@kingphiltheill 4 года назад
Amazing and interesting. Thanks for the video
@miguelangelriveiro
@miguelangelriveiro 4 года назад
Thanks for another fantastic video!
@winstonsmith430
@winstonsmith430 4 года назад
I've been waiting to see hbm used on a processor! Awesome job, it was exactly what I was predicting. As always great video Coreteks.
@p4nx844
@p4nx844 4 года назад
Damn great video dude
@aikanikuluksi4766
@aikanikuluksi4766 4 года назад
So that is where Arnold Schwarzenegger's Terminator got (or will be getting) its CPU from.
@studiosnch
@studiosnch 3 года назад
And a few months later we see many design choices here (especially the on-chip memory) in Apple Silicon M line.
@tech6294
@tech6294 4 года назад
Great video as always! ;)
@yanniskouretas8688
@yanniskouretas8688 4 года назад
The shape of things to come ...
@aziziphone9350
@aziziphone9350 4 года назад
Finally ! My favourite RU-vidr coreteks uploaded a video. Love your content man been watching you since the age of 15 in 2019 till now
@aziziphone9350
@aziziphone9350 4 года назад
JustVictor 17 hehhe nice man
@aziziphone9350
@aziziphone9350 4 года назад
JustVictor 17 this technology and semiconductor field is the only place were we all can be together without any bullshit politics and drama of this cruel world.
@Hazemann
@Hazemann 4 года назад
This advanced Fujitsu's A64FX chip variation will be in a Nintendo Switch revision in the future as a complement to the ARM's CPU and Nvidia's GPU
@deoxal7947
@deoxal7947 4 года назад
Where can I read about that?
@SelecaoOfMidas
@SelecaoOfMidas 4 года назад
The future with ARM processors looks great with this one. Interesting that Nintendo has an indirect connection to this too. One could imagine their NSO servers running on top of an A64FX processor, maybe a future console? 🤔
@wandersgion4989
@wandersgion4989 4 года назад
I used to live next to the 京コンピューター Kei Supercomputer in Kobe. Very cool to see these new advancements. 👍🏻
@Maisonier
@Maisonier 4 года назад
I'd love that ARM CPU in my smartphone...
@deoxal7947
@deoxal7947 4 года назад
Not all Arm processors are designed to go in phones. This one would draw way too much power to be usable. For example the i.MX chip in the Librem 5 and the Allwinner chip in the Pinephone. Both of them were not designed for phones but are being used anyway for their open nature.
@Maisonier
@Maisonier 4 года назад
@@deoxal7947 I know that dude ... but a man can dream ... The power of the sun in the palm of my hand.
@prashanthb6521
@prashanthb6521 4 года назад
@@deoxal7947 Dude, I want that chip in my wrist band.
@tbernesto0912
@tbernesto0912 4 года назад
Well, this is Absolutely Amazing.. Thank You Very Much for Sharing.. Greetings from México... !!!
@cdoublejj
@cdoublejj 4 года назад
another great video!
@denvera1g1
@denvera1g1 4 года назад
Consumer processors will probably use HBM as sort of an L4 cache, or a base memory with a tiering system, and then still have traditional memory channels, though maybe less channels
@notjulesatall
@notjulesatall 4 года назад
Incredible specs. All the computing power of a GPU with SIMD intrinsics and all the software support it already has available, I really look forward programming on these chips.
@RepsUp100
@RepsUp100 4 года назад
Informative af
@AlexSeesing
@AlexSeesing 4 года назад
The end sounds like Cygnus X - Positron but a bit different. Has that anything to do with the presumed change in computing you laid out in this video? If so, that is a masterful match!
@henrik5804
@henrik5804 4 года назад
Very interesting video and I believe you are spot on in your predictions.
@bigmike716
@bigmike716 4 года назад
Congratulations on 100K!!!!!!!!!!!!!!!!!!!!!!!
@lawrencedoliveiro9104
@lawrencedoliveiro9104 4 года назад
11:39 No, not all the other memory types did commoditize eventually. I can think of two things that Intel bet on, that flopped: bubble memory, and RAMBUS.
@artisan002
@artisan002 4 года назад
I'm intensely curious how it handles multimedia workloads. (My bias is towards things like digital music production, where the overall description of what this chip is and does would be ideal)
@the360gaming5
@the360gaming5 4 года назад
Amazing video
@rednammoc
@rednammoc 4 года назад
I wonder if Coreteks has any more material for a follow up video. For example, how does this architecture compare to Xeon Phi (Knights Landing) - and how do these differences matter?
@celsostarec6735
@celsostarec6735 4 года назад
@8:53 - 3TFlops of peak bandwidth? Floating Point Operations x Bandwidth? Wouldn't it be 3TB/s of Bandwidth ou 3TFlops of Throughput?
@Speak_Out_and_Remove_All_Doubt
@Speak_Out_and_Remove_All_Doubt 4 года назад
When are we going to see desktop CPUs with 3D stacked memory and realistically how much memory is it going to have? I can't see 32GB of system memory fitting on a normal desktop die but maybe I'm wrong. Or will it just be a small amount of on-die memory to be used like an extra cache layer but you still have your normal DDR system memory? Also, I can't be my head around heat dissipation in this new 3D stacking future. Given heat is the biggest issue in 2D chips and atm we can cool directly on to the only layer there then 3D stacking just doesn't sound like it's going to work on anything other than ultra low powered chips to me. I guess you could try to have some kind of micro coolant channels flowing between layers or maybe thin sheets of graphene but this will be expensive and complex to integrate plus If you have to go from CPU's running at 5GHz to them running at 1.8GHz then I can't see the benefits of closer system memory being enough to overcome this inherent drawback.
@mrlithium69
@mrlithium69 4 года назад
True story. Yes we will see 3D stacked CPUs with some kind of RAM on them. (My guess is 2023). We've seen GPUs already that have large HBM stacks "on-chip", and we will see them integrated to even more "on-die". Since Intel has invented its "Foveros" tech to avoid the Interposer layer and TSV's. It does seem like the RAM packages would be almost as large in die size as the CPU. (judging back to the HBM GPUs). Also, Remember the 128MB EDRAM L4 cache on the integrated GPU of Broadwell 5775C (2015)? Granted it was for the iGPU, but that was an early proof of concept of CPU+RAM, and took up a very large real estate of that chip. It wasnt market friendly, the proportions and cost were all wrong for mass-marketability, but it was interesting to say the least. Right now its probably easy enough to add a stack of 1 or 2 modern 16Gigabit DRAM die (=2 or 4 Giga BYTES), thats just my speculation. They've also been researching that micro-coolant-channel science as a real possibility lately. And you're right, the heat in the 3D stacks is the main issue. Everyones trying to stack the best way possible, its just a question of how. Then again, besides the real science of it, theres commercial viability concerns. The currently long-standing seperation between the CPU and RAM markets, means we get good prices on both. RAM as a commodity as we know it would be at risk if the 2 main CPU makers are integrating it onto a CPU. The move away from "memory modules" we all know and love would be too hard to do. Plus, think Apple laptops and planned obsolescence; they already solder the RAM on the motherboard, I'm sure they'd love it soldered right on the CPU. So we can buy $1000 CPU+RAM combo-chips and throw them away in 3 years when Chrome starts using 128GB of ram. I'd be more convinced if it was a small L4 cache idea, and leave system RAM alone.
@johannajohnson310
@johannajohnson310 4 года назад
he does NOT want to mentiion the downfalls , we can barely keep heat off our current cpus and gpus, so 3d stacked? nah the amount of power needed, and cost to switch a billions of severs ,no happening
@IARRCSim
@IARRCSim 4 года назад
Having things as simple as the video suggests would be a dream. Unfortunately, we'll keep having multiple levels of memory like we always have for over 50 years and different speeds, capacity, and prices per capacity for each. That means any software developers eager for better performance will need to continue optimizing for that complicated layout and be mindful of how it all works. Drastically more L3 cache would be great but there will always be demand for different capacities that strike a different balance on speed, price per unit of capacity, and capacity. RAM Drives introduced at ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-6pp_krChw_A.html do a great job of explaining some of the tradeoffs between speed, price, and others between SSD-like memory vs RAM.
@alexsiniov
@alexsiniov 4 года назад
When it hits consumer it will probably have HBM3 memory with up to 64gb on die. Since you won't need ram and GPU, PCs will become micro form factor, imagine how many sockets for these monsters can u fit on standart ATX motherboard without chipset, pci-e slots, etc? :)) and u just need to watercool that bitch with AIO or custom loop. U can add for example 12tflops another chip when u run out of resources or add smaller performance one and they will stack together :) and then add most powerfull one when u get money :) and continue your life with 3 cpus and add one more in 5-6 years :)
@pottingsoil723
@pottingsoil723 4 года назад
@@johannajohnson310 I'm inclined to agree, there's only so much efficiency you can squeeze out of silicon no matter how it's designed.
@apoorvgupta9680
@apoorvgupta9680 4 года назад
A great video bro. I am a novice, I am into processors , but really confused in risc and cisc , and how they work??
@TheJabberWockyy
@TheJabberWockyy 4 года назад
I wonder why everyone isn't talking about this. This is fascinating and exciting.
@zamundaaa776
@zamundaaa776 4 года назад
I always thought that this might be possible but I never imagined it coming so soon. Just... wow.
@lyxames8509
@lyxames8509 4 года назад
Fascinating !
@schmitzi99
@schmitzi99 4 года назад
Great video :)
@snakeatwar
@snakeatwar 4 года назад
OMG Coreteks I finally realized that you're Chyrosan22! I've thought for the longest time that you sound just like him and *finally* I looked at your recommended channels list and put 2 and 2 together. I'm a big fan of all your work. I just want to say that I very much appreciate your content!
@peterchang3998
@peterchang3998 3 года назад
I don't see a lot of Inter-connection and intergrated memory with mixture so-so proccessing units are NOT always garantee scalability and reliability. This is the good old Transputer idea in a new bottle...
@bach907
@bach907 3 года назад
This is defo not Thomas.
@paulblair7515
@paulblair7515 4 года назад
Another great video.
@Anonyminded
@Anonyminded 4 года назад
Great ep worth sharing, thank you!👏👍 A64FX is truly something very special (GO ARM)!
@AmxCsifier
@AmxCsifier 4 года назад
100K subs soon, nice
@saricubra2867
@saricubra2867 3 года назад
The PS3's CELL processor: "A rival!!!"
@TheUchihasparky
@TheUchihasparky 3 года назад
PS3 was RISC but that doesn't mean it was like an ARM CPU, for example the PS3 was very power hungry. More like a powerPC chip
@saricubra2867
@saricubra2867 3 года назад
@@TheUchihasparky "powerPC" It is a PowerPC chip.
@Speak_Out_and_Remove_All_Doubt
@Speak_Out_and_Remove_All_Doubt 4 года назад
Great video Celso, sounds like the A64FX chip could make for a great games console processor too?? When do you think we will see memory on chip on desktop CPU? Zen 4? Would this be more like another cache level and we will keep regular DDR system memory as I just can't see 32GB fitting on a CPU die.
@redrumtm3435
@redrumtm3435 4 года назад
By embedding the hbm on the soc, the processor has a direct link to it and can therefore shift data at much faster rates. It eliminates a huge bottleneck by shortening the bridge between the two. With all of these extra cores, specific tasks can be assigned, and change more dynamically than ever. Think of it as a logic function: the various cores can change form based on whatever task they are doing, and without it bogging down the system like emulation, for example. All computing is algorithmic, this is just a more advanced form of it.
@Speak_Out_and_Remove_All_Doubt
@Speak_Out_and_Remove_All_Doubt 4 года назад
@@redrumtm3435 Sure, I get that, and that is a great feature/asset to have which is one of the reasons I've been super excited about this chip and following it since the mid 2018 but what I am saying is 32GB of HBM per die seems kind of small for data centre work loads so when you have a data set or even single file that is more than 32GB you will have to split it and send it to different dies which will surely be much slower than have the whole of the data sat in normal system memory where any free CPU can work on the data?
@redrumtm3435
@redrumtm3435 4 года назад
@@Speak_Out_and_Remove_All_Doubt The entire system works so efficiently that data can be transferred much faster. It can achieve much more, but with a lower buffer. The cores are designed to work as one, and vary when and where necessary to suit a particular task. Basically, the entire thing is the beginning of ARM's transition into a hybridised cloud system that works across all devices. This is how we will cheat Moore's law, and continue to push the envelope so to speak.
@Speak_Out_and_Remove_All_Doubt
@Speak_Out_and_Remove_All_Doubt 4 года назад
@@redrumtm3435 I know Coreteks is a firm believer that ARM is coming for x86, and I agree it will take over certain areas, but I think the PC space is so heavily entrenched in x86 I really can't see it happening but it's great that it is happening on the server side if its faster and using less energy. Coreteks also loves RISC V, i'm not sure which is has the better chance of challenging x86's dominance.
@swampking666
@swampking666 4 года назад
It is amazing, this should be how marketing looks like, information, context and possibilities. Fujitsu should thank you with a load of money. Gonna check out this one today. Great job.
@JamesLee-mp2qz
@JamesLee-mp2qz 4 года назад
I didn't think it was humanly possible for your voice to get any lower... You proved me wrong :)
@tma2001
@tma2001 4 года назад
Will the amd chiplets always be so far apart from the IO die (any benefit from stacking etc).
@siddeshpatil8810
@siddeshpatil8810 3 года назад
I didn't understand a thing in this video but I am curious to learn..where to start ?
Далее
Neuromorphic: BRAINLIKE Computers
35:52
Просмотров 90 тыс.
The RTX 5090 is going to DISAPPOINT A LOT of gamers
19:37
What's New in FreeBSD 14.1?
13:54
Просмотров 1,5 тыс.
The FUTURE of GPUs: PCM
18:47
Просмотров 28 тыс.
Qualcomm Takes on Apple: The Battle For Chip Speed
13:24
Intel is in serious trouble. ARM is the Future.
25:04
The fight that made computers cheaper
20:59
Просмотров 181 тыс.
Arm vs x86 - Key Differences Explained
20:38
Просмотров 380 тыс.
Мой странный компьютер 2024
18:33
КАК GOOGLE УКРАЛ ANDROID?
17:44
Просмотров 93 тыс.