Most people don't seem to know that if you play a game that requires a lot of smart ai components, npu will take on that workload from GPU and make it run smoother.
An NPU wastes a lot of die area? Have you looked at the die plot of an Apple M-series chip? You are just wrong with that statement. The smaller the process the more difficult the routing? What? Do you understand that process progression is accompanied, typically, by an increase the number of metal layers? Memory bandwidth? Again, have you looked at Apple M-series chips? They have plenty of memory bandwidth and they use a very cost-effective design and implementation strategy. The Apple M4--the lowest-end member of the coming M4 family--increased memory bandwidth by 20% to keep pace with the clock rate. It's about 120GB/s. That's a lot for a low-end chip.