Тёмный
No video :(

The Art of SIMD Programming by Sergey Slotin 

Performance Summit
Подписаться 1,7 тыс.
Просмотров 10 тыс.
50% 1

Modern hardware is highly parallel, but not only in terms of multiprocessing. There are many other forms of parallelism that, if used correctly, can greatly boost program efficiency - and without requiring more CPU cores. One such type of parallelism actively adopted by CPUs is "Single Instruction, Multiple Data" (SIMD): a class of instructions that can perform the same operation on a block of 16, 32, or 64 bytes of data in one go, yielding a proportional speedup over scalar code.
While SIMD shares many similarities with classic multiprocessor computing, it is quite different and often requires creative use of the instruction set. In this talk, we will give a general introduction to the technology (focusing on x86/AVX2), derive and implement several state-of-the-art SIMD algorithms, and discuss their use in impactful open-source projects.
skillsmatter.c...

Опубликовано:

 

27 авг 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 6   
@yuangchen905
@yuangchen905 Год назад
great video. Thank very much for your lightening example and insightful explanation!
@martingeorgiev999
@martingeorgiev999 Год назад
I don't understand why these architecture specific instructions are not recognized directly by gcc on O3.
@bouazzase4202
@bouazzase4202 Год назад
they are, when you give the -march= argument, otherwise the compiler doesn't know which instruction sets are allowed and will fall back to a default (usually x86-64 without avx)
@Roxas99Yami
@Roxas99Yami Год назад
Thanks very appreciated. Especially the examples in C. Is this directky compatible in Cython ?
@Roxas99Yami
@Roxas99Yami Год назад
The intrinsics i mean
@petrvset1960
@petrvset1960 17 часов назад
Hard to understand English and unpleasantly small text...
Далее
Where Have All the Cycles Gone? by Sean Parent
59:20
Просмотров 2,5 тыс.
HOW DID SHE DECIDE TO DO THIS?!
00:27
Просмотров 8 млн
SPONGEBOB POWER-UPS IN BRAWL STARS!!!
08:35
Просмотров 21 млн
CPU Cache Effects - Sergey Slotin - Meeting C++ 2022
1:06:16
HOW DID SHE DECIDE TO DO THIS?!
00:27
Просмотров 8 млн