StackOverflow Questions for Tag: avx2

Robinson
Robinson

Reputation: 10132

How to reorder interleaved 8-bit values across AVX2 lanes efficiently?

Score: 0

Views: 71

Answers: 1

Read More
MustafaVisys
MustafaVisys

Reputation: 233

C++ to C# memory alignment issue

Score: 1

Views: 207

Answers: 2

Read More
Maj mac
Maj mac

Reputation: 51

AVX(2)/SIMD way to get/set (to 1) a single bit in a 256 bit register

Score: 3

Views: 190

Answers: 1

Read More
Vastar
Vastar

Reputation: 89

AVX2 integer shuffle with types other than byte?

Score: 0

Views: 99

Answers: 2

Read More
user12316363
user12316363

Reputation:

How to understand this AVX addition of two _m256i variables?

Score: 3

Views: 85

Answers: 1

Read More
Z boson
Z boson

Reputation: 33679

Shifting SSE/AVX registers 32 bits left and right while shifting in zeros

Score: 10

Views: 8745

Answers: 3

Read More
Froglegs
Froglegs

Reputation: 1155

AVX2 what is the most efficient way to pack left based on a mask?

Score: 53

Views: 26834

Answers: 6

Read More
draco1111
draco1111

Reputation: 300

extract non-zero elements from __m512i/__m256i vector

Score: 1

Views: 82

Answers: 1

Read More
vev01
vev01

Reputation: 15

Why does msvc not vectorize?

Score: 1

Views: 148

Answers: 1

Read More
Bulat
Bulat

Reputation: 2466

AVX2 code to find the first longest match of 4-byte string among 8 4-byte targets

Score: 2

Views: 141

Answers: 2

Read More
fwefew 4t4tg
fwefew 4t4tg

Reputation: 59

SIMD: reduce/convert/project/compress/align __m256i to __mmask8

Score: 0

Views: 78

Answers: 2

Read More
Infinity77
Infinity77

Reputation: 1449

Threshold on absolute values on large float array in C

Score: 3

Views: 200

Answers: 2

Read More
Devvy
Devvy

Reputation: 45

How to perform parallel addition using AVX with carry (overflow) fed back into the same element (PE checksum)?

Score: 0

Views: 302

Answers: 2

Read More
Y-Jiechao
Y-Jiechao

Reputation: 5

Why does '_mm256_fmadd_ps' cause precision loss?

Score: 0

Views: 99

Answers: 1

Read More
convoluxon
convoluxon

Reputation: 81

AVX2 Code crashes when instructions are combined

Score: 1

Views: 112

Answers: 0

Read More
Eddie-Wang
Eddie-Wang

Reputation: 21

How to implement _mm256_sign_epi8 in ARM NEON?

Score: 1

Views: 171

Answers: 1

Read More
Tin
Tin

Reputation: 23

6-bit lookup using SIMD AVX2

Score: 0

Views: 140

Answers: 1

Read More
Asher Sajid
Asher Sajid

Reputation: 21

Handling Borrow Propagation in AVX-512 Subtraction of 32-bit Chunks

Score: 2

Views: 88

Answers: 1

Read More
creativergk
creativergk

Reputation: 65

AVX2 MaskLoad/MaskStore of ushorts?

Score: 1

Views: 95

Answers: 1

Read More
creativergk
creativergk

Reputation: 65

AVX2 computing of byte array

Score: 0

Views: 151

Answers: 1

Read More
PreviousPage 2Next