StackOverflow Questions for Tag: avx2

Thomas42
Thomas42

Reputation: 21

Shuffle in between two ymm registers and fill with zeroes?

Score: 2

Views: 45

Answers: 0

Read More
geza
geza

Reputation: 29970

What can cause an AVX2 dot product to perform differently for different runs?

Score: 8

Views: 186

Answers: 1

Read More
fabian
fabian

Reputation: 1881

Why does _mm256_unpacklo "jump" a double-word and where does it says so in the documentation?

Score: 4

Views: 87

Answers: 1

Read More
xiver77
xiver77

Reputation: 2322

Best way to mask a single bit in AVX2?

Score: 4

Views: 1022

Answers: 3

Read More
Srihari S
Srihari S

Reputation: 97

Do all processors supporting AVX2 support F16C?

Score: 1

Views: 142

Answers: 1

Read More
Steve Burns
Steve Burns

Reputation: 319

What is the inverse of "_mm256_cvtepi16_epi32"

Score: 6

Views: 1596

Answers: 1

Read More
Kevin Meier
Kevin Meier

Reputation: 2582

AVX2: Get every second int32

Score: 2

Views: 117

Answers: 2

Read More
Etienne Sauvage
Etienne Sauvage

Reputation: 43

How to transform SSE assembly code to AVX1/2 assembly code?

Score: 1

Views: 116

Answers: 1

Read More
kdh
kdh

Reputation: 61

I need more performance for int8 vector multiplication (Intel AVX-512)

Score: 1

Views: 243

Answers: 1

Read More
einpoklum
einpoklum

Reputation: 132128

Counting 1 bits (population count) on large data using AVX-512 or AVX-2

Score: 9

Views: 8065

Answers: 2

Read More
E_1996
E_1996

Reputation: 89

C++ AVX2 custom functions (e.g., "exp") not working on Windows (but work on Linux)

Score: 0

Views: 112

Answers: 0

Read More
E_1996
E_1996

Reputation: 89

C++ AVX2 Function Pointers/std::function not working on Windows (but work on Linux)

Score: 1

Views: 113

Answers: 0

Read More
Christoph Diegelmann
Christoph Diegelmann

Reputation: 2034

Fallback implementation for conflict detection in AVX2

Score: 13

Views: 1147

Answers: 1

Read More
Kevin Meier
Kevin Meier

Reputation: 2582

AVX2 / gcc: Improve CPU-level parallelism by using different registers

Score: 2

Views: 74

Answers: 1

Read More
Levi Gibson
Levi Gibson

Reputation: 61

How to vectorise multiplication of an int8 array by an int16 constant, widening to int32 result array, in C (AVX2)

Score: 5

Views: 820

Answers: 2

Read More
degski
degski

Reputation: 672

How to implement lane crossing logical bit-wise shift/rotate (left and right) in AVX2

Score: 7

Views: 1179

Answers: 2

Read More
user1196549
user1196549

Reputation:

Emulating byte-shifts on 32 bytes with AVX (lane-crossing)

Score: 12

Views: 5531

Answers: 3

Read More
Sam
Sam

Reputation: 137

AVX 32-bit integer to double precision float best practice

Score: -1

Views: 115

Answers: 2

Read More
MouseWarrior
MouseWarrior

Reputation: 61

Differences between AVX and AVX2

Score: 5

Views: 16052

Answers: 1

Read More
OC87
OC87

Reputation: 41

SIMD unpack 12-bit fields to 16-bit

Score: 4

Views: 1816

Answers: 1

Read More
PreviousPage 1Next