StackOverflow Questions for Tag: avx512

Sacha
Sacha

Reputation: 883

AVX512 dot product of 64-bit vector of booleans with 512-bit vector of bytes

Score: 2

Views: 48

Answers: 1

Read More
wrgj
wrgj

Reputation: 41

Does AVX-512 scatter work efficiently on Zen 5?

Score: 0

Views: 51

Answers: 0

Read More
vitsoft
vitsoft

Reputation: 5805

AVX512 vector length and SAE control

Score: 5

Views: 976

Answers: 3

Read More
kdh
kdh

Reputation: 61

I need more performance for int8 vector multiplication (Intel AVX-512)

Score: 1

Views: 243

Answers: 1

Read More
einpoklum
einpoklum

Reputation: 132128

Counting 1 bits (population count) on large data using AVX-512 or AVX-2

Score: 9

Views: 8065

Answers: 2

Read More
nckm
nckm

Reputation: 133

How to merge two YMM registers into single ZMM but interleave?

Score: 0

Views: 46

Answers: 0

Read More
Johan Daniel
Johan Daniel

Reputation: 103

How to use SIMD effectively to count 4-character matches in a large word-search grid (including vertical and diagonal)?

Score: 2

Views: 280

Answers: 3

Read More
fabian
fabian

Reputation: 1881

How to Load and Store data for the new AVX-VNNI and Arm Neon MMLA instructions efficiently?

Score: 1

Views: 104

Answers: 1

Read More
Christoph Diegelmann
Christoph Diegelmann

Reputation: 2034

Fallback implementation for conflict detection in AVX2

Score: 13

Views: 1147

Answers: 1

Read More
kdh
kdh

Reputation: 61

Efficient way for using int8 AVX512-VNNI instruction, especially about loading the data to zmm register

Score: 2

Views: 124

Answers: 1

Read More
Rom098
Rom098

Reputation: 2603

Enabling AVX512 support on compilation significantly decreases performance

Score: 9

Views: 7840

Answers: 1

Read More
Joe Doliner
Joe Doliner

Reputation: 2240

AVX512 assembly breaks when called concurrently from different goroutines

Score: 11

Views: 761

Answers: 1

Read More
pratikpc
pratikpc

Reputation: 680

Why do GCC, ICX and Clang not auto-vectorize using AVX-512 based instructions on Intel processors but do the same on AMD?

Score: 2

Views: 205

Answers: 0

Read More
JB_User
JB_User

Reputation: 3267

What exactly do the gcc compiler switches (-mavx -mavx2 -mavx512f) do?

Score: 7

Views: 14725

Answers: 2

Read More
user12316363
user12316363

Reputation:

How to understand this AVX addition of two _m256i variables?

Score: 3

Views: 85

Answers: 1

Read More
RTC222
RTC222

Reputation: 2323

Emulate AVX512 VPCOMPRESSB byte packing without AVX512_VBMI2

Score: 2

Views: 1007

Answers: 1

Read More
Baaing Cow
Baaing Cow

Reputation: 1512

Multiply vectors of 32 bit integers, taking only high 32 bits

Score: 2

Views: 1375

Answers: 2

Read More
user24200147
user24200147

Reputation: 1

What is considered as "2 FMAs"?

Score: 0

Views: 106

Answers: 0

Read More
Hasturnius
Hasturnius

Reputation: 13

What is the alternative method for Avx2.MoveMask in Vector512<T>

Score: 1

Views: 138

Answers: 2

Read More
VarianceOfOne
VarianceOfOne

Reputation: 21

enable avx512 zmm registers in gdb

Score: 2

Views: 104

Answers: 0

Read More
PreviousPage 1Next