StackOverflow Questions for Tag: avx

user2088790
user2088790

Reputation:

How to use Fused Multiply-Add (FMA) instructions with SSE/AVX

Score: 48

Views: 45059

Answers: 2

Read More
superb owl
superb owl

Reputation: 185

Does SSE/AVX provide a means of determining if a result was rounded up?

Score: 4

Views: 737

Answers: 1

Read More
Bioheater
Bioheater

Reputation: 31

Can you disable AVX in deepfacelab?

Score: 0

Views: 504

Answers: 1

Read More
xiver77
xiver77

Reputation: 2322

Best way to mask a single bit in AVX2?

Score: 4

Views: 1010

Answers: 3

Read More
plasmacel
plasmacel

Reputation: 8538

How to efficiently perform double/int64 conversions with SSE/AVX?

Score: 28

Views: 11146

Answers: 4

Read More
Steve Burns
Steve Burns

Reputation: 319

What is the inverse of "_mm256_cvtepi16_epi32"

Score: 6

Views: 1595

Answers: 1

Read More
Kevin Meier
Kevin Meier

Reputation: 2582

AVX2: Get every second int32

Score: 2

Views: 116

Answers: 2

Read More
invertedPanda
invertedPanda

Reputation: 328

How to optimize cell-width measuring with SIMD (find the first column to have a non-zero byte in an 8x8 block of bytes)

Score: 3

Views: 138

Answers: 2

Read More
kdh
kdh

Reputation: 43

I need more performance for int8 vector multiplication (Intel AVX-512)

Score: 1

Views: 240

Answers: 1

Read More
Sofiane Benzait
Sofiane Benzait

Reputation: 49

How to have access to pointer to pointer to varying data (T**) using ISPC intel compiler

Score: 1

Views: 64

Answers: 1

Read More
Johan Daniel
Johan Daniel

Reputation: 103

How to use SIMD effectively to count 4-character matches in a large word-search grid (including vertical and diagonal)?

Score: 2

Views: 277

Answers: 3

Read More
E_1996
E_1996

Reputation: 89

C++ AVX2 custom functions (e.g., "exp") not working on Windows (but work on Linux)

Score: 0

Views: 110

Answers: 0

Read More
elvencode
elvencode

Reputation: 91

Have different optimizations (plain, SSE, AVX) in the same executable with C/C++

Score: 7

Views: 2680

Answers: 3

Read More
fabian
fabian

Reputation: 1881

Why are consecutive AVX2 loads producing so many `vmovaps` operations?

Score: 1

Views: 68

Answers: 0

Read More
namea hang
namea hang

Reputation: 11

Is fusing shuffle(tbl)+cvt+add is possible for the same throught as MAD (multiply-add) SIMD instrcution on AVX or NEON?

Score: 0

Views: 46

Answers: 0

Read More
kdh
kdh

Reputation: 43

Efficient way for using int8 AVX512-VNNI instruction, especially about loading the data to zmm register

Score: 2

Views: 117

Answers: 1

Read More
Sam
Sam

Reputation: 137

AVX 32-bit integer to double precision float best practice

Score: -1

Views: 115

Answers: 2

Read More

Have I written these sha256 #define's the correct way?

Score: 1

Views: 83

Answers: 1

Read More
namea hang
namea hang

Reputation: 11

Which execution ports can SIMD shuffles use for AVX2 and NEON?

Score: 1

Views: 94

Answers: 1

Read More
BeeOnRope
BeeOnRope

Reputation: 65006

What is the difference between shuffle and permute

Score: 6

Views: 4676

Answers: 1

Read More
PreviousPage 1Next