StackOverflow Questions for Tag: sse

Edison von Myosotis
Edison von Myosotis

Reputation: 643

double to float NaN-conversion

Score: 0

Views: 127

Answers: 1

Read More
Michał Brodziak
Michał Brodziak

Reputation: 11

Vectorized function slower than reference when compiled without optimization (multiple 4-element FP dot-products)

Score: 0

Views: 94

Answers: 0

Read More
user2088790
user2088790

Reputation:

How to use Fused Multiply-Add (FMA) instructions with SSE/AVX

Score: 48

Views: 45105

Answers: 2

Read More
Baba Dan Constantin
Baba Dan Constantin

Reputation: 123

SSE4.1 slower than SSE3 on 4x4 matrix multiplication?

Score: 3

Views: 100

Answers: 1

Read More
superb owl
superb owl

Reputation: 185

Does SSE/AVX provide a means of determining if a result was rounded up?

Score: 4

Views: 738

Answers: 1

Read More
song xs
song xs

Reputation: 81

Is floating point math determinstic for all intel/amd cpus?

Score: 1

Views: 144

Answers: 2

Read More
Filip Navara
Filip Navara

Reputation: 4828

Write access violation on read instruction (MOVQ load on old Athlon XP)

Score: 2

Views: 1085

Answers: 1

Read More
Brendan McKay
Brendan McKay

Reputation: 138

Portably testing for the POPCNT instruction

Score: 2

Views: 1758

Answers: 2

Read More
NeomerArcana
NeomerArcana

Reputation: 2311

What series of intrinsics will complete this paeth prediction code?

Score: 0

Views: 277

Answers: 2

Read More
tdenniston
tdenniston

Reputation: 3519

Calculating constants for CRC32 using PCLMULQDQ

Score: 2

Views: 2219

Answers: 2

Read More
Gordon Kindlmann
Gordon Kindlmann

Reputation: 61

Classification of x86 instructions according to floating point rounding mode sensitivity?

Score: 5

Views: 875

Answers: 1

Read More
St.Antario
St.Antario

Reputation: 27435

Why do x86 FP compares set CF like unsigned integers, instead of using signed conditions?

Score: 4

Views: 847

Answers: 2

Read More
formateu
formateu

Reputation: 167

Intel x86_64 assembly compare signed double precision floats

Score: 3

Views: 4027

Answers: 3

Read More
kalwalt
kalwalt

Reputation: 492

How correctly convert _mm_hadd_epi32 to equivalent C++ code (wasm_simd128.h) with Emscripten?

Score: 1

Views: 128

Answers: 1

Read More
plasmacel
plasmacel

Reputation: 8538

How to efficiently perform double/int64 conversions with SSE/AVX?

Score: 28

Views: 11163

Answers: 4

Read More
Mathai
Mathai

Reputation: 839

Is there a way to utilize all XMM registers?

Score: 6

Views: 3000

Answers: 3

Read More
Gabriel Lim
Gabriel Lim

Reputation: 13

Output errors when using libmvec intrinsics for trigo functions manually (like cosf)

Score: 1

Views: 40

Answers: 1

Read More
invertedPanda
invertedPanda

Reputation: 328

How to optimize cell-width measuring with SIMD (find the first column to have a non-zero byte in an 8x8 block of bytes)

Score: 3

Views: 139

Answers: 2

Read More
user18490
user18490

Reputation: 3839

Is worth using SSE or should I just rely on the compiler?

Score: 3

Views: 551

Answers: 2

Read More
Ovinus Real
Ovinus Real

Reputation: 568

Why does x86 ldmxcsr/stmxcsr take a memory operand?

Score: 2

Views: 557

Answers: 1

Read More
PreviousPage 1Next