StackOverflow Questions for Tag: fma

user24200147
user24200147

Reputation: 1

What is considered as "2 FMAs"?

Score: 0

Views: 94

Answers: 0

Read More
Z boson
Z boson

Reputation: 33659

Optimize for fast multiplication but slow addition: FMA and doubledouble

Score: 12

Views: 2232

Answers: 3

Read More
einpoklum
einpoklum

Reputation: 131405

Is floating point expression contraction allowed in C++?

Score: 10

Views: 1507

Answers: 2

Read More
Y-Jiechao
Y-Jiechao

Reputation: 5

Why does '_mm256_fmadd_ps' cause precision loss?

Score: 0

Views: 94

Answers: 1

Read More
Anili
Anili

Reputation: 140

Achieving More FMA3 Performance Than The Theoretical Maximum

Score: 1

Views: 102

Answers: 0

Read More
Anili
Anili

Reputation: 140

High Variance In Manual Vectorization Performance

Score: 2

Views: 66

Answers: 1

Read More
cyrusbehr
cyrusbehr

Reputation: 1301

AVX2: Computing dot product of 512 float arrays

Score: 23

Views: 13028

Answers: 3

Read More
Serge Rogatch
Serge Rogatch

Reputation: 15020

Can we replace XOR with multiply-add?

Score: 0

Views: 215

Answers: 0

Read More
Martin Brown
Martin Brown

Reputation: 2649

Why this AVX2 slowdown with FMA x86 MS C Compiler?

Score: 0

Views: 110

Answers: 0

Read More
MSalters
MSalters

Reputation: 179779

How to get data out of AVX registers?

Score: 16

Views: 14470

Answers: 5

Read More
Fedor
Fedor

Reputation: 20997

Clang fused multiply-add depends on constancy of expression arguments

Score: 5

Views: 290

Answers: 0

Read More
Christian Sarofeen
Christian Sarofeen

Reputation: 2250

Accurate method to calculate double FMA and Shared memory latency

Score: 3

Views: 394

Answers: 1

Read More
anna
anna

Reputation: 59

v4fmaddps instructions for packed 32-bit integers

Score: 1

Views: 137

Answers: 0

Read More
BadHellie
BadHellie

Reputation: 307

GCC 12 (minGW 64): how to enable fused multiply add code generation

Score: 0

Views: 237

Answers: 0

Read More
xiaohuihui
xiaohuihui

Reputation: 45

How should I implement a generic FMA/FMAF instruction in software?

Score: 2

Views: 549

Answers: 1

Read More
A.nechi
A.nechi

Reputation: 541

FMA intrinsics not working: is it Hardware or Compiler?

Score: 4

Views: 2191

Answers: 2

Read More
pem
pem

Reputation: 465

Fast fixed-size polynomial evaluation: MSVC vs GCC

Score: 3

Views: 110

Answers: 0

Read More
Peter
Peter

Reputation: 38455

Fastest way to multiply and sum/add two arrays (dot product) - unaligned surprisingly faster than FMA

Score: 5

Views: 1435

Answers: 1

Read More
tigertang
tigertang

Reputation: 457

Deleteing initialization leads to avx2 fma performance drop. Why?

Score: 1

Views: 95

Answers: 0

Read More
Mirco Mannino
Mirco Mannino

Reputation: 35

Latency and number of FMA units

Score: 1

Views: 227

Answers: 0

Read More
PreviousPage 1Next