StackOverflow Questions for Tag: micro-optimization

Chayim Friedman
Chayim Friedman

Reputation: 71430

What is causing the store latency in this program?

Score: 4

Views: 189

Answers: 2

Read More
no comment
no comment

Reputation: 10465

Why is `if x is None: pass` faster than `x is None` alone?

Score: 5

Views: 237

Answers: 2

Read More
njuffa
njuffa

Reputation: 26195

Optimized 53->32 bit modulo computation on 32-bit processors

Score: 4

Views: 226

Answers: 1

Read More
Guy B
Guy B

Reputation: 415

Is using AVX2 can implement a faster processing of LZCNT on a word array?

Score: 9

Views: 1545

Answers: 2

Read More
MaiaVictor
MaiaVictor

Reputation: 53037

Is it possible to check if 2 sets of 3 ints have at least one element in common with less than 9 comparisons?

Score: 12

Views: 366

Answers: 5

Read More
user15150266
user15150266

Reputation: 65

How can I g_signal_connect() by ID rather than string name?

Score: 1

Views: 384

Answers: 1

Read More
ugo_capeto
ugo_capeto

Reputation: 338

x86 assembly abs() implementation? (revisited)

Score: 0

Views: 170

Answers: 0

Read More
Jimbo
Jimbo

Reputation: 3294

what's the difference between _mm256_lddqu_si256 and _mm256_loadu_si256

Score: 17

Views: 4818

Answers: 1

Read More
matrixMule
matrixMule

Reputation: 11

How to group static/global variables together for improved cache locality in C++

Score: 1

Views: 108

Answers: 0

Read More
Viktor Sehr
Viktor Sehr

Reputation: 13099

Why doesn't the C++ standard library utilize likely/unlikely attributes?

Score: 3

Views: 418

Answers: 1

Read More
Juliean
Juliean

Reputation: 1180

LEA vs MOV imm64 for loading address-constant into register

Score: 5

Views: 220

Answers: 2

Read More
sadljkfhalskdjfh
sadljkfhalskdjfh

Reputation: 797

Test whether a register is zero with CMP reg,0 vs OR reg,reg?

Score: 23

Views: 12591

Answers: 2

Read More
Thomas O
Thomas O

Reputation: 6240

Divide by 10 using bit shifts?

Score: 65

Views: 89691

Answers: 11

Read More
Peter Cordes
Peter Cordes

Reputation: 365517

How exactly do partial registers on Haswell/Skylake perform? Writing AL seems to have a false dependency on RAX, and AH is inconsistent

Score: 53

Views: 4532

Answers: 2

Read More
DarkAtom
DarkAtom

Reputation: 3171

How to zero certain bytes of a register?

Score: 3

Views: 138

Answers: 0

Read More
Forward
Forward

Reputation: 965

Why does mulss take only 3 cycles on Haswell, different from Agner's instruction tables? (Unrolling FP loops with multiple accumulators)

Score: 67

Views: 7529

Answers: 1

Read More
DenverCoder9
DenverCoder9

Reputation: 495

Compiler optimization of if (loop invariant) if statement inside a loop

Score: 2

Views: 3813

Answers: 5

Read More
njuffa
njuffa

Reputation: 26195

Converting nucleobase representation from ASCII to UCSC .2bit

Score: 5

Views: 418

Answers: 3

Read More
CPlus
CPlus

Reputation: 4848

Can packing variables or parameters into structures/unions introduce unforseen performance penalties?

Score: 1

Views: 127

Answers: 1

Read More
sum1stolemyname
sum1stolemyname

Reputation: 4550

Floating point division vs floating point multiplication

Score: 110

Views: 101902

Answers: 8

Read More
PreviousPage 3Next