StackOverflow Questions for Tag: micro-optimization

Venusaur
Venusaur

Reputation: 385

Is it "too clever" for using LEA to load constant to register?

Score: 6

Views: 183

Answers: 2

Read More
Armen Michaeli
Armen Michaeli

Reputation: 9170

Use two loop bodies or one (result identical)?

Score: 13

Views: 1865

Answers: 7

Read More
vengy
vengy

Reputation: 2257

Using the operand-size override prefix 0x66 for instruction alignment

Score: 4

Views: 1049

Answers: 1

Read More
ListigerLurch
ListigerLurch

Reputation: 73

Fast copy and replicate (or fill) byte array with another byte array

Score: 3

Views: 643

Answers: 1

Read More
Jeremy Salwen
Jeremy Salwen

Reputation: 8418

Extract fractional part of double *efficiently* in C

Score: 25

Views: 47092

Answers: 7

Read More
12345ieee
12345ieee

Reputation: 509

Optimize std::vector comparison to memcmp

Score: 3

Views: 198

Answers: 0

Read More
eof
eof

Reputation: 557

Parsing 12 byte buffer into two u64

Score: 0

Views: 92

Answers: 0

Read More
Per
Per

Reputation: 39

optimize indexing while inside of a loop

Score: 1

Views: 136

Answers: 0

Read More
Michael Liu
Michael Liu

Reputation: 55499

Why is my operator ++ more than twice as fast as its equivalent instance method?

Score: 4

Views: 236

Answers: 2

Read More
BeeOnRope
BeeOnRope

Reputation: 65046

Is performance reduced when executing loops whose uop count is not a multiple of processor width?

Score: 46

Views: 11532

Answers: 3

Read More
micycle
micycle

Reputation: 3820

Optimal algorithm for distance between one point and all others in a grid

Score: 2

Views: 1496

Answers: 2

Read More
doraemon
doraemon

Reputation: 2512

what is the purpose of using index caches in rigtorp's SPSCQueue

Score: 1

Views: 178

Answers: 2

Read More
njuffa
njuffa

Reputation: 26185

Branchless count-leading-zeros on 32-bit RISC-V without Zbb extension

Score: 8

Views: 442

Answers: 3

Read More
Greg Kennedy
Greg Kennedy

Reputation: 644

Performance of local variable vs. array access

Score: 5

Views: 3175

Answers: 2

Read More
Alex Guteniev
Alex Guteniev

Reputation: 13719

What is the correct and optimal implementation of atomic_ref and atomic with padding bits?

Score: 3

Views: 531

Answers: 1

Read More
njuffa
njuffa

Reputation: 26185

Fast BCD addition

Score: 14

Views: 902

Answers: 1

Read More
Joseph
Joseph

Reputation: 401

Which is the best way, in C, to see if a number is divisible by another?

Score: 4

Views: 32873

Answers: 6

Read More
billpg
billpg

Reputation: 3293

What is the most optimal way to use a C# struct as the key of a dictionary?

Score: 8

Views: 2862

Answers: 2

Read More
qwark
qwark

Reputation: 533

Very fast approximate Logarithm (natural log) function in C++?

Score: 20

Views: 28654

Answers: 8

Read More
user2059893
user2059893

Reputation: 447

Is there any data on the latency of an AVX2 gather instruction?

Score: 5

Views: 1722

Answers: 3

Read More
PreviousPage 2Next