Encoding ADC EAX, ECX - 2 different ways to encode? (arch x86)

Question

I'm looking through an Intel Instruction Set manual, and it looks like there are 2 different forms of ADC that would match/encode ADC EAX, ECX as follows:

ADC r/m32, r32  (11 /r , which encodes to 11C8)

or

ADC r32, r/m32  (13 /r, which encodes to 13C1)

My question is (given I did the math correctly), are 11C8 and 13C1 equivalent? What are the factors that an assembler would consider in selecting one encoding over another? The question is from a perspective of implementing an assembler, so the question is in general, not about this particular hypothetical instruction.

If it's a lengthy answer, please point me in a right direction as my attempts at googling it failed.

phuclv · Accepted Answer

This is redundancy of instruction encoding. Any architecture that use multiple parameters in the instruction has this.

Think of a RISC architecture that have add rx, ry, rz that assigns the sum of ry and rz into rx then you can encode add rx, ry, rz or add rx, rz, ry, they'll all be equivalent.

In x86 we (normally) have only 2 parameters for each instruction but you can select the direction between them since you can store to or read from memory. If you don't use memory then you can choose the direction between the 2 registers, so there are 2 encoding ways

You can use this to identify some compilers/assemblers. For some assemblers you can choose which encoding to use. In GAS you can use .s suffix to force it to emit the alternate encoding

10 de   adcb   %bl,%dh
12 f3   adcb.s %bl,%dh

Encoding ADC EAX, ECX - 2 different ways to encode? (arch x86)

Answers (2)

Related Questions