MerickOWA
MerickOWA

Reputation: 7602

Alignment 16 not respected when using auto keyword?

Compiling with VS2012 and working with the DirectXMath library, I encountered an issue where it appeared that the compiler wasn't aligning my XMMATRIX. I simplified the issue down to the following.

#include <DirectXMath.h>
using namespace DirectX;

int _tmain(int argc, _TCHAR* argv[])
{
  auto m1 = XMMatrixIdentity();
  auto m2 = XMMatrixIdentity();

  auto t1 = XMMatrixTranspose( m1 ); // sometimes access violation occurs here
  auto t2 = XMMatrixTranspose( m2 ); // or sometimes here

  return 0;
}

Re-running the code over and over will sometimes cause an "Access violation reading location 0xFFFFFFFF" on the first transpose, sometimes on the second.

I've figured out this is due to the fact that m1 and m2 are not being properly aligned. Replacing "auto" with "XMMATRIX" seems to solve the issue, so my suspicion is a compiler bug, but its also possible that I'm doing something wrong, or not enabling some option.

Is there something wrong with my code or is this a compiler bug?

Upvotes: 14

Views: 1221

Answers (3)

user3728706
user3728706

Reputation: 11

For Visual Studio 2012, I was able to implement a "work-around" it by splitting the statement in two lines:

XMMATRIX mtxMyWorldTrnspd = mtxMyWorld;
mtxMyWorldTrnspd = XMMatrixTranspose(mtxMyWorldTrnspd);

Upvotes: 0

Roman Zavalov
Roman Zavalov

Reputation: 585

This is definitely a compiler bug. I can reproduce it too with my own math classes. I've opened a ticket here so vote for it. You can avoid using the auto keyword, as was mentioned. Or you can compile to x64 where the bug is not present (or at least i was unable to reproduce it; my not-so-simple program works perfectly).

Upvotes: 2

jleahy
jleahy

Reputation: 16875

The definition for XMMATRIX has the following in the header file (xnamath.h), although this could be different in your version.

// Matrix type: Sixteen 32 bit floating point components aligned on a
// 16 byte boundary and mapped to four hardware vector registers
#if (defined(_XM_X86_) || defined(_XM_X64_)) && defined(_XM_NO_INTRINSICS_)
typedef struct _XMMATRIX
#else
typedef _DECLSPEC_ALIGN_16_ struct _XMMATRIX
#endif

So XMMATRIX is defined with __declspec(align(16)) (if you look through the header files it does reduce to this), which is a Microsoft specific extension. It's not a macro. This means it's a compiler bug, the compiler is failing to propagate these proprietary attributes to variables defined with the auto keyword.

It's probably best to just avoid the use of the auto keyword in this case, it's probably neater than explicitly adding the declspec yourself.

Upvotes: 3

Related Questions