Kernel Mode memory size for an x86 LARGEADDRESSAWARE program on an x64 machine?

Question

The title pretty much sums it up. I have an application compiled for x86 platforms, which has the /largeaddressaware flag set. Running it on an x64 system, I get the expanded 4GB User Mode virtual memory "for free", without having to specify the /3GB boot option. On an x86 system this would mean that the Kernel Mode memory would only be 1GB, but since x64 systems can address more memory, does the Kernel Mode retain the 2GB or even get bumped up to 3GB?

Edit: To be clear, I'm wondering about per-process limits. The question stems from reading this article.

Edit 2: This question is not a duplicate of How much memory can a 32 bit process access on a 64 bit operating system? because that question addresses only the application-accessible memory, not the system accessible memory. If I have been misunderstanding and there is no memory reserved for the system for each process, I'd appreciate it if someone could write that up as an answer. I'm sure I'm not the first to be confused about this.

Luaan · Accepted Answer

There's a few misconceptions that are confusing you.

First, let's look at 32-bit Windows. The virtual address space for each process has a certain part allocated to the process itself, and a certain part for whatever the kernel needs. However, all the processes share the same kernel memory - the fact that you even have kernel memory in your own virtual address space is basically a performance optimization to avoid having to switch address spaces when dealing with kernel objects and data in your application.

By default, this is a 1:1 split, so you get 2 GiB of user address space and 2 GiB of kernel address space. This was (ab)used by early 32-bit Windows software (when your computer might have had as little as 4 MiB of memory total with a 486 CPU or similar), because due to the way the memory was laid out, your user address space never had any pointers above the 2 GiB barriers - effectively giving you the highest bit of any pointer free for your own data. Often this was used to allow for a hybrid "if it fits, this is a value, otherwise it's a pointer to a structure" approach, saving memory and a bit of indirection. Since this is so wide-spread, the default has been the same split as in the early days to prevent compatibility issues. However, you also have a way to opt-in to a different split - 3 GiB of user space and 1 GiB of kernel space. This is what the /3GB option does. But that's not enough - your application must also opt-in using /LARGEADDRESSAWARE. This basically says "I don't do weird stuff with my pointers".

It should be noted that 32-bit OS or process doesn't necessarily mean you can only address 4 GiB of memory - it just limits what's directly accessible to the CPU at any point. For memory intensive server software, even the "32-bit" versions may have support for addressing a lot more memory - for example, the 32-bit MS SQL Server supports up to 64 GiB through AWE. This is basically another layer of virtualization which allows remapping the virtual address' physical addresses. In theory, there is no limit to the amount of memory you can address, with or without AWE - after all, nothing is preventing you from having your own hardware that acts as a memory mapped file, effectively giving you unlimited address space. Of course, like the days of segmented memory, it's not very easy to work with or practical :)

On 64-bit Windows, the /3GB no longer makes any sense and is ignored. The default address space split depends on the exact version of Windows, but is in the "terabytes and more" range, way out of the 32-bit limits. For modern Windows, this is usually 128 TiB user + 128 TiB kernel. 32-bit applications still have to use /LARGEADDRESSAWARE as before. However, since the kernel is now 64-bit, it can't be in the same address space as the user process anyway, so a 32-bit application on a 64-bit OS has full access to the 4 GiB of address space.

Of course, those limits are still well below what 64-bit is theoretically capable of addressing. However, most 64-bit CPUs actually can't address the whole 64-bit address space - the most common the last time I checked was just 48-bit. And surprise, surprise - that gives you 256 TiB of address space, the limit in Windows. Not a Microsoft conspiracy after all! :) This isn't something new, actually. The fact that Intel x86's 32-bit ALU is associated with a 32-bit address space is quite an outlier in CPU history - CPUs often have either higher and lower address space (for either virtual addressing or physical addressing) width than their ALU-width. The MS DOS typical limit of 1 MiB of addressable memory (with 640 kiB left over to user applications) comes from this as well - the "32-bit" CPUs of the time could only use 20-bit addresses.

Kernel Mode memory size for an x86 LARGEADDRESSAWARE program on an x64 machine?

Answers (2)

Related Questions