Basic confusion with buffer overflow math

Question

I'm following along with a Youtube Computerphile buffer overflow tutorial to learn how it works. The tutorial says its in Kali, and I'm running Kali 64-bit to test it (I think he's running 32-bit).

He writes a simple program like this:

#include 
#include 

int main(int argc, char** argv) {

    char buffer[500];        
    strcpy(buffer, argv[1]);

    return 0;
}

Then after starting the program in GDB he runs:

(gdb) run $(python -c 'print "\x41" * 506')

and the result is a seg fault which shows that the return address was half overwritten with two 41's.

When I try to duplicate this, I need to change 506 to 522 in order to produce the same result. So my questions are:

Why does 506 only rewrite two bytes instead of three when he runs it?
Why do I need to write 522 bytes to overwrite 2 bytes in the return address? I think it has to do with him probably using 32-bit instead of 64-bit Kali, but I don't really understand how this difference adds up mathematically.
When I do disassemble main I see that after the function prologue is the instruction sub rsp, 0x210, so it looks like buffer is allocated to 528 bytes. Why this number in particular (his instead subs 0x1f4 which is exactly 500) and how does it relate to the above where greater than 520 bytes is needed to start rewriting the instruction pointer?
What is happening in the range of writing [500,520] bytes where it's more than the buffer size, but not yet writing over top of the instruction pointer?

Basic confusion with buffer overflow math

Answers (1)

Related Questions