Section 11.2 The Assignment Operator
The C/C++ assignment operator, ‘=
’, causes the expression on the right-hand side of the operator to be evaluated and the result to be associated with the variable that is named on the left-hand side. Subsequent uses of the variable name in the program will evaluate to this same value. For example,
int x; ..... x = 123;
will assign the integer \(123\) to the variable x
. If x
is later used in an expression, the value assigned to x
will be used in evaluating the expression. For example, the expression
2 * x;
would evaluate to \(246\text{.}\)
We now explore what assignment means at the assembly language level. The variable declaration,
int x;
causes a location to be allocated and that location to be given the name “x
.” That is, other parts of the program can refer to the location where the value of x
is stored by using the name “x
.”
The type name in the declaration, int
, tells the compiler how many bytes to allocate and the code used to represent the data stored at this location. The int
type uses the two's complement code. So the assignment statement,
x = 123;
would set the bit pattern in the location named x
to \(\hex{0x0000007b}\text{,}\) the two's complement code for the signed integer \(+123\text{.}\) Similarly, the assignment statement
x = -123;
would set the bit pattern in the location named x
to \(\hex{0xffffff85}\text{,}\) the two's complement code for the signed integer \(-123\text{.}\)
The program in Listing 11.2.1 uses the assignment operator to store values in the x
, y
, and z
variables. We use the register
type modifier to “advise” the compiler to use a register for the x
and y
variables.
/* assignment1.c * Assign a 32-bit pattern to a register * * 2017-09-29: Bob Plantz */ #include <stdio.h> int main(void) { register int x, y; int z; x = 123; y = 4567; z = x + y; printf("%i + %i = %i\n", x, y, z); return 0; }
The compiler-generated assembly language shown in Listing 11.2.2 shows the assignment operation implemented in three different ways.
.arch armv6 .file "assignment1.c" .section .rodata .align 2 .LC0: .ascii "%i + %i = %i\012\000" .text .align 2 .global main .syntax unified .arm .fpu vfp .type main, %function main: @ args = 0, pretend = 0, frame = 8 @ frame_needed = 1, uses_anonymous_args = 0 push {r4, r5, fp, lr} add fp, sp, #12 sub sp, sp, #8 @@ local var in stack frame mov r4, #123 @@ x = 123; ldr r5, .L3 @@ y = 4567; add r3, r4, r5 @@ x + y str r3, [fp, #-16] @@ z is in stack frame ldr r3, [fp, #-16] @@ load z mov r2, r5 @@ y mov r1, r4 @@ x ldr r0, .L3+4 @@ address of format string bl printf mov r3, #0 mov r0, r3 sub sp, fp, #12 @ sp needed pop {r4, r5, fp, pc} .L4: .align 2 .L3: .word 4567 .word .LC0 .ident "GCC: (Raspbian 6.3.0-18+rpi1) 6.3.0 20170516"
The compiler honored our request to use registers for both the x
and y
variables, and the z
variable is allocated in the stack frame.
Listing 11.2.3 shows my assembly language solution. It is essentially the same as what the compiler generated, but I have used names for labels and constants that will help with the explanation of the code.
@ assignment2.s @ Assignment three ways. @ 2017-09-29: Bob Plantz @ Define my Raspberry Pi .cpu cortex-a53 .fpu neon-fp-armv8 .syntax unified @ modern syntax @ Useful source code constants .equ z,-16 .equ local,8 @ Constant program data .section .rodata .align 2 formatMsg: .asciz "%i + %i = %i\n" @ Program code .text .align 2 .global main .type main, %function main: sub sp, sp, 12 @ space for saving regs str fp, [sp, 0] @ save fp str lr, [sp, 4] @ lr str r5, [sp, 8] @ r5 str r4, [sp, 12] @ and r4 add fp, sp, 12 @ our frame pointer sub sp, sp, local @ allocate memory for local var mov r5, 123 @ x = 123; ldr r4, yValue @ y = 4567; add r3, r5, r4 @ x + y str r3, [fp, z] @ z = x + y; ldr r0, formatMsgAddr @ printf("%i + %i = %i\n", mov r1, r5 @ x, mov r2, r4 @ y, ldr r3, [fp, z] @ z); bl printf mov r0, 0 @ return 0; add sp, sp, local @ deallocate local var ldr fp, [sp, 0] @ restore fp ldr lr, [sp, 4] @ lr ldr r5, [sp, 8] @ r5 ldr r4, [sp, 12] @ r4 add sp, sp, 12 @ and sp bx lr @ return .align 2 yValue: .word 4567 formatMsgAddr: .word formatMsg
First, notice that the values in the r4
and r5
registers must be saved on the stack in the prologue:
stmfd sp!, {r4, r5, fp, lr} @ save caller's info
and restored in the epilogue:
ldmfd sp!, {r4, r5, fp, lr} @ restore caller's info
as is specified in Table 10.1.1.
After setting up our frame pointer, we move the stack pointer to allocate space on the stack for the local variable:
add fp, sp, 12 @ our frame pointer sub sp, sp, local @ allocate memory for local var
where the value of local
was computed to (a) allow enough memory space for the int
variable, and (b) make sure the stack pointer is always on an eight-byte addressing boundary, as required by the protocol when calling a public function (printf
in this case).
You have already seen the first two assignment implementations:
mov r5, 123 @ x = 123; ldr r4, yValue @ y = 4567;
in Listing 10.1.4. The integer value, \(123\text{,}\) is within the range that can be moved directly into a register. However, \(4567\) cannot, so it is stored in memory and loaded into a register from memory.
The compiler honored our request to use registers for both the x
and y
variables. However, the z
variable is allocated in the stack frame. So after the addition is performed, the sum is stored in memory at a location relative to the frame pointer:
str r3, [fp, z] @ z = x + y;
Recall from Section 9.2 that [fp, z]
specifies the address obtained by adding the value of z
to the value contained in the fp
register. In this function z
is an offset of \(-16\) bytes from the address in fp
.
In Section 11.3 we discuss the machine code for the instructions that implement these assignment statements. In particular, we will be looking at how the location of each variable is encoded in the machine language.