Procedure Computer Organization and Assembly Languages Yung-Yu Chuang 2007/12/24 with slides by Kip Irvine Overview • • • • • Stack Operations Defining and Using Procedures Stack frames, parameters and local variables Recursion Related directives 2 Stack operations Stacks • LIFO (Last-In, First-Out) data structure. • push/pop operations • You probably have had experiences on implementing it in high-level languages. • Here, we concentrate on runtime stack, directly supported by hardware in the CPU. It is essential for calling and returning from procedures. 4 Runtime stack • Managed by the CPU, using two registers – SS (stack segment) – ESP (stack pointer) * : point to the top of the stack usually modified by CALL, RET, PUSH and POP SS stack segment ESP memory * SP in Real-address mode 5 PUSH and POP instructions • PUSH syntax: – PUSH r/m16 – PUSH r/m32 – PUSH imm32 • POP syntax: – POP r/m16 – POP r/m32 6 PUSH operation (1 of 2) • A push operation decrements the stack pointer by 2 or 4 (depending on operands) and copies a value into the location pointed to by the stack pointer. 0FEC 0FEC 0FF0 0FF0 0FF4 PUSH 0A5h 0FF8 0FF8 ESP 0FFC ESP 1000 0FF4 00000006 0FFC 1000 000000A5 00000006 7 PUSH operation (2 of 2) • The same stack after pushing two more integers: 0FEC 0FEC 0FF0 0FF0 ESP 0FF4 ESP 0FF8 0FFC 1000 00000001 000000A5 00000006 PUSH 01h 0FF4 0FF8 0FFC 1000 00000002 00000001 000000A5 00000006 PUSH 02h 8 POP operation • Copies value at stack[ESP] into a register or variable. • Adds n to ESP, where n is either 2 or 4, depending on the attribute of the operand receiving the data ESP 0FEC 0FEC 0FF0 0FF0 0FF4 0FF4 0FF8 0FFC 1000 00000002 00000001 ESP 0FFC 000000A5 00000006 0FF8 1000 POP EAX 00000001 000000A5 00000006 EAX=00000002 9 When to use stacks • • • • • Temporary save area for registers To save return address for CALL To pass arguments Local variables Applications which have LIFO nature, such as reversing a string 10 Example of using stacks Save and restore registers when they contain important values. Note that the PUSH and POP instructions are in the opposite order: push esi push ecx push ebx ; push registers mov esi,OFFSET dwordVal ; starting OFFSET mov ecx,LENGTHOF dwordVal; number of units mov ebx,TYPE dwordVal ;size of a doubleword call DumpMem ; display memory pop ebx pop ecx pop esi ; opposite order 11 Example: Nested Loop When creating a nested loop, push the outer loop counter before entering the inner loop: mov ecx,100 L1: push ecx ; set outer loop count ; begin the outer loop ; save outer loop count mov ecx,20 L2: ; ; loop L2 ; set inner loop count ; begin the inner loop pop ecx loop L1 ; repeat the inner loop ; restore outer loop count ; repeat the outer loop 12 Example: reversing a string .data aName BYTE "Abraham Lincoln",0 nameSize = ($ - aName) – 1 .code main PROC ; Push the name on the stack. mov ecx,nameSize mov esi,0 L1: movzx eax,aName[esi] ; get character push eax ; push on stack inc esi Loop L1 13 Example: reversing a string ; Pop the name from the stack, in reverse, ; and store in the aName array. mov ecx,nameSize mov esi,0 L2: pop eax ; get character mov aName[esi],al ; store in string inc esi Loop L2 exit main ENDP END main 14 Related instructions • PUSHFD and POPFD – push and pop the EFLAGS register – LAHF, SAHF are other ways to save flags • PUSHAD pushes the 32-bit general-purpose registers on the stack in the following order – EAX, ECX, EDX, EBX, ESP, EBP, ESI, EDI • POPAD pops the same registers off the stack in reverse order – PUSHA and POPA do the same for 16-bit registers 15 Example MySub PROC pushad ... ; modify some register ... popad Do not use this if your procedure uses ret registers for return values MySub ENDP 16 Defining and using procedures Creating Procedures • Large problems can be divided into smaller tasks to make them more manageable • A procedure is the ASM equivalent of a Java or C++ function • Following is an assembly language procedure named sample: sample PROC . . ret sample ENDP A named block of statements that ends with a return. 18 Documenting procedures Suggested documentation for each procedure: • A description of all tasks accomplished by the procedure. • Receives: A list of input parameters; state their usage and requirements. • Returns: A description of values returned by the procedure. • Requires: Optional list of requirements called preconditions that must be satisfied before the procedure is called. For example, a procedure of drawing lines could assume that display adapter is already in graphics mode. 19 Example: SumOf procedure ;----------------------------------------------SumOf PROC ; ; Calculates and returns the sum of three 32-bit ; integers. ; Receives: EAX, EBX, ECX, the three integers. ; May be signed or unsigned. ; Returns: EAX = sum, and the status flags ; (Carry, Overflow, etc.) are changed. ; Requires: nothing ;----------------------------------------------add eax,ebx add eax,ecx ret SumOf ENDP 20 CALL and RET instructions • The CALL instruction calls a procedure – pushes offset of next instruction on the stack – copies the address of the called procedure into EIP • The RET instruction returns from a procedure – pops top of stack into EIP • We used jl and jr in our toy computer for CALL and RET, BL and MOV PC, LR in ARM. 21 CALL-RET example (1 of 2) 0000025 is the offset of the instruction immediately following the CALL instruction 00000040 is the offset of the first instruction inside MySub main PROC 00000020 call MySub 00000025 mov eax,ebx . . main ENDP MySub PROC 00000040 mov eax,edx . . ret MySub ENDP 22 CALL-RET example (2 of 2) The CALL instruction pushes 00000025 onto the stack, and loads 00000040 into EIP ESP The RET instruction pops 00000025 from the stack into EIP 00000040 00000025 EIP 00000025 ESP 00000025 EIP 23 Nested procedure calls 0050 0100 0150 0200 0250 0300 main PROC . . call Sub1 exit main ENDP Sub1 PROC . . call Sub2 ret Sub1 ENDP EIP Sub2 PROC . . call Sub3 ret Sub2 ENDP Sub3 PROC . . ret Sub3 ENDP Stack 24 Local and global labels A local label is visible only to statements inside the same procedure. A global label is visible everywhere. main PROC jmp L2 L1:: exit main ENDP sub2 PROC L2: jmp L1 ret sub2 ENDP ; error! ; global label ; local label ; ok 25 Procedure parameters (1 of 3) • A good procedure might be usable in many different programs • Parameters help to make procedures flexible because parameter values can change at runtime • General registers can be used to pass parameters 26 Procedure parameters (2 of 3) The ArraySum procedure calculates the sum of an array. It makes two references to specific variable names: ArraySum PROC mov esi,0 mov eax,0 ; array index ; set the sum to zero L1: add eax,myArray[esi] ; add each integer to sum add esi,4 ; point to next integer loop L1 ; repeat for array size mov theSum,eax ret ArraySum ENDP ; store the sum 27 Procedure parameters (3 of 3) This version returns the sum of any doubleword array whose address is in ESI. The sum is returned in EAX: ArraySum PROC ; Recevies: ESI points to an array of doublewords, ; ECX = number of array elements. ; Returns: EAX = sum ;-----------------------------------------------push esi push ecx mov eax,0 ; set the sum to zero L1: add eax,[esi] ; add each integer to sum add esi,4 ; point to next integer loop L1 ; repeat for array size pop ecx pop esi ret ArraySum ENDP 28 Calling ArraySum .data array DWORD 10000h, 20000h, 30000h, 40000h theSum DWORD ? .code main PROC mov esi, OFFSET array mov ecx, LENGTHOF array call ArraySum mov theSum, eax 29 USES operator • Lists the registers that will be saved (to avoid side effects) (return register shouldn’t be saved) ArraySum PROC USES esi ecx mov eax,0 ; set the sum to zero ... MASM generates the following code: ArraySum PROC push esi push ecx . . pop ecx pop esi ret ArraySum ENDP 30 Stack frames, parameters and local variables Stack frame • Also known as an activation record • Area of the stack set aside for a procedure's return address, passed parameters, saved registers, and local variables • Created by the following steps: – Calling procedure pushes arguments on the stack and calls the procedure. – The subroutine is called, causing the return address to be pushed on the stack. – The called procedure pushes EBP on the stack, and sets EBP to ESP. – If local variables are needed, a constant is subtracted from ESP to make room on the stack. – The registers needed to be saved are pushed. 32 Stack frame ESP saved registers [EBP-4] EBP [EBP+4] ESP EBP ebp local variables ebp ebp ret addr [EBP+8] parameters ebp 33 Explicit access to stack parameters • A procedure can explicitly access stack parameters using constant offsets from EBP. – Example: [ebp + 8] • EBP is often called the base pointer or frame pointer because it holds the base address of the stack frame. • EBP does not change value during the procedure. • EBP must be restored to its original value when a procedure returns. 34 Parameters • Two types: register parameters and stack parameters. • Stack parameters are more convenient than register parameters. pushad mov esi,OFFSET array mov ecx,LENGTHOF array mov ebx,TYPE array call DumpMem popad register parameters push push push call TYPE array LENGTHOF array OFFSET array DumpMem stack parameters 35 Parameters call by value call by reference int sum=AddTwo(a, b); int sum=AddTwo(&a, &b); .date a DWORD b DWORD push b push a call AddTwo 5 6 5 6 push OFFSET b push OFFSET a call AddTwo ESP offset(a) ESP offset(b) 36 Stack frame example .data sum DWORD ? .code push 6 push 5 call AddTwo mov sum,eax ; ; ; ; AddTwo PROC push ebp mov ebp,esp . . second argument first argument EAX = sum save the sum ebp ESP EBP ret addr [EBP+4] 5 [EBP+8] 6 [EBP+12] 37 Stack frame example AddTwo PROC push ebp mov ebp,esp ; base of stack frame mov eax,[ebp + 12] ; second argument (6) add eax,[ebp + 8] ; first argument (5) pop ebp ret 8 ; clean up the stack AddTwo ENDP ; EAX contains the sum Who should be responsible to remove arguments? It depends on the language model. ebp EBP ret addr [EBP+4] 5 [EBP+8] 6 [EBP+12] 38 RET Instruction • Return from subroutine • Pops stack into the instruction pointer (EIP or IP). Control transfers to the target address. • Syntax: – RET – RET n • Optional operand n causes n bytes to be added to the stack pointer after EIP (or IP) is assigned a value. 39 Passing arguments by reference • The ArrayFill procedure fills an array with 16-bit random integers • The calling program passes the address of the array, along with a count of the number of array elements: .data count = 100 array WORD count DUP(?) .code push OFFSET array push COUNT call ArrayFill 40 Passing arguments by reference ArrayFill can reference an array without knowing the array's name: ArrayFill PROC push ebp mov ebp,esp pushad mov esi,[ebp+12] mov ecx,[ebp+8] . . ebp EBP ret addr [EBP+4] count [EBP+8] offset(array) [EBP+12] 41 Passing 8-bit and 16-bit arguments • When passing stack argements, it is best to push 32-bit operands to keep ESP aligned on a doubleword boundary. Uppercase PROC push ebp mov ebp, esp mov al, [ebp+8] cmp al, ‘a’ jb L1 cmp al, ‘z’ ja L1 sub al, 32 L1: pop ebp ret 4 Uppercase ENDP push Call ‘x’ ; error Uppercase .data charVal BYTE ‘x’ .code movzx eax, charVal push eax Call Uppercase 42 Saving and restoring registers • When using stack parameters, avoid USES. MySub2 push mov mov pop ret MySub2 PROC USES ecx, edx ebp ebp, esp eax, [ebp+8] ebp 4 ENDP ESP,EBP ebp edx ecx MySub2 push push push mov mov pop pop pop ret MySub2 PROC ecx edx ebp ebp, esp eax, [ebp+8] ebp edx ecx 4 ENDP [EBP+8] ret addr parameter [EBP+16] 43 Local variables • The variables defined in the data segment can be taken as static global variables. visibility=the whole program lifetime=program duration • A local variable is created, used, and destroyed within a single procedure (block) • Advantages of local variables: – – – – Restricted access: easy to debug, less error prone Efficient memory usage Same names can be used in two different procedures Essential for recursion 44 Creating local variables • Local variables are created on the runtime stack, usually above EBP. • To explicitly create local variables, subtract their total size from ESP. MySub PROC push ebp mov ebp,esp sub esp,8 mov [ebp-4],123456h mov [ebp-8],0 . [EBP-4] ESP EBP [EBP+4] [EBP+8] ebp ret addr … . [EBP-8] 45 Local variables • They can’t be initialized at assembly time but can be assigned to default values at runtime. MySub PROC push ebp void MySub() mov ebp, esp sub esp, 8 { mov DWORD PTR [ebp-4], 10 int X=10; mov DWORD PTR [ebp-8], 20 int Y=20; ... ... mov esp, ebp } pop ebp ret MySub ENDP 20 10 EBP return address stack ESP EBP 46 Local variables X_local EQU DWORD PTR [ebp-4] Y_local EQU DWORD PTR [ebp-8] MySub PROC push ebp mov ebp, esp sub esp, 8 X_local, mov DWORD PTR 10 [ebp-4], 10 Y_local, mov DWORD PTR 20 [ebp-8], 20 ... mov esp, ebp pop ebp ret MySub ENDP 47 LEA instruction (load effective address) • The LEA instruction returns offsets of both direct and indirect operands at run time. – OFFSET only returns constant offsets (assemble time). • LEA is required when obtaining the offset of a stack parameter or local variable. For example: CopyString PROC, count:DWORD LOCAL temp[20]:BYTE mov mov lea lea edi,OFFSET count; invalid operand esi,OFFSET temp ; invalid operand edi,count ; ok esi,temp ; ok 48 LEA example void makeArray() makeArray PROC { push ebp char myString[30]; mov ebp, esp for (int i=0; i<30; i++) sub esp, 32 myString[i]=‘*’; lea esi, [ebp-30] } mov ecx, 30 L1: mov BYTE PTR [esi], ‘*’ inc esi loop L1 add esp 32 pop ebp ret makeArray ENDP 49 ENTER and LEAVE • ENTER instruction creates stack frame for a called procedure – pushes EBP on the stack push ebp – set EBP to the base of stack frame mov ebp, esp – reserves space for local variables sub esp, n • ENTER nbytes, nestinglevel – nbytes (for local variables) is rounded up to a multiple of 4 to keep ESP on a doubleword boundary – nestinglevel: 0 for now MySub PROC enter 8,0 MySub PROC push ebp mov ebp,esp sub esp,8 50 ENTER and LEAVE • LEAVE reverses the action of a previous ENTER instruction. MySub PROC enter 8, 0 . . . . leave ret MySub ENDP MySub PROC push ebp mov ebp, esp sub esp, 8 . . mov esp, ebp pop ebp ret MySub ENDP 51 LOCAL directive • The LOCAL directive declares a list of local variables – immediately follows the PROC directive – each variable is assigned a type • Syntax: LOCAL varlist Example: MySub PROC LOCAL var1:BYTE, var2:WORD, var3:SDWORD 52 MASM-generated code BubbleSort PROC LOCAL temp:DWORD, SwapFlag:BYTE . . . ret BubbleSort ENDP MASM generates the following code: BubbleSort PROC push ebp mov ebp,esp add esp,0FFFFFFF8h ; add -8 to ESP . . . mov esp,ebp pop ebp ret BubbleSort ENDP 53 Non-Doubleword Local Variables • Local variables can be different sizes • How are they created in the stack by LOCAL directive: – 8-bit: assigned to next available byte – 16-bit: assigned to next even (word) boundary – 32-bit: assigned to next doubleword boundary 54 MASM-generated code [EBP-8] ESP SwapFlag temp [EBP-4] EBP ebp mov mov eax, temp bl, SwapFlag mov mov eax, [ebp-4] bl, [ebp-5] 55 Reserving stack space • .STACK 4096 • Sub1 calls Sub2, Sub2 calls Sub3, how many bytes will you need in the stack? Sub1 PROC LOCAL array1[50]:DWORD ; 200 bytes Sub2 PROC LOCAL array2[80]:WORD ; 160 bytes Sub3 PROC LOCAL array3[300]:WORD ; 300 bytes 660+8(ret addr)+saved registers… 56 Recursion Recursion • The process created when . . . – A procedure calls itself – Procedure A calls procedure B, which in turn calls procedure A • Using a graph in which each node is a procedure and each edge is a procedure call, recursion forms a cycle: A E B D C 58 Calculating a factorial This function calculates the factorial of integer n. A new value of n is saved in each stack frame: int factorial(int n) { if (n == 0) return 1; else return n*factorial(n-1); } recursive calls backing up 5! = 5 * 4! 5 * 24 = 120 4! = 4 * 3! 4 * 6 = 24 3! = 3 * 2! 3*2=6 2! = 2 * 1! 2*1=2 1! = 1 * 0! 1*1=1 0! = 1 1=1 factorial(5); (base case) 59 Calculating a factorial Factorial PROC push ebp mov ebp,esp mov eax,[ebp+8] cmp eax,0 ja L1 mov eax,1 jmp L2 L1:dec eax push eax call Factorial ReturnFact: mov ebx,[ebp+8] mul ebx L2:pop ebp ret 4 Factorial ENDP ; ; ; ; get n n > 0? yes: continue no: return 1 ; Factorial(n-1) ; get n ; edx:eax=eax*ebx ; return EAX ; clean up stack 60 Calculating a factorial ReturnFact: mov ebx,[ebp+8] mul ebx L2:pop ebp ret 4 Factorial ENDP ebp ret Factorial 0 … Factorial PROC push ebp mov ebp,esp mov eax,[ebp+8] cmp eax,0 ja L1 mov eax,1 jmp L2 L1:dec eax push eax call Factorial push 12 call Factorial ebp ret Factorial 11 ebp ret main 12 61 Related directives .MODEL directive • .MODEL directive specifies a program's memory model and model options (language-specifier). • Syntax: .MODEL memorymodel [,modeloptions] • memorymodel can be one of the following: – tiny, small, medium, compact, large, huge, or flat • modeloptions includes the language specifier: – procedure naming scheme – parameter passing conventions • .MODEL flat, STDCALL 63 Memory models • A program's memory model determines the number and sizes of code and data segments. • Real-address mode supports tiny, small, medium, compact, large, and huge models. • Protected mode supports only the flat model. Small model: code < 64 KB, data (including stack) < 64 KB. All offsets are 16 bits. Flat model: single segment for code and data, up to 4 GB. All offsets are 32 bits. 64 Language specifiers • STDCALL (used when calling Windows functions) – procedure arguments pushed on stack in reverse order (right to left) – called procedure cleans up the stack – _name@nn (for example, _AddTwo@8) • C – procedure arguments pushed on stack in reverse order (right to left) – calling program cleans up the stack (variable number of parameters such as printf) – _name (for example, _AddTwo) • PASCAL – arguments pushed in forward order (left to right) – called procedure cleans up the stack • BASIC, FORTRAN, SYSCALL 65 INVOKE directive • The INVOKE directive is a powerful replacement for Intel’s CALL instruction that lets you pass multiple arguments • Syntax: INVOKE procedureName [, argumentList] • ArgumentList is an optional comma-delimited list of procedure arguments • Arguments can be: – – – – immediate values and integer expressions variable names address and ADDR expressions register names 66 INVOKE examples .data byteVal BYTE 10 wordVal WORD 1000h .code ; direct operands: INVOKE Sub1,byteVal,wordVal ; address of variable: INVOKE Sub2,ADDR byteVal ; register name, integer expression: INVOKE Sub3,eax,(10 * 20) ; address expression (indirect operand): INVOKE Sub4,[ebx] 67 INVOKE example .data val1 DWORD 12345h val2 DWORD 23456h .code INVOKE AddTwo, val1, val2 push val1 push val2 call AddTwo 68 ADDR operator • Returns a near or far pointer to a variable, depending on which memory model your program uses: • Small model: returns 16-bit offset • Large model: returns 32-bit segment/offset • Flat model: returns 32-bit offset • Simple example: .data myWord WORD ? .code INVOKE mySub,ADDR myWord 69 ADDR example .data Array DWORD 20 DUP(?) .code ... INVOKE Swap, ADDR Array, ADDR [Array+4] push OFFSET Array+4 push OFFSET Array Call Swap 70 PROC directive • The PROC directive declares a procedure with an optional list of named parameters. • Syntax: label PROC [attributes] [USES] paramList •paramList is a list of parameters separated by commas. Each parameter has the following syntax: paramName:type type must either be one of the standard ASM types (BYTE, SBYTE, WORD, etc.), or it can be a pointer to one of these types. • Example: foo PROC C USES eax, param1:DWORD 71 PROC example • The AddTwo procedure receives two integers and returns their sum in EAX. • C++ programs typically return 32-bit integers from functions in EAX. AddTwo PROC, val1:DWORD, val2:DWORD mov eax,val1 add eax,val2 ret AddTwo ENDP AddTwo PROC, push ebp mov ebp, esp mov eax, dword ptr [ebp+8] add eax, dword ptr [ebp+0Ch] leave ret 8 AddTwo ENDP 72 PROC example Read_File PROC USES eax, ebx, pBuffer:PTR BYTE LOCAL fileHandle:DWORD mov esi, pBuffer mov fileHandle, eax . . ret Read_File ENDP Read_File PROC push ebp mov ebp, esp add esp, 0FFFFFFFCh push eax push ebx mov esi, dword ptr [ebp+8] mov dword ptr [ebp-4], eax . . pop ebx pop eax ret Read_File ENDP 73 PROTO directive • Creates a procedure prototype • Syntax: – label PROTO paramList • Every procedure called by the INVOKE directive must have a prototype • A complete procedure definition can also serve as its own prototype 74 PROTO directive • Standard configuration: PROTO appears at top of the program listing, INVOKE appears in the code segment, and the procedure implementation occurs later in the program: MySub PROTO ; procedure prototype .code INVOKE MySub ; procedure call MySub PROC . . MySub ENDP ; procedure implementation 75 PROTO example • Prototype for the ArraySum procedure, showing its parameter list: ArraySum PROTO, ptrArray:PTR DWORD, ; points to the array szArray:DWORD ; array size ArraySum PROC USES esi, ecx, ptrArray:PTR DWORD, ; points to the array szArray:DWORD ; array size 76 Multimodule programs Multimodule programs • A multimodule program is a program whose source code has been divided up into separate ASM files. • Each ASM file (module) is assembled into a separate OBJ file. • All OBJ files belonging to the same program are linked using the link utility into a single EXE file. – This process is called static linking 78 Advantages • Large programs are easier to write, maintain, and debug when divided into separate source code modules. • When changing a line of code, only its enclosing module needs to be assembled again. Linking assembled modules requires little time. • A module can be a container for logically related code and data • encapsulation: procedures and variables are automatically hidden in a module unless you declare them public 79 Creating a multimodule program • Here are some basic steps to follow when creating a multimodule program: – Create the main module – Create a separate source code module for each procedure or set of related procedures – Create an include file that contains procedure prototypes for external procedures (ones that are called between modules) – Use the INCLUDE directive to make your procedure prototypes available to each module 80 Multimodule programs • MySub PROC PRIVATE sub1 PROC PUBLIC • EXTERN sub1@0:PROC • PUBLIC count, SYM1 SYM1=10 .data count DWORD 0 • EXTERN name:type 81 INCLUDE file The sum.inc file contains prototypes for external functions that are not in the Irvine32 library: INCLUDE Irvine32.inc PromptForIntegers PROTO, ptrPrompt:PTR BYTE, ptrArray:PTR DWORD, arraySize:DWORD ; prompt string ; points to the array ; size of the array ArraySum PROTO, ptrArray:PTR DWORD, count:DWORD ; points to the array ; size of the array DisplaySum PROTO, ptrPrompt:PTR BYTE, theSum:DWORD ; prompt string ; sum of the array 82 Main.asm TITLE Integer Summation Program INCLUDE sum.inc .code main PROC call Clrscr INVOKE PromptForIntegers, ADDR prompt1, ADDR array, Count ... call Crlf INVOKE ExitProcess,0 main ENDP END main 83 Example: ArraySum Program • Let's review the ArraySum program from Chapter 5. Summation Program (main) Clrscr PromptForIntegers WriteString ReadInt ArraySum DisplaySum WriteString WriteInt Each of the four white rectangles will become a module. 85 AddTwo Procedure (1 of 2) • Recall the AddTwo Procedure AddTwo PROC, val1:DWORD, val2:DWORD mov eax,val1 add eax,val2 ret AddTwo ENDP 86 AddTwo Procedure (2 of 2) • MASM generates the following code when we assemble AddTwo (from the previous panel): AddTwo PROC, val1:DWORD, val2:DWORD push ebp mov ebp, esp mov add The LEAVE instruction is shorthand for: mov pop esp,ebp ebp eax,val1 eax,val2 leave ret 8 AddTwo ENDP p.229 87 Passing by value • When a procedure argument is passed by value, a copy of a 16-bit or 32-bit integer is pushed on the stack. Example: .data myData WORD 1000h .code main PROC INVOKE Sub1, myData MASM generates the following code: push myData call Sub1 88 Passing by reference • When an argument is passed by reference, its address is pushed on the stack. Example: .data myData WORD 1000h .code main PROC INVOKE Sub1, ADDR myData MASM generates the following code: push OFFSET myData call Sub1 89 Statistics for midterm • Average=52.82, Std. dev.=18.01 90 91 Stack Parameters • • • • • • • Register vs. Stack Parameters INVOKE Directive PROC Directive PROTO Directive Passing by Value or by Reference Parameter Classifications Example: Exchanging Two Integers • Trouble-Shooting Tips 92 Recursion • What is recursion? • Recursively Calculating a Sum • Calculating a Factorial 93 Recursively Calculating a Sum The CalcSum procedure recursively calculates the sum of an array of integers. Receives: ECX = count. Returns: EAX = sum CalcSum PROC cmp ecx,0 jz L2 add eax,ecx dec ecx call CalcSum L2: ret CalcSum ENDP Stack frame: ; ; ; ; ; check counter value quit if zero otherwise, add to sum decrement counter recursive call View the complete program 94 Inspect Individual Modules • • • • Main PromptForIntegers ArraySum DisplaySum Custom batch file for assembling and linking. 95 Sample Program output Enter a signed integer: -25 Enter a signed integer: 36 Enter a signed integer: 42 The sum of the integers is: +53 96 Stack Frames • • • • • Memory Models Language Specifiers Explicit Access to Stack Parameters Passing Arguments by Reference Creating Local Variables 97 Your turn . . . • Create a procedure named Difference that subtracts the first argument from the second one. Following is a sample call: push 14 push 30 call Difference Difference PROC push ebp mov ebp,esp mov eax,[ebp + 8] sub eax,[ebp + 12] pop ebp ret 8 Difference ENDP ; first argument ; second argument ; EAX = 16 ; second argument ; first argument 98 PROC examples Swap PROC, pValX:PTR DWORD, pValY:PTR DWORD . . . Swap ENDP ReadFile PROC, pBuffer:PTR BYTE LOCAL fileHandle:DWORD . . . ReadFile ENDP 99 Trouble-Shooting Tips • Save and restore registers when they are modified by a procedure. – Except a register that returns a function result • When using INVOKE, be careful to pass a pointer to the correct data type. • For example, MASM cannot distinguish between a DWORD argument and a PTR BYTE argument. • Do not pass an immediate value to a procedure that expects a reference parameter. • Dereferencing its address will likely cause a generalprotection fault. 100 Recursively calculating a sum The CalcSum procedure recursively calculates the sum of 1 to n. Receives: ECX = count. Returns: EAX = sum mov ecx, 5 mov eax, 0 call CalcSum CalcSum PROC cmp ecx,0 jz L2 add eax,ecx dec ecx call CalcSum L2: ret CalcSum ENDP ; ; ; ; ; check counter value quit if zero otherwise, add to sum decrement counter recursive call Stack frame: 101 PROC examples FillArray receives a pointer to an array of bytes, a single byte fill value that will be copied to each element of the array, and the size of the array. FillArray PROC, pArray:PTR BYTE, fillVal:BYTE arraySize:DWORD mov ecx,arraySize mov esi,pArray mov al,fillVal L1:mov [esi],al inc esi loop L1 ret FillArray ENDP 102 LOCAL directive Examples: LOCAL flagVals[20]:BYTE ; array of bytes LOCAL pArray:PTR WORD ; pointer to an array myProc PROC, p1:PTR WORD LOCAL t1:BYTE, t2:WORD, t3:DWORD, t4:PTR DWORD ; procedure ; parameter ; local variables 103 Program design using procedures • Top-Down Design (functional decomposition) involves the following: – design your program before starting to code – break large tasks into smaller ones – use a hierarchical structure based on procedure calls – test individual procedures separately 104 Integer summation program (1 of 4) Spec.: Write a program that prompts the user for multiple 32-bit integers, stores them in an array, calculates the sum of the array, and displays the sum on the screen. Main steps: • Prompt user for multiple integers • Calculate the sum of the array • Display the sum 105 Procedure design (2 of 4) Main Clrscr PromptForIntegers WriteString ReadInt ArraySum DisplaySum WriteString WriteInt ; clear screen ; display string ; input integer ; sum the integers ; display string ; display integer 106 Structure chart (3 of 4) Summation Program (main) Clrscr PromptForIntegers WriteString ReadInt ArraySum DisplaySum WriteString WriteInt 107 PromptForIntegers ;--------------------------------------------------PromptForIntegers PROC ; ; Prompts the user for an array of integers, and ; fills the array with the user's input. ; Receives: ESI points to the array, ; ECX = array size ; Returns: nothing ;-------------------------------------------------pushad ; save all registers mov cmp jle edx,OFFSET prompt1 ; address of the prompt ecx,0 ; array size <= 0? L2 ; yes: quit 108 PromptForIntegers L1: call call call mov add loop WriteString ReadInt Crlf [esi],eax esi,4 L1 ; ; ; ; ; display string read integer into EAX go to next output line store in array next integer L2: popad ; restore all registers ret PromptForIntegers ENDP 109 PromptForIntegers ;--------------------------------------------------DisplaySum PROC ; Displays the sum on the screen ; Receives: EAX = the sum ; Returns: nothing ;--------------------------------------------------push edx mov edx,OFFSET prompt2 ; display message call WriteString call WriteInt ; display EAX call Crlf pop edx ret DisplaySum ENDP 110 Code fragment IntegerCount = 3 ; array size .data prompt1 BYTE "Enter a signed integer: ",0 prompt2 BYTE "The sum of the integers is: ",0 array DWORD IntegerCount DUP(?) .code main PROC call Clrscr mov esi,OFFSET array mov ecx,IntegerCount call PromptForIntegers call ArraySum call DisplaySum exit main ENDP 111 Sample output (4 of 4) Enter a signed integer: 550 Enter a signed integer: -23 Enter a signed integer: -96 The sum of the integers is: +431 112 113 Offset 00001000 00000006 ESP 00000FFC 00000FF8 00000FF4 00000FF0 114 Example: reversing a string • Use a loop with indexed addressing • Push each character on the stack • Start at the beginning of the string, pop the stack in reverse order, insert each character into the string • Source code • Q: Why must each character be put in EAX before it is pushed? Because only word (16-bit) or doubleword (32-bit) values can be pushed on the stack. 115 Your turn . . . • Using the String Reverse program as a starting point, • #1: Modify the program so the user can input a string of up to 50 characters. • #2: Modify the program so it inputs a list of 32-bit integers from the user, and then displays the integers in reverse order. 116 Your Turn . . . • Write a program that does the following: – Assigns integer values to EAX, EBX, ECX, EDX, ESI, and EDI – Uses PUSHAD to push the general-purpose registers on the stack – Using a loop, the program pops each integer from the stack and displays it on the screen 117 Defining and using procedures • • • • • • • • • Creating Procedures Documenting Procedures Example: SumOf Procedure CALL and RET Instructions Nested Procedure Calls Local and Global Labels Procedure Parameters Flowchart Symbols USES Operator 118 Your turn . . . Draw a flowchart that expresses the following pseudocode: input exam grade from the user if( grade > 70 ) display "Pass" else display "Fail" endif 119 . . . (Solution) begin input exam grade yes no grade > 70? display "Pass" display "Fail" end 120 Your turn . . . • Modify the flowchart in the previous slide to allow the user to continue to input exam scores until a value of –1 is entered 121 When not to push a register The sum of the three registers is stored in EAX on line (3), but the POP instruction replaces it with the starting value of EAX on line (4): SumOf PROC ; sum of three integers push eax ; line 1 add eax,ebx ; line 2 add eax,ecx ; line 3 pop eax ; line 4 ret SumOf ENDP 122 Flowchart symbols • The following symbols are the basic building blocks of flowcharts: begin / end manual input process (task) procedure call display decision yes no 123 ArraySum Procedure begin Flowchart for the ArraySum Procedure push esi, ecx eax = 0 push esi push ecx mov eax,0 add eax,[esi] add esi, 4 cx = cx - 1 AS1: add eax,[esi] add esi,4 loop AS1 pop pop yes ecx esi CX > 0? no pop ecx, esi end 124 Overview • Stack Frames (a communication protocol between high-level-language procedures) • Stack Parameters (passing by value, passing by reference, memory model and language specifiers) • Local Variables (creating and initializing on the stack, scope and lifetime, LOCAL) • Recursion • Related directives: INVOKE, PROC, PROTO • Creating Multimodule Programs 125 WriteStackFrame Procedure • Displays contents of current stack frame WriteStackFrame PROTO, numParam:DWORD, ; # of passed parameters numLocalVal: DWORD, ; # of DWordLocal ; variables numSavedReg: DWORD ; # of saved registers 126 WriteStackFrame Example main PROC mov eax, 0EAEAEAEAh mov ebx, 0EBEBEBEBh INVOKE aProc, 1111h, 2222h exit main ENDP aProc PROC USES eax ebx, x: DWORD, y: DWORD LOCAL a:DWORD, b:DWORD PARAMS = 2 LOCALS = 2 SAVED_REGS = 2 mov a,0AAAAh mov b,0BBBBh INVOKE WriteStackFrame, PARAMS, LOCALS, SAVED_REGS 127 WriteStackFrame Example Stack Frame 00002222 ebp+12 (parameters) 00001111 ebp+8 (parameters) 00401083 ebp+4 (return address) 0012FFF0 ebp+0 (saved ebp) ebp 0000AAAA ebp-4 (local variable) 0000BBBB ebp-8 (local variable) EAEAEAEA ebp-12 (saved register) EBEBEBEB ebp-16 (saved register) esp 128 Parameter classifications • An input parameter is data passed by a calling program to a procedure. – The called procedure is not expected to modify the corresponding parameter variable, and even if it does, the modification is confined to the procedure itself. • An output parameter is created by passing a pointer to a variable when a procedure is called. • The procedure does not use any existing data from the variable, but it fills in a new value before it returns. • An input-output parameter represents a value passed as input to a procedure, which the procedure may modify. • The same parameter is then able to return the changed data to the calling program. 129 Example: exchanging two integers The Swap procedure exchanges the values of two 32-bit integers. pValX and pValY do not change values, but the integers they point to are modified. Swap PROC USES eax esi edi, pValX:PTR DWORD, ; pointer to first integer pValY:PTR DWORD ; pointer to second integer mov esi,pValX ; get pointers mov edi,pValY mov eax,[esi] ; get first integer xchg eax,[edi] ; exchange with second mov [esi],eax ; replace first integer ret ; MASM changes it to ret 8 due to PROC Swap ENDP 130