COM181 Computer Hardware Lecture 5: MIPs Assembly language "Adapted from Computer Organization and Design, 4th Edition, Patterson & Hennessy, © 2008.” This material may not be copied or distributed for commercial purposes without express written permission of the copyright holders. Also drawn from the work of Mary Jane Irwin ( www.cse.psu.edu/~mji ) and Jie Xu at (http://people.cs.pitt.edu/~xujie/index.html) 08/08/12 www.eej.ulster.ac.uk/~ian/modules/COM181/files Types of Instructions • There are 3 main types of assembly instructions – Arithmetic - add, sub, mul, shifts, and, or, etc. – Load/store – Conditional - branches Arithmetic Instructions add a, b, c add a, a, d add a, a, e a = b+c a = d+a = d+b+c a = e+a = e+d+b+c Example: Translate the following instructions to assembly code a = b+c d = a-e Solution: add a, b, c sub d, a, e Arithmetic Instructions Example: Translate the following instructions to assembly code. Remember with RISC, only 1 operation per instruction! HINT - you may need temporary variables f = (g+h) - (i+j) Solution: add t0, g, h add t1, i, j sub f, t0, t1 Operands • In assembly code, you can’t use variables such as a, b, c, etc • In RISC instruction sets, operands must be registers such as r1, r2, r3, etc – r0 is typically reserved to hold the immediate value of 0 – There is a limited number of registers • MIPS has 32 Arithmetic Instructions Using Registers Example: Translate the following instructions to assembly code. Assume that g, h, i, and j are already stored in r1, r2, r3, and r4. Store f in r5 f = (g+h) - (i+j) Solution: add r6, r1, r2 add r7, r3, r4 sub r5, r6, r7 What about more data?? • With only a limited number of registers, not all data can be stored in registers at the same time. – Registers only store data that is currently being operated on • Variables are stored in memory and then loaded into registers when needed using data transfer instructions – Load word (lw) and store word (sw) Load and store word • Load word format – lw destination register, memory location • Store word format – sw source register, memory location • Memory location format – Offset(base address) • Base address = starting location of data in memory • Offset = how far away is location to access from base address – Values are added together LW Example Example: Assume that A is an array of 100 words. Assume the base address is stored in r3, g is in r1 and h is in r2 g = h + A[8] Solution: Offset lw r4, 8(r3) add r1, r2, r4 Base Address This is simplified, more details later… Data in Memory • All variables/data are stored in memory – You will need to do this in your assembler – Your ISS will need a data structure to hold main memory • Array is fine Addressing Data • Architecture usually addresses data in bytes (byte addressable) – 32-bit architecture = 4 bytes = 1 word • lw/sw load/store 1 word or 4 bytes – Thus, data/inst addresses are multiples of 4 • Data is word aligned to be more efficient Data in Memory . . . . . . 12 8 4 0 100 10 101 1 Address Data LW/SW Example Example: Assume that A is an array of 100 words. Assume the base address is stored in r3 and h is stored in r2. You may directly calculate the offset. Remember, each data piece is 4 bytes when calculating the offset A[12] = h+A[8] Solution: lw r1, 32(r3) add r4, r2, r1 sw r4, 48(r3) LW/SW Example Example: Assume that A is an array of 100 words. Assume the base address is stored in r3 and g, h, and i are in r1, r2, and r4 respectively. Calculate the offset using assembly instructions but don’t use multiplication yet (mult instruction is different) g = h + A[i] Solution: add r5, r4, r4 add r5, r5, r5 add r5, r5, r3 lw r6, 0(r5) add r1, r6, r2 # Temp reg r5=2*i # Temp reg r5=4*i # t1 = addr of A[i] (4*i+r3) # Temp reg r0=a[i] # g=h+a[i] Translating MIPS Assm Language to Machine Language • Translate human readable assembly code to machine readable code (binary) – I will show examples in decimal for readability – This is what you assembler will do but it will output in binary. MIPS -> Machine Language Example: Show the real MIPS language version of the following instruction in both decimal and binary add r0, r1, r2 Solution: decimal 0 0 1 2 0 32 00010 00000 100000 5 bits 6 bits binary 000000 00000 00001 6 bits 5 bits 5 bits 5 bits Each segment is referred to as a field. Details to come…. MIPS Fields • MIPS fields are giving names to make them easier to discuss op rs rt 6 bits 5 bits 5 bits rd 5 bits shamt funct 5 bits 6 bits • op: Basic operation of the instruction, typically called the opcode • rs: The first register source operand • rt: The second register source operand • rd: The register destination operand, it gets the result of the operation • shamt: Shift amount (0 if not shift instruction) • funct: Function. This field selects the specific variant of the operation in the op field, and is sometimes called the function code MIPS Fields • Problem occurs with an instruction needs a longer field than that showed on the previous slide – I.e. LW must specify 2 registers and a constant. Limited to 5-bit constant if use previous format. • Solution: There are different formats for different types of instructions – Previous slide is R-type (R-format): • R=register MIPS Fields op rs rt 6 bits 5 bits 5 bits address 16 bits • I-type (I-format) – I=immediate – Now LW can specify an address up to 16bits • Opcode determines the format MIPS Instruction Encoding MIPS Asm -> Machine Language Example: Assume r1 is the base of A and r2 corresponds to h, the C statement: A[300] = h + A[300] is compiled to: lw r0, 1200(r1) add r0, r2, r0 sw r0, 1200(r1) What is the MIPS machine code for these three instructions? (Use figure 3.5) MIPS Asm -> Machine lw r0, 1200(r1) add r0, r2, r0 Language sw r0, 1200(r1) Solution: decimal op rs rt 35 0 1 0 0 2 43 0 1 rd Address /shamt funct 1200 0 0 32 1200 binary 100011 00000 00001 000000 00000 00010 101011 00000 00001 0000 0100 1011 0000 00000 00000 0000 0100 1011 0000 32 Decision Instructions • Branch/jump instructions – Conditional branches • beq register1, register2, Label • bne register1, register2, Label – Unconditional branches • j Label Decision Instructions Example: Assume f->r0, g->r1, h->r2, i->r3, j->r4 L1: if ( i==j ) goto L1 f = g+h f = f-i Solution: L1: beq r3, r4, L1 add r0, r1, r2 sub r0, r0, r3 Labels will need to be translated to instruction address in your assembler Decision Instructions Example: Assume f->r0, g->r1, h->r2, i->r3, j->r4 L1: if ( i==j ) f = g+h else f = g-h L2: Solution: L1: L2: bne r3, r4, L1 add r0, r1, r2 j L2 sub r0, r1, r2 Decision Instructions Example: A is 100 elements with the base address in r5. g->r1, h->r2, i->r3, j->r4 Loop: g = g+A[i] i = i+j if ( i!=h ) goto Loop Solution: Loop: add r6, r3, r3 add r6, r6, r6 add r6, r6, r5 lw r7, 0(r6) add r1, r1, r7 add r3, r3, r4 bne r3, r2, Loop While Loop • Goto statements are bad, just used them as an example. • You will want to use while loops – Or for loops but I am just showing you while loops While Loop Example: Base address of save is in r6. i->r3, j->r4, k->r5 while ( save[i] == k ) i = i+j Solution: Loop: Exit: add r1, r3, r4 add r1, r1, r1 add r1, r1, r6 lw r0, 0(r1) bne r0, r5, Exit add r3, r3, r4 j Loop Other Styles of MIPS Addressing • Constant or immediate operands – Programs often use constant values – I.e. incrementing to the next data element while scanning an array • addi instruction - adds an immediate value to a register Immediate Operands Example: What is the machine code for the following? (Remember the I-format instruction) addi r4, r4, 4 Solution: decimal op 8 rs 4 rt 4 Immediate 4 binary 001000 00100 00100 0000 0000 0000 0100 Addressing in Branches and Jumps • Last instruction format - J-type (Jformat) opcode Target address • Branches do not use J-type. – Must specify 2 registers to compare – Use I-type Implementing Conditional Statements We're going to translate some easy conditional statements. if ( i == j ) i++ ; j-- ; Translating conditional statements is interesting. In C, for example, when the condition is true, you execute the body. This is the fall-through case. That is, you execute the next statement. When the condition is false, you don't execute the body, you jump over it. This is the jump case. Therefore, you jump when the condition is false. In ISA programming, you jump when the condition is true. Thus, we often need to negate the condition.Here‘s the translation of the above if-statement, assuming $r1 stores i and $r2 stores j. bne $r1, $r2, L1 # branch if ! ( i == j ) addi $r1, $r1, 1 # i++ L1: addi $r2, $r2, -1 # j– The label L1 has the same address as the instruction immediately following the colon. Thus, the above code is the same as: bne $r1, $r2, L1 # branch if ! ( i == j ) addi $r1, $r1, 1 # i++ L1: addi $r2, $r2, -1 # j-- Even though it appears that label L1 has an empty instruction, it doesn't. It is still associated with the second addi instruction. Taken from http://www.cs.pitt.edu/~xujie/cs447/AccessingArray.htm Translating if-else if ( i == j ) i++ ; else j-- ; j += i ; Let's think about what happens. As before, if the condition is false, we want to jump. This time, we want to jump to the else. Thus, we write the code like: ELSE: L1: bne $r1, $r2, ELSE addi $r1, $r1, 1 j L1 addi $r2, $r2, -1 add $r2, $r2, $r1 # # # # # branch if ! ( i == j ) i++ jump over else j– j += i Taken from http://www.cs.pitt.edu/~xujie/cs447/AccessingArray.htm Translating if-else with && Translating && is interesting because there's short-circuiting involved. To see how this works, let's translate: if ( i == j && i == k ) i++ ; // if-body else j-- ; // else-body j = i + k ; let <cond1> stand for i == j and <cond2> stand for i == k. if ( <cond1> && <cond2> ) i++ ; // if-body else j-- ; // else-body j = i + k ; Short-circuiting occurs when <cond1> evaluates to false. The control-flow then jumps over <cond2> (that is, <cond2> is not evaluated), and continues executing in the else-body.If <cond1> evaluates to true, we want to fall-through and check <cond2>. If <cond2> evaluates false, we again jump, this time over the ifbody, and to the else-body.If <cond2> is true, we fall-through to the if-body. Notice that we jump when the condition evaluates to false for both cases, so we'll be interested in jumping on negations of conditions. Taken from http://www.cs.pitt.edu/~xujie/cs447/AccessingArray.htm Here's the translated code, assuming $r3 stores k. bne $r1, $r2, ELSE # cond1: branch if ! ( i == j ) bne $r1, $r3, ELSE # cond2: branch if ! ( i == k ) addi $r1, $r1, 1 # if-body: i++ j L1 # jump over else ELSE: addi $r2, $r2, -1 # else-body: j– L1: add $r2, $r1, $r3 # j = i + k From the C Code if ( i == j && i == k i++ ; // if-body else j-- ; // else-body j = i + k ; ) Taken from http://www.cs.pitt.edu/~xujie/cs447/AccessingArray.htm Translating if-else with || if ( i == j || i == k ) i++ ; // if-body else j-- ; // else-body j=i+k; Again, let's use <cond1> to stand for i == j and <cond2> to stand for i == k. if ( <cond1> || <cond2> ) i++ ; // if-body else j-- ; // else-body j=i+k; Short-circuiting occurs when <cond1> evaluates to true. That is, we want to jump over checking the second condition and into the if-body. Notice that we go to the ifbody when the condition evaluates to true. If <cond1> is false, we want to fallthrough and check <cond2>. If <cond2> is false, we now jump to the else-body. If <cond2> is true, we fall-through to the if-body. Notice that we jump when <cond1> evaluates to true (to the if-body) and when <cond2> evaluates to false (to the elsebody). Taken from http://www.cs.pitt.edu/~xujie/cs447/AccessingArray.htm Translating if-else with || /cont if ( i == i++ ; // else j-- ; // j = i + k j || i == k ) if-body else-body ; Here's the translated code: beq $r1, $r2, IF bne $r1, $r3, ELSE IF: addi $r1, $r1, 1 j L1 ELSE: addi $r2, $r2, -1 L1: add $r2, $r1, $r3 # # # # # # cond1: branch if (i==j) cond2: branch if !(i==k) if-body: i++ jump over else else-body: j– j = i + k Taken from http://www.cs.pitt.edu/~xujie/cs447/AccessingArray.htm switch statements Switch statements are interesting. Note switch works on a very limited number of types (int and char, primarily). It doesn't work on strings (even if students wish they did). Switch evaluates case-by-case. When one condition fails, the next is checked. When a condition is true, the code associated with that condition is run. However, if you don't put break the code for the next condition will also run. Unfortunately, most people expect a break to occur when the condition is done. They don't expect a fallthrough case. Here's an example of a switch. switch( i ) { case 1: i++ ; // falls through case 2: i += 2 ; break; case 3: i += 3 ; } Taken from http://www.cs.pitt.edu/~xujie/cs447/AccessingArray.htm Here's the translated code addi $r4, $r0, 1 bne $r1, $r4, C2_COND j C1_BODY C2_COND: addi $r4, $r0, 2 bne $r1, $r4, C3_COND C2_BODY C3_COND: addi $r4, $r0, 3 bne $r1, $r4, EXIT j C3_BODY C1_BODY: addi $r1, $r1, 1 C2_BODY: addi $r1, $r1, 2 j EXIT C3_BODY: addi $r1, $r1, 3 EXIT: # # # # switch( i ) { case 1: i++ ; // falls through case 2: i += 2 ; break; case 3: i += 3 ; } set temp to 1 case 1 false: branch to case 2 cond case 1 true: branch to case 1 set temp to 2 # case 2 false: branch to case 2 cond j # case 2 true: branch to case 2 body # # # # # # # set temp to 3 case 3 false: branch to exit case 3 true: branch to case 3 body case 1 body: i++ case 2 body: i += 2 break case 3 body: i += 3 When case 1 is true, you want to run i++, then i += 2. You don't want to test for case 2, because that's now how the semantics of switch works. bge, bgt, blt, ble bge, bgt, blt, ble are all pseudo-instructions. That is, there is no corresponding machine code to these instructions. Let's see an example of how the assembler might translate bge. The key is to use slt which means "set on less than". Here is the syntax of slt. slt $r1, $r2, $r3 # R[1] = R[2] < R[3] ? 1 : 0 The semantics are shown in the comments: if R[2] < R[3] false, we now jump to the elsebodythen R[1] = 1, otherwise it's assigned to 0. Here is the syntax and semantics of bge: bge $r1, $r2, LABEL # jump to LABEL if R[1] >= R[2] If R[1] >= R[2] we know that this is equivalent to !( R[1] < R[2]). Thus, if we check R[1] < R[2] using slt, we expect it to be false. Here's the translation of bge. slt $r3, $r1, $r2 # check if R[1] < R[2] beq $r3, $r0, LABEL # branch if previous condition is false As an exercise, you should translate the other three pseudo-instructions Implementing Loops Unlike conditional statements, which have assembly instructions that support them more-orless directly (i.e., beq), loops do not have similar support.To translate loops, it's easier to convert the loops to if-statements with goto statements, prior to translation. Doing so also gives you insight into how a loop really behaves. We'll translate a while-loop, then a for-loop. You can do the do-while loop as an exercise (and you should!). Here's a generic while-loop: while ( <cond> ) { <while-body> } This is how it's translated to an if-statement with goto's. L1: if ( <cond> ) { <while-body> goto L1 ; } Taken from http://www.cs.pitt.edu/~xujie/cs447/AccessingArray.htm Here's an example with real code. while ( i < j ) { k++ ; i = i * 2 ; } Then translate it to if-statements with goto's. L1: if ( i < j ) { k++ ; i = i * 2 ; goto L1 ; } This should be easy to convert to MIPS. Assume $r1 stores i, $r2 stores j, and $r3 stores k. L1: bge $r1, $r2, EXIT addi $r3, $r3, 1 add $r1, $r1, $r1 j L1 # branch if ! ( i < k ) # k++ # i = i * 2 # jump back to top of loop EXIT: Taken from http://www.cs.pitt.edu/~xujie/cs447/AccessingArray.htm Translating for-loops To translate a for-loop, we'll only go part way, and translate it to if-statements and goto's. You can do the rest on your own. Here's a generic for-loop: for ( <init> ; <cond> ; <update> ) { <for-body> } This is how it's translated to an if-statement with goto's. <init> L1: if ( <cond> ) { <for-body> UPDATE: <update> // should affect condition! goto L1 ; } EXIT: There's only one special case. Suppose the <for-body> has a continue statement. Then, you need to jump to the <update> code, which is at the UPDATE label. break should still work fine, because you can jump to the EXIT label. Taken from http://www.cs.pitt.edu/~xujie/cs447/AccessingArray.htm Accessing Array Data in MIPS Since arrays can store LOTS of data, and since we have only a small (~32) number of registers, it is infeasible to use the registers for long-term storage of the array data. Hence, arrays are stored in the Data Segment of a MIPS program. Fundamentally, there are three operations which one can perform on an array: –Getting the data from an array cell, e.g, x = list[i]; –Storing data into an array cell, e.g. list[i] = x; –Determining the length of an array, i.e. list.length. To access the data in the array requires that we know the address of the data and then use the load word (lw) or store word (sw) instructions. Words (which is how integers are stored) in MIPS take up 32 bits or 4 bytes. Therefore, if we have a declaration in the .data segment such as: list: .word 3, 0, 1, 2, 6, -2, 4, 7, 3, 7 the address that is loaded by the instruction la $t3, list is the address of the first '3' in the list. The address of the '0' is 4 greater than that number, and the address of the '6' is 16 greater than that number. Taken from http://www.cs.pitt.edu/~xujie/cs447/AccessingArray.htm Accessing Array Data in MIPS The following snippet of code will place the value of list[6] into the $t4: la $t3, list li $t2, 6 add $t2, $t2, $t2 add $t2, $t2, $t2 add $t1, $t2, $t3 lw $t4, 0($t1) # # # # # # put address of list into $t3 put the index into $t2 double the index double the index again (now 4x) combine both parts of the address get the value from the array cell If we wish to assign to the contents of $t4 to list[6] instead, the last line would simply be: sw $t4, 0($t1) # store value in the array cell Taken from http://www.cs.pitt.edu/~xujie/cs447/AccessingArray.htm Some notes on arrays Most assembly languages, like MIPS, do not have built-in capabilities for sophisticated data structures. Even the most commonly available structure, the array, is not available in MIPS. As we shall see shortly, we can set aside a block of memory in MIPS in assembly language and treat it similar to an array. What is an array anyway? One way to think about an array is a set of values that can simultaneously be considered a single collective entity and many individual elements. In high level languages, the array is a set of references using an identifier like any other variable. To access each individual element, one uses an integer to specify a particular element. In most high-level languages, arrays are stored as a contiguous block of n memory cells starting with a base address. In java, we might declare and use an array as follows. int a[] = new int[10]; int sum = 0; ... for (int i = 0; i < 10; i++) sum += a[i]; In this example, the elements in the array are added and the result is stored in the variable sum. Note a is the identifier that refers to the array and [i] refers to the ith element of the array. Here a is the base location of the array and i indicates the offset into memory from the base address. From http://zeta.albion.edu/~dreimann/Spring2012/courses/cs354/projects/primes.php In MIPS, there is no formal array construct. The first issue to resolve is the association of a block of memory with a particular identifier. This can be done using a label and a .space directive in the .data section. For example .data a: .space 40 reserves 40 bytes (10 words) of memory of data associated with the label a. The memory location where this label is stored (by the assembler) becomes the base address of the array. To access the ith element from the array, we need to determine the memory offset from the beginning address and add the number of bytes per element. For simplicity, let's assume the array stores elements that each require on word of storage and $t0 is the register that represents i. Then la $t1, a muli $t2, $t0, 4 # or use two adds... add $t2, $t2, $t1 lw $t3, 0($t2) will load the ith element from the array into register $t3. Here $t1 contains the base address of the array and $t2 is the address of the ith element of the array. From http://zeta.albion.edu/~dreimann/Spring2012/courses/cs354/projects/primes.php While this will work, it is not the only way to access the elements of the array. Consider storing the integers 0-9 in the first ten elements of an array then reading the elements and adding them together. The program sum10.s shown below illustrates one way to do this. .text main: la $t4, n lw $t4, 0($t4) and $t0, $0, $0 la $t1, a loop1: sll $t2, $t0, 2 add $t2, $t2, $t1 sw $t0, 0($t2) addi $t0, $t0, 1 slt $t5, $t0, $t4 bne $t5, $0, loop1 # # # # # # # # # # # Fill the array address of n t4 = n i = 0 address of a byte offset of ith element address of a[i] put i into a[i] increment i is $t0 < $t4 ? branch if so #/CONT (note sll is the same as multiplying by 2) From http://zeta.albion.edu/~dreimann/Spring2012/courses/cs354/projects/primes.php # Sum the array values and $s0, $0, $0 and $t0, $0, $0 add $t2, $t1, $0 loop2: lw $t3, 0($t2) add $s0, $s0, $t3 addi $t0, $t0, 1 addi $t2, $t2, 4 slt $t5, $t0, $t4 bne $t5, $0, loop2 # # # # # # # # # sum = 0 i = 0 address of a[i] load a[i] increment sum increment i increment address of a[i] is $t0 < $t4 ? branch if so # Output Sum li $v0, 1 # Load 1=print_int into $v0 add $a0, $s0, $zero # Load first number into $a0 syscall # Output prompt via syscall li $v0, 10 syscall # exit .data n: a: .word 10 # n = 10 .align 4 .space 40 # Allocate 10 words (40 bytes) From http://zeta.albion.edu/~dreimann/Spring2012/courses/cs354/projects/primes.php