Machine Instructions/Assembly Language

advertisement
Machine Instructions
Operations
1
ITCS 3181 Logic and Computer Systems 2015 B. Wilkinson Slides4-1.ppt
Modification date: March 18, 2015
Instructions
We will use a simple instruction formats of a so-called reduced
instruction set computer (RISC), which has the characteristics:
• Simple, fixed length instruction format
• A few addressing modes
• Limited number of operations
• Designed to achieve high speed of execution
It was recognized in the 1980’s that such processors would
actually execute programs faster that the prevalent CISCs
(complex instruction set computers).
2
Instruction format
Our instruction format is quite similar to the G4/G5 PowerPC
and SUN Sparc processors and also similar but not identical to
that used in the assembly language simulator in the labs.
Intel 64/IA32 processor family -- uses a very complex and
archaic instruction format, based upon the early 8086
processor (which was themselves loosely based upon even
earlier Intel processors).
However Intel processors now convert this complex instruction
format (CISC) internally to simpler RISC formats for
performance reasons.
3
Some Key Features of Processor Used
1.Thirty-two 32-bit integer registers called R0 to R31
2. One register, R0, holds zero permanently
R31
R30
R29
R28
3 All arithmetic done only between registers:
- Three-register format
op-code, destination register, source register 1, source register 2
or
- Immediate addressing
op-code destination register, source register, constant
4. Memory operations limited to:
load register, and
R4
R3
R2
R1
R0
0000 … 0000
R31 holds return
address for
procedures.
R29 is a stack
pointer see later.
store register
using register indirect addressing plus offset only:
LD/LB destination register, offset[source register]
ST/SB source register, offset[source register]
4
Data Transfer
Instructions that copy the contents of one location to another
location.
Examples
Instruction
MOV R1,R2
LD R3,100[R2]
ST [R5],R4
Comments
;R1 = R2
;Contents of memory whose address is
;given by 100 + R2 copied to R3.
;Contents of R4 copied to memory loc.
;whose address held in R5. (Offset = 0)
Note LD and ST cause 32-bit transfers.
Use LB and SB to cause 8-bit transfers.
5
Arithmetic Instructions
Performs an arithmetic operation such as addition, subtraction,
multiplication or division.
Examples
ADD R1,R2,R3
SUB R5,R4,3
;R1 = R2 + R3
;R5 = R4 – 3
For literals (immediate addressing) differences in assembly language
notation. Might be written as:
SUB R5,R4,#3
;R5 = R4 - 3
SBI R5,R4,3
;R5 = R4 - 3
or
depending upon assembly language.
6
Logical Instructions
Performs bit-wise logical operation such as AND, OR, exclusiveOR, or NOT. AND, OR, exclusive-OR operate upon pairs of bits of
two operands.
Bit-wise AND, OR, Ex-OR, and NOT are available in C and Java
(although you probably did not come across them!):
C/Java Language Examples
y=y&1
z=z|2
;bit-wise AND y with the number 1
;bit-wise OR z with the number 2
7
Machine Instruction Examples
AND R1,R2,R3
if
then
R2 = 10100010100101011000010100001111
R3 = 01011011111010100010100010111010
R1 = 00000010100000000000000000001010
OR R1,R2,R3
if
then
;R1 = R2 “AND” R3
;R1 = R2 “OR” R3
R2 = 10100010100101011000010100001111
R3 = 01011011111010100010100010111010
R1 = 11111011111111111010110110111111
8
Shift Instructions
Moves the bits of a location one or more places left or right.
Again available in C/Java (although you probably did not come across
them!):
C/Java language Examples
y = y << 1
;shift y 1 place left 1
z = z >> 2
;shift z 2 places right
9
Machine Instruction Examples
Examples
SHL R1,R1,1
X = 0 or 1 see next slide
;Shift R1 left one place
1 0 1 0 1 1 1 1 0 1 0 1 ... 1 1 0 1
0 1 0 1 1 1 1 0 1 0 1 ... 1 1 0 1 X
SHR R1,R1,1 ;Shift R1 right one place
1 0 1 0 1 1 1 1 0 1 0 1 ... 1 1 0 1
X 1 0 1 0 1 1 1 1 0 1 0 1 ... 1 1 0
10
Arithmetical and Logical Shifts
Two types of shift usually provided:
“Logical” shift (SHL, SHR)
Fill free end with 0, i.e. X = 0.
“Arithmetic” shift (SAL, SAR)
Arithmetic shifts multiple/divide by 2.
Arithmetic shift right maintains value of sign bit, i.e. X = value of
original sign bit.
11
Example
Starting with a number 9
000 ... 0001001
Shift arithmetic left one place. Get 18
000 ... 0010010
Shift arithmetic right two places. Get 4
i.e. lost the 0.5.
000 ... 0000100
12
Note: Java has logical shift right - called unsigned right
shift, >>>.
Example
x = x >>> 2;
13
Question
What is the difference, if any, between arithmetic shift left and
logical shift left, i.e. what is the difference, if any, between:
SHL R1, R1, 1
and
SAL R1, R1, 1
Answer
Arithmetic shift left same as logical shift left (except
arithmetic overflow may be detected).
14
Question
What is the effect of the sequence?
SAR R1, R1, 1
SAL R1, R1, 1
Answer
Makes number even if odd, i.e. 5 would become 4
15
Rotate Instructions
Moves bits of location one or more places left of right in a circular
fashion.
Examples
ROL R1,R1,1 ;Rotate R1 left one place
1 0 1 0 1 1 1 1 0 1 0 1 ... 1 1 0 1
0 1 0 1 1 1 1 0 1 0 1 ... 1 1 0 1 1
ROR R1,R1,1 ;Rotate R1 right one place
1 0 1 0 1 1 1 1 0 1 0 1 ... 1 1 0 1
Version of
rotate exists
where the
rotate
passes thro a
Carry flag
within the
condition
code
register, see
later about
the CCR
1 1 0 1 0 1 1 1 1 0 1 0 1 ... 1 1 0
16
Control Flow
Compilers must translate statements such as:
if ((x != y) && (z < 0)) {
a = b + 5;
b = b + 1;
}
into machine instructions.
Unreasonable to try to provide a unique machine instruction for this
IF statement because of the vast number of possible IF statements.
Need to extract essential primitive operations for machine
instructions.
17
Decompose into simple IF statements of the form:
if (x relation y) goto L1;
where:
relation is any of usual relations allowed in high level languages
(<, >, >=, <=, ==, !=)
L1 is a label prefixed to an instruction to identify it.
18
i.e. translate
if ((x != y) && (z < 0)) {
a = b + 5;
b = b + 1;
}
...
into
Label
L1:
if (x == y) goto L1;
if (z >= 0) goto L1;
a = b + 5;
b = b + 1;
...
19
Several ways of implementing the IF statement.
if (x relation y) goto L1;
Here we will consider two ways:
1. Using one branch instruction
Used in our design and lab.
2 Using two instructions, one to determine whether relationship
is true, and another to branch to L1 if true.
Used by Intel and based upon a very old instruction set, so we
have to know about it.
20
1. Using one branch instruction
Single “branch” machine instruction compares two registers and
goes to the labeled location if the condition is true, i.e.
Bcond Rs1, Rs2, L1
changes the execution sequence to the instruction labeled L1 if
Rs1 cond Rs2 is true, where cond can be any of six possible
conditions, and Rs1 and Rs2 can be any of the 32 registers.
21
Conditional Branch Instruction op-codes
Bcond
Bcond
BL
BG
BGE
BLE
BE
BNE
Condition
High level language notation
Branch if less than
<
Branch if greater than
>
Branch if greater or equal to
>=
Branch if less or equal to
<=
Branch if equal
==
Branch if not equal
!=
22
Example
To implement
if (x == y) goto L1;
.
.
.
L1:
by a single machine instruction, we get:
BE R1,R2,L1
.
.
.
L1:
where the compiler allocates R1 for x and R2 for y.
23
Machine Instruction Encoding
The instruction
Bcond Rs1,Rs2,L1
requires an op-code (Bcond), the two source registers Rs1 and Rs2,
and L1 to be specified.
Op-code
Bcond
Rs1
Rs2
L1
Note: Bcond is either BL, BG, BGE, BLE, or BNE
24
Specifying “target” location L1
Several ways L1 could be specified in instruction:
(a) Absolute (direct) addressing
Address of L1 held in instruction.
(b) (PC) Relative addressing Distance from branch instruction to
labeled instruction stored in instruction.
Program counter, PC, holds the location of the instruction to be fetched from
memory next, see later.
25
(a) Absolute (direct) addressing
BE R1,R2,120
.
.
.
L1:
;location 120 say
Op-code
Bcond
Rs1
Rs2
120
Note: Absolute addressing will not used in our design.
26
(b) (Program counter) Relative Addressing
Specify target location as number of instructions from branch
instruction.
Reasoning:
• Mostly, conditional branch instructions used to implement small
changes in sequences or program loops of relatively short length,
so distance from branch to target (label) quite small compared to
full address of the labeled instruction.
• Also good programming practice to limit sequence changes to
short distance from current location to avoid difficult to read code.
• Also makes code relocatable. (i.e. code can be loaded anywhere in
memory without altering branch instructions.)
27
PC-Relative Addressing
The number of locations to the target is held in the instruction as
an offset.
Branch op-code
Offset
Offset is added to the program counter to obtain the “effective
address” of the target location.
28
(PC) Relative Addressing
Offset/displacement
BE R1,R2,+30 ; location 90 say
.
.
.
30
L1:
; location 120 (90 + 30)
Op-code
Bcond
Rs1
Rs2
We will deal with how to implement this later. Note: PC incremented by the size of
instruction, 4 here, after instruction fetched from memory in most implementations.
30
29
Question
How would one code:
if (x > y) x = 10;
with machine instructions where x and y are stored in R2 and R3
respectively?
Answer
BG R2, R3, L1
MOV R2, 10
L1:
30
2. Using two instructions, one to determine whether
relationship is true, and another to branch to L1 if true.
In this approach, we determine whether the Boolean condition in
if (x relation y) goto L1;
is true by subtracting y from x and recognizing whether the result is
positive or negative, zero, or not zero:
relation
<
>
>=
<=
==
!=
x-y
negative
positive and not zero
positive or zero
negative or zero
zero
not zero
31
Condition Code Register (CCR)
Contains a set of flags (single bits) used to be able to recognize the
different possible relationships (<, >, >=, <=, ==, !=) after the
previous arithmetic operation.
Flags in condition code register indicate a particular aspect of the
result of the last arithmetic instruction, i.e. positive or negative,
zero or not zero, …
32
Sign Flag (S or N flag)
(positive or negative)
Indicates whether previous arithmetic result is negative or positive.
S = 1 for negative result
S = 0 for positive result.
S is actually the most significant (sign) bit of the result of the
previous arithmetic operation
33
Zero Flag
Zero flag, Z:
Z = 1 for result = 0
Z = 0 for result != 0
(Note zero is a positive number.)
34
Condition Code Register
Condition Code register normally closely linked to the ALU
(Arithmetic and Logic Unit):
Processor
Internal buses
S – Sign flag
Z – Zero flag
Source operands
ALU
Answer
Condition code
register
S
Z
Sample allocation of bits
(There are other bits used not yet described)
35
Using Condition Code Register
Decompose IF statement such as:
if (x relation y) goto L1
into two sequential operations:
1. Subtract y from x which sets condition code register according
to result
2. Read condition code register and “branch” to L1 if specific
condition indicated
36
Step 1 Subtract and Set Condition Code Register
All arithmetic instructions set condition code register according to
the result of the arithmetic operation, but a compare instruction
specifically provided, similar to a subtract instruction except result is
not stored anywhere, only the CCR flags set according to the result.
Example
CMP R1, R2
;R1 - R2, sets CCR flags
37
Step 2 Reading Condition Code Register and
Branching
A conditional branch instruction used to examine the values stored in
the condition code register to determine whether the specific
condition exists and to branch if it does.
All six conditions usually available:
BL
BG
BGE
BLE
BE (or BZ)
BNE (or BNZ)
Branch if less than
Branch if greater than
Branch if greater or equal to
Branch if less or equal to
Branch if equal
Branch if not equal
38
Example
Suppose x held in R1 and y held in R2. The if statement:
if (x == y) goto L1;
could be implemented by sequence of two instructions, CMP and
BE:
CMP R1,R2
BE L1
;Compare contents of R1, R2 (x, y)
;Go to L1 if equal
L1:
39
Example
Suppose x held in R1 and y held in R2. The if statement:
if (x == y) goto L1;
could be implemented by sequence of two instructions, CMP and BE:
Condition code register
Z = 1 if R1 - R2 = zero otherwise Z = 0
S
Z
Write
CMP R1,R2
BE L1
;Compare contents of R1, R2 (x, y)
;Go to L1 if equal
L1:
40
Example
Suppose x held in R1 and y held in R2. The if statement:
if (x == y) goto L1;
could be implemented by sequence of two instructions, CMP and
BE:
Condition code register
Z = 1 if R1 - R2 = zero otherwise Z = 0
S
Z
Read
Write
CMP R1,R2 ;Compare contents of R1, R2 (x, y)
BE L1 ;Go to L1 if equal
L1:
41
More complex constructions
if - then - else
Suppose we had to implement:
if (a < b) c = a; else a = c;
...
assuming that the variables a, b, and c are assigned to registers R1,
R2, and R3 respectively.
42
Leads to:
CMP R1,R2
BL L1
MOV R1, R3
L1:
L2:
MOV R3,R1
...
;Compare R1, R2 (x, y)
;Go to L1 if a less than b
;a = c
Skip over c = a, unconditionally
;
c = a;
We need to alter the instruction sequence unconditionally. With the
instructions we know about so far, it could be done with:
CMP R1, R1
BE L2
;if (x == x) goto L2;
but there is a special instruction called a jump instruction to do it.
43
JUMP Instructions
Causes an unconditional change of execution sequence to a new
location.
Necessary to implement more complicated IF constructs, FOR, and
WHILE loops.
Using J as the opcode mnemonic, the jump instruction is:
J L1
;goto to L1
44
With jump instruction:
L1:
L2:
CMP R1,R2
BL L1
MOV R1, R3
J L2
Go to L2
MOV R3,R1
...
;Compare R1, R2 (x, y)
;Go to L1 if a less than b
;a = c
;c = a;
45
For loops
Suppose we had to implement the C/Java code sequence:
for (i = 1; i < 10; i++)
a = a * i;
...
Let us assume i is held in register R1, and x is held in register R2.
Possible solution
L2:
L1:
MOV R1, 1
CMP R1,10
BGE L1
MUL R2,R2,R1
ADD R1,R1,1
J L2
...
;i = 1
;Compare R1 with 10
;Go to L1 if end of loop
;a = a * i
;increment i
46
Jump Instruction with Register-Indirect
Addressing
An “address register” specified in instruction holds the address of
the location holding the next instruction to be executed.
Examples
J [R1]
;jump to location whose address in R1
J 100[R1]
;jump to location whose address given
;by R1 + 100
Target address specified as absolute address, rather than relative
address.
Used to implement high level language SWITCH statements. Also
return from procedures see next section.
47
Avoiding Condition Code Register
It turns out that the CCR approach has disadvantages when one
tries to implement a high performance processor, and does not
really suit RISC designs.
Makes it much more difficult to design a high-performance
processor.
CCR approach requires two sequential instructions with no
instructions allowed in between that affect the CCR. (All
arithmetic/logic instruction affect CCR.)
For the most part we will use single instruction in our designs.
An alternative to the CCR is to use a general purpose register in
place of CCR to hold “conditions.”
48
Simplified version of branch instruction
Bcond Rs1, L1
where Rs1 is compared against zero rather than Rs2.
Do subtract operation previous to this instruction, placing result
in Rs1.
49
Question
How would one code:
if (x > y) goto L1;
where x and y are stored in R2 and R3 respectively using previous
simplified version of branch instruction?
Answer
SUB R4, R2, R3
BG R4, L1
50
One version of branch instruction in MIPS
processor (Lab)
Use a general purpose register set to 1 of 0 if one register is less
than another.
SLT Rs1,Rs2,Rs3
;Rs1 = 1 if Rs2 < Rs3
together with a simple branch instruction that only tests condition
equal or not equal.
51
Questions
52
Download