CS 630: Advanced Microcomputer Programming Fall 2006 Professor Allan B. Cruse

advertisement
CS 630: Advanced
Microcomputer Programming
Fall 2006
Professor Allan B. Cruse
University of San Francisco
Course Synopsis
•
•
•
•
We study the IA32 processor architecture
It’s implemented in our Pentium 4 CPUs
Also implemented in some earlier CPUs
Not only Intel, but also by its competitors
(e.g., present as ‘legacy mode’ in AMD64)
• IA32 architecture adopted by newer Macs
• IA32 architecture continues in Core 2 Duo
Point-of-View
• For study purposes we can pretend we’re
studying a ‘bare machine’ (i.e., it just has
standard PC hardware for doing I/O, and
ROM-BIOS firmware supplied by vendor,
but lacks any operating system software).
• So we get to ‘build our own’ miniature OS
• Doing this will bring us face-to-face with
the CPU’s most fundamental capabilities
Methodology
• Our interactive computer classroom lets us
take a ‘hands on’ approach to our studies
(i.e., we combine ‘theory’ with ‘practice’)
• Typically we’ll devote first part each class
to a ‘lecture’ about aspects of IA32 theory
• Then we’ll take time in the second part of
class for ‘laboratory exercises’ that put the
newly learned ideas into ‘working code’
Course prerequisites
• Experience with C / C++ programming
• Familiarity with use of Linux / UNIX OS
• Acquaintance with x86 assembly language
– Knowledge of the x86 general registers
– Awareness of the x86’s instruction-set
• Understand the CPU’s fetch-execute cycle
• Recall the ways memory is addressed
Review of System Components
Central
Processing
Unit
Main
Memory
system bus
I/O
device
I/O
device
I/O
device
I/O
device
Review of the x86 API
EAX
CS
EBX
DS
ECX
ES
EDX
FS
ESI
GS
EDI
SS
EBP
ESP
General Registers (32-bits)
Segment Registers (16-bits)
EIP
EFLAGS
Program Control and Status Registers (32 bits)
Review of Instruction-Set
•
•
•
•
•
•
•
Data-transfer instructions (mov, xchg, …)
Control-transfer instructions (jmp, call, …)
Arithmetic/Logic instructions (add, or, …)
Shift/Rotate instructions (shr, rol, …)
String-manipulation instructions (movs, …)
Processor-control instructions (cli, hlt, …)
Floating-point instructions (fldpi, fmul, …)
Review “Fetch-Execute” Cycle
main memory
central processor
Temporary
Storage
(STACK)
ESP
Program
Variables
(DATA)
Program
Instructions
(TEXT)
EAX
EAX
EAX
EAX
EIP
the system bus
Steps in ‘Fetch-Execute Cycle’
Fetch next instruction
Advance instruction-pointer
Decode fetched instruction
Execute decoded instruction
no
INTR
?
yes
Interrupt
Service
Routine
Review of operand addressing
• Implicit addressing
(e.g. pushf, cbw, scasb, cli, xlat, …)
• Direct addressing
(e.g., incl salary, movw $0, counter, …)
• Indirect addressing
(e.g., add %dx, 0x14(%ebx, %esi, 2) )
Course Textbook
• Tom Shanley, Protected Mode Software
Architecture, Addison-Wesley (1996)
Initial reading assignment:
Week 1: Read Part One (Chapters 1-3)
Week 2: Read Part Two (Chapters 4-5)
Instructor Contact Information
• Office: Harney Science Center – 212
• Hours: Mon-Wed-Fri 12:30pm-1:15pm
and Tues-Thurs 6:15pm-7:15pm
• Phone: (415) 422-6562
• Email: cruse@usfca.edu
• Webpage: <http://cs.usfca.edu/~cruse>
CPU Execution Modes
POWER-ON / RESET
REAL
MODE
PROTECTED
MODE
SYSTEM
MANAGEMENT
MODE
VIRTUAL
8086
MODE
Early Intel Processors
•
•
•
•
•
•
•
1971: 4004 (first 4-bit processor)
1972: 8008 (first 8-bit processor)
1974: 8080 (widely used by CP/M)
1978: 8086/8088 (first 16-bit processor)
1982: 80286: (introduced protected mode)
1985: 80386: (first 32-bit processor)
1989: 80486: (integrated floating-point)
Later Intel Processors
•
•
•
•
•
•
•
•
1993: Pentium processor (dual CPUs)
1995: Pentium Pro (for high-end servers)
1996: Pentium II (single-edge connector)
1998: Pentium II Xeon (multiple CPUs)
1999: Celeron (stripped down Pentium II)
1999: Pentium III (1GHz, 512K L2 cache)
1999: Pentium III Xeon (high-end servers)
2001: Pentium 4 (new SIMD instructions)
Even newer Intel Processors
• 2003: Pentium-M (‘mobile’ -- for laptops)
• 2005: Pentium-D (‘dual core’ -- for ‘smp’)
• 2006: Core 2 Duo (released this summer)
• Newest CPUs support ‘EM64T’ and ‘VT’
– EM64T: Extended Memory 64-bit Technology
– VT: Intel’s ‘Virtualization Technology’
Backward Compatibility
• From its first commercial success onward,
“backward compatibility” (i.e., support for
the software legacy) has been viewed by
Intel as an engineering design imperative
• So the first 16-bit processors (8086/8088),
used in IBM-PCs, were designed in a way
that would let them run the vast number of
CP/M programs written for 8-bit 8080 CPU
Real Mode
• 8086/8088 had only one execution mode
• It used “segmented” memory-addressing
• Physical memory on 8086 was subdivided
into overlapping “segments” of fixed-size
• The length of any “segment” was 64KB, to
match the size of an 8080s address-space
• This scheme supported CP/M applications
• (Our Pentium CPUs continue that support)
64KB Memory-Segments
• Fixed-size segments partially overlap
• Segments start on paragraph boundaries
• Segment-registers serve as “selectors”
stack
data
code
SS
DS
CS
Real-Mode Address-Translation
Logical address:
0x12340
+ 0x06789
---------------0x18AC9
16-bit segment-address
16-bit offset-address
0x1234
0x6789
x 16
+
20-bit bus-address
Physical address:
0x18AC9
Protected Mode
• Any Pentium CPU starts up in ‘Real Mode’
• While in real mode, its behavior is like an 8086
(i.e., any program can do anything it wants, as
the CPU’s protection mechanisms are disabled)
• But software can enter ‘protected mode’ (on a
80286 or higher) using a special instruction to
modify a bit within a processor control-register
• Once in protected mode, the segment-sizes can
be adjusted, accesses to physical memory (or to
peripheral devices) can be restricted, and tasks
can be isolated from interfering with one another
Enabling Protection
15 14 13 12
11 10
9
8
7
6
5
4
3
2
1
0
N E T E M P
E T S M P E
80286 Machine Status Word
Code-fragment that
enables protection
smsw %ax
or $1, %ax
lmsw %ax
PE (Protection Enabled) 0=no, 1=yes
Protected-Mode Segments
• Segments can have varying lengths
• Segments may, or may not, overlap
• Segments are assigned ‘access-attributes’
operating system
stack
data
code
GS
SS
DS
CS
Our ‘bare machine’
• If we want to do a “hands on” study of our
CPU, without any operating system getting
in our way, we have to begin by exploring
‘Real Mode’ (it’s the CPU’s startup state)
• We will need to devise a mechanism by
which our program-code can get loaded
into memory (since we won’t have an OS)
• This means we must write a ‘boot loader’
What’s a ‘boot loader’
• A ‘boot loader’ is a small program that is
resident in the starting sector of a disk (or
tape or other non-volatile storage medium)
• After testing and initializing the machine’s
essential hardware devices, the startup
program in the ROM-BIOS firmware will
read the ‘boot loader’ into memory, at an
assigned location, and then jump there
PC ROM-BIOS BOOT_LOCN
Vendor’s Firmware
ROM-BIOS
No installed memory
Video Display Memory
Volatile Program Memory
VRAM
RAM
0x00007E00
0x00007C00
BOOT_LOCN 512 bytes
IVT and BDA
8086 memory-map
1-MB
Some Requirements
• A ‘boot loader’ has to be 512 bytes in size
(because it has to fit within a disk sector)
• Must begin with executable machine-code
• Must end with a special ‘boot signature’
• Depending on the type of storage medium,
it may need to share its limited space with
certain other data-structures (such as the
‘partition table’ on a hard disk, or the Bios
Parameter Block’ on a MS-DOS diskette)
Writing a ‘boot loader’
• Not practical to use a high-level language
• We need to use 8086 assembly language
(our classroom/lab systems provides ‘as’)
• This assembler’s syntax differ’s from the
standard set by Intel and Microsoft, but it
follows a tradition, established in 1970s at
AT&T, for its original versions of UNIX
• That ‘as’ syntax is documented online
Using ROM-BIOS functions
• Our system firmware provides many basic
service-functions that real mode programs
can invoke (this includes ‘boot-loaders’):
– Video display functions
– Keyboard input functions
– Disk access functions
– System query functions
– A machine ‘re-boot’ function
Example: Write_String function
• Setup parameters in designated registers
– AH = function ID-number (e.g. 0x13)
– AL = cursor handling method (e.g. 0x01)
– BH = display page-number (e.g., 0x00)
– BL = color attributes (e.g., 0x0A)
– CX = length of the character-string
– DH, DL = row-number, column-number
– ES:BP = string’s starting-address (seg:off)
• Call BIOS via software interrupt (int-0x10)
Downloading a class demo
• You can ‘download’ a program source-file
from our CS 630 course-website to your
own ‘present working directory’ by using
the Linux file-copy command, like this:
$ cp /home/web/cruse/cs630/bootmsw.s .
(Here the final period-character (‘.’) is the Linux
shell’s symbol for your ‘current directory’).
Compiling and Installing
• Compiling our ‘boot loader’ using ‘as’ is a
two-step operation (and requires use of a
linker-script, named ‘ldscript’):
$ as bootload.s –o bootload.o
$ ld bootload.o –T ldscript –o bootload.b
• Installing our bootloader into the starting
sector of a floppy diskette is very simple:
$ dd if=bootload.b of=/dev/fd0
No floppy drive!
• Our workstations no longer have diskettedrives, but we have devised alternatives:
– Copy the bootloader to a hard disk partition
– Install the bootloader on a diskette-image file
• Tonight we can use the first alternative:
$ dd if=bootloader.b of=/dev/sda4
• The ‘grub’ menu includes an option that
will let you ‘boot’ from this ‘cs630 partition’
Executing a ‘boot-loader’
• You need to perform a system ‘reboot’
• Our classroom machines will load GRUB
(the Linux GRand Unified Boot-loader)
• GRUB will display a menu of Boot Options
• You can choose ‘boot from a disk-partition’
• Or you can boot from a diskette-image file
In-class Exercise #1
• Look at our CS 630 class website:
<http://cs.usfca.edu/~cruse/cs630>
• Download, assemble, and install our demo
‘bootmsw.s’
• Copy the ‘binary-executable’ (i.e., bootmsw.b’) to
the first sector of the hard-disk’s partition #4:
$ dd if=bootmsw.b of=/dev/sda4
• Reboot machine and use GRUB’s menu to boot
our demo-program from the ‘cs630 partition’
In-class Exercise #2
• Now modify our demo so it will permit a
user to ‘reboot’ just by pressing any key
• This exercise will require you to edit your
copy of our demo-program’s source-file
(adding a few lines that invoke two further
ROM-BIOS service-functions), and then
reassemble, relink, and reinstall your work
A valuable Online Reference
• Professor Ralf Brown’s Interrupt List
(see webpage link under ‘Resources’)
• It tells how to make BIOS system-calls, to
perform numerous low-level services from
within Real-Mode 8086 applications (such
as ‘boot loader’ programs)
Programming Details
• It’s easy to include ‘await keypress’:
mov $0, %ah ; function-ID
int $0x16
; BIOS keyboard service
• It’s easy to include ‘reboot system’:
int $0x19
; BIOS reboot service
Download