Threads and Critical Sections Vivek Pai / Kai Li Princeton University

advertisement
Threads and Critical Sections
Vivek Pai / Kai Li
Princeton University
Gedankenthreads
What happens during fork?
We need particular mechanisms, but do
we have options about what to do?
2
Mechanics
Midterm grading finished!
Tabulating, recording not done
Available this afternoon, really!
Let’s talk a little about the midterm
Start threads & critical sections
Readings will be updated & mentioned
in followup (so you’re responsible  )
3
Thread and Address Space
Thread
A sequential execution stream within a
process (also called lightweight process)
Address space
All the state needed to run a program
Provide illusion that program is running on
its own machine (protection)
There can be more than one thread per
address space
4
Concurrency and Threads
I/O devices
Overlap I/Os with I/Os and computation (modern
OS approach)
Human users
Doing multiple things to the machine: Web
browser
Distributed systems
Client/server computing: NFS file server
Multiprocessors
Multiple CPUs sharing the same memory: parallel
program
5
Typical Thread API
Creation
Fork, Join
Mutual exclusion
Acquire (lock), Release (unlock)
Condition variables
Wait, Signal, Broadcast
Alert
Alert, AlertWait, TestAlert
6
User vs. Kernel-Level Threads
Question
What is the difference between user-level and
kernel-level threads?
Discussions
When a user-level thread is blocked on an I/O
event, the whole process is blocked
A context switch of kernel-threads is expensive
A smart scheduler (two-level) can avoid both
drawbacks
7
Thread Control Block
Shared information
Processor info: parent process, time, etc
Memory: segments, page table, and stats, etc
I/O and file: comm ports, directories and file
descriptors, etc
Private state
State (ready, running and blocked)
Registers
Program counter
Execution stack
8
Threads Backed By Kernel Threads
Each thread has
a user stack
a private kernel stack
Pros
concurrent accesses
to system services
works on a
multiprocessor
Cons
More memory
Each thread has
a user stack
a shared kernel stack with
other threads in the same
address space
Pros
less memory
Does not work on a
multiprocessor
Cons
serial access to system
services
9
“Too Much Milk” Problem
Person A
Look in fridge: out of milk
Leave for Wawa
Arrive at Wawa
Buy milk
Arrive home
Person B
Look in fridge: out of milk
Leave for Wawa
Arrive at Wawa
Buy milk
Arrive home
Don’t buy too much milk
Any person can be distracted at any point
10
A Possible Solution?
if ( noMilk ) {
if (noNote) {
leave note;
buy milk;
remove note;
}
}
if ( noMilk ) {
if (noNote) {
leave note;
buy milk;
remove note;
}
}
Thread can get context switched after
checking milk and note, but before buying
milk
11
Another Possible Solution?
Thread A
Thread B
leave noteA
if (noNoteB) {
if (noMilk) {
buy milk
}
}
remove noteA
leave noteB
if (noNoteA) {
if (noMilk) {
buy milk
}
}
remove noteB
Thread A switched out right after
leaving a note
12
Yet Another Possible Solution?
Thread A
leave noteA
while (noteB)
do nothing;
if (noMilk)
buy milk;
remove noteA
Thread B
leave noteB
if (noNoteA) {
if (noMilk) {
buy milk
}
}
remove noteB
Safe to buy
If the other buys, quit
13
Remarks
The last solution works, but
Life is too complicated
A’s code is different from B’s
Busy waiting is a waste
Peterson’s solution is also complex
What we want is:
Acquire(lock);
if (noMilk)
buy milk;
Release(lock);
}
Critical section
14
What Is A Good Solution
Only one process inside a critical
section
No assumption about CPU speeds
Processes outside of critical section
should not block other processes
No one waits forever
Works for multiprocessors
15
Primitives
We want to avoid thinking (repeatedly)
So, we want some “contract” that
provides certain behavior
Low-level behavior encapsulated in
“primitives”
Application uses primitives to construct
more complex behavior
16
The Simplistic Acquire/Release
Acquire() {
disable interrupts;
}
Release() {
enable interrupts;
}
Kernel cannot let users disable interrupts
Critical sections can be arbitrarily long
Used on uniprocessors, but won’t work on
multiprocessors
17
Disabling Interrupts
Done right, serializes activity
People think sequentially – easier to reason
Guarantees code executes without
interruption
Delays handling of external events
Used throughout the kernel
18
Using Disabling Interrupts
Acquire(lock) {
disable interrupts;
while (lock != FREE){
enable interrupts;
disable interrupts;
}
lock = BUSY;
enable interrupts;
}
Release(lock) {
disable interrupts;
lock = FREE;
enable interrupts;
}
Why do we need to disable interrupts at all?
Why do we need to enable interrupts inside
the loop in Acquire?
19
Using Disabling Interrupts
Acquire(lock) {
disable interrupts;
while (lock == BUSY) {
enqueue me for lock;
block;
} else
lock = BUSY;
enable interrupts;
}
Release(lock) {
disable interrupts;
if (anyone in queue) {
dequeue a thread;
make it ready;
}
lock = FREE;
enable interrupts;
}
When does Acquire re-enable interrupts in
going to sleep?
Before enqueue?
After enqueue but before block?
20
Hardware Support for Mutex
Mutex = mutual exclusion
Early software-only approaches limited
Hardware support became common
Various approaches:
Disabling interrupts
Atomic memory load and store
Atomic read-modify-write
L. Lamport, “A Fast Mutual Exclusion
Algorithm,” ACM Trans. on Computer
Systems, 5(1):1-11, Feb 1987. – use Google
to find
21
The Big Picture
Concurrent Applications
High-Level Locks
Atomic API
Low-Level
Atomic Ops
Semaphores Monitors Send/Receive
Load/Store Interrupt disable Test&Set
Interrupt (timer or I/O completion), Scheduling, Multiprocessor
22
Atomic Read-Modify-Write
Instructions
Test&Set: Read value and write 1 back to memory
Exchange (xchg, x86 architecture)
Swap register and memory
Compare and Exchange (cmpxchg, 486+)
If Dest = (al,ax,eax), Dest = SRC;
else (al,ax,eax) = Dest
LOCK prefix in x86
Load link and conditional store (MIPS, Alpha)
Read value in one instruction, do some operations
When store, check if value has been modified. If not, ok;
otherwise, jump back to start
23
A Simple Solution with Test&Set
Acquire(lock) {
while (!TAS(lock))
;
}
Release(lock) {
lock = 0;
}
Waste CPU time
Low priority threads may never get a
chance to run
24
Test&Set, Minimal Busy Waiting
Acquire(lock) {
while (!TAS(lock.guard))
;
if (lock.value) {
enqueue the thread;
block and lock.guard = 0;
} else {
lock.value = 1;
lock.guard = 0;
}
}
Release(lock) {
while (!TAS(lock.guard))
;
if (anyone in queue) {
dequeue a thread;
make it ready;
} else
lock.value = 0;
lock.guard = 0;
}
Why does this work?
25
Download