Uploaded by vijaysteam

Write A Linux Driver Subsystem

advertisement
So you want to write a Linux driver subsystem?
Michael Turquette <mturquette@baylibre.com>
Who am I? And why am I here?
CEO of BayLibre, Inc
● Previously at Texas Instruments, Linaro, San Francisco start-up
● Contributor to various power management-related topics upstream
Author and co-maintainer of the common clk framework
● Merged in 3.4
● Maintenance for 3 years (and counting)
Lots of mistakes, rewrites and lessons learned.
This talk is a work in progress. Thanks, guinea pigs!
Agenda
1. Overview: what makes a good subsystem?
2. Design considerations
3. (Very) Brief review of CCF design
4. Maintenance
1. What makes a good Linux driver
subsystem?
What is a Linux driver subsystem?
● Frameworks and libraries; common code implementing a
standard protocol, interface or behavior
● Providers are Linux kernel drivers that plug into the
framework and provide access to hardware
● Consumers are Linux kernel drivers or subsystems that
access the framework through a common API
● A Linux driver can be both a provider and a consumer
Some common subsystems...
●
●
●
●
●
●
genirq
clocksource
clockevent
pinctrl
regulator
clk
●
●
●
●
●
●
cpufreq
cpuidle
pm runtime
genpd
alsa/asoc
v4l2
What makes a good subsystem?
●
Solves for the most common cases
●
Maintainable
●
Concurrency / locking correctness
●
Respects the Driver Model
●
Architecture/platform independent
●
Module safe
●
Continuous testing
Linux is not special
●
Use good programming practices
●
Consolidate code
●
Provide helpers and accessors only as needed
●
Use coccinelle to find bad patterns and fix them
●
Read an algorithm and data structures book
2. Design considerations
Patterns and pitfalls
1.
Consumer/provider API split
2.
Consumers should not know about the hardware
3.
Device versus Resource
4.
Follow the Linux Driver Model
5.
Locking and concurrent access
6.
Protecting your internal data structures
7.
Synchronous and async behavior
Consumer/provider API split
●
Consumers want to get devices and resources and call
functions on them
○
●
Providers register the devices and resources
○
●
clk_get(), clk_set_rate(), clk_enable(), etc
clk_register(), clk_unregister(), struct clk_ops, etc
Split them into separate headers
○
include/linux/clk-provider.h, include/linux/clk.h (consumer)
Example: OMAP3 ISP
cam_xclka
cam_mclk
OMAP3ISP
cam_xclkb
drivers/media/platform/omap3isp/isp.c
Knowledge not required!
The framework is incorrectly designed if consumer drivers
need to know details about the underlying hardware
Write-only APIs are useful for this
Rusty’s API Levels
10. It's impossible to get wrong.
9. The compiler/linker won't let you get it wrong.
8. The compiler will warn if you get it wrong.
7. The obvious use is (probably) the correct one.
http://goo.gl/SmNqN8
6. The name tells you how to use it.
http://goo.gl/yc6E4X
5. Do it right or it will always break at runtime.
4. Follow common convention and you'll get it right.
3. Read the documentation and you'll get it right.
2. Read the implementation and you'll get it right.
1. Read the correct mailing list thread and you'll get it right.
Example: MMC controller
osc
Switch
parent
mux
mmc_clk
pll
Propagate
rate
change
MMC Controller
Device versus Resource
CCF manages clock tree hierarchies
●
Should clocks be Linux devices?
○
●
Bring Your Own Device
vs
Hundreds of clocks…
Framework Provided Device
Does it match the data sheet?
○
○
Clock controller IP blocks expose
hundreds of clock nodes
IP block roughly == Linux device
●
●
struct regulator.dev versus struct clk
CCF does not create a struct device
Purely a matter of taste
Reference counting
kobject
kref
● creates sysfs object
● includes kref object for
reference counting
● get this “for free” with
struct device
● lightweight alternative to
kobject
● struct clk_core uses this
to keep things sane
around module
unloading
Don’t forget the release() method!
Follow the Linux driver model
gross.
void __init nomadik_clk_init(void)
{
struct clk *clk;
clk = clk_register_fixed_rate(NULL,
"apb_pclk", NULL,
CLK_IS_ROOT, 0);
...
Locking and concurrent access
● Drivers will do crazy shit.
● Protect yourself!
○
○
Define strict entry points into the framework
Wrap all data structure accesses in a sane locking scheme
● Do you need to access the framework in interrupt context?
○
○
Provide irq-safe entry points using spinlocks
Otherwise use mutexes
Example: clk_prepare & clk_enable
CCF has competing needs:
1. clk_enable/clk_disable can be called from interrupt context
2. Some enable ops may have delays/sleeps
clk_{un}prepare supplements clk_{en,dis}able
Mutex protects prepare ops, spinlock protects enable ops
Consumer drivers must always call both, in-order, and do not
need to know the details of underlying hardware
Breaking i2c
xtal
crap.
Clock Controller
(i2c Clock Expander
or PMIC)
i2c_out_clk
foo Device
Example: Reentrant locking
#define get_current() (current_thread_info()->task)
#define current get_current()
static void clk_prepare_lock(void)
{
if (!mutex_trylock(&prepare_lock)) {
if (prepare_owner == current) {
prepare_refcnt++;
return;
}
mutex_lock(&prepare_lock);
}
Protect data structures & bookkeeping
Internal bookkeeping and private data structures should not
be defined in headers
● Expose opaque handles to consumer drivers
● Think long and hard before giving provider drivers access
to struct definitions and private pointers
● Reference count accesses to these resources
Drivers will muck with data structures and bookkeeping that
they have no business touching
Example: per-user reference counting
struct clk {
};
struct clk_core {
struct clk_core *core;
const char *name;
const char *dev_id;
const struct clk_ops *ops;
const char *con_id;
struct clk_hw *hw;
unsigned long min_rate;
struct module *owner;
unsigned long max_rate;
struct clk_core *parent;
struct hlist_node clks_node;
unsigned int enable_count;
unsigned int prepare_count;
unsigned int prepare_count;
unsigned int enable_count;
…
};
Beware: get/put abuse
static struct clk *foo;
void probe()
void module_exit()
{
{
foo = clk_get(dev, “foo”);
clk_prepare_enable(foo);
clk_put(foo);
}
clk_unprepare_disable(foo);
}
Sync vs Async consumer API behavior
Sync
Async
● execution blocked until
operation completes
● The right choice for
some low-level
operations where
sequence is critical
● Examples: i2c and clk
consumer APIs
● execution proceeds after
operation is initiated
● Increases performance
in many use cases
● Requires a model where
waiting on a completion
event makes sense
● Example: spi consumer
APIs
Where does the data come from?
● Provide helper functions for the primary source of driver
data
● In the embedded world this is often Device Tree
● Continuously scan provider drivers and consolidate
common open-code solutions into helpers
● Design with a firmware interface in mind, but …
● … also do not design only for a single firmware interface
Misc tips & pitfalls
● Test for memory leaks caused by module
load/unload/reload
● Pass pointers to structs as arguments to functions
exposed by your subsystem
● Merge tests and hide them behind
CONFIG_FOO_TEST
● Sort Makefiles lexicographically
● Always terminate array initialization with a comma
3. (Very) Brief review of CCF design
Background on CCF
●
●
●
clk.h API is pretty old
○
Consumer side of the API
○
pre-dates CCF
○
Multiple implementations
Single implementation desirable
○
One definition of struct clk
○
Single zImage for ARM (and other arch’s)
○
Code consolidation
Coincided with other developments
CCF design (in a single slide)
●
It is a library
○
BYO(P)D
●
Re-entrant for the same context
●
Mixed use of mutex and spinlock
●
Per-user, opaque handles
●
Per-user reference counting kia kref
●
Strict consumer/provider API split
●
Internal data structures hidden
●
Big global locks
●
No async api
●
Consumer API is shared with competing implementations
4. Maintenance
So now what?
● Merging a new Linux driver subsystem is the beginning
of the work, not the end
● Set aside 50% of your time to maintain it
Maintaining sanity
● Find a co-maintainer
● Participate in linux-next
● Setup subsystem-specific mailing list and irc channel
● Automate your life
Best advice ever
● Say “No” all the time
○
○
○
This is your primary job now!
You amy stop being the top contributor to the code that you wrote!
A weak reason to not merge a patch is enough reason
Thanks
● Linus Walleij
● Greg Kroah-Hartman
● Mark Brown
● Rafael J. Wysocki
● … many others for hallway advice
Download