Berkeley NOW - Frontiers in Distributed Information Systems

advertisement
Intel Research @ Berkeley
and
Extreme Networked Systems
www.intel-research.net/berkeley
David Culler
8/12/2002
Where this presentation might go...
aka Outline
• new models of industry/academic research
collaboration
• vast networks of tiny devices in the physical
world
• open infrastructure for emerging planetary-scale
services
8/12/2002
IRB/XIS
2
New model for ind/acad collaboration
• Key challenges ahead in EECS are fundamentally
problems of scale
– require level of investigation and engineering beyond what is
sustainable within the university and beyond what a company
can commit outside product scope
– industry possesses key technology and expertise
– requires insights from many perspectives
• A new lab stucture built around deep research
collaboration and intimate ties to the EECS
department
–
–
–
–
industry contributes substantial effort of high quality
projects span boundaries
faculty co-direct lab
student / faculty cycles drive the continuous motion
• Operate in uniquely open fashion
8/12/2002
IRB/XIS
3
Intel Network of Lablets Concept
• Network of small labs working closely with top
computer science departments around the world
on deeply collaborative projects.
–
–
–
–
Berkeley
– extreme network systems
Washington – HCI
CMU
– distributed storage
Cambridge
• Complement the corporate labs
– explore off the roadmap, long range, high risk
• Complement the external-research council
– drive projects of significant scale and impact
• Expand the channel
– Bi-directional transfer of people, ideas, technology
8/12/2002
IRB/XIS
4
lablet mission
• Leadership role in emerging and important areas
• Combining the unique strengths of Intel and Univ.
• Bi-directional exchange of breakthough ideas,
technology and people
University
Advance of the
research ecosystem
Lablet
Novel component
technology
SRPs
Advanced
Applications
Intel Labs
8/12/2002
IRB/XIS
5
Berkeley Emphasis
• Cross-cutting problems of scale.
• Extreme Interconnected Systems
• “endonets”
– dense, fine-grain networked systems deeply embedded in or
interacting with physical environment
– sensor networks
– ubiquitous computing architectures
– computational fabrics, surfaces, structures
• “exonets”
–
–
–
–
8/12/2002
broad coverage networked systems at societal scale
world-wide storage systems
composable infrastructure services
massive servers for millions of users
IRB/XIS
6
Scale and structure
Active day-to-day involvement
• ~20 full-time Intel Researchers and Engineers
– currently 13
• ~5 part-time Intel folks
• 20 faculty, students, visitors, research
consultants
Two-in-a-box co-directors
• University Director + Intel Director
• Report to David Tennenhouse, VP Research
Project focused
• ~6-year projects starting about every two years
8/12/2002
IRB/XIS
7
Two Major Lab Projects
• Define and Develop complete ‘network system
stack’ for deeply embedded sensor/effector
networks
–
–
–
–
enabling technology
create the community
core architecture, OS, networking, service foundations
demonstrate revolutionary applications
• Create an Open Laboratory for Widelydistributed “Planetary Scale” Services to explore
architecture, services and applications
–
–
–
–
8/12/2002
enabling resource catalyzes community
distributed development effort
foundations: scalable, secure slice-able platform
infra and service design trade-offs (DHT, Dist-storage)
IRB/XIS
8
Open Collaborative Research Agreement
• Master Agreement states
– intent: Open
– terms, conditions (IP addendum)
• Research Project Descriptions
– what, who, where
• scope of work defines boundary of openness!
– an openness agreement is all about defining reach-through
8/12/2002
IRB/XIS
9
System Stack for Deeply Embedded
Networks
8/12/2002
IRB/XIS
10
Bridging the Technology-Appln Gap
service
network
system
architecture
algorithm / theory
data
mgmt
mgmt / diag / debug
application
technology
8/12/2002
IRB/XIS
11
prog / data model
Monitoring & Managing Spaces and Things
Deeply Embedded Networks
• # nodes >> # people
• sensor/actuator data stream
• unattended
• inaccessible
• prolonged deployment
• energy constrained
• operate in aggregate
• in-network processing necessary
• what they do changes over time
=> must be programmed over the network
8/12/2002
IRB/XIS
12
Project Activities
• Core Platform
– architecture, TinyOS, Networking
– simulation and debugging tools
• Programming Support
– NesC (TinyOS modularity and concurrency)
– Cooperating FSMs, atomicity
– Macroprogramming
• Sensor-Network databases
– streaming, noisy data, with in-network query processing
• Delay Tolerant Networking
– overlay for diverse, challenged internets
• Interactive Environments and Things
– ambient displays, remote physical communication
– context-aware tools for the handicapped
• Habitat and Environmental Monitoring
– dense sensor networks in the hands of life scientists
• Generic Sensor Kit
8/12/2002
IRB/XIS
13
Platform Architecture
• Goal
– create a small wireless device that would enable us to explore
the system design space, applns to be attempted, and a new
research community
– develop the architecture in response to observed system
design
application
• Approach
– joined in the series of UCB COTS mote designs
» WeC -> Rene -> iDot -> MICA
– look to silicon for full architecture
• New ideas
data
mgmt
service
network
system
– rich interfaces allow radical system optimizations
architecture
» analog wake-up, Tx-Rx time synch
– federation of accelerators, not dedicate protocol proc. technology
– HW/SW multithreading for low power, passive vigilance
8/12/2002
IRB/XIS
14
Berkeley Wireless Sensor ‘Motes’
Mote Type
Date
WeC
Rene
Rene2
Dot
Mica
Sep-99
Oct-00
Jun-01
Aug-01
Feb-02
Microcontroller (4MHz)
Type
Prog. Mem. (KB)
RAM (KB)
AT90LS8535
ATMega163
ATMega103/128
8
16
128
0.5
1
4
Communication
Radio
Rate (Kbps)
Modulation Type
8/12/2002
RFM TR1000
10
10/40
OOK
OOK/ASK
IRB/XIS
15
application
TinyOS Application Graph
Route map
router
sensor appln
packet
Radio byte
bit
Radio Packet
byte
Active Messages
RFM
8/12/2002
Serial Packet
UART
Temp
photo
SW
HW
ADC
clocks
IRB/XIS
Example: self-organized adhoc, multi-hop routing of
photo sensor readings
3450 B code
226 B data
Graph of cooperating
state machines
on shared stack
16
It is a noisy world after all...
• Get to rethink each of the layers in
a new context
–
–
–
–
–
–
–
–
–
–
–
coding, framing
mac
routing
transport,
rate control
discovery
multicast
aggregation
naming
security
...
• Resource constrained, power
aware, highly variable, ...
• Every node is also a router
• No entrenched ‘dusty packets’
probability of reception from center
node vs xmit strength
8/12/2002
IRB/XIS
17
Example “epidemic” tree formation
8/12/2002
IRB/XIS
18
Habitat Monitoring
Acadia National Park
Mt. Desert Island, ME
Ongoing research
Great Duck Island
Nature Conservancy
LAN
WAN
(satcast)
http://www.greatduckisland.net
sensor nets
8/12/2002
IRB/XIS
19
8/12/2002
IRB/XIS
application
service
data mgmt
network
system
architecture
algorithm / theory
Programming environments
Deep & scalable simulation
Algorithm behavior at scale
Operating on prob.
distributions
• Fine-Grain Inverse problems
• Pseudo-imaging
• Constructive foundations of
self-organization
mgmt / diag / debug
•
•
•
•
prog / data model
Cross-cutting issues?
technology
20
The Other Extreme
- Planetary Scale Services
www.planet-lab.org
8/12/2002
IRB/XIS
21
Motivation
• A new class of services & applications is emerging
that spread over a sizable fraction of the web
– CDNs as the first examples
– Peer-to-peer, ...
• Architectural components are beginning to emerge
– Distributed hash tables to provide scalable translation
– Distributed storage, caching, instrumentation, mapping, events ...
• The next internet will be created as an overlay on the
current one
– as did the last one
– it will be defined by its services, not its transport
» translation, storage, caching, event notification, management
• There will soon be vehicle to try out the next n great
ideas in this area
8/12/2002
IRB/XIS
22
Confluence of Technologies
• Cluster-based scalable distribution, remote execution,
management, monitoring tools
– UCB Millennium, OSCAR, ..., Utah Emulab, ModelNet...
• CDNS and P2Ps
– Gnutella, Kazaa, ... ,Pastry, Chord, CAN, Tapestry
• Proxies routine
• Virtual machines & Sandboxing
– VMWare, Janos, Denali,...
web-host slices (EnSim)
• Overlay networks becoming ubiquitous
– XBONE, RON, Detour...
Akamai, Digital Island, ....
• Service Composition Frameworks
– yahoo, ninja, .net, websphere, Eliza
•
•
•
•
Established internet ‘crossroads’ – colos
Web Services / Utility Computing
Grid authentication infrastructure
Packet processing,
The Time is NOW
– Anets, .... layer 7 switches, NATs, firewalls
• 8/12/2002
Internet instrumentation
IRB/XIS
23
Guidelines (1)
• Thousand viewpoints on “the cloud” is what matters
– not the thousand servers
– not the routers, per se
– not the pipes, per se
8/12/2002
IRB/XIS
24
Guidelines (2)
• and you miust have the vantage points of the crossroads
– primarily co-location centers
8/12/2002
IRB/XIS
25
Guidelines (3)
• Each service needs an overlay covering many
points
– logically isolated
• Many concurrent services and applications
– must be able to slice nodes => VM per service
– service has a slice across large subset
• Must be able to run each service / app over long
period to build meaningful workload
– traffic capture/generator must be part of facility
• Consensus on “a node” more important than
“which node”
8/12/2002
IRB/XIS
26
Guidelines (4)
Management, Management, Management
• Test-lab as a whole must be up a lot
– global remote administration and management
» mission control
– redundancy within
• Each service will require its own remote management
capability
• Testlab nodes cannot “bring down” their site
– generally not on main forwarding path
– proxy path
– must be able to extend overlay out to user nodes?
• Relationship to firewalls and proxies is key
8/12/2002
IRB/XIS
27
Guidelines (5)
• Storage has to be a part of it
– edge nodes have significant capacity
• Needs a basic well-managed capability
– but growing to the seti@home model should be considered at
some stage
– may be essential for some services
8/12/2002
IRB/XIS
28
Initial Researchers (mar 02)
Washington
Tom Anderson
Steven Gribble
David Wetherall
MIT
Frans Kaashoek
Hari Balakrishnan
Robert Morris
David Anderson
Berkeley
Ion Stoica
Joe Helerstein
Eric Brewer
John Kubi
8/12/2002
Intel Research
David Culler
Timothy Roscoe
Sylvia Ratnasamy
Gaetano Borriello
Satya
Milan Milenkovic
http://www.planet-lab.org/
Rice
Peter Druschel
Utah
Jay Lepreau
CMU
Srini Seshan
Hui Zhang
UCSD
Duke
Amin Vadat
Jeff Chase
Princeton
Larry Peterson
Randy Wang
Vivek Pai
IRB/XIS
Stefan Savage
Columbia
Andrew Campbell
ICIR
Scott Shenker
Mark Handley
Eddie Kohler
29
Initial Planet-Lab Candidate Sites
UBC
UW
WI
Chicago
UPenn
Harvard
Utah
Intel Seattle
Intel
MIT
Intel OR
Intel Berkeley
Cornell
CMU
ICIR
Princeton
UCB
St. Louis
Columbia
Duke
UCSB
Washu
KY
UCLA
Rice GIT
UCSD
UT
ISI
Uppsala
Copenhagen
Cambridge
Amsterdam
Karlsruhe
Barcelona
Beijing
Tokyo
Melbourne
8/12/2002
IRB/XIS
30
Approach:Service-Centric Virtualization
• Virtual Machine Technology has re-emerged for hosting
complete desktop environments on non-native OS’s and
potentially on machine monitors.
– ex. VMWare, ...
• Sandboxing has emerged to emulate multiple virtual machines
per server with limited /bin, (no /dev)
– ex. ENSim web hosting
• Network Services require fundamentally simpler virtual
machines, can be made far more scalable (VMs per PM),
focused on service requirements
– ex. Jail, Denali, scalable and fast, but no full legacy OS
– access to overlays (controlled access to raw sockets)
– allocation & isolation
» proportional scheduling across resource container - CPU, net, disk
– foundation of security model
– fast packet/flow processing puts specific design pressures
• Instrumentation and management are additional virtualized
‘slices’
– distributed workload generation,
data collection
8/12/2002
IRB/XIS
31
Hard problems/challenges
• “Slice-ability” – multiple experimental services deployed over many
nodes
–
–
–
–
Distributed Virtualization
Isolation & Resource Containment
Proportional Scheduling
Scalability
• Security & Integrity - remotely accessed and fully exposed
– Authentication / Key Infrastructure proven, if only systems were bug free
– Build secure scalable platform for distributed services
» Narrow API vs. Tiny Machine Monitor
• Management
– Resource Discovery, Provisioning, Overlay->IP
– Create management services (not people) and environment for innovation
in management
» Deal with many as if one
• Building Blocks and Primitives
– Ubiquitous overlays
• Instrumentation
8/12/2002
IRB/XIS
32
Emerging Extreme Internet
Wide-Area Broad-Coverage Services
DeeplyEmbedded
Networks
8/12/2002
Traditional pt-pt Internet
IRB/XIS
33
backup
8/12/2002
IRB/XIS
34
Mission for the Network of Labs
• Bold new form of Industry-University collaboration that
reflects the changing nature of the information age.
• Conduct the highest quality research in emerging,
important areas of CS and IT.
• Join the unique strengths of Universities and the company
in concurrent, collaborative efforts that are both broad in
scope and deeply penetrating in exploration.
• Operate in a uniquely open fashion, promoting a powerful,
bidirectional exchange of groundbreaking ideas,
technology, and people.
• Leadership role in the creation of new research
ecosystems spanning the continuum from academic study
to product development.
• Labs will be project-focused with an active, constantly
evolving agenda involving Intel researchers, University
researchers, and members of the larger research
community
8/12/2002
IRB/XIS
35
Berkeley Focus
Extreme Interconnected Systems
• Invent, develop, explore, analyze, and understand
highly interconnected systems at the extremes of the
computing and networking spectrum - the very large,
the very small, and the very numerous
• Do leading-edge Computer Science on problems of
scale, cutting across traditional areas of architecture,
operating systems, networks, and languages to
enable a wide range of explorations in ubiquitous
computing, both embedded in the environment or
carried easily on moving objects and people
8/12/2002
IRB/XIS
36
Current Research Team
•
•
Hans Mulder – co-director, IA64
Kevin Fall: UCSD, ISI, UCB,
NetBoost, Intel
•
– Operating systems, Distributed
Computing, Infrastructure Services
– high speed ip networking
•
•
Alan Mainwaring: TMC, UCB,
Sun, Intel
•
– virtual networks, deep scalable
network systems
•
•
David Gay: UCB
•
Wei Hong, UCB, Illustra, Cohera,
PeopleSoft
Su Ping: Intel
Phil Buonodonna, UCB (abd
intern)
– Storage Area Networks, networks
Silvia Ratnasamy, UCB/ICSI (abd)
– Networking, P2P
•
– Federated databases
•
Matt Welsh, UCB (Post Doc)
– Operating Systems, internet service
design
Anind Dey: Georgia Tech, aware
house
– Prog. Lang. design/Imp for novel
comm. layers
•
Brent Chun: UCB, CIT
– cluster systems, resource management
– framework for context aware applns,
ubicom
•
Timothy Roscoe: Cambridge,
Sprint
Justin Tomilson, Part Time
– optimization, IEOR PhD Student
•
Earl Hines – operations mgr
– Software Engineering, embedded
systems
•
Eric Paulos: UCB
– HCI, robotics, ubicomp
8/12/2002
IRB/XIS
37
Additional Researchers
• Joe Hellerstein, Faculty Consultant (next AD)
– streaming database, sensor database, P2P
• Eric Brewer, Faculty Consultant
– systems, language design
• Larry Peterson, Consultant/Sabattical
• Deborah Estrin, Faculty consultant
– internet, multicast, rsvp,...sensor nets
• Paul Wright, Former Faculty consultant
– infopad, BWRC, cybercut
8/12/2002
IRB/XIS
38
Current Faculty Research Associates
• James Demmel
large-scale comp. sci
• Michael Franklin
Sensor Databases
• Steven Glaser
structural dynamics
• Joe Hellerstein
Streaming Databases
• John Kubiatowicz planetary storage
• James Landay
HCI
• David A Patterson Architecture
• Kris Pister
MEMS, Smart Dust
• Jan Rabaey
Low power systems
• Satish Rao
Distr. Systems Theory
• Ion Stoika
Networking
• Vivek Subramanian Disposable devices
• David Wagner
Security
• Kathy Yelick
Parallel Languages
• Jennifer Mankoff
HCI
8/12/2002
IRB/XIS
• Shankar Sastry
Distributed
Robotics
39
Download