Telepresence: An Umbrella Research Topic Jim Gray Microsoft Research

advertisement
Telepresence:
An Umbrella Research Topic
Jim Gray
Microsoft Research
Gray@Microsoft.com
http://research.Microsoft.com/~Gray/
1
NSF: Nerve Center of Science
If it’s not broke, don’t fix it.
But….




US Science is the engine of progress
BUT…..
Best and brightest are spending
increasing time fundraising
Seems excessive to me.
Venture capital community is
richer and
more generous
than NSF
2
Outline (ambitious!)

Microsoft Research (census)
Tele-Presentations (Gordon Bell, Jim Gemmell)
Microsoft Research initiative on Telepresence

What if you could record everything you see & hear?

The architecture revolution:
processing moves to transducers


3
Microsoft Research -- 1991



Founded in 1991
Goal:
pursue strategic technologies
for Microsoft
Original research groups:
– Natural Language Processing
– Operating Systems
– Programming Languages

Overall size < 20 at the end of 1992
4
Microsoft Research -- 1998

280 Researchers in 25 areas
– Operating systems to Statistical Physics

Research lab locations:
– Redmond, Cambridge, San Francisco

Internationally recognized research
teams
– Hundreds of publications, presentations
– Leadership roles in professional societies,
journals, conferences
5
MS Research Areas



Operating systems, languages,
compilers, virtual machines, networking,
wireless computing, fault-tolerance,
large scale servers, security
Natural language, speech, vision,
graphics, decision theory, information
retrieval, UI, collaboration, statistics,
signal processing
Cryptography, statistical physics and
discrete mathematics
6
Growing Fast


Grew 4x from ‘94 to ‘97
Decided in ‘97 to grow by a 3x in 3 years
– 200 in FY97 => 600 in FY00,
primarily in Redmond

Major impact on MS products
– Virtually all MS products shipped today use
technology from MS Research

Critical role in MS growth
– Pioneering research in software that allows
computers to see, hear, speak and understand
7
Microsoft Research
Philosophy

University organizational model
– Flat structure, critical mass groups

Open research environment
– Aggressive publication of research results
in literature and on world wide web
– Frequent visitors, daily seminars
– Over 70 visiting professors and interns in
1997
– Over 110 visiting researchers in 1998
8
Some Key Senior Researchers




Systems
– Rick Rashid, Butler Lampson, Gordon Bell
– Anoop Gupta, Roger Needham, Chuck Thacker
Databases & Data Mining
– David Lomet, Jim Gray, Usama Fayyad
Graphics
– Jim Kajiya, Jim Blinn, Alvy Ray Smith, Michael Cohen
Speech & Language
– Karen Jensen, George Heidorn, X.D. Huang, Alex
Acero, Hsiao-Wuen Hon, Scott Meredith
9
Some Key
Senior Researchers




UI Design, Intelligent Systems, IR
– George Robertson, Linda Stone, Susan Dumais, David
Heckerman, Eric Horvitz, Jack Breese
Computer Vision & Signal Processing
– Steve Shafer, Rick Szeliski, P. Anandan, Rico Malvar
Cryptography & Theory
– Yacov Yacobi, Jennifer Chayes, Christian Borg,
Michael Freedman
Languages & Compilers
– Daniel Weise, Chris Fraser, Amitabh Srivastava, Luca
Cardelli, David Hanson, Charles Simonyi, Todd
Proebsting
10
Microsoft Research

1997 BusinessWeek Poll of Academia:
– Voted #7 lab (overall) in Computer Science
– Voted #3 industrial research lab
(after Bell Labs and IBM Research)
– Voted #2 most desirable lab to work
(after Stanford)
11
Outline (ambitious!)

Microsoft Research (census)
Tele-Presentations (Gordon Bell, Jim Gemmell)
Microsoft Research initiative on Telepresence

What if you could record everything you see & hear?

The architecture revolution:
processing moves to transducers


12
Gordon Bell on
Tele Presentations
http://research.microsoft.com/barc/GBell/
13
Motivation:
Telepresentations
• Presenter and/or
audience
telepresent
NOT: meeting or collaboration
settings
Forget the nasty social
issues!
Mostly one-way
14
Telepresentation
Elements
Slides
 Audio
 Video
 Script,
text
comments,
hyperlinks,
etc.

15
Telepresentations:
The Essentials



Slide and audio a must
Add some video
(low quality)
to make us feel good
Storage and
transmission costs low
16
Telepresentations:
The Killer App



Increased attendance & lower
travel costs
Practical and low-cost NOW
e.g. ACM97 - 2,000 visitors in real
space, 20,000 visitors on Internet
http://research.microsoft.com/acm97
17
Today’s
Experiment


Would you like to pause, rewind, browse?
Do you wish you could have seen this
– At home?
– At another time?

How much does a present speaker add?
How much would you pay for real
presence?
18
Outline (ambitious!)





Microsoft Research (census)
Tele-Presentations (Gordon Bell, Jim Gemmell)
Microsoft Research initiative on Telepresence
What if you could record everything you see &
hear?
The architecture revolution:
processing moves to transducers
19
Changing role of computation

Past: Computers for:
– computing (Cray)
– business data processing (IBM)
– “document” creation (PC)

Future: Computers for:
– understanding & learning
– communicating
– consuming & entertaining

Requires new User Interface to machines
20
Flows
Making “Flows” a Reality

Computer Graphics
– Creating realistic looking environments,
people

Computer Vision
– Analyzing posture, gaze, gestures


Speech input/output
Natural Language
– Analysis, IR

Implicit requests for information
22
Building life-like human
characters
Recognizing gestures
Live video
Area of
motion
H flow
V flow
Generating life-like speech
from textual data

Data-driven stochastic speech
– Natural sounding
– Rapid, automatic customizability

Examples
– Synthetic voice w/ transplanted speech
contours
25
Artificial singing

AT&T Voder, 1962, by Homer Dudley
– Daisy (Inspiration for HAL’s voice in 2001)

Microsoft Research Whistler, 1997
– Scarborough Fair
26
Analyzing language


Language recognition shipped in Word 97
General purpose text-critiquing,
summarization, Japanese word-breaking
27
Inside The Office
Grammar Checker
28
Understanding language:
MindNet





A huge language
knowledge base
Automatically created
from dictionaries
Words (nodes) linked
by relationships
Millions of links
Recently added
(Encarta)
encyclopedia
knowledge
29
MindNet -- “Going to the birds”
chicken
Is_a
clean
Is_a
smooth
Is_a
poultry
Means
chatter
hen
Is_a
gaggle
Is_a
Is_a
beak
strike
Is_a
bill
egg
bird
sound
Is_a
goose
Part_of
face
Locn_of
feather
wing
Is_a
claw
hawk
Is_a
Is_a
Is_a
Typ_obj
leg
turtle
mouth
limb
Is_a
Part
Typ_subj_of
Is_a
creature
Is_a
Part
fly
Typ_subj
plant
Not_is_a
Is_a
Is_a
Typ_subj_of
Means
meat
Is_a
Is_a
Typ_obj
keep
Typ_0bj_of
Purpose
animal
Typ_subj
make
peck
duck
Cause
quack
Is_a
Typ_obj
Quesp
Typ_obj
Typ_subj
preen
supply
Purpose
catch
Is_a
opening
arm
Changing balance between
user & software systems

Yesterday:
– Applications were single programs running in
isolation
– Users used to (more or less) understand systems
that they used

Today:
– Componentized applications operate in concert
– Sophisticated users understand only small
percentage of systems they use
31
Tomorrow’s Systems and
Applications

Users will not be able to predict
– where computations will be performed,
– when they will be performed or
– by what software components

Gap between system capabilities and
user understanding will grow to the
point that the only way user will be able
to use system is through assisting
agents
32
Examples of user agents &
implicit actions

Lumiere (Office 97)
– Monitoring user and program events to
provide user help and assistance

Implicit queries
– Inferring information needs from browsing

Lookout/SpamKiller
– Monitoring mail activity to auto-categorize it
33
User Modeling

Models of a user’s informational goals
– User’s query (when available…)
– User’s background
– Acute and long-term search activity
– Acute actions with objects and documents
– Program data structures

Explicit and implicit information access
and display
Outline (ambitious!)





Microsoft Research (census)
Tele-Presentations (Gordon Bell, Jim Gemmell)
Microsoft Research initiative on Telepresence
What if you could record everything you see &
hear?
The architecture revolution:
processing moves to transducers
35
Some Tera-Byte Databases







The Web: 1 TB of HTML
TerraServer 1 TB of images
Several other 1 TB (file) servers
Hotmail: 7 TB of email
Sloan Digital Sky Survey:
40 TB raw, 2 TB cooked
EOS/DIS (picture of planet each week)
Mega
Federal Clearing house: images of checks
Exa
– 15 PB by 2007
– 15 PB by 2006 (7 year history)

Kilo
Nuclear Stockpile Stewardship Program
– 10 Exabytes (???!!)
Giga
Tera
Peta
Zetta
Yotta
36
Info Capture



You can record
everything you see
or hear or read.
What would you do
with it?
How would you
organize & analyze
it?
A letter
A novel
Kilo
Mega
A
Movie
Giga
Library of
Congress (text)
Tera
LoC (image)
Peta
All Disks
Exa
Video
8 PB per lifetime (10GBph)
All Tapes
Audio
30 TB (10KBps)
Read or write: 8 GB (words)
Zetta
See: http://www.lesk.com/mlesk/ksg97/ksg.html
Yotta
37
A novel
Kilo
A letter
Mega
Library of
Congress (text)
LoC
(sound +
cinima)
All Disks
All Tapes
Giga
Tera
Peta
Exa
A
Movie
LoC
(image)
All
Photos
Zetta
All Information!
Yotta
38
Michael Lesk’s Points
www.lesk.com/mlesk/ksg97/ksg.html

Soon everything can be recorded and kept

Most data will never be seen by humans

Precious Resource: Human attention
Auto-Summarization
Auto-Search
will be a key enabling technology.
39
Outline (ambitious!)





Microsoft Research (census)
Tele-Presentations (Gordon Bell, Jim Gemmell)
Microsoft Research initiative on Telepresence
What if you could record everything you see &
hear?
The architecture revolution:
processing moves to transducers
40
Put Everything
in Future (Disk) Controllers
(it’s not “if”, it’s “when?”)
Acknowledgements:
Dave Patterson explained this to me a year ago
Kim Keeton
Erik Riedel
Helped me sharpen
these arguments
Catharine Van Ingen
41
Remember Your Roots
42
Kilo
Mega
Giga
Tera
Peta
Exa
Technology Drivers: Disks




Disks on track
100x in 10 years
2 TB 3.5” drive
Shrink to 1” is 200GB
Disk replaces tape?
Zetta
Yotta

Disk is super
computer!
43
Data Gravity
Processing Moves to Transducers



Move Processing to data sources
Move to where the power (and sheet metal) is
Processor in
– Modem
– Display
– Microphones (speech recognition)
& cameras (vision)
– Storage: Data storage and analysis
44
It’s Already True of Printers
Peripheral = CyberBrick


You buy a printer
You get a
– several network interfaces
– A Postscript engine




cpu,
memory,
software,
a spooler (soon)
– and… a print engine.
45
All Device Controllers will be Cray 1’s

TODAY
– Disk controller is 10 mips risc engine
with 2MB DRAM
– NIC is similar power

SOON
– Will become 100 mips systems
with 100 MB DRAM.

Central
Processor &
Memory
They are nodes in a federation
(can run Oracle on NT in disk
controller).

Advantages
–
–
–
–
–
Uniform programming model
Great tools
Security
economics (CyberBricks)
Move computation to data (minimize
traffic)
Tera Byte
Backplane
46
Basic Argument for x-Disks

Future disk controller is a super-computer.
– 1 bips processor
– 128 MB dram
– 100 GB disk plus one arm

Connects to SAN via high-level protocols
– RPC, HTTP, DCOM, Kerberos, Directory Services,….
– Commands are RPCs
– Management, security,….
– Services file/web/db/… requests
– Managed by general-purpose OS with good dev
environment

Apps in disk saves data movement
– need programming environment in controller
47
The Slippery Slope
Nothing =
Sector Server
If you add function to server
 Then you

add more function to server

Function gravitates to
data.
Everything =
App Server
48
Why Not a Sector Server?
(let’s get physical!)


Good idea, that’s what we have today.
But
– cache added for performance
– Sector remap added for fault tolerance
– error reporting and diagnostics added
– SCSI commends (reserve,.. are growing)
– Sharing problematic (space mgmt, security,…)

Slipping down the slope to a 2-D block server
49
Why Not a 1-D Block Server?
Put A LITTLE on the Disk Server

Tried and true design
– HSC - VAX cluster
– EMC
– IBM Sysplex (3980?)

But look inside
– Has a cache
– Has space management
– Has error reporting & management
– Has RAID 0, 1, 2, 3, 4, 5, 10, 50,…
– Has locking
– Has remote replication
– Has an OS
– Security is problematic
– Low-level interface moves too many bytes
50
Why Not a 2-D Block Server?
Put A LITTLE on the Disk Server

Tried and true design
– Cedar -> NFS
– file server, cache, space,..
– Open file is many fewer msgs

Grows to have
– Directories + Naming
– Authentication + access control
– RAID 0, 1, 2, 3, 4, 5, 10, 50,…
– Locking
– Backup/restore/admin
– Cooperative caching with client

File Servers are a BIG hit: NetWare™
– SNAP! is my favorite today
51
Why Not a File Server?
Put a Little on the Disk Server

Tried and true design
– Auspex, NetApp, ...
– Netware

Yes, but look at NetWare
– File interface gives you app invocation interface
– Became an app server

Mail, DB, Web,….
– Netware had a primitive OS

Hard to program, so optimized wrong thing
52
Why Not Everything?
Allow Everything on Disk Server
(thin client’s)

Tried and true design
– Mainframes, Minis, ...
– Web servers,…
– Encapsulates data
– Minimizes data moves
– Scaleable


It is where everyone ends up.
All the arguments against are short-term.
53
The Slippery Slope
Nothing =
Sector Server
If you add function to server
 Then you

add more function to server

Function gravitates to
data.
Everything =
App Server
54
Disk = Node




has magnetic storage (100 GB?)
has processor & DRAM
has SAN attachment
has execution
Applications
environment
Services
DBMS
RPC, ...
File System
SAN driver
Disk driver
OS Kernel
55
Technology Drivers:
System on a Chip

Integrate Processing with memory on one chip
– chip is 75% memory now
– 1MB cache >> 1960 supercomputers
– 256 Mb memory chip is 32 MB!
– IRAM, CRAM, PIM,… projects abound

Integrate Networking with processing on one
chip
– system bus is a kind of network
– ATM, FiberChannel, Ethernet,.. Logic on chip.
– Direct IO (no intermediate bus)

Functionally specialized cards shrink to a chip.
56
Technology Drivers: What if
Networking Was as Cheap As Disk IO?
 Disk
 TCP/IP
– Unix/NT
100% cpu @ 40MBps
– Unix/NT
8% cpu @ 40MBps
Why the Difference?
Host does
TCP/IP packetizing,
checksum,…
flow control
small buffers
Host Bus Adapter does
SCSI packetizing,
checksum,…
flow control
57
DMA
Technology Drivers:
The Promise of SAN/VIA:10x in 2 years
http://www.ViArch.org/

Today:
– wires are 10 MBps (100 Mbps Ethernet)
– ~20 MBps tcp/ip saturates 2 cpus
– round-trip latency is ~300 us

In the lab
– Wires are 10x faster Myrinet, Gbps Ethernet,
ServerNet,…
– Fast user-level communication


tcp/ip ~ 100 MBps 10% of each processor
round-trip latency is 15 us
58
SAN:
Standard Interconnect
Gbps Ethernet: 110 MBps

PCI: 70 MBps


UW Scsi: 40 MBps

LAN faster than
memory bus?
1 GBps links in
lab.
100$ port cost
soon
Port is computer
FW scsi: 20 MBps
scsi: 5 MBps
59
Technology Drivers:
100 GBps Ethernet replaces SCSI

Why I love SCSI
– Its fast (40MBps)
– The protocol uses little processor power

Why I hate SCSI
– Wires must be short
– Cables are pricey
– pins bend
60
Functionally Specialized Cards

Storage
P mips processor
ASIC
Today:
P=50 mips

Network
M MB DRAM
M= 2 MB
In a few years
ASIC
P= 200 mips
M= 64 MB

Display
ASIC
61
Technology Drivers
Plug & Play Software

RPC is standardizing: (DCOM, IIOP, HTTP)
– Gives huge TOOL LEVERAGE
– Solves the hard problems for you:
 naming,
 security,
 directory service,
 operations,...

Commoditized programming environments
–
–
–
–


FreeBSD, Linix, Solaris,…+ tools
NetWare + tools
WinCE, WinNT,…+ tools
JavaOS + tools
Apps gravitate to data.
General purpose OS on controller runs apps.
62
Basic Argument for x-Disks

Future disk controller is a super-computer.
– 1 bips processor
– 128 MB dram
– 100 GB disk plus one arm

Connects to SAN via high-level protocols
– RPC, HTTP, DCOM, Kerberos, Directory Services,….
– Commands are RPCs
– management, security,….
– Services file/web/db/… requests
– Managed by general-purpose OS with good dev
environment

Move apps to disk to save data movement
– need programming environment in controller
63
Summary





Microsoft Research (census)
Tele-Presentations (Gordon Bell, Jim Gemmell)
Microsoft Research initiative on Telepresence
What if you could record everything you see &
hear?
The architecture revolution:
processing moves to transducers
64
Download