Pen-Centric Shorthand - Seidenberg School of Computer Science

advertisement
Pen-Centric
Shorthand Interfaces
Charles C. Tappert
Seidenberg School of CSIS, Pace University
1
Themes of Presentation



Online Handwriting Recognition and Pen
Computing Tutorial
Historical Research – undertaken for the
Palm-Xerox Patent Infringement Lawsuit
Recent Research - Enhanced Pen-Centric
Shorthand Interfaces can have benefits

DPS dissertation could extend M.S. thesis
2
Enhanced Pen-Centric
Shorthand Interfaces




Can use word/phrase shorthand to speed
text input
Can provide critical infrastructure for many
pen-centric applications
Can enhance natural pen-centric
interactions for many applications
Will have greatest impact on the utility of
applications running on small mobile devices
3
Part 1: Online (Pen-Centric)
Handwriting Recognition








Written Languages and Handwriting Properties
The Fundamental Property of Writing
Handwriting Recognition Difficulties
Online (Pen-Centric) Handwriting Recognition
Online more accurate than Offline Recognition
Online Info Can Complicate Recognition Process
Design Tradeoffs / Design Decisions
Computer Problems in English
4
Written Language
and Handwriting Properties

Alphabet


Letters, digits, punctuation, special symbols
Writing is a time sequence of strokes



Stroke – writing from pen down to pen up
Usually complete one character before
beginning the next
Spatial order – e.g., in English left to right
5
Fundamental Property of Writing



Differences between different characters
are more significant than differences
between different drawings of the same
character
This makes handwritten communication
possible
Can there be exceptions – say, different
characters written identically?
6
Fundamental Property of Writing
in English



Property holds within subalphabets of uppercase,
lowercase, and digits, but not across them
“I”, “l”, and “1” written with single vertical stroke
“O” and “0” written similarly with an oval
7
Handwriting Recognition Difficulties



Shape, size, and slant variation
Similarly shaped characters – U and V
Careless writing


in the extreme, almost illegible writing
Resolving difficult ambiguities requires
sophisticated recognition algorithms,
syntax/semantics
8
Online (Pen-Centric)
Handwriting Recognition

Electronic tablets invented in late 1950s


Digitizer and display in separate surfaces
Pen Computers arrived in 1980s




Combined digitizer and display
Brought input and output into one surface
Immediate feedback via electronic ink
Created paper-like interface
9
Tablet Digitizers – Dynamic Information


Pen down – indication of inking
X-Y coordinates as function of time


Sampling rate: 100 points per second
Resolution: 200 points per inch
10
Early Pen-Centric Interface


Different surfaces for
input and output
Rand system, about 1959
11
Pen Computers

IBM vision


Paper-like interface, 1992
Microsoft Tablet PC

Launched, 2001
12
Pen-Centric PDAs


Early Palm Pilot
Palm Tungsten T3 and
Sony Clié TH55
13
Online (Pen-Centric)
Handwriting Recognition


Machine recognizes the writing as the user writes
Digitizer equipment captures the dynamic
information of the writing


Stroke number, order, direction, speed
A stroke is the writing from pen down to pen up
14
Online (Pen-Centric) more accurate
than Offline (Static) Recognition


Can use both dynamic and static information
Can often distinguish between similarly shaped
characters
 E.g., 5 versus S where the 5 is usually written
with two strokes and the S with one stroke
15
Online Information Can Complicate
Recognition Process

Large number of possible variations


E can be written with one, two, three, or four
strokes, and with various stroke orders and directions
A four-stroke E has 384 variations (4! stroke orders x
24 stroke directions)
16
Online Information Can Complicate
Recognition Process

Other variations
17
Online Information Can Complicate
Recognition Process

Segmentation ambiguities


character-within-character problem
lowercase d might be recognized as a cl if drawn
with two strokes that are somewhat separated
from one another
18
Design Tradeoffs/Decisions

No constraints on the user


Machine recognizes user's normal writing
User severely constrained


Must write in particular style such as handprint
Must write strokes in particular order,
direction, and graphical specification
19
English Writing Styles

Handprint



Uppercase – about 2 strokes per letter
Lowercase- about 1 stroke per letter
Cursive Script


Usually less than 1 stroke per letter
Delayed crossing and dotting strokes
20
Computer Problems in English

Constrained Handprint



Unconstrained Handprint



Printing one symbol per box – form filling
Printing on lines – symbols can touch or
overlap
No lines and symbols can touch or overlap
Cursive Script
Mixed Printing and Cursive
21
Computer Problems in English
22
Pencept Commercial Product 1980s
23
Part 2
Shorthand in Pen-Centric PDAs





Famous Uses of Shorthand
Historical Shorthand Alphabets
Pen-Centric Shorthand Alphabets
Pen-Centric Word/Phrase Shorthand
Allegro/Chatroom Shorthand System

M.S. thesis that could be extended into a
DPS dissertation
24
Background

Famous writings throughout history were
effectively written in a style of shorthand





Cicero’s orations
Martin Luther’s sermons
Shakespeare’s and George Bernard Shaw’s
plays
Samuel Pepys’ diary
Sir Isaac Newton’s notebooks
25
Historical Shorthand Alphabets



We first review the history of shorthand
systems prior to pen computing
Shorthand is “a method of writing rapidly
by substituting characters, abbreviations,
or symbols for letters, words, or
phrases”
Shorthand can be traced back to the
Greeks in 400 B.C.
26
Historical Shorthand Alphabets


We focus on shorthand alphabets that
might be appropriate for PDAs
We review two types of shorthand

Geometric shorthand



Small number of basic shapes
Shapes reused in multiple orientations
Non-geometric shorthand shorthand
27
Historical Shorthand Alphabets





Ancient Greeks – 400 BC
Tironian Alphabet – 63 BC
John Willis’s Stenography – 1602
Gabelsberger Alphabet – 1834
Moon Alphabet – 1845
28
Tironian Alphabet, 63 B.C.
Non-Geometric
29
Stenography Alphabet, 1602
30
Stenography Alphabet, 1602

Basic Shapes and Orientations
31
Gabelsberger Cursive-Style, 1834
Non-Geometric Alphabet
32
Moon Geometric Alphabet, 1845
33
Other Historical Shorthand Systems

Phonetic alphabets



Pitman (1837), was popular in UK
Gregg (1888), was popular in USA
Systems for the blind

Braille (1821)
34
Pen-Centric Shorthand Alphabets

Some of the earliest were for CAD/CAM



symbols represent graphical items and
commands
Others developed for text input on small
consumer devices like PDAs that have
limited computing power
We review geometric and non-geometric
shorthands appropriate for small devices
35
Pen-Centric Shorthand Alphabets

Historical alphabets presented above could
be used for machine recognition
symbols drawn with a single stroke
(except “K” in Tironian and “+” in Stenography)


In addition to shape and orientation,
online systems can use stroke direction to
differentiate among symbols
36
Pen-Centric Shorthand Alphabets

Geometric Pen-Centric Shorthands




Organek – 1991
Allen – filed 1991, patent 1993
Goldberg (Xerox) – filed 1993, patent 1997
Non-Geometric Pen-Centric Shorthands


Graffiti (Palm Computing) – 1995
Allegro (Papyrus) – 1995
37
Organek Alphabet, 1991
38
Organic Alphabet, 1991
Basic Shapes and Orientations
One shape in 4 orientations.
This gives 8 directions that
together with 3 lengths
provide 24 symbols.
A second wheel provides
additional symbols.
39
Allen patent, filed 1991
40
Allen patent, filed 1991
Basic Shapes and Orientations
41
Goldberg patent, filed 1993
(“unistroke symbols”)
42
Goldberg patent, filed 1993
Basic Shapes and Orientations
43
Goldberg patent, filed 1993

5 Basic shapes
4 Orientations
2 Stroked Directions
40 Possible Symbols
Designed for Speed of Input and
Maximum Symbol Separation
44
Shorthand Alphabet Design


How would you design a shorthand
alphabet?
What would be the design criteria?
45
Design of Graffiti Alphabet
for the Palm Pilot

Small alphabet



Uppercase, digits, special symbols
One stroke per symbol to avoid
segmentation difficulty
Separate writing areas for letters and
digits to avoid same-shape confusions
46
Graffiti Alphabet, 1995
47
Graffiti Mimics Keyboard Input


Character by character input
Mode shifts for



Uppercase
Special characters
Eyes can focus on application’s insertion
point rather than on input area
48
Graffiti Alphabet Design

What was the additional design criterion?
49
Graffiti Alphabet Design

Designed for ease of learning


20 letters exactly match the Roman
alphabet
6 remaining ones match partially
50
Graffiti Alphabet: 11 of 26 characters
have alternate variations
51
Frequently Confused Characters
52
Other Low Performance Characters
53
Symbol Overlap Comparison
54
Graffiti Recognition Accuracy Study
55
Allegro Alphabet (Papyrus), 1995
(now Microsoft)
56
Simplified Design Tradeoffs/Decisions
for Graffiti and Allegro PDA Alphabets

Small alphabet


One stroke per character (character = stroke)


preferably only one
Separate writing areas for letters and digits


allows machine to recognize each character upon pen lift
Small number of writing variations per letter


one case rather than both upper and lowercase
avoids confusion of similarly shaped letters and digits
High correspondence to Roman alphabet for ease
of learning

non-geometric, might not actually qualify as shorthand
57
Commercially Successful Shorthands

Similar to the Roman alphabet

Easy to learn
Graffiti used in Palm OS devices
 notably the Palm Pilot & Handspring models

Allegro used in Microsoft Windows devices


Geometric alphabets not successful
58
Current Commercial Systems
Company/System
Writing Style
Palm Computing/Graffiti*
Special Shorthand Alphabet
Microsoft/Papyrus Allegro
Special Shorthand Alphabet
CIC/Jot
Relatively Unconstrained Handprint
Microsoft
Relatively Unconstrained Handprint
and Cursive
*A few years ago Palm switched from Graffiti to Graffiti2,
Graffiti2 is basically Jot licensed from CIC.
59
Jeff Hawkins, innovator












1979 BSEE Cornell, 1979-1986 worked at Intel and GRiD
1986-1987 ABD BioPhysics doctoral program, U.C. Berkeley
1987- back at GRiD he created GRiDPAD, first pen computer
1992 formed Palm Computing, 1993 created first handwriting reco software
product for a mobile handheld - Casio’s Zoomer
1995 Palm Computing bought by U.S. Robotics
1996 created PalmPilot, first PDA with Graffiti shorthand alphabet (over a
million shipped in 18 months, a 66% market share, and the fastest growth
of any computing product in history, faster than the TV and the VCR)
1997 U.S. Robotics bought by 3Com (sued by Xerox for patent infringement)
1998 left Palm to form Handspring, 1999 launched the Visor handheld
2000 Palm Computing spun off by 3Com
2002 created what is now the Redwood Center for Theoretical Neuroscience
2003 Handspring (with Hawkins, et al.) acquired by Palm Computing
2005 Founded Numenta to build the ultimate brain-like machine
60
Palm-Xerox
Patent Infringement Lawsuit



The nine-year old battle between Palm and Xerox over
handwriting recognition ends in 2006, see article.
Palm pays Xerox $22.5 million for a fully paid-up license
for Xerox patents covering its text input Unistrokes
technology
Xerox first sued Palm predecessor Palm Computing back
in April 1997, claiming that the Graffiti text-entry system
used in its PDAs infringed on patents for Unistrokes,
which allows users to input letters and numbers into
personal data units with basic, one stroke movements.
61
Conclusions
Palm-Xerox Patent Infringement Lawsuit

Invalidity



Infringement


Historical research showed that Goldberg alphabet
not so unique
Even though the patent was accepted as valid,
these arguments narrowed the scope of the patent
Analyses and comparisons of the Goldberg and
Graffiti alphabets showed major differences
Result was favorable settlement for Palm
62
Pen-Centric Word/Phrase Shorthand
such as Chatroom Shorthand


Further increase speed of text entry
Potential applications
Where input speed important
 Where word/phrase abbreviations
occur frequently – e.g., email

63
Chatroom Shorthand Examples
CU
CM
@TEOTD
^5
2nite
LOL
ASAP
B/C or BC
See you, or Cracking up
Call me
At the end of the day
High five
Tonight
Laughing out loud
As soon as possible
Because
64
Allegro/Chatroom Shorthand System

Developed for M.S. dissertation


Student was hearing impaired
Developed as output component of
communication system


Handwriting to text to speech
Two input writing areas


One for Allegro (all-purpose)
One for chatroom-like or user-defined words/phrases
65
Allegro/Chatroom Shorthand System
Stroke acquisition GUI
a single stroke
is it
word/phrase
character
allegro stroke
library
allegro stroke
recognition
other stroke
recognition
alphabet
user-defined
stroke library
meaning
sentence accumulator
done?
no
yes
Sentence display
and spoken output
66
Allegro/Chatroom Shorthand System
67
Allegro/Chatroom Shorthand System
M.S. Thesis Experimental Results

Allegro/Chatroom pen-centric
shorthand input is faster than typing
text and is comparable to typing text
and chatroom shorthand characters
68
Conclusions
Pen-Centric Shorthands


Pen-centric interfaces should use shorthand,
and especially word/phrase shorthand where
appropriate, for fast text input
Benefit of shorthand interfaces



Provides critical infrastructure for many pen-centric
applications
Enhances natural pen-centric interactions for many
applications
Has greatest impact on the utility of applications
running on small mobile devices
69
Conclusions
Handwriting Recognition


Graffiti and Allegro greatly simplified the
recognition problem
Handprint problem not completely solved


Even with IBM’s ThinkWrite, CIC’s Jot, and
Microsoft products
Cursive script problem clearly not solved
70
References





W.B. Huber, S.-H. Cha, C.C. Tappert, and V.L. Hanson, "Use of
Chatroom Abbreviations and Shorthand Symbols in Pen Computing,"
Proc. 9th Int. Workshop on Frontiers in Handwriting Recognition,
IWFHR 2004, Tokyo, Japan, October 2004.
W. Huber, V. Hanson, S. Cha, and C.C. Tappert, "Common Chatroom
Abbreviations Speed Pen Computing," Proc. 11th Int. Conf. HumanComputer Interaction, Las Vegas, NV, July 2005.
C.C. Tappert and S. Cha, "Handwriting Recognition Interfaces," Chapter
6, pp. 123-137, in Text Entry Systems, Scott MacKenzie and Kumiko
Tanaka-Ishii (Eds.), Morgan Kaufmann, 2007.
C.C. Tappert, C.Y. Suen, and T. Wakahara, "The state-of-the-art in online handwriting recognition," IEEE Trans. Pattern Analysis Machine
Intelligence, Vol. PAMI-12, pp. 787-808, August 1990.
C.C. Tappert and J.R. Ward, "Pen-Centric Shorthand Handwriting
Recognition Interfaces," Proc. 1st Int. Workshop on Pen-Based
Learning Technologies, Catania, Italy, May 2007.
71
Download