Ambiguous Keyboards 4/8/2010 -- Nate Bodenstab • Text Entry for Mobile Computing: Models and Methods, Theory and Practice. MacKenzie and Soukoreff. Human-Computer Interaction, 2002. 17:147198 • Optimal Character Arrangements for Ambiguous Keyboards. Lesher, Moulton, and Higginbotham. IEEE Rehabilitation Engineering, 1998. • The One-Key Challenge: Searching for a Fast OneKey Text Entry Method. Scott MacKenzie. ACM ASSETS, 2009. The Problem • What is the most effective method of text input? Given constraints? – Users lacking fine motor coordination to use standard keyboard – Physical device too small for full keyboard (mobile phones) • Full Keyboard (last week) – How many keys? What arrangement? Word/Char prediction? • Reduced Keyboard – How many keys? What arrangement? Disambiguation? • Stylus or Pointer – How are letters displayed and chosen? Soft Keyboard? • Speech – MacKenzie says no, but 300 WPM is tempting. – Privacy and convenience concerns. Telepathy? BCI? Big Ideas • MacKenzie and Soukoreff (2002) – Good intro and lit review of text input on mobile devices. Asks the right questions. – Present model of text input speed given keyboard layout – No mention of AAC • Lesher et al. (1998) – Model the “cost” of placing 2+ characters on the same key – Can evaluate key arrangements very quickly – Assumes character prediction model. • MacKenzie (2009) – Single button input device – Scanning ambiguous soft keyboard with word disambiguation – Four virtual keys “optimal” with small vocabulary (~9,000 words) • No one talks about spelling correction • AAC and mobile device communities should share more Two Important Groups Expert Novice Paper #1: MacKenzie and Soukoreff (2002) Text Entry for Mobile Computing: Models and Methods, Theory and Practice. MacKenzie and Soukoreff (2002) • Can text entry rates improve with… – Button arrangement? – Character or word prediction? – Different interface or modality? • Proper evaluation is critical – – – – Reproducible and generalizable (standard science) Spontaneous text generation (not text copy) Novice vs. Expert users Human trials to measure speed, accuracy, fatigue, subjective impressions of actual usage (not optimal number of clicks) • Pen or Keyboard: “which will prevail?” – Keyboard winning in 2010 – Pen has been replaced by finger (iPhone) HP95LX (1991) Apple Message Pad (1993) Palm Pilot (1996) PageWriter (2000) Cheap phone, BlackBerry, and iPhone (2008) 3-key text entry (left, right, select) 5-key text entry (up, down, left, right, select) 12-key text entry • Multi-tap – “lazy” = 5-5-5 2 9-9-9-9 # 9-9-9 – Time-out or “done” key • Two-key method – “lazy” = 5-3 2-1 9-4 9-3 • One-key w/ disambiguation (T9) – “lazy” = 5 2 9 9 # – first guess was “jazz” • WordWise (1=shift key) – “lazy” = 1-5 2 9 1-9 – 1- or 2-key w/ disambiguation Two Important Groups • Why does T9 use a unigram language model? – Uni-gram disambiguation accuracy ~95% – Bi-gram disambiguation accuracy ~97% • Novice users would see a big accuracy increase • Expert users memorize the predicted position of ambiguous words and can text without looking – “book” vs “cool” Expert Novice …although it looks like novice users are winning out in the current release (class bi-gram, recent history, etc) 18- (or 20-) key text entry • Not mentioned by MacKenzie (not invented yet) • Qwerty keyboard layout • Two letters per key • Much better disambiguation Other Ideas … Half-Qwerty keyboard One handed keyboard (Good for one-handed typing, not really for mobile devices) Other Ideas … Fold-up keyboard Chorded Keyboard MacKenzie and Soukoreff (2002) • Stylus / Pointer / Finger Input – Handwriting Recognition • Pretty accurate today, but human handwriting rate is slow (~15 WPM) • Letter input: Speed vs. Familiarity – Gesture-Based Text Input • Can be faster that letter symbols, but easy to learn? Intuitive? Unistroke (1993) – fast; more accurate; hard to learn Graffiti (1996) – Palm’s magic bullet Jot (1998) – slow; less accurate; easy to learn Convergence to Jot-like input. Graffiti 2 (2003) based on Jot. Two Important Groups Unistroke Jot Gesture-Based Text Input Cirrin (<15 WPM) Quikwriting (<15 WPM) similar to 2-key method Gesture-Based Text Input Dasher (20-30 WPM) T-Cube (21 WPM) MacKenzie and Soukoreff (2002) • Movement Minimization (WPM model) – Predicts an estimate of how efficient a keyboard arrangement will be – Model components: • Physical distance from key1 to key2 • Reaction time to find a key (expert=0) • Weighted average over all possible two-letter sequences given frequency in English – Implicit assumption of single pointer input • What about two fingers? Five fingers? – Implicit assumption of 100% accuracy (no modeling of deletions) Soft Keyboards • Brian talked about soft keyboards last week – MacKenzie and Soukoreff (2002) present a number of keyboard configurations and predicted WPM using their model – Not based on actual use! – How is distance computed on hex grid? Is this an unfair advantage? OPTI 2 Metropolis 2 Other Ideas… Projected Keyboard Ambiguous Soft (Full) Keyboard (still problems with fitting full keyboard onto mobile device) Paper #2: Lesher et al. (1998) Optimal Character Arrangements for Ambiguous Keyboards Lesher et al. (1998) Lesher et al. (1998) • Maximize power of character disambiguation (on 9 keys) – Not worrying about minimizing finger movement, user’s previous knowledge of alphabetic or qwerty arrangement, etc. • Confusion Matrix of “costs” to place two characters on same key, estimated from corpus • Can compute effectiveness of an ambiguous keyboard layout given this matrix • Still 3.0 x 1015 possible layouts… • Use “n-optimization” to find good layout – Poor man’s Genetic Algorithm • Claims to find the globally optimal solution … because convergence is so fast • Assumes we’re doing character prediction (costs matrix only works for char prediction), but word prediction is much more effective • No real human trials, results based on optimal key presses per character • Amazing that TOC does well (or others do poorly compared to TOC) • What about the standard 9-key phone arrangement? Paper #3: MacKenzie (2009) The One-Key Challenge: Searching for a Fast One-Key Text Entry Method MacKenzie (2009) • Soft keyboard with single button input – Requires scanning – Scanning unambiguous layout is slow (ie. row column scanning) • Borrows from the ambiguous keyboard literature – – – – How many soft keys? What letter arrangement? Word selection / disambiguation? Deletion and correction? • Not as many references to AAC literature… • Find optimal number of buttons and alphabet split by counting required button presses on a test corpus MacKenzie (2009) • SAK: What it does – Allows “multiple-selection” • press select key multiple times during a single scan interval • SAK does not revert to home position immediately after select key is pressed – – – – – Vocabulary size is only 9,025 words and still need 20+ word candidates Word deletion by long key-press (not really binary anymore) Faster scanning interval != faster text input ~5 WPM standard set-up ~8 WPM with “timer restart on selection” • Not sure about this one … needs comparison with immediate position restart • SAK: What it doesn’t do – – – – Disambiguation only allowed at end of word Why limit to alphabetic ordering? What about dynamic letter arrangement? Should have run parallel human trails with other methods (ie. row column scanning, Koester’s model, etc). Instead, MacKenzie just sites results from other papers with different users, setup, etc. thanks.