Interaction Devices Human Computer Interaction CIS 6930/4930 Section 4188/4186 Interaction Performance ► 60s vs. Today Performance ► Hz -> GHz Memory ►k -> GB Storage ►k -> TB Input ► punch cards -> ► Keyboards, Pens, tablets, mobile phones, mice, digital cameras, web cams Output ► 10 character/sec ► Megapixel displays, color laser, surround sound, force feedback, VR ► Substantial bandwidth increase! Interaction Performance ► Future? Gestural input Two-handed input 3D I/O Others: voice, wearable, whole body, eye trackers, data gloves, haptics, force feedback Engineering research! Entire companies created around one single technology ► Current trend: Multimodal (using car navigation via buttons or voice) Helps disabled (esp. those w/ different levels of disability) Keyboard and Keypads ► QWERTY keyboards been around for a long time (1870s – Christopher Sholes) Cons: Not easy to learn Pros: Familiarity Stats: ► Beginners: 1 keystroke per sec ► Average office worker: 5 keystrokes (50 wpm) ► Experts: 15 keystrokes per sec (150 wpm) ► Is it possible to do better? Suggestions? Keyboard and Keypads ► ► Look at the piano for possible inspiration Court reporter keyboards (one keypress = multiple letters or a word) 300 wpm, requires extensive training and use ► Keyboard properties that matter Size large - imposing for novices, appears more complex ► mobile devices ► Adjustable ► Reduces RSI, better performance and comfort Mobile phone keyboards, blackberry devices, etc. ► QWERTY Keyboard Layouts Frequently used pairs far apart Fewer typewriter jams Electronic approaches don’t jam.. why use it? ► DVOARK (1920s) ► 150 wpm->200 wpm Reducing errors Takes about one week to switch Stops most from trying ABCDE – style Easier for non-typists Studies show no improvement vs. QWERTY ► Number pads What’s in the top row? Look at phones (slight faster), then look at calculators, keypads ► Those for disabled Split keyboards KeyBowl’s orbiTouch (screenshot) Eyetrackers, mice Dasher - 2d motion with word prediction Keys ► Current keyboards have been extensively tested ► ► Size Shape Required force Spacing Speed vs. error rates for majority of users Distinctive click gives audio feedback Why membrane keyboards are slow (Atari 400?) Environment hazards might necessitate ► Usually speed is not a factor ► Keys Guidelines ► ► ► ► ► Special keys should be denoted State keys (such as caps, etc.) should have easily noted states Special curves or dots for home keys for touch typists Inverted T Cursor movement keys are important (though cross is easier for novices) Auto-repeat feature Improves performance, but only if repeat is customizable (motor impaired, young, old) ► Two thinking points: Why are home keys fastest to type? Why are certain keys larger? (Enter, Shift, Space bar) ► This is called Fitt’s Law Keypads for small devices ► ► ► ► ► ► PDAs, Cellphones, Game consoles Fold out keyboards Virtual keyboard Cloth keyboards (ElekSen) Haptic feedback? Mobile phones Combine static keys with dynamic soft keys Multi-tap a key to get to a character Study: Predictive techniques greatly improve performance Ex. LetterWise = 20 wpm vs 15 wpm multitap ► Draw keyboard on screen and tap w/ pen Speed: 20 to 30 wpm (Sears ’93) ► Handwriting recognition (still hard) Subset: Graffiti2 (uses unistrokes) Pointing Devices Direct manipulation needs some pointing device ► Factors: ► Size of device Accuracy Dimensionality ► Interaction Tasks: Select – menu selection, from a list Position – 1D, 2D, 3D (ex. paint) Orientation – Control orientation or provide direct 3D orientation input Path – Multiple poses are recorded ► ex. to draw a line Quantify – control widgets that affect variables Text – move text ► ► Faster w/ less error than keyboard Two types (Box 9.1) Direct control – device is on the screen surface (touchscreen, stylus) Indirect control – mouse, trackball, joystick, touchpad Direct-control pointing ► First device – lightpen Point to a place on screen and press a button Pros: ► ► Easy to understand and use Very fast for some operations (e.g. drawing) Cons: ► ► ► ► Hand gets tired fast! Hand and pen blocks view of screen Fragile Evolved into the touchscreen Pros: Very robust, no moving parts Cons: Depending on app, accuracy could be an issue ► 1600x1600 res with acoustic wave Must be careful about software design for selection (land-on strategy). ► If you don’t show a cursor of where you are selecting, users get confused User confidence is improved with a good lift-off strategy Direct-control pointing ► Primarily for novice users or large user base ► Case study: Disney World ► Need to consider those who are: disabled, illiterate, hard of hearing, errors in usage (two touch points), etc. Indirect-Control Pointing ► Pros: Reduces hand-fatigue Reduces obscuration problems ► Cons: Increases cognitive load Spatial ability comes more into play ► Mouse Pros: ► ► ► ► ► Familiarity Wide availability Low cost Easy to use Accurate Cons: ► ► ► ► Time to grab mouse Desk space Encumbrance (wire), dirt Long motions aren’t easy or obvious (pick up and replace) Consider, weight, size, style, # of buttons, force feedback Indirect-Control Pointing ► Trackball Pros: ► Small physical footprint ► Good for kiosks ► Joystick Easy to use, lots of buttons Good for tracking (guide or follow an on screen object) Does it map well to your app? ► Touchpoint Pressure-sensitive ‘nubbin’ on laptops Keep fingers on the home position Indirect-Control Pointing ► Touchpad Laptop mouse device Lack of moving parts, and low profile Accuracy, esp. those w/ motor disabilities ► Graphics Tablet Screen shot comfort good for cad, artists Limited data entry Comparing pointing devices ► Direct pointing Study: Faster but less accurate than indirect (Haller ’84) ► ► ► ► Lots of studies confirm mouse is best for most tasks for speed and accuracy Trackpoint < Trackballs & Touchpads < Mouse Short distances – cursor keys are better Disabled prefer joysticks and trackballs If force application is a problem, then touch sensitive is preferred Vision impaired have problems with most pointing devices ► Use multimodal approach or customizable ► Read Vanderheiden ’04 for a case study ► ► cursors Designers should smooth out trajectories Large targets reduce time and frustration Example ► Five fastest places to click on for a righthanded user? Example ► What affects time? Fitts’s Law Paul Fitts (1954) developed a model of human hand movement ► Used to predict time to point at an object ► What are the factors to determine the time to point to an object? ► ► Just from your own experience, is this function linear? ► D – distance to target W – size of target No, since if Target A is D distance and Target B is 2D distance, it doesn’t take twice as long What about target size? Not linear there either MT = a + b log2(D/W + 1) a = time to start/stop in seconds (empirically measured per device) b = inherent speed of the device (empirically measured per device) Ex. a = 300 ms, b = 200 ms/bit, D = 14 cm, W = 2 cm ► Ans: 300 + 200 log2(14/2 + 1) = 900 ms Really a slope-intercept model Fitts’s Law ► MT = a + b log2(D/W + 1) a = time to start/stop in seconds (empirically measured per device) b = inherent speed of the device (empirically measured per device) Ex. a = 300 ms, b = 200 ms/bit, D = 14 cm, W = 2 cm ► Ans: 300 + 200 log2(14/2 + 1) = 900 ms Question: If I wanted to half the pointing time (on average), how much do I change the size? ► ► Proven to provide good timings for most age groups Newer versions taken into account Direction (we are faster horizontally than vertically) Device weight Target shape Arm position (resting or midair) 2D and 3D (Zhai ’96) Very Successfully Studied ► Applies to ► Limitations ► ► Feet, eye gaze, head mounted sights Many types of input devices Physical environments (underwater!) User populations (even retarded and drugged) Drag & Drop and Point & Click Dimensionality Software accelerated pointer motion Training Trajectory Tasks (Accot-Zhai Steering Law) Decision Making (Hick’s Law) Results (what does it say about) Buttons and widget size? Edges? Popup vs. pull-down menus Pie vs. Linear menus iPhone/web pages (real borders) vs. monitor+mouse (virtual borders) http://particletree.com/features/visualizing-fittss-law/ http://www.asktog.com/columns/022DesignedToGiveFitts.html http://www.yorku.ca/mack/GI92.html Interesting readings: Precision Pointing Movement Time ► Study: Sears and Shneiderman ’91 Broke down task into gross and fine components for small targets PPMT = a + b log2(D/W+1) + c log2(d/W) ►c – speed for short distance movement ► d – minor distance Notice how the overall time changes with a smaller target. ► Other factors Age (Pg. 369) ► Research: How can we design devices that produce smaller constants for the predictive equation Two handed Zooming Novel Devices ► Themes: Make device more diverse Users ► Task ► Improve match between task and device Improve affordance Refine input Feedback strategies ► Foot controls Already used in music where hands might be busy Cars Foot mouse was twice as slow as hand mouse Could specify ‘modes’ Novel Devices ► Eye-tracking Accuracy 1-2 degrees selections are by constant stare for 200-600 ms How do you distinguish w/ a selection and a gaze? Combine w/ manual input ► Multiple degree of freedom devices Logitech Spaceball and SpaceMouse Ascension Bird Polhemus Liberty and IsoTrack Novel Devices ► Boom Chameleon Pros: Natural, good spatial understanding Cons: limited applications, hard to interact (very passive) ► DataGlove Pinch glove Gesture recognition American Sign Language, musical director Pros: Natural Cons: Size, hygiene, accuracy, durability Novel Devices ► Haptic Feedback Why is resistance useful? SensAble Technology’s Phantom Cons: limited applications Sound and vibration are easier and can be a good approximation ► ► Rumble pack Two-Handed input Different hands have different precision Non-dominant hand selects fill, the other selects objects ► Ubiquitous Computing and Tangible User Interface Active Badges allows you to move about the house w/ your profile Which sensors could you use? Elderly, disabled Research: Smart House Myron Kruger – novel user participation in art (Lots of exhibit art at siggraph) Novel Devices ► Paper/Whiteboards Video capture of annotations Record notes (special tracked pens Logitech digital pen) ► Handheld Devices PDA Universal remote Help disabled ► Read LCD screens ► Rooms in building ► Maps Interesting body-context-sensitive. ► Ex. hold PDA by ear = phone call answer. Novel Devices ► Miscellaneous Shapetape – reports 3D shape. ► Tracks limbs ► Engineer for specific app (like a gun trigger connected to serial port) Pros: good affordance Cons: Limited general use, time Speech and Auditory Interfaces ► There’s the dream ► Then there’s reality ► Practical apps don’t really require freeform discussions with a computer Goals: ► Low cognitive load ► Low error rates ► Smaller goals: Speech Store and Forward (voice mail) Speech Generation Currently not too bad, low cost, available Speech and Auditory Interfaces ► ► ► ► Bandwidth is much lower than visual displays Ephemeral nature of speech (tone, etc.) Difficulty in parsing/searching (Box 9.2) Types ► Discrete-word recognition Continuous speech Voice information Speech generation Non-speech auditory If you want to do research here, lots of research in the audio, audio psychology, and DSP field you should understand Discrete-Word Recognition ► ► ► ► Individual words spoken by a specific person Command and control 90-98% for 100-10000 word vocabularies Training Speaker speaks the vocabulary Speaker-independent ► Still requires Low noise operating environment Microphones Vocabulary choice Clear voice (language disabled are hampered, stressed) Reduce most questions to very distinct answers (yes/no) Discrete-Word Recognition ► Helps: ► Disabled Elderly Cognitive challenged User is visually distracted Mobility or space restrictions Apps: Telephone-based info ► ► Study: much slower for cursor movement than mouse or keyboard (Christian ’00) Study: choosing actions (such as drawing actions) improved performance by 21% (Pausch ’91) and word processing (Karl ’93) However acoustic memory requires high cognitive load (> than hand/eye) ► ► Toys are successful (dolls, robots). Accuracy isn’t as important Feedback is difficult Continuous Speech Recognition ► ► ► ► Dictation Error rates and error repair are still poor Higher cognitive load, could lower overall quality Why is it hard? Recognize boundaries (normal speech blurs them) Context sensitivity “How to wreck a nice beach” ► ► ► Much training Specialized vocabularies (like medical or legal) Apps: Dictate reports, notes, letters Communication skills practice (virtual patient) Automatic retrieval/transcription of audio content (like radio, CC) Security/user ID Voice Information Systems Use human voice as a source of info ► Apps: ► Tourist info Museum audio tours Voice menus (Interactive Voice Response IVR systems) ► Use speech recognition to also cut through menus If menus are too long, users get frustrated Cheaper than hiring 24 hr/day reps ► Voice mail systems Interface isn’t the best ► Get email in your car Also helps with non-tech savvy like the elderly ► Potentially aides with Learning (engage more senses) Cognitive load (hypothesize each sense has a limited ‘bandwidth’) ► Think ER, or fighter jets Speech Generation ► Play back speech (games) ► Combine text (navigation systems) ► Careful evaluation! Speech isn’t always great ► Door is ajar – now just a tone ► Use flash ► Supermarket scanners Often times a simple tone is better Why? Cognitive load ► Thus cockpits and control rooms need speech ► Competes w/ human-human communication Speech Generation ► ► Ex: Text-to-Speech (TTS) Latest TTS uses multiple syllabi to make generated speech sound better Robotic speech could be desirable to get attention All depends on app Thus don’t assume one way is the best, you should user test ► ► Apps: TTS for blind, JAWS Web-based voice apps: VoiceXML and SALT (tagged web pages). Good for disabled, and also for mobile devices ► Use if Message is short Requires dynamic responses Events in time ► Good when visual displays aren’t that useful. When? Bad lighting, vibrations (say liftoff) Non-speech Auditory Interface ► Audio ► Major tones that provide information Research Area Sonification – converting information into audio Audiolization Auditory Interfaces ► Browsers link produced a click when you clicked on a Increases confidence Can do tasks without visual cognitive load Helps figure out when things are wrong Greatly helps visually impaired Non-speech Auditory Interface ► Terms: Auditory icons – familiar sounds (record real world sound and play it in your app) Earcons – new learned sounds (door ajar) ► Role in video games is huge Emotions, Tension, set mood ► To create 3D sound Need to do more than stereo Take into account Head-related transfer function (HRTF) ► ► Ear and head shape New musical instruments Theremin ► New ways to arrange music Displays ► ► Primary Source of feedback Properties: Physical Dimension Resolution Color Depth and correctness Brightness, contrast, glare Power Refresh rate Cost Reliability # of users Display Technology ► Monochrome displays (single color) Low cost Greater intensity range (medical) ► Color Raster Scan CRT LCD – thin, bright Plasma – very bright, thin LED – large public displays Electronic Ink – new product w/ tiny capsules of negative black particles and positive white Braille – refreshable cells with dots that rise up Large Displays ► Wall displays Informational ► Control rooms, military, flight control rooms, emergency response ► Provides System overview Increases situational awareness Effective team review ► Old: Array of CRTs Interactive ► Require new interaction methods (freehand sketch, PDAs) ► Local and remote collaboration ► Art, engineering Large Displays ► Multiple Desktop Displays Multiple CRTs or Flat panels for large desktops Cheap Familiar Spatial divide up tasks Comparison tasks are easier Too much info? ► HMD ► Eventually pixel -> Every surface a Mobile device displays ► Applications Personal ► Reprogrammable frames picture Digital family portrait (GaTech) Business ► PDAs, Medical cellphones ► Monitor patients Research: Modality Translation Services (Trace Center – University of Wisconsin) ► As you move about it auto converts data, info, etc. for you Mobile device displays ► Actions on mobile devices Monitor information and alert (calendar) Gather then spread out information (phone) Participate in groups and relate to individual (networked devices) Locate services and identify objects (GPS car system) Capture and then share info (phone) Mobile device displays ► Guidelines for design Bergman ’00, Weiss, ’02 Industry led research and design case studies (Lindholm ’03) Typically short in time usage (except handheld games) Optimize for repetitive tasks (rank functions by frequency) Research: new ways to organize large amounts of info on a small screen Study: Rapid Serial Visual Presentation (RSVP) presents text at a constant speed (33% improvement Oquist ’03) Searching and web browsing still very poor performance Promising: Hierarchical representation (show full document and allow user to select where to zoom into) Animation, Image, and Video Content quality has also greatly increased ► 3D rendering is near life-like ► Digital Photography is common ► Scanned documents ► Video compression ► Multimedia considerations for the disabled ► Printers ► 3D Printers create custom objects from 3D models