Heuristic Evaluation 1 Interface Hall of Shame or

  • Slides: 34
Download presentation
Heuristic Evaluation 1

Heuristic Evaluation 1

Interface Hall of Shame or Fame? l Standard MS calculator l on all Win

Interface Hall of Shame or Fame? l Standard MS calculator l on all Win 95/98/NT/2000/XP 2

Interface Hall of Shame! l l l What is the empty button above MC

Interface Hall of Shame! l l l What is the empty button above MC for? Can’t resize Blue for numbers! l goes against all we know l hard to focus on l combined with red eye strain 3

Outline l l l l Discount usability engineering Heuristic evaluation overview Administrivia Heuristics How

Outline l l l l Discount usability engineering Heuristic evaluation overview Administrivia Heuristics How to perform a HE HE vs. user testing How well does HE work 4

Discount Usability Engineering (? ) l Cheap l l l Fast l l l

Discount Usability Engineering (? ) l Cheap l l l Fast l l l no special labs or equipment needed the more careful you are, the better it gets on order of 1 day to apply standard usability testing may take a week Easy to use l can be taught in 2 -4 hours 5

Examples l Walkthroughs l l Low-fi prototyping Action analysis l l l put yourself

Examples l Walkthroughs l l Low-fi prototyping Action analysis l l l put yourself in the shoes of a user like a code walkthrough GOMS (add times to formal action analysis) On-line, remote usability tests Heuristic evaluation 6

Heuristic Evaluation l l l Developed by Jakob Nielsen Helps find usability problems in

Heuristic Evaluation l l l Developed by Jakob Nielsen Helps find usability problems in a UI design Small set (3 -5) of evaluators examine UI l l independently check for compliance with usability principles (“heuristics”) different evaluators will find different problems evaluators only communicate afterwards l findings are then aggregated Can perform on working UI or on sketches l I’ve found in this class it works better on working UI 7

Why Multiple Evaluators? l l Every evaluator doesn’t find every problem Good evaluators find

Why Multiple Evaluators? l l Every evaluator doesn’t find every problem Good evaluators find both easy & hard ones 8

Heuristic Evaluation Process l Evaluators go through UI several times l l Usability principles

Heuristic Evaluation Process l Evaluators go through UI several times l l Usability principles l l l inspect various dialogue elements compare with list of usability principles consider other principles/results that come to mind Nielsen’s “heuristics” supplementary list of category-specific heuristics l competitive analysis & user testing of existing products Use violations to redesign/fix problems 9

Heuristics (original) l l l H 1 -1: Simple & natural dialog H 1

Heuristics (original) l l l H 1 -1: Simple & natural dialog H 1 -2: Speak the users’ language H 1 -3: Minimize users’ memory load H 1 -4: Consistency H 1 -5: Feedback l l l H 1 -6: Clearly marked exits H 1 -7: Shortcuts H 1 -8: Precise & constructive error messages H 1 -9: Prevent errors H 1 -10: Help and documentation 10

Heuristics (revised set) searching database for matches l H 2 -1: Visibility of system

Heuristics (revised set) searching database for matches l H 2 -1: Visibility of system status l l keep users informed about what is going on example: pay attention to response time l 0. 1 sec: no special indicators needed, why? l 1. 0 sec: user tends to lose track of data l 10 sec: max. duration if user to stay focused on action l for longer delays, use percent-done progress bars 11

Heuristics (cont. ) l Bad example: Mac desktop l l Dragging disk to trash

Heuristics (cont. ) l Bad example: Mac desktop l l Dragging disk to trash l should delete it, not eject it H 2 -2: Match between system & real world l l speak the users’ language follow real world conventions 12

Heuristics (cont. ) l Wizards l l H 2 -3: User control & freedom

Heuristics (cont. ) l Wizards l l H 2 -3: User control & freedom l l “exits” for mistaken choices, undo, redo don’t force down fixed paths l like that BART machine… l must respond to Q before going to next for infrequent tasks l (e. g. , modem config. ) not for common tasks good for beginners l have 2 versions (Win. Zip) 13

Heuristics (cont. ) l H 2 -4: Consistency & standards 14

Heuristics (cont. ) l H 2 -4: Consistency & standards 14

Heuristics (cont. ) l l MS Web Pub. Wiz. Before dialing l l asks

Heuristics (cont. ) l l MS Web Pub. Wiz. Before dialing l l asks for id & password When connecting l asks again for id & pw l l H 2 -5: Error prevention H 2 -6: Recognition rather than recall l make objects, actions, options, & directions visible or easily retrievable 15

Heuristics (cont. ) Edit Cut Copy Paste l H 2 -7: Flexibility and efficiency

Heuristics (cont. ) Edit Cut Copy Paste l H 2 -7: Flexibility and efficiency of use l l accelerators for experts (e. g. , gestures, kb shortcuts) allow users to tailor frequent actions (e. g. , macros) 16

Heuristics (cont. ) l H 2 -8: Aesthetic and minimalist design l no irrelevant

Heuristics (cont. ) l H 2 -8: Aesthetic and minimalist design l no irrelevant information in dialogues 17

Heuristics (cont. ) l H 2 -9: Help users recognize, diagnose, and recover from

Heuristics (cont. ) l H 2 -9: Help users recognize, diagnose, and recover from errors l l l error messages in plain language precisely indicate the problem constructively suggest a solution 18

Heuristics (cont. ) l H 2 -10: Help and documentation l l easy to

Heuristics (cont. ) l H 2 -10: Help and documentation l l easy to search focused on the user’s task list concrete steps to carry out not too large 19

Administrivia l Please turn in the homework l l it must also be online

Administrivia l Please turn in the homework l l it must also be online Wai-ling will help get you info on how to move to our Web server by next week We will be webcasting by next Monday Other questions? 20

Phases of Heuristic Evaluation 1) Pre-evaluation training l give evaluators needed domain knowledge and

Phases of Heuristic Evaluation 1) Pre-evaluation training l give evaluators needed domain knowledge and information on the scenario 2) Evaluation l individuals evaluate and then aggregate results 3) Severity rating l determine how severe each problem is (priority) l can do this first individually & then as a group 4) Debriefing l discuss the outcome with design team 21

How to Perform Evaluation l At least two passes for each evaluator l l

How to Perform Evaluation l At least two passes for each evaluator l l l If system is walk-up-and-use or evaluators are domain experts, no assistance needed l l first to get feel for flow and scope of system second to focus on specific elements otherwise might supply evaluators with scenarios Each evaluator produces list of problems l l explain why with reference to heuristic or other information be specific and list each problem separately 22

Examples l Can’t copy info from one window to another l l l violates

Examples l Can’t copy info from one window to another l l l violates “Minimize the users’ memory load” (H 1 -3) fix: allow copying Typography uses mix of upper/lower case formats and fonts l l violates “Consistency and standards” (H 2 -4) slows users down probably wouldn’t be found by user testing fix: pick a single format for entire interface 23

How to Perform Evaluation l Why separate listings for each violation? l l l

How to Perform Evaluation l Why separate listings for each violation? l l l risk of repeating problematic aspect may not be possible to fix all problems Where problems may be found l l single location in UI two or more locations that need to be compared problem with overall structure of UI something that is missing l hard w/ paper prototypes so work extra hard on those l note: sometimes features are implied by design docs and just haven’t been “implemented” – relax on those 24

Severity Rating l l l Used to allocate resources to fix problems Estimates of

Severity Rating l l l Used to allocate resources to fix problems Estimates of need for more usability efforts Combination of l l l frequency impact persistence (one time or repeating) Should be calculated after all evals. are in Should be done independently by all judges 25

Severity Ratings (cont. ) 0 - don’t agree that this is a usability problem

Severity Ratings (cont. ) 0 - don’t agree that this is a usability problem 1 - cosmetic problem 2 - minor usability problem 3 - major usability problem; important to fix 4 - usability catastrophe; imperative to fix 26

Debriefing l l l Conduct with evaluators, observers, and development team members Discuss general

Debriefing l l l Conduct with evaluators, observers, and development team members Discuss general characteristics of UI Suggest potential improvements to address major usability problems Dev. team rates how hard things are to fix Make it a brainstorming session l little criticism until end of session 27

Severity Ratings Example 1. [H 1 -4 Consistency] [Severity 3][Fix 0] The interface used

Severity Ratings Example 1. [H 1 -4 Consistency] [Severity 3][Fix 0] The interface used the string "Save" on the first screen for saving the user's file, but used the string "Write file" on the second screen. Users may be confused by this different terminology for the same function. 28

HE vs. User Testing l HE is much faster l l l HE doesn’t

HE vs. User Testing l HE is much faster l l l HE doesn’t require interpreting user’s actions User testing is far more accurate (by def. ) l l l 1 -2 hours each evaluator vs. days-weeks takes into account actual users and tasks HE may miss problems & find “false positives” Good to alternate between HE & user testing l l find different problems don’t waste participants 29

Results of Using HE l Discount: benefit-cost ratio of 48 [Nielsen 94] l l

Results of Using HE l Discount: benefit-cost ratio of 48 [Nielsen 94] l l l cost was $10, 500 for benefit of $500, 000 value of each problem ~15 K (Nielsen & Landauer) how might we calculate this value? l l l in-house -> productivity; open market -> sales customer calls to your customer service center Correlation between severity & finding w/ HE 30

Results of Using HE (cont. ) l Single evaluator achieves poor results l l

Results of Using HE (cont. ) l Single evaluator achieves poor results l l l only finds 35% of usability problems 5 evaluators find ~ 75% of usability problems why not more evaluators? ? 10? 20? l l adding evaluators costs more many evaluators won’t find many more problems 31

Decreasing Returns problems found l benefits / cost Caveat: graphs for a specific example

Decreasing Returns problems found l benefits / cost Caveat: graphs for a specific example 32

Simple HE Homework l l Given a poorly designed web page Find at least

Simple HE Homework l l Given a poorly designed web page Find at least 15 distinct heuristic violations l l l number the violations on the diagram list violation with number on another sheet give a solution to fix Individual assignment Due next Monday in class 33

Summary l l l Heuristic evaluation is a discount method Have evaluators go through

Summary l l l Heuristic evaluation is a discount method Have evaluators go through the UI twice Ask them to see if it complies with heuristics l l l note where it doesn’t and say why Combine the findings from 3 to 5 evaluators Have evaluators independently rate severity Discuss problems with design team Alternate with user testing 34