Cognitive Models Contents n n n Cognitive Models

Contents n n n Cognitive Models Device Models Cognitive Architectures 2

Cognitive Models n Cognitive models are used to represent the users of interactive systems

Unit Tasks n n The models of tasks and goals all decompose these into

GOMS n n n This models goal and task hierarchies It stands for Goals,

GOMS n Operators n n n These are the simplest actions the user performs

GOMS n Selection n n Whenever there is more than one method to achieve

GOMS models goals as a hierarchy GOAL: Iconize-window n n [select GOAL: use-close-method n

GOMS n n The dots indicate the hierarchical level of each goal GOMS uses

GOMS Uses n The analysis of GOMS goal structures can be used to create

GOMS Uses n The selection rules can be used to predict the actual commands

Cognitive Complexity Theory n n This is an extension of the GOMS model which

Cognitive Complexity Theory n n These rules are written in a LISP-like language Let’s

Cognitive Complexity Theory (select-insert-space IF(AND (TEST-GOAL perform unit task) (TEST-TEXT task is insert space)

Cognitive Complexity Theory (INSERT-SPACE-DONE IF (AND (TEST-GOAL perform unit task) (TEST-NOTE executing insert space)

Cognitive Complexity Theory THEN ((ADD-GOAL move cursor to %LINE %COL))) (INSERT-SPACE-2 IF (AND (TEST-GOAL

Cognitive Complexity Theory n n n CCT allows you to model GOMS like hierarchies

Cognitive Complexity Theory n n n CCT can be used to model the system

Linguistic Models n n The user’s interaction with a computer is similar to a

BNF n n n Backus-Naur Form was originally developed to describe the syntax of

BNF draw-line : : = select-line + choose-points + last-point select-line : : =

BNF n n n BNF represents the users action but not the systems responses

Task-action Grammar n While BNF can represent the structure of a language, it cannot

Task-action Grammar n Consider using BNF for the UNIX copy, move, and link commands

Task-action Grammar n The TAG description of the same commands makes the consistency far

Task-action Grammar n TAG can also represent world knowledge Command Interface 1 n movement[Direction]

Task-action Grammar n n n The previous interface could represent addresses of functions to

Task-action Grammar n n The second form of the interface is preferable and takes

Task-action Grammar movement[Direction] : = command[Direction] + distance + RETURN command[Direction] : = known-item[Type=word,

Contents n n n Cognitive Models Device Models Cognitive Architectures 30

GUI Systems n n BNF and TAG were designed for command line interfaces While

Keystroke-level Model n n n This is used for modeling simple interaction sequences on

Keystroke-level Model n K n n B n n Pressing a mouse button P

Keystroke-level Model n D n n M n n Drawing lines with the mouse

Keystroke-level Model n Consider using a mouse based editor to correct a single character

Keystroke-level Model 1. 2. 3. 4. 5. 6. n n n Move hand to

Three-state Model n n Pointing devices like mice, trackballs, and light pens all behave

Three-state Model n State 2 n n n Depressing a button over an icon

Three-state Model n n A touch screen behaves like a light pen with no

Three-state Model Mouse Transitions Button down State 1 tracking State 2 dragging Button up

Fitt’s Law n Fitt’s law states that the time to move a pointer to

Contents n n n Cognitive Models Device Models Cognitive Architectures 42

Cognitive Architectures n n n The models we have looked at up to this

The Problem Space Model n n n Rational behaviour is defined as behaviour directed

The Problem Space Model n n This model can be adapted to the rational

The Problem Space Model n To solve a problem in this model n n

Slides: 47

Download presentation

Cognitive Models

Contents n n n Cognitive Models Device Models Cognitive Architectures 2

Cognitive Models n Cognitive models are used to represent the users of interactive systems n n Models of user’s tasks and goals Models of the user-system grammar Models of human motor skills Cognitive architectures which underlie these models 3

Unit Tasks n n The models of tasks and goals all decompose these into simpler parts One is always faced with the question of to what depth the decomposition should proceed This is a question of granularity and it can proceed to the lowest level operations We define the unit task as the most abstract task a user can perform that does not require any problem solving on the part of the user 4

GOMS n n n This models goal and task hierarchies It stands for Goals, Operators, Methods, and Selection Goals n n These describe what the user wants to achieve They also represent a memory point which can be used to evaluate what has been achieved 5

GOMS n Operators n n n These are the simplest actions the user performs to use the system Pressing the ‘X’ key would be an operator Methods n n n Often there is more than one way to accomplish a goal Help could be by hitting F 1 or by clicking the help button These are referred to as two methods for the same goal 6

GOMS n Selection n n Whenever there is more than one method to achieve a goal, a selection must be made The choice of methods usually depends on the state of the system and the particular user 7

GOMS models goals as a hierarchy GOAL: Iconize-window n n [select GOAL: use-close-method n n Move-mouse-to-window-header Pop-up-menu Click-close-option GOAL: use-L 7 method n Press-L 7 -key] 8

GOMS n n The dots indicate the hierarchical level of each goal GOMS uses this to decompose large goals into sub-goals Note the use of select to indicate that there is a choice of methods A typical GOMS analysis breaks a high-level goal into unit tasks which are further decomposed into basic operators 9

GOMS Uses n The analysis of GOMS goal structures can be used to create measures of performance n n Assigning a time to each operator and summing the result yielded estimates within 33% of the actual values The depth of the hierarchy can be used as a measure of how much the user must store in short-term memory 10

GOMS Uses n The selection rules can be used to predict the actual commands which will be used n n n In practice this allowed predictions of commands that were 90% accurate The GOMS model has served as a basis for other models It can be combined with other models to make more advanced predictions 11

Cognitive Complexity Theory n n This is an extension of the GOMS model which provides improved prediction It provides two parallel descriptions n n n Of the user’s goals Of the system The descriptions consist of a series of production rules of the form n If condition then action 12

Cognitive Complexity Theory n n These rules are written in a LISP-like language Let’s look at the description of how we would insert a missing space in text using the vi text editor 13

Cognitive Complexity Theory (select-insert-space IF(AND (TEST-GOAL perform unit task) (TEST-TEXT task is insert space) (NOT TEST-GOAL insert space) (NOT (TEST-NOTE executing insert space)) ) THEN ( (ADD-GOAL insert space) (ADD-NOTE executing insert space) (LOOK-TEXT task is at %LINE %COL) )) 14

Cognitive Complexity Theory (INSERT-SPACE-DONE IF (AND (TEST-GOAL perform unit task) (TEST-NOTE executing insert space) (NOT (TEST-GOAL insert space)) ) THEN ( (DELETE-NOTE executing insert space) (DELETE-GOAL perform unit task) (UNBIND %LINE %COL) )) (INSERT-SPACE-1 IF (AND (TEST-GOAL insert space) (NOT (TEST-GOAL move cursor)) (NOT (TEST-CURSOR %LINE %COL)) ) 15

Cognitive Complexity Theory THEN ((ADD-GOAL move cursor to %LINE %COL))) (INSERT-SPACE-2 IF (AND (TEST-GOAL insert space) (TEST-CURSOR %LINE %COL) ) THEN ((DO-KEYSTROKE ‘I’) (DO-KEYSTROKE space) (DO-KEYSTROKE ESC) (DELETE-GOAL insert space))) 16

Cognitive Complexity Theory n n n CCT allows you to model GOMS like hierarchies CCT also allows you to model concurrent goals since more than one rule can be matched at the same time However, the main use of CCT is in measuring the complexity of the interface 17

Cognitive Complexity Theory n n n CCT can be used to model the system as well If this is done, it can be used to predict the difficulty in translating from the user’s model to the system model The sheer size of the CCT description is a predictor of the complexity of the operations necessary to achieve a goal 18

Linguistic Models n n The user’s interaction with a computer is similar to a language Therefore, several modeling techniques have been built on interaction as a language 19

BNF n n n Backus-Naur Form was originally developed to describe the syntax of programming languages It can be used equally well to describe the interaction between a user and a computer Consider the case of drawing a line in a graphics system 20

BNF draw-line : : = select-line + choose-points + last-point select-line : : = position-mouse + CLICK-MOUSE choose-points: : = choose-one | choose-one + choose-points choose-one : : = position-mouse + CLICK-MOUSE last-point : : = position-mouse + DOUBLE-CLICK-MOUSE position-mouse : : = empty | MOVE-MOUSE + position-mouse 21

BNF n n n BNF represents the users action but not the systems responses The complexity of the description provides a crude measure of the complexity of the task BNF is also a good way to unambiguously specify how a user interacts with a system 22

Task-action Grammar n While BNF can represent the structure of a language, it cannot represent n n n consistency in commands or Any knowledge the user has of the world The task-action grammar (TAG) addresses both of these problems 23

Task-action Grammar n Consider using BNF for the UNIX copy, move, and link commands copy move link : : = ‘cp’ + filename | ‘cp’ + filenames + directory : : = ‘mv’ + filename | ‘mv’ + filenames + directory : : = ‘ln’ + filename | ‘ln’ + filenames + directory 24

Task-action Grammar n The TAG description of the same commands makes the consistency far more apparent file-op[Op] : = command[Op] + filename | command[Op] + filenames + directory command[Op=copy] : = ‘cp’ command[Op=move] : = ‘mv’ command[Op=link] : = ‘ln’ 25

Task-action Grammar n TAG can also represent world knowledge Command Interface 1 n movement[Direction] : = command[Direction] + distance + RETURN command[Direction=forward] : = ‘go 395’ command[Direction=backward] : = ‘go 013’ command[Direction=left] : = ‘go 712’ command[Direction=right] : = ‘go 956’ 26

Task-action Grammar n n n The previous interface could represent addresses of functions to call to perform actions Let’s look at a second version of the interface Command Interface 2 movement[Direction] : = command[Direction] + distance + RETURN command[Direction=forward] : = ‘FORWARD’ command[Direction=backward] : = ‘BACKWARD’ command[Direction=left] : = ‘LEFT’ command[Direction=right] : = ‘RIGHT’ 27

Task-action Grammar n n The second form of the interface is preferable and takes advantage of the words (forward, back, etc. ) the user already knows We can rewrite the previous TAG to show the information that the user already knows and does not have to learn 28

Task-action Grammar movement[Direction] : = command[Direction] + distance + RETURN command[Direction] : = known-item[Type=word, Direction] * command[Direction=forward] : = ‘FORWARD’ * command[Direction=backward] : = ‘BACKWARD’ * command[Direction=left] : = ‘LEFT’ * command[Direction=right] : = ‘RIGHT’ n The rules with asterisks can be generated from the second rule combined with the user’s knowledge 29

Contents n n n Cognitive Models Device Models Cognitive Architectures 30

GUI Systems n n BNF and TAG were designed for command line interfaces While pressing a button is a reasonable action, moving a mouse one pixel is less obvious In GUI systems, the buttons are virtual and depend on what is displayed at a particular screen position The keystroke model allows us to model lowlevel interaction with a device 31

Keystroke-level Model n n n This is used for modeling simple interaction sequences on the order of a few seconds It does not extend to more complex operations such as producing an entire diagram The model decomposes actions into 5 motor operators, a mental operator and a response operator 32

Keystroke-level Model n K n n B n n Pressing a mouse button P n n Keystroke operator Pointing or moving the mouse over a target H n Homing or switching the hand between mouse and keyboard 33

Keystroke-level Model n D n n M n n Drawing lines with the mouse Mentally preparing for a physical action R n n System response User does not always wait for this as happens in continuous typing 34

Keystroke-level Model n Consider using a mouse based editor to correct a single character error n n n Point at the error Delete the character Retype it Return to the previous typing point The following notation will capture this 35

Keystroke-level Model 1. 2. 3. 4. 5. 6. n n n Move hand to mouse H[mouse] Position after bad character PB[LEFT] Return to keyboard H[keyboard] Delete character MK[DELETE] Type correction K[char] Reposition insert point H[mouse]MPB[LEFT] Timings for individual operations can be measured These timings can then be summed to create the total time for the overall operation Alternative ways of performing an action can have their times computed and compared to find which one is more efficient 36

Three-state Model n n Pointing devices like mice, trackballs, and light pens all behave differently as far as the user is concerned The three-state model is used to capture the behaviour of these devices n State 1 n n Moving the mouse with no buttons pressed This usually moves the pointer on the screen 37

Three-state Model n State 2 n n n Depressing a button over an icon and then moving the mouse This is usually thought of as dragging an object State 0 n n This is for a light pen when it is not touching the screen In this state the location of the pen is not tracked at all 38

Three-state Model n n A touch screen behaves like a light pen with no button to press This means that a touch screen is in state 0 when the finger is off the screen When the finger touches the screen, it can be tracked and is in state 1 Thus, n n a touch screen is a state 0 -1 device A mouse is a state 1 -2 device 39

Three-state Model Mouse Transitions Button down State 1 tracking State 2 dragging Button up Light pen Transitions Touch screen Button down State 1 tracking State 0 No tracking Remove pen State 2 dragging Button up 40

Fitt’s Law n Fitt’s law states that the time to move a pointer to a target of size S at a distance D from the starting point is n n a + b log 2(D/S + 1) Where a and b are constants dependent on the type of pointing device and the skill of the user The insight provided by the three state model is that a and b also depend on the state This is due to dragging being more accurate than the original pointing which does not have as good feedback 41

Contents n n n Cognitive Models Device Models Cognitive Architectures 42

Cognitive Architectures n n n The models we have looked at up to this point have implied a model of the mental processes of the user For example, GOMS implied a divide and conquer approach We will now look at a different model of the user’s cognitive processes 43

The Problem Space Model n n n Rational behaviour is defined as behaviour directed to achieving a specific goal This is the behaviour you would expect of a human or a knowledge based system This is in contrast to the problem solving modeled as a search of a solution space until a solution is found This search is performed by traversing the space until a solution is found This is a brute force search and is not rational behaviour in seeking a solution to a goal 44

The Problem Space Model n n This model can be adapted to the rational behaviour of humans A problem space consists of n n n A set of states A set of operations to go from one state to another A goal is a subset of states which must be reached for the goal to be achieved 45

The Problem Space Model n To solve a problem in this model n n Identify the current state Identify the goal Devise a set of operations which will move from the current state to the goal state This model is inherently recursive n If you cannot find the operations to achieve the goal then this becomes a new recursive problem to be solved 46