For Friday Finish reading chapter 7 Homework Chapter

Genetic Algorithms • Have a population of k states (or individuals) • Have a

Other Issues • What issues arise from continuous spaces? • What issues do online

Game Playing in AI • Long history • Games are well-defined problems usually considered

Games and Search • Search spaces can be very large: • Chess – Branching

Game Playing Problem • Instance of general search problem • States where game has

Minimax Algorithm • Method for determining the optimal move • Generate the entire search

Recursive Minimax Algorithm function Minimax-Decision(game) returns an operator for each op in Operators[game] do

Making Imperfect Decisions • Generating the complete game tree is intractable for most games

Evaluation Functions • Evaluation function needs to – Agree with the utility function on

Cutting Off Search • Search to uniform depth • Use iterative deepening to search

Alpha-Beta Pruning • Concept: Avoid looking at subtrees that won’t affect the outcome •

General Principle • If a node has value n, but the player considering moving

function Max-Value (state, game, , ) returns the minimax value of state if Cutoff-Test(state)

Effectiveness • Depends on the order in which siblings are considered • Optimal ordering

Chance • What if we don’t know what the options are? • Expectiminimax uses

Imperfect Knowledge • What issues arise when we don’t know everything (as in standard

State of the Art • • • Chess – Deep Blue and Fritz Checkers

Knowledge • Knowledge Base – Inference mechanism (domain-independent) – Information (domain-dependent) • Knowledge Representation

Getting Knowledge • We can TELL the agent everything it needs to know •

The Wumpus World • Simple computer game • Good testbed for an agent •

Wumpus Percepts • The wumpus’s square and squares adjacent to it smell bad. •

Wumpus Actions • • • Go forward Turn left Turn right Grab (picks up

Consequences • Entering a square containing a live wumpus is deadly • Entering a

Knowledge Representation • Two sets of rules: – Syntax: determines what atomic symbols exist

Slides: 28

Download presentation

For Friday • Finish reading chapter 7 • Homework: – Chapter 6, exercises 1 (all) and 3 (a-c only)

Program 1 • Any questions?

Genetic Algorithms • Have a population of k states (or individuals) • Have a fitness function that evaluates the states • Create new individuals by randomly selecting pairs and mating them using a randomly selected crossover point. • More fit individuals are selected with higher probability. • Apply random mutation. • Keep top k individuals for next generation.

Other Issues • What issues arise from continuous spaces? • What issues do online search and unknown environments create?

Game Playing in AI • Long history • Games are well-defined problems usually considered to require intelligence to play well • Introduces uncertainty (can’t know opponent’s moves in advance)

Games and Search • Search spaces can be very large: • Chess – Branching factor: 35 – Depth: 50 moves per player – Search tree: 35100 nodes (~1040 legal positions) • Humans don’t seem to do much explicit search • Good test domain for search methods and pruning methods

Game Playing Problem • Instance of general search problem • States where game has ended are terminal states • A utility function (or payoff function) determines the value of the terminal states • In 2 player games, MAX tries to maximize the payoff and MIN is tries to minimize the payoff • In the search tree, the first layer is a move by MAX and the next a move by MIN, etc. • Each layer is called a ply

Minimax Algorithm • Method for determining the optimal move • Generate the entire search tree • Compute the utility of each node moving upward in the tree as follows: – At each MAX node, pick the move with maximum utility – At each MIN node, pick the move with minimum utility (assume opponent plays optimally) – At the root, the optimal move is determined

Recursive Minimax Algorithm function Minimax-Decision(game) returns an operator for each op in Operators[game] do Value[op] <- Mimimax-Value(Apply(op, game) end return the op with the highest Value[op] function Minimax-Value(state, game) returns a utility value if Terminal-Test[game](state) then return Utility[game](state) else if MAX is to move in state then return highest Minimax-Value of Successors(state) else return lowest Minimax-Value of Successors(state)

Making Imperfect Decisions • Generating the complete game tree is intractable for most games • Alternative: – Cut off search – Apply some heuristic evaluation function to determine the quality of the nodes at the cutoff

Evaluation Functions • Evaluation function needs to – Agree with the utility function on terminal states – Be quick to evaluate – Accurately reflect chances of winning • Example: material value of chess pieces • Evaluation functions are usually weighted linear functions

Cutting Off Search • Search to uniform depth • Use iterative deepening to search as deep as time allows (anytime algorithm) • Issues – quiescence needed – horizon problem

Alpha-Beta Pruning • Concept: Avoid looking at subtrees that won’t affect the outcome • Once a subtree is known to be worse than the current best option, don’t consider it further

General Principle • If a node has value n, but the player considering moving to that node has a better choice either at the node’s parent or at some higher node in the tree, that node will never be chosen. • Keep track of MAX’s best choice ( ) and MIN’s best choice ( ) and prune any subtree as soon as it is known to be worse than the current or value

function Max-Value (state, game, , ) returns the minimax value of state if Cutoff-Test(state) then return Eval(state) for each s in Successors(state) do <- Max( , Min-Value(s , game, , )) if >= then return end return function Min-Value(state, game, , ) returns the minimax value of state if Cutoff-Test(state) then return Eval(state) for each s in Successors(state) do <- Min( , Max-Value(s , game, , )) if <= then return end return

Effectiveness • Depends on the order in which siblings are considered • Optimal ordering would reduce nodes considered from O(bd) to O(bd/2)--but that requires perfect knowledge • Simple ordering heuristics can help quite a bit

Chance • What if we don’t know what the options are? • Expectiminimax uses the expected value for any node where chance is involved. • Pruning with chance is more difficult. Why?

Imperfect Knowledge • What issues arise when we don’t know everything (as in standard card games)?

State of the Art • • • Chess – Deep Blue and Fritz Checkers – Chinook Othello – Logistello Backgammon – TD-Gammon (learning) Go – Computers are very bad Bridge

What about the games we play?

Knowledge • Knowledge Base – Inference mechanism (domain-independent) – Information (domain-dependent) • Knowledge Representation Language – Sentences (which are not quite like English sentence) – The KRL determine what the agent can “know” – It also affects what kind of reasoning is possible • Tell and Ask

Getting Knowledge • We can TELL the agent everything it needs to know • We can create an agent that can “learn” new information to store in its knowledge base

The Wumpus World • Simple computer game • Good testbed for an agent • A world in which an agent with knowledge should be able to perform well • World has a single wumpus which cannot move, pits, and gold

Wumpus Percepts • The wumpus’s square and squares adjacent to it smell bad. • Squares adjacent to a pit are breezy. • When standing in a square with gold, the agent will perceive a glitter. • The agent can hear a scream when the wumpus dies from anywhere • The agent will perceive a bump if it walks into a wall. • The agent doesn’t know where it is.

Wumpus Actions • • • Go forward Turn left Turn right Grab (picks up gold in that square) Shoot (fires an arrow forward--only once) – If the wumpus is in front of the agent, it dies. • Climb (leave the cavern--only good at the start square)

Consequences • Entering a square containing a live wumpus is deadly • Entering a square containing a pit is deadly • Getting out of the cave with the gold is worth 1, 000 points. • Getting killed costs 10, 000 points • Each action costs 1 point

Possible Wumpus Environment

Knowledge Representation • Two sets of rules: – Syntax: determines what atomic symbols exist in the language and how to combine them into sentences – Semantics: Relationship between the sentences and “the world”--needed to determine truth or falsehood of the sentences