EE 562 ARTIFICIAL INTELLIGENCE FOR ENGINEERS Lecture 9
- Slides: 30
EE 562 ARTIFICIAL INTELLIGENCE FOR ENGINEERS Lecture 9, 5/4/2005 University of Washington, Department of Electrical Engineering Spring 2005 Instructor: Professor Jeff A. Bilmes 5/4/2005 EE 562
Game Playing & Adversarial Search 5/4/2005 EE 562
Material • Reading: Read chapters 6 • Read chapter 7 (for Monday). – expect new homework on Monday (this will be a long one). • Today’s lecture covers chapter 6. • Announcements: Still working on makeup class times. Send me email if you haven’t already for Friday schedules. 5/4/2005 EE 562
Outline • Optimal decisions • α-β pruning • Imperfect, real-time decisions 5/4/2005 EE 562
Games vs. search problems • "Unpredictable" opponent solution is a strategy specifying a move for every possible opponent reply • Time limits unlikely to find goal, must approximate • History: – Computer considers possible lines of play (Babbage, 1846) – Algorithm for perfect play (Zermelo, 1912; Von Neumann, 1944) – Finite horizon, approximate evaluation (Zuse, 1945; Wiener, 1948; Shannon, 1950) – First Chess Program (Turing, 1951) – Machine learning to improve evaluation accuracy (Samuel, 195257) – Pruning to allow deeper search 5/4/2005 EE 562 (Mc. Carthy, 1956)
Types of Games 5/4/2005 EE 562
Game tree (2 -player, deterministic, turns) 5/4/2005 EE 562
Minimax • Perfect play for deterministic games • Idea: choose move to position with highest minimax value = best achievable payoff against best play • E. g. , 2 -ply game (ply == move by one player): 5/4/2005 EE 562
Minimax algorithm 5/4/2005 EE 562
Properties of minimax • Complete? Yes, if tree is finite. Finite strategy can exist even if tree is infinite. • Optimal? Yes (against an optimal opponent). – for sub-optimal opponent, better strategies might exist. • Time complexity? O(bm) • Space complexity? O(bm) (depth-first exploration) – For chess, b ≈ 35, m ≈100 for "reasonable" games exact solution completely infeasible. 5/4/2005 EE 562 • But do we need to explore every path?
α-β pruning example 5/4/2005 EE 562
α-β pruning example 5/4/2005 EE 562
α-β pruning example 5/4/2005 EE 562
α-β pruning example 5/4/2005 EE 562
α-β pruning example 5/4/2005 EE 562
Why is it called α-β? • α is the value of the best (i. e. , highestvalue) choice found so far at any choice point along the path for max • If v is worse than α, max will avoid it prune that branch 5/4/2005 EE 562
The α-β algorithm 5/4/2005 EE 562
The α-β algorithm 5/4/2005 EE 562
Properties of α-β • Pruning does not affect final result • Good move ordering improves effectiveness of pruning • With "perfect ordering, " time complexity = O(bm/2) doubles the solvable tractable depth of search • A simple example of the value of reasoning about which computations are relevant (a form of metareasoning) • For chess, 3550 is still impossible! 5/4/2005 EE 562
Resource limits Standard approach: • cutoff test: use a cutoff-test rather than a terminal -test. e. g. , depth limit (perhaps add quiescence search) quiescence == stop when nothing is going to make drastic changes (e. g. , taking a queen) • evaluation function: use Eval instead of Utility = eval estimates desirability of position (but is a heuristic, since the exact true evaluation requires a full search) Suppose we have 100 secs to move, can explore 104 nodes/sec 106 nodes per move ¼ 358/2 - reaches depth 8, relatively good chess program 5/4/2005 EE 562
Evaluation functions • For chess, typically linear weighted sum of features Eval(s) = w 1 f 1(s) + w 2 f 2(s) + … + wn fn(s) e. g. , w 1 = 9 with f 1(s) = (number of white queens) – (number of black queens), etc. 5/4/2005 EE 562
Exact values don’t’ matter Behavior is preserved under any monotonic transformation of Eval function. Only the order matters: payoff in deterministic games acts as an ordinal utility function (in game theory) 5/4/2005 EE 562
Cutting off search Minimax. Cutoff is identical to Minimax. Value except 1. Terminal? is replaced by Cutoff? 2. Utility is replaced by Eval Does it work in practice? bm = 106, b=35 m=4 4 -ply lookahead is a hopeless chess player! 5/4/2005 – EE 562 4 -ply ≈ human novice
Deterministic games in practice 5/4/2005 EE 562
Non-Deterministic games: backgammon 5/4/2005 EE 562
Non-Deterministic games in general • In nondeterministic games, chance is introduced by dice, card-shuffling, or coin flipping. In these cases, we have search nodes for “chance” which can be seen as a random player in the game. In the coin case: 5/4/2005 EE 562
Algorithm for non-deterministic games • Expectedminimax gives perfect play • Just like Minimax, except we also handle chance nodes. 5/4/2005 EE 562
non-deterministic games in practice • Dice rolls increase b: 21 possible rolls with 2 dice • Backgammon ¼ 20 legal moves (can be 6000 with 1 -1 roll). – depth 4 = 20(21*20)3 ¼ 1. 2 £ 109 • As depth increases, probability of reaching given node shrinks value of lookahead is diminished • - pruning much less effective. • TDGammon: uses depth-2 search + very good Eval (learned automatically), world champion level 5/4/2005 EE 562
Exact values DO matter. • Behavor is preserved only by positive linear transformation of Eval • Hence, Eval should be proportional to expected payoff. 5/4/2005 EE 562
Summary • Games are fun to work on! • They illustrate several important points about AI • perfection is unattainable must approximate • good idea to think about what to think about 5/4/2005 EE 562
- Site:slidetodoc.com
- What is ai?
- Evaluate the postfix expression: 562 +124/
- 562-247-8422
- Round 271 to the nearest hundred
- 562-247-8422
- Differentiate between strips and adl
- Conceptual graph in artificial intelligence tutorial
- Ethics of artificial intelligence
- Compare a* and ao* algorithm
- References for artificial intelligence
- Knowledge manipulation in ai
- Artificial intelligence: a modern approach
- Informed and uninformed search in artificial intelligence
- Artificial intelligence devices
- Optimal decisions in games in artificial intelligence
- A programmable electronic device designed to accept data
- Math and artificial intelligence
- Inference by enumeration in artificial intelligence
- Artificial intelligence applications institute
- Artificial intelligence assessment
- Steps toward artificial intelligence
- Inference by enumeration in artificial intelligence
- Artificial intelligence devices
- Prolog iterative deepening search
- Ai is a branch of computer science
- Artificial intelligence applications institute
- What is the alan turing test
- The fuzzy proposition if x is e then y is f is a
- Artificial intelligence devices
- Optimal decisions in games in artificial intelligence