CSC 148H - Assignment #3 - Trees
Due Friday July 11, 1pm

Tic-Tac-Toe AI

Tic-Tac-Toe is a game that, if played "perfectly" by both players, will always result in a tie. In other words, you can implement an algorithm that will never lose at Tic-Tac-Toe.

Game play is on a 3x3 board that is initially empty:

("None" represents an empty marker).

There are two players: The 'X' player and the 'O' player. The 'X' player moves first. When a player makes a move, he places his letter on an empty cell in the board, and then it becomes the other player's turn. This back-and-forth play proceeds until either one player wins or a tie occurs. A player wins when he creates a row, column, or diagonal that contains only his letter. A tie occurs when there are no more empty cells on the board, and there is no row, column or diagonal consisting entirely of X's or O's

The Tic-Tac-Toe playing board is stored internally as a list of lists (a list with 3 lists, of which each has 3 entries) in the Board class and the entries of the board are either Marker.NONE, Marker.X or Marker.O(see board.py for the Board class and the Marker enumerations.)

Computer players are represented by the AmateurPlayer and ProfessionalPlayer classes, described in Part 1 and 2, respectively, below. Given the current state of the game (as a Board object), the player places the next X or O on the board and returns an updated Board object. The Tournament class takes two players and pits them against each other in a game, or series of games, of Tic-Tac-Toe.

Part 1 - Rule-Based Player AI (40%)

Part 1A: A Naive Tic-Tac-Toe Player

In the first part of this assignment, you will implement a very naïve, rule-based Tic-Tac-Toe playing algorithm. You will then pit your algorithm against itself in a Tic-Tac-Toe tournament.

Your player, who happens to be an amateur, will follow these simple rules:

Firstly, if there's a move that will win you the game, play it.
If not, check if there's a move that will block a win from the opposing player. If so, place your marker in the blocking spot.
Otherwise, just place your marker in a random empty cell.

Implementation Details:

The AmateurPlayer class' highest-level method, play_turn, takes in the current board state and returns an updated Board object with the next board state.

In order to know where to place its marker, play_turnneeds to call helper functions to decide which rule to apply, and where to apply them. One helper function is used for each rule above: play_win, play_block andplay_random.

In order to decide which rule to apply, each of these helper functions must scan the board to find their potential move, and if they find one, they return 2-element (row, column) tuple signaling the position of the move. If no move exists, then the helper function returns None.

Example: If the current board state is

O | None | O
---------------------
X | None | O
---------------------
O | None | X

and it's O's turn to play, the following are the valid return values for the play_win function called by O’s AmateurPlayer object: (0,1) and (1, 1). play_turnshould return a board state of either

O | O | O
---------------------
X | None | O
---------------------
O | None | X

O | None | O
---------------------
X | O | O
---------------------
O | None | X

When implementing each helper function, think about identifying different conditions which would satisfy the rule. For example, one of the few conditions play_win should search for is: "for each row, are there two of my pieces and none of the opponents? If so, return the empty position."

Hint: Blocking your opponent's winning move requires the same strategy as scoring a win yourself. Think about analyzing columns, rows and diagonals.

Part 1B: Unit Testing the Board and AmateurPlayer classes

Create a number of suitable unit tests to convince yourself (and us) that you have implemented AmateurPlayer and Board correctly. Create your tests in testamateur.py and testboard.py, respectively. These files already contain some simple tests to get you started. You should write tests to ensure each method defined in AmateurPlayer and Board is implemented correctly (see "Summary of Deliverables" for exceptions to this). When testing the play_turn and corresponding play_* helper methods in AmateurPlayer, for our marking purposes, we don't expect you to exhaustively test the methods with all possible board configurations. You should test 2 non-identical cases in which a player makes a winning move, 2 non-identical cases in which a player blocks his opponent, and a single random move. However, adding more unit tests for your own testing purposes is fine and probably necessary.

Part 1C: Managing a Tic-Tac-Toe Game

Once you have your AmateurPlayer class up and running, it's time to pit two amateur players against each other. The Tournament class has arun_gamemethod that sets up two players and gets them to play a game of Tic-Tac-Toe until the game is over. X always plays first.

The Board class' is_game_over method might come in handy here. Once a game is over, the Tournamentclass records who has won or whether there was a tie. The run_tournament method runs many games and prints the final stats for win/losses/ties.

Part 2 - Search-Based Player AI (60%)

Part 2A: A Professional Tic-Tac-Toe Player

In mathematical literature, the game of Tic-Tac-Toe it said to have been 'solved'. This is because mathematicians and computer scientists have been able to map out the entire space of possible board layouts for any Tic-Tac-Toe game and derive a strategy that will always lead to either a victory of a draw.

In this part of the assignment, we will map out the entire space of all possible Tic-Tac-Toe games using a special type of tree data structure called a Game Tree. Each node in the tree represents a potential state of a Tic-Tac-Toe game. In particular, each node stores:

a Board object with the current game state it represents,
a marker for which player last played a turn on the board,
a rating of either 'X Wins', 'O Wins', or 'Tie' which we will discuss shortly,
and a list of its child nodes.

A node is represented by the Node class and is implemented in node.py.

The game tree will be created using the generate_game_tree() module function, located in professionalplayer.py. It is up to you to implement this function according to the procedure described below.

An example of the first two plies of a (simplified) Tic-Tac-Toe game tree are illustrated below:

The tree in the image takes symmetry into account, however you will not be required to do so in this assignment. So, for example, the root node of your game tree will contain 9 children, each of which will contain 8 children, each of which will contain 7 children, and so on until a game is over (i.e. a node with a board layout representing a finished game will not have any children.)

More specifically, to construct the game tree you must perform the following tasks:

Generate the full Game Tree: start by creating the root node and populating its Board object with the empty Board, then create its children, populating their Board objects appropriately, and so on until you reach the leaf nodes. A leaf node is a node whose board represents a game in which some player has won or a tie has occurred.
Identify the winning strategy at each node of the game tree, which amounts to appropriately setting the rating variable of each node in the tree. To do so, you will implement the following recursive ranking algorithm:

If a node N is a leaf node, set the node's rating as Marker.X, Marker O, or Marker.NONE to indicate that X has won, O has won, or there was a tie, respectively, in the node's associated board.
If a node N is neither a leaf node nor a root node, then set its rating as follows:

if at least one of N's children has a rating that is opposite to N's turn identifier, rate N with the value opposite to its turn identifier. For example, if N's turn identifier is Marker.X, and N has a child with rating Marker.O, then rate N with Marker.O.
if all of N's children have a rating equal to N's turn identifier, rate N with its turn identifier.
otherwise, rate this node as a tie (i.e., Marker.NONE).

Note that we don't assign a ranking to the root node.

The number of nodes in the game tree constructed according to the above procedure turns out to be quite large (well over 100,000 nodes). You may find this surprising if you consider the number of possible boards that actually exist in Tic-Tac-Toe. To figure out the number of possible boards, observe that there are only 3 markers that can be placed in each cell (Marker.X, Marker.O, or Marker.NONE). So, there are 3 markers that can be placed in the first position. For each of these choices, there are 3 markers that can be placed in the second position. For each of the 3x3 possible configurations of the first two positions, there are 3 markers that can be placed in the third position... and so forth. Since there are 9 positions on the board, there are 3^9 = 19683 possible board configurations, some of which can't even arise in valid game play (e.g. all X's or all O's).

The reason that there are so many more nodes in the game tree than there are Tic-Tac-Toe boards is because the game tree contains many duplicate nodes. For example, the same board will arise in the following two situations: (i) X plays (0,0) then O plays (0,1) then X plays (0,2); (ii) X plays (0,2) then O plays (0,1) then X plays (0,0). Because there are a lot of duplicate nodes in the tree, this means that if the tree is constructed naively, you will be doing a lot of repeated work.

Although naively constructing the tree is feasible (it will take several minutes if you implement it correctly), there is a much faster way of building the game tree. Essentially, you need to use memoization. (See the lab from week 4, where you were introduced to memoization.) Whenever you want to construct the subtree rooted at some node, you first check in a "memoization dictionary" whether that node already exists. If it does exist, then the subtree rooted at that node is already built and you don't have to build it again. Building the game tree using memoization means that more than one node in a given level may reference the same child node, and that some nodes may have more than one parent. Strictly speaking, such a structure is not a "tree" according to the definition of a tree we discussed in lecture. We will ignore this technicality, as it does not cause us any serious problems for the way we use the game tree.

In order to use memoization, each node needs to have a key associated with it, which will be used for looking up a node in the memoization dictionary. The most natural choice for a node's key is its board. However, the board is a list of lists, and python does not let you use lists as keys in dictionaries. Instead, the key will be a numeric representation of the board. This representation can be obtained by calling the get_key method in the Board class. (This method is already implemented for you.)

The ProfessionalPlayerclass has a play_turn method, that (as with the AmateurPlayer class) takes as input the current state of the game (again, a Board object) and uses the game tree to make the next move: this method searches through the game tree for a node with a Board object with the same configuration as the current state (you can use the == operator to test equality between boards, since the__eq__ method is defined in Board). Once the node is identified, set the next move by searching through the node's children and identifying a board with a rating equal to the player's piece (or a tie, if no winning options exist.) There's actually a much quicker way of determining the next move that doesn't require you search through all the nodes in the game tree one by one; if you can see how to do this, feel free to do things that way instead.

The function generate_game_tree(),if implemented correctly, returns the same game tree every time it is invoked. For this reason, and the fact that generating the game tree is a computationally intensive procedure, you should only generate the tree once during execution. To this end, the ProfessionalPlayer class generates the tree when the class is defined and stores the root of the tree as the static attribute gt_root. You should make use of this static attribute whenever accessing the game tree, rather than repeatedly generating a new game tree.

There are a few important notes, guidelines and recommendations that should be considered while you design and code the ProfessionalPlayer class:

Unlike any of the other classes you have to implement in this assignment, we have not outlined any helper or intermediate methods that might be useful. In general, dividing a complicated task into several smaller tasks is a good design methodology and should be used. For example, it's probably a bad idea to put all the logic for building the full game-tree into the generate_game_tree method. Instead, think about the main steps of the algorithm (e.g. populating the tree with valid board nodes, and rating each board node) and code/test these separately.
A board that corresponds to a game that is over should not have any children since there are no moves afterwards.
Recursion is recommended for building the game tree as well as rating the nodes of the tree.
We already discussed using memoization to avoid doing repeated work when building the game tree. You should also be careful about not doing repeated work when rating the nodes in the game tree. To this end, observe that a node's rating is initially -1 (see the Node constructor), which is not equal to any of Marker.X, Marker.O or Marker.NONE. When generating the ratings for nodes, you can easily check whether a rating has already been assigned to a node by checking if its value is -1.
Writing a (recursive) function to search the tree for a node given a board is probably a useful thing to do, not only for debugging your tree generation, but also for use in the play_turn method.

Part 2B: Unit Testing your Professional Tic-Tac-Toe Player

Write unit tests for the play_turn method of the ProfessionalPlayer class, and write unit tests for the generate_game_tree() method. All these unit tests are to be placed in the file testprofessional.py.

Testing play_turn for every possible board configuration in the game tree would be very intensive; instead, test 3 cases that you feel are important and, in the comments of your unit test, explain why you chose these cases.

Testing that the game tree is properly constructed by generate_game_tree() requires some creativity. One way is to create and verify the entire game tree by hand and then ensure that it matches the tree generated by your code. This is obviously unrealistic. Instead be creative and think of heuristic ways of testing that the game tree is constructed properly. For example, what should the 'turn' value of a node be in each subsequent level of the tree? What should the board method is_game_over() return for boards associated with leaf nodes? What should the height of the tree be? And so forth. Again, you don't need to go overboard, but show that you've put some thought into verifying that the game tree is constructed correctly.

Summary of Deliverables:

The following is a list of files you need to submit and the methods that are defined in the starter code. Most of these methods still need to be implemented by you. Be careful to not change any method definitions or attribute names in the starter code. Also be careful to obey the instructions with regards to what you should be returning from these functions, and what kinds of arguments the functions expect. This is important for automarking. You are free to add any methods/functions/classes (or any other files) that you wish.

board.py

_init__(self, board = None) : create and initialize the 3x3 list with Marker.NONE entries, unless a second parameter is passed in, in which case initialize the board with a copy of board passed in.
__str__(self) : returns a string representing the board state. (* You do not have to test this in testboard.py)
__eq__(self, board2) : return True if the two boards are identical. Return False otherwise.
get_key(self): return a numeric representation of the board. (* Already implemented. You do not have to test this in testboard.py)
is_empty(self) : return True if every cell in the board is Marker.NONE, and False otherwise.
is_full(self) : return True if the board has no cell marked with Marker.NONE, and False otherwise. (this is handy to determine whether the game ends in a tie.)
is_game_over(self) : return a 2-element tuple that indicates whether the game is over and who won. The first element is True if the game is over, and False otherwise. If the first element is True, then the second element indicates who won the game: Marker.X, Marker.O, or Marker.NONE (for a tie). If the first element is False, then the second element is Marker.NONE. In general, assume that the board objects you are dealing with are valid Tic-Tac-Toe boards (for example, having a board with 7 X's and 2 O's is impossible and you don't need to test for these invalid board setups.)

amateurplayer.py

__init__(self, marker) : Initialize the player with the given marker (Marker.X or Marker.O) as well as their opponent's marker.
play_win(self, board) : Return a 2-element (row, column) tuple indicating the position of a winning move, or None if no such move exists. The passed in board is an instance of Board.
play_block(self, board) : Return a 2-element (row, column) tuple indicating the position of a blocking move, or None if no such move exists. The passed in board is an instance of Board.
play_random(self, board) : Return a 2-element (row, column) tuple indicating the position of a random move, or None if no such move exists. The passed in board is an instance of Board.
play_turn(self, board) : Return a new instance of Board after a single move has been played according to the rules. The rules are as follows: play a winning move if possible. If there is no winning move, then play a blocking move if possible. If there is neither a winning or blocking move, then play a random move. This function returns a new Board object with the updated game state. The passed in board is an instance of Board.

tournament.py

__init__(self, playerX, playerO, numgames) : Intialize a new tournament. playerX and playerO are the X and O players, respectively. Each is either an instance of AmateurPlayer or ProfessionalPlayer. numgames is the number of games to be played in the tournament.
run_game(self) : plays a single game of Tic-Tac-Toe. X always plays first. Returns the winner of the game (either Marker.X, Marker.O, or Marker.NONE for a tie)
run_tournament(self) : runs self.numgames games and prints the final tally of wins/losses/ties.

testboard.py

write unit tests for the functions listed in the board.py deliverable above. (You don't have to write tests for the exceptions that were noted).

testamateur.py

write unit tests for the functions listed in the amateurplayer.py deliverable above

node.py

__init__(self, turn, board) : Initialize this node with a new Board object, the given turn, an empty list of children, and a rating of -1.
add_child(self, node): add a child node to the node's list of children

professionalplayer.py

generate_game_tree(): Generate the game tree, along with the ratings for nodes. This method returns the root node, which is an instance of Node.
__init__(self, marker) : Initialize the player with their marker: either Marker.X or Marker.O.
play_turn(self, board) : play the best move possible according to the game tree. This function returns a new Board object with the updated game state.

testprofessionalplayer.py

write unit tests for the functions in the ProfessionalPlayer class
write unit tests for the generate_game_tree() module function

We have also included a main.py file with example code for running the Amateur vs. Amateur player tournament, as well as the Professional vs. Amateur player tournament.

Start this assignment early, it’ll take a while. Expect the ProfessionalPlayer coding tasks to consume more than 50% of the programming time.

Mark Breakdown

Part 1 (40%)
    Automarking 50%
    Unit tests       20%
    Tournament (Part 1C -- this will be tested manually) 10%
    Commenting/Code Style: 20%

Part 2 (60%)
     Automarking 50%
     Unit tests 25%
     Commenting/Code Style/Design: 25%

CSC 148H - Assignment #3 - Trees Due Friday July 11, 1pm