CS 747: Programming Assignment 3

Microchess is a smaller version of standard 8x8 chess, played on a 5×4 board. Games are faster and simpler than regular chess since there are fewer pieces and smaller board size, but careful planning and strategy are still needed to win. Its compact size makes it ideal for experimenting with intelligent agents while keeping the game interesting and challenging. For more details, check out this link.

5×4 Microchess Board

Pieces

Each side has the following pieces - King, Rook, Bishop, Knight, and Pawn. The movement and capture rules for each piece follow the standard rules of classical 8x8 chess, summarized below:

King: Moves one square in any direction (horizontally, vertically, or diagonally). It cannot move into a square that is under attack by an opponent’s piece.
Rook: Moves any number of vacant squares horizontally or vertically.
Bishop: Moves any number of vacant squares diagonally, as long as no other piece blocks its path. A bishop always remains on the same color square it started on.
Knight: Moves in an “L” shape — two squares in one direction (horizontal or vertical) followed by one square perpendicular to that direction. Knights can jump over other pieces.
Pawn: Moves forward by one square (toward the opponent’s side). On its first move also, it may move forward by one square only. Pawns capture diagonally forward by one square.
Queen*: Combines the powers of the Rook and Bishop. It can move any number of vacant squares vertically, horizontally, or diagonally, as long as no other piece blocks its path. The queen is the most powerful piece on the board due to its versatile movement.

Note: The Queen is not present in the initial setup of Microchess but can be obtained through pawn promotion (explained later).

The videos below demonstrate the legal moves for each chess piece in different scenarios:

King Moves

Rook Moves

Bishop Moves

Knight Moves

Pawn Moves

Queen Moves

Players

Microchess is played between two players, White and Black. White always moves first, followed by Black, and players alternate turns. The goal is to checkmate the opponent’s King — putting it in a position where it is under threat and cannot move to a safe square. All rules for piece movements, captures, and game termination apply equally to both players.

Game Termination Conditions

The game ends under any of the following conditions:

Checkmate (Win):
A player wins if the opponent’s King is in check and has no legal moves to escape. In simple words: the King is trapped and cannot move to safety. The image below illustrates one such position where White has checkmated Black. The Black King is under attack by the White Queen and cannot move to any allowed square without being captured.

Checkmate Position: White wins
Stalemate (Draw):
The game is a draw if the player whose turn it is has no legal moves, but their King is not in check. The image below shows a position where the game is a stalemate. It is Black's turn to move, and although the Black King has three available squares on the board, each would place it in check so each of these is an illegal move. As a result, Black has no legal moves available, yet the King is not currently in check.

Stalemate Position: Draw
Draw by inactivity:

If no captures occur and no pawns move forward for more than 20 consecutive moves, the game is declared a draw.
Draw by insufficient material:
The game is a draw if neither player has enough pieces to force a checkmate — specifically, if both players only have their King, or at most a King + Bishop or King + Knight combination (no pawns or rooks remaining). In the image below, both players only have their Kings left, making it impossible to checkmate.

Draw by insufficient material

Pawn Promotion

When a pawn advances all the way to the last row on the side of the opponent, it can be promoted to any piece (except King) — Rook, Bishop, Knight, or Queen. This allows a player to strategically gain stronger pieces during the game. The video below demonstrates a Black Pawn being promoted to a Black Queen upon reaching the last row.

Pawn Promotion

In this simplified version of Chess, complex rules like castling and en passant are ignored.

Agents

In this assignment, you will design an agent to play microchess. Given a board position and a set of possible legal moves, the agent will choose the most appropriate move at each step. The agent should be able to play both as a white or a black player. If playing multiple games, the agents keep swapping their places from white to black in every game, so that they both evenly play as both white and black.
You are given three tasks and you need to design agents to accomplish them. You may choose to have the same agent for all the tasks or different agents for each task.

In this setup, the state of the agent is the current board situation along with the player's turn, and the action is a legal move out of all possible legal moves in that state.

Game Environment and Code Structure

This compressed files given below contains the Microchess environment and supporting files. This environment is based on the Explainable-Minichess repository, with necessary modifications for this assignment. You do not need to modify the core environment; you only need to focus on implementing your agent in the provided skeleton files. Download the file corresponding to your OS.

1. Setup Instructions

You must ensure that you have Python 3.9.6. You can install Python 3.9.6 from here. Your code will be tested using a python virtual environment. You can set up a virtual environment using this link.

2. Code Structure

Inside the directory, you are provided with the following (along with the core game environment files):

The microchess board used for this assignment is defined in minichess/boards/5x4microchess.board.

3. Agent Interface & Helper Functions

The chess_object parameter input to the move() function is an instance of the Chess class defined in minichess/chess/fastchess.py. It contains all the information you may need to design your agent. Important attributes and methods you may need are described below:

Important Attributes of Chess Object

Useful Methods and Helper Functions

4. Bitboard Representation

The microchess board is represented using a bitboard representation for efficient computation in the game environment. A bitboard uses a 64-bit integer to represent the state of the chessboard, where each bit corresponds to a square on the board for the classical 8x8 chess. Similarly, for the 5x4 microchess board, only the first 20 bits of the integer are used, with each bit representing a square on the 5x4 board. Many functions in the environment use this representation for fast calculations of moves and game states. You can read more about bitboard representation here.

Minimax Search Algorithm

Before we get into the tasks, let us familiarize ourselves with a common search algorithm that is used to play games like chess.

Games similar to chess, where two players compete and the gain of one (+1) corresponds to the loss of the other (-1), are generally called zero-sum games, because the sum of the players’ rewards is zero.

Player 1 aims to increase its reward by maximizing R(⋅) while Player 2 tries to maximize its own reward, which effectively means minimizing Player 1's reward by minimizing R(⋅).

Using this idea, we can develop an algorithm that explores all possible moves for both players alternately up to a certain depth (one turn by a player). The algorithm evaluates the resulting terminal states and traces back, assuming that each player chooses moves to either maximize or minimize the reward function depending on whose turn it is.

For example, consider Aarti and Bhaskar playing tic-tac-toe. Aarti uses crosses and Bhaskar uses circles. Let the reward function be positive for Aarti and negative for Bhaskar if Aarti wins, and vice versa. A draw gives zero reward to both. Rewards are only assigned at terminal states, not during the game. Suppose the game has progressed to the following stage:

Now Aarti has to make a move and plans to look two levels deep. She has five options at depth one; three of them are shown above. For each of these, Bhaskar has four options at depth two, the last level. At this stage, Aarti estimates the 'goodness' of each resulting state using a function she has constructed. The numbers below the leaf nodes represent her estimated rewards. While the actual game rewards are +1, 0, or -1, Aarti’s evaluation can be any numerical estimate.

Since Bhaskar aims to minimize Aarti’s score, he chooses the move with the least reward for her. This is shown by the blue arrows pointing upwards, carrying Bhaskar’s 'best value' (the worst for Aarti). Aarti then picks the move with the maximum value among her options, indicated by the red arrows going up to the root node. This determines the best move she should make from her current state.

This procedure is known as the Minimax algorithm, commonly used in zero-sum discrete games.

Task 1: Playing against Random player (3 Marks)

Design an agent that maximises its wins against an opponent that plays randomly, i.e., when the opponent player selects a move uniformly at random from the set of all legal moves.

You may choose to implement the minimax algorithm with a handcrafted evaluation function to determine the best move. You should limit your search to maximum depth 2 (Hereafter, depth refers to the number of plies, which means a single turn by a player). Your agent should not take more than 5ms for each turn, on average.

Your score will be based on 100 games played against the random player. The agent's score will be +1 for every win, -1 for every loss and 0 for draw. Your agent must score at least 27 points.

You should design and write code for your agent in the move() function of agents/task1_agent.py file. You are free to define any other helper functions or classes as needed. Your agent should follow the BaseAgent template described above.

Task 2: Playing against Rational player (3 Marks)

Now you must design an agent that maximises its wins against an opponent that takes calculated decisions to win. You can use the helper functions described above to extract information about the state of the game.

You may choose to implement the minimax algorithm with a handcrafted evaluation function to determine the best move. You must limit your search to not more than depth 4. Your agent should not take more than 100ms for each turn, on average.

Your score will be based on 100 games played against our rational player. The agent's score will be +1 for every win, -1 for every loss and 0 for draw. Your agent must score at least 20 points.

As an example of evaluation score, you can count how many of white pieces are there how many of black and compare them to get an empirical score for that state. You can look at threat to pieces, king defense, mobility, etc. and build a score function. Here you may need to use the helper functions listed above. There are several more functions in the code file which you are free to use.

You should design and write code for your agent in the move() function of agents/task2_agent.py file. You are free to define any other helper functions or classes as needed. Your agent should follow the BaseAgent template described above.

Task 3: Build your Best agent (4 Marks)

Finally, you are free to choose your favourite learning algorithm to train and come up with your best agent that plays Microchess. You may generate episodes of games using the functions described above, if needed. Your search depth should not exceed a depth of 5. The average time taken by your agent for each move should be under 200ms during the actual game play. Exceeding the time limit for each move may result in a loss of marks.

Your score will be based on 100 games played against the random player. The agents score will be +1 for every win, -1 for every loss and 0 for draw. Your agent must score at least 50 points.

You should design and write code for your agent in the move() function of agents/task3_agent.py file. You are free to define any other helper functions or classes as needed. Your agent should follow the BaseAgent template described above. There is no autograder for Task 4.

You are free to use any form of reinforcement learning (or non-reinforcement-learning) to train your agent for this task. Policy search (covered in class in Week 9) is likely to be the easiest to implement and iterate.

Report (2 marks)

Unlike previous assignments, you have been given a free hand to design your agent for Microchess. Your report should include the following details for each of the agents you have developed.

Your report should be clear and informative, as it will help the TAs and instructor understand your design choices. The report, along with your code, may be examined to corroborate the results. Insufficient explanation may result in a loss of marks.

Submission

Organize your submission folder as shown below. For example, if your roll number is 22B1831, your submission folder should be named 22B1831_submission and structured as follows:

Note that you must also include a references.txt file if you have referred to any resources while working on this assignment (see the section on Academic Honesty on the course web page).

Tar and Gzip the directory to produce a single compressed file named <your_roll_no>_submission.tar.gz, for example 22B1831_submission.tar.gz. If you are submitting any additional files including parameters or weights, please include them in the extra_files/ directory and ensure that the file size does not exceed 1MB. If extra libraries used then they should all be recorded in requirements.txt file with proper version numbers.

Evaluation

The assignment is worth a total of 12 marks. Task 1 and Task 2 are each worth 3 marks, while Task 3 carries 4 marks. Marks for each task will be awarded based on your agent’s performance, considering the number of wins, losses, draws, and the time taken per move for each task. The remaining 2 marks are allocated for the report.

Deadline and Rules

Your submission is due by 11.59 p.m., Wednesday, November 5. Finish working on your submission well in advance, keeping enough time to test your code, compile the results, and upload to Moodle.

Your submission will not be evaluated (and will be given a score of zero) if it is not uploaded to Moodle by the deadline. Do not send your code to the instructor or TAs through any other channel. Requests to evaluate late submissions will not be entertained.

Your submission will receive a score of zero if your code does not execute using the given python virtual environment. To make sure you have uploaded the right version, download it and check after submitting (but well before the deadline, so you can handle any contingencies before the deadline lapses).

You are expected to comply with the rules laid out in the "Academic Honesty" section on the course web page, failing which you are liable to be reported for academic malpractice.