Learning Positional Features for Annotating Chess Games: A Case Study Matej Guid, Martin Mozina, Jana Krivec, Aleksander Sadikov and Ivan Bratko CG 2008 Faculty of Computer and Information Science University of Ljubljana, Slovenia
Enhanced Knowledge in Chess Programs The improvements of Rybka 3 in evaluation function for chess middlegames: understanding good and bad bishops, much more advanced understanding of kings attacks, importance of defending king by pieces, “space” (previously addresed only indirectly) (Larry Kaufmann, designer of Rybka’s evaluation function, RybkaForum.net) How to “teach” a program to recognise chess concepts? Knowledge is particularly important when annotating chess games…
The Concept of the Bad Bishop Chess experts in general understand the concept of bad bishop. Precise formalisation of this concept is difficult. Traditional definition ( John Watson, Secrets of Modern Chess Strategy, 1999 ) A bishop that is on the same colour of squares as its own pawns is bad: its mobility is restricted by its own pawns, it does not defend the squares in front of these pawns. Moreover, centralisation of these pawns is the main factor in deciding whether the bishop is bad or not.
CRAFTY 's Features for Describing Bishops FeatureDescriptionDefficiency for annotating BLACK_BISHOP_PLUS _PAWNS_ON_COLOR number of own pawns that are on same colour of square as bishop all such pawns count the same, regardless of their position and how badly they restrict the bishop BLACK_BISHOPS _POSITION evaluation of bishop's posi- tion based on predefined values for particular squares such predefined value is not the actual value of bishop's placement in a particular position BLACK_BISHOPS _MOBILITY number of squares that bishop attacks number of attacked squares and actual bishop's mobility are not necessarily the same thing
CRAFTY 's Features for Describing Bishops FeatureDescriptionDefficiency for annotating BLACK_BISHOP_PLUS _PAWNS_ON_COLOR number of own pawns that are on same colour of square as bishop all such pawns count the same, regardless of their position and how badly they restrict the bishop BLACK_BISHOPS _POSITION evaluation of bishop's posi- tion based on predefined values for particular squares such predefined value is not the actual value of bishop's placement in a particular position BLACK_BISHOPS _MOBILITY number of squares that bishop attacks number of attacked squares and actual bishop's mobility are not necessarily the same thing
CRAFTY 's Features for Describing Bishops FeatureDescriptionDefficiency for annotating BLACK_BISHOP_PLUS _PAWNS_ON_COLOR number of own pawns that are on same colour of square as bishop all such pawns count the same, regardless of their position and how badly they restrict the bishop BLACK_BISHOPS _POSITION evaluation of bishop's posi- tion based on predefined values for particular squares such predefined value is not the actual value of bishop's placement in a particular position BLACK_BISHOPS _MOBILITY number of squares that bishop attacks number of attacked squares and actual bishop's mobility are not necessarily the same thing
Static Nature of Positional Features Positional features in evaluation functions are static in their nature Heuristic search enables to fulfil their purpose – contributing to the program finding best moves
Static Nature of Positional Features Positional features in evaluation functions are static in their nature Heuristic search enables to fulfil their purpose – contributing to the program finding best moves It is also desirable for positional features for annotating chess games to be of static nature! Heuristic search is important for instructive annotations. see the paper Automated chess tutor. CG 2006, Turin, Italy, May 29-31, 2006.Automated chess tutor. Is freeing move e6-e5 possible or not?
Expert-Crafted Rules & Introducing Learning Data set: 200 middlegame positions from real chess games: bad: 78 bishops not bad: 122 bishops We randomly selected 100 positions for learning and 100 positions for testing. CRAFTY ’s positional feature values served as attribute values for learning. Chess experts: wGM Jana Krivec FM Matej Guid It is extremely difficult for chess experts to define appropriate rules for determining bad bishops -> knowledge acquisition bottleneck various expert-crafted rules performed rather poorly (only 59% of examples were correctly classified) -> motivation for introducing machine learning Data Set for Machine Learning
Bad Bishops from Static Point of View Our goal: construct a static positional feature BAD_BISHOP Assessing bishops “statically” is counter-intuitive from chess point of view Special guidelines were chosen for determining “statically” bad bishops… The bishop is bad from the static point of view in some position, if 1. its improvement or exchange is desirable 2. the pawn structure limits the bishop’s chances for taking an active part in the game, 3. its mobility in this position is limited or not important for the evaluation. Guidelines for Determining Bad Bishops from Static Point of View
Standard Machine Learning Methods' Performance with CRAFTY 's features only Machine learning methods’ performance on initial dataset MethodClassification accuracy Decision trees (C4.5)73% Logistic regression70% Rule learning (CN2)72% The results were obtained on test data set. The results obtained with CRAFTY ’s positional features only are too inaccurate for commenting purposes… additional information for describing bad bishops is necessary. see the paper Fighting Knowledge Acquisition Bottleneck with Argument Based Machine Learning. European Conference on Artificial Intelligence, Patras, Greece, July 21-25, Available at: Knowledge Acquisition Bottleneck with Argument BasedMachine Learning. Improved Method
First Critical Example Rules obtained by ABML method ABCN2 failed to classify this example as "not bad“. The following question was given to the experts: “Why is the black bishop not bad?“ The experts used their domain knowledge: “The black bishop is not bad, since its mobility is not seriously restricted by the pawns of both players.”
Introducing new attribute into the domain and adding argument to an example Experts’ explanation could not be described with current domain attributes. The argument “ BISHOP=“not bad” because IMPROVED_BISHOP_MOBILITY is high“ was added to the example. A new attribute, IMPROVED_BISHOP_MOBILITY, was included into the domain: number of squares accessible to the bishop, taking into account only own and opponent’s pawn structure
Another Critical Example The ABML-based knowledge elicitation process can be used to induce rules to determine both good (i.e., not bad) and bad bishops. Now the following question was given to the experts: “Why is the black bishop bad?“ The experts used their domain knowledge: “The black bishop is bad, since black pawns block its diagonals.”
Introducing new attribute into the domain and adding argument to an example Experts’ explanation could not be described with current domain attributes. The argument “ BISHOP=“bad” because BLACK_PAWN_BLOCKS_BISHOP_DIAGONAL is high“ was added to the example. A new attribute, BLACK_PAWN_BLOCKS_BISHOP_DIAGONAL, was included into the domain: pawns on the color of the square of the bishop that block the bishop's (front) diagonals weights of such pawns were taken into account (see the “Look-up table” that follows)
Method failed to explain critical example with given argument. Counter example was presented to experts: Experts’ explanation: “The black bishop is not bad, since its mobility is not seriously restricted, taking the pawn structure into account.” Counter example: “bad”, although BLACK_PAWN_BLOCKS_BISHOP _DIAGONAL is high. Counter example "Why is the “green” bishop not bad, compared to the “red” one?" Critical example: “not bad”, BLACK_PAWN_BLOCKS_BISHOP _DIAGONAL is high.
Improving Arguments with Counter Examples The argument given to the critical example was extended to “ BISHOP=“bad” because BLACK_PAWN_BLOCKS_BISHOP_DIAGONAL is high and IMPROVED_BISHOP_MOBILITY is low.” With this argument the method could not find any counter examples anymore. The new rule covering the critical example became: if BAD_PAWNS_BLOCK_BISHOP_DIAGONAL > 16 and IMPROVED BISHOP MOBILITY < 4 then BAD_BISHOP = “bad” class distribution [42,0]
Assesing “Bad” Pawns Experts designed a look-up table (left) with predefined values for pawns that are on color of square of the bishop in order to assign weights to such pawns. BAD_PAWNS_AHEAD = = 42
After the Final Iteration... AttributeDescription BAD_PAWNS pawns on the color of the square of the bishop - weighted according to their squares (bad pawns) BAD_PAWNS_AHEAD bad pawns ahead of the bishop BAD_PAWNS _BLOCK_BISHOP_DIAGONAL bad pawns that block the bishop's (front) diagonals BLOCKED_BAD_PAWNS bad pawns blocked by opponent's pawns or pieces IMPROVED_BISHOP_MOBILITY number of squares accessible to the bishop taking into account only pawns of both opponents The whole process consisted of 8 iterations. 7 arguments were attached to automatically selected critical examples 5 new attributes were included into the domain
Classification Accuracy Through Iterations The accuracies of all methods improved by adding new attributes. ABCN2 (which also used the arguments) outperformed all others. Arguments suggested useful attributes AND lead to more accurate models.
The Final Model Rule #2: if BAD_PAWNS_AHEAD > 18 and IMPROVED_BISHOP_MOBILITY < 3 then BISHOP = “bad” class distribution [46,0]
The Final Model Possible comment: “Black bishop is bad, since black pawns on the same colour of squares ahead of it, and pawns of both opponents restrict its mobility.”
Conclusions In this case study, Argument Based Machine Learning was shown to be a powerful knowledge-elicitation method for defining deep positional patterns. ABML offers the following advantages for knowledge elicitation: explain single example easier for experts to articulate knowledge critical examples expert provides only relevant knowledge arguments constrain learning hypotheses are consistent with expert knowledge