Game-Rule Learning


Three independent agents, the protagonist, antagonist, and the wannabe, share an instantiation of our robotic testbed in this endeavour to learn board-game rules. Two of these, the protagonist and the antagonist, are given the rules of the game, while the wannabe is not. The protagonist and the antagonist then play a series of physically instantiated games of a particular kind, while the wannabe watches.

The wannabe then infers the rules of the game purely from visual-input. It then takes the place of one of the earlier players, say the antagonist, and uses the learned rules to drive physically instantiated play. Note that we aim to learn to play legally, not necessarily well.

Shown below are the respective intial rules, robotic play to learn the rules, learned rules, and robotic play from learned rules for Tic-Tac-Toe, Hexapawn, and four variants of Hexapawn.

Initial Rules

Learning Rules

Learned Rules

Playing from Learned Rules

Every cache square for every player in the initial state has some piece of
that player.

A player moves by moving some piece of that player from some cache square for
that player to some empty board square.

A player wins when every square in some row has some piece of that player.
A player wins when every square in some column has some piece of that player.
A player wins when every square in some diagonal has some piece of that
player.

A player draws when no player wins and that player has no move.
	      
		
              
Every square in the close row for every player in the initial state has some
piece of that player.

A player moves by moving some piece of that player from some square to some
empty forward-adjacent square for that player of that square.
A player moves by moving some piece of the opponent of that player from some
forward-diagonal square for that player of some square to some cache square
for that opponent then moving the piece of that player from that square to
that forward-diagonal square.

A player wins when some square in the distant row for that player has some
piece of that player.

A player draws when no player wins and that player has no move.
	      
		
              
		
              
		
	      
		
              
		
	      
		
              
		
	      
		
              
		
	      

Back to the Research page