TacticZero experiments use the HOL4 environment and dataset as outlined in the original paper. It is an online policy gradient based algorithm. The architectures are defined in models.TacticZero, with the agent interaction loop being defined in experiments.TacticZero.

Running

To run a TacticZero experiment, from the root directory of the project simply run:

python3 -m experiments.TacticZero.tacticzero_experiment --config-name=tacticzero/{model}

Configuration

Users can specify what architectures to use for the goal, tactic and argument selection modules through the configurations in configs/tacticzero. Here, configs/tacticzero/tactic_zero is the base config, and the other config files are used to define model details for the GNN, Transformer and original Autoencoder used. Pretrained embedding models from Premise Selection experiments can be included by setting pretrain: true and pretrain_ckpt to the path of the pretrained checkpoint.

Data

This experiment assumes data has been processed into a MongoDB database, with the following yaml keys:

# MongoDB database (default is 'hol4')
data_config.data_options.db
# Processed Data for each expression/formula. Default is 'expression_graphs'
data_config.data_options.expression_col 
# Collection mapping each token to an index. Default is 'vocab_col'
data_config.data_options.vocab_col 
# Collection containing the train/val/test split of the goals to prove
data_config.data_options.paper_goals

Environments

Models

Updated: