Causal language modeling can elicit search and reasoning capabilities on logic puzzles

Official implementation of the paper

Installing the environment

The recommended way to run the code is with an Anaconda/Miniconda environment. First, clone the repository:

git clone https://github.com/kulinshah98/llm-reasoning-logic-puzzles.git

Then, create a new Anaconda environment and install the dependencies:

conda env create -f environment.yml -n logic_puzzles

Preparing datasets

The dataset for the Sudoku and Zebra both puzzles is given in Link.

Sudoku dataset

Sudoku-train-data.npy contains around 1.8M suodku puzzles and Sudoku-test-data.npy contains 0.1M sudoku puzzles. Both the dataset only contains puzzles with the unique solutions.
Each example of the dataset contains 325 entries:
- First value: Number of filled cells in that example puzzle
- Rest of 324 values (324 values = 4 values corresponding to each of the cell in $9 \times 9$ puzzle) can be divided in two parts. Each example first contains the information about the cells given in the puzzle and followed by the information about the empty cells of the puzzle.
- The information (4 values) about each of the cell in the puzzle is given in the form of (row of the cell, column of the cell, correct value of the cell, strategy/list of strategy that needs to be applied to get the correct value).
- Strategy id: (0) the cell is given, (2) the cell is filled with Lone single, (3) Hidden single, (4) Naked pair, (5) Naked Triplet, (6) Locked Candidate, (7) XY Wing, (8) Unique Rectangle

Zebra dataset

zebra-train-data.pkl contains around 1.5M puzzles and zebra-test-data.pkl contains around 0.1M puzzles with number of entities (houses) and number of attributes (properties) in the range of 3 to 6.
Each example contains a list with 3 elements.
First element of the list contains the list of the clues/hints for the puzzle in the symbolic language.
- Clues are given in [Clue type] LHS [c/n] [attr1/house_no1] [val1] RHS [c/n] [attr2/house_no2] [val2] CLUE_END.
  - Clue type denotes the relation between attributes (See paper for more details on the cluetypes).
  - [c/n] denotes whether the following information is about attribute or a house. 'c' denotes that it is for the attribute and 'n' denotes that it is for the house numbering.
- Example: "immediate-left LHS n 0 2 RHS c 1 2 CLUE_END'. This means that house number 2 is immediate left of the house whose first attribute is 2.
Second element of the list contains the solution box.
- The first row always contains house numbers.
- From second row, the ith list contains information about (i-1)th attribute.
Third element of the list contains the order in which solver decides values in the solution box. Intuitively, the solver tries to fill the easy-to-decode cells.

Training a model on the sudoku puzzle

To train a new model, set the experiment name in {EXP_NAME} and work directory name in {WORKDIR} in scripts/train.sh file and run the following two commands:

cd sudoku-code
bash scripts/train.sh

To change the hyperparameters of the training, check sudoku-code/train/main.py file.

Training a model on the zebra puzzle

Similarly to the sudoku puzzles, to train a new model, set the experiment name in {EXP_NAME} and work directory name in {WORKDIR} in scripts/train.sh file and run the following two commands:

cd zebra-code
bash scripts/train.sh

To change the hyperparameters of the training, check zebra-code/train/main.py file.

Acknowledgement

This code is inspired from the following repositories.

JAX/Flax implementation of minGPT (Link)
Dokusan Sudoku solver (Link)

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
sudoku-code		sudoku-code
zebra-code		zebra-code
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Causal language modeling can elicit search and reasoning capabilities on logic puzzles

Official implementation of the paper

Installing the environment

Preparing datasets

Sudoku dataset

Zebra dataset

Training a model on the sudoku puzzle

Training a model on the zebra puzzle

Acknowledgement

About

Uh oh!

Releases

Packages

Languages

License

kulinshah98/llm-reasoning-logic-puzzles

Folders and files

Latest commit

History

Repository files navigation

Causal language modeling can elicit search and reasoning capabilities on logic puzzles

Official implementation of the paper

Installing the environment

Preparing datasets

Sudoku dataset

Zebra dataset

Training a model on the sudoku puzzle

Training a model on the zebra puzzle

Acknowledgement

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages