Text2Bricks : Transform Text to Buildable Lego Set

Overview

The Text2Bricks project aims to create buildable LEGO sets in the 3D LEGO representation format (LDRAW) based on natural language input. This involves using a diffusion model to generate a 3D shape from the input and reinforcement learning (RL) to construct the LEGO set based on the generated shape.

Final Projected Pipeline

Natural Language Input: User provides a description of the desired LEGO model.
Diffusion Model: Converts the input into a 3D shape.
Text2Brick Reinforcement Learning Model: Builds the LEGO model by:
- Utilizing a gym environment.
- Leveraging a combination of CNN and GNN for processing.
- Representing the LEGO world as a graph.
Output: A buildable LEGO set in LDRAW format.

Pipeline:
Natural Language Input → Diffusion Model (3D Shape) → Text2Brick RL Model (Gym + CNN + GNN) → Buildable LEGO Set (LDRAW Format)

Step 1: Initial Proof of Concept

Objective

Rebuild MNIST digits in LEGO LDRAW format. This simplified approach focuses on 2D reconstruction (ignoring the z-dimension) to reduce complexity in the initial stages of the project.

Reinforcement Learning Model Pipeline

Observations

Target Image: MNIST digit to rebuild.
Current Build LEGO Shape: Converted to grayscale image at each epoch.
Reward Function:
- Reward = α * brick_validity + β * IoU
  - IoU: Intersection-over-Union between the target image and the current LEGO shape.
  - brick_validity: Boolean indicating whether the brick placement is legal (e.g., no flying bricks).

RL Model Architecture

Model Type: TBD (Possibly Q-Learning).
Components:
- CNN: Processes the target and current build images (using a backbone from a pretrained model).
- GNN: Processes the graph representation of the LEGO world.
- Fusion and Attention Layer: TBD – should we include this?
- Output: Predicts the next LEGO node in the graph (Brick Class) or its x, y coordinates (to be determined).

LDRAW Format

LEGO sets are generated in LDRAW format. For details, see the official specification:
LDRAW File Format Documentation

References

Brick by Brick:
NeurIPS 2021 Paper
Learning to Build by Building Your Own Instructions:
arXiv 2410.01111

Name		Name	Last commit message	Last commit date
Latest commit History 119 Commits
.github/workflows		.github/workflows
images		images
ldraw-brick-example		ldraw-brick-example
testing_notebooks		testing_notebooks
text2brick		text2brick
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
getting_started.ipynb		getting_started.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Text2Bricks : Transform Text to Buildable Lego Set

Overview

Final Projected Pipeline

Step 1: Initial Proof of Concept

Objective

Reinforcement Learning Model Pipeline

Observations

RL Model Architecture

LDRAW Format

References

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

EwenBernard/Text2Bricks

Folders and files

Latest commit

History

Repository files navigation

Text2Bricks : Transform Text to Buildable Lego Set

Overview

Final Projected Pipeline

Step 1: Initial Proof of Concept

Objective

Reinforcement Learning Model Pipeline

Observations

RL Model Architecture

LDRAW Format

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages