Explore In-Context Segmentation via Latent Diffusion Models

AAAI 2025

Requirements

Install torch==2.1.0.
Install pip packages via pip install -r requirements.txt and alpha_clip.
Our model is based on Stable Diffusion, download and put it into datasets/pretrain. Put the checkpoints of alpha_clip into datasets/pretrain/alpha-clip.

Data Preparation

Please download the following datasets: COCO 2014, DAVIS16, VSPW, and PASCAL, which includes PASCAL VOC 2012 and SBD. And then download the meta files. Put them under datasets and rearrange as follows.

datasets
├── pascal
│   ├── JPEGImages
│   ├── SegmentationClassAug
│   └── metas
├── davis16
│   ├── JPEGImages
│   ├── Annotations
│   └── metas
├── vspw
│   ├── images
│   ├── masks
│   └── metas
└── coco20i
    ├── annotations
    │   ├── train2014
    │   └── val2014
    ├── metas
    ├── train2014
    └── val2014

Train

The codes in scripts is launched by accelerate. The saved path is specified by --output_dir defined in args.

# ldis1
accelerate launch --multi_gpu --num_processes [GPUS] scripts/modelf.py --config configs/cfg.py
# ldisn
accelerate launch --multi_gpu --num_processes [GPUS] scripts/modeln.py --config configs/cfg.py --mask_alpha 0.4

Inference

# ldis1
accelerate launch --multi_gpu --num_processes [GPUS] scripts/modelf.py --config configs/cfg.py --only_val 1 --val_dataset pascal --output_dir [the path of ckpt]
# ldisn
accelerate launch --multi_gpu --num_processes [GPUS] scripts/modeln.py --config configs/cfg.py --only_val 1 --val_dataset pascal --output_dir [the path of ckpt] --mask_alpha 0.4

The pretrained models can be found here.

Citation

If you find our work useful, please kindly consider citing our paper:

@article{wang2024explore,
  title={Explore In-Context Segmentation via Latent Diffusion Models},
  author={Wang, Chaoyang and Li, Xiangtai and Ding, Henghui and Qi, Lu and Zhang, Jiangning and Tong, Yunhai and Loy, Chen Change and Yan, Shuicheng},
  journal={arXiv preprint arXiv:2403.09616},
  year={2024}
}

License

MIT license

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
assets		assets
configs		configs
diff		diff
module		module
scripts		scripts
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Explore In-Context Segmentation via Latent Diffusion Models

Requirements

Data Preparation

Train

Inference

Citation

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

wang-chaoyang/RefLDMSeg

Folders and files

Latest commit

History

Repository files navigation

Explore In-Context Segmentation via Latent Diffusion Models

Requirements

Data Preparation

Train

Inference

Citation

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages