ArXiv | PDF | Model (GSFixer_ckpt) | Bench (DL3DV-Res)
🤗 If you find GSFixer useful, please help ⭐ this repo, which is important to Open-Source projects. Thanks!
TL;DR: using 3D geometric (VGGT) and 2D semantic (DINOv2) features of reference views (sparse input views) to guide the video diffusion process, enabling consistency in restoring artifact novel views.
Below shows some examples:
| 3DGS Artifact Difix3D+ GenFusion GSFixer (Ours) GT |
|
|
|
git clone https://github.com/GVCLab/GSFixer.git
cd GSFixerGSFixer is tested with CUDA 12.1 on H20 & H100.
conda create -n gsfixer python=3.10
conda activate gsfixer
pip install torch torchvision --index-url https://download.pytorch.org/whl/cu121
pip install -r requirements.txt
pip install Reconstruction/submodules/simple-knn
pip install Reconstruction/submodules/diff-gaussian-rasterizationDownload the pretrained models (e.g., GSFixer, CogVideoX-I2V, VGGT...) using HuggingFace.
sh download/download_hf.sh Download Mip-NeRF 360 dataset from their project page.
Download DL3DV-Benchmark dataset using their download script.
Download the DL3DV-Res dataset using HuggingFace:
python download/download_data_hf.pycd Restoration
python inference_gs_restoration_demo.py \
--model_name PATH-to-CogVideoX-5b-I2V \
--transformer_path PATH-to-GSFixer \
--dinov2_ckpt PATH-to-dinov2-with-registers-large \
--vggt_ckpt PATH-to-vggt-model.pt \
--blip_path PATH-to-blip2-opt-2.7b \
--base_folder PATH-to-DL3DV_Res_benchmark \
--ref_folders PATH-to-DL3DV_benchmark \
--num_views 3 \
--images_root ./demo/032dee9fb0a8bc1b90871dc5fe950080d0bcd3caf166447f44e60ca50ac04ec7 \
--outpath ./output_gsfixer_inference_results \
--scene_name ./examples/gsfixer/DL3DV-Res_scene_names.txtRun evaluate.sh using the following script. Please refer to the args to set up parameters.
cd Restoration
bash examples/gsfixer/evaluate.shRun train.py using the following script. Please refer to the args to set up parameters.
cd Reconstruction
# for per-scene results
python train_mipnerf360_sparse_view_recon_per_scene.py
# for all results
python train_mipnerf360_sparse_view_recon_all_scene.pyIncluding but not limited to: CogVideoX, diffusers, VGGT, DINOv2, Difix3D+, GenFusion, 3DGS-Enhancer, ReconX, ViewCrafter, TrajectoryCrafter, ReCamMaster...
If you find the work useful, please consider citing:
@article{yin2025gsfixer,
title={GSFixer: Improving 3D Gaussian Splatting with Reference-Guided Video Diffusion Priors},
author={Yin, Xingyilang and Zhang, Qi and Chang, Jiahao and Feng, Ying and Fan, Qingnan and Yang, Xi and Pun, Chi-Man and Zhang, Huaqi and Cun, Xiaodong},
journal={arXiv preprint arXiv:2508.09667},
year={2025}
}


