-
Notifications
You must be signed in to change notification settings - Fork 3
Open
Description
Interesting papers
CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models
- https://arxiv.org/pdf/2411.18613
- https://cat-4d.github.io/
- 4D scene generation, multi-view video diffusion model, deformable 3D Gaussian
pixelSplat: 3D Gaussian Splats from Image Pairs for Scalable Generalizable 3D Reconstruction
- https://openaccess.thecvf.com//content/CVPR2024/papers/Charatan_pixelSplat_3D_Gaussian_Splats_from_Image_Pairs_for_Scalable_Generalizable_CVPR_2024_paper.pdf
- https://davidcharatan.com/pixelsplat/
LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and Control
Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentation
Navigation World Models
MV-DUSt3R+: Single-Stage Scene Reconstruction from Sparse Views In 2 Seconds

MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction Priors
Diorama: Unleashing Zero-shot Single-view 3D Scene Modeling

MegaSaM: Accurate, Fast, and Robust Structure and Motion from Casual Dynamic Videos
- https://arxiv.org/pdf/2412.04463
- https://mega-sam.github.io/
- deep visual SLAM framework
- Check out website for better examples