Skip to content

2025.09.17 - #49 - ViPE, Maps for Autonomous Driving survey, TeraSim-World #51

@changh95

Description

@changh95

Interesting papers

ViPE: Video Pose Engine for 3D Geometric Perception

  • https://research.nvidia.com/labs/toronto-ai/vipe/
  • ViPE efficiently estimates camera intrinsics, camera motion, and dense, near-metric depth maps from unconstrained raw videos.
  • It is robust to diverse scenarios, including dynamic selfie videos, cinematic shots, or dashcams, and supports various camera models such as pinhole, wide-angle, and 360° panoramas
  • runs at 3-5FPS on a single GPU for standard input resolutions
  • designed to bridge the gap between classical and learning-based approaches. It combines the scalability and precision of a dense Bundle Adjustment (BA) framework, akin to SLAM, with the robustness of modern learned components.

Image Image Image Image Image Image Image

Metadata

Metadata

Labels

No labels
No labels

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions