2025.09.17 - #49 - ViPE, Maps for Autonomous Driving survey, TeraSim-World

# Interesting papers

## ViPE: Video Pose Engine for 3D Geometric Perception

- https://research.nvidia.com/labs/toronto-ai/vipe/
- ViPE efficiently estimates camera intrinsics, camera motion, and dense, near-metric depth maps from unconstrained raw videos. 
- It is robust to diverse scenarios, including dynamic selfie videos, cinematic shots, or dashcams, and supports various camera models such as pinhole, wide-angle, and 360° panoramas
- runs at 3-5FPS on a single GPU for standard input resolutions
- designed to bridge the gap between classical and learning-based approaches. It combines the scalability and precision of a dense Bundle Adjustment (BA) framework, akin to SLAM, with the robustness of modern learned components. 
 
![](https://research.nvidia.com/labs/toronto-ai/vipe/assets/images/method.png)

<img width="1314" height="276" alt="Image" src="https://github.com/user-attachments/assets/0b9c098a-e3b8-44f4-9a2a-92dc5d71b3bb" />

<img width="1314" height="164" alt="Image" src="https://github.com/user-attachments/assets/aabff955-4a95-4820-b470-4396b30f3bf4" />

<img width="1314" height="383" alt="Image" src="https://github.com/user-attachments/assets/bd6bbfdb-f041-4237-b06c-c35cff54712e" />

<img width="1314" height="211" alt="Image" src="https://github.com/user-attachments/assets/9566ee13-f473-4566-811a-710645bb8e4b" />

<img width="1314" height="602" alt="Image" src="https://github.com/user-attachments/assets/6736bdca-3584-4d24-ad88-67c9a5915f3d" />

<img width="1314" height="697" alt="Image" src="https://github.com/user-attachments/assets/233a955d-4e74-4dd8-8cdb-f0addd381e27" />

<img width="1314" height="562" alt="Image" src="https://github.com/user-attachments/assets/7f6f8a19-fbe7-45ba-af2f-2008adc15e54" />

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

2025.09.17 - #49 - ViPE, Maps for Autonomous Driving survey, TeraSim-World #51

Interesting papers

ViPE: Video Pose Engine for 3D Geometric Perception

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

2025.09.17 - #49 - ViPE, Maps for Autonomous Driving survey, TeraSim-World #51

Description

Interesting papers

ViPE: Video Pose Engine for 3D Geometric Perception

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions