DiscoverDaily Paper CastStereoPilot: Learning Unified and Efficient Stereo Conversion via Generative Priors
StereoPilot: Learning Unified and Efficient Stereo Conversion via Generative Priors

StereoPilot: Learning Unified and Efficient Stereo Conversion via Generative Priors

Update: 2025-12-20
Share

Description

🤗 Upvotes: 33 | cs.CV



Authors:

Guibao Shen, Yihua Du, Wenhang Ge, Jing He, Chirui Chang, Donghao Zhou, Zhen Yang, Luozhou Wang, Xin Tao, Ying-Cong Chen



Title:

StereoPilot: Learning Unified and Efficient Stereo Conversion via Generative Priors



Arxiv:

http://arxiv.org/abs/2512.16915v1



Abstract:

The rapid growth of stereoscopic displays, including VR headsets and 3D cinemas, has led to increasing demand for high-quality stereo video content. However, producing 3D videos remains costly and complex, while automatic Monocular-to-Stereo conversion is hindered by the limitations of the multi-stage ``Depth-Warp-Inpaint'' (DWI) pipeline. This paradigm suffers from error propagation, depth ambiguity, and format inconsistency between parallel and converged stereo configurations. To address these challenges, we introduce UniStereo, the first large-scale unified dataset for stereo video conversion, covering both stereo formats to enable fair benchmarking and robust model training. Building upon this dataset, we propose StereoPilot, an efficient feed-forward model that directly synthesizes the target view without relying on explicit depth maps or iterative diffusion sampling. Equipped with a learnable domain switcher and a cycle consistency loss, StereoPilot adapts seamlessly to different stereo formats and achieves improved consistency. Extensive experiments demonstrate that StereoPilot significantly outperforms state-of-the-art methods in both visual fidelity and computational efficiency. Project page: https://hit-perfect.github.io/StereoPilot/.

Comments 
In Channel
loading
00:00
00:00
x

0.5x

0.8x

1.0x

1.25x

1.5x

2.0x

3.0x

Sleep Timer

Off

End of Episode

5 Minutes

10 Minutes

15 Minutes

30 Minutes

45 Minutes

60 Minutes

120 Minutes

StereoPilot: Learning Unified and Efficient Stereo Conversion via Generative Priors

StereoPilot: Learning Unified and Efficient Stereo Conversion via Generative Priors

Jingwen Liang, Gengyu Wang