RT-MPINet / README.md
3ZadeSSG's picture
initial commit
ff00a24

A newer version of the Gradio SDK is available: 5.44.1

Upgrade
arXiv Project Page

RT-MPINet

Real-Time View Synthesis with Multiplane Image Network using Multimodal Supervision (RT-MPINet)

We present a real-time multiplane image (MPI) network. Unlike existing MPI based approaches that often rely on a separate depth estimation network to guide the network for estimating MPI parameters, our method directly predicts these parameters from a single RGB image. To guide the network we present a multimodal training strategy utilizing joint supervision from view synthesis and depth estimation losses. More details can be found in the paper.

Please head to the Project Page to see supplementary materials and Full Code

Acknowledgements

  • We thank the authors of AdaMPI for their implementation of the homography renderer which has been used in this codebase under ./utils directory
  • We tank the author of Deepview renderer template, which was used in our project page.

Citation

If you use our work please use following citation:

@inproceedings{gond2025rtmpi,
  title={Real-Time View Synthesis with Multiplane Image Network using Multimodal Supervision},
  author={Gond, Manu and Shamshirgarha, Mohammadreza and Zerman, Emin and Knorr, Sebastian and Sj{\"o}str{\"o}m, M{\aa}rten},
  booktitle={2025 IEEE 27th International Workshop on Multimedia Signal Processing (MMSP)},
  pages={},
  year={2025},
  organization={IEEE}
}