YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

AuralSAM2

[CVPRF'26] AuralSAM2: Enabling SAM2 Hear Through Pyramid Audio-Visual Feature Prompting

by Yuyuan Liu, Yuanhong Chen, Chong Wang, Junlin Han, Junde Wu, Can Peng, Jingkun Chen, Yu Tian and Gustavo Carneiro

Installation

please install the dependencies and dataset based on this installation document.

Getting start

please follow this instruction document to reproduce our results.

Citation

please consider citing our work in your publications if it helps your research.

@article{liu2025auralsam2,
  title={AuralSAM2: Enabling SAM2 Hear Through Pyramid Audio-Visual Feature Prompting},
  author={Liu, Yuyuan and Chen, Yuanhong and Wang, Chong and Han, Junlin and Wu, Junde and Peng, Can and Chen, Jingkun and Tian, Yu and Carneiro, Gustavo},
  journal={arXiv preprint arXiv:2506.01015},
  year={2025}
}
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Paper for yyliu01/AuralSAM2