Publications
[ Conference Papers,
Journal Articles ]
An asterisk (*) beside authors' names indicates equal contributions.
Preprints
Gensheng Pei, Xiruo Jiang, Yazhou Yao, Xiangbo Shu, Fumin Shen, and Byeungwoo Jeon.
Taming SAM3 in the Wild: A Concept Bank for Open-Vocabulary Segmentation.
Preprint, 2026.
[ arXiv ]
Gensheng Pei, Yazhou Yao, Jianbo Jiao, Wenguan Wang, Liqiang Nie, and Jinhui Tang.
Dynamic in Static: Hybrid Visual Correspondence for Self-Supervised Video Object Segmentation.
Preprint, 2024.
[ arXiv ]
Conference Papers
Gensheng Pei, Xiruo Jiang, Xinhao Cai, Tao Chen, Yazhou Yao, and Byeungwoo Jeon.
PEARL: Geometry Aligns Semantics for Training-Free Open-Vocabulary Semantic Segmentation.
In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR),
IEEE/CVF, Denver CO, USA, Jun 3--7, 2026.
[ paper ]
Gensheng Pei, Tao Chen, Yujia Wang, Xinhao Cai, Xiangbo Shu, Tianfei Zhou, and Yazhou Yao.
Seeing What Matters: Empowering CLIP with Patch Generation-to-Selection.
In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR),
IEEE/CVF, pp. 24862--24872, Nashville TN, USA, Jun 11--15, 2025.
[ paper ]
Gensheng Pei, Tao Chen, Xiruo Jiang, Huafeng Liu, Zeren Sun, and Yazhou Yao.
VideoMAC: Video Masked Autoencoders Meet ConvNets.
In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR),
IEEE/CVF, pp. 22733--22743, Seattle WA, USA, Jun 17--21, 2024.
[ paper ]
Gensheng Pei, Fumin Shen, Yazhou Yao, Guo-Sen Xie, Zhenmin Tang, and Jinhui Tang.
Hierarchical Feature Alignment Network for Unsupervised Video Object Segmentation.
In Proceedings of European Conference on Computer Vision (ECCV),
Springer Nature, pp. 596--613, Tel Aviv, Israel, October 23--27, 2022.
[ paper ]
Zhenyu Yang, Gensheng Pei, Tao Chen, Yichao Zhou, Tianfei Zhou, Yazhou Yao, and Fumin Shen.
Efficiency Follows Global-Local Decoupling.
In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR),
IEEE/CVF, Denver CO, USA, Jun 3--7, 2026.
[ paper ]
Haowen Gu, Gensheng Pei, Zeren Sun, Mingwu Ren, Xiangbo Shu, Yazhou Yao, and Fumin Shen.
MedFG-VQA: Low-Frequency Memory and Graph Attention for Lightweight Medical VQA.
In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR),
IEEE/CVF, Denver CO, USA, Jun 3--7, 2026.
[ paper ]
Jianqiang Xu, Gensheng Pei, Huafeng Liu, and Yazhou Yao.
GSV2X: Geometry-Aware Uncertainty Modeling and Orthogonal Fusion for Robust Roadside Perception.
In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR),
IEEE/CVF, Denver CO, USA, Jun 3--7, 2026.
[ paper ]
Xinhao Cai, Gensheng Pei, Zeren Sun, Yazhou Yao, Fumin Shen, and Wenguan Wang.
Iris: Bringing Real-World Priors into Diffusion Model for Monocular Depth Estimation.
In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR),
IEEE/CVF, Denver CO, USA, Jun 3--7, 2026.
[ paper ]
Jianjian Yin, Tao Chen, Yi Chen, Gensheng Pei, Xiangbo Shu, Yazhou Yao, and Fumin Shen.
PCA-Seg: Revisiting Cost Aggregation for Open-Vocabulary Semantic and Part Segmentation.
In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR),
IEEE/CVF, Denver CO, USA, Jun 3--7, 2026.
[ paper ]
Xinhao Cai, Liulei Li, Gensheng Pei, Tao Chen, Jinshan Pan, Yazhou Yao, and Wenguan Wang.
Beyond Frequency: Scoring-Driven Debiasing for Object Detection via Blueprint-Prompted Image Synthesis.
In Proceedings of the International Conference on Learning Representations (ICLR),
OpenReview, Rio de Janeiro, Brazil, April 23--27, 2026.
[ paper ]
Zhenyu Yang, Gensheng Pei, Tao Chen, Xia Yuan, Haofeng Zhang, Xiangbo Shu, Yazhou Yao.
Beyond Quadratic: Linear-Time Change Detection with RWKV.
In Proceedings of the AAAI Conference on Artificial Intelligence (AAAI),
AAAI, Singapore, January 20--27, 2026.
[ paper ]
Xinhao Cai, Qiuxia Lai, Gensheng Pei, Xiangbo Shu, Yazhou Yao, and Wenguan Wang.
Cycle-Consistent Learning for Joint Layout-to-Image Generation and Object Detection.
In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV),
IEEE/CVF, pp. 6797--6807, Honolulu HI, USA, October 19--23, 2025.
[ paper ]
Mengmeng Sheng, Zeren Sun, Gensheng Pei, Tao Chen, Haonan Luo, and Yazhou Yao.
Enhancing Robustness in Learning with Noisy Labels: An Asymmetric Co-Training Approach.
In Proceedings of ACM International Conference on Multimedia (ACMMM),
ACM, pp. 4406--4415, Melbourne VIC, Australia, October 28--November 1, 2024.
[ paper ]
Tao Chen, XiRuo Jiang, Gensheng Pei, Zeren Sun, Yucheng Wang, and Yazhou Yao.
Knowledge Transfer with Simulated Inter-Image Erasing for Weakly Supervised Semantic Segmentation.
In Proceedings of European Conference on Computer Vision (ECCV),
Springer Nature, pp. 441--458, Milan, Italy, September 29--October 4, 2024.
[ paper ]
Journal Articles
Jianjian Yin, Xiruo Jiang, Tao Chen, Gensheng Pei, Yazhou Yao, Fumin Shen, and Heng-Tao Shen.
DepMatch: Boosting Semi-supervised Semantic Segmentation by Exploring Depth Difference Knowledge.
IEEE Transactions on Image Processing, 2026.
[ link ]
Zhenyu Yang, Gensheng Pei, Yazhou Yao, Tianfei Zhou, Lizhong Ding, and Fumin Shen.
ChangeTitans: Towards Remote Sensing Change Detection with Neural Memory.
IEEE Transactions on Geoscience and Remote Sensing, vol. 63, pp. 4709714, 2025.
[ link ]
Yin Tang, Rui Chen, Gensheng Pei, and Qiong Wang.
PASS-SAM: Integration of Segment Anything Model for Large-Scale Unsupervised Semantic Segmentation.
Computational Visual Media, vol. 11, pp. 669--674, 2025.
[ link ]
Jianjian Yin, Tao Chen, Gensheng Pei, Yazhou Yao, Liqiang Nie, and Xiansheng Hua.
Semi-Supervised Semantic Segmentation With Multi-Constraint Consistency Learning.
IEEE Transactions on Multimedia, vol. 27, pp. 6449--6461, 2025.
[ link ]
Gensheng Pei, Fumin Shen, Yazhou Yao, Tao Chen, Xian-Sheng Hua, and Heng-Tao Shen.
Hierarchical Graph Pattern Understanding for Zero-Shot Video Object Segmentation.
IEEE Transactions on Image Processing, vol. 32, pp. 5909--5920, 2023.
[ link ]
Yazhou Yao, Tao Chen, Hanbo Bi, Xinhao Cai, Gensheng Pei, Guoye Yang, Zhiyuan Yan, Xian Sun, Xing Xu, and Hai Zhang.
Automated Object Recognition in High-resolution Optical Remote Sensing Imagery.
National Science Review, vol. 10, nwad122, 2023.
[ link ]
Gensheng Pei, Yazhou Yao, Fumin Shen, Dan Huang, Xingguo Huang, and Heng-Tao Shen.
Hierarchical Co-attention Propagation Network for Zero-Shot Video Object Segmentation.
IEEE Transactions on Image Processing, vol. 32, pp. 2348--2359, 2023.
[ link ]
|