Gensheng Pei (裴根生)


Publications


[ Conference Papers, Journal Articles ]

An asterisk (*) beside authors' names indicates equal contributions.


Preprints

  1. Gensheng Pei, Xiruo Jiang, Yazhou Yao, Xiangbo Shu, Fumin Shen, and Byeungwoo Jeon.
    Taming SAM3 in the Wild: A Concept Bank for Open-Vocabulary Segmentation.
    Preprint, 2026.
    [ arXiv ]

  2. Gensheng Pei, Yazhou Yao, Jianbo Jiao, Wenguan Wang, Liqiang Nie, and Jinhui Tang.
    Dynamic in Static: Hybrid Visual Correspondence for Self-Supervised Video Object Segmentation.
    Preprint, 2024.
    [ arXiv ]


Conference Papers

  1. Gensheng Pei, Xiruo Jiang, Xinhao Cai, Tao Chen, Yazhou Yao, and Byeungwoo Jeon.
    PEARL: Geometry Aligns Semantics for Training-Free Open-Vocabulary Semantic Segmentation.
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE/CVF, Denver CO, USA, Jun 3--7, 2026.
    [ paper ]

  2. Gensheng Pei, Tao Chen, Yujia Wang, Xinhao Cai, Xiangbo Shu, Tianfei Zhou, and Yazhou Yao.
    Seeing What Matters: Empowering CLIP with Patch Generation-to-Selection.
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE/CVF, pp. 24862--24872, Nashville TN, USA, Jun 11--15, 2025.
    [ paper ]

  3. Gensheng Pei, Tao Chen, Xiruo Jiang, Huafeng Liu, Zeren Sun, and Yazhou Yao.
    VideoMAC: Video Masked Autoencoders Meet ConvNets.
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE/CVF, pp. 22733--22743, Seattle WA, USA, Jun 17--21, 2024.
    [ paper ]

  4. Gensheng Pei, Fumin Shen, Yazhou Yao, Guo-Sen Xie, Zhenmin Tang, and Jinhui Tang.
    Hierarchical Feature Alignment Network for Unsupervised Video Object Segmentation.
    In Proceedings of European Conference on Computer Vision (ECCV), Springer Nature, pp. 596--613, Tel Aviv, Israel, October 23--27, 2022.
    [ paper ]

  5. Zhenyu Yang, Gensheng Pei, Tao Chen, Yichao Zhou, Tianfei Zhou, Yazhou Yao, and Fumin Shen.
    Efficiency Follows Global-Local Decoupling.
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE/CVF, Denver CO, USA, Jun 3--7, 2026.
    [ paper ]

  6. Haowen Gu, Gensheng Pei, Zeren Sun, Mingwu Ren, Xiangbo Shu, Yazhou Yao, and Fumin Shen.
    MedFG-VQA: Low-Frequency Memory and Graph Attention for Lightweight Medical VQA.
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE/CVF, Denver CO, USA, Jun 3--7, 2026.
    [ paper ]

  7. Jianqiang Xu, Gensheng Pei, Huafeng Liu, and Yazhou Yao.
    GSV2X: Geometry-Aware Uncertainty Modeling and Orthogonal Fusion for Robust Roadside Perception.
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE/CVF, Denver CO, USA, Jun 3--7, 2026.
    [ paper ]

  8. Xinhao Cai, Gensheng Pei, Zeren Sun, Yazhou Yao, Fumin Shen, and Wenguan Wang.
    Iris: Bringing Real-World Priors into Diffusion Model for Monocular Depth Estimation.
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE/CVF, Denver CO, USA, Jun 3--7, 2026.
    [ paper ]

  9. Jianjian Yin, Tao Chen, Yi Chen, Gensheng Pei, Xiangbo Shu, Yazhou Yao, and Fumin Shen.
    PCA-Seg: Revisiting Cost Aggregation for Open-Vocabulary Semantic and Part Segmentation.
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE/CVF, Denver CO, USA, Jun 3--7, 2026.
    [ paper ]

  10. Xinhao Cai, Liulei Li, Gensheng Pei, Tao Chen, Jinshan Pan, Yazhou Yao, and Wenguan Wang.
    Beyond Frequency: Scoring-Driven Debiasing for Object Detection via Blueprint-Prompted Image Synthesis.
    In Proceedings of the International Conference on Learning Representations (ICLR), OpenReview, Rio de Janeiro, Brazil, April 23--27, 2026.
    [ paper ]

  11. Zhenyu Yang, Gensheng Pei, Tao Chen, Xia Yuan, Haofeng Zhang, Xiangbo Shu, Yazhou Yao.
    Beyond Quadratic: Linear-Time Change Detection with RWKV.
    In Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), AAAI, Singapore, January 20--27, 2026.
    [ paper ]

  12. Xinhao Cai, Qiuxia Lai, Gensheng Pei, Xiangbo Shu, Yazhou Yao, and Wenguan Wang.
    Cycle-Consistent Learning for Joint Layout-to-Image Generation and Object Detection.
    In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), IEEE/CVF, pp. 6797--6807, Honolulu HI, USA, October 19--23, 2025.
    [ paper ]

  13. Mengmeng Sheng, Zeren Sun, Gensheng Pei, Tao Chen, Haonan Luo, and Yazhou Yao.
    Enhancing Robustness in Learning with Noisy Labels: An Asymmetric Co-Training Approach.
    In Proceedings of ACM International Conference on Multimedia (ACMMM), ACM, pp. 4406--4415, Melbourne VIC, Australia, October 28--November 1, 2024.
    [ paper ]

  14. Tao Chen, XiRuo Jiang, Gensheng Pei, Zeren Sun, Yucheng Wang, and Yazhou Yao.
    Knowledge Transfer with Simulated Inter-Image Erasing for Weakly Supervised Semantic Segmentation.
    In Proceedings of European Conference on Computer Vision (ECCV), Springer Nature, pp. 441--458, Milan, Italy, September 29--October 4, 2024.
    [ paper ]


Journal Articles

  1. Jianjian Yin, Xiruo Jiang, Tao Chen, Gensheng Pei, Yazhou Yao, Fumin Shen, and Heng-Tao Shen.
    DepMatch: Boosting Semi-supervised Semantic Segmentation by Exploring Depth Difference Knowledge.
    IEEE Transactions on Image Processing, 2026.
    [ link ]

  2. Zhenyu Yang, Gensheng Pei, Yazhou Yao, Tianfei Zhou, Lizhong Ding, and Fumin Shen.
    ChangeTitans: Towards Remote Sensing Change Detection with Neural Memory.
    IEEE Transactions on Geoscience and Remote Sensing, vol. 63, pp. 4709714, 2025.
    [ link ]

  3. Yin Tang, Rui Chen, Gensheng Pei, and Qiong Wang.
    PASS-SAM: Integration of Segment Anything Model for Large-Scale Unsupervised Semantic Segmentation.
    Computational Visual Media, vol. 11, pp. 669--674, 2025.
    [ link ]

  4. Jianjian Yin, Tao Chen, Gensheng Pei, Yazhou Yao, Liqiang Nie, and Xiansheng Hua.
    Semi-Supervised Semantic Segmentation With Multi-Constraint Consistency Learning.
    IEEE Transactions on Multimedia, vol. 27, pp. 6449--6461, 2025.
    [ link ]

  5. Gensheng Pei, Fumin Shen, Yazhou Yao, Tao Chen, Xian-Sheng Hua, and Heng-Tao Shen.
    Hierarchical Graph Pattern Understanding for Zero-Shot Video Object Segmentation.
    IEEE Transactions on Image Processing, vol. 32, pp. 5909--5920, 2023.
    [ link ]

  6. Yazhou Yao, Tao Chen, Hanbo Bi, Xinhao Cai, Gensheng Pei, Guoye Yang, Zhiyuan Yan, Xian Sun, Xing Xu, and Hai Zhang.
    Automated Object Recognition in High-resolution Optical Remote Sensing Imagery.
    National Science Review, vol. 10, nwad122, 2023.
    [ link ]

  7. Gensheng Pei, Yazhou Yao, Fumin Shen, Dan Huang, Xingguo Huang, and Heng-Tao Shen.
    Hierarchical Co-attention Propagation Network for Zero-Shot Video Object Segmentation.
    IEEE Transactions on Image Processing, vol. 32, pp. 2348--2359, 2023.
    [ link ]