Xiaofeng Zhang  

Ph.D student, SJTU



Key Laboratory of System Control and Information Processing

Shang Hai Jiao Tong Unversity

Email: framebreak@sjtu.edu.cn, 微信: SemiZxf

钱塘江上朝信来,今日方知我是我
[Google Scholar] [GitHub]


News

  • 9/2025: I have one paper (IPM) accepted by IPM(中科院一区top) .
  • 9/2025: We have one paper (Spatial-R1) accepted by NeurIPS 2025 , congratulations to Yifan Shen and Yuanzhe Liu.
  • 8/2025: I have one paper (EAH) accepted by EMNLP 2025 oral🏆, see you in suzhou.
  • 7/2025: We have one paper (MCA-LLaVA) accepted by ACM MM 2025, congratulations to Qiyan Zhao.
  • 6/2025: We have one paper (pCR Prediction in Breast Cancer) accepted by MICCAI 2025, congratulations to Dingrui Ma.
  • 6/2025: We have one paper (AdaToken-3D, VLM token pruning ) accepted by IROS 2025, congratulations to Kai Zhang.
  • 6/2025: We have one paper (Mural inpainting) accepted by ACM TOMM, congratulations to Zishan Xu.
  • 5/2025: We have one paper (Image segmetation) accepted by ICML 2025, congratulations to Jiawei Cao.
  • 4/2025: We have one paper (Image restoration) accepted by IJCAI 2025, congratulations to Jiesong Bai.
  • 1/2025: I have one paper (LLaVA-CAM) accepted by NAACL 2025 oral🏆.
  • 1/2025: We have one paper accepted by ICLR 2025, congratulations to Sinan Fan.
  • 1/2025: I have one paper (Wakeup-Darkness) accepted by ACM TOMM..
  • 12/2024: I have one paper (Simignore) accepted by Neural Network.
  • 12/2024: We have one paper accepted by TIM, congratulations to Jietao Yang.
  • 12/2024: I have one paper accepted by AAAI 2025.
  • 10/2024: We have one paper accepted by WACV 2025, congratulations to Yingtie Lei.
  • 09/2024: We have one paper accepted by NIPS 2024, congratulations to Xiaosong Yuan.
  • 08/2024: I have one paper (DOPRA) accepted by ACM MM 2024.

About Me

I am currently an third-year Ph.D student at SJTU, Shang Hai Jiao Tong University . Prior to that, I received my master degree in NJUPT. My current research focuses on large vision-language models.

Biography

  • 2022.09 - Present, Ph.D at Shang Hai Jiao Tong Unversity, supervised by Chaochen Gu and Hao Tang.
  • 2024.01 - 2025.7. Research Intern at Alibaba Cloud(飞天实验室), supervised by Chen Shen.
  • 2021.09 - 2022.9. Product manger. in China Mobile Communications Group Jiangsu Co., LTD. Wuxi branch.
  • 2014.09- 2021.06, B/M.Eng. in Nanjing University of Posts and Telecommunications.

    Publications

    What Drives Attention Sinks? A Study of Massive Activations and Rotational Positional Encoding in Large Vision-Language Models

    Xiaofeng Zhang, Yuanchao Zhu, Chaochen Gu, Jiawei Cao, Hao Cheng, Kaijie Wu

    Information Processing & Mangement (中科院一区).


    Fine-Grained Preference Optimization Improves Spatial Reasoning in VLMs

    Yifan Shen, Yuanzhe Liu, Jingyuan Zhu, Xu Cao, Xiaofeng Zhang, Yixiao He, Wenming Ye, James Matthew Rehg, Ismini Lourentzou

    NeurIPS, 2025 (projector leader).

    [Arxiv]

    Seeing Clearly by Layer Two: Enhancing Attention Heads to Alleviate Hallucination in LVLMs

    Xiaofeng Zhang*, Yihao Quan*, Chaochen Gu, Chen Shen, Xiaosong Yuan, Shaotian Yan, Jieping Ye

    EMNLP 2025 Oral🏆

    [Arxiv] [Code]

    MCA-LLaVA: Manhattan Causal Attention for Reducing Hallucination in Large Vision-Language Models

    Qiyan Zhao*, Xiaofeng Zhang*, Yiheng Li, Yun Xing, Xiaosng Yuan, Feilong Tang, Sinan Fan, Xuhang Chen, Xuyao Zhang, Dahan Wang

    ACM MM 2025 (corresponding author/projector leader)

    [Arxiv] [Code]

    From Redundancy to Relevance: Information Flow in LVLMs Across Reasoning Tasks

    Xiaofeng Zhang, Yihao Quan, Chen Shen, Xiaosong Yuan, Shaotian Yan, Chaochen Gu, Hao Tang, Jieping Ye

    NAACL 2025 Oral🏆

    [arXiv] [Code]

    Enhancing Multimodal Large Language Models Complex Reasoning via Similarity Computation

    Xiaofeng Zhang, Fanshuo Zeng, Yihao Quan, Zheng Hui, Jiawei Yao

    AAAI, 2025

    [Code]

    Simignore:Enhancing Multimodal Large Language Models Complex Reasoning via Similarity Computation

    Xiaofeng Zhang, Fanshuo Zeng, Chaochen Gu

    Neural Network (中科院一区)

    [Code]

    DOPRA: Decoding Over-accumulation Penalization and Re-allocation in Specific Weighting Layer

    Jingfeng Wei*, Xiaofeng Zhang*

    ACM MM, 2024🏆, (corresponding author/projector leader)

    [arXiv]

    Improving complex reasoning with dynamic prompt corruption: a soft prompt optimization approch

    Sinan Fan, Liang Xie, Chen Shen, Ge Teng, Xiaosong Yuan, Xiaofeng Zhang, Jieping Ye

    ICLR, 2025.


    Instance-adaptive Zero-shot Chain-of-Thought Prompting

    Xiaosong Yuan, Chen Shen, Shaotian Yan, Xiaofeng Zhang, Liang Xie, Wenxiao Wang, Renchu Guan, Ying Wang, Jieping Ye

    NeurIPS, 2024.

    [arXiv]

    Wakeup-Darkness: When Multimodal Meets Unsupervised Low-light Image Enhancement

    Xiaofeng Zhang, Zishan Xu, Hao Tang, Chaochen Gu, Wei Chen

    ACM Transactions on Multimedia Computing, Communications, 2025

    [arXiv] [Code]

    Memory augment is All You Need for image restoration

    Xiaofeng Zhang, Chaochen Gu, Shanying Zhu

    IEEE Transactions on Consumer Electronics (中科院二区), 2025.

    [arXiv] [Code]

    MuralDiff:Diffusion for Ancient Murals restoration on Large-scale Pre-training

    Zishan Xu,Xiaofeng Zhang, Wei Chen, Jueting Liu, Tingting Xu, Zehua Wang

    IEEE Transactions on Emerging Topics in Computational Intelligence(TETCI), 2024.

    [arXiv]

    SpA-Former: An Effective and lightweight Transformer for image shadow removal

    Xiaofeng Zhang, Yudi Zhao, Chaochen Gu

    International Joint Conference on Neural Networks, (IJCNN), 2023.

    [arXiv] [Code]

    Sienet: Siamese expansion network for image extrapolation

    Xiaofeng Zhang, Feng Chen, Cailing Wang

    IEEE Signal Processing Letters (SPL), 2021.

    [Paper] [Code]

    Awards

  • 2021 National Scholarship

  • Services



    Invited Reviewer for:
  • TIP, TETCI, IPM
  • NIPS, CVPR, ICLR, AAAI, IJCAI, ACM MM, ACL, EMNLP