Xiaofeng Zhang  

Ph.D student, SJTU



Key Laboratory of System Control and Information Processing

Shang Hai Jiao Tong Unversity

Email: framebreak@sjtu.edu.cn, 微信: SemiZxf

钱塘江上朝信来,今日方知我是我
[Google Scholar] [GitHub]


News

  • 11/2025: We have two paper accepted by AAAI 2026 , congratulations to Shuochen Chang and Peng Gao.
  • 9/2025: I have one paper (IPM) accepted by IPM(中科院一区top) .
  • 9/2025: We have one paper (Spatial-R1) accepted by NeurIPS 2025 , congratulations to Yifan Shen and Yuanzhe Liu.
  • 8/2025: I have one paper (EAH) accepted by EMNLP 2025 oral🏆, see you in suzhou.
  • 7/2025: We have one paper (MCA-LLaVA) accepted by ACM MM 2025, congratulations to Qiyan Zhao.
  • 6/2025: We have one paper (pCR Prediction in Breast Cancer) accepted by MICCAI 2025, congratulations to Dingrui Ma.
  • 6/2025: We have one paper (AdaToken-3D, VLM token pruning ) accepted by IROS 2025, congratulations to Kai Zhang.
  • 6/2025: We have one paper (Mural inpainting) accepted by ACM TOMM, congratulations to Zishan Xu.
  • 5/2025: We have one paper (Image segmetation) accepted by ICML 2025, congratulations to Jiawei Cao.
  • 4/2025: We have one paper (Image restoration) accepted by IJCAI 2025, congratulations to Jiesong Bai.
  • 1/2025: I have one paper (LLaVA-CAM) accepted by NAACL 2025 oral🏆.
  • 1/2025: We have one paper accepted by ICLR 2025, congratulations to Sinan Fan.
  • 1/2025: I have one paper (Wakeup-Darkness) accepted by ACM TOMM..
  • 12/2024: I have one paper (Simignore) accepted by Neural Network.
  • 12/2024: We have one paper accepted by TIM, congratulations to Jietao Yang.
  • 12/2024: I have one paper accepted by AAAI 2025.
  • 10/2024: We have one paper accepted by WACV 2025, congratulations to Yingtie Lei.
  • 09/2024: We have one paper accepted by NIPS 2024, congratulations to Xiaosong Yuan.
  • 08/2024: I have one paper (DOPRA) accepted by ACM MM 2024.

About Me

I am currently an third-year Ph.D student at SJTU, Shang Hai Jiao Tong University . Prior to that, I received my master degree in NJUPT. My current research focuses on large vision-language models.

Biography

  • 2022.09 - Present, Ph.D at Shang Hai Jiao Tong Unversity, supervised by Chaochen Gu and Hao Tang.
  • 2024.01 - 2025.7. Research Intern at Alibaba Cloud(飞天实验室), supervised by Chen Shen.
  • 2021.09 - 2022.9. Product manger. in China Mobile Communications Group Jiangsu Co., LTD. Wuxi branch.
  • 2014.09- 2021.06, B/M.Eng. in Nanjing University of Posts and Telecommunications.

    Publications

    Remember Me: Bridging the Long‑Range Gap in LVLMs with Three-Step Inference-Only Decay Resilience Strategies

    Peng Gao, Yujian Lee, Xiaofeng Zhang*, Zailong Chen, Hui Zhang

    AAAI 2026, corresponding author

    [Arxiv]

    D3ToM: Decider-Guided Dynamic Token Merging for Accelerating Diffusion MLLMs

    Shuochen Chang, Xiaofeng Zhang*, Qingyang Liu, Li Niu*

    AAAI 2026, projector leader

    [Arxiv] [Code]

    What Drives Attention Sinks? A Study of Massive Activations and Rotational Positional Encoding in Large Vision-Language Models

    Xiaofeng Zhang, Yuanchao Zhu, Chaochen Gu, Jiawei Cao, Hao Cheng, Kaijie Wu

    Information Processing & Mangement (中科院一区).


    Seeing Clearly by Layer Two: Enhancing Attention Heads to Alleviate Hallucination in LVLMs

    Xiaofeng Zhang*, Yihao Quan*, Chaochen Gu, Chen Shen, Xiaosong Yuan, Shaotian Yan, Jieping Ye

    EMNLP 2025 Oral🏆

    [Arxiv] [Code]

    MCA-LLaVA: Manhattan Causal Attention for Reducing Hallucination in Large Vision-Language Models

    Qiyan Zhao*, Xiaofeng Zhang*, Yiheng Li, Yun Xing, Xiaosng Yuan, Feilong Tang, Sinan Fan, Xuhang Chen

    ACM MM 2025 (corresponding author/projector leader)

    [Arxiv] [Code]

    From Redundancy to Relevance: Information Flow in LVLMs Across Reasoning Tasks

    Xiaofeng Zhang, Yihao Quan, Chen Shen, Xiaosong Yuan, Shaotian Yan, Chaochen Gu, Hao Tang, Jieping Ye

    NAACL 2025 Oral🏆

    [arXiv] [Code]

    Enhancing Multimodal Large Language Models Complex Reasoning via Similarity Computation

    Xiaofeng Zhang, Fanshuo Zeng, Yihao Quan, Zheng Hui, Jiawei Yao

    AAAI, 2025

    [Code]

    Simignore:Enhancing Multimodal Large Language Models Complex Reasoning via Similarity Computation

    Xiaofeng Zhang, Fanshuo Zeng, Chaochen Gu

    Neural Network (中科院一区)

    [Code]

    DOPRA: Decoding Over-accumulation Penalization and Re-allocation in Specific Weighting Layer

    Jingfeng Wei*, Xiaofeng Zhang*

    ACM MM, 2024🏆, (corresponding author/projector leader)

    [arXiv]

    Improving complex reasoning with dynamic prompt corruption: a soft prompt optimization approch

    Sinan Fan, Liang Xie, Chen Shen, Ge Teng, Xiaosong Yuan, Xiaofeng Zhang, Jieping Ye

    ICLR, 2025.


    Instance-adaptive Zero-shot Chain-of-Thought Prompting

    Xiaosong Yuan, Chen Shen, Shaotian Yan, Xiaofeng Zhang, Liang Xie, Wenxiao Wang, Renchu Guan, Ying Wang, Jieping Ye

    NeurIPS, 2024.

    [arXiv]

    Wakeup-Darkness: When Multimodal Meets Unsupervised Low-light Image Enhancement

    Xiaofeng Zhang, Zishan Xu, Hao Tang, Chaochen Gu, Wei Chen

    ACM Transactions on Multimedia Computing, Communications, 2025

    [arXiv] [Code]

    Memory augment is All You Need for image restoration

    Xiaofeng Zhang, Chaochen Gu, Shanying Zhu

    IEEE Transactions on Consumer Electronics (中科院二区), 2025.

    [arXiv] [Code]

    SpA-Former: An Effective and lightweight Transformer for image shadow removal

    Xiaofeng Zhang, Yudi Zhao, Chaochen Gu

    International Joint Conference on Neural Networks, (IJCNN), 2023.

    [arXiv] [Code]

    Sienet: Siamese expansion network for image extrapolation

    Xiaofeng Zhang, Feng Chen, Cailing Wang

    IEEE Signal Processing Letters (SPL), 2021.

    [Paper] [Code]

    Awards

  • 2021 National Scholarship

  • Services



    Invited Reviewer for:
  • TPAMI,TIP, TETCI, IPM
  • NIPS, CVPR, ICLR, ICCV, AAAI, IJCAI, ACM MM, ACL, EMNLP