Xiaofeng Zhang  

Ph.D student, SJTU



Key Laboratory of System Control and Information Processing

Shang Hai Jiao Tong Unversity

Email: framebreak@sjtu.edu.cn, 微信: SemiZxf

钱塘江上朝信来,今日方知我是我
[Google Scholar] [GitHub]


News

  • 11/2025: I have three paper accepted by ICLR 2026! congratulations!~~ .
  • 11/2025: We have two paper accepted by AAAI 2026 , congratulations to Shuochen Chang and Peng Gao.
  • 9/2025: I have one paper (IPM) accepted by IPM(中科院一区top) .
  • 9/2025: We have one paper (Spatial-R1) accepted by NeurIPS 2025 , congratulations to Yifan Shen and Yuanzhe Liu.
  • 8/2025: I have one paper (EAH) accepted by EMNLP 2025 oral🏆, see you in suzhou.
  • 7/2025: We have one paper (MCA-LLaVA) accepted by ACM MM 2025, congratulations to Qiyan Zhao.
  • 6/2025: We have one paper (pCR Prediction in Breast Cancer) accepted by MICCAI 2025, congratulations to Dingrui Ma.
  • 6/2025: We have one paper (AdaToken-3D, VLM token pruning ) accepted by IROS 2025, congratulations to Kai Zhang.
  • 6/2025: We have one paper (Mural inpainting) accepted by ACM TOMM, congratulations to Zishan Xu.
  • 5/2025: We have one paper (Image segmetation) accepted by ICML 2025, congratulations to Jiawei Cao.
  • 4/2025: We have one paper (Image restoration) accepted by IJCAI 2025, congratulations to Jiesong Bai.
  • 1/2025: I have one paper (LLaVA-CAM) accepted by NAACL 2025 oral🏆.
  • 1/2025: We have one paper accepted by ICLR 2025, congratulations to Sinan Fan.
  • 1/2025: I have one paper (Wakeup-Darkness) accepted by ACM TOMM..
  • 12/2024: I have one paper (Simignore) accepted by Neural Network.
  • 12/2024: We have one paper accepted by TIM, congratulations to Jietao Yang.
  • 12/2024: I have one paper accepted by AAAI 2025.
  • 10/2024: We have one paper accepted by WACV 2025, congratulations to Yingtie Lei.
  • 09/2024: We have one paper accepted by NIPS 2024, congratulations to Xiaosong Yuan.
  • 08/2024: I have one paper (DOPRA) accepted by ACM MM 2024.

About Me

I am currently an third-year Ph.D student at SJTU, Shang Hai Jiao Tong University . Prior to that, I received my master degree in NJUPT. My current research focuses on large vision-language models.

Biography

  • 2022.09 - Present, Ph.D at Shang Hai Jiao Tong Unversity, supervised by Chaochen Gu and Hao Tang.
  • 2024.01 - 2025.7. Research Intern at Alibaba Cloud(飞天实验室), supervised by Chen Shen.
  • 2021.09 - 2022.9. Product manger. in China Mobile Communications Group Jiangsu Co., LTD. Wuxi branch.
  • 2014.09- 2021.06, B/M.Eng. in Nanjing University of Posts and Telecommunications.

    Publications

    Hallucination Begins Where Saliency Drops

    Xiaofeng Zhang, Yuanchao Zhu, Chaochen Gu, Xiaosong Yuan, Qiyan Zhao, Jiawei Cao, Feilong Tang, Sinan Fan, Yaomin Shen, Chen Shen, Hao Tang

    ICLR 2026

    [Openreview] [Code]

    Context Tokens are Anchors: Understanding the Repetition Curse in Diffusion MLLMs from an Information Flow Perspective

    Qiyan Zhao, Xiaofeng Zhang*, Shuochen Chang, Qianyu Chen, Xiaosong Yuan, Xuhang Chen, Luoqi Liu, Jiajun Zhang, Xu-Yao Zhang, Da-Han Wang

    ICLR 2026, corresponding author

    [Openreview]

    Remember Me: Bridging the Long‑Range Gap in LVLMs with Three-Step Inference-Only Decay Resilience Strategies

    Peng Gao, Yujian Lee, Xiaofeng Zhang*, Zailong Chen, Hui Zhang

    AAAI 2026, corresponding author

    [Arxiv]

    D3ToM: Decider-Guided Dynamic Token Merging for Accelerating Diffusion MLLMs

    Shuochen Chang, Xiaofeng Zhang*, Qingyang Liu, Li Niu*

    AAAI 2026, projector leader

    [Arxiv] [Code]

    What Drives Attention Sinks? A Study of Massive Activations and Rotational Positional Encoding in Large Vision-Language Models

    Xiaofeng Zhang, Yuanchao Zhu, Chaochen Gu, Jiawei Cao, Hao Cheng, Kaijie Wu

    Information Processing & Mangement (中科院一区).


    Seeing Clearly by Layer Two: Enhancing Attention Heads to Alleviate Hallucination in LVLMs

    Xiaofeng Zhang*, Yihao Quan*, Chaochen Gu, Chen Shen, Xiaosong Yuan, Shaotian Yan, Jieping Ye

    EMNLP 2025 Oral🏆

    [Arxiv] [Code]

    MCA-LLaVA: Manhattan Causal Attention for Reducing Hallucination in Large Vision-Language Models

    Qiyan Zhao*, Xiaofeng Zhang*, Yiheng Li, Yun Xing, Xiaosng Yuan, Feilong Tang, Sinan Fan, Xuhang Chen

    ACM MM 2025 (corresponding author/projector leader)

    [Arxiv] [Code]

    From Redundancy to Relevance: Information Flow in LVLMs Across Reasoning Tasks

    Xiaofeng Zhang, Yihao Quan, Chen Shen, Xiaosong Yuan, Shaotian Yan, Chaochen Gu, Hao Tang, Jieping Ye

    NAACL 2025 Oral🏆

    [arXiv] [Code]

    Enhancing Multimodal Large Language Models Complex Reasoning via Similarity Computation

    Xiaofeng Zhang, Fanshuo Zeng, Yihao Quan, Zheng Hui, Jiawei Yao

    AAAI, 2025

    [Code]

    Simignore:Enhancing Multimodal Large Language Models Complex Reasoning via Similarity Computation

    Xiaofeng Zhang, Fanshuo Zeng, Chaochen Gu

    Neural Network (中科院一区)

    [Code]

    DOPRA: Decoding Over-accumulation Penalization and Re-allocation in Specific Weighting Layer

    Jingfeng Wei*, Xiaofeng Zhang*

    ACM MM, 2024🏆, (corresponding author/projector leader)

    [arXiv]

    Improving complex reasoning with dynamic prompt corruption: a soft prompt optimization approch

    Sinan Fan, Liang Xie, Chen Shen, Ge Teng, Xiaosong Yuan, Xiaofeng Zhang, Jieping Ye

    ICLR, 2025.


    Instance-adaptive Zero-shot Chain-of-Thought Prompting

    Xiaosong Yuan, Chen Shen, Shaotian Yan, Xiaofeng Zhang, Liang Xie, Wenxiao Wang, Renchu Guan, Ying Wang, Jieping Ye

    NeurIPS, 2024.

    [arXiv]

    Wakeup-Darkness: When Multimodal Meets Unsupervised Low-light Image Enhancement

    Xiaofeng Zhang, Zishan Xu, Hao Tang, Chaochen Gu, Wei Chen

    ACM Transactions on Multimedia Computing, Communications, 2025

    [arXiv] [Code]

    Memory augment is All You Need for image restoration

    Xiaofeng Zhang, Chaochen Gu, Shanying Zhu

    IEEE Transactions on Consumer Electronics (中科院二区), 2025.

    [arXiv] [Code]

    SpA-Former: An Effective and lightweight Transformer for image shadow removal

    Xiaofeng Zhang, Yudi Zhao, Chaochen Gu

    International Joint Conference on Neural Networks, (IJCNN), 2023.

    [arXiv] [Code]

    Sienet: Siamese expansion network for image extrapolation

    Xiaofeng Zhang, Feng Chen, Cailing Wang

    IEEE Signal Processing Letters (SPL), 2021.

    [Paper] [Code]

    Awards

  • 2021 National Scholarship

  • Services



    Invited Reviewer for:
  • TPAMI,TIP, TETCI, IPM
  • NIPS, CVPR, ICLR, ICCV, AAAI, IJCAI, ACM MM, ACL, EMNLP