Xiaofeng Zhang

Ph.D student, SJTU

Key Laboratory of System Control and Information Processing

Shang Hai Jiao Tong Unversity

Email: framebreak@sjtu.edu.cn, 微信: SemiZxf

钱塘江上朝信来，今日方知我是我
[Google Scholar] [GitHub]

News

11/2025: I have three paper accepted by ICLR 2026! congratulations!~~ .
11/2025: We have two paper accepted by AAAI 2026 , congratulations to Shuochen Chang and Peng Gao.
9/2025: I have one paper (IPM) accepted by IPM(中科院一区top) .
9/2025: We have one paper (Spatial-R1) accepted by NeurIPS 2025 , congratulations to Yifan Shen and Yuanzhe Liu.
8/2025: I have one paper (EAH) accepted by EMNLP 2025 oral🏆, see you in suzhou.
7/2025: We have one paper (MCA-LLaVA) accepted by ACM MM 2025, congratulations to Qiyan Zhao.
6/2025: We have one paper (pCR Prediction in Breast Cancer) accepted by MICCAI 2025, congratulations to Dingrui Ma.
6/2025: We have one paper (AdaToken-3D, VLM token pruning ) accepted by IROS 2025, congratulations to Kai Zhang.
6/2025: We have one paper (Mural inpainting) accepted by ACM TOMM, congratulations to Zishan Xu.
5/2025: We have one paper (Image segmetation) accepted by ICML 2025, congratulations to Jiawei Cao.
4/2025: We have one paper (Image restoration) accepted by IJCAI 2025, congratulations to Jiesong Bai.
1/2025: I have one paper (LLaVA-CAM) accepted by NAACL 2025 oral🏆.
1/2025: We have one paper accepted by ICLR 2025, congratulations to Sinan Fan.
1/2025: I have one paper (Wakeup-Darkness) accepted by ACM TOMM..
12/2024: I have one paper (Simignore) accepted by Neural Network.
12/2024: We have one paper accepted by TIM, congratulations to Jietao Yang.
12/2024: I have one paper accepted by AAAI 2025.
10/2024: We have one paper accepted by WACV 2025, congratulations to Yingtie Lei.
09/2024: We have one paper accepted by NIPS 2024, congratulations to Xiaosong Yuan.
08/2024: I have one paper (DOPRA) accepted by ACM MM 2024.

About Me

I am currently an third-year Ph.D student at SJTU, Shang Hai Jiao Tong University . Prior to that, I received my master degree in NJUPT. My current research focuses on large vision-language models.

Biography

2022.09 - Present, Ph.D at Shang Hai Jiao Tong Unversity, supervised by Chaochen Gu and Hao Tang.

2024.01 - 2025.7. Research Intern at Alibaba Cloud(飞天实验室), supervised by Chen Shen.

2021.09 - 2022.9. Product manger. in China Mobile Communications Group Jiangsu Co., LTD. Wuxi branch.

2014.09- 2021.06, B/M.Eng. in Nanjing University of Posts and Telecommunications.

Publications

	Hallucination Begins Where Saliency Drops Xiaofeng Zhang, Yuanchao Zhu, Chaochen Gu, Xiaosong Yuan, Qiyan Zhao, Jiawei Cao, Feilong Tang, Sinan Fan, Yaomin Shen, Chen Shen, Hao Tang ICLR 2026 [Openreview] [Code]

	Context Tokens are Anchors: Understanding the Repetition Curse in Diffusion MLLMs from an Information Flow Perspective Qiyan Zhao, Xiaofeng Zhang, Shuochen Chang, Qianyu Chen, Xiaosong Yuan, Xuhang Chen, Luoqi Liu, Jiajun Zhang, Xu-Yao Zhang, Da-Han Wang ICLR 2026, corresponding author* [Openreview]

	Remember Me: Bridging the Long‑Range Gap in LVLMs with Three-Step Inference-Only Decay Resilience Strategies Peng Gao, Yujian Lee, Xiaofeng Zhang, Zailong Chen, Hui Zhang AAAI 2026, corresponding author* [Arxiv]

	D3ToM: Decider-Guided Dynamic Token Merging for Accelerating Diffusion MLLMs Shuochen Chang, Xiaofeng Zhang, Qingyang Liu, Li Niu AAAI 2026, projector leader [Arxiv] [Code]

	What Drives Attention Sinks? A Study of Massive Activations and Rotational Positional Encoding in Large Vision-Language Models Xiaofeng Zhang, Yuanchao Zhu, Chaochen Gu, Jiawei Cao, Hao Cheng, Kaijie Wu Information Processing & Mangement (中科院一区).

	Seeing Clearly by Layer Two: Enhancing Attention Heads to Alleviate Hallucination in LVLMs Xiaofeng Zhang, Yihao Quan, Chaochen Gu, Chen Shen, Xiaosong Yuan, Shaotian Yan, Jieping Ye EMNLP 2025 Oral🏆 [Arxiv] [Code]

	MCA-LLaVA: Manhattan Causal Attention for Reducing Hallucination in Large Vision-Language Models Qiyan Zhao, Xiaofeng Zhang, Yiheng Li, Yun Xing, Xiaosng Yuan, Feilong Tang, Sinan Fan, Xuhang Chen ACM MM 2025 (corresponding author/projector leader) [Arxiv] [Code]

	From Redundancy to Relevance: Information Flow in LVLMs Across Reasoning Tasks Xiaofeng Zhang, Yihao Quan, Chen Shen, Xiaosong Yuan, Shaotian Yan, Chaochen Gu, Hao Tang, Jieping Ye NAACL 2025 Oral🏆 [arXiv] [Code]

	Enhancing Multimodal Large Language Models Complex Reasoning via Similarity Computation Xiaofeng Zhang, Fanshuo Zeng, Yihao Quan, Zheng Hui, Jiawei Yao AAAI, 2025 [Code]

	Simignore:Enhancing Multimodal Large Language Models Complex Reasoning via Similarity Computation Xiaofeng Zhang, Fanshuo Zeng, Chaochen Gu Neural Network (中科院一区) [Code]

	DOPRA: Decoding Over-accumulation Penalization and Re-allocation in Specific Weighting Layer Jingfeng Wei, Xiaofeng Zhang ACM MM, 2024🏆, (corresponding author/projector leader) [arXiv]

	Improving complex reasoning with dynamic prompt corruption: a soft prompt optimization approch Sinan Fan, Liang Xie, Chen Shen, Ge Teng, Xiaosong Yuan, Xiaofeng Zhang, Jieping Ye ICLR, 2025.

	Instance-adaptive Zero-shot Chain-of-Thought Prompting Xiaosong Yuan, Chen Shen, Shaotian Yan, Xiaofeng Zhang, Liang Xie, Wenxiao Wang, Renchu Guan, Ying Wang, Jieping Ye NeurIPS, 2024. [arXiv]

	Wakeup-Darkness: When Multimodal Meets Unsupervised Low-light Image Enhancement Xiaofeng Zhang, Zishan Xu, Hao Tang, Chaochen Gu, Wei Chen ACM Transactions on Multimedia Computing, Communications, 2025 [arXiv] [Code]

	Memory augment is All You Need for image restoration Xiaofeng Zhang, Chaochen Gu, Shanying Zhu IEEE Transactions on Consumer Electronics (中科院二区), 2025. [arXiv] [Code]

	SpA-Former: An Effective and lightweight Transformer for image shadow removal Xiaofeng Zhang, Yudi Zhao, Chaochen Gu International Joint Conference on Neural Networks, (IJCNN), 2023. [arXiv] [Code]

	Sienet: Siamese expansion network for image extrapolation Xiaofeng Zhang, Feng Chen, Cailing Wang IEEE Signal Processing Letters (SPL), 2021. [Paper] [Code]

Awards

2021 National Scholarship

Services

Invited Reviewer for:

TPAMI,TIP, TETCI, IPM

NIPS, CVPR, ICLR, ICCV, AAAI, IJCAI, ACM MM, ACL, EMNLP