Xiaofeng Zhang

Ph.D. Student, Shanghai Jiao Tong University

Key Laboratory of System Control and Information Processing
Shanghai Jiao Tong University

📧 framebreak@sjtu.edu.cn  |  💬 WeChat: SemiZxf

钱塘江上潮信来,今日方知我是我

Xiaofeng Zhang

📰 News

👤 About Me

I am currently a third-year Ph.D. student at Shanghai Jiao Tong University. Prior to that, I received my Master's degree from Nanjing University of Posts and Telecommunications. My current research focuses on large vision-language models, including hallucination mitigation, position encoding optimization, and efficient multimodal reasoning.

📚 Biography

📄 Publications (First Author)

ICLR saliency

Hallucination Begins Where Saliency Drops

Xiaofeng Zhang, Yuanchao Zhu, Chaochen Gu, Xiaosong Yuan, Qiyan Zhao, Jiawei Cao, Feilong Tang, Sinan Fan, Yaomin Shen, Chen Shen, Hao Tang

ICLR 2026 Oral 🏆

ICLR qiyan

Context Tokens are Anchors: Understanding the Repetition Curse in Diffusion MLLMs from an Information Flow Perspective

Qiyan Zhao*, Xiaofeng Zhang*✉, Shuochen Chang, Qianyu Chen, Xiaosong Yuan, Xuhang Chen, Luoqi Liu, Jiajun Zhang, Xu-Yao Zhang, Da-Han Wang

ICLR 2026, corresponding author

IPM

What Drives Attention Sinks? A Study of Massive Activations and Rotational Positional Encoding in Large Vision-Language Models

Xiaofeng Zhang, Yuanchao Zhu, Chaochen Gu, Jiawei Cao, Hao Cheng, Kaijie Wu

Information Processing & Management (中科院一区), CCF B

Seeing Clearly

Seeing Clearly by Layer Two: Enhancing Attention Heads to Alleviate Hallucination in LVLMs

Xiaofeng Zhang*, Yihao Quan*, Chaochen Gu, Chen Shen, Xiaosong Yuan, Shaotian Yan, Jieping Ye

EMNLP 2025 Oral 🏆

NAACL

From Redundancy to Relevance: Information Flow in LVLMs Across Reasoning Tasks

Xiaofeng Zhang, Yihao Quan, Chen Shen, Xiaosong Yuan, Shaotian Yan, Chaochen Gu, Hao Tang, Jieping Ye

NAACL 2025 Oral 🏆

SimIgnore AAAI

Enhancing Multimodal Large Language Models Complex Reasoning via Similarity Computation

Xiaofeng Zhang✉, Fanshuo Zeng, Yihao Quan, Zheng Hui, Jiawei Yao

AAAI 2025

SimIgnore Journal

SimIgnore: Enhancing Multimodal Large Language Models Complex Reasoning via Similarity Computation

Xiaofeng Zhang, Fanshuo Zeng, Chaochen Gu

Neural Networks (中科院一区)

Wakeup-Darkness

Wakeup-Darkness: When Multimodal Meets Unsupervised Low-light Image Enhancement

Xiaofeng Zhang, Zishan Xu, Hao Tang, Chaochen Gu, Wei Chen

ACM Transactions on Multimedia Computing, Communications, 2025

MemoryNet

Memory Augment is All You Need for Image Restoration

Xiaofeng Zhang, Chaochen Gu, Shanying Zhu

IEEE Transactions on Consumer Electronics (中科院二区), 2026

📄 Publications (Corresponding Author / Project Lead)

Remember Me

Remember Me: Bridging the Long‑Range Gap in LVLMs with Three-Step Inference-Only Decay Resilience Strategies

Peng Gao*, Yujian Lee*, Xiaofeng Zhang*✉, Zailong Chen, Hui Zhang

AAAI 2026, corresponding author

D3ToM

D3ToM: Decider-Guided Dynamic Token Merging for Accelerating Diffusion MLLMs

Shuochen Chang, Xiaofeng Zhang*, Qingyang Liu, Li Niu*

AAAI 2026, project leader

MCA-LLaVA

MCA-LLaVA: Manhattan Causal Attention for Reducing Hallucination in Large Vision-Language Models

Qiyan Zhao*, Xiaofeng Zhang*✉, Yiheng Li, Yun Xing, Xiaosong Yuan, Feilong Tang, Sinan Fan, Xuhang Chen

ACM MM 2025 (corresponding author / project leader)

DOPRA

DOPRA: Decoding Over-accumulation Penalization and Re-allocation in Specific Weighting Layer

Jingfeng Wei*, Xiaofeng Zhang*✉

ACM MM 2024 🏆 (corresponding author / project leader)

🏆 Awards

🔍 Services

Invited Reviewer for: