|
* denotes equal contribution, † denotes correspondence. Please see my CV or
Google Scholar for full list. My representative papers are highlighted.
|
|
LLMs Pre-training and Attention
|
When Attention Sink Emerges in Language Models: An Empirical View
Xiangming Gu,
Tianyu Pang†,
Chao Du,
Qian Liu,
Fengzhuo Zhang,
Cunxiao Du,
Ye Wang†,
Min Lin
International Conference on Learning Representations (ICLR), Singapore, Singapore, 2025. (Spotlight)
Also in Annual Conference on Neural Information Processing Systems Workshop on Attributing Model Behavior at Scale (ATTRIB @ NeurIPS), Vancouver, Canada, 2024. (Oral)
pdf /
code /
video /
long talk /
slides /
poster
|
Why Do LLMs Attend to the First Token?
Federico Barbero*†,
รlvaro Arroyo*,
Xiangming Gu,
Christos Perivolaropoulos,
Michael Bronstein,
Petar Veliฤkoviฤ,
Razvan Pascanu
Conference on Language Modeling (COLM), Montreal, Canada, 2025.
pdf
|
SkyLadder: Better and Faster Pretraining via Context Window Scheduling
Tongyao Zhu,
Qian Liu†,
Haonan Wang,
Shiqi Chen,
Xiangming Gu,
Tianyu Pang,
Min-Yen Kan
Annual Conference on Neural Information Processing Systems (NeurIPS), San Diego, USA, 2025.
Also in International Conference on Learning Representations Workshop on Open Science for Foundation Models (SCI-FM @ ICLR), Singapore, Singapore, 2025.
pdf /
code
|
|
Safety/Security of LLMs and Diffusion Models
|
Extracting Alignment Data in Open Models
Federico Barbero†,
Xiangming Gu,
Christopher A. Choquette-Choo,
Chawin Sitawarin,
Matthew Jagielski,
Itay Yona,
Petar Veliฤkoviฤ,
Ilia Shumailov,
Jamie Hayes
Technical Report, 2025.
pdf
|
Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast
Xiangming Gu*,
Xiaosen Zheng*,
Tianyu Pang*†,
Chao Du,
Qian Liu,
Ye Wang†,
Jing Jiang†,
Min Lin
International Conference on Machine Learning (ICML), Vienna, Austria, 2024.
Also in International Conference on Learning Representations Workshop on Large Language Model Agents (LLMAgents @ ICLR), Vienna, Austria, 2024.
pdf /
project page /
code /
video /
slides /
ICML poster /
GYSS poster /
WIRED press
|
On Memorization in Diffusion Models
Xiangming Gu,
Chao Du†,
Tianyu Pang†,
Chongxuan Li,
Min Lin,
Ye Wang†
Transactions on Machine Learning Research (TMLR), 2025.
pdf /
code
|
On Calibration of LLM-based Guard Models for Reliable Content Moderation
Hongfu Liu†,
Hengguan Huang,
Xiangming Gu,
Hao Wang,
Ye Wang
International Conference on Learning Representations (ICLR), Singapore, Singapore, 2025.
Also in Annual Conference on Neural Information Processing Systems Safe Generative AI Workshop (SafeGenAI @ NeurIPS), Vancouver, Canada, 2024. (Oral)
pdf /
code
|
|
Speech/Singing and Multimodality
|
Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing
Xiangming Gu,
Longshen Ou,
Wei Zeng,
Jianan Zhang,
Nicholas Wong,
Ye Wang†
ACM Transactions on Multimedia Computing Communications and Applications (TOMM), 2024.
pdf /
code /
data
|
Elucidating Gender Fairness in Singing Voice Transcription
Xiangming Gu,
Wei Zeng,
Ye Wang†
ACM International Conference on Multimedia (MM), Ottawa, Canada, 2023.
pdf /
code /
video /
poster
|
MM-ALT: A Multimodal Automatic Lyric Transcription System
Xiangming Gu*,
Longshen Ou*,
Danielle Ong,
Ye Wang†
ACM International Conference on Multimedia (MM), Lisbon, Portugal, 2022. (Oral, Top Paper Award)
pdf /
appendix /
project page /
code /
data /
video /
press
|
Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription
Longshen Ou*,
Xiangming Gu*,
Ye Wang†
International Society for Music Information Retrieval Conference (ISMIR), Bengaluru, India, 2022.
pdf /
code
|
Dean's Graduate Research Excellence Award, National University of Singapore, 2024
Research Achievement Award, National University of Singapore, 2025/2022
MM'22 Top Paper Award, Association for Computing Machinery, 2022
President's Graduate Fellowship, National University of Singapore, 2021-2025
Tsinghua's Friend- Zheng Geru Scholarship (Academic Excellence Scholarship), Tsinghua University, 2018
|
Conference reviewer for NeurIPS, ICML, ICLR, CVPR, ICCV, ECCV, ACL ARR, MM, IJCAI, AISTATS
Journal reviewer for TPAMI, TOMM, TASLP, RA-L
|
Teaching Assistant, CS4347/CS5647, Sound and Music Computing, Fall 2024
Teaching Assistant, CS6212, Topics in Media, Spring 2024
Teaching Assistant, CS5242, Neural Networks and Deep Learning, Spring 2023
Teaching Assistant, CS3244, Machine Learning, Fall 2022
Teaching Assistant, CS4243, Computer Vision and Pattern Recognition, Spring 2022
|
I love tourism, movies, food, etc. I have been lived in ๐จ๐ณ๐ธ๐ฌ๐ฌ๐ง, and travelled to ๐น๐ญ๐ซ๐ฎ๐ต๐น๐ง๐ช๐บ๐ธ๐ญ๐ฐ๐ฒ๐พ๐จ๐ฆ๐ฆ๐ช๐ฆ๐น๐ฏ๐ต๐ญ๐บ๐จ๐ฟ๐ฎ๐น๐ป๐ฆ๐ญ๐ท๐ซ๐ท๐จ๐ญ๐ฉ๐ช๐ณ๐ฑ๐ฐ๐ท for holidays/conferences.
|
You've probably seen this website template before, thanks to Jon Barron.
|
|