News
-
[2024.02]: One paper got accepted to ACM Transactions on Multimedia Computing Communications and Applications (TOMM'2024)!
-
[2024.02]: We released Agent Smith, which got posted as "Here Come the AI Worms" on WIRED Magazine!
-
[2023.09]: One paper got accepted to IEEE Transactions on Audio, Speech and Language Processing (TASLP'2023)!
-
[2023.07]: One paper got accepted to ACM International Conference on Multimedia (MM'2023)!
-
[2023.03]: I joined Sea AI lab (SAIL) as a research intern and my research is related to generative models and (multimodal) large language models!
-
[2023.01]: I received the Research Achievement Award from School of Computing, NUS!
-
[2022.12]: One paper got accepted to Transactions on Machine Learning Research (TMLR'2022)!
-
[2022.12]: I passed my Ph.D. Qualifying Examination (PQE) and became a Ph.D. candidate!
-
[2022.09]: One paper got accepted to Advances in Neural Information Processing Systems (NeurIPS'2022)!
-
[2022.07]: One paper got accepted to International Society for Music Information Retrieval Conference (ISMIR'2022)!
-
[2022.06]: One paper got accepted to ACM International Conference on Multimedia (MM'2022) as oral presentation, which also won the Top Paper Award!
-
[2022.05]: One paper got accepted to IEEE Transactions on Image Processing (TIP'2022)!
|
Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast
Xiangming Gu*,
Xiaosen Zheng*,
Tianyu Pang*†,
Chao Du,
Qian Liu,
Ye Wang†,
Jing Jiang†,
Min Lin
International Conference on Learning Representations Workshop on Large Language Model Agents (LLMAgents @ ICLR'2024), Vienna, Austria
pdf /
project page /
code /
press
|
Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing
Xiangming Gu,
Longshen Ou,
Wei Zeng,
Jianan Zhang,
Nicholas Wong,
Ye Wang†
ACM Transactions on Multimedia Computing Communications and Applications (TOMM'2024).
pdf /
code /
data
|
Unsupervised Mismatch Localization in Cross-Modal Sequential Data with Application to Mispronunciations Localization
Wei Wei*,
Hengguan Huang*,
Xiangming Gu,
Hao Wang,
Ye Wang†
Transactions on Machine Learning Research (TMLR'2022).
pdf /
code
|
Extrapolative Continuous-time Bayesian Neural Network for Fast Training-free Test-time Adaptation
Hengguan Huang†,
Xiangming Gu,
Hao Wang,
Chang Xiao,
Hongfu Liu,
Ye Wang†
Advances in Neural Information Processing Systems (NeurIPS'2022), New Orleans, USA.
pdf /
code /
video
|
Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription
Longshen Ou*,
Xiangming Gu*,
Ye Wang†
International Society for Music Information Retrieval Conference (ISMIR'2022), Bengaluru, India.
pdf /
code
|
MM-ALT: A Multimodal Automatic Lyric Transcription System
Xiangming Gu*,
Longshen Ou*,
Danielle Ong,
Ye Wang†
ACM International Conference on Multimedia (MM'2022). (Oral, Top Paper Award), Lisbon, Portugal.
pdf /
appendix /
project page /
code /
data /
video /
press
|
Research Incentive Award, National University of Singapore, 2023
Research Achievement Award, National University of Singapore, 2022
MM'22 Top Paper Award, Association for Computing Machinery, 2022
MM'22 Student Travel Grant, Association for Computing Machinery, 2022
President's Graduate Fellowship, National University of Singapore, 2021
Visiting Undergraduate Student Scholarship, Tsinghua University, 2020
Zheng Geru Scholarship, Tsinghua University, 2018
|
Conference reviewer for MM 2024, ECCV 2024, IJCAI 2024, ICCV 2023, AISTATS 2021
Journal reviewer for TASLP, RA-L
|
You've probably seen this website template before, thanks to Jon Barron.
Last Updated March 2024.
|
|