Sound & Music Computing Lab

About Us

Welcome to Sound and Music Computing Lab at National University of Singapore! The NUS Sound and Music Computing Lab strives to develop Sound and Music Computing (SMC) technologies, in particular Music Information Retrieval (MIR) technologies, with an emphasis on applications in e-Learning (especially computer-assisted music and language edutainment) and e-Health (especially computer-assisted music-enhanced exercise and therapy).

We seek to harness the synergy of SMC, MIR, mobile computing, and cloud computing technologies to promote healthy lifestyles and to facilitate disease prevention, diagnosis, and treatment in both developed countries and resource-poor developing countries.

Selected Publication

[see FULL publication list]

2025

Q. Liang, X. Ma, T. Hopkins, and Y. Wang, “LivePoem: Improving the Learning Experience of Classical Chinese Poetry with AI-Generated Musical Storyboards,” in Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence (IJCAI 2025). ijcai.org, 2025.
J. Zhao, X. Wang, and Y. Wang, “Prosody-Adaptable Audio Codecs for Zero-Shot Voice Conversion via In-Context Learning,” in Proceedings of the 26th Annual Conference of the International Speech Communication Association (Interspeech 2025). ISCA, 2025s.
H. Liu, H. Huang, H. Wang, X. Gu, and Y. Wang, “ On Calibration of LLM-based Guard Models for Reliable Content Moderation,” in Proceedings of the 13th International Conference on Learning Representations (ICLR 2025). OpenReview.net, 2025.
X. Gu, T. Pang, C. Du, Q. Liu, F. Zhang, C. Du, Y. Wang and M. Lin, “When Attention Sink Emerges in Language Models: An Empirical View,” in Proceedings of the 13th International Conference on Learning Representations (ICLR 2025). OpenReview.net, 2025.
X. Gu, C. Du, T. Pang, C. Li, M. Lin, and Y. Wang, “ On Memorization in Diffusion Models,” Trans. Mach. Learn. Res. (TMLR) , vol. 2025.
L. Ou, Y. Takahashi, and Y. Wang, “ Lead Instrument Detection from Multitrack Music ,” in Proceedings of the 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2025). IEEE, 2025.
J. Zhao, C. Low, and Y. Wang, “ SPSinger: Multi-Singer Singing Voice Synthesis with Short Reference Prompt,” in Proceedings of the 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2025). IEEE, 2025.

2024

J. Zhao, G. Xia, Z. Wang, and Y. Wang, “Structured Multi-Track Accompaniment Arrangement via Style Prior Modelling,” in Proceedings of the 38th Annual Conference on Neural Information Processing Systems (NeurIPS 2024). 2024. [demo] [code]
H. Liu, H. Huang, and Y. Wang, “Advancing Test-Time Adaptation inWild Acoustic Test Settings,” in Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024). Association for Computational Linguistics, 2024, pp. 7138-7155.
H. Liu*, Y. Xie*, Y. Wang, and Michael Shieh, “Advancing Adversarial Suffix Transfer Learning on Aligned Large Language Models,” in Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024). Association for Computational Linguistics, 2024, pp. 7213-7224.
X. Ma, V. Sharma, M. Y. Kan, W. S. Lee, and Y. Wang, “KeYric: Unsupervised Keywords Extraction and Expansion from Music for Coherent Lyric Generation,” ACM Trans. Multim. Comput. Commun. Appl. (TOMM),, vol. 21, No. 1, pp. 1-28, 2024.
X. Wang, M. Shi, and Y. Wang, “Pitch-Aware RNN-T for Mandarin Chinese Mispronunciation Detection and Diagnosis,” in Proceedings of the 25th Annual Conference of the International Speech Communication Association (Interspeech 2024). ISCA, 2024, pp. 292-296.
J. Zhao, L. Q. Chetwin, and Y. Wang, “SinTechSVS: A Singing Technique Controllable Singing Voice Synthesis System,” IEEE ACM Trans. Audio Speech Lang. Process. (TASLP), vol. 32, pp. 2641–2653, 2024.
H. Huang, S. Wang, H. Liu, H. Wang, and Y. Wang, “Benchmarking Large Language Models on Communicative Medical Coaching: A Dataset and a Novel System,” in Findings of the Association for Computational Linguistics: ACL 2024 (Findings of ACL 2024). Association for Computational Linguistics, 2024, pp. 1624-1637.
Q. Liang, X. Ma, F. Doshi-Velez, B. Lim, and Y. Wang, “XAI-Lyricist: Improving the Singability of AI-Generated Lyrics with Prosody Explanations,” in Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence (IJCAI 2024). ijcai.org, 2024, pp.7877-7885.
W. Zeng, X. He, and Y. Wang, “End-to-End Real-World Polyphonic Piano Audio-to-Score Transcription with Hierarchical Decoding,” in Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence (IJCAI 2024). ijcai.org, 2024, pp. 7788-7795.
X. Ma, Y. Wang, and Y. Wang, “Symbolic Music Generation from Graph-Learning-based Preference Modeling and Textual Queries,” IEEE Trans. Multim. (TMM), vol. 26, pp. 10545-10558, 2024.
X. Gu, X. Zheng, T. Pang, C. Du, Q. Liu, Y. Wang, J. Jiang, and M. Lin, “Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast,” Proceedings of the 41st International Conference on Machine Learning (ICML 2024). PMLR, 2024.
X. Gu, L. Ou, W. Zeng, J. Zhang, N. Wong, and Y. Wang, “Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing,” ACM Trans. Multim. Comput. Commun. Appl. (TOMM), vol. 20, No. 7, pp. 1551-6857, 2024.
Q. Liang and Y. Wang, “Drawlody: Sketch-Based Melody Creation with Enhanced Usability and Interpretability,” IEEE Trans. Multim. (TMM), vol. 26, pp. 7074-7088, 2024.

2023

H. Liu and Y. Wang, “Towards Informative Few-Shot Prompt with Maximum Information Gain for In-Context Learning,” in Findings of the Association for Computational Linguistics: EMNLP 2023 (Findings of EMNLP 2023). Association for Computational Linguistics, 2023, pp. 15825-15838.
Y. Wang, W. Wei, X. Gu, X. Guan, and Y. Wang, “Disentangled Adversarial Domain Adaptation for Phonation Mode Detection in Singing and Speech,” IEEE ACM Trans. Audio Speech Lang. Process. (TASLP), vol. 31, pp. 3746–3759, 2023.
X. Gu, W. Zeng, and Y. Wang, “Elucidate Gender Fairness in Singing Voice Transcription,” in Proceedings of the 31st ACM International Conference on Multimedia (MM 2023). ACM, 2023, pp. 8760–8769.
H. Liu, M. Shi, and Y. Wang, “Zero-Shot Automatic Pronunciation Assessment,” in Proceedings of the 24th Annual Conference of the International Speech Communication Association (Interspeech 2023). ISCA, 2023, pp. 1009–1013.
J. Zhao, G. Xia, and Y. Wang, “Q&A: Query-Based Representation Learning for Multi-Track Symbolic Music re-Arrangement,” in Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence (IJCAI 2023). ijcai.org, 2023, pp. 5878–5886. [code] [demo] [tutorial]
L. Ou, X. Ma, M. Kan, and Y. Wang, “Songs Across Borders: Singable and Controllable Neural Lyric Translation,” in Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (ACL 2023). Association for Computational Linguistics, 2023, pp. 447–467. [code] [demo]
Y. Wang, W. Wei, and Y. Wang, “Phonation Mode Detection in Singing: A Singer Adapted Model,” in Proceedings of the 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023). IEEE, 2023, pp. 1–5.

[see FULL publication list]

Leveraging sound and music computing technology

to enhance human health and potential

About Us

Recent News

Job Openings

Our Team

Ye WANG

Xichu (Stan) MA

Torin HOPKINS

Siwei LUO

Xintong WANG

Hongfu LIU

Xiangming GU

Longshen OU

Jingwei ZHAO

Qihao LIANG

Nicholas WONG

Wei ZENG

Junchuan ZHAO

Felix Leon GRIEBEL

Niven ANG

Jiyu WANG

Hejin WANG

Jiaen SUN

Our Alumni

Selected Publication

2025

2024

2023

Educational Resources

SMC Dataset

Recent Events and Videos

Get In Touch