Welcome to Sound and Music Computing Lab at National University of Singapore! The NUS Sound and Music Computing Lab strives to develop Sound and Music Computing (SMC) technologies, in particular Music Information Retrieval (MIR) technologies, with an emphasis on applications in e-Learning (especially computer-assisted music and language edutainment) and e-Health (especially computer-assisted music-enhanced exercise and therapy).
We seek to harness the synergy of SMC, MIR, mobile computing, and cloud computing technologies to promote healthy lifestyles and to facilitate disease prevention, diagnosis, and treatment in both developed countries and resource-poor developing countries.
We have advised students from a wide range of disciplines and across many education levels. See our alumni here!
L. Ou, J. Zhao, Z. Wang, G. Xia, Q. Liang, T. Hopkins, and Y. Wang, “Unifying Symbolic Music Arrangement: Track-Aware Reconstruction and Structured Tokenization,” in Proceedings of the 39th Annual Conference on Neural Information Processing Systems (NeurIPS 2025). 2025.
Q. Liang, X. Ma, T. Hopkins, and Y. Wang, “LivePoem: Improving the Learning Experience of Classical Chinese Poetry with AI-Generated Musical Storyboards,” in Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence (IJCAI 2025). ijcai.org, 2025. [Demo 1 (reciting)] [Demo 1 (singing)] [Demo 2 (reciting)] [Demo 2 (singing)]
J. Zhao, X. Wang, and Y. Wang, “Prosody-Adaptable Audio Codecs for Zero-Shot Voice Conversion via In-Context Learning,” in Proceedings of the 26th Annual Conference of the International Speech Communication Association (Interspeech 2025). ISCA, 2025s.
H. Liu, H. Huang, H. Wang, X. Gu, and Y. Wang, “ On Calibration of LLM-based Guard Models for Reliable Content Moderation,” in Proceedings of the 13th International Conference on Learning Representations (ICLR 2025). OpenReview.net, 2025.
X. Gu, T. Pang, C. Du, Q. Liu, F. Zhang, C. Du, Y. Wang and M. Lin, “When Attention Sink Emerges in Language Models: An Empirical View,” in Proceedings of the 13th International Conference on Learning Representations (ICLR 2025). OpenReview.net, 2025.
X. Gu, C. Du, T. Pang, C. Li, M. Lin, and Y. Wang, “ On Memorization in Diffusion Models,” Trans. Mach. Learn. Res. (TMLR) , vol. 2025.
L. Ou, Y. Takahashi, and Y. Wang, “ Lead Instrument Detection from Multitrack Music ,” in Proceedings of the 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2025). IEEE, 2025.
J. Zhao, C. Low, and Y. Wang, “ SPSinger: Multi-Singer Singing Voice Synthesis with Short Reference Prompt,” in Proceedings of the 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2025). IEEE, 2025.
J. Zhao, G. Xia, Z. Wang, and Y. Wang, “Structured Multi-Track Accompaniment Arrangement via Style Prior Modelling,” in Proceedings of the 38th Annual Conference on Neural Information Processing Systems (NeurIPS 2024). 2024. [demo] [code]
H. Liu, H. Huang, and Y. Wang, “Advancing Test-Time Adaptation in Wild Acoustic Test Settings,” in Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024). Association for Computational Linguistics, 2024, pp. 7138-7155.
X. Ma, V. Sharma, M. Y. Kan, W. S. Lee, and Y. Wang, “KeYric: Unsupervised Keywords Extraction and Expansion from Music for Coherent Lyric Generation,” ACM Trans. Multim. Comput. Commun. Appl. (TOMM), vol. 21, No. 1, pp. 1-28, 2024.
X. Wang, M. Shi, and Y. Wang, “Pitch-Aware RNN-T for Mandarin Chinese Mispronunciation Detection and Diagnosis,” in Proceedings of the 25th Annual Conference of the International Speech Communication Association (Interspeech 2024). ISCA, 2024, pp. 292-296.
J. Zhao, L. Q. Chetwin, and Y. Wang, “SinTechSVS: A Singing Technique Controllable Singing Voice Synthesis System,” IEEE ACM Trans. Audio Speech Lang. Process. (TASLP), vol. 32, pp. 2641–2653, 2024.
W. Zeng, X. He, and Y. Wang, “End-to-End Real-World Polyphonic Piano Audio-to-Score Transcription with Hierarchical Decoding,” in Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence (IJCAI 2024). ijcai.org, 2024, pp. 7788-7795.
X. Gu, L. Ou, W. Zeng, J. Zhang, N. Wong, and Y. Wang, “Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing,” ACM Trans. Multim. Comput. Commun. Appl. (TOMM), vol. 20, No. 7, pp. 1551-6857, 2024.
Q. Liang and Y. Wang, “Drawlody: Sketch-Based Melody Creation with Enhanced Usability and Interpretability,” IEEE Trans. Multim. (TMM), vol. 26, pp. 7074-7088, 2024.
[Song Intelligibility Data] K. M. Ibrahim, D. Grunberg, K. Agres, C. Gupta, and Y. Wang, “Intelligibility of Sung Lyrics: A Pilot Study,” in Proceedings of the 18th International Society for Music Information Retrieval Conference (ISMIR 2017). 2017, pp. 686–693. [data]
[LyricFind Corpus] R. J. Ellis, Z. Xing, J. Fang, and Y. Wang, “Quantifying Lexical Novelty in Song Lyrics,” in Proceedings of the 16th International Society for Music Information Retrieval Conference (ISMIR 2015). 2015, pp. 694–700. [data]
[NUS-48E Sung and Spoken Lyrics Corpus] Z. Duan, H. Fang, B. Li, K. C. Sim, and Y. Wang, “The NUS Sung and Spoken Lyrics Corpus: A Quantitative Comparison of Singing and Speech,” in Proceedings of the 2013 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2013). IEEE, 2013, pp. 1–9. [data]
[2025.11] Sound and Music Computing for Human Health and Potential (SMC4HHP) Theme Seminar & Concert Event. [event page]
[2025.05] CS4347/5647 Lecture Recording on Text-to-Speech (TTS) and Singing Voice Synthesis (SVS). [video]
[2025.05] CS4347/5647 Lecture Recording on Automatic Music Generation (AMG). [video]
[2025.05] CS4347/5647 Lecture Recording on Automatic Music Transcription (AMT). [video]
[2025.05] CS4347/5647 Lecture Recording on Automatic Speech Recognition (ASR). [video]
[2025.04] Speech and Music AI: Current Research Frontier Workshop at NUS. [event page]
[2023.09] Music Recommender System Workshop at NUS. [event page]
Addr: 11 Computing Dr, SG, 117416
Tel: (65) 6516 2980
Fax: (65) 6779 4580
Office: AS6 #04-08
Lab Director: A/Prof. Ye Wang