558 Handong-ro Buk-gu, Pohang Gyeongbuk 37554 Republic of Korea
Copyright (c) Handong Global University. All Rights Reserved.
- Handong Global University’s Deep Learning Lab Developed the World's First Emotion-Expressing Song Voice Synthesis AI 'MuseSVS'
(Photo 1) Structure of MuseSVS
The Deep Learning Research Lab of Handong Global University (President Dosoung Choi) has developed the world's first emotion-expressing multi-speaker song voice synthesis AI called 'MuseSVS'.
Singing Voice Synthesis (SVS) is an AI technology that synthesizes singing voices from musical scores. While previous AI systems could only synthesize voices singing the lyrics, there were no AIs capable of expressing emotions through voice synthesis until now. The recently developed MuseSVS by Handong Global University takes musical scores and additional inputs, including the speaker's emotional types and intensities, to produce singing voices with varying tones, vibrato, and volume. Notably, the statistical pitch predictor and context-based note duration predictor, applied for the first time in this singing voice synthesis AI, effectively generate subtle changes in pitch and rhythm required for expressing emotions. MuseSVS has been published in the prestigious IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP), the world's top journal in the field of voice processing.
(Photo 2) The pitch variation of the song voice synthesized by MuseSVS
(Stronger vibrato is synthesized as sadness intensifies).
Professor In Jung Kim, the guiding professor at Handong Global University's Deep Learning Research Lab, stated, "When people sing the same song, they express emotions through subtle changes in pitch, rhythm, and volume. MuseSVS effectively learns these fine variations from the data." He further explained, "MuseSVS follows the macro flow of pitch and rhythm according to the musical score, while synthesizing micro-level variations differently based on the specified emotion type and intensity." The development of MuseSVS was led by Sungjae Kim (Handong Global University Ph.D. candidate), with the participation of Handong Global University students Yewon Kim (Master's candidate) and Jewoo Jeon (undergraduate student).