This page lists some speech related research at Microsoft, conducted by the team led by Xu Tan. The research topics cover text to speech, singing voice synthesis, music generation, automatic speech recognition, etc. Some research are open-sourced via NeuralSpeech and Muzic.
We are hiring researchers on audio/video generation and LLMs at Microsoft. Please contact xuta@microsoft.com if you have interests.