About me

Hello! I am Yujia Xiao, a third-year PhD student in the DSP & Speech Technology Laboratory (DSP-STL) at The Chinese University of Hong Kong (CUHK), under the supervision of Prof. Tan Lee. Prior to this, I worked as an applied scientist at Microsoft from 2018 to 2022. I earned both my M.S. and B.S. degrees from South China University of Technology. My current research focuses on long-form audio and speech generation as well as multimodal agents. If you are interested in my work, feel free to contact me!

News

  • πŸ₯‚ Mar 4, 2025: PodAgent is released. Given the topic to be discussed, PodAgent will simulate human behavior to create podcast-like audio presented as a talk show, featuring one host and several guests. The show will include diverse and insightful viewpoints, delivered in appropriate voices, along with structured sound effects and background music to enrich the listening experience.

Experience

  • πŸ’» 2023.07 - 2024.03: Research Intern at Microsoft (TTS Algorithm Team)
  • πŸ’Ό 2018.05 - 2022.07: Applied Scientist at Microsoft (TTS Algorithm Team)
  • πŸ’» 2016.08 - 2018.04: Research Intern at Microsoft Research Asia (Speech Group & IEG)
  • πŸ’» 2014.07 - 2015.08: Research Intern at Microsoft Research Asia (Speech Group & IEG)

Selected Publications

Awards

  • 🌟 2021.12 [Microsoft Hacathon] Executive Challenge - Hack for Consumer Business Growth - 2nd Place
  • 🌟 2020.09 [Microsoft Hacathon] Honorable Mention
  • 🌟 2019.09 [Microsoft Hacathon] Hackathon Challenge - Hack for Big Ideas - 2nd Place
  • πŸ₯‡ 2016 National Scholarship for Postgraduates
  • πŸ₯‡ 2013 National Scholarship
  • πŸ₯‡ 2012 National Scholarship

Teaching & Services