About me

Hello! I am Yujia Xiao, a fourth-year PhD student in the DSP & Speech Technology Laboratory (DSP-STL) at The Chinese University of Hong Kong (CUHK), under the supervision of Prof. Tan Lee. Prior to this, I worked as an applied scientist at Microsoft from 2018 to 2022. I earned both my M.S. and B.S. degrees from South China University of Technology. My current research focuses on long-form audio and speech generation as well as multimodal agents.

😊 I plan to graduate in 2026 and am actively seeking new opportunities in academic or industry research positions. If you are interested in my work, feel free to contact me!

News

  • 🌟 Oct 2, 2025: PodEval is released. PodEval is a comprehensive toolkit for podcast evaluation across multiple dimensions including audio, speech, and text using both objective metrics and subjective evaluation methods.
  • 🌟 May 16, 2025: PodAgent is accepted by ACL 2025 Findings.
  • 🌟 Mar 4, 2025: PodAgent is released. Given the topic to be discussed, PodAgent will simulate human behavior to create podcast-like audio presented as a talk show, featuring one host and several guests. The show will include diverse and insightful viewpoints, delivered in appropriate voices, along with structured sound effects and background music to enrich the listening experience.

Experience

  • πŸ’Ό 2018.05 - 2022.07: Applied Scientist at Microsoft (TTS Algorithm Team)
  • πŸ’» 2016.08 - 2018.04: Research Intern at Microsoft Research Asia (Speech Group & IEG)

Selected Publications

Awards

  • 🌟 2021.12 [Microsoft Hacathon] Executive Challenge - Hack for Consumer Business Growth - 2nd Place
  • 🌟 2020.09 [Microsoft Hacathon] Honorable Mention
  • 🌟 2019.09 [Microsoft Hacathon] Hackathon Challenge - Hack for Big Ideas - 2nd Place
  • πŸ₯‡ 2016 National Scholarship for Postgraduates
  • πŸ₯‡ 2013 National Scholarship
  • πŸ₯‡ 2012 National Scholarship

Teaching & Services

  • πŸ§‘β€πŸ«οΈ Teaching Assistant (CUHK) of UGEB1408-ENGG1920 Artificial Intelligence in Action
  • πŸ§‘β€πŸ«οΈ Teaching Assistant (CUHK) of ELEG2310B: Principles of Communication Systems
  • πŸ“‘ Invited Reviewer of ICASSP 2025-2026 / IJCNN 2025

πŸŽΆπŸŽ™οΈπŸ’š

I love music, enjoy singing, and play the guzheng (amateur Level 10). I’m also into podcasts, interviews, stand-up comedy, badminton, and have a strong interest in mental health, with certifications in QPR Gatekeeping and MHFA Standard Course. If we share similar interests, let’s connect and explore them together!