About me

Ziqian Ning is a master student in the Audio, Speech and Language Processing Laboratory at Northwestern Polytechnical University (ASLP@NWPU), Xi’an, China, supervised by Prof. Lei Xie . He is currently performing research at Netease Fuxi AI Lab. His research interests include voice conversion, text-to-speech and audio/music generation.

Internships

  • 2024.03 - 2024.09, Azure Speech, Microsoft, China.
  • 2022.06 - 2024.03, Fuxi AI Lab, Netease, China.
  • 2021.07 - 2021.09, TEG, Tencent, China.

Publications

Singing Voice Generation

Voice Conversion (VC)

Streaming Voice Conversion

Speaker Anonymization

Text to Speech

Project Experience

  • Singing Voice Conversion Challenge 2023
    • Propose a VITS-based singing voice conversion model that leverages Whisper bottleneck features as linguistic information and uses PBTC module extracts multi-scale F0 to better capture the pitch variation. The results of the official competition measurements demonstrate that our system achieves human-level naturalness, ranking first and second in Task 1 and Task 2, respectively. Demo
  • Online Text-to-speech synthesis system
    • Develop a text-to-speech system to provide high availability and scalability for online services. Models are encapsulated in separate microservices that are managed using Kubernetes. Kafka is used for inter-model messaging, and the use of message queue makes it possible to parallelize a large number of microservice replicas.

Patents

  • CN115910083A Real-time voice conversion method, device, electronic equipment and medium.
  • CN116013336A Voice conversion method, device, electronic equipment and storage medium.
  • CN116364099A Voice conversion method, device, electronic apparatus, storage medium, and program product.
  • CN118136033A Method, device, electronic equipment and storage medium for converting drama voice.