I am a first-year master’s student in Zhejiang University now. I also worked at MSRA-Natural Language Computing Group as a research intern in Beijing ago, doing some LLM and speech research.
I graduate from the Department of Software Engineering in JiLin University (吉林大学软件学院) with a bachelor’s degree and continue to study in Zhejiang University (浙江大学软件学院) with a master’s degree now, advised by Zhou Zhao (赵洲). I also collaborate with Zhou Long (周龙), ShuJie Liu (刘树杰) from Microsoft Research Asia closely.
My research interest includes speech synthesis, Generative model and LLM. I have published some papers at the top international AI conferences such as ACL2024, ICLR2024, ICASSP2024 with total google scholar citations .
🔥 News
- 2024.06: We propose ControlSpeech
on arxiv.
- 2024.05: I was selected as a reviewer for NIPS 2024.
- 2024.05: MobileSpeech is accepted by 2024 ACL Main(Top conference in nlp)!
- 2024.04: I join
Alibaba, DAMO Academy, Tongyi Lab
as a research intern.
- 2024.03:I was selected as a reviewer for ECCV 2024.
- 2024.02: We propose SOTA codec model Language-Codec
on arxiv.
- 2024.01:I was selected as a reviewer for ACM MM 2024.
- 2024.01: MobileSpeech has been successfully deployed into Magic6 series in Honor Mobile phone!
- 2024.01: MagaTTS 2 (co-worker) is accepted by 2024 ICLR (Top conference in machine learning)!
- 2023.12: TextrolSpeech
is accepted by 2024 ICASSP (Top conference in speech)!
- 2023.11: One Paper (co-worker) is accepted by CCFA IEEE Transactions on Computers.
- 2023.11: Megatts has been successfully deployed into products at ByteDance
!
- 2023.08:I was selected as a reviewer for EMNLP 2023.
- 2023.03: 🎉🎉 I join Microsoft Research Asia(MSRA), Natural Language Computing Group
as a research intern!
- 2022.11: I join Ping An Technology Company
as a speech junior algorithm engineer in Shanghai!
- 2022.10: I got the offer of postgraduate study in the School of Software of Zhejiang University.
- 2021.11: I join Tsinghua Shenzhen International Graduate School
as a remote intern.
- 2021.10: 🎉🎉 I win the Nation Scholarship (Top 1%) in the second year of undergraduate!
📝 Publications
🎙 Controllable and Zero-shot Text-to-Speech, Codec Representation
![sym](images\textrolspeech1.jpg)
TextrolSpeech: A Text Style Control Speech Corpus With Codec Language Text-to-Speech Models
Authors: Shengpeng Ji, Jialong Zuo, Minghui Fang, Ziyue Jiang, Feiyang Chen, Xinyu Duan, Baoxing Huai, Zhou Zhao
- Audio samples are available in this website
- Code is available in this
![sym](images\mobilespeech.png)
MobileSpeech: A Fast and High-Fidelity Framework for Mobile Zero-Shot Text-to-Speech
Authors: Shengpeng Ji*, Ziyue Jiang*, Hanting Wang, Jialong Zuo, Zhou Zhao
- Audio samples are available in this website
![sym](images\controlspeech.png)
ControlSpeech: Towards Simultaneous Zero-shot Speaker Cloning and Zero-shot Language Style Control With Decoupled Codec
Authors: Shengpeng Ji, Jialong Zuo, Minghui Fang, Siqi Zheng, Qian Chen, Wen Wang, Ziyue Jiang, Hai Huang, Xize Cheng, Rongjie Huang, Zhou Zhao
![sym](images\languagecodec.png)
![sym](images\VSTTS.png)
VS-TTS: Controllable Voice Stylization for Text-to-Speech with Natural Language Prompts
Authors: Jialung Zuo*, Xize Cheng*, Shengpeng Ji*, Ziyue Jiang, Minghui Fang, Zhiqing Hong, Rongjie Huang, Zehan Wang, Tao Jin, Zhou Zhao
- Audio samples are available in this website
![sym](images\watermark.png)
DiscreteWM: Speech Watermarking with Discrete Representations
Authors: Ziyue Jiang*, Shengpeng Ji*, Yi Ren, Zhenhui Ye, Rongjie Huang, Jinglin Liu, Chen Zhang, Tianyu Pang, Chao Du, Hongcheng Zhu, Zhou Zhao
- Audio samples are available in this website
🎖 Honors and Awards
- 2023.06 Outstanding graduate of Jilin University (Top 5%)
- 2023.06 One-class scholarship of Jilin University (Top 0.25%, 1/392)
- 2022.10 Second-class scholarship of Jilin University
- 2021.10 National Scholarship (Undergraduate) (Top 1.27%, 5/392)
- 2020.10 Third-class scholarship of Jilin University
📖 Educations
- 2023.09 - 2026.03, Master, Software Engineering, Zhejiang University.
- 2019.09 - 2023.06, Undergraduate, Software Engineering, JiLin Univeristy.
💻 Internships
- 2024.04 - now, Alibaba, DAMO Academy, Tongyi Lab
, Hangzhou YunGu Area.
- 2023.03 - 2023.08, MSRA,Natural Language Computing Group
, Beijing HaiDian Area.
- 2022.11 - 2023.03, Ping An Techology Company, ShangHai Pudong Area.
- 2021.11 - 2022.05, Tsinghua Shenzhen International Graduate School
, Remote.