default search action
Lei He 0005
Person information
- affiliation: Microsoft China, Speech and Language Group, Beijing, China
Other persons with the same name
- Lei He — disambiguation page
- Lei He 0001 — University of California, Los Angeles, Department of Electrical Engineering, CA, USA (and 1 more)
- Lei He 0002 — Hefei University of Technology, School of Computer and Information, China
- Lei He 0003 — Central South University, School of Business, Changsha, China
- Lei He 0004 — Chinese Academy of Sciences, National Laboratory of Pattern Recognition, Beijing, China
- Lei He 0006 — Chengdu University of Information Technology, School of Software Engineering, Chengdu, China (and 1 more)
- Lei He 0007 — Library of Congress, Washington, DC, USA (and 3 more)
- Lei He 0008 — China National Digital Switching System Engineering and Technology Research Center (NDSC), Zhengzhou, China
- Lei He 0009 — National University of Defense Technology, College of Information System and Management, Changsha, China
- Lei He 0010 — Hunan University of Science and Technology, School of Information and Electrical Engineering, Xiangtan, China (and 1 more)
- Lei He 0011 — Tongji University School of Medicine, Tongji Hospital, Department of Spine Surgery, Shanghai, China (and 1 more)
- Lei He 0012 — Xidian University, School of Computer Science and Technology, Xi'an, China (and 1 more)
- Lei He 0013 — Sichuan University, School of Mechanical Engineering, Chengdu, China
- Lei He 0014 — Southeast University, School of Civil Engineering, Nanjing, China (and 1 more)
- Lei He 0015 — Northwestern Polytechnical University, School of Marine Science and Technology, Xi'an, China
- Lei He 0016 — Chinese Academy of Science, Institute of Computing Technology, State Key Laboratory of Computer Architecture, Beijing, China
- Lei He 0017 — Jilin University, State Key Laboratory of Automotive Simulation and Control, Changchun, China (and 1 more)
- Lei He 0018 — China University of Petroleum -Beijing, National Engineering Laboratory for Pipeline Safety, Beijing, China
- Lei He 0019 — Shanghai Jiao Tong University, Department of Automation, Key Laboratory of System Control and Information Processing, Shanghai, China
- Lei He 0020 — Toshiba (China) Research and Development Center, Beijing China
- Lei He 0021 — University of Zurich, Department of Comparative Linguistics, Phonetics Laboratory, Zurich, Switzerland
Other persons with a similar name
- Chun Lei He — Concordia University, Montreal, Canada
- Hong-Lei He
- Xiang-lei He
- Yue-Lei He
- He Lei
- He-lei Wu
SPARQL queries
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j3]Xu Tan, Jiawei Chen, Haohe Liu, Jian Cong, Chen Zhang, Yanqing Liu, Xi Wang, Yichong Leng, Yuanhao Yi, Lei He, Sheng Zhao, Tao Qin, Frank K. Soong, Tie-Yan Liu:
NaturalSpeech: End-to-End Text-to-Speech Synthesis With Human-Level Quality. IEEE Trans. Pattern Anal. Mach. Intell. 46(6): 4234-4245 (2024) - [c51]Xueyuan Chen, Xi Wang, Shaofei Zhang, Lei He, Zhiyong Wu, Xixin Wu, Helen Meng:
Stylespeech: Self-Supervised Style Enhancing with VQ-VAE-Based Pre-Training for Expressive Audiobook Speech Synthesis. ICASSP 2024: 12316-12320 - [c50]Yichong Leng, Zhifang Guo, Kai Shen, Zeqian Ju, Xu Tan, Eric Liu, Yufei Liu, Dongchao Yang, Leying Zhang, Kaitao Song, Lei He, Xiangyang Li, Sheng Zhao, Tao Qin, Jiang Bian:
PromptTTS 2: Describing and Generating Voices with Text Prompt. ICLR 2024 - [c49]Kai Shen, Zeqian Ju, Xu Tan, Eric Liu, Yichong Leng, Lei He, Tao Qin, Sheng Zhao, Jiang Bian:
NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers. ICLR 2024 - [c48]Zeqian Ju, Yuancheng Wang, Kai Shen, Xu Tan, Detai Xin, Dongchao Yang, Eric Liu, Yichong Leng, Kaitao Song, Siliang Tang, Zhizheng Wu, Tao Qin, Xiangyang Li, Wei Ye, Shikun Zhang, Jiang Bian, Lei He, Jinyu Li, Sheng Zhao:
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models. ICML 2024 - [c47]Yujia Xiao, Xi Wang, Xu Tan, Lei He, Xinfa Zhu, Sheng Zhao, Tan Lee:
Contrastive Context-Speech Pretraining for Expressive Text-to-Speech Synthesis. ACM Multimedia 2024: 2099-2107 - [c46]Xinfa Zhu, Wenjie Tian, Xinsheng Wang, Lei He, Yujia Xiao, Xi Wang, Xu Tan, Sheng Zhao, Lei Xie:
UniStyle: Unified Style Modeling for Speaking Style Captioning and Stylistic Speech Synthesis. ACM Multimedia 2024: 7513-7522 - [i41]Zeqian Ju, Yuancheng Wang, Kai Shen, Xu Tan, Detai Xin, Dongchao Yang, Yanqing Liu, Yichong Leng, Kaitao Song, Siliang Tang, Zhizheng Wu, Tao Qin, Xiang-Yang Li, Wei Ye, Shikun Zhang, Jiang Bian, Lei He, Jinyu Li, Sheng Zhao:
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models. CoRR abs/2403.03100 (2024) - [i40]Leying Zhang, Yao Qian, Long Zhou, Shujie Liu, Dongmei Wang, Xiaofei Wang, Midia Yousefi, Yanmin Qian, Jinyu Li, Lei He, Sheng Zhao, Michael Zeng:
CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations. CoRR abs/2404.06690 (2024) - [i39]Ziqian Ning, Shuai Wang, Yuepeng Jiang, Jixun Yao, Lei He, Shifeng Pan, Jie Ding, Lei Xie:
Drop the beat! Freestyler for Accompaniment Conditioned Rapping Voice Generation. CoRR abs/2408.15474 (2024) - 2023
- [c45]Yihan Wu, Junliang Guo, Xu Tan, Chen Zhang, Bohan Li, Ruihua Song, Lei He, Sheng Zhao, Arul Menezes, Jiang Bian:
VideoDubber: Machine Translation with Speech-Aware Length Control for Video Dubbing. AAAI 2023: 13772-13779 - [c44]Zhihang Xu, Shaofei Zhang, Xi Wang, Jiajun Zhang, Wenning Wei, Lei He, Sheng Zhao:
MuLanTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 2023. Blizzard Challenge 2023 - [c43]Yan Deng, Long Zhou, Yuanhao Yi, Shujie Liu, Lei He:
Prosody-Aware Speecht5 for Expressive Neural TTS. ICASSP 2023: 1-5 - [c42]Kun Wei, Long Zhou, Ziqiang Zhang, Liping Chen, Shujie Liu, Lei He, Jinyu Li, Furu Wei:
Joint Pre-Training with Speech and Bilingual Text for Direct Speech to Speech Translation. ICASSP 2023: 1-5 - [c41]Chen Zhang, Shubham Bansal, Aakash Lakhera, Jinzhu Li, Gang Wang, Sandeepkumar Satpal, Sheng Zhao, Lei He:
LeanSpeech: The Microsoft Lightweight Speech Synthesis System for Limmits Challenge 2023. ICASSP 2023: 1-2 - [c40]Brendan Walsh, Mark Hamilton, Greg Newby, Xi Wang, Serena Ruan, Sheng Zhao, Lei He, Shaofei Zhang, Eric Dettinger, William T. Freeman, Markus Weimer:
Large-Scale Automatic Audiobook Creation. INTERSPEECH 2023: 3675-3676 - [c39]Yujia Xiao, Shaofei Zhang, Xi Wang, Xu Tan, Lei He, Sheng Zhao, Frank K. Soong, Tan Lee:
ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading. INTERSPEECH 2023: 4883-4887 - [c38]Yuancheng Wang, Zeqian Ju, Xu Tan, Lei He, Zhizheng Wu, Jiang Bian, Sheng Zhao:
AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models. NeurIPS 2023 - [i38]Chengyi Wang, Sanyuan Chen, Yu Wu, Ziqiang Zhang, Long Zhou, Shujie Liu, Zhuo Chen, Yanqing Liu, Huaming Wang, Jinyu Li, Lei He, Sheng Zhao, Furu Wei:
Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers. CoRR abs/2301.02111 (2023) - [i37]Ruiqing Xue, Yanqing Liu, Lei He, Xu Tan, Linquan Liu, Edward Lin, Sheng Zhao:
FoundationTTS: Text-to-Speech for ASR Customization with Generative Language Model. CoRR abs/2303.02939 (2023) - [i36]Ziqiang Zhang, Long Zhou, Chengyi Wang, Sanyuan Chen, Yu Wu, Shujie Liu, Zhuo Chen, Yanqing Liu, Huaming Wang, Jinyu Li, Lei He, Sheng Zhao, Furu Wei:
Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language Modeling. CoRR abs/2303.03926 (2023) - [i35]Yuancheng Wang, Zeqian Ju, Xu Tan, Lei He, Zhizheng Wu, Jiang Bian, Sheng Zhao:
AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models. CoRR abs/2304.00830 (2023) - [i34]Kai Shen, Zeqian Ju, Xu Tan, Yanqing Liu, Yichong Leng, Lei He, Tao Qin, Sheng Zhao, Jiang Bian:
NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers. CoRR abs/2304.09116 (2023) - [i33]Yujia Xiao, Shaofei Zhang, Xi Wang, Xu Tan, Lei He, Sheng Zhao, Frank K. Soong, Tan Lee:
ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading. CoRR abs/2307.00782 (2023) - [i32]Yichong Leng, Zhifang Guo, Kai Shen, Xu Tan, Zeqian Ju, Yanqing Liu, Yufei Liu, Dongchao Yang, Leying Zhang, Kaitao Song, Lei He, Xiang-Yang Li, Sheng Zhao, Tao Qin, Jiang Bian:
PromptTTS 2: Describing and Generating Voices with Text Prompt. CoRR abs/2309.02285 (2023) - [i31]Zhihang Xu, Shaofei Zhang, Xi Wang, Jiajun Zhang, Wenning Wei, Lei He, Sheng Zhao:
MuLanTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 2023. CoRR abs/2309.02743 (2023) - [i30]Brendan Walsh, Mark Hamilton, Greg Newby, Xi Wang, Serena Ruan, Sheng Zhao, Lei He, Shaofei Zhang, Eric Dettinger, William T. Freeman, Markus Weimer:
Large-Scale Automatic Audiobook Creation. CoRR abs/2309.03926 (2023) - [i29]Xueyuan Chen, Xi Wang, Shaofei Zhang, Lei He, Zhiyong Wu, Xixin Wu, Helen Meng:
StyleSpeech: Self-supervised Style Enhancing with VQ-VAE-based Pre-training for Expressive Audiobook Speech Synthesis. CoRR abs/2312.12181 (2023) - 2022
- [c37]Fengpeng Yue, Yan Deng, Lei He, Tom Ko, Yu Zhang:
Exploring Machine Speech Chain For Domain Adaptation. ICASSP 2022: 6757-6761 - [c36]Yujia Xiao, Xi Wang, Lei He, Frank K. Soong:
Improving Fastspeech TTS with Efficient Self-Attention and Compact Feed-Forward Network. ICASSP 2022: 7472-7476 - [c35]Yuanhao Yi, Lei He, Shifeng Pan, Xi Wang, Yujia Xiao:
Prosodyspeech: Towards Advanced Prosody Model for Neural Text-to-Speech. ICASSP 2022: 7582-7586 - [c34]Zehua Chen, Xu Tan, Ke Wang, Shifeng Pan, Danilo P. Mandic, Lei He, Sheng Zhao:
Infergrad: Improving Diffusion Models for Vocoder by Considering Inference in Training. ICASSP 2022: 8432-8436 - [c33]Mutian He, Jingzhou Yang, Lei He, Frank K. Soong:
Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge. INTERSPEECH 2022: 441-445 - [c32]Yanqing Liu, Ruiqing Xue, Lei He, Xu Tan, Sheng Zhao:
DelightfulTTS 2: End-to-End Speech Synthesis with Adversarial Vector-Quantized Auto-Encoders. INTERSPEECH 2022: 1581-1585 - [c31]Yuanhao Yi, Lei He, Shifeng Pan, Xi Wang, Yuchao Zhang:
SoftSpeech: Unsupervised Duration Model in FastSpeech 2. INTERSPEECH 2022: 1606-1610 - [c30]Yihan Wu, Xu Tan, Bohan Li, Lei He, Sheng Zhao, Ruihua Song, Tao Qin, Tie-Yan Liu:
AdaSpeech 4: Adaptive Text to Speech in Zero-Shot Scenarios. INTERSPEECH 2022: 2568-2572 - [c29]Yihan Wu, Xi Wang, Shaofei Zhang, Lei He, Ruihua Song, Jian-Yun Nie:
Self-supervised Context-aware Style Representation for Expressive Speech Synthesis. INTERSPEECH 2022: 5503-5507 - [c28]Yichong Leng, Zehua Chen, Junliang Guo, Haohe Liu, Jiawei Chen, Xu Tan, Danilo P. Mandic, Lei He, Xiangyang Li, Tao Qin, Sheng Zhao, Tie-Yan Liu:
BinauralGrad: A Two-Stage Conditional Diffusion Probabilistic Model for Binaural Audio Synthesis. NeurIPS 2022 - [i28]Zehua Chen, Xu Tan, Ke Wang, Shifeng Pan, Danilo P. Mandic, Lei He, Sheng Zhao:
InferGrad: Improving Diffusion Models for Vocoder by Considering Inference in Training. CoRR abs/2202.03751 (2022) - [i27]Yihan Wu, Xu Tan, Bohan Li, Lei He, Sheng Zhao, Ruihua Song, Tao Qin, Tie-Yan Liu:
AdaSpeech 4: Adaptive Text to Speech in Zero-Shot Scenarios. CoRR abs/2204.00436 (2022) - [i26]Xu Tan, Jiawei Chen, Haohe Liu, Jian Cong, Chen Zhang, Yanqing Liu, Xi Wang, Yichong Leng, Yuanhao Yi, Lei He, Frank K. Soong, Tao Qin, Sheng Zhao, Tie-Yan Liu:
NaturalSpeech: End-to-End Text to Speech Synthesis with Human-Level Quality. CoRR abs/2205.04421 (2022) - [i25]Yichong Leng, Zehua Chen, Junliang Guo, Haohe Liu, Jiawei Chen, Xu Tan, Danilo P. Mandic, Lei He, Xiang-Yang Li, Tao Qin, Sheng Zhao, Tie-Yan Liu:
BinauralGrad: A Two-Stage Conditional Diffusion Probabilistic Model for Binaural Audio Synthesis. CoRR abs/2205.14807 (2022) - [i24]Yihan Wu, Xi Wang, Shaofei Zhang, Lei He, Ruihua Song, Jian-Yun Nie:
Self-supervised Context-aware Style Representation for Expressive Speech Synthesis. CoRR abs/2206.12559 (2022) - [i23]Yanqing Liu, Ruiqing Xue, Lei He, Xu Tan, Sheng Zhao:
DelightfulTTS 2: End-to-End Speech Synthesis with Adversarial Vector-Quantized Auto-Encoders. CoRR abs/2207.04646 (2022) - [i22]Kun Wei, Long Zhou, Ziqiang Zhang, Liping Chen, Shujie Liu, Lei He, Jinyu Li, Furu Wei:
Joint Pre-Training with Speech and Bilingual Text for Direct Speech to Speech Translation. CoRR abs/2210.17027 (2022) - [i21]Yihan Wu, Junliang Guo, Xu Tan, Chen Zhang, Bohan Li, Ruihua Song, Lei He, Sheng Zhao, Arul Menezes, Jiang Bian:
VideoDubber: Machine Translation with Speech-Aware Length Control for Video Dubbing. CoRR abs/2211.16934 (2022) - [i20]Zehua Chen, Yihan Wu, Yichong Leng, Jiawei Chen, Haohe Liu, Xu Tan, Yang Cui, Ke Wang, Lei He, Sheng Zhao, Jiang Bian, Danilo P. Mandic:
ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech. CoRR abs/2212.14518 (2022) - 2021
- [j2]Liumeng Xue, Shifeng Pan, Lei He, Lei Xie, Frank K. Soong:
Cycle consistent network for end-to-end style transfer TTS training. Neural Networks 140: 223-236 (2021) - [c27]Rui Zhao, Jian Xue, Jinyu Li, Wenning Wei, Lei He, Yifan Gong:
On Addressing Practical Challenges for RNN-Transducer. ASRU 2021: 526-533 - [c26]Yanqing Liu, Zhihang Xu, Gang Wang, Kuan Chen, Bohan Li, Xu Tan, Jinzhu Li, Lei He, Sheng Zhao:
DelightfulTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 2021. Blizzard Challenge 2021 - [c25]Liping Chen, Yan Deng, Xi Wang, Frank K. Soong, Lei He:
Speech Bert Embedding for Improving Prosody in Neural TTS. ICASSP 2021: 6563-6567 - [c24]Yan Deng, Rui Zhao, Zhong Meng, Xie Chen, Bing Liu, Jinyu Li, Yifan Gong, Lei He:
Improving RNN-T for Domain Scaling Using Semi-Supervised Training with Neural TTS. Interspeech 2021: 751-755 - [c23]Shifeng Pan, Lei He:
Cross-Speaker Style Transfer with Prosody Bottleneck in Neural Speech Synthesis. Interspeech 2021: 4678-4682 - [c22]Haohan Guo, Shaofei Zhang, Frank K. Soong, Lei He, Lei Xie:
Conversational End-to-End TTS for Voice Agents. SLT 2021: 403-409 - [i19]Mutian He, Jingzhou Yang, Lei He, Frank K. Soong:
Multilingual Byte2Speech Models for Scalable Low-resource Speech Synthesis. CoRR abs/2103.03541 (2021) - [i18]Fengpeng Yue, Yan Deng, Lei He, Tom Ko:
Exploring Machine Speech Chain for Domain Adaptation and Few-Shot Speaker Adaptation. CoRR abs/2104.03815 (2021) - [i17]Rui Zhao, Jian Xue, Jinyu Li, Wenning Wei, Lei He, Yifan Gong:
On Addressing Practical Challenges for RNN-Transducer. CoRR abs/2105.00858 (2021) - [i16]Liping Chen, Yan Deng, Xi Wang, Frank K. Soong, Lei He:
Speech BERT Embedding For Improving Prosody in Neural TTS. CoRR abs/2106.04312 (2021) - [i15]Shifeng Pan, Lei He:
Cross-speaker Style Transfer with Prosody Bottleneck in Neural Speech Synthesis. CoRR abs/2107.12562 (2021) - [i14]Mutian He, Jingzhou Yang, Lei He, Frank K. Soong:
Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge. CoRR abs/2110.09698 (2021) - [i13]Yanqing Liu, Zhihang Xu, Gang Wang, Kuan Chen, Bohan Li, Xu Tan, Jinzhu Li, Lei He, Sheng Zhao:
DelightfulTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 2021. CoRR abs/2110.12612 (2021) - 2020
- [c21]Yujia Xiao, Lei He, Huaiping Ming, Frank K. Soong:
Improving Prosody with Linguistic and Bert Derived Features in Multi-Speaker Based Mandarin Chinese Neural TTS. ICASSP 2020: 6704-6708 - [c20]Yan Huang, Lei He, Wenning Wei, William Gale, Jinyu Li, Yifan Gong:
Using Personalized Speech Synthesis and Neural Language Generator for Rapid Speaker Adaptation. ICASSP 2020: 7399-7403 - [c19]Eva Sharma, Guoli Ye, Wenning Wei, Rui Zhao, Yao Tian, Jian Wu, Lei He, Ed Lin, Yifan Gong:
Adaptation of RNN Transducer with Text-To-Speech Technology for Keyword Spotting. ICASSP 2020: 7484-7488 - [c18]Yan Huang, Jinyu Li, Lei He, Wenning Wei, William Gale, Yifan Gong:
Rapid RNN-T Adaptation Using Personalized Speech Synthesis and Neural Language Generator. INTERSPEECH 2020: 1256-1260 - [c17]Yang Cui, Xi Wang, Lei He, Frank K. Soong:
An Efficient Subband Linear Prediction for LPCNet-Based Neural Synthesis. INTERSPEECH 2020: 3555-3559 - [c16]Jinyu Li, Rui Zhao, Zhong Meng, Yanqing Liu, Wenning Wei, Sarangarajan Parthasarathy, Vadim Mazalov, Zhenghao Wang, Lei He, Sheng Zhao, Yifan Gong:
Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability. INTERSPEECH 2020: 3590-3594 - [c15]Liping Chen, Kong-Aik Lee, Lei He, Frank K. Soong:
On Early-stop Clustering for Speaker Diarization. Odyssey 2020: 110-116 - [i12]Haohan Guo, Shaofei Zhang, Frank K. Soong, Lei He, Lei Xie:
Conversational End-to-End TTS for Voice Agent. CoRR abs/2005.10438 (2020) - [i11]Jinyu Li, Rui Zhao, Zhong Meng, Yanqing Liu, Wenning Wei, Sarangarajan Parthasarathy, Vadim Mazalov, Zhenghao Wang, Lei He, Sheng Zhao, Yifan Gong:
Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability. CoRR abs/2007.15188 (2020) - [i10]Xi Wang, Huaiping Ming, Lei He, Frank K. Soong:
s-Transformer: Segment-Transformer for Robust Neural Speech Synthesis. CoRR abs/2011.08480 (2020)
2010 – 2019
- 2019
- [c14]Yajie Zhang, Shifeng Pan, Lei He, Zhen-Hua Ling:
Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis. ICASSP 2019: 6945-6949 - [c13]Yibin Zheng, Xi Wang, Lei He, Shifeng Pan, Frank K. Soong, Zhengqi Wen, Jianhua Tao:
Forward-Backward Decoding for Regularizing End-to-End TTS. INTERSPEECH 2019: 1283-1287 - [c12]Haohan Guo, Frank K. Soong, Lei He, Lei Xie:
A New GAN-Based End-to-End TTS Training Algorithm. INTERSPEECH 2019: 1288-1292 - [c11]Mutian He, Yan Deng, Lei He:
Robust Sequence-to-Sequence Acoustic Modeling with Stepwise Monotonic Attention for Neural TTS. INTERSPEECH 2019: 1293-1297 - [c10]Haohan Guo, Frank K. Soong, Lei He, Lei Xie:
Exploiting Syntactic Features in a Parsed Tree to Improve End-to-End TTS. INTERSPEECH 2019: 4460-4464 - [i9]Huaiping Ming, Lei He, Haohan Guo, Frank K. Soong:
Feature reinforcement with word embedding and parsing information in neural TTS. CoRR abs/1901.00707 (2019) - [i8]Haohan Guo, Frank K. Soong, Lei He, Lei Xie:
Exploiting Syntactic Features in a Parsed Tree to Improve End-to-End TTS. CoRR abs/1904.04764 (2019) - [i7]Haohan Guo, Frank K. Soong, Lei He, Lei Xie:
A New GAN-based End-to-End TTS Training Algorithm. CoRR abs/1904.04775 (2019) - [i6]Mutian He, Yan Deng, Lei He:
Robust Sequence-to-Sequence Acoustic Modeling with Stepwise Monotonic Attention for Neural TTS. CoRR abs/1906.00672 (2019) - [i5]Yibin Zheng, Xi Wang, Lei He, Shifeng Pan, Frank K. Soong, Zhengqi Wen, Jianhua Tao:
Forward-Backward Decoding for Regularizing End-to-End TTS. CoRR abs/1907.09006 (2019) - 2018
- [c9]Yang Cui, Xi Wang, Lei He, Frank K. Soong:
A New Glottal Neural Vocoder for Speech Synthesis. INTERSPEECH 2018: 2017-2021 - [c8]Feng-Long Xie, Frank K. Soong, Xi Wang, Lei He, Haifeng Li:
Frame Selection in SI-DNN Phonetic Space with WaveNet Vocoder for Voice Conversion without Parallel Training Data. ISCSLP 2018: 56-60 - [i4]Yajie Zhang, Shifeng Pan, Lei He, Zhen-Hua Ling:
Learning latent representations for style control and transfer in end-to-end speech synthesis. CoRR abs/1812.04342 (2018) - [i3]Yan Deng, Lei He, Frank K. Soong:
Modeling Multi-speaker Latent Space to Improve Neural TTS: Quick Enrolling New Speaker and Enhancing Premium Voice. CoRR abs/1812.05253 (2018) - 2016
- [j1]Xiang Yin, Ming Lei, Yao Qian, Frank K. Soong, Lei He, Zhen-Hua Ling, Li-Rong Dai:
Modeling F0 trajectories in hierarchically structured deep neural networks. Speech Commun. 76: 82-92 (2016) - [c7]Yuchen Fan, Yao Qian, Frank K. Soong, Lei He:
Unsupervised speaker adaptation for DNN-based TTS synthesis. ICASSP 2016: 5135-5139 - [c6]Yuchen Fan, Yao Qian, Frank K. Soong, Lei He:
Speaker and language factorization in DNN-based TTS synthesis. ICASSP 2016: 5540-5544 - [c5]Peilu Wang, Yao Qian, Frank K. Soong, Lei He, Hai Zhao:
Learning Distributed Word Representations For Bidirectional LSTM Recurrent Neural Network. HLT-NAACL 2016: 527-533 - 2015
- [c4]Yuchen Fan, Yao Qian, Frank K. Soong, Lei He:
Multi-speaker modeling and speaker adaptation for DNN-based TTS synthesis. ICASSP 2015: 4475-4479 - [c3]Peilu Wang, Yao Qian, Frank K. Soong, Lei He, Hai Zhao:
Word embedding for recurrent neural network based TTS synthesis. ICASSP 2015: 4879-4883 - [c2]Yuchen Fan, Yao Qian, Frank K. Soong, Lei He:
Sequence generation error (SGE) minimization based deep neural networks training for text-to-speech synthesis. INTERSPEECH 2015: 864-868 - [i2]Peilu Wang, Yao Qian, Frank K. Soong, Lei He, Hai Zhao:
Part-of-Speech Tagging with Bidirectional Long Short-Term Memory Recurrent Neural Network. CoRR abs/1510.06168 (2015) - [i1]Peilu Wang, Yao Qian, Frank K. Soong, Lei He, Hai Zhao:
A Unified Tagging Solution: Bidirectional LSTM Recurrent Neural Network with Word Embedding. CoRR abs/1511.00215 (2015) - 2014
- [c1]Xiang Yin, Ming Lei, Yao Qian, Frank K. Soong, Lei He, Zhen-Hua Ling, Li-Rong Dai:
Modeling DCT parameterized F0 trajectory at intonation phrase level with DNN or decision tree. INTERSPEECH 2014: 2273-2277
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-04 20:10 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint