


default search action
Zhen-Hua Ling
Person information
Other persons with a similar name
SPARQL queries 
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [c228]Jiaxuan Liu, Zhaoci Liu, Yajun Hu, Yingying Gao, Shilei Zhang, Zhenhua Ling:
DiffStyleTTS: Diffusion-based Hierarchical Prosody Modeling for Text-to-Speech with Diverse and Controllable Styles. COLING 2025: 5265-5272 - [i110]Zhengyan Sheng, Zhihao Du, Heng Lu, Shiliang Zhang, Zhen-Hua Ling:
Unispeaker: A Unified Approach for Multimodality-driven Speaker Generation. CoRR abs/2501.06394 (2025) - [i109]Shi-Qi Yan, Zhen-Hua Ling:
RPO: Retrieval Preference Optimization for Robust Retrieval-Augmented Generation. CoRR abs/2501.13726 (2025) - 2024
- [j52]Bing Yin, Shi Yin
, Cong Liu, Yanyong Zhang, Changfeng Xi, Baocai Yin
, Zhenhua Ling:
Dynamic facial expression recognition with pseudo-label guided multi-modal pre-training. IET Comput. Vis. 18(1): 33-45 (2024) - [j51]Rui-Chen Zheng
, Yang Ai
, Zhen-Hua Ling
:
Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 32: 1430-1444 (2024) - [j50]Yang Ai
, Zhen-Hua Ling
:
Low-Latency Neural Speech Phase Prediction Based on Parallel Estimation Architecture and Anti-Wrapping Losses for Speech Generation Tasks. IEEE ACM Trans. Audio Speech Lang. Process. 32: 2283-2296 (2024) - [j49]Yang Ai
, Xiao-Hang Jiang, Ye-Xin Lu
, Hui-Peng Du, Zhen-Hua Ling
:
APCodec: A Neural Audio Codec With Parallel Amplitude and Phase Spectrum Encoding and Decoding. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3256-3269 (2024) - [j48]Zhaoci Liu
, Liping Chen
, Ya-Jun Hu, Zhen-Hua Ling
, Jia Pan
:
PE-Wav2vec: A Prosody-Enhanced Speech Model for Self-Supervised Prosody Learning in TTS. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4199-4210 (2024) - [j47]Jun-Yu Ma
, Jia-Chen Gu
, Zhen-Hua Ling
, Quan Liu, Cong Liu, Guoping Hu:
Syntax-Augmented Hierarchical Interactive Encoder for Zero-Shot Cross-Lingual Information Extraction. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4795-4809 (2024) - [c227]Qian Wang, Jia-Chen Gu, Zhen-Hua Ling:
X-ACE: Explainable and Multi-factor Audio Captioning Evaluation. ACL (Findings) 2024: 12273-12287 - [c226]Pengyu Cheng, Zhenhua Ling, Meng Meng, Yujun Wang:
Disentangling Speaker Representations from Intuitive Prosodic Features for Speaker-Adaptative and Prosody-Controllable Speech Synthesis. APSIPA 2024: 1-6 - [c225]Yanjun Li, Xiangyu Zhao, Zhengpeng Zha, Zhenhua Ling:
ET-SSM: Linear-Time Encrypted Traffic Classification Method Based On Structured State Space Model. APSIPA 2024: 1-6 - [c224]Xiangyu Zhao, Yanjun Li, Zhengpeng Zha, Zhenhua Ling:
MGVul: a Multi-Granularity Detection Framework for Software Vulnerability. APSIPA 2024: 1-6 - [c223]Bolei He, Nuo Chen, Xinran He, Lingyong Yan, Zhenkai Wei, Jinchang Luo, Zhen-Hua Ling:
Retrieving, Rethinking and Revising: The Chain-of-Verification Can Improve Retrieval Augmented Generation. EMNLP (Findings) 2024: 10371-10393 - [c222]Jia-Chen Gu, Hao-Xiang Xu, Jun-Yu Ma, Pan Lu, Zhen-Hua Ling, Kai-Wei Chang, Nanyun Peng:
Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue. EMNLP 2024: 16801-16819 - [c221]Qing-Tian Xu, Jie Zhang, Zhen-Hua Ling:
An End-to-End EEG Channel Selection Method with Residual Gumbel Softmax for Brain-Assisted Speech Enhancement. ICASSP 2024: 10131-10135 - [c220]Shihao Chen, Liping Chen, Jie Zhang, Kong-Aik Lee, Zhenhua Ling, Lirong Dai:
Adversarial Speech for Voice Privacy Protection from Personalized Speech Generation. ICASSP 2024: 11411-11415 - [c219]Kangdi Mei, Zhaoci Liu, Hui-Peng Du, Hengyu Li, Yang Ai, Liping Chen, Zhenhua Ling:
Considering Temporal Connection between Turns for Conversational Speech Synthesis. ICASSP 2024: 11426-11430 - [c218]Qian Wang, Jia-Chen Gu, Zhen-Hua Ling:
Multiscale Matching Driven by Cross-Modal Similarity Consistency for Audio-Text Retrieval. ICASSP 2024: 11581-11585 - [c217]Liping Chen, Kong Aik Lee, Wu Guo, Zhen-Hua Ling:
Modeling Pseudo-Speaker Uncertainty in Voice Anonymization. ICASSP 2024: 11601-11605 - [c216]Jun-Yu Ma, Zhen-Hua Ling, Ningyu Zhang, Jia-Chen Gu:
Neighboring Perturbations of Knowledge Editing on Large Language Models. ICML 2024 - [c215]Rui Feng, Yu-Ang Chen, Yin-Long Liu, Jia-Hong Yuan, Zhen-Hua Ling:
Wav2Nas: An Exploratory Approach to Nasalance Estimation in Speech. ISCSLP 2024: 1-5 - [c214]Rui Feng, Yin-Long Liu, Zhen-Hua Ling, Jia-Hong Yuan:
Wav2f0: Exploring the Potential of Wav2vec 2.0 for Speech Fundamental Frequency Extraction. ISCSLP 2024: 169-173 - [c213]Yu-Fei Shi, Yang Ai, Ye-Xin Lu, Hui-Peng Du, Zhen-Hua Ling:
SAMOS: A Neural MOS Prediction Model Leveraging Semantic Representations and Acoustic Features. ISCSLP 2024: 199-203 - [c212]Yubang Zhang, Jie Zhang, Zhenhua Ling:
The NERCSLIP-USTC System for Track2 of the First Chinese Auditory Attention Decoding Challenge. ISCSLP 2024: 319-323 - [c211]Yin-Long Liu, Rui Feng, Jia-Hong Yuan, Zhen-Hua Ling:
Leveraging Prompt Learning and Pause Encoding for Alzheimer's Disease Detection. ISCSLP 2024: 486-490 - [c210]Hui-Peng Du, Yang Ai, Rui-Chen Zheng, Zhen-Hua Ling:
APCodec+: A Spectrum-Coding-Based High-Fidelity and High-Compression-Rate Neural Audio Codec with Staged Training Paradigm. ISCSLP 2024: 676-680 - [c209]Rui-Chen Zheng
, Yang Ai
, Zhen-Hua Ling
:
Speech Reconstruction from Silent Lip and Tongue Articulation by Diffusion Models and Text-Guided Pseudo Target Generation. ACM Multimedia 2024: 6559-6568 - [c208]Chang Liu, Zhen-Hua Ling, Ya-Jun Hu:
Language-Independent Prosody-Enhanced Speech Representations For Multilingual Speech Synthesis. SLT 2024: 482-488 - [c207]Xiao-Hang Jiang, Yang Ai, Rui-Chen Zheng, Hui-Peng Du, Ye-Xin Lu, Zhen-Hua Ling:
MDCTCodec: A Lightweight MDCT-Based Neural Audio Codec Towards High Sampling Rate and Low Bitrate Scenarios. SLT 2024: 540-547 - [c206]Fei Liu, Yang Ai, Hui-Peng Du, Ye-Xin Lu, Rui-Chen Zheng, Zhen-Hua Ling:
Stage-Wise and Prior-Aware Neural Speech Phase Prediction. SLT 2024: 638-644 - [c205]Yu-Fei Shi, Yang Ai, Ye-Xin Lu, Hui-Peng Du, Zhen-Hua Ling:
Pitch-and-Spectrum-Aware Singing Quality Assessment with Bias Correction and Model Fusion. SLT 2024: 811-817 - [c204]Chenyang Guo, Liping Chen, Zhuhai Li, Kong Aik Lee, Zhen-Hua Ling, Wu Guo:
On The Generation and Removal of Speaker Adversarial Perturbation For Voice-Privacy Protection. SLT 2024: 1179-1184 - [e2]Yanmin Qian, Qin Jin, Zhijian Ou, Zhenhua Ling, Zhiyong Wu, Ya Li, Lei Xie, Jianhua Tao:
14th IEEE International Symposium on Chinese Spoken Language Processing, ISCSLP 2024, Beijing, China, November 7-10, 2024. IEEE 2024, ISBN 979-8-3315-1682-6 [contents] - [i108]Jia-Chen Gu, Hao-Xiang Xu, Jun-Yu Ma, Pan Lu, Zhen-Hua Ling, Kai-Wei Chang, Nanyun Peng:
Model Editing Can Hurt General Abilities of Large Language Models. CoRR abs/2401.04700 (2024) - [i107]Ye-Xin Lu, Yang Ai, Hui-Peng Du, Zhen-Hua Ling:
Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction. CoRR abs/2401.06387 (2024) - [i106]Shihao Chen, Liping Chen, Jie Zhang, Kong-Aik Lee, Zhenhua Ling, Lirong Dai:
Adversarial speech for voice privacy protection from Personalized Speech generation. CoRR abs/2401.11857 (2024) - [i105]Shi-Qi Yan, Jia-Chen Gu, Yun Zhu, Zhen-Hua Ling:
Corrective Retrieval Augmented Generation. CoRR abs/2401.15884 (2024) - [i104]Jun-Yu Ma, Jia-Chen Gu, Ningyu Zhang, Zhen-Hua Ling:
Neighboring Perturbations of Knowledge Editing on Large Language Models. CoRR abs/2401.17623 (2024) - [i103]Yang Ai, Xiao-Hang Jiang, Ye-Xin Lu, Hui-Peng Du, Zhen-Hua Ling:
APCodec: A Neural Audio Codec with Parallel Amplitude and Phase Spectrum Encoding and Decoding. CoRR abs/2402.10533 (2024) - [i102]Qian Wang, Jia-Chen Gu, Zhen-Hua Ling:
Multiscale Matching Driven by Cross-Modal Similarity Consistency for Audio-Text Retrieval. CoRR abs/2403.10146 (2024) - [i101]Yang Ai, Zhen-Hua Ling:
Low-Latency Neural Speech Phase Prediction based on Parallel Estimation Architecture and Anti-Wrapping Losses for Speech Generation Tasks. CoRR abs/2403.17378 (2024) - [i100]Zhengyan Sheng, Yang Ai, Li-Juan Liu, Jia Pan, Zhen-Hua Ling:
Voice Attribute Editing with Text Prompt. CoRR abs/2404.08857 (2024) - [i99]Jun-Yu Ma, Hong Wang, Hao-Xiang Xu, Zhen-Hua Ling, Jia-Chen Gu:
Perturbation-Restrained Sequential Model Editing. CoRR abs/2405.16821 (2024) - [i98]Hui-Peng Du, Ye-Xin Lu, Yang Ai, Zhen-Hua Ling:
BiVocoder: A Bidirectional Neural Vocoder Integrating Feature Extraction and Waveform Generation. CoRR abs/2406.02162 (2024) - [i97]Ye-Xin Lu, Yang Ai, Zheng-Yan Sheng, Zhen-Hua Ling:
Multi-Stage Speech Bandwidth Extension with Flexible Sampling Rate Control. CoRR abs/2406.02250 (2024) - [i96]Rui Wang, Liping Chen, Kong-Aik Lee, Zhen-Hua Ling:
Asynchronous Voice Anonymization Using Adversarial Perturbation On Speaker Embedding. CoRR abs/2406.08200 (2024) - [i95]Hengyu Li, Kangdi Mei, Zhaoci Liu, Yang Ai, Liping Chen, Jie Zhang, Zhenhua Ling:
Refining Self-Supervised Learnt Speech Representation using Brain Activations. CoRR abs/2406.08266 (2024) - [i94]Keying Zuo, Qingtian Xu, Jie Zhang, Zhenhua Ling:
Geometry-Constrained EEG Channel Selection for Brain-Assisted Speech Enhancement. CoRR abs/2409.12520 (2024) - [i93]Fei Liu, Yang Ai, Hui-Peng Du, Ye-Xin Lu, Rui-Chen Zheng, Zhen-Hua Ling:
Stage-Wise and Prior-Aware Neural Speech Phase Prediction. CoRR abs/2410.04990 (2024) - [i92]Bolei He, Nuo Chen, Xinran He, Lingyong Yan, Zhenkai Wei, Jinchang Luo, Zhen-Hua Ling:
Retrieving, Rethinking and Revising: The Chain-of-Verification Can Improve Retrieval Augmented Generation. CoRR abs/2410.05801 (2024) - [i91]Hui-Peng Du, Yang Ai, Rui-Chen Zheng, Zhen-Hua Ling:
APCodec+: A Spectrum-Coding-Based High-Fidelity and High-Compression-Rate Neural Audio Codec with Staged Training Paradigm. CoRR abs/2410.22807 (2024) - [i90]Xiao-Hang Jiang, Yang Ai, Rui-Chen Zheng, Hui-Peng Du, Ye-Xin Lu, Zhen-Hua Ling:
MDCTCodec: A Lightweight MDCT-based Neural Audio Codec towards High Sampling Rate and Low Bitrate Scenarios. CoRR abs/2411.00464 (2024) - [i89]Yu-Fei Shi, Yang Ai, Ye-Xin Lu, Hui-Peng Du, Zhen-Hua Ling:
Pitch-and-Spectrum-Aware Singing Quality Assessment with Bias Correction and Model Fusion. CoRR abs/2411.11123 (2024) - [i88]Yu-Fei Shi, Yang Ai, Ye-Xin Lu, Hui-Peng Du, Zhen-Hua Ling:
SAMOS: A Neural MOS Prediction Model Leveraging Semantic Representations and Acoustic Features. CoRR abs/2411.11232 (2024) - [i87]Xiao-Hang Jiang, Hui-Peng Du, Yang Ai, Ye-Xin Lu, Zhen-Hua Ling:
ESTVocoder: An Excitation-Spectral-Transformed Neural Vocoder Conditioned on Mel Spectrogram. CoRR abs/2411.11258 (2024) - [i86]Jiaxuan Liu, Zhaoci Liu, Yajun Hu, Yingying Gao, Shilei Zhang, Zhenhua Ling:
DiffStyleTTS: Diffusion-based Hierarchical Prosody Modeling for Text-to-Speech with Diverse and Controllable Styles. CoRR abs/2412.03388 (2024) - [i85]Yin-Long Liu, Rui Feng, Jia-Hong Yuan, Zhen-Hua Ling:
Leveraging Prompt Learning and Pause Encoding for Alzheimer's Disease Detection. CoRR abs/2412.06259 (2024) - [i84]Chenyang Guo, Liping Chen, Zhuhai Li, Kong Aik Lee, Zhen-Hua Ling, Wu Guo:
On the Generation and Removal of Speaker Adversarial Perturbation for Voice-Privacy Protection. CoRR abs/2412.09195 (2024) - 2023
- [j46]Yang Ai
, Ye-Xin Lu
, Zhen-Hua Ling
:
Long-Frame-Shift Neural Speech Phase Prediction With Spectral Continuity Enhancement and Interpolation Error Compensation. IEEE Signal Process. Lett. 30: 1097-1101 (2023) - [j45]Zhiqiang Guo
, Zhenhua Ling
:
Exploring the Topics of Audio Words for Detecting Alzheimer's Disease From Spontaneous Speech. IEEE Signal Process. Lett. 30: 1727-1731 (2023) - [j44]Yu-Ping Ruan
, Zhen-Hua Ling
:
Emotion-Regularized Conditional Variational Autoencoder for Emotional Response Generation. IEEE Trans. Affect. Comput. 14(1): 842-848 (2023) - [j43]Yang Ai
, Zhen-Hua Ling
:
APNet: An All-Frame-Level Neural Vocoder Incorporating Direct Prediction of Amplitude and Phase Spectra. IEEE ACM Trans. Audio Speech Lang. Process. 31: 2145-2157 (2023) - [j42]Chang Liu
, Zhen-Hua Ling
, Ling-Hui Chen
:
Pronunciation Dictionary-Free Multilingual Speech Synthesis Using Learned Phonetic Representations. IEEE ACM Trans. Audio Speech Lang. Process. 31: 3706-3716 (2023) - [c203]Beiduo Chen, Shaohan Huang, Zihan Zhang, Wu Guo, Zhenhua Ling, Haizhen Huang
, Furu Wei, Weiwei Deng, Qi Zhang:
Pre-training Language Model as a Multi-perspective Course Learner. ACL (Findings) 2023: 114-128 - [c202]Jia-Chen Gu, Zhenhua Ling, Quan Liu, Cong Liu, Guoping Hu:
GIFT: Graph-Induced Fine-Tuning for Multi-Party Conversation Understanding. ACL (1) 2023: 11645-11658 - [c201]Haochen Wu, Zhuhai Li, Luzhen Xu, Zhentao Zhang, Wenting Zhao, Bin Gu, Yang Ai, Yexin Lu, Jie Zhang, Zhenhua Ling, Wu Guo:
The USTC-NERCSLIP System for the Track 1.2 of Audio Deepfake Detection (ADD 2023) Challenge. DADA@IJCAI 2023: 119-124 - [c200]Yue Chen, Tian-Wei He, Hongbin Zhou, Jia-Chen Gu, Heng Lu, Zhen-Hua Ling:
Symbolization, Prompt, and Classification: A Framework for Implicit Speaker Identification in Novels. EMNLP (Findings) 2023: 3455-3467 - [c199]Chao-Hong Tan, Jia-Chen Gu, Zhen-Hua Ling:
Is ChatGPT a Good Multi-Party Conversation Solver? EMNLP (Findings) 2023: 4905-4915 - [c198]Jia-Chen Gu, Chao-Hong Tan, Caiyuan Chu, Zhen-Hua Ling, Chongyang Tao, Quan Liu, Cong Liu:
MADNet: Maximizing Addressee Deduction Expectation for Multi-Party Conversation Generation. EMNLP 2023: 7681-7692 - [c197]Yang Ai, Zhen-Hua Ling:
Neural Speech Phase Prediction Based on Parallel Estimation Architecture and Anti-Wrapping Losses. ICASSP 2023: 1-5 - [c196]Kangdi Mei, Xinyun Ding, Yinlong Liu, Zhiqiang Guo, Feiyang Xu, Xin Li, Tuya Naren, Jiahong Yuan, Zhenhua Ling:
The Ustc System for Adress-m Challenge. ICASSP 2023: 1-2 - [c195]Zhengyan Sheng
, Yang Ai, Zhen-Hua Ling:
Zero-Shot Personalized Lip-To-Speech Synthesis with Face Image Based Voice Control. ICASSP 2023: 1-5 - [c194]Jing-Xuan Zhang, Genshun Wan, Zhen-Hua Ling, Jia Pan, Jianqing Gao, Cong Liu:
Self-Supervised Audio-Visual Speech Representations Learning by Multimodal Self-Distillation. ICASSP 2023: 1-5 - [c193]Rui-Chen Zheng, Yang Ai, Zhen-Hua Ling:
Speech Reconstruction from Silent Tongue and Lip Articulation by Pseudo Target Generation and Domain Adversarial Training. ICASSP 2023: 1-5 - [c192]Zhaoci Liu, Zhen-Hua Ling, Ya-Jun Hu, Jia Pan, Jin-Wei Wang, Yun-Di Wu:
Speech Synthesis with Self-Supervisedly Learnt Prosodic Representations. INTERSPEECH 2023: 7-11 - [c191]Rui-Chen Zheng, Yang Ai, Zhen-Hua Ling:
Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement through Knowledge Distillation. INTERSPEECH 2023: 844-848 - [c190]Jie Zhang, Qing-Tian Xu, Qiu-Shi Zhu, Zhen-Hua Ling:
BASEN: Time-Domain Brain-Assisted Speech Enhancement Network with Convolutional Cross Attention in Multi-talker Conditions. INTERSPEECH 2023: 3117-3121 - [c189]Ye-Xin Lu, Yang Ai, Zhen-Hua Ling:
MP-SENet: A Speech Enhancement Model with Parallel Denoising of Magnitude and Phase Spectra. INTERSPEECH 2023: 3834-3838 - [c188]Zhengyan Sheng
, Yang Ai
, Yan-Nian Chen
, Zhen-Hua Ling
:
Face-Driven Zero-Shot Voice Conversion with Memory-based Face-Voice Alignment. ACM Multimedia 2023: 8443-8452 - [c187]Jun-Yu Ma, Jia-Chen Gu, Jiajun Qi, Zhenhua Ling, Quan Liu, Xiaoyi Zhao:
USTC-NELSLIP at SemEval-2023 Task 2: Statistical Construction and Dual Adaptation of Gazetteer for Multilingual Complex NER. SemEval@ACL 2023: 651-659 - [i83]Ye-Xin Lu, Yang Ai, Zhen-Hua Ling:
Source-Filter-Based Generative Adversarial Neural Vocoder for High Fidelity Speech Synthesis. CoRR abs/2304.13270 (2023) - [i82]Jun-Yu Ma, Jia-Chen Gu, Jiajun Qi, Zhen-Hua Ling, Quan Liu, Xiaoyi Zhao:
USTC-NELSLIP at SemEval-2023 Task 2: Statistical Construction and Dual Adaptation of Gazetteer for Multilingual Complex NER. CoRR abs/2305.02517 (2023) - [i81]Beiduo Chen
, Shaohan Huang, Zihan Zhang, Wu Guo, Zhenhua Ling, Haizhen Huang
, Furu Wei, Weiwei Deng, Qi Zhang:
Pre-training Language Model as a Multi-perspective Course Learner. CoRR abs/2305.03981 (2023) - [i80]Yang Ai, Zhen-Hua Ling:
APNet: An All-Frame-Level Neural Vocoder Incorporating Direct Prediction of Amplitude and Phase Spectra. CoRR abs/2305.07952 (2023) - [i79]Jia-Chen Gu, Zhen-Hua Ling, Quan Liu, Cong Liu, Guoping Hu:
GIFT: Graph-Induced Fine-Tuning for Multi-Party Conversation Understanding. CoRR abs/2305.09360 (2023) - [i78]Jie Zhang, Qing-Tian Xu, Qiu-Shi Zhu, Zhen-Hua Ling:
BASEN: Time-Domain Brain-Assisted Speech Enhancement Network with Convolutional Cross Attention in Multi-talker Conditions. CoRR abs/2305.09994 (2023) - [i77]Chao-Hong Tan, Jia-Chen Gu, Zhen-Hua Ling:
DiffuSIA: A Spiral Interaction Architecture for Encoder-Decoder Text Diffusion. CoRR abs/2305.11517 (2023) - [i76]Jun-Yu Ma, Jia-Chen Gu, Zhen-Hua Ling, Quan Liu, Cong Liu, Guoping Hu:
SHINE: Syntax-augmented Hierarchical Interactive Encoder for Zero-shot Cross-lingual Information Extraction. CoRR abs/2305.12389 (2023) - [i75]Jia-Chen Gu, Chao-Hong Tan, Caiyuan Chu, Zhen-Hua Ling, Chongyang Tao, Quan Liu, Cong Liu, Guoping Hu:
MADNet: Maximizing Addressee Deduction Expectation for Multi-Party Conversation Generation. CoRR abs/2305.12733 (2023) - [i74]Zhengyan Sheng, Yang Ai, Zhen-Hua Ling:
Zero-shot personalized lip-to-speech synthesis with face image based voice control. CoRR abs/2305.14359 (2023) - [i73]Rui-Chen Zheng, Yang Ai, Zhen-Hua Ling:
Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement through Knowledge Distillation. CoRR abs/2305.14933 (2023) - [i72]Yang Ai, Ye-Xin Lu, Zhen-Hua Ling:
Long-frame-shift Neural Speech Phase Prediction with Spectral Continuity Enhancement and Interpolation Error Compensation. CoRR abs/2308.08850 (2023) - [i71]Ye-Xin Lu, Yang Ai, Zhen-Hua Ling:
Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement. CoRR abs/2308.08926 (2023) - [i70]Zhengyan Sheng, Yang Ai, Yan-Nian Chen, Zhen-Hua Ling:
Face-Driven Zero-Shot Voice Conversion with Memory-based Face-Voice Alignment. CoRR abs/2309.09470 (2023) - [i69]Rui-Chen Zheng, Yang Ai, Zhen-Hua Ling:
Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement. CoRR abs/2309.10455 (2023) - [i68]Jun-Yu Ma, Jia-Chen Gu, Zhen-Hua Ling, Quan Liu, Cong Liu:
Untying the Reversal Curse via Bidirectional Language Model Editing. CoRR abs/2310.10322 (2023) - [i67]Chao-Hong Tan, Jia-Chen Gu, Zhen-Hua Ling:
Is ChatGPT a Good Multi-Party Conversation Solver? CoRR abs/2310.16301 (2023) - [i66]Hongjie Zhang, Yi Liu, Lu Dong, Yifei Huang, Zhen-Hua Ling, Yali Wang, Limin Wang, Yu Qiao:
MoVQA: A Benchmark of Versatile Question-Answering for Long-Form Movie Understanding. CoRR abs/2312.04817 (2023) - 2022
- [j41]Yang Ai
, Zhen-Hua Ling
, Wei-Lu Wu, Ang Li:
Denoising-and-Dereverberation Hierarchical Neural Vocoder for Statistical Parametric Speech Synthesis. IEEE ACM Trans. Audio Speech Lang. Process. 30: 2036-2048 (2022) - [c186]Chao-Hong Tan, Jia-Chen Gu, Chongyang Tao, Zhen-Hua Ling, Can Xu, Huang Hu, Xiubo Geng, Daxin Jiang:
TegTok: Augmenting Text Generation via Task-specific and Open-world Knowledge. ACL (Findings) 2022: 1597-1609 - [c185]Jia-Chen Gu, Chao-Hong Tan, Chongyang Tao, Zhen-Hua Ling, Huang Hu, Xiubo Geng, Daxin Jiang:
HeterMPC: A Heterogeneous Graph Neural Network for Response Generation in Multi-Party Conversations. ACL (1) 2022: 5086-5097 - [c184]Tianda Li, Jia-Chen Gu, Zhen-Hua Ling, Quan Liu:
Conversation- and Tree-Structure Losses for Dialogue Disentanglement. DialDoc@ACL 2022: 54-64 - [c183]Kangdi Mei, Zhiqiang Guo, Zhaoci Liu, Lijuan Liu, Xin Li, Zhenhua Ling:
Detecting Alzheimer's Disease Based on Acoustic Features Extracted from Pre-trained Models. CICAI (3) 2022: 272-283 - [c182]Jun-Yu Ma, Beiduo Chen
, Jia-Chen Gu, Zhenhua Ling, Wu Guo, Quan Liu, Zhigang Chen, Cong Liu:
Wider & Closer: Mixture of Short-channel Distillers for Zero-shot Cross-lingual Named Entity Recognition. EMNLP 2022: 5171-5183 - [c181]Lu Dong, Zhiqiang Guo, Chao-Hong Tan, Ya-Jun Hu, Yuan Jiang, Zhen-Hua Ling:
Neural Grapheme-To-Phoneme Conversion with Pre-Trained Grapheme Models. ICASSP 2022: 6202-6206 - [c180]Zhengyan Sheng
, Zhiqiang Guo, Xin Li, Yunxia Li, Zhenhua Ling:
Dementia Detection by Fusing Speech and Eye-Tracking Representation. ICASSP 2022: 6457-6461 - [c179]Yan-Nian Chen, Li-Juan Liu, Ya-Jun Hu, Yuan Jiang, Zhen-Hua Ling:
Improving Recognition-Synthesis Based any-to-one Voice Conversion with Cyclic Training. ICASSP 2022: 7007-7011 - [c178]Ning-Qian Wu, Zhaoci Liu, Zhen-Hua Ling:
Discourse-Level Prosody Modeling with a Variational Autoencoder for Non-Autoregressive Expressive Speech Synthesis. ICASSP 2022: 7592-7596 - [c177]Cheng Gong, Longbiao Wang, Zhenhua Ling, Ju Zhang, Jianwu Dang:
Using Multiple Reference Audios and Style Embedding Constraints for Speech Synthesis. ICASSP 2022: 7912-7916 - [c176]Pengyu Cheng, Zhen-Hua Ling:
Speaker Adaption with Intuitive Prosodic Features for Statistical Parametric Speech Synthesis. ICDSP 2022: 187-193 - [c175]Chao-Hong Tan, Qian Chen, Wen Wang, Qinglin Zhang, Siqi Zheng, Zhen-Hua Ling:
PoNet: Pooling Network for Efficient Token Mixing in Long Sequences. ICLR 2022 - [c174]Jia-Chen Gu, Chongyang Tao, Zhen-Hua Ling:
Who Says What to Whom: A Survey of Multi-Party Conversations. IJCAI 2022: 5486-5493 - [c173]Yukun Peng, Zhenhua Ling:
Decoupled Pronunciation and Prosody Modeling in Meta-Learning-based Multilingual Speech Synthesis. INTERSPEECH 2022: 4257-4261 - [c172]Chang Liu, Zhen-Hua Ling, Ling-Hui Chen:
Pronunciation Dictionary-Free Multilingual Speech Synthesis by Combining Unsupervised and Supervised Phonetic Representations. INTERSPEECH 2022: 4282-4286 - [c171]Zhaoci Liu, Ning-Qian Wu, Yajie Zhang, Zhenhua Ling:
Integrating Discrete Word-Level Style Variations into Non-Autoregressive Acoustic Models for Speech Synthesis. INTERSPEECH 2022: 5508-5512 - [c170]Beiduo Chen
, Jun-Yu Ma, Jiajun Qi, Wu Guo, Zhen-Hua Ling, Quan Liu:
USTC-NELSLIP at SemEval-2022 Task 11: Gazetteer-Adapted Integration Network for Multilingual Complex Named Entity Recognition. SemEval@NAACL 2022: 1613-1622 - [i65]Lu Dong, Zhiqiang Guo, Chao-Hong Tan, Ya-Jun Hu, Yuan Jiang, Zhen-Hua Ling:
Neural Grapheme-to-Phoneme Conversion with Pre-trained Grapheme Models. CoRR abs/2201.10716 (2022) - [i64]Pengyu Cheng, Zhen-Hua Ling:
Speaker Adaption with Intuitive Prosodic Features for Statistical Parametric Speech Synthesis. CoRR abs/2203.00951 (2022) - [i63]Beiduo Chen, Jun-Yu Ma, Jiajun Qi, Wu Guo, Zhen-Hua Ling, Quan Liu:
USTC-NELSLIP at SemEval-2022 Task 11: Gazetteer-Adapted Integration Network for Multilingual Complex Named Entity Recognition. CoRR abs/2203.03216 (2022) - [i62]Lu Dong, Zhenhua Ling, Qiang Ling, Zefeng Lai:
Cognitive Diagnosis with Explicit Student Vector Estimation and Unsupervised Question Matrix Learning. CoRR abs/2203.03722 (2022) - [i61]Jia-Chen Gu, Chao-Hong Tan, Chongyang Tao, Zhen-Hua Ling, Huang Hu, Xiubo Geng, Daxin Jiang:
HeterMPC: A Heterogeneous Graph Neural Network for Response Generation in Multi-Party Conversations. CoRR abs/2203.08500 (2022) - [i60]Chao-Hong Tan, Jia-Chen Gu, Chongyang Tao, Zhen-Hua Ling, Can Xu, Huang Hu, Xiubo Geng, Daxin Jiang:
TegTok: Augmenting Text Generation via Task-specific and Open-world Knowledge. CoRR abs/2203.08517 (2022) - [i59]Chang Liu, Zhen-Hua Ling, Ling-Hui Chen:
Pronunciation Dictionary-Free Multilingual Speech Synthesis by Combining Unsupervised and Supervised Phonetic Representations. CoRR abs/2206.00951 (2022) - [i58]Yang Ai, Zhen-Hua Ling:
Neural Speech Phase Prediction based on Parallel Estimation Architecture and Anti-Wrapping Losses. CoRR abs/2211.15974 (2022) - [i57]Jing-Xuan Zhang, Genshun Wan, Zhen-Hua Ling, Jia Pan, Jianqing Gao, Cong Liu:
Self-Supervised Audio-Visual Speech Representations Learning By Multimodal Self-Distillation. CoRR abs/2212.02782 (2022) - [i56]Jun-Yu Ma, Beiduo Chen, Jia-Chen Gu, Zhen-Hua Ling, Wu Guo, Quan Liu, Zhigang Chen, Cong Liu:
WIDER & CLOSER: Mixture of Short-channel Distillers for Zero-shot Cross-lingual Named Entity Recognition. CoRR abs/2212.03506 (2022) - 2021
- [j40]Runze Wang
, Zhen-Hua Ling
, Jing-Bo Zhou, Yu Hu:
A Multiple-Integration Encoder for Multi-Turn Text-to-SQL Semantic Parsing. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1503-1513 (2021) - [j39]Yajie Zhang, Zhen-Hua Ling
:
Extracting and Predicting Word-Level Style Variations for Speech Synthesis. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1582-1593 (2021) - [j38]Jia-Chen Gu
, Tianda Li, Zhen-Hua Ling
, Quan Liu, Zhiming Su, Yu-Ping Ruan
, Xiaodan Zhu
:
Deep Contextualized Utterance Representations for Response Selection and Dialogue Analysis. IEEE ACM Trans. Audio Speech Lang. Process. 29: 2443-2455 (2021) - [j37]Xiao Zhou
, Zhen-Hua Ling
, Li-Rong Dai:
UnitNet: A Sequence-to-Sequence Acoustic Model for Concatenative Speech Synthesis. IEEE ACM Trans. Audio Speech Lang. Process. 29: 2643-2655 (2021) - [j36]Yi-Yang Ding, Hao-Jian Lin, Li-Juan Liu, Zhen-Hua Ling
, Yu Hu:
Robustness of Speech Spoofing Detectors Against Adversarial Post-Processing of Voice Conversion. IEEE ACM Trans. Audio Speech Lang. Process. 29: 3415-3426 (2021) - [c169]Runze Wang, Zhen-Hua Ling, Jingbo Zhou, Yu Hu:
Tracking Interaction States for Multi-Turn Text-to-SQL Semantic Parsing. AAAI 2021: 13979-13987 - [c168]Jing-Xuan Zhang, Korin Richmond, Zhen-Hua Ling, Lirong Dai:
TaLNet: Voice Reconstruction from Tongue and Lip Articulation with Transfer Learning from Text-to-Speech Synthesis. AAAI 2021: 14402-14410 - [c167]Jia-Chen Gu, Chongyang Tao, Zhen-Hua Ling, Can Xu, Xiubo Geng, Daxin Jiang:
MPC-BERT: A Pre-Trained Language Model for Multi-Party Conversation Understanding. ACL/IJCNLP (1) 2021: 3682-3692 - [c166]Xin Fang, Zhen-Hua Ling, Lei Sun, Shutong Niu, Jun Du, Cong Liu, Zhi-Chao Sheng:
A Deep Analysis of Speech Separation Guided Diarization Under Realistic Conditions. APSIPA ASC 2021: 667-671 - [c165]Zhen-Hua Ling, Xiao Zhou, Simon King:
The Blizzard Challenge 2021. Blizzard Challenge 2021 - [c164]Qin Yang, Feiyang Xu, Zhenhua Ling, Xin Li, Yunxia Li, Decheng Fang:
Selecting and Analyzing Speech Features for the Screening of Mild Cognitive Impairment. EMBC 2021: 1906-1910 - [c163]Jia-Chen Gu, Zhen-Hua Ling, Yu Wu, Quan Liu, Zhigang Chen, Xiaodan Zhu:
Detecting Speaker Personas from Conversational Texts. EMNLP (1) 2021: 1126-1136 - [c162]Shiming Wang, Zhenhua Ling, Ruibo Fu, Jiangyan Yi, Jianhua Tao:
Patnet : A Phoneme-Level Autoregressive Transformer Network for Speech Synthesis. ICASSP 2021: 5684-5688 - [c161]Cheng Gong, Longbiao Wang, Zhenhua Ling, Shaotong Guo, Ju Zhang, Jianwu Dang:
Improving Naturalness and Controllability of Sequence-to-Sequence Speech Synthesis by Learning Local Prosody Representations. ICASSP 2021: 5724-5728 - [c160]Tianda Li, Jia-Chen Gu, Hui Liu, Quan Liu, Zhen-Hua Ling, Zhiming Su, Xiaodan Zhu:
Have You Made a Decision? Where? A Pilot Study on Interpretability of Polarity Analysis Based on Advising Problem. ICASSP 2021: 6928-6932 - [c159]Zhaoci Liu, Zhiqiang Guo, Zhenhua Ling, Yunxia Li:
Detecting Alzheimer's Disease from Speech Using Neural Networks with Bottleneck Features and Data Augmentation. ICASSP 2021: 7323-7327 - [c158]Rui Yang, Runze Wang, Zhen-Hua Ling:
Graph Attention and Interaction Network With Multi-Task Learning for Fact Verification. ICASSP 2021: 7838-7842 - [c157]Xin Fang, Haijia Du, Tian Gao, Liang Zou
, Zhenhua Ling:
Voice spoofing detection with raw waveform based on Dual Path Res2net. ICCSE 2021: 160-165 - [c156]Yajie Zhang, Zhen-Hua Ling:
Learning Deep and Wide Contextual Representations Using BERT for Statistical Parametric Speech Synthesis. ICDSP 2021: 146-150 - [c155]Yi-Yang Ding, Li-Juan Liu, Yu Hu, Zhen-Hua Ling:
Adversarial Voice Conversion Against Neural Spoofing Detectors. Interspeech 2021: 816-820 - [c154]Yue Chen, Zhen-Hua Ling, Qing-Feng Liu:
A Neural-Network-Based Approach to Identifying Speakers in Novels. Interspeech 2021: 4114-4118 - [c153]Xiao Zhou, Zhen-Hua Ling, Li-Rong Dai:
UnitNet-Based Hybrid Speech Synthesis. Interspeech 2021: 4119-4123 - [c152]Chang Liu, Yang Ai, Zhenhua Ling:
Phase Spectrum Recovery for Enhancing Low-Quality Speech Captured by Laser Microphones. ISCSLP 2021: 1-5 - [c151]Boyuan Zheng, Xiaoyu Yang, Yu-Ping Ruan, Zhen-Hua Ling, Quan Liu, Si Wei, Xiaodan Zhu:
SemEval-2021 Task 4: Reading Comprehension of Abstract Meaning. SemEval@ACL/IJCNLP 2021: 37-50 - [c150]Jia-Chen Gu, Hui Liu, Zhen-Hua Ling, Quan Liu, Zhigang Chen, Xiaodan Zhu:
Partner Matters! An Empirical Study on Fusing Personas for Personalized Response Selection in Retrieval-Based Chatbots. SIGIR 2021: 565-574 - [c149]Yang Ai, Haoyu Li, Xin Wang
, Junichi Yamagishi, Zhen-Hua Ling:
Denoising-and-Dereverberation Hierarchical Neural Vocoder for Robust Waveform Generation. SLT 2021: 477-484 - [c148]Min Lu, Bin Zhou, Zhiyong Bu, Kecheng Zhang, Zhen-Hua Ling:
Compressed Network in Network Models for Traffic Classification. WCNC 2021: 1-6 - [i55]Yu-Ping Ruan, Zhen-Hua Ling:
Emotion-Regularized Conditional Variational Autoencoder for Emotional Response Generation. CoRR abs/2104.08857 (2021) - [i54]Jia-Chen Gu, Hui Liu, Zhen-Hua Ling, Quan Liu, Zhigang Chen, Xiaodan Zhu:
Partner Matters! An Empirical Study on Fusing Personas for Personalized Response Selection in Retrieval-Based Chatbots. CoRR abs/2105.09050 (2021) - [i53]Boyuan Zheng, Xiaoyu Yang, Yu-Ping Ruan, Zhen-Hua Ling, Quan Liu, Si Wei, Xiaodan Zhu:
SemEval-2021 Task 4: Reading Comprehension of Abstract Meaning. CoRR abs/2105.14879 (2021) - [i52]Jia-Chen Gu, Chongyang Tao, Zhen-Hua Ling, Can Xu, Xiubo Geng, Daxin Jiang:
MPC-BERT: A Pre-Trained Language Model for Multi-Party Conversation Understanding. CoRR abs/2106.01541 (2021) - [i51]Jia-Chen Gu, Zhen-Hua Ling, Yu Wu, Quan Liu, Zhigang Chen, Xiaodan Zhu:
Detecting Speaker Personas from Conversational Texts. CoRR abs/2109.01330 (2021) - [i50]Chao-Hong Tan, Qian Chen, Wen Wang, Qinglin Zhang, Siqi Zheng, Zhen-Hua Ling:
PoNet: Pooling Network for Efficient Token Mixing in Long Sequences. CoRR abs/2110.02442 (2021) - [i49]Cheng Gong, Longbiao Wang, Zhenhua Ling, Ju Zhang, Jianwu Dang:
Using multiple reference audios and style embedding constraints for speech synthesis. CoRR abs/2110.04451 (2021) - 2020
- [j35]Zhiyong Bu, Bin Zhou, Pengyu Cheng, Kecheng Zhang
, Zhen-Hua Ling
:
Encrypted Network Traffic Classification Using Deep and Parallel Network-in-Network Models. IEEE Access 8: 132950-132959 (2020) - [j34]Yu-Ping Ruan
, Zhen-Hua Ling
, Xiaodan Zhu, Quan Liu, Jia-Chen Gu:
Generating diverse conversation responses by creating and ranking multiple candidates. Comput. Speech Lang. 62: 101071 (2020) - [j33]Xin Wang
, Junichi Yamagishi, Massimiliano Todisco, Héctor Delgado, Andreas Nautsch, Nicholas W. D. Evans, Md. Sahidullah, Ville Vestman, Tomi Kinnunen, Kong Aik Lee
, Lauri Juvela
, Paavo Alku
, Yu-Huai Peng, Hsin-Te Hwang, Yu Tsao
, Hsin-Min Wang
, Sébastien Le Maguer
, Markus Becker, Zhen-Hua Ling:
ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech. Comput. Speech Lang. 64: 101114 (2020) - [j32]Xin Fang
, Tian Gao
, Liang Zou
, Zhen-Hua Ling
:
Bidirectional Attention for Text-Dependent Speaker Verification. Sensors 20(23): 6784 (2020) - [j31]Xiao Zhou, Zhen-Hua Ling, Li-Rong Dai:
Learning and Modeling Unit Embeddings Using Deep Neural Networks for Unit-Selection-Based Mandarin Speech Synthesis. ACM Trans. Asian Low Resour. Lang. Inf. Process. 19(3): 38:1-38:14 (2020) - [j30]Yu-Ping Ruan
, Zhen-Hua Ling, Xiaodan Zhu:
Condition-Transforming Variational Autoencoder for Generating Diverse Short Text Conversations. ACM Trans. Asian Low Resour. Lang. Inf. Process. 19(6): 79:1-79:13 (2020) - [j29]Jia-Chen Gu
, Zhen-Hua Ling
, Quan Liu:
Utterance-to-Utterance Interactive Matching Network for Multi-Turn Response Selection in Retrieval-Based Chatbots. IEEE ACM Trans. Audio Speech Lang. Process. 28: 369-379 (2020) - [j28]Jing-Xuan Zhang
, Zhen-Hua Ling
, Li-Rong Dai:
Non-Parallel Sequence-to-Sequence Voice Conversion With Disentangled Linguistic and Speaker Representations. IEEE ACM Trans. Audio Speech Lang. Process. 28: 540-552 (2020) - [j27]Yang Ai
, Zhen-Hua Ling
:
A Neural Vocoder With Hierarchical Generation of Amplitude and Phase Spectra for Statistical Parametric Speech Synthesis. IEEE ACM Trans. Audio Speech Lang. Process. 28: 839-851 (2020) - [c147]Yi-Yang Ding, Jing-Xuan Zhang, Li-Juan Liu, Yuan Jiang, Yu Hu, Zhen-Hua Ling:
Adversarial Post-Processing of Voice Conversion against Spoofing Detection. APSIPA 2020: 556-560 - [c146]Qiuchen Huang, Yang Ai, Zhenhua Ling:
Online Speaker Adaptation for WaveNet-based Neural Vocoders. APSIPA 2020: 815-820 - [c145]Yi Zhao, Wen-Chin Huang, Xiaohai Tian, Junichi Yamagishi, Rohan Kumar Das, Tomi Kinnunen, Zhen-Hua Ling, Tomoki Toda:
Voice Conversion Challenge 2020 -- Intra-lingual semi-parallel and cross-lingual voice conversion --. Blizzard Challenge / Voice Conversion Challenge 2020 - [c144]Rohan Kumar Das, Tomi Kinnunen, Wen-Chin Huang, Zhen-Hua Ling, Junichi Yamagishi, Yi Zhao, Xiaohai Tian, Tomoki Toda:
Predictions of Subjective Ratings and Spoofing Assessments of Voice Conversion Challenge 2020 Submissions. Blizzard Challenge / Voice Conversion Challenge 2020 - [c143]Li-Juan Liu, Yan-Nian Chen, Jing-Xuan Zhang, Yuan Jiang, Ya-Jun Hu, Zhen-Hua Ling, Li-Rong Dai:
Non-Parallel Voice Conversion with Autoregressive Conversion Model and Duration Adjustment. Blizzard Challenge / Voice Conversion Challenge 2020 - [c142]Jing-Xuan Zhang, Li-Juan Liu, Yan-Nian Chen, Ya-Jun Hu, Yuan Jiang, Zhen-Hua Ling, Li-Rong Dai:
Voice Conversion by Cascading Automatic Speech Recognition and Text-to-Speech Synthesis with Prosody Transfer. Blizzard Challenge / Voice Conversion Challenge 2020 - [c141]Xiao Zhou, Zhen-Hua Ling, Simon King:
The Blizzard Challenge 2020. Blizzard Challenge / Voice Conversion Challenge 2020 - [c140]Jia-Chen Gu, Tianda Li, Quan Liu, Zhen-Hua Ling, Zhiming Su, Si Wei, Xiaodan Zhu:
Speaker-Aware BERT for Multi-Turn Response Selection in Retrieval-Based Chatbots. CIKM 2020: 2041-2044 - [c139]Zhiqiang Guo, Zhaoci Liu, Zhenhua Ling, Shijin Wang, Lingjing Jin, Yunxia Li:
Text Classification by Contrastive Learning and Cross-lingual Data Augmentation for Alzheimer's Disease Detection. COLING 2020: 6161-6171 - [c138]Jia-Chen Gu, Zhen-Hua Ling, Quan Liu, Zhigang Chen, Xiaodan Zhu:
Filtering before Iteratively Referring for Knowledge-Grounded Response Selection in Retrieval-Based Chatbots. EMNLP (Findings) 2020: 1412-1422 - [c137]Lei Zhang, Runze Wang, Jingbo Zhou, Jingsong Yu, Zhenhua Ling, Hui Xiong:
Joint Intent Detection and Entity Linking on Spatial Domain Queries. EMNLP (Findings) 2020: 4937-4947 - [c136]Ning-Qian Wu, Zhen-Hua Ling:
WaveFFJORD: FFJORD-Based Vocoder for Statistical Parametric Speech Synthesis. ICASSP 2020: 7214-7218 - [c135]Xiao Zhou, Zhen-Hua Ling, Li-Rong Dai:
Extracting Unit Embeddings Using Sequence-To-Sequence Acoustic Models for Unit Selection Speech Synthesis. ICASSP 2020: 7659-7663 - [c134]Feiyang Xu, Yue Ding, Zhenhua Ling, Xin Li, Yunxia Li, Shijin Wang:
DCDT: A Digital Clock Drawing Test System for Cognitive Impairment Screening. ICDE 2020: 1762-1765 - [c133]Yang Ai, Zhen-Hua Ling:
Knowledge-and-Data-Driven Amplitude Spectrum Prediction for Hierarchical Neural Vocoders. INTERSPEECH 2020: 190-194 - [c132]Jing-Xuan Zhang, Zhen-Hua Ling, Li-Rong Dai:
Recognition-Synthesis Based Non-Parallel Voice Conversion with Adversarial Learning. INTERSPEECH 2020: 771-775 - [c131]Fenglin Ding, Wu Guo, Bin Gu, Zhen-Hua Ling, Jun Du:
Unsupervised Regularization-Based Adaptive Training for Speech Recognition. INTERSPEECH 2020: 996-1000 - [c130]Fenglin Ding, Wu Guo, Bin Gu, Zhen-Hua Ling, Jun Du:
Adaptive Speaker Normalization for CTC-Based Speech Recognition. INTERSPEECH 2020: 1266-1270 - [c129]Bin Gu, Wu Guo, Fenglin Ding, Zhen-Hua Ling, Jun Du:
An Adaptive X-Vector Model for Text-Independent Speaker Verification. INTERSPEECH 2020: 1506-1510 - [c128]Yang Ai, Xin Wang
, Junichi Yamagishi, Zhen-Hua Ling:
Reverberation Modeling for Source-Filter-Based Neural Vocoder. INTERSPEECH 2020: 3560-3564 - [e1]Junichi Yamagishi, Zhenhua Ling, Rohan Kumar Das, Simon King, Tomi Kinnunen, Tomoki Toda, Wen-Chin Huang, Xiao Zhou, Xiaohai Tian, Yi Zhao:
Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, Shanghai, China, October 30, 2020. ISCA 2020 [contents] - [i48]Yu-Ping Ruan, Zhen-Hua Ling, Jia-Chen Gu, Quan Liu:
Fine-Tuning BERT for Schema-Guided Zero-Shot Dialogue State Tracking. CoRR abs/2002.00181 (2020) - [i47]Jia-Chen Gu, Tianda Li, Quan Liu, Xiaodan Zhu, Zhen-Hua Ling, Yu-Ping Ruan:
Pre-Trained and Attention-Based Neural Networks for Building Noetic Task-Oriented Dialogue Systems. CoRR abs/2004.01940 (2020) - [i46]Jia-Chen Gu, Tianda Li, Quan Liu, Xiaodan Zhu, Zhen-Hua Ling, Zhiming Su, Si Wei:
Speaker-Aware BERT for Multi-Turn Response Selection in Retrieval-Based Chatbots. CoRR abs/2004.03588 (2020) - [i45]Tianda Li, Jia-Chen Gu, Xiaodan Zhu, Quan Liu, Zhen-Hua Ling, Zhiming Su, Si Wei:
DialBERT: A Hierarchical Pre-Trained Model for Conversation Disentanglement. CoRR abs/2004.03760 (2020) - [i44]Yang Ai, Zhen-Hua Ling:
Knowledge-and-Data-Driven Amplitude Spectrum Prediction for Hierarchical Neural Vocoders. CoRR abs/2004.07832 (2020) - [i43]Jia-Chen Gu, Zhen-Hua Ling, Quan Liu, Si Wei, Xiaodan Zhu:
Filtering before Iteratively Referring for Knowledge-Grounded Response Selection in Retrieval-Based Chatbots. CoRR abs/2004.14550 (2020) - [i42]Yang Ai, Xin Wang, Junichi Yamagishi, Zhen-Hua Ling:
Reverberation Modeling for Source-Filter-based Neural Vocoder. CoRR abs/2005.07379 (2020) - [i41]Jing-Xuan Zhang, Zhen-Hua Ling, Li-Rong Dai:
Recognition-Synthesis Based Non-Parallel Voice Conversion with Adversarial Learning. CoRR abs/2008.02371 (2020) - [i40]Yi Zhao, Wen-Chin Huang, Xiaohai Tian, Junichi Yamagishi, Rohan Kumar Das, Tomi Kinnunen, Zhen-Hua Ling, Tomoki Toda:
Voice Conversion Challenge 2020: Intra-lingual semi-parallel and cross-lingual voice conversion. CoRR abs/2008.12527 (2020) - [i39]Jing-Xuan Zhang, Li-Juan Liu, Yan-Nian Chen, Ya-Jun Hu, Yuan Jiang, Zhen-Hua Ling, Li-Rong Dai:
Voice Conversion by Cascading Automatic Speech Recognition and Text-to-Speech Synthesis with Prosody Transfer. CoRR abs/2009.01475 (2020) - [i38]Rohan Kumar Das, Tomi Kinnunen, Wen-Chin Huang, Zhen-Hua Ling, Junichi Yamagishi, Yi Zhao, Xiaohai Tian, Tomoki Toda:
Predictions of Subjective Ratings and Spoofing Assessments of Voice Conversion Challenge 2020 Submissions. CoRR abs/2009.03554 (2020) - [i37]Yang Ai, Haoyu Li, Xin Wang, Junichi Yamagishi, Zhen-Hua Ling:
Denoising-and-Dereverberation Hierarchical Neural Vocoder for Robust Waveform Generation. CoRR abs/2011.03955 (2020) - [i36]Runze Wang, Zhen-Hua Ling, Jingbo Zhou, Yu Hu:
Tracking Interaction States for Multi-Turn Text-to-SQL Semantic Parsing. CoRR abs/2012.04995 (2020) - [i35]Chao-Hong Tan, Xiaoyu Yang, Zi'ou Zheng, Tianda Li, Yufei Feng, Jia-Chen Gu, Quan Liu, Dan Liu, Zhen-Hua Ling, Xiaodan Zhu:
Learning to Retrieve Entity-Aware Knowledge and Generate Responses with Copy Mechanism for Task-Oriented Dialogue Systems. CoRR abs/2012.11937 (2020)
2010 – 2019
- 2019
- [j26]Runze Wang, Zhen-Hua Ling
, Yu Hu:
Knowledge Base Question Answering With Attentive Pooling for Question Representation. IEEE Access 7: 46773-46784 (2019) - [j25]Jing-Xuan Zhang
, Zhen-Hua Ling
, Li-Juan Liu, Yuan Jiang, Li-Rong Dai:
Sequence-to-Sequence Acoustic Modeling for Voice Conversion. IEEE ACM Trans. Audio Speech Lang. Process. 27(3): 631-644 (2019) - [c127]Zhi-Xiu Ye, Zhen-Hua Ling:
Multi-Level Matching and Aggregation Network for Few-Shot Relation Classification. ACL (1) 2019: 2872-2881 - [c126]Zhaoci Liu, Zhiqiang Guo, Zhenhua Ling, Shijin Wang, Lingjing Jin, Yunxia Li:
Dementia Detection by Analyzing Spontaneous Mandarin Speech. APSIPA 2019: 289-296 - [c125]Peng-Fei Wu, Zhen-Hua Ling, Li-Juan Liu, Yuan Jiang, Hong-Chuan Wu, Lirong Dai:
End-to-End Emotional Speech Synthesis Using Style Tokens and Semi-Supervised Training. APSIPA 2019: 623-627 - [c124]Rui Yang, Zhen-Hua Ling:
Linguistic Steganography by Sampling-based Language Generation. APSIPA 2019: 1014-1019 - [c123]Yuan Jiang, Ya-Jun Hu, Li-Juan Liu, Hong-Chuan Wu, Zhi-Kun Wang, Yang Ai, Zhen-Hua Ling, Li-Rong Dai:
The USTC System for Blizzard Challenge 2019. Blizzard Challenge 2019 - [c122]Jia-Chen Gu, Zhen-Hua Ling, Quan Liu:
Interactive Matching Network for Multi-Turn Response Selection in Retrieval-Based Chatbots. CIKM 2019: 2321-2324 - [c121]Jia-Chen Gu, Zhen-Hua Ling, Xiaodan Zhu, Quan Liu:
Dually Interactive Matching Network for Personalized Response Selection in Retrieval-Based Chatbots. EMNLP/IJCNLP (1) 2019: 1845-1854 - [c120]Xin Fang, Liang Zou
, Jin Li, Lei Sun, Zhen-Hua Ling:
Channel Adversarial Training for Cross-channel Text-independent Speaker Recognition. ICASSP 2019: 6221-6225 - [c119]Jing-Xuan Zhang, Zhen-Hua Ling, Yuan Jiang, Li-Juan Liu, Chen Liang, Li-Rong Dai:
Improving Sequence-to-sequence Voice Conversion by Adding Text-supervision. ICASSP 2019: 6785-6789 - [c118]Yajie Zhang, Shifeng Pan, Lei He, Zhen-Hua Ling:
Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis. ICASSP 2019: 6945-6949 - [c117]Yang Ai, Jing-Xuan Zhang, Liang Chen, Zhen-Hua Ling:
Dnn-based Spectral Enhancement for Neural Waveform Generators with Low-bit Quantization. ICASSP 2019: 7025-7029 - [c116]Yu-Ping Ruan, Zhen-Hua Ling, Quan Liu, Zhigang Chen, Nitin Indurkhya:
Condition-transforming Variational Autoencoder for Conversation Response Generation. ICASSP 2019: 7215-7219 - [c115]Chaohong Tan, Zhenhua Ling:
Multi-Classification Model for Spoken Language Understanding. ICMI 2019: 526-530 - [c114]Jia-Xiang Chen, Zhen-Hua Ling, Li-Rong Dai:
A Chinese Dataset for Identifying Speakers in Novels. INTERSPEECH 2019: 1561-1565 - [c113]Yuan-Hao Yi, Yang Ai, Zhen-Hua Ling, Li-Rong Dai:
Singing Voice Synthesis Using Deep Autoregressive Neural Networks for Acoustic Modeling. INTERSPEECH 2019: 2593-2597 - [c112]Zhi Chen, Wu Guo, Li-Rong Dai, Zhen-Hua Ling, Jun Du:
Neural Text Clustering with Document-Level Attention Based on Dynamic Soft Labels. INTERSPEECH 2019: 4225-4229 - [c111]Zhi-Xiu Ye, Zhen-Hua Ling:
Distant Supervision Relation Extraction with Intra-Bag and Inter-Bag Attentions. NAACL-HLT (1) 2019: 2810-2819 - [i34]Jia-Chen Gu, Zhen-Hua Ling, Quan Liu:
Interactive Matching Network for Multi-Turn Response Selection in Retrieval-Based Chatbots. CoRR abs/1901.01824 (2019) - [i33]Yu-Ping Ruan, Zhen-Hua Ling, Quan Liu, Jia-Chen Gu, Xiaodan Zhu:
Promoting Diversity for End-to-End Conversation Response Generation. CoRR abs/1901.09444 (2019) - [i32]Xin Fang, Liang Zou, Jin Li, Lei Sun, Zhen-Hua Ling:
Channel adversarial training for cross-channel text-independent speaker recognition. CoRR abs/1902.09074 (2019) - [i31]Zhi-Xiu Ye, Zhen-Hua Ling:
Distant Supervision Relation Extraction with Intra-Bag and Inter-Bag Attentions. CoRR abs/1904.00143 (2019) - [i30]Yu-Ping Ruan, Xiaodan Zhu, Zhen-Hua Ling, Zhan Shi, Quan Liu, Si Wei:
Exploring Unsupervised Pretraining and Sentence Structure Modelling for Winograd Schema Challenge. CoRR abs/1904.09705 (2019) - [i29]Yu-Ping Ruan, Zhen-Hua Ling, Quan Liu, Zhigang Chen, Nitin Indurkhya:
Condition-Transforming Variational AutoEncoder for Conversation Response Generation. CoRR abs/1904.10610 (2019) - [i28]Zhi-Xiu Ye, Zhen-Hua Ling:
Multi-Level Matching and Aggregation Network for Few-Shot Relation Classification. CoRR abs/1906.06678 (2019) - [i27]Yuan-Hao Yi, Yang Ai, Zhen-Hua Ling, Li-Rong Dai:
Singing Voice Synthesis Using Deep Autoregressive Neural Networks for Acoustic Modeling. CoRR abs/1906.08977 (2019) - [i26]Yang Ai, Zhen-Hua Ling:
A Neural Vocoder with Hierarchical Generation of Amplitude and Phase Spectra for Statistical Parametric Speech Synthesis. CoRR abs/1906.09573 (2019) - [i25]Jing-Xuan Zhang, Zhen-Hua Ling, Li-Rong Dai:
Non-Parallel Sequence-to-Sequence Voice Conversion with Disentangled Linguistic and Speaker Representations. CoRR abs/1906.10508 (2019) - [i24]Peng-Fei Wu, Zhen-Hua Ling, Li-Juan Liu, Yuan Jiang, Hong-Chuan Wu, Li-Rong Dai:
End-to-End Emotional Speech Synthesis Using Style Tokens and Semi-Supervised Training. CoRR abs/1906.10859 (2019) - [i23]Jia-Chen Gu, Zhen-Hua Ling, Xiaodan Zhu, Quan Liu:
Dually Interactive Matching Network for Personalized Response Selection in Retrieval-Based Chatbots. CoRR abs/1908.05859 (2019) - [i22]Zhi-Xiu Ye, Qian Chen, Wen Wang, Zhen-Hua Ling:
Align, Mask and Select: A Simple Method for Incorporating Commonsense Knowledge into Language Representation Models. CoRR abs/1908.06725 (2019) - [i21]Xin Wang, Junichi Yamagishi, Massimiliano Todisco, Héctor Delgado, Andreas Nautsch, Nicholas W. D. Evans, Md. Sahidullah, Ville Vestman, Tomi Kinnunen, Kong Aik Lee, Lauri Juvela, Paavo Alku, Yu-Huai Peng, Hsin-Te Hwang, Yu Tsao, Hsin-Min Wang, Sébastien Le Maguer, Markus Becker, Fergus Henderson, Rob Clark, Yu Zhang, Quan Wang, Ye Jia, Kai Onuma, Koji Mushika, Takashi Kaneda, Yuan Jiang, Li-Juan Liu, Yi-Chiao Wu, Wen-Chin Huang, Tomoki Toda, Kou Tanaka, Hirokazu Kameoka, Ingmar Steiner, Driss Matrouf, Jean-François Bonastre, Avashna Govender, Srikanth Ronanki, Jing-Xuan Zhang, Zhen-Hua Ling:
The ASVspoof 2019 database. CoRR abs/1911.01601 (2019) - [i20]Jia-Chen Gu, Zhen-Hua Ling, Quan Liu:
Utterance-to-Utterance Interactive Matching Network for Multi-Turn Response Selection in Retrieval-Based Chatbots. CoRR abs/1911.06940 (2019) - 2018
- [j24]Zheng-Chen Liu, Zhen-Hua Ling, Li-Rong Dai:
Articulatory-to-acoustic conversion using BLSTM-RNNs with augmented input representation. Speech Commun. 99: 161-172 (2018) - [j23]Zheng-Chen Liu, Zhen-Hua Ling
, Li-Rong Dai:
Statistical Parametric Speech Synthesis Using Generalized Distillation Framework. IEEE Signal Process. Lett. 25(5): 695-699 (2018) - [j22]Yu-Ping Ruan
, Qian Chen, Zhen-Hua Ling
:
A Sequential Neural Encoder With Latent Structured Description for Modeling Sentences. IEEE ACM Trans. Audio Speech Lang. Process. 26(2): 231-242 (2018) - [j21]Ya-Jun Hu
, Zhen-Hua Ling
:
Extracting Spectral Features Using Deep Autoencoders With Binary Distributed Hidden Units for Statistical Parametric Speech Synthesis. IEEE ACM Trans. Audio Speech Lang. Process. 26(4): 713-724 (2018) - [j20]Zhen-Hua Ling
, Yang Ai
, Yu Gu, Li-Rong Dai:
Waveform Modeling and Generation Using Hierarchical Recurrent Neural Networks for Speech Bandwidth Extension. IEEE ACM Trans. Audio Speech Lang. Process. 26(5): 883-894 (2018) - [j19]Junhua Liu, Zhen-Hua Ling
, Si Wei, Guoping Hu, Li-Rong Dai:
Improving the Decoding Efficiency of Deep Neural Network Acoustic Models by Cluster-Based Senone Selection. J. Signal Process. Syst. 90(7): 999-1011 (2018) - [j18]Zhen-Hua Ling, Zhi-Ping Zhou:
Unit Selection Speech Synthesis Using Frame-Sized Speech Segments and Neural Network Based Acoustic Models. J. Signal Process. Syst. 90(7): 1053-1062 (2018) - [c110]Zhi-Xiu Ye, Zhen-Hua Ling:
Hybrid semi-Markov CRF for Neural Sequence Labeling. ACL (2) 2018: 235-240 - [c109]Qian Chen, Xiaodan Zhu, Zhen-Hua Ling, Diana Inkpen, Si Wei:
Neural Natural Language Inference Models Enhanced with External Knowledge. ACL (1) 2018: 2406-2417 - [c108]Bing Yin, Jun Du, Lei Sun, Xueyang Zhang, Shan He, Zhenhua Ling, Guoping Hu, Wu Guo:
An Analysis of Speaker Diarization Fusion Methods For The First DIHARD Challenge. APSIPA 2018: 1473-1477 - [c107]Yuan Jiang, Xiao Zhou, Chuang Ding, Ya-Jun Hu, Zhen-Hua Ling, Li-Rong Dai:
The USTC System for Blizzard Challenge 2018. Blizzard Challenge 2018 - [c106]Jia-Chen Gu, Zhen-Hua Ling, Nitin Indurkhya:
A Study on Improving End-to-End Neural Coreference Resolution. CCL 2018: 159-169 - [c105]Qian Chen, Zhen-Hua Ling, Xiaodan Zhu:
Enhancing Sentence Embedding with Generalized Pooling. COLING 2018: 1815-1826 - [c104]Jing-Xuan Zhang, Zhen-Hua Ling, Li-Rong Dai:
Forward Attention in Sequence- To-Sequence Acoustic Modeling for Speech Synthesis. ICASSP 2018: 4789-4793 - [c103]Yang Ai, Hong-Chuan Wu, Zhen-Hua Ling:
Samplernn-Based Neural Vocoder for Statistical Parametric Speech Synthesis. ICASSP 2018: 5659-5663 - [c102]Peixin Chen, Wu Guo, Lirong Dai, Zhenhua Ling:
Pseudo-Supervised Approach for Text Clustering Based on Consensus Analysis. ICASSP 2018: 6184-6188 - [c101]Li-Juan Liu, Zhen-Hua Ling, Yuan Jiang, Ming Zhou, Li-Rong Dai:
WaveNet Vocoder with Limited Training Data for Voice Conversion. INTERSPEECH 2018: 1983-1987 - [c100]Xiao Zhou, Zhen-Hua Ling, Zhi-Ping Zhou, Li-Rong Dai:
Learning and Modeling Unit Embeddings for Improving HMM-based Unit Selection Speech Synthesis. INTERSPEECH 2018: 2509-2513 - [c99]Yi-Yang Ding, Ya-Jun Hu, Zhen-Hua Ling:
GTDNN-Based Voice Conversion Using DAEs with Binary Distributed Hidden Units. ISCSLP 2018: 1-5 - [c98]Tomi Kinnunen, Jaime Lorenzo-Trueba, Junichi Yamagishi, Tomoki Toda, Daisuke Saito, Fernando Villavicencio, Zhen-Hua Ling:
A Spoofing Benchmark for the 2018 Voice Conversion Challenge: Leveraging from Spoofing Countermeasures for Speech Artifact Assessment. Odyssey 2018: 187-194 - [c97]Jaime Lorenzo-Trueba, Junichi Yamagishi, Tomoki Toda, Daisuke Saito, Fernando Villavicencio, Tomi Kinnunen, Zhen-Hua Ling:
The Voice Conversion Challenge 2018: Promoting Development of Parallel and Nonparallel Methods. Odyssey 2018: 195-202 - [i19]Zhen-Hua Ling, Yang Ai, Yu Gu, Li-Rong Dai:
Waveform Modeling and Generation Using Hierarchical Recurrent Neural Networks for Speech Bandwidth Extension. CoRR abs/1801.07910 (2018) - [i18]Jaime Lorenzo-Trueba, Junichi Yamagishi, Tomoki Toda, Daisuke Saito, Fernando Villavicencio, Tomi Kinnunen, Zhen-Hua Ling:
The Voice Conversion Challenge 2018: Promoting Development of Parallel and Nonparallel Methods. CoRR abs/1804.04262 (2018) - [i17]Tomi Kinnunen, Jaime Lorenzo-Trueba, Junichi Yamagishi, Tomoki Toda, Daisuke Saito, Fernando Villavicencio, Zhen-Hua Ling:
A Spoofing Benchmark for the 2018 Voice Conversion Challenge: Leveraging from Spoofing Countermeasures for Speech Artifact Assessment. CoRR abs/1804.08438 (2018) - [i16]Zhi-Xiu Ye, Zhen-Hua Ling:
Hybrid semi-Markov CRF for Neural Sequence Labeling. CoRR abs/1805.03838 (2018) - [i15]Qian Chen, Zhen-Hua Ling, Xiaodan Zhu:
Enhancing Sentence Embedding with Generalized Pooling. CoRR abs/1806.09828 (2018) - [i14]Jing-Xuan Zhang, Zhen-Hua Ling, Li-Rong Dai:
Forward Attention in Sequence-to-sequence Acoustic Modelling for Speech Synthesis. CoRR abs/1807.06736 (2018) - [i13]Jing-Xuan Zhang, Zhen-Hua Ling, Li-Juan Liu, Yuan Jiang, Li-Rong Dai:
Sequence-to-Sequence Acoustic Modeling for Voice Conversion. CoRR abs/1810.06865 (2018) - [i12]Jing-Xuan Zhang, Zhen-Hua Ling, Yuan Jiang, Li-Juan Liu, Chen Liang, Li-Rong Dai:
Improving Sequence-to-Sequence Acoustic Modeling by Adding Text-Supervision. CoRR abs/1811.08111 (2018) - [i11]Jia-Chen Gu, Zhen-Hua Ling, Yu-Ping Ruan, Quan Liu:
Building Sequential Inference Models for End-to-End Response Selection. CoRR abs/1812.00686 (2018) - [i10]Yajie Zhang, Shifeng Pan, Lei He, Zhen-Hua Ling:
Learning latent representations for style control and transfer in end-to-end speech synthesis. CoRR abs/1812.04342 (2018) - 2017
- [c96]Quan Liu, Hui Jiang, Zhen-Hua Ling, Xiaodan Zhu, Si Wei, Yu Hu:
Combing Context and Commonsense Knowledge Through Neural Networks for Solving Winograd Schema Problems. AAAI Spring Symposia 2017 - [c95]Qian Chen, Xiaodan Zhu, Zhen-Hua Ling, Si Wei, Hui Jiang, Diana Inkpen:
Enhanced LSTM for Natural Language Inference. ACL (1) 2017: 1657-1668 - [c94]Shumin An, Zhenhua Ling, Lirong Dai:
Emotional statistical parametric speech synthesis using LSTM-RNNs. APSIPA 2017: 1613-1616 - [c93]Ya-Jun Hu, Li-Juan Liu, Chuang Ding, Zhen-Hua Ling, Li-Rong Dai:
The USTC system for blizzard machine learning challenge 2017-ES2. ASRU 2017: 650-656 - [c92]Li-Juan Liu, Chuang Ding, Ya-Jun Hu, Zhen-Hua Ling, Yuan Jiang, Ming Zhou, Si Wei:
The iFLYTEK system for blizzard machine learning challenge 2017-ES1. ASRU 2017: 657-664 - [c91]Ya-Jun Hu, Chuang Ding, Li-Juan Liu, Zhen-Hua Ling, Li-Rong Dai:
The USTC System for Blizzard Challenge 2017. Blizzard Challenge 2017 - [c90]Runze Wang, Chen-Di Zhan, Zhen-Hua Ling:
Question Answering with Character-Level LSTM Encoders and Model-Based Data Augmentation. CCL 2017: 295-305 - [c89]Ya-Jun Hu, Zhen-Hua Ling, Li-Rong Dai:
Extracting structural spectral features using what-where auto-encoders for statistical parametric speech synthesis. ICASSP 2017: 4915-4919 - [c88]Quan Liu, Hui Jiang, Andrew Evdokimov, Zhen-Hua Ling, Xiaodan Zhu, Si Wei, Yu Hu:
Cause-Effect Knowledge Acquisition and Neural Association Model for Solving A Set of Winograd Schema Problems. IJCAI 2017: 2344-2350 - [c87]Yu Gu, Zhen-Hua Ling:
Waveform Modeling Using Stacked Dilated Convolutional Neural Networks for Speech Bandwidth Extension. INTERSPEECH 2017: 1123-1127 - [c86]Qian Chen, Xiaodan Zhu, Zhen-Hua Ling, Si Wei, Hui Jiang, Diana Inkpen:
Recurrent Neural Network-Based Sentence Encoder with Gated Attention for Natural Language Inference. RepEval@EMNLP 2017: 36-40 - [i9]Qian Chen, Xiaodan Zhu, Zhen-Hua Ling, Si Wei, Hui Jiang, Diana Inkpen:
Recurrent Neural Network-Based Sentence Encoder with Gated Attention for Natural Language Inference. CoRR abs/1708.01353 (2017) - [i8]Qian Chen, Xiaodan Zhu, Zhen-Hua Ling, Diana Inkpen, Si Wei:
Natural Language Inference with External Knowledge. CoRR abs/1711.04289 (2017) - [i7]Yu-Ping Ruan, Qian Chen, Zhen-Hua Ling:
A Sequential Neural Encoder with Latent Structured Description for Modeling Sentences. CoRR abs/1711.05433 (2017) - 2016
- [j17]Xin Wang
, Zhen-Hua Ling, Li-Rong Dai:
Concept-to-Speech generation with knowledge sharing for acoustic modelling and utterance filtering. Comput. Speech Lang. 38: 46-67 (2016) - [j16]Xiang Yin
, Ming Lei, Yao Qian, Frank K. Soong, Lei He, Zhen-Hua Ling, Li-Rong Dai:
Modeling F0 trajectories in hierarchically structured deep neural networks. Speech Commun. 76: 82-92 (2016) - [j15]Ya-Jun Hu, Zhen-Hua Ling:
DBN-based Spectral Feature Representation for Statistical Parametric Speech Synthesis. IEEE Signal Process. Lett. 23(3): 321-325 (2016) - [j14]Zhizheng Wu, Phillip L. De Leon
, Cenk Demiroglu, Ali Khodabakhsh
, Simon King, Zhen-Hua Ling, Daisuke Saito, Bryan Stewart, Tomoki Toda
, Mirjam Wester, Junichi Yamagishi:
Anti-Spoofing for Text-Independent Speaker Verification: An Initial Database, Comparison of Countermeasures, and Human Performance. IEEE ACM Trans. Audio Speech Lang. Process. 24(4): 768-783 (2016) - [c85]Ling-Hui Chen, Yuan Jiang, Ming Zhou, Zhen-Hua Ling, Li-Rong Dai:
The USTC System for Blizzard Challenge 2016. Blizzard Challenge 2016 - [c84]Yu-Ping Ruan, Zhen-Hua Ling, Yu Hu:
Exploring Semantic Representation in Brain Activity Using Word Embeddings. EMNLP 2016: 669-679 - [c83]Xiang Yin, Zhen-Hua Ling, Ya-Jun Hu, Li-Rong Dai:
Modeling spectral envelopes using deep conditional restricted Boltzmann machines for statistical parametric speech synthesis. ICASSP 2016: 5125-5129 - [c82]Xin Wang
, Minghui Dong, Zhen-Hua Ling:
A full training framework of cross-stream dependence modelling for HMM-based singing voice synthesis. ICASSP 2016: 5165-5169 - [c81]Ya-Jun Hu, Zhen-Hua Ling, Li-Rong Dai:
Deep belief network-based post-filtering for statistical parametric speech synthesis. ICASSP 2016: 5510-5514 - [c80]Zhen-Hua Ling, Xiao-Hui Sun, Li-Rong Dai, Yu Hu:
Modulation spectrum compensation for HMM-based speech synthesis using line spectral pairs. ICASSP 2016: 5595-5599 - [c79]Qian Chen, Xiaodan Zhu, Zhen-Hua Ling, Si Wei, Hui Jiang:
Distraction-Based Neural Networks for Modeling Document. IJCAI 2016: 2754-2760 - [c78]Yu Gu, Zhen-Hua Ling, Li-Rong Dai:
Speech Bandwidth Extension Using Bottleneck Features and Deep Recurrent Neural Networks. INTERSPEECH 2016: 297-301 - [c77]Zheng-Chen Liu, Zhen-Hua Ling, Li-Rong Dai:
Articulatory-to-Acoustic Conversion with Cascaded Prediction of Spectral and Excitation Features Using Neural Networks. INTERSPEECH 2016: 1502-1506 - [c76]Ling-Hui Chen, Li-Juan Liu, Zhen-Hua Ling, Yuan Jiang, Li-Rong Dai:
The USTC System for Voice Conversion Challenge 2016: Neural Network Based Approaches for Spectrum, Aperiodicity and F0 Conversion. INTERSPEECH 2016: 1642-1646 - [c75]Junhua Liu, Zhen-Hua Ling, Si Wei, Guoping Hu, Li-Rong Dai:
Cluster-based senone selection for the efficient calculation of deep neural network acoustic models. ISCSLP 2016: 1-5 - [c74]Zhi-Ping Zhou, Zhen-Hua Ling:
DNN-based unit selection using frame-sized speech segments. ISCSLP 2016: 1-5 - [c73]Quan Liu, Wu Guo, Zhen-Hua Ling, Hui Jiang, Yu Hu:
Intra-Topic Variability Normalization based on Linear Projection for Topic Classification. HLT-NAACL 2016: 441-446 - [i6]Quan Liu, Zhen-Hua Ling, Hui Jiang, Yu Hu:
Part-of-Speech Relevance Weights for Learning Word Embeddings. CoRR abs/1603.07695 (2016) - [i5]Quan Liu, Hui Jiang, Zhen-Hua Ling, Si Wei, Yu Hu:
Probabilistic Reasoning via Deep Learning: Neural Association Models. CoRR abs/1603.07704 (2016) - [i4]Qian Chen, Xiaodan Zhu, Zhen-Hua Ling, Si Wei, Hui Jiang:
Enhancing and Combining Sequential and Tree LSTM for Natural Language Inference. CoRR abs/1609.06038 (2016) - [i3]Qian Chen, Xiaodan Zhu, Zhen-Hua Ling, Si Wei, Hui Jiang:
Distraction-Based Neural Networks for Document Summarization. CoRR abs/1610.08462 (2016) - [i2]Quan Liu, Hui Jiang, Zhen-Hua Ling, Xiaodan Zhu, Si Wei, Yu Hu:
Combing Context and Commonsense Knowledge Through Neural Networks for Solving Winograd Schema Problems. CoRR abs/1611.04146 (2016) - 2015
- [j13]Ming-Qi Cai, Zhen-Hua Ling, Li-Rong Dai:
Statistical parametric speech synthesis using a hidden trajectory model. Speech Commun. 72: 149-159 (2015) - [j12]Zhen-Hua Ling, Shiyin Kang, Heiga Zen
, Andrew W. Senior, Mike Schuster, Xiaojun Qian, Helen M. Meng, Li Deng:
Deep Learning for Acoustic Modeling in Parametric Speech Generation: A systematic review of existing techniques and future trends. IEEE Signal Process. Mag. 32(3): 35-52 (2015) - [j11]Ling-Hui Chen, Tuomo Raitio, Cassia Valentini-Botinhao, Zhen-Hua Ling, Junichi Yamagishi:
A Deep Generative Architecture for Postfiltering in Statistical Parametric Speech Synthesis. IEEE ACM Trans. Audio Speech Lang. Process. 23(11): 2003-2014 (2015) - [c72]Quan Liu, Hui Jiang, Si Wei, Zhen-Hua Ling, Yu Hu:
Learning Semantic Word Embeddings based on Ordinal Knowledge Constraints. ACL (1) 2015: 1501-1511 - [c71]Ling-Hui Chen, Zhen-Hua Ling, Xian-Jun Xia, Yuan Jiang, Yi-Qing Zu, Run-Qiang Yan:
The USTC System for Blizzard Challenge 2015. Blizzard Challenge 2015 - [c70]Zheng-Chen Liu, Zhen-Hua Ling, Li-Rong Dai:
LIP movement generation using restricted Boltzmann machines for visual speech synthesis. ChinaSIP 2015: 606-610 - [c69]Li-Juan Liu, Ling-Hui Chen, Zhen-Hua Ling, Li-Rong Dai:
Spectral conversion using deep neural networks trained with multi-source speakers. ICASSP 2015: 4849-4853 - [c68]Yu Gu, Zhen-Hua Ling:
Restoring high frequency spectral envelopes using neural networks for speech bandwidth extension. IJCNN 2015: 1-8 - [c67]Qian Chen, Zhen-Hua Ling, Chen-Yu Yang, Li-Rong Dai:
Automatic phrase boundary labeling of speech synthesis database using context-dependent HMMs and n-gram prior distributions. INTERSPEECH 2015: 1581-1585 - [i1]Quan Liu, Wu Guo, Zhen-Hua Ling:
Integrate Document Ranking Information into Confidence Measure Calculation for Spoken Term Detection. CoRR abs/1509.01899 (2015) - 2014
- [j10]Chen-Yu Yang, Zhen-Hua Ling, Li-Rong Dai:
Unsupervised Prosodic Labeling of Speech Synthesis Databases Using Context-Dependent HMMs. IEICE Trans. Inf. Syst. 97-D(6): 1449-1460 (2014) - [j9]Xian-Jun Xia, Zhen-Hua Ling, Yuan Jiang, Li-Rong Dai:
HMM-based unit selection speech synthesis using log likelihood ratios derived from perceptual data. Speech Commun. 63: 27-37 (2014) - [j8]Ling-Hui Chen, Zhen-Hua Ling, Li-Juan Liu, Li-Rong Dai:
Voice conversion using deep neural networks with layer-wise generative training. IEEE ACM Trans. Audio Speech Lang. Process. 22(12): 1859-1872 (2014) - [c66]Ling-Hui Chen, Zhen-Hua Ling, Yi-Qing Zu, Run-Qiang Yan, Yuan Jiang, Xian-Jun Xia, Ying Wang:
The USTC System for Blizzard Challenge 2014. Blizzard Challenge 2014 - [c65]Xiang Yin, Zhen-Hua Ling, Li-Rong Dai:
Spectral modeling using neural autoregressive distribution estimators for statistical parametric speech synthesis. ICASSP 2014: 3824-3828 - [c64]Li-Juan Liu, Ling-Hui Chen, Zhen-Hua Ling, Li-Rong Dai:
Using bidirectional associative memories for joint spectral envelope modeling in voice conversion. ICASSP 2014: 7884-7888 - [c63]Ming-Qi Cai, Zhen-Hua Ling, Li-Rong Dai:
Formant-controlled speech synthesis using hidden trajectory model. INTERSPEECH 2014: 1529-1533 - [c62]Ling-Hui Chen, Tuomo Raitio, Cassia Valentini-Botinhao, Junichi Yamagishi, Zhen-Hua Ling:
DNN-based stochastic postfilter for HMM-based speech synthesis. INTERSPEECH 2014: 1954-1958 - [c61]Xiang Yin, Ming Lei, Yao Qian, Frank K. Soong, Lei He, Zhen-Hua Ling, Li-Rong Dai:
Modeling DCT parameterized F0 trajectory at intonation phrase level with DNN or decision tree. INTERSPEECH 2014: 2273-2277 - [c60]Ling-Hui Chen, Zhen-Hua Ling, Li-Rong Dai:
Voice conversion using generative trained deep neural networks with multiple frame spectral envelopes. INTERSPEECH 2014: 2313-2317 - [c59]Xin Wang, Zhen-Hua Ling, Li-Rong Dai:
Concept-to-speech generation by integrating syntagmatic features into HMM-based speech synthesis. INTERSPEECH 2014: 2942-2946 - [c58]Yu-Sheng Sun, Zhen-Hua Ling, Xiang Yin, Li-Rong Dai:
Integrating global variance of log power spectrum derived from LSPs into MGE training for HMM-based parametric speech synthesis. ISCSLP 2014: 201-205 - [c57]Li Gao, Zhen-Hua Ling, Ling-Hui Chen, Li-Rong Dai:
Improving F0 prediction using bidirectional associative memories and syllable-level F0 features for HMM-based Mandarin speech synthesis. ISCSLP 2014: 275-279 - 2013
- [j7]Zhen-Hua Ling, Korin Richmond
, Junichi Yamagishi:
Articulatory Control of HMM-Based Parametric Speech Synthesis Using Feature-Space-Switched Multiple Regression. IEEE Trans. Speech Audio Process. 21(1): 205-217 (2013) - [j6]Zhen-Hua Ling, Li Deng, Dong Yu:
Modeling Spectral Envelopes Using Restricted Boltzmann Machines and Deep Belief Networks for Statistical Parametric Speech Synthesis. IEEE Trans. Speech Audio Process. 21(10): 2129-2139 (2013) - [c56]Ling-Hui Chen, Zhen-Hua Ling, Yuan Jiang, Yang Song, Xian-Jun Xia, Yi-Qing Zu, Run-Qiang Yan, Li-Rong Dai:
The USTC System for Blizzard Challenge 2013. Blizzard Challenge 2013 - [c55]Chen-Yu Yang, Zhen-Hua Ling, Li-Rong Dai:
Unsupervised prosodic phrase boundary labeling of Mandarin speech synthesis database using context-dependent HMM. ICASSP 2013: 6875-6879 - [c54]Zhen-Hua Ling, Li Deng, Dong Yu:
Modeling spectral envelopes using restricted Boltzmann machines for statistical parametric speech synthesis. ICASSP 2013: 7825-7829 - [c53]Korin Richmond, Zhen-Hua Ling, Junichi Yamagishi, Benigno Uria:
On the evaluation of inversion mapping performance in the acoustic domain. INTERSPEECH 2013: 1012-1016 - [c52]Ling-Hui Chen, Zhen-Hua Ling, Yan Song, Li-Rong Dai:
Joint spectral distribution modeling using restricted boltzmann machines for voice conversion. INTERSPEECH 2013: 3052-3056 - [c51]Maria Astrinaki, Alexis Moinet, Junichi Yamagishi, Korin Richmond, Zhen-Hua Ling, Simon King, Thierry Dutoit:
Mage - reactive articulatory feature control of HMM-based parametric speech synthesis. SSW 2013: 207-211 - [c50]Maria Astrinaki, Alexis Moinet, Junichi Yamagishi, Korin Richmond, Zhen-Hua Ling, Simon King, Thierry Dutoit:
Mage - HMM-based speech synthesis reactively controlled by the articulators. SSW 2013: 243 - 2012
- [j5]Zhen-Hua Ling, Li-Rong Dai:
Minimum Kullback-Leibler Divergence Parameter Generation for HMM-Based Speech Synthesis. IEEE Trans. Speech Audio Process. 20(5): 1492-1502 (2012) - [c49]Zhen-Hua Ling, Xian-Jun Xia, Yang Song, Chen-Yu Yang, Ling-Hui Chen, Li-Rong Dai:
The USTC System for Blizzard Challenge 2012. Blizzard Challenge 2012 - [c48]Zhen-Hua Ling, Korin Richmond, Junichi Yamagishi:
Vowel Creation by Articulatory Control in HMM-based Parametric Speech Synthesis. INTERSPEECH 2012: 991-994 - [c47]Xiang Yin, Zhen-Hua Ling, Ming Lei, Li-Rong Dai:
Considering Global Variance of the Log Power Spectrum Derived from Mel-Cepstrum in HMM-based Parametric Speech Synthesis. INTERSPEECH 2012: 1147-1150 - [c46]Xin Wang
, Zhen-Hua Ling, Li-Rong Dai:
Cross-stream dependency modeling using continuous F0 model for HMM-based speech synthesis. ISCSLP 2012: 84-87 - [c45]Xian-Jun Xia, Zhen-Hua Ling, Chen-Yu Yang, Li-Rong Dai:
Improved unit selection speech synthesis method utilizing subjective evaluation results on synthetic speech. ISCSLP 2012: 160-164 - 2011
- [c44]Ling-Hui Chen, Chen-Yu Yang, Zhen-Hua Ling, Yuan Jiang, Li-Rong Dai, Yu Hu, Ren-Hua Wang:
The USTC System for Blizzard Challenge 2011. Blizzard Challenge 2011 - [c43]Ming Lei, Zhen-Hua Ling, Li-Rong Dai:
Preserve ordering property of generated LSPS for minimum generation error training in HMM-based speech synthesis. ICASSP 2011: 4712-4715 - [c42]Ling-Hui Chen, Zhen-Hua Ling, Li-Rong Dai:
Non-parallel training for voice conversion based on FT-GMM. ICASSP 2011: 5116-5119 - [c41]Heng Lu, Zhen-Hua Ling, Li-Rong Dai, Ren-Hua Wang:
Building HMM based unit-selection speech synthesis system using synthetic speech naturalness evaluation score. ICASSP 2011: 5352-5355 - [c40]Zhen-Hua Ling, Korin Richmond, Junichi Yamagishi:
Feature-Space Transform Tying in Unified Acoustic-Articulatory Modelling for Articulatory Control of HMM-Based Speech Synthesis. INTERSPEECH 2011: 117-120 - [c39]Ling-Hui Chen, Yoshihiko Nankaku, Heiga Zen, Keiichi Tokuda, Zhen-Hua Ling, Li-Rong Dai:
Estimation of Window Coefficients for Dynamic Feature Extraction for HMM-Based Speech Synthesis. INTERSPEECH 2011: 1801-1804 - [c38]Ming Lei, Junichi Yamagishi, Korin Richmond, Zhen-Hua Ling, Simon King, Li-Rong Dai:
Formant-Controlled HMM-Based Speech Synthesis. INTERSPEECH 2011: 2777-2780 - 2010
- [j4]Heng Lu, Zhen-Hua Ling, Li-Rong Dai, Ren-Hua Wang:
Cross-Validation and Minimum Generation Error based Decision Tree Pruning for HMM-based Speech Synthesis. Int. J. Comput. Linguistics Chin. Lang. Process. 15(1) (2010) - [j3]Zhen-Hua Ling, Korin Richmond, Junichi Yamagishi:
An Analysis of HMM-based prediction of articulatory movements. Speech Commun. 52(10): 834-846 (2010) - [c37]Yuan Jiang, Zhen-Hua Ling, Ming Lei, Cheng-Cheng Wang, Heng Lu, Yu Hu, Li-Rong Dai, Ren-Hua Wang:
The USTC System for Blizzard Challenge 2010. Blizzard Challenge 2010 - [c36]Ming Lei, Zhen-Hua Ling, Li-Rong Dai:
Minimum generation error training with weighted Euclidean distance on LSP for HMM-based speech synthesis. ICASSP 2010: 4230-4233 - [c35]Heng Lu, Zhen-Hua Ling, Si Wei, Li-Rong Dai, Ren-Hua Wang:
Automatic error detection for unit selection speech synthesis using log likelihood ratio based SVM classifier. INTERSPEECH 2010: 162-165 - [c34]Zhen-Hua Ling, Yu Hu, Li-Rong Dai:
Global variance modeling on the log power spectrum of LSPs for HMM-based speech synthesis. INTERSPEECH 2010: 825-828 - [c33]Ming Lei, Yi-Jian Wu, Frank K. Soong, Zhen-Hua Ling, Li-Rong Dai:
A hierarchical F0 modeling method for HMM-based speech synthesis. INTERSPEECH 2010: 2170-2173 - [c32]Zhen-Hua Ling, Korin Richmond, Junichi Yamagishi:
HMM-based text-to-articulatory-movement prediction and analysis of critical articulators. INTERSPEECH 2010: 2194-2197 - [c31]Tian-Yi Zhao, Zhen-Hua Ling, Ming Lei, Li-Rong Dai, Qingfeng Liu:
Minimum generation error training for HMM-based prediction of articulatory movements. ISCSLP 2010: 99-102 - [c30]Zhen-Hua Ling, Zhiguo Wang, Li-Rong Dai:
Statistical modeling of syllable-level F0 features for HMM-based unit selection speech synthesis. ISCSLP 2010: 144-147 - [c29]Ling-Hui Chen, Zhen-Hua Ling, Wu Guo, Li-Rong Dai:
GMM-based voice conversion with explicit modelling on feature transform. ISCSLP 2010: 364-368 - [c28]Chen-Yu Yang, Zhen-Hua Ling, Heng Lu, Wu Guo, Li-Rong Dai:
Automatic phrase boundary labeling for Mandarin TTS corpus using context-dependent HMM. ISCSLP 2010: 374-377
2000 – 2009
- 2009
- [j2]Zhen-Hua Ling, Korin Richmond
, Junichi Yamagishi, Ren-Hua Wang:
Integrating Articulatory Features Into HMM-Based Parametric Speech Synthesis. IEEE Trans. Speech Audio Process. 17(6): 1171-1185 (2009) - [j1]Junichi Yamagishi, Takashi Nose, Heiga Zen
, Zhen-Hua Ling, Tomoki Toda
, Keiichi Tokuda, Simon King, Steve Renals:
Robust Speaker-Adaptive HMM-Based Text-to-Speech Synthesis. IEEE Trans. Speech Audio Process. 17(6): 1208-1230 (2009) - [c27]Heng Lu, Zhen-Hua Ling, Ming Lei, Cheng-Cheng Wang, Huan-huan Zhao, Ling-Hui Chen, Yu Hu, Li-Rong Dai, Ren-Hua Wang:
The USTC System for Blizzard Challenge 2009. Blizzard Challenge 2009 - [c26]Cheng-Cheng Wang, Zhen-Hua Ling, Li-Rong Dai:
Asynchronous F0 and spectrum modeling for HMM-based speech synthesis. INTERSPEECH 2009: 404-407 - 2008
- [c25]Zhen-Hua Ling, Heng Lu, Guoping Hu, Li-Rong Dai, Ren-Hua Wang:
The USTC System for Blizzard Challenge 2008. Blizzard Challenge 2008 - [c24]Zhen-Hua Ling, Ren-Hua Wang:
Minimum unit selection error training for HMM-based unit selection speech synthesis system. ICASSP 2008: 3949-3952 - [c23]Long Qin, Yi-Jian Wu, Zhen-Hua Ling, Ren-Hua Wang, Li-Rong Dai:
Minumum generation error linear regression based model adaptation for HMM-based speech synthesis. ICASSP 2008: 3953-3956 - [c22]Long Qin, Yi-Jian Wu, Zhen-Hua Ling, Ren-Hua Wang, Li-Rong Dai:
Minimum generation error criterion considering global/local variance for HMM-based speech synthesis. ICASSP 2008: 4621-4624 - [c21]Zhen-Hua Ling, Korin Richmond, Junichi Yamagishi, Ren-Hua Wang:
Articulatory control of HMM-based parametric speech synthesis driven by phonetic knowledge. INTERSPEECH 2008: 573-576 - [c20]Junichi Yamagishi, Zhen-Hua Ling, Simon King:
Robustness of HMM-based speech synthesis. INTERSPEECH 2008: 581-584 - [c19]Zhen-Hua Ling, Wei Zhang, Ren-Hua Wang:
Cross-Stream Dependency Modeling for HMM-Based Speech Synthesis. ISCSLP 2008: 5-8 - [c18]Cheng-Cheng Wang, Zhen-Hua Ling, Bu-Fan Zhang, Li-Rong Dai:
Multi-Layer F0 Modeling for HMM-Based Speech Synthesis. ISCSLP 2008: 129-132 - [c17]Heng Lu, Zhen-Hua Ling, Si Wei, Yu Hu, Li-Rong Dai, Ren-Hua Wang:
Heteronym Verification for Mandarin Speech Synthesis. ISCSLP 2008: 137-140 - [c16]Long Qin, Yi-Jian Wu, Zhen-Hua Ling, Ren-Hua Wang:
Model Adaptation for HMM-Based Speech Synthesis under Minimum Generation Error Criterion. ISM 2008: 539-544 - 2007
- [c15]Zhen-Hua Ling, Long Qin, Heng Lu, Yu Gao, Li-Rong Dai, Ren-Hua Wang, Yuan Jiang, Zhi-Wei Zhao, Jin-Hui Yang, Jie Chen, Guo-Ping Hu:
The USTC and iflytek speech synthesis systems for Blizzard Challenge 2007. Blizzard Challenge 2007 - [c14]Zhen-Hua Ling, Ren-Hua Wang:
HMM-Based Hierarchical Unit Selection Combining Kullback-Leibler Divergence with Likelihood Criterion. ICASSP (4) 2007: 1245-1248 - 2006
- [c13]Zhen-Hua Ling, Yi-Jian Wu, Yu-Ping Wang, Long Qin, Ren-Hua Wang:
USTC System for Blizzard Challenge 2006 an Improved HMM-based Speech Synthesis Method. Blizzard Challenge 2006 - [c12]Zhen-Hua Ling, Ren-Hua Wang:
HMM-based unit selection using frame sized speech segments. INTERSPEECH 2006 - [c11]Long Qin, Yi-Jian Wu, Zhen-Hua Ling, Ren-Hua Wang:
Improving the performance of HMM-based voice conversion using context clustering decision tree and appropriate regression matrix format. INTERSPEECH 2006 - [c10]Long Qin, Zhen-Hua Ling, Yi-Jian Wu, Bu-Fan Zhang, Ren-Hua Wang:
HMM-Based Emotional Speech Synthesis Using Average Emotion Model. ISCSLP (Selected Papers) 2006: 233-240 - [c9]Bu-Fan Zhang, Zhenhua Ling, Long Qin, Ren-Hua Wang:
Applying SFC Model for Chinese Expressive Speech Synthesis. ISCSLP 2006 - 2005
- [c8]Yu-Ping Wang, Zhen-Hua Ling, Ren-Hua Wang:
Emotional Speech Synthesis Based on Improved Codebook Mapping Voice Conversion. ACII 2005: 374-381 - [c7]Zhen-Hua Ling, Yu Hu, Ren-Hua Wang:
A Novel Source Analysis Method by Matching Spectral Characters of LF Model with STRAIGHT Spectrum. ACII 2005: 441-448 - [c6]Long Qin, Gao Peng Chen, Zhen-Hua Ling, Li-Rong Dai:
An Improved Spectral and Prosodic Transformation Method in STRAIGHT-based Voice Conversion. ICASSP (1) 2005: 21-24 - 2004
- [c5]Zixiang Wang, Ren-Hua Wang, Zhiwei Shuang, Zhen-Hua Ling:
A novel voice conversion system based on codebook mapping with phoneme-tied weighting. INTERSPEECH 2004: 1197-1200 - [c4]Zhen-Hua Ling, Yu Hu, Zhiwei Shuang, Ren-Hua Wang:
Compression of speech database by feature separation and pattern clustering using STRAIGHT. INTERSPEECH 2004: 1201-1204 - [c3]Zhen-Hua Ling, Yu-Ping Wang, Yu Hu, Ren-Hua Wang:
Modeling glottal effect on the spectral envelop of STRAIGHT using mixture of Gaussians. ISCSLP 2004: 73-76 - 2002
- [c2]Zhiwei Shuang, Yu Hu, Zhen-Hua Ling, Ren-Hua Wang:
A miniature Chinese TTS system based on tailored corpus. INTERSPEECH 2002: 2389-2392 - [c1]Zhen-Hua Ling, Yu Hu, Zhiwei Shuang, Ren-Hua Wang:
Decision tree based unit pre-selection in Mandarin Chinese synthesis. ISCSLP 2002
Coauthor Index
aka: Ling-Hui Chen
aka: Yexin Lu
aka: Ren-Hua Wang

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-02-28 22:00 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint