default search action
Bin Ma 0001
Person information
- affiliation: Alibaba Group, Speech Lab, Singapore
- affiliation: Nanyang Technological University, School of Computer Science and Engineering, Singapore
- affiliation (since 2004): Institute for Infocomm Research, A*STAR, Singapore
- affiliation (PhD 2000): University of Hong Kong, Hong Kong
Other persons with the same name
- Bin Ma — disambiguation page
- Bin Ma 0002 — University of Waterloo, School of Computer Science, ON, Canada (and 2 more)
- Bin Ma 0003 — Qilu University of Technology, School of Cyber Security, Jinan, China (and 3 more)
- Bin Ma 0004 — Beihang University, School of Computer Science and Engineering, State Key Laboratory of Virtual Reality Technology and Systems, Beijing, China
- Bin Ma 0005 — Chongqing University of Posts and Telecommunications, Key Laboratory of Computer Network and Communications Technology, China (and 1 more)
- Bin Ma 0006 — Singapore Institute of Manufacturing Technology, Singapore
- Bin Ma 0007 — Shenyang Jianzhu University, Faculty of Information and Control Engineering, China
- Bin Ma 0008 — Beihang University, School of Transportation Science and Engineering, Beijing, China
- Bin Ma 0009 — Central South University of Forestry & Technology, College of Computer Science, Changsha, China
- Bin Ma 0010 — HiSilicon Technologies Company Ltd., Shanghai, China (and 1 more)
- Bin Ma 0011 — Soochow University, School of Computer Science and Technology, Suzhou City, China (and 1 more)
- Bin Ma 0012 — Chinese Academy and Sciences, Institute of Automation, Beijing, China
- Bin Ma 0013 — University of Manitoba, Department of Mechanical and Industrial Engineering, Winnipeg, Canada
- Bin Ma 0014 — Beijing University of Aeronautics and Astronautics, Robotics Institute, Beijing, China
- Bin Ma 0015 — University of Maryland, Department of Geography, College Park, MD, USA
- Bin Ma 0016 — Xidian University, Xi'an, School of Life Sciences and Technology, China
- Bin Ma 0017 — ABB Corporate Research Center, Västerås, Sweden
- Bin Ma 0018 — Xi'an University of Technology, State Key Laboratory Base of Eco-hydraulic Engineering in Arid Area, China
- Bin Ma 0019 — Chinese Academy of Sciences, Institute of Information Engineering, Beijing, China
- Bin Ma 0020 — Chongqing Industrial and Commercial University, Mechanical Engineering College, China
- Bin Ma 0021 — China Ship Development and Design Center, Wuhan, China
- Bin Ma 0022 — Chinese Academy of Sciences, National Astronomical Observatories, Beijing, China
- Bin Ma 0023 — Chinese Academy of Sciences, Shenzhen Institutes of Advanced Technology, China
- Bin Ma 0024 — CuraCloud, Seattle, WA, USA
- Bin Ma 0025 — University of Southern California, Department of Electrical and Computer Engineering, Los Angeles, CA, USA
- Bin Ma 0026 — State Grid Henan Electric Power Company Research Institute, China
- Bin Ma 0027 — Tianjin University, School of Architecture, China (and 1 more)
- Bin Ma 0028 — Meituan Inc., Beijing, China
- Bin Ma 0029 — Beijing MedPeer Information Technology Co. Ltd., China
- Bin Ma 0030 — Shanghai University, School of Mechatronic Engineering and Automation, China
- Bin Ma 0031 — Dalian Jiaotong University, School of Computer and Communication Engineering, China
- Bin Ma 0032 — Xi'an Jiaotong University, Institute of Artificial Intelligence and Robotics, China
Other persons with a similar name
SPARQL queries
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j28]Yukun Ma, Chong Zhang, Qian Chen, Wen Wang, Bin Ma:
Tuning Large Language Model for Speech Recognition With Mixed-Scale Re-Tokenization. IEEE Signal Process. Lett. 31: 1740-1744 (2024) - [c229]Jia Qi Yip, Shengkui Zhao, Yukun Ma, Chongjia Ni, Chong Zhang, Hao Wang, Trung Hieu Nguyen, Kun Zhou, Dianwen Ng, Eng Siong Chng, Bin Ma:
SPGM: Prioritizing Local Features for Enhanced Speech Separation Performance. ICASSP 2024: 326-330 - [c228]Shengkui Zhao, Yukun Ma, Chongjia Ni, Chong Zhang, Hao Wang, Trung Hieu Nguyen, Kun Zhou, Jia Qi Yip, Dianwen Ng, Bin Ma:
MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation. ICASSP 2024: 10356-10360 - [c227]Dianwen Ng, Chong Zhang, Ruixi Zhang, Yukun Ma, Fabian Ritter Gutierrez, Trung Hieu Nguyen, Chongjia Ni, Shengkui Zhao, Eng Siong Chng, Bin Ma:
Are Soft Prompts Good Zero-Shot Learners for Speech Recognition? ICASSP 2024: 10366-10370 - [c226]Kun Zhou, Berrak Sisman, Carlos Busso, Bin Ma, Haizhou Li:
Mixed-EVC: Mixed Emotion Synthesis and Control in Voice Conversion. Odyssey 2024: 180-186 - [i32]Kun Zhou, Shengkui Zhao, Yukun Ma, Chong Zhang, Hao Wang, Dianwen Ng, Chongjia Ni, Trung Hieu Nguyen, Jia Qi Yip, Bin Ma:
Phonetic Enhanced Language Modeling for Text-to-Speech Synthesis. CoRR abs/2406.02009 (2024) - [i31]Jia Qi Yip, Shengkui Zhao, Dianwen Ng, Eng Siong Chng, Bin Ma:
Towards Audio Codec-based Speech Separation. CoRR abs/2406.12434 (2024) - [i30]Keyu An, Qian Chen, Chong Deng, Zhihao Du, Changfeng Gao, Zhifu Gao, Yue Gu, Ting He, Hangrui Hu, Kai Hu, Shengpeng Ji, Yabin Li, Zerui Li, Heng Lu, Haoneng Luo, Xiang Lv, Bin Ma, Ziyang Ma, Chongjia Ni, Changhe Song, Jiaqi Shi, Xian Shi, Hao Wang, Wen Wang, Yuxuan Wang, Zhangyu Xiao, Zhijie Yan, Yexin Yang, Bin Zhang, Qinglin Zhang, Shiliang Zhang, Nan Zhao, Siqi Zheng:
FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs. CoRR abs/2407.04051 (2024) - [i29]Kun Zhou, You Zhang, Shengkui Zhao, Hao Wang, Zexu Pan, Dianwen Ng, Chong Zhang, Chongjia Ni, Yukun Ma, Trung Hieu Nguyen, Jia Qi Yip, Bin Ma:
Emotional Dimension Control in Language Model-Based Text-to-Speech: Spanning a Broad Spectrum of Human Emotions. CoRR abs/2409.16681 (2024) - 2023
- [c225]Jia Qi Yip, Dianwen Ng, Bin Ma, Chng Eng Siong:
Analysis of Speech Separation Performance Degradation on Emotional Speech Mixtures. APSIPA ASC 2023: 2002-2007 - [c224]Yukun Ma, Trung Hieu Nguyen, Jinjie Ni, Wen Wang, Qian Chen, Chong Zhang, Bin Ma:
Auxiliary Pooling Layer For Spoken Language Understanding. ICASSP 2023: 1-5 - [c223]Dianwen Ng, Ruixi Zhang, Jia Qi Yip, Zhao Yang, Jinjie Ni, Chong Zhang, Yukun Ma, Chongjia Ni, Eng Siong Chng, Bin Ma:
De'hubert: Disentangling Noise in a Self-Supervised Model for Robust Speech Recognition. ICASSP 2023: 1-5 - [c222]Dianwen Ng, Ruixi Zhang, Jia Qi Yip, Chong Zhang, Yukun Ma, Trung Hieu Nguyen, Chongjia Ni, Eng Siong Chng, Bin Ma:
Contrastive Speech Mixup for Low-Resource Keyword Spotting. ICASSP 2023: 1-5 - [c221]Jinjie Ni, Yukun Ma, Wen Wang, Qian Chen, Dianwen Ng, Han Lei, Trung Hieu Nguyen, Chong Zhang, Bin Ma, Erik Cambria:
Adaptive Knowledge Distillation Between Text and Speech Pre-Trained Models. ICASSP 2023: 1-5 - [c220]Shengkui Zhao, Bin Ma:
D2Former: A Fully Complex Dual-Path Dual-Decoder Conformer Network Using Joint Complex Masking and Complex Spectral Mapping for Monaural Speech Enhancement. ICASSP 2023: 1-5 - [c219]Shengkui Zhao, Bin Ma:
MossFormer: Pushing the Performance Limit of Monaural Speech Separation Using Gated Single-Head Transformer with Convolution-Augmented Joint Self-Attentions. ICASSP 2023: 1-5 - [c218]Zhao Yang, Dianwen Ng, Chong Zhang, Xiao Fu, Rui Jiang, Wei Xi, Yukun Ma, Chongjia Ni, Eng Siong Chng, Bin Ma, Jizhong Zhao:
Dual Acoustic Linguistic Self-supervised Representation Learning for Cross-Domain Speech Recognition. INTERSPEECH 2023: 72-76 - [c217]Dianwen Ng, Yang Xiao, Jia Qi Yip, Zhao Yang, Biao Tian, Qiang Fu, Eng Siong Chng, Bin Ma:
Small Footprint Multi-channel Network for Keyword Spotting with Centroid Based Awareness. INTERSPEECH 2023: 296-300 - [c216]Dianwen Ng, Chong Zhang, Ruixi Zhang, Yukun Ma, Trung Hieu Nguyen, Chongjia Ni, Shengkui Zhao, Qian Chen, Wen Wang, Eng Siong Chng, Bin Ma:
Adapter-tuning with Effective Token-dependent Representation Shift for Automatic Speech Recognition. INTERSPEECH 2023: 1319-1323 - [c215]Jia Qi Yip, Duc-Tuan Truong, Dianwen Ng, Chong Zhang, Yukun Ma, Trung Hieu Nguyen, Chongjia Ni, Shengkui Zhao, Eng Siong Chng, Bin Ma:
ACA-Net: Towards Lightweight Speaker Verification using Asymmetric Cross Attention. INTERSPEECH 2023: 1938-1942 - [c214]Zhao Yang, Dianwen Ng, Xizhe Li, Chong Zhang, Rui Jiang, Wei Xi, Yukun Ma, Chongjia Ni, Jizhong Zhao, Bin Ma, Eng Siong Chng:
Dual-Memory Multi-Modal Learning for Continual Spoken Keyword Spotting with Confidence Selection and Diversity Enhancement. INTERSPEECH 2023: 3774-3778 - [c213]Zhao Yang, Dianwen Ng, Chong Zhang, Rui Jiang, Wei Xi, Yukun Ma, Chongjia Ni, Jizhong Zhao, Bin Ma, Eng Siong Chng:
A Unified Recognition and Correction Model under Noisy and Accent Speech Conditions. INTERSPEECH 2023: 4953-4957 - [i28]Shengkui Zhao, Bin Ma:
MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-Head Transformer with Convolution-Augmented Joint Self-Attentions. CoRR abs/2302.11824 (2023) - [i27]Shengkui Zhao, Bin Ma:
D2Former: A Fully Complex Dual-Path Dual-Decoder Conformer Network using Joint Complex Masking and Complex Spectral Mapping for Monaural Speech Enhancement. CoRR abs/2302.11832 (2023) - [i26]Dianwen Ng, Ruixi Zhang, Jia Qi Yip, Zhao Yang, Jinjie Ni, Chong Zhang, Yukun Ma, Chongjia Ni, Eng Siong Chng, Bin Ma:
deHuBERT: Disentangling Noise in a Self-supervised Model for Robust Speech Recognition. CoRR abs/2302.14597 (2023) - [i25]Jinjie Ni, Yukun Ma, Wen Wang, Qian Chen, Dianwen Ng, Han Lei, Trung Hieu Nguyen, Chong Zhang, Bin Ma, Erik Cambria:
Adaptive Knowledge Distillation between Text and Speech Pre-trained Models. CoRR abs/2303.03600 (2023) - [i24]Dianwen Ng, Ruixi Zhang, Jia Qi Yip, Chong Zhang, Yukun Ma, Trung Hieu Nguyen, Chongjia Ni, Eng Siong Chng, Bin Ma:
Contrastive Speech Mixup for Low-resource Keyword Spotting. CoRR abs/2305.01170 (2023) - [i23]Jia Qi Yip, Tuan Truong, Dianwen Ng, Chong Zhang, Yukun Ma, Trung Hieu Nguyen, Chongjia Ni, Shengkui Zhao, Eng Siong Chng, Bin Ma:
ACA-Net: Towards Lightweight Speaker Verification using Asymmetric Cross Attention. CoRR abs/2305.12121 (2023) - [i22]Jia Qi Yip, Dianwen Ng, Bin Ma, Chng Eng Siong:
Analysis of Speech Separation Performance Degradation on Emotional Speech Mixtures. CoRR abs/2309.07458 (2023) - [i21]Dianwen Ng, Chong Zhang, Ruixi Zhang, Yukun Ma, Fabian Ritter Gutierrez, Trung Hieu Nguyen, Chongjia Ni, Shengkui Zhao, Eng Siong Chng, Bin Ma:
Are Soft Prompts Good Zero-shot Learners for Speech Recognition? CoRR abs/2309.09413 (2023) - [i20]Jia Qi Yip, Shengkui Zhao, Yukun Ma, Chongjia Ni, Chong Zhang, Hao Wang, Trung Hieu Nguyen, Kun Zhou, Dianwen Ng, Eng Siong Chng, Bin Ma:
SPGM: Prioritizing Local Features for enhanced speech separation performance. CoRR abs/2309.12608 (2023) - [i19]Shengkui Zhao, Yukun Ma, Chongjia Ni, Chong Zhang, Hao Wang, Trung Hieu Nguyen, Kun Zhou, Jia Qi Yip, Dianwen Ng, Bin Ma:
MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation. CoRR abs/2312.11825 (2023) - 2022
- [c212]Karn N. Watcharasupat, Thi Ngoc Tho Nguyen, Woon-Seng Gan, Shengkui Zhao, Bin Ma:
End-to-End Complex-Valued Multidilated Convolutional Neural Network for Joint Acoustic Echo Cancellation and Noise Suppression. ICASSP 2022: 656-660 - [c211]Fan Yu, Shiliang Zhang, Yihui Fu, Lei Xie, Siqi Zheng, Zhihao Du, Weilong Huang, Pengcheng Guo, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu:
M2Met: The Icassp 2022 Multi-Channel Multi-Party Meeting Transcription Challenge. ICASSP 2022: 6167-6171 - [c210]Yukun Ma, Trung Hieu Nguyen, Bin Ma:
CPT: Cross-Modal Prefix-Tuning for Speech-To-Text Translation. ICASSP 2022: 6217-6221 - [c209]Yukun Ma, Bin Ma:
Multimodal Sentiment Analysis on Unaligned Sequences Via Holographic Embedding. ICASSP 2022: 8547-8551 - [c208]Fan Yu, Shiliang Zhang, Pengcheng Guo, Yihui Fu, Zhihao Du, Siqi Zheng, Weilong Huang, Lei Xie, Zheng-Hua Tan, DeLiang Wang, Yanmin Qian, Kong Aik Lee, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu:
Summary on the ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge. ICASSP 2022: 9156-9160 - [c207]Shengkui Zhao, Bin Ma, Karn N. Watcharasupat, Woon-Seng Gan:
FRCRN: Boosting Feature Representation Using Frequency Recurrence for Monaural Speech Enhancement. ICASSP 2022: 9281-9285 - [c206]Mingyuan Cheng, Xinru Liao, Quan Liu, Bin Ma, Jian Xu, Bo Zheng:
Learning Disentangled Representations for Counterfactual Regression via Mutual Information Minimization. SIGIR 2022: 1802-1806 - [i18]Fan Yu, Shiliang Zhang, Pengcheng Guo, Yihui Fu, Zhihao Du, Siqi Zheng, Weilong Huang, Lei Xie, Zheng-Hua Tan, DeLiang Wang, Yanmin Qian, Kong Aik Lee, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu:
Summary On The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge. CoRR abs/2202.03647 (2022) - [i17]Mingyuan Cheng, Xinru Liao, Quan Liu, Bin Ma, Jian Xu, Bo Zheng:
Learning Disentangled Representations for Counterfactual Regression via Mutual Information Minimization. CoRR abs/2206.01022 (2022) - [i16]Shengkui Zhao, Bin Ma, Karn N. Watcharasupat, Woon-Seng Gan:
FRCRN: Boosting Feature Representation using Frequency Recurrence for Monaural Speech Enhancement. CoRR abs/2206.07293 (2022) - [i15]Dianwen Ng, Jia Qi Yip, Tanmay Surana, Zhao Yang, Chong Zhang, Yukun Ma, Chongjia Ni, Eng Siong Chng, Bin Ma:
I2CR: Improving Noise Robustness on Keyword Spotting Using Inter-Intra Contrastive Regularization. CoRR abs/2209.06360 (2022) - [i14]Lei Wang, Rong Tong, Cheung Chi Leung, Sunil Sivadas, Chongjia Ni, Bin Ma:
Cloud-based Automatic Speech Recognition Systems for Southeast Asian Languages. CoRR abs/2210.03580 (2022) - 2021
- [c205]Zongtao Liu, Bin Ma, Quan Liu, Jian Xu, Bo Zheng:
Heterogeneous Graph Neural Networks for Large-Scale Bid Keyword Matching. CIKM 2021: 3976-3985 - [c204]Yingzhu Zhao, Chongjia Ni, Cheung-Chi Leung, Shafiq R. Joty, Eng Siong Chng, Bin Ma:
A Unified Speaker Adaptation Approach for ASR. EMNLP (1) 2021: 9339-9349 - [c203]Shengkui Zhao, Hao Wang, Trung Hieu Nguyen, Bin Ma:
Towards Natural and Controllable Cross-Lingual Voice Conversion Based on Neural TTS Model and Phonetic Posteriorgram. ICASSP 2021: 5969-5973 - [c202]Shengkui Zhao, Trung Hieu Nguyen, Bin Ma:
Monaural Speech Enhancement with Complex Convolutional Block Attention Module and Joint Time Frequency Losses. ICASSP 2021: 6648-6652 - [c201]Yingzhu Zhao, Chongjia Ni, Cheung-Chi Leung, Shafiq R. Joty, Eng Siong Chng, Bin Ma:
Preventing Early Endpointing for Online Automatic Speech Recognition. ICASSP 2021: 6813-6817 - [c200]Zhiping Zeng, Van Tung Pham, Haihua Xu, Yerbolat Khassanov, Eng Siong Chng, Chongjia Ni, Bin Ma:
Leveraging Text Data Using Hybrid Transformer-LSTM Based End-to-End ASR in Transfer Learning. ISCSLP 2021: 1-5 - [i13]Shengkui Zhao, Hao Wang, Trung Hieu Nguyen, Bin Ma:
Towards Natural and Controllable Cross-Lingual Voice Conversion Based on Neural TTS Model and Phonetic Posteriorgram. CoRR abs/2102.01991 (2021) - [i12]Shengkui Zhao, Trung Hieu Nguyen, Bin Ma:
Monaural Speech Enhancement with Complex Convolutional Block Attention Module and Joint Time Frequency Losses. CoRR abs/2102.01993 (2021) - [i11]Karn N. Watcharasupat, Thi Ngoc Tho Nguyen, Woon-Seng Gan, Shengkui Zhao, Bin Ma:
End-to-End Complex-Valued Multidilated Convolutional Neural Network for Joint Acoustic Echo Cancellation and Noise Suppression. CoRR abs/2110.00745 (2021) - [i10]Fan Yu, Shiliang Zhang, Yihui Fu, Lei Xie, Siqi Zheng, Zhihao Du, Weilong Huang, Pengcheng Guo, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu:
M2MeT: The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Challenge. CoRR abs/2110.07393 (2021) - [i9]Yingzhu Zhao, Chongjia Ni, Cheung-Chi Leung, Shafiq R. Joty, Eng Siong Chng, Bin Ma:
A Unified Speaker Adaptation Approach for ASR. CoRR abs/2110.08545 (2021) - [i8]Zongtao Liu, Bin Ma, Quan Liu, Jian Xu, Bo Zheng:
Heterogeneous Graph Neural Networks for Large-Scale Bid Keyword Matching. CoRR abs/2111.00926 (2021) - 2020
- [j27]Yougen Yuan, Lei Xie, Cheung-Chi Leung, Hongjie Chen, Bin Ma:
Fast Query-by-Example Speech Search Using Attention-Based Deep Binary Embeddings. IEEE ACM Trans. Audio Speech Lang. Process. 28: 1988-2000 (2020) - [c199]Van Tung Pham, Haihua Xu, Yerbolat Khassanov, Zhiping Zeng, Eng Siong Chng, Chongjia Ni, Bin Ma, Haizhou Li:
Independent Language Modeling Architecture for End-To-End ASR. ICASSP 2020: 7059-7063 - [c198]Yingzhu Zhao, Chongjia Ni, Cheung-Chi Leung, Shafiq R. Joty, Eng Siong Chng, Bin Ma:
Speech Transformer with Speaker Aware Persistent Memory. INTERSPEECH 2020: 1261-1265 - [c197]Shengkui Zhao, Trung Hieu Nguyen, Hao Wang, Bin Ma:
Towards Natural Bilingual and Code-Switched Speech Synthesis Based on Mix of Monolingual Recordings and Cross-Lingual Voice Conversion. INTERSPEECH 2020: 2927-2931 - [c196]Yingzhu Zhao, Chongjia Ni, Cheung-Chi Leung, Shafiq R. Joty, Eng Siong Chng, Bin Ma:
Universal Speech Transformer. INTERSPEECH 2020: 5021-5025 - [c195]Yingzhu Zhao, Chongjia Ni, Cheung-Chi Leung, Shafiq R. Joty, Eng Siong Chng, Bin Ma:
Cross Attention with Monotonic Alignment for Speech Transformer. INTERSPEECH 2020: 5031-5035 - [i7]Zhiping Zeng, Van Tung Pham, Haihua Xu, Yerbolat Khassanov, Eng Siong Chng, Chongjia Ni, Bin Ma:
Leveraging Text Data Using Hybrid Transformer-LSTM Based End-to-End ASR in Transfer Learning. CoRR abs/2005.10407 (2020) - [i6]Shengkui Zhao, Trung Hieu Nguyen, Hao Wang, Bin Ma:
Towards Natural Bilingual and Code-Switched Speech Synthesis Based on Mix of Monolingual Recordings and Cross-Lingual Voice Conversion. CoRR abs/2010.08136 (2020)
2010 – 2019
- 2019
- [j26]Yougen Yuan, Cheung-Chi Leung, Lei Xie, Hongjie Chen, Bin Ma:
Query-by-Example Speech Search Using Recurrent Neural Acoustic Word Embeddings With Temporal Context. IEEE Access 7: 67656-67665 (2019) - [c194]Shiliang Zhang, Ming Lei, Bin Ma, Lei Xie:
Robust Audio-visual Speech Recognition Using Bimodal Dfsmn with Multi-condition Training and Dropout Regularization. ICASSP 2019: 6570-6574 - [c193]Shengkui Zhao, Trung Hieu Nguyen, Hao Wang, Bin Ma:
Fast Learning for Non-Parallel Many-to-Many Voice Conversion with Residual Star Generative Adversarial Networks. INTERSPEECH 2019: 689-693 - [c192]Shengkui Zhao, Chongjia Ni, Rong Tong, Bin Ma:
Multi-Task Multi-Network Joint-Learning of Deep Residual Networks and Cycle-Consistency Generative Adversarial Networks for Robust Speech Recognition. INTERSPEECH 2019: 1238-1242 - [c191]Yerbolat Khassanov, Haihua Xu, Van Tung Pham, Zhiping Zeng, Eng Siong Chng, Chongjia Ni, Bin Ma:
Constrained Output Embeddings for End-to-End Code-Switching Speech Recognition with Only Monolingual Data. INTERSPEECH 2019: 2160-2164 - [c190]Shiliang Zhang, Yuan Liu, Ming Lei, Bin Ma, Lei Xie:
Towards Language-Universal Mandarin-English Speech Recognition. INTERSPEECH 2019: 2170-2174 - [i5]Yerbolat Khassanov, Haihua Xu, Van Tung Pham, Zhiping Zeng, Eng Siong Chng, Chongjia Ni, Bin Ma:
Constrained Output Embeddings for End-to-End Code-Switching Speech Recognition with Only Monolingual Data. CoRR abs/1904.03802 (2019) - [i4]Van Tung Pham, Haihua Xu, Yerbolat Khassanov, Zhiping Zeng, Eng Siong Chng, Chongjia Ni, Bin Ma, Haizhou Li:
Independent language modeling architecture for end-to-end ASR. CoRR abs/1912.00863 (2019) - 2018
- [c189]Yougen Yuan, Cheung-Chi Leung, Lei Xie, Hongjie Chen, Bin Ma, Haizhou Li:
Learning Acoustic Word Embeddings with Temporal Context for Query-by-Example Speech Search. INTERSPEECH 2018: 97-101 - [c188]Nguyen Bach, Hongjie Chen, Kai Fan, Cheung-Chi Leung, Bo Li, Chongjia Ni, Rong Tong, Pei Zhang, Boxing Chen, Bin Ma, Fei Huang:
Alibaba Speech Translation Systems for IWSLT 2018. IWSLT 2018: 136-141 - [i3]Yougen Yuan, Cheung-Chi Leung, Lei Xie, Hongjie Chen, Bin Ma, Haizhou Li:
Learning Acoustic Word Embeddings with Temporal Context for Query-by-Example Speech Search. CoRR abs/1806.03621 (2018) - 2017
- [j25]Hongjie Chen, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li:
Multitask Feature Learning for Low-Resource Query-by-Example Spoken Term Detection. IEEE J. Sel. Top. Signal Process. 11(8): 1329-1339 (2017) - [j24]Chang Huai You, Bin Ma:
Spectral-domain speech enhancement for speech recognition. Speech Commun. 94: 30-41 (2017) - [j23]Hongjie Chen, Lei Xie, Cheung-Chi Leung, Xiaoming Lu, Bin Ma, Haizhou Li:
Modeling Latent Topics and Temporal Distance for Story Segmentation of Broadcast News. IEEE ACM Trans. Audio Speech Lang. Process. 25(1): 108-119 (2017) - [c187]Nancy F. Chen, Boon Pang Lim, Van Hai Do, Van Tung Pham, Chongjia Ni, Haihua Xu, Mark Hasegawa-Johnson, Wenda Chen, Xiong Xiao, Sunil Sivadas, Eng Siong Chng, Bin Ma, Haizhou Li:
Low-resource spoken keyword search strategies in georgian inspired by distinctive feature theory. APSIPA 2017: 1322-1327 - [c186]Tin Lay Nwe, Tran Huy Dat, Bin Ma:
Convolutional neural network with multi-task learning scheme for acoustic scene classification. APSIPA 2017: 1347-1350 - [c185]Hanwu Sun, Kong-Aik Lee, Trung Hieu Nguyen, Bin Ma, Haizhou Li:
I2R-NUS submission to oriental language recognition AP16-OL7 challenge. APSIPA 2017: 1574-1578 - [c184]Hongjie Chen, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li:
Multilingual bottle-neck feature learning from untranscribed speech. ASRU 2017: 727-733 - [c183]Yougen Yuan, Cheung-Chi Leung, Lei Xie, Hongjie Chen, Bin Ma, Haizhou Li:
Extracting bottleneck features and word-like pairs from untranscribed speech for feature representation. ASRU 2017: 734-739 - [c182]Rong Tong, Lei Wang, Bin Ma:
Transfer learning for children's speech recognition. IALP 2017: 36-39 - [c181]Nana Hou, Xiaohai Tian, Eng Siong Chng, Bin Ma, Haizhou Li:
Improving air traffic control speech intelligibility by reducing speaking rate effectively. IALP 2017: 197-200 - [c180]Liping Chen, Kong-Aik Lee, Bin Ma, Long Ma, Haizhou Li, Li-Rong Dai:
Adaptation of PLDA for multi-source text-independent speaker verification. ICASSP 2017: 5380-5384 - [c179]Chang Huai You, Bin Ma, Chongjia Ni:
Modification on LSA speech enhancement for speech recognition. ICASSP 2017: 5475-5479 - [c178]Yougen Yuan, Cheung-Chi Leung, Lei Xie, Hongjie Chen, Bin Ma, Haizhou Li:
Pairwise learning using multi-lingual bottleneck features for low-resource query-by-example spoken term detection. ICASSP 2017: 5645-5649 - [c177]Chongjia Ni, Cheung-Chi Leung, Lei Wang, Nancy F. Chen, Bin Ma:
Efficient methods to train multilingual bottleneck feature extractors for low resource keyword search. ICASSP 2017: 5650-5654 - [c176]Kong-Aik Lee, Ville Hautamäki, Tomi Kinnunen, Anthony Larcher, Chunlei Zhang, Andreas Nautsch, Themos Stafylakis, Gang Liu, Mickaël Rouvier, Wei Rao, Federico Alegre, J. Ma, Man-Wai Mak, Achintya Kumar Sarkar, Héctor Delgado, Rahim Saeidi, Hagai Aronowitz, Aleksandr Sizov, Hanwu Sun, Trung Hieu Nguyen, Guangsen Wang, Bin Ma, Ville Vestman, Md. Sahidullah, M. Halonen, Anssi Kanervisto, Gaël Le Lan, Fahimeh Bahmaninezhad, Sergey Isadskiy, Christian Rathgeb, Christoph Busch, Georgios Tzimiropoulos, Q. Qian, Z. Wang, Q. Zhao, T. Wang, H. Li, J. Xue, S. Zhu, R. Jin, T. Zhao, Pierre-Michel Bousquet, Moez Ajili, Waad Ben Kheder, Driss Matrouf, Zhi Hao Lim, Chenglin Xu, Haihua Xu, Xiong Xiao, Eng Siong Chng, Benoit G. B. Fauve, Kaavya Sriskandaraja, Vidhyasaharan Sethu, W. W. Lin, Dennis Alexander Lehmann Thomsen, Zheng-Hua Tan, Massimiliano Todisco, Nicholas W. D. Evans, Haizhou Li, John H. L. Hansen, Jean-François Bonastre, Eliathamby Ambikairajah:
The I4U Mega Fusion and Collaboration for NIST Speaker Recognition Evaluation 2016. INTERSPEECH 2017: 1328-1332 - [c175]Rong Tong, Nancy F. Chen, Bin Ma:
Multi-Task Learning for Mispronunciation Detection on Singapore Children's Mandarin Speech. INTERSPEECH 2017: 2193-2197 - [c174]Tin Lay Nwe, Tran Huy Dat, Wen Zheng Terence Ng, Bin Ma:
An Integrated Solution for Snoring Sound Classification Using Bhattacharyya Distance Based GMM Supervectors with SVM, Feature Selection with Random Forest and Spectrogram with CNN. INTERSPEECH 2017: 3467-3471 - [c173]Kai Chen, Tongxin Li, Bin Ma, Peng Wang, XiaoFeng Wang, Peiyuan Zong:
Filtering for Malice Through the Data Ocean: Large-Scale PHA Install Detection at the Communication Service Provider Level. RAID 2017: 167-191 - 2016
- [j22]Nancy F. Chen, Darren Wee, Rong Tong, Bin Ma, Haizhou Li:
Large-scale characterization of non-native Mandarin Chinese spoken by speakers of European origin: Analysis on iCALL. Speech Commun. 84: 46-56 (2016) - [j21]Liping Chen, Kong-Aik Lee, Bin Ma, Wu Guo, Haizhou Li, Li-Rong Dai:
Exploration of Local Variability in Text-Independent Speaker Verification. J. Signal Process. Syst. 82(2): 217-228 (2016) - [c172]Maofan Yin, Sunil Sivadas, Kai Yu, Bin Ma:
Discriminatively trained joint speaker and environment representations for adaptation of deep neural network acoustic models. ICASSP 2016: 5065-5069 - [c171]Liping Chen, Kong-Aik Lee, Eng Siong Chng, Bin Ma, Haizhou Li, Li-Rong Dai:
Content-aware local variability vector for speaker verification with short utterance. ICASSP 2016: 5485-5489 - [c170]Chongjia Ni, Cheung-Chi Leung, Lei Wang, Haibo Liu, Feng Rao, Li Lu, Nancy F. Chen, Bin Ma, Haizhou Li:
Cross-lingual deep neural network based submodular unbiased data selection for low-resource keyword search. ICASSP 2016: 6015-6019 - [c169]Haihua Xu, Jingyong Hou, Xiong Xiao, Van Tung Pham, Cheung-Chi Leung, Lei Wang, Van Hai Do, Hang Lv, Lei Xie, Bin Ma, Eng Siong Chng, Haizhou Li:
Approximate search of audio queries by using DTW with phone time boundary and data augmentation. ICASSP 2016: 6030-6034 - [c168]Nancy F. Chen, Van Tung Pham, Haihua Xu, Xiong Xiao, Van Hai Do, Chongjia Ni, I-Fan Chen, Sunil Sivadas, Chin-Hui Lee, Eng Siong Chng, Bin Ma, Haizhou Li:
Exemplar-inspired strategies for low-resource spoken keyword search in Swahili. ICASSP 2016: 6040-6044 - [c167]Guangsen Wang, Kong-Aik Lee, Trung Hieu Nguyen, Hanwu Sun, Bin Ma:
Joint Speaker and Lexical Modeling for Short-Term Characterization of Speaker. INTERSPEECH 2016: 415-419 - [c166]Yougen Yuan, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li:
Learning Neural Network Representations Using Cross-Lingual Bottleneck Features with Word-Pair Information. INTERSPEECH 2016: 788-792 - [c165]Hongjie Chen, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li:
Unsupervised Bottleneck Features for Low-Resource Query-by-Example Spoken Term Detection. INTERSPEECH 2016: 923-927 - [c164]Nancy F. Chen, Rong Tong, Darren Wee, Pei Xuan Lee, Bin Ma, Haizhou Li:
SingaKids-Mandarin: Speech Corpus of Singaporean Children Speaking Mandarin Chinese. INTERSPEECH 2016: 1545-1549 - [c163]Rong Tong, Nancy F. Chen, Bin Ma, Haizhou Li:
Context Aware Mispronunciation Detection for Mandarin Pronunciation Training. INTERSPEECH 2016: 3112-3116 - [c162]Kong-Aik Lee, Haizhou Li, Li Deng, Ville Hautamäki, Wei Rao, Xiong Xiao, Anthony Larcher, Hanwu Sun, Trung Hieu Nguyen, Guangsen Wang, Aleksandr Sizov, Jianshu Chen, Ivan Kukanov, Amir Hossein Poorjam, Trung Ngo Trong, Chenglin Xu, Haihua Xu, Bin Ma, Eng Siong Chng, Sylvain Meignier:
The 2015 NIST Language Recognition Evaluation: The Shared View of I2R, Fantastic4 and SingaMS. INTERSPEECH 2016: 3211-3215 - [c161]Chongjia Ni, Lei Wang, Cheung-Chi Leung, Feng Rao, Li Lu, Bin Ma, Haizhou Li:
Rapid Update of Multilingual Deep Neural Network for Low-Resource Keyword Search. INTERSPEECH 2016: 3698-3702 - [c160]Cheung-Chi Leung, Lei Wang, Haihua Xu, Jingyong Hou, Van Tung Pham, Hang Lv, Lei Xie, Xiong Xiao, Chongjia Ni, Bin Ma, Eng Siong Chng, Haizhou Li:
Toward High-Performance Language-Independent Query-by-Example Spoken Term Detection for MediaEval 2015: Post-Evaluation Analysis. INTERSPEECH 2016: 3703-3707 - [c159]Lei Wang, Chongjia Ni, Cheung-Chi Leung, Changhuai You, Lei Xie, Haihua Xu, Xiong Xiao, Tin Lay Nwe, Eng Siong Chng, Bin Ma, Haizhou Li:
The NNI Vietnamese Speech Recognition System for MediaEval 2016. MediaEval 2016 - [c158]Hanwu Sun, Trung Hieu Nguyen, Guangsen Wang, Kong-Aik Lee, Bin Ma, Haizhou Li:
I2R Submission to the 2015 NIST Language Recognition I-vector Challenge. Odyssey 2016: 311-318 - [i2]Kong-Aik Lee, Ville Hautamäki, Anthony Larcher, Wei Rao, Hanwu Sun, Trung Hieu Nguyen, Guangsen Wang, Aleksandr Sizov, Ivan Kukanov, Amir Hossein Poorjam, Trung Ngo Trong, Xiong Xiao, Chenglin Xu, Haihua Xu, Bin Ma, Haizhou Li, Sylvain Meignier:
Fantastic 4 system for NIST 2015 Language Recognition Evaluation. CoRR abs/1602.01929 (2016) - [i1]Zhenzhou Wu, Sunil Sivadas, Yong Kiam Tan, Bin Ma, Rick Siow Mong Goh:
Multi-Modal Hybrid Deep Neural Network for Speech Enhancement. CoRR abs/1606.04750 (2016) - 2015
- [j20]Haipeng Wang, Tan Lee, Cheung-Chi Leung, Bin Ma, Haizhou Li:
Acoustic Segment Modeling with Spectral Clustering Methods. IEEE ACM Trans. Audio Speech Lang. Process. 23(2): 264-277 (2015) - [c157]Hanwu Sun, Kong-Aik Lee, Bin Ma:
A new study of GMM-SVM system for text-dependent speaker recognition. ICASSP 2015: 4195-4199 - [c156]Chongjia Ni, Lei Wang, Haibo Liu, Cheung-Chi Leung, Li Lu, Bin Ma:
Submodular data selection with acoustic and phonetic features for automatic speech recognition. ICASSP 2015: 4629-4633 - [c155]Chongjia Ni, Cheung-Chi Leung, Lei Wang, Nancy F. Chen, Bin Ma:
Unsupervised data selection and word-morph mixed language model for tamil low-resource keyword search. ICASSP 2015: 4714-4718 - [c154]Haihua Xu, Peng Yang, Xiong Xiao, Lei Xie, Cheung-Chi Leung, Hongjie Chen, Jia Yu, Hang Lv, Lei Wang, Su Jun Leow, Bin Ma, Engsiong Chng, Haizhou Li:
Language independent query-by-example spoken term detection using N-best phone sequences and partial matching. ICASSP 2015: 5191-5195 - [c153]Liping Chen, Kong-Aik Lee, Bin Ma, Wu Guo, Haizhou Li, Li-Rong Dai:
Channel adaptation of plda for text-independent speaker verification. ICASSP 2015: 5251-5255 - [c152]Rong Tong, Nancy F. Chen, Boon Pang Lim, Bin Ma, Haizhou Li:
Tokenizing fundamental frequency variation for Mandarin tone error detection. ICASSP 2015: 5361-5365 - [c151]Nancy F. Chen, Chongjia Ni, I-Fan Chen, Sunil Sivadas, Van Tung Pham, Haihua Xu, Xiong Xiao, Tze Siong Lau, Su Jun Leow, Boon Pang Lim, Cheung-Chi Leung, Lei Wang, Chin-Hui Lee, Alvina Goh, Engsiong Chng, Bin Ma, Haizhou Li:
Low-resource keyword search strategies for tamil. ICASSP 2015: 5366-5370 - [c150]Liping Chen, Kong-Aik Lee, Bin Ma, Wu Guo, Haizhou Li, Li-Rong Dai:
Phone-centric local variability vector for text-constrained speaker verification. INTERSPEECH 2015: 229-233 - [c149]Nancy F. Chen, Rong Tong, Darren Wee, Pei Xuan Lee, Bin Ma, Haizhou Li:
iCALL corpus: Mandarin Chinese spoken by non-native speakers of European descent. INTERSPEECH 2015: 324-328 - [c148]Pengfei Liu, Shoaib Jameel, Wai Lam, Bin Ma, Helen M. Meng:
Topic modeling for conference analytics. INTERSPEECH 2015: 707-711 - [c147]Rong Tong, Nancy F. Chen, Bin Ma, Haizhou Li:
Goodness of tone (GOT) for non-native Mandarin tone recognition. INTERSPEECH 2015: 801-805 - [c146]Kong-Aik Lee, Guangsen Wang, Kam Pheng Ng, Hanwu Sun, Trung Hieu Nguyen, Ngoc Thuy Huong Thai, Bin Ma, Haizhou Li:
The reddots platform for mobile crowd-sourcing of speech data. INTERSPEECH 2015: 2603-2604 - [c145]Shakti Rath, Sunil Sivadas, Bin Ma:
Joint environment and speaker normalization using factored front-end CMLLR. INTERSPEECH 2015: 2844-2848 - [c144]Kong-Aik Lee, Anthony Larcher, Guangsen Wang, Patrick Kenny, Niko Brümmer, David A. van Leeuwen, Hagai Aronowitz, Marcel Kockmann, Carlos Vaquero, Bin Ma, Haizhou Li, Themos Stafylakis, Md. Jahangir Alam, Albert Swart, Javier Perez:
The reddots data collection for speaker recognition. INTERSPEECH 2015: 2996-3000 - [c143]Hongjie Chen, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li:
Parallel inference of dirichlet process Gaussian mixture models for unsupervised acoustic modeling: a feasibility study. INTERSPEECH 2015: 3189-3193 - [c142]Sunil Sivadas, Zhenzhou Wu, Bin Ma:
Investigation of parametric rectified linear units for noise robust speech recognition. INTERSPEECH 2015: 3234-3238 - [c141]Hoang Gia Ngo, Nancy F. Chen, Binh Minh Nguyen, Bin Ma, Haizhou Li:
Phonology-augmented statistical transliteration for low-resource languages. INTERSPEECH 2015: 3670-3674 - [c140]Tin Lay Nwe, Qianli Xu, Cuntai Guan, Bin Ma:
Stress level detection using double-layer subband filter. INTERSPEECH 2015: 3695-3699 - [c139]Jingyong Hou, Van Tung Pham, Cheung-Chi Leung, Lei Wang, Haihua Xu, Hang Lv, Lei Xie, Zhonghua Fu, Chongjia Ni, Xiong Xiao, Hongjie Chen, Shaofei Zhang, Sining Sun, Yougen Yuan, Pengcheng Li, Tin Lay Nwe, Sunil Sivadas, Bin Ma, Engsiong Chng, Haizhou Li:
The NNI Query-by-Example System for MediaEval 2015. MediaEval 2015 - [c138]Wenda Chen, Nancy F. Chen, Boon Pang Lim, Bin Ma:
Corpus-based pronunciation variation rule analysis for singapore English. SLaTE 2015: 35-40 - 2014
- [j19]Anthony Larcher, Kong-Aik Lee, Bin Ma, Haizhou Li:
Text-dependent speaker verification: Classifiers, databases and RSR2015. Speech Commun. 60: 56-77 (2014) - [c137]Anthony Larcher, Kong-Aik Lee, Bin Ma, Haizhou Li:
Modelling the alternative hypothesis for text-dependent speaker verification. ICASSP 2014: 734-738 - [c136]Anthony Larcher, Kong-Aik Lee, Bin Ma, Haizhou Li:
Imposture classification for text-dependent speaker verification. ICASSP 2014: 739-743 - [c135]Liping Chen, Kong-Aik Lee, Bin Ma, Wu Guo, Haizhou Li, Li-Rong Dai:
Minimum divergence estimation of speaker prior in multi-session PLDA scoring. ICASSP 2014: 4007-4011 - [c134]Nancy F. Chen, Sunil Sivadas, Boon Pang Lim, Hoang Gia Ngo, Haihua Xu, Van Tung Pham, Bin Ma, Haizhou Li:
Strategies for Vietnamese keyword search. ICASSP 2014: 4121-4125 - [c133]Rong Tong, Boon Pang Lim, Nancy F. Chen, Bin Ma, Haizhou Li:
Subspace Gaussian mixture model for computer-assisted language learning. ICASSP 2014: 5347-5351 - [c132]Tin Lay Nwe, Trung Hieu Nguyen, Bin Ma:
On the use of Bhattacharyya based GMM distance and neural net features for identification of cognitive load levels. INTERSPEECH 2014: 736-740 - [c131]Haipeng Wang, Tan Lee, Cheung-Chi Leung, Bin Ma, Haizhou Li:
A graph-based Gaussian component clustering approach to unsupervised acoustic modeling. INTERSPEECH 2014: 875-879 - [c130]Hanwu Sun, Bin Ma:
The NIST SRE summed channel speaker recognition system. INTERSPEECH 2014: 1111-1114 - [c129]Anthony Larcher, Kong-Aik Lee, Pablo Luis Sordo Martinez, Trung Hieu Nguyen, Bin Ma, Haizhou Li:
Extended RSR2015 for text-dependent speaker verification over VHF channel. INTERSPEECH 2014: 1322-1326 - [c128]Hoang Gia Ngo, Nancy F. Chen, Sunil Sivadas, Bin Ma, Haizhou Li:
A minimal-resource transliteration framework for vietnamese. INTERSPEECH 2014: 1410-1414 - [c127]Pei Xuan Lee, Darren Wee, Hilary Si Yin Toh, Boon Pang Lim, Nancy F. Chen, Bin Ma:
A whispered Mandarin corpus for speech technology applications. INTERSPEECH 2014: 1598-1602 - [c126]Peng Yang, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li:
Intrinsic spectral analysis based on temporal context features for query-by-example spoken term detection. INTERSPEECH 2014: 1722-1726 - [c125]Rong Tong, Bin Ma, Haizhou Li:
Virtual example for phonotactic language recognition. INTERSPEECH 2014: 3017-3021 - [c124]Liping Chen, Kong-Aik Lee, Bin Ma, Wu Guo, Haizhou Li, Li-Rong Dai:
Local variability vector for text-independent speaker verification. ISCSLP 2014: 54-58 - [c123]Chongjia Ni, Nancy F. Chen, Bin Ma:
Multiple time-span feature fusion for deep neural network modeling. ISCSLP 2014: 138-142 - [c122]Peng Yang, Haihua Xu, Xiong Xiao, Lei Xie, Cheung-Chi Leung, Hongjie Chen, Jia Yu, Hang Lv, Lei Wang, Su Jun Leow, Bin Ma, Chng Eng Siong, Haizhou Li:
The NNI Query-by-Example System for MediaEval 2014. MediaEval 2014 - [c121]Kong Aik Lee, Bin Ma, Haizhou Li, Liping Chen, Wu Guo, Li-Rong Dai:
Local Variability Modeling for Text-Independent Speaker Verification. Odyssey 2014: 54-59 - [c120]Changhuai You, Kong Aik Lee, Bin Ma, Haizhou Li:
Text-Dependent Speaker Verification System in VHF Communication Channel. Odyssey 2014: 216-223 - [c119]Bin Ma:
How We Found These Vulnerabilities in Android Applications. SecureComm (2) 2014: 399-406 - [e3]Haizhou Li, Helen M. Meng, Bin Ma, Engsiong Chng, Lei Xie:
15th Annual Conference of the International Speech Communication Association, INTERSPEECH 2014, Singapore, September 14-18, 2014. ISCA 2014 [contents] - 2013
- [j18]Haizhou Li, Bin Ma, Kong-Aik Lee:
Spoken Language Recognition: From Fundamentals to Practice. Proc. IEEE 101(5): 1136-1159 (2013) - [j17]Haipeng Wang, Cheung-Chi Leung, Tan Lee, Bin Ma, Haizhou Li:
Shifted-Delta MLP Features for Spoken Language Recognition. IEEE Signal Process. Lett. 20(1): 15-18 (2013) - [j16]Ville Hautamäki, Tomi Kinnunen, Filip Sedlak, Kong-Aik Lee, Bin Ma, Haizhou Li:
Sparse Classifier Fusion for Speaker Verification. IEEE Trans. Speech Audio Process. 21(8): 1622-1631 (2013) - [j15]Raymond W. M. Ng, Tan Lee, Cheung-Chi Leung, Bin Ma, Haizhou Li:
Spoken Language Recognition With Prosodic Features. IEEE Trans. Speech Audio Process. 21(9): 1841-1853 (2013) - [c118]Xiaoming Lu, Lei Xie, Cheung-Chi Leung, Bin Ma, Haizhou Li:
Broadcast News Story Segmentation Using Manifold Learning on Latent Topic Distributions. ACL (2) 2013: 190-195 - [c117]Chien-Lin Huang, Chiori Hori, Hideki Kashioka, Bin Ma:
Speaker clustering using vector representation with long-term feature for lecture speech recognition. ICASSP 2013: 3532-3536 - [c116]Chien-Lin Huang, Chiori Hori, Hideki Kashioka, Bin Ma:
Joint analysis of vocal tract length and temporal information for robust speech recognition. ICASSP 2013: 7432-7436 - [c115]Anthony Larcher, Kong-Aik Lee, Bin Ma, Haizhou Li:
Phonetically-constrained PLDA modeling for text-dependent speaker verification with multiple short utterances. ICASSP 2013: 7673-7677 - [c114]Chang Huai You, Haizhou Li, Bin Ma, Kong-Aik Lee:
A study on GMM-SVM with adaptive relevance factor and its comparison with i-vector and JFA for speaker recognition. ICASSP 2013: 7683-7687 - [c113]Hanwu Sun, Kong-Aik Lee, Bin Ma:
Anti-model KL-SVM-NAP system for NIST SRE 2012 evaluation. ICASSP 2013: 7688-7692 - [c112]Nancy F. Chen, Bin Ma, Haizhou Li:
Minimal-resource phonetic language models to summarize untranscribed speech. ICASSP 2013: 8357-8361 - [c111]Xiaoming Lu, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li:
Broadcast news story segmentation using latent topics on data manifold. ICASSP 2013: 8465-8469 - [c110]Haipeng Wang, Tan Lee, Cheung-Chi Leung, Bin Ma, Haizhou Li:
Using parallel tokenizers with DTW matrix combination for low-resource spoken term detection. ICASSP 2013: 8545-8549 - [c109]Rahim Saeidi, Kong-Aik Lee, Tomi Kinnunen, Tawfik Hasan, Benoit G. B. Fauve, Pierre-Michel Bousquet, Elie Khoury, Pablo Luis Sordo Martinez, Jia Min Karen Kua, Changhuai You, Hanwu Sun, Anthony Larcher, Padmanabhan Rajan, Ville Hautamäki, Cemal Hanilçi, Billy Braithwaite, Rosa González Hautamäki, Seyed Omid Sadjadi, Gang Liu, Hynek Boril, Navid Shokouhi, Driss Matrouf, Laurent El Shafey, Pejman Mowlaee, Julien Epps, Tharmarajah Thiruvaran, David A. van Leeuwen, Bin Ma, Haizhou Li, John H. L. Hansen, Jean-François Bonastre, Sébastien Marcel, John S. D. Mason, Eliathamby Ambikairajah:
I4u submission to NIST SRE 2012: a large-scale collaborative effort for noise-robust speaker verification. INTERSPEECH 2013: 1986-1990 - [c108]Hanwu Sun, Bin Ma:
Improved unsupervised NAP training dataset design for speaker recognition. INTERSPEECH 2013: 1991-1995 - [c107]Haipeng Wang, Tan Lee, Cheung-Chi Leung, Bin Ma, Haizhou Li:
Unsupervised mining of acoustic subword units with segment-level Gaussian posteriorgrams. INTERSPEECH 2013: 2297-2301 - [c106]Nancy F. Chen, Vivaek Shivakumar, Mahesh Harikumar, Bin Ma, Haizhou Li:
Large-scale characterization of Mandarin pronunciation errors made by native speakers of European languages. INTERSPEECH 2013: 2370-2374 - [c105]Kong-Aik Lee, Anthony Larcher, Chang Huai You, Bin Ma, Haizhou Li:
Multi-session PLDA scoring of i-vector for partially open-set speaker detection. INTERSPEECH 2013: 3651-3655 - 2012
- [j14]Xiaoxuan Wang, Lei Xie, Mimi Lu, Bin Ma, Engsiong Chng, Haizhou Li:
Broadcast News Story Segmentation Using Conditional Random Fields and Multimodal Features. IEICE Trans. Inf. Syst. 95-D(5): 1206-1215 (2012) - [j13]Omid Dehzangi, Bin Ma, Engsiong Chng, Haizhou Li:
Discriminative feature extraction for speech recognition using continuous output codes. Pattern Recognit. Lett. 33(13): 1703-1709 (2012) - [j12]Tin Lay Nwe, Hanwu Sun, Bin Ma, Haizhou Li:
Speaker Clustering and Cluster Purification Methods for RT07 and RT09 Evaluation Meeting Data. IEEE Trans. Speech Audio Process. 20(2): 461-473 (2012) - [c104]Lilei Zheng, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li:
Acoustic TextTiling for story segmentation of spoken documents. ICASSP 2012: 5121-5124 - [c103]Haipeng Wang, Cheung-Chi Leung, Tan Lee, Bin Ma, Haizhou Li:
An acoustic segment modeling approach to query-by-example spoken term detection. ICASSP 2012: 5157-5160 - [c102]Hanwu Sun, Bin Ma:
Unsupervised NAP Training Data Design for Speaker Recognition. INTERSPEECH 2012: 1099-1102 - [c101]Anthony Larcher, Kong-Aik Lee, Bin Ma, Haizhou Li:
RSR2015: Database for Text-Dependent Speaker Verification using Multiple Pass-Phrases. INTERSPEECH 2012: 1580-1583 - [c100]Ye Jiang, Kong-Aik Lee, Zhenmin Tang, Bin Ma, Anthony Larcher, Haizhou Li:
PLDA Modeling in I-Vector and Supervector Space for Speaker Verification. INTERSPEECH 2012: 1680-1683 - [c99]Changhuai You, Haizhou Li, Bin Ma, Kong-Aik Lee:
Effect of Relevance Factor of Maximum a posteriori Adaptation for GMM-SVM in Speaker and Language Recognition. INTERSPEECH 2012: 2065-2068 - [c98]Chien-Lin Huang, Chiori Hori, Hideki Kashioka, Bin Ma:
Ensemble Classifiers Using Unsupervised Data Selection for Speaker Recognition. INTERSPEECH 2012: 2666-2669 - [c97]Cheung-Chi Leung, Bin Ma, Haizhou Li:
Phonotactic spoken language recognition: Using diversely adapted acoustic models in parallel phone recognizers. ISCSLP 2012: 108-111 - [c96]Brian Mak, Bin Ma:
Welcome message from the technical program chairs. ISCSLP 2012 - [c95]Ville Hautamäki, Kong-Aik Lee, Anthony Larcher, Tomi Kinnunen, Bin Ma, Haizhou Li:
Variational Bayes logistic regression as regularized fusion for NIST SRE 2010. Odyssey 2012: 268-274 - [c94]Chang Huai You, Haizhou Li, Eliathamby Ambikairajah, Kong-Aik Lee, Bin Ma:
Bhattacharyya-based GMM-SVM system with adaptive relevance factor for pair language recognition. Odyssey 2012: 338-345 - [e2]Haizhou Li, Bin Ma, Kong-Aik Lee:
Odyssey 2012: The Speaker and Language Recognition Workshop, Singapore, June 25-28, 2012. ISCA 2012 [contents] - 2011
- [j11]Omid Dehzangi, Bin Ma, Engsiong Chng, Haizhou Li:
Error Corrective Fusion of Classifier Scores for Spoken Language Recognition. IEICE Trans. Inf. Syst. 94-D(12): 2503-2512 (2011) - [j10]Donglai Zhu, Bin Ma, Haizhou Li:
Speaker Verification With Feature-Space MAPLR Parameters. IEEE Trans. Speech Audio Process. 19(3): 505-515 (2011) - [c93]Raymond W. M. Ng, Cheung-Chi Leung, Tan Lee, Bin Ma, Haizhou Li:
Score fusion and calibration in multiple language detectors with large performance variation. ICASSP 2011: 4404-4407 - [c92]Eryu Wang, Kong-Aik Lee, Bin Ma, Haizhou Li, Wu Guo, Li-Rong Dai:
Factored covariance modeling for text-independent speaker verification. ICASSP 2011: 4856-4859 - [c91]Chien-Lin Huang, Bin Ma, Haizhou Li, Chung-Hsien Wu:
Speech Indexing Using Semantic Context Inference. INTERSPEECH 2011: 717-720 - [c90]Rong Tong, Bin Ma, Haizhou Li, Chng Eng Siong:
Target-Aware Lattice Rescoring for Dialect Recognition. INTERSPEECH 2011: 733-736 - [c89]Mimi Lu, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li:
Probabilistic Latent Semantic Analysis for Broadcast News Story Segmentation. INTERSPEECH 2011: 1109-1112 - [c88]Hanwu Sun, Bin Ma:
Study of Overlapped Speech Detection for NIST SRE Summed Channel Speaker Recognition. INTERSPEECH 2011: 2345-2348 - [c87]Chien-Lin Huang, Bin Ma:
Maximum Entropy Based Data Selection for Speaker Recognition. INTERSPEECH 2011: 2713-2716 - [c86]Ville Hautamäki, Kong-Aik Lee, Tomi Kinnunen, Bin Ma, Haizhou Li:
Regularized Logistic Regression Fusion for Speaker Verification. INTERSPEECH 2011: 2745-2748 - [c85]Kong-Aik Lee, Anthony Larcher, Helen Thai, Bin Ma, Haizhou Li:
Joint Application of Speech and Speaker Recognition for Automation and Security in Smart Home. INTERSPEECH 2011: 3317-3318 - 2010
- [j9]Haizhou Li, Bin Ma:
TechWare: Speaker and Spoken Language Recognition Resources [Best of the Web]. IEEE Signal Process. Mag. 27(6): 139-142 (2010) - [c84]Minghui Dong, Paul Y. Chan, Ling Cen, Bin Ma, Haizhou Li:
I2R Text-to-Speech System for Blizzard Challenge 2010. Blizzard Challenge 2010 - [c83]Omid Dehzangi, Bin Ma, Engsiong Chng, Haizhou Li:
Error corrective classifier fusion for spoken Language Recognition. ICASSP 2010: 1994-1997 - [c82]Hanwu Sun, Bin Ma, Swe Zin Kalayar Khine, Haizhou Li:
Speaker diarization system for RT07 and RT09 meeting room audio. ICASSP 2010: 4982-4985 - [c81]Donglai Zhu, Bin Ma, Haizhou Li:
Soft margin estimation of Gaussian mixture model parameters for spoken language recognition. ICASSP 2010: 4990-4993 - [c80]Raymond W. M. Ng, Cheung-Chi Leung, Tan Lee, Bin Ma, Haizhou Li:
Prosodic attribute model for spoken language identification. ICASSP 2010: 5022-5025 - [c79]Shuanhu Bai, Chien-Lin Huang, Bin Ma, Haizhou Li:
Semi-supervised learning of language model using unsupervised topic model. ICASSP 2010: 5386-5389 - [c78]Tin Lay Nwe, Minghui Dong, Paul Y. Chan, Xi Wang, Bin Ma, Haizhou Li:
Voice conversion: From spoken vowels to singing vowels. ICME 2010: 1421-1426 - [c77]Omid Dehzangi, Bin Ma, Engsiong Chng, Haizhou Li:
Framewise Phone Classification Using Weighted Fuzzy Classification Rules. ICPR 2010: 4186-4189 - [c76]Hanwu Sun, Bin Ma, Chien-Lin Huang, Trung Hieu Nguyen, Haizhou Li:
The IIR NIST SRE 2008 and 2010 summed channel speaker recognition systems. INTERSPEECH 2010: 366-369 - [c75]Chien-Lin Huang, Hanwu Sun, Bin Ma, Haizhou Li:
Speaker characterization using long-term and temporal information. INTERSPEECH 2010: 370-373 - [c74]Rong Tong, Bin Ma, Haizhou Li, Engsiong Chng:
Selecting phonotactic features for language recognition. INTERSPEECH 2010: 737-740 - [c73]Eryu Wang, Kong-Aik Lee, Bin Ma, Haizhou Li, Wu Guo, Li-Rong Dai:
The estimation and kernel metric of spectral correlation for text-independent speaker verification. INTERSPEECH 2010: 1065-1068 - [c72]Xiaoxuan Wang, Lei Xie, Bin Ma, Engsiong Chng, Haizhou Li:
Phoneme lattice based texttiling towards multilingual story segmentation. INTERSPEECH 2010: 1305-1308 - [c71]Donglai Zhu, Bin Ma, Kong-Aik Lee, Cheung-Chi Leung, Haizhou Li:
MAP estimation of subspace transform for speaker recognition. INTERSPEECH 2010: 1465-1468 - [c70]Ville Hautamäki, Tomi Kinnunen, Mohaddeseh Nosratighods, Kong-Aik Lee, Bin Ma, Haizhou Li:
Approaching human listener accuracy with modern speaker verification. INTERSPEECH 2010: 1473-1476 - [c69]Tin Lay Nwe, Hanwu Sun, Bin Ma, Haizhou Li:
Speaker diarization in meeting audio for single distant microphone. INTERSPEECH 2010: 1505-1508 - [c68]Raymond W. M. Ng, Cheung-Chi Leung, Ville Hautamäki, Tan Lee, Bin Ma, Haizhou Li:
Towards long-range prosodic attribute modeling for language recognition. INTERSPEECH 2010: 1792-1795 - [c67]Omid Dehzangi, Bin Ma, Engsiong Chng, Haizhou Li:
A discriminative performance metric for GMM-UBM speaker identification. INTERSPEECH 2010: 2114-2117 - [c66]Yanhua Long, Li-Rong Dai, Bin Ma, Wu Guo:
Effects of the phonological relevance in speaker verification. INTERSPEECH 2010: 2130-2133 - [c65]Cheung-Chi Leung, Donglai Zhu, Kong-Aik Lee, Bin Ma, Haizhou Li:
Incorporating MAP estimation and covariance transform for SVM based speaker recognition. INTERSPEECH 2010: 2318-2321 - [c64]Sirinoot Boonsuk, Donglai Zhu, Bin Ma, Atiwong Suchato, Proadpran Punyabukkana, Nattanun Thatphithakkul, Chai Wutiwiwatchai:
A study of term weighting in phonotactic approach to spoken language recognition. INTERSPEECH 2010: 2714-2717 - [c63]Eryu Wang, Wu Guo, Li-Rong Dai, Kong-Aik Lee, Bin Ma, Haizhou Li:
Factor analysis based spatial correlation modeling for speaker verification. ISCSLP 2010: 166-170 - [c62]Shuanhu Bai, Cheung-Chi Leung, Chien-Lin Huang, Bin Ma, Haizhou Li:
Building topic mixture language models using the document soft classification notion of topic models. ISCSLP 2010: 229-232 - [c61]Yanhua Long, Li-Rong Dai, Eryu Wang, Bin Ma, Wu Guo:
Non-negative matrix factorization based discriminative features for speaker verification. ISCSLP 2010: 291-295 - [c60]Hanwu Sun, Bin Ma, Haizhou Li:
Frame selection of interview channel for NIST speaker recognition evaluation. ISCSLP 2010: 305-308 - [c59]Raymond W. M. Ng, Cheung-Chi Leung, Tan Lee, Bin Ma, Haizhou Li:
Detection target dependent score calibration for language recognition. Odyssey 2010: 18 - [c58]Cheung-Chi Leung, Bin Ma, Haizhou Li:
Parallel Acoustic Model Adaptation for Improving Phonotactic Language Recognition. Odyssey 2010: 41 - [c57]Sethserey Sam, Laurent Besacier, Eric Castelli, Bin Ma, Cheung-Chi Leung, Haizhou Li:
Autonomous acoustic model adaptation for multilingual meeting transcription involving high- and low-resourced languages. SLTU 2010: 116-121
2000 – 2009
- 2009
- [j8]Chien-Lin Huang, Haizhou Li, Bin Ma:
Speaker Characterization using Average Filtering and Two Space Fusions. Int. J. Asian Lang. Process. 19(3): 85-94 (2009) - [j7]Raymond W. M. Ng, Tan Lee, Cheung-Chi Leung, Bin Ma, Haizhou Li:
Analysis and Selection of Prosodic Features for Asian Language Recognition. Int. J. Asian Lang. Process. 19(4): 139-152 (2009) - [j6]Rong Tong, Bin Ma, Haizhou Li, Chng Eng Siong:
A Target-Oriented Phonotactic Front-End for Spoken Language Recognition. IEEE Trans. Speech Audio Process. 17(7): 1335-1347 (2009) - [c56]Minghui Dong, Ling Cen, Paul Y. Chan, Dongyan Huang, Donglai Zhu, Bin Ma, Haizhou Li:
I2R Text-to-Speech System for Blizzard Challenge 2009. Blizzard Challenge 2009 - [c55]Raymond W. M. Ng, Tan Lee, Cheung-Chi Leung, Bin Ma, Haizhou Li:
Analysis and Selection of Prosodic Features for Language Identification. IALP 2009: 123-128 - [c54]Cheung-Chi Leung, Rong Tong, Bin Ma, Haizhou Li:
A Lattice-Based Phonotactic Language Recognition System with CMLLR Adaptation and Its Implementation Issues. IALP 2009: 285-288 - [c53]Donglai Zhu, Bin Ma, Haizhou Li:
Joint map adaptation of feature transformation and Gaussian Mixture Model for speaker recognition. ICASSP 2009: 4045-4048 - [c52]Haizhou Li, Bin Ma, Kong-Aik Lee, Hanwu Sun, Donglai Zhu, Khe Chai Sim, Changhuai You, Rong Tong, Ismo Kärkkäinen, Chien-Lin Huang, Vladimir Pervouchine, Wu Guo, Yijie Li, Li-Rong Dai, Mohaddeseh Nosratighods, Tharmarajah Thiruvaran, Julien Epps, Eliathamby Ambikairajah, Chng Eng Siong, Tanja Schultz, Qin Jin:
The I4U system in NIST 2008 speaker recognition evaluation. ICASSP 2009: 4201-4204 - [c51]Yanhua Long, Bin Ma, Haizhou Li, Wu Guo, Chng Eng Siong, Li-Rong Dai:
Exploiting prosodic information for Speaker Recognition. ICASSP 2009: 4225-4228 - [c50]Mohaddeseh Nosratighods, Tharmarajah Thiruvaran, Julien Epps, Eliathamby Ambikairajah, Bin Ma, Haizhou Li:
Evaluation of a fused FM and cepstral-based speaker recognition system on the NIST 2008 SRE. ICASSP 2009: 4233-4236 - [c49]Hanwu Sun, Bin Ma, Haizhou Li:
Cross-validation of multiple language recognition systems using pseudo keys. ICASSP 2009: 4353-4356 - [c48]Bin Ma, Donglai Zhu, Haizhou Li:
Acoustic segment modeling for speaker recognition. ICME 2009: 1668-1671 - [c47]Rong Tong, Bin Ma, Haizhou Li, Engsiong Chng, Kong-Aik Lee:
Target-aware language models for spoken language recognition. INTERSPEECH 2009: 200-203 - [c46]Hanwu Sun, Tin Lay Nwe, Bin Ma, Haizhou Li:
Speaker diarization for meeting room audio. INTERSPEECH 2009: 900-903 - [c45]Donglai Zhu, Bin Ma, Haizhou Li:
Large margin estimation of Gaussian mixture model parameters with extended baum-welch for spoken language recognition. INTERSPEECH 2009: 2179-2182 - [c44]Omid Dehzangi, Bin Ma, Engsiong Chng, Haizhou Li:
Discriminative feature transformation using output coding for speech recognition. INTERSPEECH 2009: 2979-2982 - [c43]Shuanhu Bai, Chien-Lin Huang, Yeow-Kee Tan, Bin Ma:
Language models learning for domain-specific natural language user interaction. ROBIO 2009: 2480-2485 - 2008
- [j5]Donglai Zhu, Haizhou Li, Bin Ma, Chin-Hui Lee:
Optimizing the Performance of Spoken Language Recognition With Discriminative Training. IEEE Trans. Speech Audio Process. 16(8): 1642-1653 (2008) - [c42]Minghui Dong, Donglai Zhu, Bin Ma, Haizhou Li:
I2R's Submission to Blizzard Challenge 2008. Blizzard Challenge 2008 - [c41]Donglai Zhu, Haizhou Li, Bin Ma, Chin-Hui Lee:
Discriminative learning for optimizing detection performance in spoken language recognition. ICASSP 2008: 4161-4164 - [c40]Rong Tong, Bin Ma, Haizhou Li, Engsiong Chng:
Target-oriented phone tokenizers for spoken language recognition. ICASSP 2008: 4221-4224 - [c39]Chien-Lin Huang, Chung-Hsien Wu, Haizhou Li, Chia-Hsin Hsieh, Bin Ma:
Unsupervised pronunciation grammar growing using knowledge-based and data-driven approaches. ICME 2008: 1097-1100 - [c38]Omid Dehzangi, Bin Ma, Chng Eng Siong, Haizhou Li:
Fuzzy rule selection using Iterative Rule Learning for speech data classification. ICPR 2008: 1-4 - [c37]Rong Tong, Bin Ma, Haizhou Li, Engsiong Chng:
Target-oriented phone selection from universal phone set for spoken language recognition. INTERSPEECH 2008: 715-718 - [c36]Donglai Zhu, Bin Ma, Haizhou Li:
Using MAP estimation of feature transformation for speaker recognition. INTERSPEECH 2008: 849-852 - [c35]Chien-Lin Huang, Bin Ma, Chung-Hsien Wu, Brian Mak, Haizhou Li:
Robust speaker verification using short-time frequency with long-time window and fusion of multi-resolutions. INTERSPEECH 2008: 1897-1900 - [c34]Omid Dehzangi, Bin Ma, Chng Eng Siong, Haizhou Li:
Discriminative Output Coding Features for Speech Recognition. ISCSLP 2008: 89-92 - [c33]Hanwu Sun, Bin Ma, Haizhou Li:
Using Pseudo-Key for Language Recognition System Design. ISCSLP 2008: 173-176 - [c32]Chang Huai You, Kong-Aik Lee, Bin Ma, Haizhou Li:
Self-Organized Clustering for Feature Mapping in Language Recognition. ISCSLP 2008: 177-180 - [c31]Hanwu Sun, Bin Ma, Haizhou Li:
An Efficient Feature Selection Method for Speaker Recognition. ISCSLP 2008: 181-184 - [c30]Haizhou Li, Bin Ma, Kong-Aik Lee, Khe Chai Sim, Hanwu Sun, Rong Tong, Donglai Zhu, Changhuai You:
NIST 2007 Language Recognition Evaluation: From the Perspective of IIR. PACLIC 2008: 46-57 - 2007
- [j4]Haizhou Li, Bin Ma, Chin-Hui Lee:
A Vector Space Modeling Approach to Spoken Language Identification. IEEE Trans. Speech Audio Process. 15(1): 271-284 (2007) - [j3]Bin Ma, Haizhou Li, Rong Tong:
Spoken Language Recognition Using Ensemble Classifiers. IEEE Trans. Speech Audio Process. 15(7): 2053-2062 (2007) - [c29]Chin-Wei Eugene Koh, Hanwu Sun, Tin Lay Nwe, Trung Hieu Nguyen, Bin Ma, Chng Eng Siong, Haizhou Li, Susanto Rahardja:
Speaker Diarization Using Direction of Arrival Estimate and Acoustic Feature Information: The I2R-NTU Submission for the NIST RT 2007 Evaluation. CLEAR 2007: 484-496 - [c28]Donglai Zhu, Bin Ma, Haizhou Li, Qiang Huo:
A Generalized Feature Transformation Approach for Channel Robust Speaker Verification. ICASSP (4) 2007: 61-64 - [c27]Bin Ma, Helen M. Meng, Man-Wai Mak:
Effects of Device Mismatch, Language Mismatch and Environmental Mismatch on Speaker Verification. ICASSP (4) 2007: 301-304 - [c26]Rong Tong, Haizhou Li, Bin Ma, Engsiong Chng, Siu-Yeung Cho:
Spoken Language Recognition with Relevance Feedback. ICASSP (4) 2007: 861-864 - [c25]Bin Ma, Rong Tong, Haizhou Li:
Discriminative Vector for Spoken Language Recognition. ICASSP (4) 2007: 1001-1004 - [c24]Chin-Wei Eugene Koh, Hanwu Sun, Tin Lay Nwe, Trung Hieu Nguyen, Bin Ma, Engsiong Chng, Haizhou Li, Susanto Rahardja:
Using direction of arrival estimate and acoustic feature information in speaker diarization. INTERSPEECH 2007: 2149-2152 - 2006
- [j2]Bin Ma, Haizhou Li:
A Comparative Study of Four Language Identification Systems. Int. J. Comput. Linguistics Chin. Lang. Process. 11(2) (2006) - [c23]Rong Tong, Bin Ma, Donglai Zhu, Haizhou Li, Engsiong Chng:
Integrating Acoustic, Prosodic and Phonotactic Features for Spoken Language Identification. ICASSP (1) 2006: 205-208 - [c22]Bin Ma, Donglai Zhu, Rong Tong:
Chinese Dialect Identification Using Tone Features Based on Pitch Flux. ICASSP (1) 2006: 1029-1032 - [c21]Haizhou Li, Bin Ma, Rong Tong:
Vector-based spoken language recognition using output coding. INTERSPEECH 2006 - [c20]Bin Ma, Donglai Zhu, Rong Tong, Haizhou Li:
Speaker cluster based GMM tokenization for speaker recognition. INTERSPEECH 2006 - [c19]Kong-Aik Lee, Hanwu Sun, Rong Tong, Bin Ma, Minghui Dong, Changhuai You, Donglai Zhu, Chin-Wei Eugene Koh, Lei Wang, Tomi Kinnunen, Chng Eng Siong, Haizhou Li:
The IIR Submission to CSLP 2006 Speaker Recognition Evaluation. ISCSLP (Selected Papers) 2006: 494-505 - [c18]Rong Tong, Bin Ma, Kong-Aik Lee, Changhuai You, Donglai Zhu, Tomi Kinnunen, Hanwu Sun, Minghui Dong, Chng Eng Siong, Haizhou Li:
Fusion of Acoustic and Tokenization Features for Speaker Recognition. ISCSLP (Selected Papers) 2006: 566-577 - [c17]Donglai Zhu, Rong Tong, Bin Ma, Haizhou Li:
Minimum Classification Error Based Optimal Linear Combination for Spoken Language Identification. ISCSLP 2006 - [c16]Jinyu Li, Sibel Yaman, Chin-Hui Lee, Bin Ma, Rong Tong, Donglai Zhu, Haizhou Li:
Language Recognition Based on Score Distribution Feature Vectors and Discriminative Classifier Fusion. Odyssey 2006: 1-5 - [e1]Qiang Huo, Bin Ma, Chng Eng Siong, Haizhou Li:
Chinese Spoken Language Processing, 5th International Symposium, ISCSLP 2006, Singapore, December 13-16, 2006, Selected Papers. Lecture Notes in Computer Science 4274, Springer 2006, ISBN 3-540-49665-3 [contents] - 2005
- [c15]Haizhou Li, Bin Ma:
A Phonotactic Language Model for Spoken Language Identification. ACL 2005: 515-522 - [c14]Boon Pang Lim, Haizhou Li, Bin Ma:
Using Local & Global Phonotactic Features in Chinese Dialect Identification. ICASSP (1) 2005: 577-580 - [c13]Bin Ma, Haizhou Li, Chin-Hui Lee:
An acoustic segment modeling approach to automatic language identification. INTERSPEECH 2005: 2829-2832 - [c12]Sheng Gao, Bin Ma, Haizhou Li, Chin-Hui Lee:
A text categorization approach to automatic language identification. INTERSPEECH 2005: 2837-2840 - [c11]Bin Ma, Haizhou Li:
A phonotactic-semantic paradigm for automatic spoken document classification. SIGIR 2005: 369-376 - 2004
- [c10]Bin Ma, Helen Meng:
English-Chinese bilingual text-independent speaker verification. ICASSP (5) 2004: 293-296 - [c9]Chun Wai Lau, Bin Ma, Helen Mei-Ling Meng, Yiu Sang Moon, Yeung Yam:
Fuzzy logic decision fusion in a multimodal biometric system. INTERSPEECH 2004: 261-264 - 2002
- [c8]Bin Ma, Cuntai Guan, Haizhou Li, Chin-Hui Lee:
Multilingual speech recognition with language identification. INTERSPEECH 2002: 505-508 - [c7]Bin Ma, Cuntai Guan, Haizhou Li:
Likelihood probability mismatch analysis and normalization in multilingual speech applications. ISCSLP 2002 - [c6]Bin Ma, Qiang Huo:
A comparative study of several incremental adaptation algorithms for speaker adaptation. ISCSLP 2002 - 2001
- [j1]Qiang Huo, Bin Ma:
Online adaptive learning of continuous-density hidden Markov models based on multiple-stream prior evolution and posterior pooling. IEEE Trans. Speech Audio Process. 9(4): 388-398 (2001) - 2000
- [b1]Bin Ma:
A study on acoustic modeling and adaptation in HMM-based speech recognition. University of Hong Kong, 2000 - [c5]Qiang Hue, Nathan Smith, Bin Ma:
Efficient ML training of CDHMM parameters based on prior evolution, posterior intervention and feedback. ICASSP 2000: 1001-1004 - [c4]Qiang Huo, Bin Ma:
Robust speech recognition based on off-line elicitation of multiple priors and on-line adaptive prior fusion. INTERSPEECH 2000: 480-483 - [c3]Bin Ma, Qiang Huo:
Benchmark Results of Triphone-based Acoustic Modeling on HKU96 and HKU99 Putonghua Corpora. ISCSLP 2000
1990 – 1999
- 1999
- [c2]Qiang Huo, Bin Ma:
Irrelevant variability normalization in learning HMM state tying from data based on phonetic decision-tree. ICASSP 1999: 577-580 - [c1]Qiang Huo, Bin Ma:
On-line adaptive learning of CDHMM parameters based on multiple-stream prior evolution and posterior pooling. EUROSPEECH 1999: 2721-2724
Coauthor Index
aka: Kong Aik Lee
aka: Changhuai You
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-08 01:30 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint