default search action
APSIPA 2019: Lanzhou, China
- 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2019, Lanzhou, China, November 18-21, 2019. IEEE 2019, ISBN 978-1-7281-3248-8
- Guo-Chih Hong, Chung-Nan Lee, Ming-Feng Lee:
Dynamic Threshold for DDoS Mitigation in SDN Environment. 1-7 - Po-Chiang Lin:
Large-Scale and High-Dimensional Cell Outage Detection in 5G Self-Organizing Networks. 8-12 - Kouji Hirata, Takuji Tachibana:
Implementation of multiple routing configurations on software-defined networks with P4. 13-16 - Koki Shimizu, Yuya Kumai, Kimiko Motonaka, Tomotaka Kimura, Kouji Hirata:
Evaluation of countermeasure against future malware evolution with deterministic modeling. 17-21 - Wen-Ping Lai, Kuan-Chun Chiu:
NUMAP: NUMA-aware Multi-core Pinning and Pairing for Network Slicing at the 5G Mobile Edge. 22-27 - Shaojie Yang, Wenbo Chen, Shanxi Li, Qingxiang Xu:
Approach using Transforming Structural Data into Image for Detection of Malicious MS-DOC Files based on Deep Learning Models. 28-32 - Kavin Kamaraj, Behnam Dezfouli, Yuhong Liu:
Edge Mining on IoT Devices Using Anomaly Detection. 33-40 - Fang Feng, Qingquan Lv, Mingsong Wang, Xuhui Yang, Qingguo Zhou, Rui Zhou:
A Hybrid Feature Selection Algorithm Applied to High-dimensional Imbalanced Small-sample Data Classification. 41-46 - Yikang Lin, Peng Zhang:
Blockchain-based Complete Self-tallying E-voting Protocol. 47-52 - Licheng Xiao, Hairong Wang, Nam Ling:
Image Compression with Deeper Learned Transformer. 53-57 - Lin Zhang, Yuhong Liu:
Modeling the Views of WeChat Articles by Branching Processes. 58-63 - Congcong Wang, Pengyu Liu, Kebin Jia, Siwei Chen:
Lightweight models for weather identification. 64-68 - Yu-Min Huang, Huan-Hsin Tseng, Jen-Tzung Chien:
Stochastic Fusion for Multi-stream Neural Network in Video Classification. 69-74 - Jie Cao, Yinping Qiu, Dongliang Chang, Xiaoxu Li, Zhanyu Ma:
Dynamic Attention Loss for Small-Sample Image Classification. 75-79 - Xiaoxu Li, Jijie Wu, Dongliang Chang, Weifeng Huang, Zhanyu Ma, Jie Cao:
Mixed Attention Mechanism for Small-Sample Fine-grained Image Classification. 80-85 - Jie Cao, Yaofeng Zhou, Hong Yu, Xiaoxu Li, Dan Wang, Zhanyu Ma:
A Loss With Mixed Penalty for Speech Enhancement Generative Adversarial Network. 86-90 - Xiaoxu Li, Liyun Yu, Jie Cao, Dongliang Chang, Zhanyu Ma, Nian Liu:
Small-Sample Image Classification Method of Combining Prototype and Margin Learning. 91-95 - Muhammad Hasnain, Muhammad Fermi Pasha, Chern Hong Lim, Imran Ghani:
Recurrent Neural Network for Web Services Performance Forecasting, Ranking and Regression Testing. 96-105 - Tuan Vu Ho, Masato Akagi:
Non-parallel Voice Conversion with Controllable Speaker Individuality using Variational Autoencoder. 106-111 - Berrak Sisman, Karthika Vijayan, Minghui Dong, Haizhou Li:
SINGAN: Singing Voice Conversion with Generative Adversarial Networks. 112-118 - Gaku Kotani, Hitoshi Suda, Daisuke Saito, Nobuaki Minematsu:
Experimental investigation on the efficacy of Affine-DTW in the quality of voice conversion. 119-124 - Jinsen Hu, Chunyan Yu, Faqian Guan:
Non-parallel Many-to-many Singing Voice Conversion by Adversarial Learning. 125-132 - Thuan Van Ngo, Rieko Kubo, Masato Akagi:
Evaluation of the Lombard effect model on synthesizing Lombard speech in varying noise level environments with limited data. 133-137 - Hiroki Murakami, Sunao Hara, Masanobu Abe:
DNN-based Voice Conversion with Auxiliary Phonemic Information to Improve Intelligibility of Glossectomy Patients' Speech. 138-142 - Kento Matsumoto, Sunao Hara, Masanobu Abe:
Speech-like Emotional Sound Generator by WaveNet. 143-147 - Shunsuke Goto, Daisuke Saito, Nobuaki Minematsu:
DNN-based Statistical Parametric Speech Synthesis Incorporating Non-negative Matrix Factorization. 148-153 - Masanori Morise, Genta Miyashita:
Efficient quantization of vocoded speech parameters without degradation. 154-158 - Xiaoxue Gao, Xiaohai Tian, Rohan Kumar Das, Yi Zhou, Haizhou Li:
Speaker-independent Spectral Mapping for Speech-to-Singing Conversion. 159-164 - Chuxiong Zhang, Sheng Zhang, Haibing Zhong:
A Prosodic Mandarin Text-to-Speech System Based on Tacotron. 165-169 - Jiangyan Yi, Jianhua Tao:
Distilling Knowledge for Distant Speech Recognition via Parallel Data. 170-175 - Jiangyan Yi, Jianhua Tao:
Batch Normalization based Unsupervised Speaker Adaptation for Acoustic Models. 176-180 - Ming Liu, Yujun Wang, Zhaoyu Yan, Jing Wang, Xiang Xie:
Robust Speech Recognition based on Multi-Objective Learning with GRU Network. 181-185 - Hiroshi Sato, Takafumi Moriya, Yusuke Shinohara, Ryo Masumura, Takaaki Fukutomi, Kiyoaki Matsui, Takanori Ashihara, Yoshikazu Yamaguchi, Yushi Aono:
Revisiting Dynamic Adjustment of Language Model Scaling Factor for Automatic Speech Recognition. 186-191 - Shuji Komeiji, Toshihisa Tanaka:
A Language Model-Based Design of Reduced Phoneme Set for Acoustic Model. 192-197 - Thi-Ly Vu, Zhiping Zeng, Haihua Xu, Eng Siong Chng:
Audio Codec Simulation based Data Augmentation for Telephony Speech Recognition. 198-203 - Wataru Nakamura, Yosuke Kaga, Masakazu Fujio, Kenta Takahashi:
Security and Efficiency of Biometric Template Protection for Identification. 210-217 - Daiki Izumoto, Yasushi Yamazaki:
Security enhancement for touch panel based user authentication on smartphones. 218-223 - Tetsushi Ohki, Vishu Gupta, Masakatsu Nishigaki:
Efficient Spoofing Attack Detection against Unknown Sample using End-to-End Anomaly Detection. 224-230 - Keisuke Takano, Hironobu Takano:
Eye-blink based Personal Authentication Using Time-series Directional Features and Waveform Features. 231-235 - Shion Tagawa, Hironobu Takano:
Personal Authentication with Eye Movement Features During PIN Input. 236-240 - Yi-Chun Lin, Yusei Suzuki, Hiroya Kawai, Koichi Ito, Hwann-Tzong Chen, Takafumi Aoki:
Attribute Estimation Using Multi-CNNs from Hand Images. 241-244 - Hyewon Song, Beom Kwon, Seongmin Lee, Sanghoon Lee:
Dictionary based Compression Type Classification using a CNN Architecture. 245-248 - Zhihao Du, Xueliang Zhang, Jiqing Han:
Investigation of Monaural Front-End Processing for Robust Speech Recognition Without Retraining or Joint-Training. 249-254 - Akira Tamamori, Tomoko Matsui:
A sequential prediction method of quasi-periodicity based on Gaussian process state space model. 255-261 - Ming-Hsiang Su, Chung-Hsien Wu, Po-Chen Shih:
Automatic Ontology Population Using Deep Learning for Triple Extraction. 262-267 - Karan Makhija, Thi-Nga Ho, Eng Siong Chng:
Transfer Learning for Punctuation Prediction. 268-273 - Xin Tang, Jun Du, Li Chai, Yannan Wang, Qing Wang, Chin-Hui Lee:
A LSTM-Based Joint Progressive Learning Framework for Simultaneous Speech Dereverberation and Denoising. 274-278 - Ryo Tanabe, Takashi Endo, Yuki Nikaido, Kenji Ichige, Nguyen Phong, Yohei Kawaguchi, Koichi Hamada:
Location-Independent Multi-Channel Acoustic Scene Classification Using Blind Dereverberation, Blind Source Separation, and Model Ensemble. 279-283 - Lantian Li, Zhiyuan Tang, Ying Shi, Dong Wang:
Phonetic-Attention Scoring for Deep Speaker Features in Speaker Verification. 284-288 - Zhaoci Liu, Zhiqiang Guo, Zhenhua Ling, Shijin Wang, Lingjing Jin, Yunxia Li:
Dementia Detection by Analyzing Spontaneous Mandarin Speech. 289-296 - Hao Li, Xueliang Zhang, Guanglai Gao:
Dynamic-attention based Encoder-decoder model for Speaker Extraction with Anchor speech. 297-301 - Jennifer Santoso, Takeshi Yamada, Shoji Makino:
Classification of causes of speech recognition errors using attention-based bidirectional long short-term memory and modulation spectrum. 302-306 - Jiahong Zhao, Christian Ritz:
Semi-Coprime Microphone Arrays for Estimating Direction of Arrival of Speech Sources. 308-313 - Junyi Peng, Rongzhi Gu, Yuexian Zou, Wenwu Wang:
Speaker-discriminative Embedding Learning via Affinity Matrix for Short Utterance Speaker Verification. 314-319 - Yao Du, Zhiyong Wu, Shiyin Kang, Dan Su, Dong Yu, Helen Meng:
Prosodic Structure Prediction using Deep Self-attention Neural Network. 320-324 - Rongzhi Gu, Junyi Peng, Yuexian Zou, Dong Yu:
Alleviate Cross-chunk Permutation through Chunk-level Speaker Embedding for Blind Speech Separation. 325-331 - Yuki Kubo, Norihiro Takamune, Daichi Kitamura, Hiroshi Saruwatari:
Acceleration of rank-constrained spatial covariance matrix estimation for blind speech extraction. 332-338 - Hsiao-Tzu Hung, Chung-Yang Wang, Yi-Hsuan Yang, Hsin-Min Wang:
Improving Automatic Jazz Melody Generation by Transfer Learning Techniques. 339-346 - Yupeng Shi, Nengheng Zheng, Yuyong Kang, Weicong Rong:
Speech Loss Compensation by Generative Adversarial Networks. 347-351 - Keisuke Nishijima, Ken'ichi Furuya:
Snoring sound classification using multiclass classifier under actual environments. 352-356 - Meng Liang, Zhong-Hua Fu, Xiang Zhao, Jinglei Zhou, Haikun Wang:
Nonlinear Echo Cancellation Based on Polyphase Filter Bank. 357-362 - Yunqi Cai, Dong Wang:
Question Mark Prediction By Bert. 363-367 - Li Li, Jianwu Dang, Yangping Wang, Song Wang, Zhenhai Zhang:
Part-Based Bilinear CNN For Person Re-Identification. 368-374 - Christoph M. Wilk, Shigeki Sagayama:
Polyphonic Voicing Optimization for Automatic Music Completion. 375-382 - Bo-Cheng Jiang, Chung-Nan Lee:
Online Layered Multiple Object Tracking Using Residual-Residual Networks. 383-390 - Biao Yue, Yangping Wang, Yongzhi Min, Zhenhai Zhang, Wenrun Wang, Jiu Yong:
Rail Surface Defect Recognition Method Based on AdaBoost Multi-classifier Combination. 391-396 - Qiuxian Zhang, Jiangyan Yi, Jianhua Tao, Mingliang Gu, Yong Ma:
Focal Loss for End-to-end Short Utterances Chinese Dialect Identification. 397-401 - Daisuke Saito, So Suzuki, Nobuaki Minematsu:
Speech representation based on tensor factor analysis and its application to speaker recognition and language identification. 402-406 - Jacob Lambert, Eijiro Takeuchi, Kazuya Takeda:
Optimizing Learned Object Detection on Point Clouds from 3D Lidars Through Range and Sparsity Information. 407-413 - Huan-Yu Chen, Yun-Shao Lin, Chi-Chun Lee:
Through the Eyes of Viewers: A Comment-Enhanced Media Content Representation for TED Talks Impression Recognition. 414-418 - Yuanfang Zhao, Yunli Chen:
End-to-end autonomous driving based on the convolution neural network model. 419-423 - Ping-Rong Chen, Hsueh-Ming Hang, Sheng-Wei Chan, Jing-Jhih Lin:
DSNet: An Efficient CNN for Road Scene Segmentation. 424-432 - Miao Zhao, Rongjin Li, Shijiang Yan, Zheng Li, Hao Lu, Shipeng Xia, Qingyang Hong, Lin Li:
Phone-Aware Multi-task Learning and Length Expanding for Short-Duration Language Recognition. 433-437 - Zhibo Rao, Mingyi He, Zhidong Zhu, Yuchao Dai, Renjie He:
SDBF-Net: Semantic and Disparity Bidirectional Fusion Network for 3D Semantic Detection on Incidental Satellite Images. 438-444 - Zhiyong Chen, Zongze Ren, Shugong Xu:
A Study on Angular Based Embedding Learning for Text-independent Speaker Verification. 445-449 - Zheng Li, Hao Lu, Jianfeng Zhou, Lin Li, Qingyang Hong:
Speaker Embedding Extraction with Multi-feature Integration Structure. 450-454 - Zhuozheng Wang, Meng Zhang, Wei Liu:
An Effective Road Extraction Method from Remote Sensing Images Based on Self-Adaptive Threshold Function. 455-460 - Jiayao Wu, Zhiyuan Tang, Dong Wang:
Structure Growth for Small-Footprint Speech Recognition. 461-465 - Na Li, Yongfei Zhang, Yun Zhang, C.-C. Jay Kuo:
On Energy Compaction of 2D Saab Image Transforms. 466-475 - Hangjing Zhang, Yuejiang Li, Yang Hu, Yan Chen, H. Vicky Zhao:
Measuring the Hazard of Malicious Nodes in Information Diffusion over Social Networks. 476-481 - Benliu Qiu, Yuejiang Li, Yan Chen, H. Vicky Zhao:
Controlling Information Diffusion with Irrational Users. 482-485 - Hong Hu, Yuejiang Li, H. Vicky Zhao, Yan Chen:
Modeling Multi-source Information Diffusion: A Graphical Evolutionary Game Approach. 486-492 - Zheming Yang, Wen Ji:
A Universal Intelligence Measurement Method Based on Meta-analysis. 493-498 - Qinyuan Ye, Yuejiang Li, Yan Chen, H. Vicky Zhao:
Modeling Content Interaction in Information Diffusion with Pre-trained Sentence Embedding. 499-507 - Fu-Sheng Tsai, Yi-Ming Weng, Chip-Jin Ng, Chi-Chun Lee:
Pain versus Affect? An Investigation in the Relationship between Observed Emotional States and Self-Reported Pain. 508-512 - Yuxuan Xi, Pengcheng Li, Yan Song, Yiheng Jiang, Lirong Dai:
Speaker to Emotion: Domain Adaptation for Speech Emotion Recognition with Residual Adapters. 513-518 - Bagus Tris Atmaja, Kiyoaki Shirai, Masato Akagi:
Speech Emotion Recognition Using Speech Feature and Word Embedding. 519-523 - Zhichao Peng, Zhi Zhu, Masashi Unoki, Jianwu Dang, Masato Akagi:
Dimensional Emotion Recognition from Speech Using Modulation Spectral Features and Recurrent Neural Networks. 524-528 - Lu Yi, Man-Wai Mak:
Adversarial Data Augmentation Network for Speech Emotion Recognition. 529-534 - Xueyi Wang, Lantian Li, Dong Wang:
VAE-based Domain Adaptation for Speaker Verification. 535-539 - Xingliang Cheng, Mingxing Xu, Thomas Fang Zheng:
Replay detection using CQT-based modified group delay feature and ResNeWt network in ASVspoof 2019. 540-545 - Zhimin Feng, Qiqi Tong, Yanhua Long, Shuang Wei, Chunxia Yang, Qiaozheng Zhang:
SHNU Anti-spoofing Systems for ASVspoof 2019 Challenge. 548-552 - Bin Gu, Wu Guo, Yao Liu, Jian Sun:
Clustering-Based Score Normalization for Speaker Verification. 553-557 - Zongze Ren, Zhiyong Chen, Shugong Xu:
Triplet Based Embedding Distance and Similarity Learning for Text-independent Speaker Verification. 558-562 - Wei-Cheng Liao, Jian-Jiun Ding:
Automatic Handwriting Verification and Suspect Identification for Chinese Characters Using Space and Frequency Domain Features. 563-571 - Dongkwon Jin, Kyungsun Lim, Chang-Su Kim:
Robust Change Detection in High Resolution Satellite Images with Geometric Distortions. 572-577 - Zhibo Rao, Mingyi He, Yuchao Dai, Zhidong Zhu, Bo Li, Renjie He:
MSDC-Net: Multi-Scale Dense and Contextual Networks for Stereo Matching. 578-583 - Zheng Cheng, Ping Han, Binbin Han, Jiahui Sun:
Classification of Polarimetric SAR Image based on Improved Fuzzy Clustering. 584-589 - Nien-Hsin Chou, Li-Chung Chuang, Ming-Sui Lee:
Intensity-aware GAN for Single Image Reflection Removal. 590-594 - Aamir Naveed Abbasi, Mingyi He:
CNN with ICA-PCA-DCT Joint Preprocessing for Hyperspectral Image Classification. 595-600 - Man Wang, Fangkun Qi, Hongwu Yang, Jingwen Sun:
Dongxiang speech synthesis based on statistical parameter method. 601-607 - Daichi Kondo, Masanori Morise:
Human-in-the-loop speech-design system and its evaluation. 608-612 - Masanori Morise, Takuro Shono:
High-quality waveform generator from fundamental frequency, spectral envelope, and band aperiodicity. 613-617 - Hyeonjoo Kang, Young-Sun Joo, Inseon Jang, Chunghyun Ahn, Hong-Goo Kang:
A Study on Acoustic Parameter Selection Strategies to Improve Deep Learning-Based Speech Synthesis. 618-622 - Peng-Fei Wu, Zhen-Hua Ling, Li-Juan Liu, Yuan Jiang, Hong-Chuan Wu, Lirong Dai:
End-to-End Emotional Speech Synthesis Using Style Tokens and Semi-Supervised Training. 623-627 - Jingwen Sun, Gang Zhou, Hongwu Yang, Man Wang:
End-to-end Tibetan Ando dialect speech recognition based on hybrid CTC/attention architecture. 628-632 - Sining Sun, Shuran Zhou, Mei-Yuh Hwang, Lei Xie, Qin Li, Xin Lei:
Multiple fixed beamformers with a spacial Wiener-form postfilter for far-field speech recognition. 633-637 - Zhaoyi Liu, Yuexian Zou:
Teacher-Student BLSTM Mask Model for Robust Acoustic Beamforming. 638-643 - Fanchang Meng, Shouye Peng, Guohui Zhang:
Using Convolution and Sequence-discriminative Training to Improving Children Speech Recognition. 644-649 - Yahui Shan, Min Liu, Qingran Zhan, Shixuan Du, Jing Wang, Xiang Xie:
Speech Recognition Based on Deep Tensor Neural Network and Multifactor Feature. 650-654 - Ryo Masumura, Yusuke Ijima, Satoshi Kobashikawa, Takanobu Oba, Yushi Aono:
Can We Simulate Generative Process of Acoustic Modeling Data? Towards Data Restoration for Acoustic Modeling. 655-661 - Cunhang Fan, Bin Liu, Jianhua Tao, Jiangyan Yi, Zhengqi Wen, Ye Bai:
Noise Prior Knowledge Learning for Speech Enhancement via Gated Convolutional Generative Adversarial Network. 662-666 - Nana Hou, Chenglin Xu, Eng Siong Chng, Haizhou Li:
Domain Adversarial Training for Speech Enhancement. 667-672 - Fuqiang Ye, Yu Tsao, Fei Chen:
Subjective Feedback-based Neural Network Pruning for Speech Enhancement. 673-677 - Tassadaq Hussain, Yu Tsao, Hsin-Min Wang, Jia-Ching Wang, Sabato Marco Siniscalchi, Wen-Hung Liao:
Compressed Multimodal Hierarchical Extreme Learning Machine for Speech Enhancement. 678-683 - Xupeng Jia, Dongmei Li:
Speech Enhancement Based on Deep Mixture of Distinguishing Experts. 684-688 - Jingjun Liang, Shizhe Chen, Qin Jin:
Semi-supervised Multimodal Emotion Recognition with Improved Wasserstein GANs. 695-703 - Yijun Yuan, Jinwei Wan, Bo Chen:
Robust Attack on Deep Learning based Radar HRRP Target Recognition. 704-707 - Xiaoyong Lu, Yanqin Li, Haizhen An, Tao Pan, Renjun Li, Yanbin Hu, Aibao Zhou, Hongwu Yang:
Development of a Chinese Depressed Speech Corpus Based on The Disturbed Effect of Self-Processing. 718-722 - Junichiro Yoshimoto, Jumpei Ozaki, Kohta Mizutani, Takashi Nakano, Kazushi Ikeda, Takayuki Yamashita:
Statistical analysis on characteristic whisker movements observed in reward processing. 723-726 - Mikiko Konda, Takatomi Kubo, Naruki Morimura, Kazushi Ikeda:
Interaction Analysis in Hunting Behavior of Finless Porpoises. 727-730 - Yuichi Sakumura, Katsuyuki Kunida:
Extraction of Biomolecular Signals Controlling Complex Behavior of Biological Cells. 731-735 - Yaming Hu, Shun Nakamura, Tsuyoshi Yamanaka, Toshihisa Tanaka:
Physiological signals responses to normal and abnormal brake events in simulated autonomous car. 736-740 - Boning Li, Xuyang Zhao, Qibin Zhao, Toshihisa Tanaka, Jianting Cao:
A One-Dimensional Convolutional Neural Network Model for Automated Localization of Epileptic Foci. 741-744 - Zhichao Zhang, Maokang Luo, Ke Deng, Tao Yu:
Cohen's class time-frequency representation in linear canonical domains: definition and properties. 745-752 - Xiaolong Chen, Qiaowen Jiang, Ningyuan Su, Baoxin Chen, Jian Guan:
LFM Signal Detection and Estimation Based on Deep Convolutional Neural Network. 753-758 - Yannan Sun, Bingzhao Li:
Nonuniform fast linear canonical transform. 759-764 - Bing Deng, Qingshun Huang, Lin Zhang:
Digital implementation of Hilbert Transform in the LCT domain associated with FIR filter. 770-773 - Juan Zhao, Xia Bai:
Adaptive Matching Pursuit Method Based on Auxiliary Residual for Sparse Signal Recovery. 774-778 - Navid Tafaghodi Khajavi, Anthony Kuh:
Decomposition of Covariance Matrix Using Cascade of Trees. 779-783 - Junho Jo, Jae Woong Soh, Nam Ik Cho:
Handwritten Text Segmentation in Scribbled Document via Unsupervised Domain Adaptation. 784-790 - Chunyao Fang, Kebin Jia, Pengyu Liu, Liang Zhang:
Research on Cloud Recognition Technology Based on Transfer Learning. 791-796 - Muwei Jian, Ruihong Wang, Hui Yu, Junyu Dong, Yujuan Wang, Yilong Yin, Kin-Man Lam:
Saliency Detection via Robust Seed Selection of Foreground and Background Priors. 797-801 - Yuma Kinoshita, Kouki Seo, Hitoshi Kiya:
A Hue Correction Scheme Based on Constant-Hue Plane for Color Image Enhancement. 802-806 - S. K. Felix Yu, Zi-Xin Xu, Yuk-Hee Chan, Daniel Pak-Kong Lun:
A spatial domain secret image embedding technique with image authentication feature. 807-813 - Jing-Ming Guo, Sankarasrinivasan Seshathiri:
Reconstruction of Multitone BTC Images using Conditional Generative Adversarial Nets. 814-817 - Yuanjun Zhao, Roberto Togneri, Victor Sreeram:
Data augmentation and post selection for improved replay attack detection. 818-821 - Bin Liu, Shuai Nie, Wenju Liu, Hui Zhang, Xiangang Li, Changliang Li:
Deep Segment Attentive Embedding for Duration Robust Speaker Verification. 822-826 - Qian-Bei Hong, Chung-Hsien Wu, Ming-Hsiang Su, Hsin-Min Wang:
Sequential Speaker Embedding and Transfer Learning for Text-Independent Speaker Identification. 827-832 - Ryoya Yaguchi, Sayaka Shiota, Nobutaka Ono, Hitoshi Kiya:
Improving replay attack detection by combination of spatial and spectral features. 833-837 - Yitong Liu, Rohan Kumar Das, Haizhou Li:
Multi-band Spectral Entropy Information for Detection of Replay Attacks. 838-843 - Jingyi Xu, Junfeng Hou, Yan Song, Wu Guo, Lirong Dai:
Knowledge Distillation from Multilingual and Monolingual Teachers for End-to-End Multilingual Speech Recognition. 844-849 - Rui Na, Junfeng Hou, Wu Guo, Yan Song, Lirong Dai:
Learning Adaptive Downsampling Encoding for Online End-to-End Speech Recognition. 850-854 - Yueh-Ting Lee, Xuan-Bo Chen, Hung-Shin Lee, Jyh-Shing Roger Jang, Hsin-Min Wang:
Multi-task Learning for Acoustic Modeling Using Articulatory Attributes. 855-861 - Yuuki Tachioka:
Hypothesis Correction Based on Semi-character Recurrent Neural Network for End-to-end Speech Recognition. 862-867 - Haoxin Ma, Ye Bai, Jiangyan Yi, Jianhua Tao:
Hypersphere Embedding and Additive Margin for Query-by-example Keyword Spotting. 868-872 - Nan Zhou, Jun Du, Yanhui Tu, Tian Gao, Chin-Hui Lee:
A Speech Enhancement Neural Network Architecture with SNR-Progressive Multi-Target Learning for Robust Speech Recognition. 873-877 - Jing Yuan, Changchun Bao:
CycleGAN-based speech enhancement for the unpaired training data. 878-883 - Rui Cheng, Changchun Bao:
Phase Unwrapping Based Speech Enhancement. 884-889 - Dujuan Wang, Changchun Bao:
End-to-End Speech Enhancement Using Fully Convolutional Networks with Skip Connections. 890-895 - Jingdong Li, Hui Zhang, Xueliang Zhang, Changliang Li:
Single Channel Speech Enhancement Using Temporal Convolutional Recurrent Neural Networks. 896-900 - Yao Zhou, Changchun Bao, Rui Cheng:
GSC Based Speech Enhancement with Generative Adversarial Network. 901-906 - Hideki Kawahara, Ken-Ichi Sakakibara, Eri Haneishi, Kaori Hagiwara:
Real-time and interactive tools for vocal training based on an analytic signal with a cosine series envelope. 907-910 - Hosana Kamiyama, Atsushi Ando, Ryo Masumura, Satoshi Kobashikawa, Yushi Aono:
Likability Estimation of Call-center Agents by Suppressing Annotator Variability. 911-916 - Hosana Kamiyama, Atsushi Ando, Ryo Masumura, Satoshi Kobashikawa, Yushi Aono:
Urgent Voicemail Detection Focused on Long-term Temporal Variation. 917-921 - Liangqi Liu, Zhiyong Wu, Runnan Li, Jia Jia, Helen Meng:
Learning Contextual Representation with Convolution Bank and Multi-head Self-attention for Speech Emphasis Detection. 922-926 - Xiaoqun Dong, Xueqin Zhao:
Effect of Relative Frequency of Lexical Meanings on Accessing Lexical Ambiguities: Evidence from the Coordinator 'and'. 927-932 - Naoki Umeno, Masaru Yamashita, Hiroyuki Takada, Shoichi Matsunaga:
Training Data Expansion for Classification between Normal and Abnormal Lung Sounds. 935-938 - Xinjie Shi, Tianqi Wang, Lan Wang, Hanjun Liu, Nan Yan:
Hybrid Convolutional Recurrent Neural Networks Outperform CNN and RNN in Task-state EEG Detection for Parkinson's Disease. 939-944 - Jinfeng Huang, Bin Zhao, Jianwu Dang, Minbo Chen:
Investigation of speech-planning mechanism based on eye movement and EEG. 945-950 - Takeshi D. Itoh, Takatomi Kubo, Kiyoka Ikeda, Yuki Maruno, Yoshiharu Ikutani, Hideaki Hata, Kenichi Matsumoto, Kazushi Ikeda:
Towards Generation of Visual Attention Map for Source Code. 951-954 - Xiaokong Miao, Meng Sun, Xiongwei Zhang:
Voice Conversion by Dual-Domain Bidirectional Long Short-Term Memory Networks with Temporal Attention. 955-959 - Siwei Chen, Kebin Jia, Pengyu Liu, Xunping Huang:
Taxi Drivers' Smoking Behavior Detection in Traffic Monitoring Video. 968-973 - Jiaqi Feng, Shuai Li, Yunfeng Sui, Lingtong Meng, Ce Zhu:
Integrating Action-aware Features for Saliency Prediction via Weakly Supervised Learning. 974-979 - Hochang Rhee, Nam Ik Cho:
Efficient and Robust Pseudo-Labeling for Unsupervised Domain Adaptation. 980-985 - Wei Gao:
A Multi-Objective Optimization Perspective for Joint Consideration of Video Coding Quality. 986-991 - Yuyang Liu, Hongwei Guo, Ce Zhu, Yipeng Liu:
Spherical Position Dependent Rate-Distortion Optimization for 360-degree Video Coding. 992-996 - Qiang Fang:
Is average RMSE appropriate for evaluating acoustic-to-articulatory inversion? 997-1003 - Wenwei Dong, Yanlu Xie:
Normalization of GOP for Chinese Mispronunciation Detection. 1004-1008 - Tomohiro Tanaka, Ryo Masumura, Takafumi Moriya, Takanobu Oba, Yushi Aono:
Disfluency Detection Based on Speech-Aware Token-by-Token Sequence Labeling with BLSTM-CRFs and Attention Mechanisms. 1009-1013 - Rui Yang, Zhen-Hua Ling:
Linguistic Steganography by Sampling-based Language Generation. 1014-1019 - Wenwei Dong, Yanlu Xie, Binghuai Lin:
Unsupervised Pronunciation Fluency Scoring by infoGan. 1020-1023 - Zhenye Gan, Yi Jiao, Hongwu Yang, Gaungying Zhao, Zhimeng Song:
Study on the Tones Biases of Mandarin Speaker in Amdo Tibetan Areas Based on Statistics. 1024-1028 - Leilan Zhang, Qiang Zhou:
Automatically Annotate TV Series Subtitles for Dialogue Corpus Construction. 1029-1035 - Leilan Zhang, Qiang Zhou:
Topic Segmentation for Dialogue Stream. 1036-1043 - Huang-Cheng Chou, Yi-Wen Liu, Chi-Chun Lee:
Joint Learning of Conversational Temporal Dynamics and Acoustic Features for Speech Deception Detection in Dialog Games. 1044-1050 - Kengo Ohta, Ryota Nishimura, Norihide Kitaoka:
Type of Response Selection utilizing User Utterance Word Sequence, LSTM and Multi-task Learning for Chat-like Spoken Dialog Systems. 1051-1055 - Aijun Li, Gan Huang, Zhiqiang Li:
Prosodic Cues in the Interpretation of Echo Questions in Chinese Spoken Dialogues. 1056-1061 - Julan Xie, Fanghao Cheng, Zishu He, Huiyong Li:
A DOA Estimation Method of coherent and uncorrelated sources based on Nested Arrays. 1062-1065 - Julan Xie, Fanghao Cheng, Zishu He, Huiyong Li:
A DOA Estimation Method in the presence of unknown mutual coupling based on Nested Arrays. 1066-1071 - Feiran Yang, Jun Yang, Felix Albu:
An Alternative Solution to the Dynamically Regularized RLS Algorithm. 1072-1075 - Xinqi Huang, Yingsong Li, Felix Albu:
A Norm Constraint Lorentzian Algorithm Under Alpha-stable Measurement Noise. 1076-1079 - Liyun Xu:
Random Signal Estimation by Ergodicity associated with Linear Canonical Transform. 1080-1083 - Shanpeng Zhao, Shaoxiang Zhao, Youpeng Zhang, Zhengjie Xu:
Study on Pre-warning Model of Railway Signal System with Fuzzy Analytic Hierarchy Process. 1084-1090 - Yongzhi Min, Jie Hu:
Calibration of Position and Orientation between Cameras without Common Field of View Using Cooperative Target. 1100-1104 - Shu-Feng Duan, Ligu Zhu, Yujing Shi, Lei Zhang, Bo Hui:
Frequency Decomposition Model of Popularity Evolution in Online Social Media. 1105-1111 - Yanyan Wang, Yingsong Li, Lu Shen, Yuriy V. Zakharov:
Acoustic-Domain Self-Interference Cancellation for Full-Duplex Underwater Acoustic Communication Systems. 1112-1116 - Zhicheng Guo, Jianwu Dang, Yangping Wang, Jing Jin:
Background Modeling Algorithm for Multi-feature Fusion. 1117-1121 - Yi-Fan Chen, Amey Kiran Patel, Chia-Ping Chen:
Image Haze Removal By Adaptive CycleGAN. 1122-1127 - Jinxiang Liang, Jianwu Dang, Yangping Wang, Jingyu Yang, Zhenhai Zhang:
Remote Sensing Image Scene Classification Based on SURF Feature and Deep Learning. 1128-1133 - ShaoQuan Wang, DeYong Gao, Yangping Wang, Song Wang:
An Improved Retinex low-illumination image enhancement algorithm. 1134-1139 - Peiya Li, Zhenhui Situ:
Encrypted JPEG image retrieval using histograms of transformed coefficients. 1140-1144 - Changmeng Peng, Luting Cai, Zhizhong Fu, Xiaofeng Li:
CNN-based bit-depth enhancement by the suppression of false contour and color distortion. 1145-1151 - Lixin Pan, Sheng Li, Longbiao Wang, Jianwu Dang:
Effective Training End-to-End ASR systems for Low-resource Lhasa Dialect of Tibetan Language. 1152-1156 - Tianjiao Xu, Hui Zhang, Xueliang Zhang:
Joint Training ResCNN-based Voice Activity Detection with Speech Enhancement. 1157-1162 - Haruka Tanji, Kazunori Kojima, Hiroaki Nanjo, Shi-wook Lee, Yoshiaki Itoh:
A Rescoring Method Using Web Search and Word Vectors for Spoken Term Detection. 1163-1167 - Nguyen Binh Thien, Yukoh Wakabayashi, Takahiro Fukumori, Takanobu Nishiura:
Derivative of instantaneous frequency for voice activity detection using phase-based approach. 1168-1172 - Ye Bai, Jiangyan Yi, Jianhua Tao, Zhengqi Wen, Bin Liu:
Voice Activity Detection Based on Time-Delay Neural Networks. 1173-1178 - Wei-Cheng Lin, Yu Tsao, Fei Chen, Hsin-Min Wang:
Investigation of Neural Network Approaches for Unified Spectral and Prosodic Feature Enhancement. 1179-1184 - Tianjiao Xu, Hao Li, Hui Zhang, Xueliang Zhang:
Improve Data Utilization with Two-stage Learning in CNN-LSTM-based Voice Activity Detection. 1185-1189 - Karthika Vijayan, K. Sri Rama Murty, Haizhou Li:
Allpass Modeling of Phase Spectrum of Speech Signals for Formant Tracking. 1190-1196 - Minghao Guo, Cai Rui, Wei Wang, Binghuai Lin, Jinsong Zhang, Yanlu Xie:
A Study on Mispronunciation Detection Based on Fine-grained Speech Attribute. 1197-1201 - Ze-Yu Zou, Yun-Xia Liu, Wen-Na Zhang, Yuehui Chen, Yun-Li Zang, Yang Yang, Bonnie Ngai-Fong Law:
Robust Camera Model Identification Based on Richer Convolutional Feature Network. 1202-1207 - Suradej Duangpummet, Jessada Karnjana, Waree Kongprawechnon, Masashi Unoki:
A Robust Method for Blindly Estimating Speech Transmission Index using Convolutional Neural Network with Temporal Amplitude Envelope. 1208-1214 - Dan He, Yubin Zhong:
Compressing Speech Recognition Networks with MLP via Tensor-Train Decomposition. 1215-1219 - Wenxia Lu, Lijun Zhang, Jie Chen, Jingdong Chen:
Generalized Combined Nonlinear Adaptive Filters for Nonlinear Acoustic Echo Cancellation. 1220-1225 - Yao Du, Zhiyong Wu, Shiyin Kang, Dan Su, Dong Yu, Helen Meng:
Automatic Prosodic Structure Labeling using DNN-BGRU-CRF Hybrid Neural Network. 1234-1238 - Feng Li, Kaizhi Qian, Mark Hasegawa-Johnson, Masato Akagi:
Monaural Singing Voice Separation Using Fusion-Net with Time-Frequency Masking. 1239-1243 - Qing Zhou, Yong Ma, Benyan Luo, Mingliang Gu, Zude Zhu:
Identification of Alzheimer's Disease Patients Based on Oral Speech Features. 1244-1249 - Yang Yi, Kuan-Yu Chen, Hung-Yan Gu:
Mixture of CNN Experts from Multiple Acoustic Feature Domain for Music Genre Classification. 1250-1255 - Lu Huang, Gaofeng Cheng, Pengyuan Zhang, Yi Yang, Shumin Xu, Jiasong Sun:
Utterance-level Permutation Invariant Training with Latency-controlled BLSTM for Single-channel Multi-talker Speech Separation. 1256-1261 - Jian Sun, Wu Guo, Bin Gu, Yao Liu:
Bidirectional Temporal Convolution with Self-Attention Network for CTC-Based Acoustic Modeling. 1262-1266 - Kun Zhang, Zhiyong Wu, Jia Jia, Helen M. Meng, Binheng Song:
Query-by-Example Spoken Term Detection using Attentive Pooling Networks. 1267-1272 - Maitreya Patel, Mihir Parmar, Savan Doshi, Nirmesh J. Shah, Hemant A. Patil:
Novel Adaptive Generative Adversarial Network for Voice Conversion. 1273-1281 - Yi Zhou, Xiaohai Tian, Rohan Kumar Das, Haizhou Li:
Many-to-many Cross-lingual Voice Conversion with a Jointly Trained Speaker Embedding Network. 1282-1287 - Ping Gao, Cheng-You You, Tai-Shih Chi:
A Multi-Scale Fully Convolutional Network for Singing Melody Extraction. 1288-1293 - Guanyu Li, Lisai Luo, Chunwei Gong, Shiliang Lv:
End-to-end Tibetan Speech Synthesis Based on Phones and Semi-syllables. 1294-1297 - Neelesh Nursiah, KokSheik Wong, Minoru Kuribayashi:
Reversible Data Hiding in PDF Document Exploiting Prefix Zeros in Glyph Coordinates. 1298-1302 - Jianyuan Wu, Zheng Wang, Hui Zeng, Xiangui Kang:
Multiple-Operation Image Anti-Forensics with WGAN-GP Framework. 1303-1307 - Duo Ma, Guanyu Li, Haihua Xu, Eng Siong Chng:
Improving code-switching speech recognition with data augmentation and system combination. 1308-1312 - Jisheng Bai, Chen Chen, Jianfeng Chen:
A Multi-feature Fusion Based Method For Urban Sound Tagging. 1313-1317 - Taiki Izumi, Shingo Uenohara, Ken'ichi Furuya, Yuuki Tachioka:
Activation Driven Synchronized Joint Diagonalization for Underdetermined Sound Source Separation. 1318-1322 - Weiqing Wang, Haiwei Wu, Ming Li:
Deep Neural Networks with Batch Speaker Normalization for Intoxicated Speech Detection. 1323-1327 - Liang He, Xianhong Chen, Can Xu, Jia Liu:
Subtraction-Positive Similarity Learning. 1328-1332 - Zhuozheng Wang, Yingjie Dong, Wei Liu:
A Novel Effective Dimensionality Reduction Algorithm for Water Chiller Fault Data. 1333-1341 - Ying Chen, Wentao Xiao, Jie Cui, Hanyu Xu:
Speech Prosody and Eye Movements in Processing Discourse Information: A Preliminary Study in Mandarin Chinese. 1342-1346 - Guan-Bo Wang, Wei-Qiang Zhang:
An RNN and CRNN Based Approach to Robust Voice Activity Detection. 1347-1350 - Linna Zhou, Derui Liao:
Study of Chinese Text Steganography using Typos. 1351-1357 - Kosuke Fukumori, Toshihisa Tanaka:
A Simple Gaussian Kernel Classifier with Automated Hyperparameter Tuning. 1358-1363 - Senmao Wang, Pan Zhou, Wei Chen, Jia Jia, Lei Xie:
Exploring RNN-Transducer for Chinese speech recognition. 1364-1369 - Jiu Yong, Yangping Wang, Xiaomei Lei, Fang Yong, Zhenhai Zhang:
Long-term 3D Registration Method Based on LCT Tracking and Improved ORB Detection. 1370-1379 - Masahiro Tsumori, Shinichiro Nagai, Ryosuke Harakawa, Toru Sasaki, Masahiro Iwahashi:
Restoration of Minute Light Emissions Observed by Streak Camera Based on N-CUP Method. 1380-1384 - Kheng Hui Ng, Yiqi Tew, Mum Wai Yip:
A Prefatory Study on Data Channelling Mechanism towards Industry 4.0. 1385-1390 - Weiwei Shan, Shogo Muramatsu, Akira Oshima, Hiroyoshi Yamada:
Successive Stripe Artifact Removal Based on Robust PCA for Millimeter Wave Automotive Radar Image. 1391-1394 - Thittaporn Ganokratanaa, Supavadee Aramvith, Nicu Sebe:
Anomaly Event Detection Using Generative Adversarial Network for Surveillance Videos. 1395-1399 - Tien-Hong Lo, Berlin Chen:
Semi-supervised Training of Acoustic Models Leveraging Knowledge Transferred from Out-of-Domain Data. 1400-1404 - Toranosuke Tanio, Kouya Takeda, Jaehoon Yu, Masanori Hashimoto:
Training Data Reduction using Support Vectors for Neural Networks. 1405-1410 - Shota Fukui, Jaehoon Yu, Masanori Hashimoto:
Distilling Knowledge for Non-Neural Networks. 1411-1416 - Meng Meng, Go Tanaka:
Proposal of Minimization Problem Based Lightness Modification Method Considering Visual Characteristics of Protanopia and Deuteranopia. 1417-1422 - Hiroshi Tsutsui, Kentaro Yamada, Akihiro Sudou, Yoshikazu Miyanaga:
An Evaluation of Stack Light Indicator Color Detection System Using Web Cameras for Automatic Production Lines. 1423-1426 - Dingli Luo, Songlin Du, Takeshi Ikenaga:
Multi-Task and Multi-Level Detection Neural Network Based Real-Time 3D Pose Estimation. 1427-1434 - Yu Wang, Xueting Li, Yun Zhu, Feilong He:
A Fast Inter-view Mode Selection Algorithm Based on Video Array Processor. 1435-1442 - Junyong Deng, Haoyue Wu, Rui Shan, Yiwen Fu, Xinchuang Liu, Ping Wang:
NPFONoC: A Low-loss, Non-blocking, Scalable Passive Optical Interconnect Network-on-Chip Architecture. 1443-1448 - Xiaoyan Xie, Xiang Lei, Jinna Zhou, Yun Zhu, Lin Jiang:
A Reconfigurable Implementation of Motion Compensation in HEVC. 1449-1454 - Bowen Zhang, Huaxi Gu, Ruiqi Guo:
SCRA: A Hybrid Deterministic Routing Algorithm for Aging-Resilient Network-an-Chip. 1455-1458 - Ryota Sugimoto, Osamu Takyu:
Access Decision based on Secure Capacity for prevention to CSI Impersonation of Untrusted Relay. 1459-1462 - Akinori Kamio, Fumihito Sasamori, Shiro Handa, Osamu Takyu, Mai Ohta, Takeo Fujii:
Recognition and Countermeasure to Hidden Terminal Problem by Packet Analysis in Wireless LAN. 1463-1467 - Shunsuke Tsuchida, Takumi Takahashi, Shinsuke Ibi, Seiichi Sampei:
Machine Learning-Aided Indoor Positioning Based on Unified Fingerprints of Wi-Fi and BLE. 1468-1472 - Kazunori Hayashi, Ayano Nakai-Kasai, Ryo Hayakawa:
An Overloaded SC-CP IoT Signal Detection Method via Sparse Complex Discrete-Valued Vector Reconstruction. 1473-1478 - Jumpei Kawakami, Hendrik Lumbantoruan, Koichi Adachi:
NOMA Based UAV Relay Communication Protocol in Cellular Network. 1479-1484 - Changyan Zheng, Jibin Yang, Xiongwei Zhang, Meng Sun, Kun Yao:
Improving the Spectra Recovering of Bone-Conducted Speech via Structural SIMilarity Loss Function. 1485-1490 - Ke-Xin He, Wei-Qiang Zhang, Jia Liu, Yao Liu:
Dilated-Gated Convolutional Neural Network with A New Loss Function on Sound Event Detection. 1491-1495 - Bolun Wang, Zhong-Hua Fu, Hao Wu:
Augmented Strategy For Polyphonic Sound Event Detection. 1496-1500 - Rui Wang, Mou Wang, Xiao-Lei Zhang, Susanto Rahardja:
Domain Adaptation Neural Network for Acoustic Scene Classification in Mismatched Conditions. 1501-1505 - Ziye Yang, Xiao-Lei Zhang:
Boosting Spatial Information for Deep Learning Based Multichannel Speaker-Independent Speech Separation In Reverberant Environments. 1506-1510 - Mou Wang, Rui Wang, Xiao-Lei Zhang, Susanto Rahardja:
Hybrid Constant-Q Transform Based CNN Ensemble for Acoustic Scene Classification. 1511-1516 - Jiakang Li, Meng Sun, Xiongwei Zhang:
Multi-task learning of deep neural networks for joint automatic speaker verification and spoofing detection. 1517-1522 - Hideki Kawahara, Ken-Ichi Sakakibara, Mitsunori Mizumachi, Hideki Banno, Masanori Morise, Toshio Irino:
Frequency domain variant of Velvet noise and its application to acoustic measurements. 1523-1532 - Beth Jelfs, Christopher Gilliam:
Fast & Efficient Delay Estimation Using Local All-Pass & Kalman Filters. 1533-1539 - Qian Ren, Zhenhai Zhang:
Dynamic Adjustment of Railway Emergency Plan Based on Utility Risk Entropy. 1540-1544 - Madhu R. Kamble, Aditya Krishna Sai Pulikonda, Maddala Venkata Siva Krishna, Ankur T. Patil, Rajul Acharya, Hemant A. Patil:
Speech Demodulation-based Techniques for Replay and Presentation Attack Detection. 1545-1550 - Huachao Lu, Zhijin Zhao:
Spectrum Sensing Algorithm Based on LSTM and Its Implementation of Multiple USRP. 1551-1555 - Woojae Kim, Jaekyung Kim, Sanghoon Lee:
Quality of Experience using Deep Convolutional Neural Networks and future trends. 1556-1559 - Yibo Du, Kebin Jia, Chang Liu:
Stereo Matching and Image Inpainting Based on Binocular Camera. 1560-1564 - Daichi Kitahara, Swathi Ananda, Akira Hirabayashi:
Optimization-Based Fundus Image Decomposition for Diagnosis Support of Diabetic Retinopathy. 1565-1572 - Ming-Ze Wang, Shuai Wan, Hao Gong, Yuanfang Yu, Yang Liu:
An Integrated CNN-based Post Processing Filter For Intra Frame in Versatile Video Coding. 1573-1577 - Hongwei Zhang, Liuai Wu, Yanchun Yang:
Parameter-free Image Segmentation Based on Extreme Learning Machine. 1578-1581 - Swathi Ananda, Daichi Kitahara, Akira Hirabayashi, K. R. Udaya Kumar Reddy:
Automatic Fundus Image Segmentation for Diabetic Retinopathy Diagnosis by Multiple Modified U-Nets and SegNets. 1582-1588 - Haesoo Chung, Yoonsik Kim, Junho Jo, Sang-Hoon Lee, Nam Ik Cho:
Kernel Prediction Network for Detail-Preserving High Dynamic Range Imaging. 1589-1594 - Minoru Kuribayashi, Nobuo Funabiki:
Efficient Decentralized Tracing Protocol for Fingerprinting System with Index Table. 1595-1601 - Xiang Feng, Qun Song, Qingfang Guo, Duo Liu, Zhanfeng Zhao, Yi-an Zhao:
Hand Gesture Recognition with Ensemble Time-Frequency Signatures Using Enhanced Deep Convolutional Neural Network. 1602-1605 - Amna Qureshi, David Megías:
Blockchain-based P2P multimedia content distribution using collusion-resistant fingerprinting. 1606-1615 - Ponlawat Chophuk, Kanjana Pattanaworapan, Kosin Chamnongthai:
Consideration of a Selecting Frame of Finger-Spelled Words from Backhand View. 1621-1624 - Yiheng Jiang, Yan Song, Jie Yan, Lirong Dai, Ian McLoughlin:
Triplet-Center Loss Based Deep Embedding Learning Method for Speaker Verification. 1625-1629 - Rohan Kumar Das, Jichen Yang, Haizhou Li:
Speaker Clustering with Penalty Distance for Speaker Verification with Multi-Speaker Speech. 1630-1635 - Can Xu, Xianhong Chen, Liang He, Jia Liu:
Geometric Discriminant Analysis for I-vector Based Speaker Verification. 1636-1640 - Jianfeng Zhou, Tao Jiang, Qingyang Hong, Lin Li:
Extraction of Noise-Robust Speaker Embedding Based on Generative Adversarial Networks. 1641-1645 - Haiwei Wu, Weicheng Cai, Ming Li, Ji Gao, Shanshan Zhang, Zhiqiang Lyu, Shen Huang:
DKU-Tencent Submission to Oriental Language Recognition AP18-OLR Challenge. 1646-1651 - Xu Xiang, Shuai Wang, Houjun Huang, Yanmin Qian, Kai Yu:
Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition. 1652-1656 - Tianxiang Ma, Bo Peng, Wei Wang, Jing Dong:
Any-to-one Face Reenactment Based on Conditional Generative Adversarial Network. 1657-1664 - Naoki Hamasaki, Kazuaki Nakamura, Naoko Nitta, Noboru Babaguchi:
Discrimination between Handwritten and Computer-Generated Texts using a Distribution of Patch-Wise Font Features. 1665-1671 - Jeongwoo Lim, Naoko Nitta, Kazuaki Nakamura, Noboru Babaguchi:
Generating Spoofing Tweets considering Points of Interest of Target User. 1672-1678 - Yuki Hirose, Kazuaki Nakamura, Naoko Nitta, Noboru Babaguchi:
Anonymization of Gait Silhouette Video by Perturbing Its Phase and Shape Components. 1679-1685 - Ngoc-Dung T. Tieu, Huy H. Nguyen, Fuming Fang, Junichi Yamagishi, Isao Echizen:
An RGB Gait Anonymization Model for Low-Quality Silhouettes. 1686-1693 - Hiroki Tanji, Takahiro Murakami, Hiroyuki Kamata:
A Generalization of Laplace Nonnegative Matrix Factorization and Its Multichannel Extension. 1694-1699 - Yuanlei Qi, Feiran Yang, Jun Yang:
A Late Reverberation Power Spectral Density Aware Approach to Speech Dereverberation Based on Deep Neural Networks. 1700-1703 - Khanh T. K. Nguyen, Hien M. Nguyen:
A Comparison Study of GRAPPA and Generalized Series Methods for parallel MRI at high acceleration factor. 1704-1709 - Tomonori Maeda, Kiyoshi Nishikawa:
Consideration on application of the concept of Saak transform to convolutional neural networks. 1710-1716 - Wanlu Shi, Yingsong Li, Felix Albu:
A Norm Penalized Noise-free Maximum Correntropy Criterion Algorithm. 1717-1720 - Kazunori Hayashi, Kaede Shiohara, Tetsuya Sasaki:
Differentiable Programming based Step Size Optimization for LMS and NLMS Algorithms. 1721-1727 - Qiushi Li, Zilong Shao, Shunquan Tan, Jishen Zeng, Bin Li:
Non-structured Pruning for Deep-learning based Steganalytic Frameworks. 1735-1739 - Wen-Na Zhang, Yun-Xia Liu, Ze-Yu Zou, Yun-Li Zang, Yang Yang, Bonnie Ngai-Fong Law:
Effective Source Camera Identification based on MSEPLL Denoising Applied to Small Image Patches. 1740-1744 - MaungMaung AprilPyone, Yuma Kinoshita, Hitoshi Kiya:
Filtering Adversarial Noise with Double Quantization. 1745-1749 - Kenta Iida, Hitoshi Kiya:
Image Identification of Grayscale-Based JPEG Images for Privacy-Preserving Photo Sharing Services. 1750-1755 - Warit Sirichotedumrong, Yuma Kinoshita, Hitoshi Kiya:
Privacy-Preserving Deep Neural Networks Using Pixel-Based Image Encryption Without Common Security Keys. 1756-1761 - Koi Yee Ng, Simying Ong, KokSheik Wong:
Delving into the Methods of Coverless Image Steganography. 1763-1772 - Haiwei Wu, Jiantao Zhou, Yuanman Li:
Image Reconstruction from Local Descriptors Using Conditional Adversarial Networks. 1773-1779 - Juqiang Chen, Xuliang He:
Computational perception of information foci produced by Chinese English learners and American English speakers. 1780-1785 - Jie Hou, Yu Chen, Yutong Xing, Jianwu Dang:
Acoustic Attributes of Citation Tones in Standard Chinese Produced by Prelingually Deaf Adults. 1786-1790 - Linxuan Wei, Wenwei Dong, Binghuai Lin, Jinsong Zhang:
Multi-Task Based Mispronunciation Detection of Children Speech Using Multi-Lingual Information. 1791-1794 - Bin Li, Yihan Guan, Si Chen:
Sounds of Personality: Inference from Voices by Non-Native Speakers. 1795-1799 - Xi Chen, Si Chen:
Acquisition and Interpretation of Mandarin Speech Prosody by Native Speakers and Cantonese Learners. 1800-1809 - Yiran Ding, Yanlu Xie, Jinsong Zhang:
Acquisition of L2 Mandarin Rhythm By Russian and Japanese Learners. 1810-1814 - Rong Han, Ming Wu, Kexun Chi, Lan Yin, Hongling Sun, Jun Yang:
A min-max optimization algorithm for global active acoustic radiation control. 1815-1818 - Kenta Iwai, Takanobu Nishiura:
Audio Integrated Active Noise Control System with Auto Gain Controller. 1819-1823 - Kyosuke Nakagawa, Chuang Shi, Yoshinobu Kajikawa:
Beam Steering of Portable Parametric Array Loudspeaker. 1824-1827 - Chuang Shi, Nan Jiang, Rong Xie, Huiyong Li:
A Simulation Investigation of Modified FxLMS Algorithms for Feedforward Active Noise Control. 1833-1837 - Meixia Fu, Songlin Sun, Kaili Ni, Xiaoying Hou:
Mobile Robot Object Recognition in The Internet of Things based on Fog Computing. 1838-1842 - Haohui Jia, Na Chen, Takeshi Higashino, Minoru Okada:
Joint Sparse Channel Estimation in Downlink NOMA System. 1843-1846 - Chengbo Liu, Na Chen, Yafei Hou, Minoru Okada:
Time-Domain Signal Recovery for OFDM System in the Industrial Environment. 1847-1851 - Mau-Luen Tham, Amjad Iqbal, Yoong Choon Chang:
Deep Reinforcement Learning for Resource Allocation in 5G Communications. 1852-1855 - Ying Loong Lee, Donghong Qin:
A Survey on Applications of Deep Reinforcement Learning in Resource Management for 5G Heterogeneous Networks. 1856-1862 - Yukoh Wakabayashi, Nobutaka Ono:
Griffin-Lim phase reconstruction using short-time Fourier transform with zero-padded frame analysis. 1863-1867 - Naoki Makishima, Norihiro Takamune, Hiroshi Saruwatari, Daichi Kitamura, Yu Takahashi, Kazunobu Kondo:
Robust Demixing Filter Update Algorithm Based on Microphone-wise Coordinate Descent for Independent Deeply Learned Matrix Analysis. 1868-1873 - Masakazu Une, Yuki Kubo, Norihiro Takamune, Daichi Kitamura, Hiroshi Saruwatari, Shoji Makino:
Evaluation of Multichannel Hearing Aid System by Rank-Constrained Spatial Covariance Matrix Estimation. 1874-1879 - Ningning Pan, Jingdong Chen, Biing-Hwang Fred Juang:
Comparative Study of Deep Learning Based and Traditional Single-Channel Noise-Reduction Algorithms. 1880-1884 - Zhi-Wei Tan, Anh H. T. Nguyen, Andy W. H. Khong:
An Efficient Dilated Convolutional Neural Network for UAV Noise Reduction at Low Input SNR. 1885-1892 - Soky Kak, Sheng Li, Tatsuya Kawahara, Sopheap Seng:
Multi-lingual Transformer Training for Khmer Automatic Speech Recognition. 1893-1896 - Zhaodi Qi, Yong Ma, Mingliang Gu:
A Study on Low-resource Language Identification. 1897-1902 - Sardar Parhat, Gao Ting, Mijit Ablimit, Askar Hamdulla:
A morpheme sequence and convolutional neural network based Kazakh text classification. 1903-1906 - Jiawei Yu, Jinsong Zhang:
Zero-resource Language Recognition. 1907-1911 - Qingran Zhan, Petr Motlícek, Shixuan Du, Yahui Shan, Sifan Ma, Xiang Xie:
Cross-lingual Automatic Speech Recognition Exploiting Articulatory Features. 1912-1916 - Zhiyuan Tang, Dong Wang, Liming Song:
AP19-OLR Challenge: Three Tasks and Their Baselines. 1917-1921 - Yi-Hsuan Hsu, Jiun-In Guo:
A Real-time and Online Multiple-Type Object Tracking Method with Deep Features. 1922-1928 - Phuong Le Thi, Tuan Pham, Jia-Ching Wang:
Convolutional Attention Model for Retinal Edema Segmentation. 1929-1932 - Kai-Wen Liang, Yu-Hao Tseng, Pao-Chi Chang:
Parallel Capsule Neural Networks for Sound Event Detection. 1933-1936 - Duc-Quang Vu, Thi-Thu-Trang Phung, Chien-Yao Wang, Jia-Ching Wang:
Age and Gender Recognition Using Multi-task CNN. 1937-1941 - Leong Chee Him, Yu Yang Poh, Lee Wah Pheng:
IoT-based Predictive Maintenance for Smart Manufacturing Systems. 1942-1944 - Seongmin Lee, Woojae Kim, Sewoong Ahn, Jaekyung Kim, Sanghoon Lee:
Physical parameter prediction by embedding human perceptual parameter for 3D garment modeling. 1945-1949 - Zifei Jiang, Zhen Li, Wei Li, Xueqing Li, Jingliang Peng:
Generic Video-Based Motion Capture Data Retrieval. 1950-1957 - Lulu Guo, Huihui Bai, Yao Zhao:
A Lightweight and Robust Face Recognition Network on Noisy Condition. 1964-1969 - Junheum Park, Chul Lee, Chang-Su Kim:
Deep Learning Approach to Video Frame Rate Up-Conversion Using Bilateral Motion Estimation. 1970-1975 - Chia-Hung Yeh, Min-Hui Lin, Wei-Chieh Lu:
3D Reconstruction using HDR-based SLAM. 1976-1980 - Henry Clifton, Alanna Vial, Andrew Miller, Christian Ritz, Matthew Field, Lois Holloway, Montserrat Ros, Martin Carolan, David Stirling:
Using Machine Learning Applied to Radiomic Image Features for Segmenting Tumour Structures. 1981-1988 - Ricky Sutopo, Ting Yau Teo, Joanne Mun-Yee Lim, KokSheik Wong:
Computational Intelligence-based Real-time Lane Departure Warning System Using Gabor Features. 1989-1992 - Ng Chung Hou, Lim Wern Han, Mei Kuan Lim:
Optimising Search Operations with Swarm Intelligence. 1993-1997 - JunYi Lim, Md Istiaque Al Jobayer, Vishnu Monn Baskaran, Joanne Mun-Yee Lim, KokSheik Wong, John See:
Gun Detection in Surveillance Videos using Deep Neural Networks. 1998-2002 - Mahamat Moussa, Chern Hong Lim:
Interpreting Abnormality of a Complex Static Scene using Generative Adversarial Network. 2003-2007 - Tetsuya Asakawa, Masaki Aono:
Median based Multi-label Prediction by Inflating Emotions with Dyads for Visual Sentiment Analysis. 2008-2014 - Yupeng Li, Yuxiao Wang, Yongfeng Jiang, Liang Zhang:
Action Recognition using Convolutional Neural Networks with Joint Supervision. 2015-2020 - Zhenqi Fu, Yan Yang, Feng Shao, Xinghao Ding:
A Study of Perceptual Quality Assessment for Stereoscopic Image Retargeting. 2021-2024 - Yifan Zhao, Jingchun Cheng, Wei Zhou, Chunxi Zhang, Xiong Pan:
Infrared Pedestrian Detection with Converted Temperature Map. 2025-2031 - Binbin Han, Ping Han, Zheng Cheng:
A Fast and Accurate Cluster Center Initialization Algorithm for PolSAR Superpixel Segmentation. 2032-2037 - Jian Gong, Yameng Yu, William Bellamy, Feng Wang, Xiaoli Ji, Zhenzhen Yang:
Comparing Native Chinese Listeners' Speech Reception Thresholds for Mandarin and English Consonants. 2038-2041 - Jiajing Zhang, Ying Chen, Jie Cui:
Prosodic Realization of Focus in English by Bidialectal Mandarin Speakers. 2042-2047 - Yating Cao, Hua Chen:
World Englishes and Prosody: Evidence from the Successful Public Speakers. 2048-2052 - Jiangbo Zhang, Aijun Li, Na Zhi:
An Experimental Study on English Majors Weak Form Productions of Prepositions. 2054-2063 - Yixin Zhang, Jinsong Zhang:
Oral Motor Exercises For CSL Learners to Master Productions of Retroflex And Non-Retroflex Consonants. 2064-2069 - Yi Liu, Bairong Zhuang, Zhiyu Li, Takahiro Shinozaki:
Cross-Domain Speaker Recognition using Cycle-Consistent Adversarial Networks. 2070-2074
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.