{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,5,9]],"date-time":"2024-05-09T08:27:34Z","timestamp":1715243254904},"reference-count":13,"publisher":"Institute of Electronics, Information and Communications Engineers (IEICE)","issue":"2","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["IEICE Trans. Inf. & Syst."],"published-print":{"date-parts":[[2016]]},"DOI":"10.1587\/transinf.2015edl8168","type":"journal-article","created":{"date-parts":[[2016,1,31]],"date-time":"2016-01-31T22:13:09Z","timestamp":1454278389000},"page":"550-553","source":"Crossref","is-referenced-by-count":27,"title":["DNN-Based Voice Activity Detection with Multi-Task Learning"],"prefix":"10.1587","volume":"E99.D","author":[{"given":"Tae Gyoon","family":"KANG","sequence":"first","affiliation":[{"name":"Department of Electrical and Computer Engineering and the Institute of New Media and Communications, Seoul National University"}]},{"given":"Nam Soo","family":"KIM","sequence":"additional","affiliation":[{"name":"Department of Electrical and Computer Engineering and the Institute of New Media and Communications, Seoul National University"}]}],"member":"532","reference":[{"key":"1","doi-asserted-by":"crossref","unstructured":"[1] J.W. Shin, J.-H. Chang, and N.S. Kim, \u201cVoice activity detection based on statistical models and machine learning approaches," Comput. Speech Lang., vol.24, no.3, pp.515-530, July 2010.","DOI":"10.1016\/j.csl.2009.02.003"},{"key":"2","unstructured":"[2] D. Enqing, L. Guizhong, Z. Yatong, and Z. Xiaodi, \u201cApplying support vector machines to voice activity detection," Proc. Int. Conf. Signal Process., Beijing, China, vol.2, pp.1124-1127, Aug. 2002."},{"key":"3","doi-asserted-by":"crossref","unstructured":"[3] J. Wu and X.-L. Zhang, \u201cEfficient multiple kernel support vector machine based voice activity detection," IEEE Signal Process. Lett., vol.18, no.8, pp.466-469, Aug. 2011.","DOI":"10.1109\/LSP.2011.2159374"},{"key":"4","doi-asserted-by":"crossref","unstructured":"[4] D. Ying, Y. Yan, J. Dang, and F.K. Soong, \u201cVoice activity detection based on an unsupervised learning framework," IEEE Trans. Audio, Speech, Lang. Process., vol.19, no.8, pp.2624-2633, Nov. 2011.","DOI":"10.1109\/TASL.2011.2125953"},{"key":"5","doi-asserted-by":"crossref","unstructured":"[5] X.-L. Zhang and J. Wu, \u201cDeep belief networks based voice activity detection," IEEE Trans. Audio, Speech, Lang. Process., vol.21, no.4, pp.697-710, April 2013.","DOI":"10.1109\/TASL.2012.2229986"},{"key":"6","unstructured":"[6] R.A. Caruana, \u201cMultitask connectionist learning," Proc. 1993 Connectionist Models Summer School, pp.372-379, 1993."},{"key":"7","doi-asserted-by":"crossref","unstructured":"[7] Y. Bengio, \u201cPractical recommendations for gradient-based training of deep architectures," in Neural Networks: Tricks of the Trade, eds. G. Montavon, G.B. Orr, and K.-R. M\u00fcller, pp.437-478, Springer-Verlag, Berlin, 2012.","DOI":"10.1007\/978-3-642-35289-8_26"},{"key":"8","unstructured":"[8] P. Bell and S. Renals, \u201cRegularization of context-dependent deep neural networks with context-independent multi-task training," Proc. IEEE Int. Conf. Acoust., Speech, and Signal Process., Brisbane, Australia, pp.4290-4294, April 2015."},{"key":"9","unstructured":"[9] R. Giri, M.L. Seltzer, J. Droppo, and D. Yu, \u201cImproving speech recognition in reverberation using a room-aware deep neural network and multi-task learning," Proc. IEEE Int. Conf. Acoust., Speech, and Signal Process., Brisbane, Australia, pp.5014-5018, April 2015."},{"key":"10","unstructured":"[10] D. Chen and B.K.-W. Mak, \u201cMultitask learning of deep neural networks for low-resource speech recognition," IEEE\/ACM Trans. Audio, Speech, Lang. Process., vol.23, no.7, pp.1172-1183, July 2015."},{"key":"11","doi-asserted-by":"crossref","unstructured":"[11] G.E. Hinton, S. Osindero, and Y.-W. Teh, \u201cA fast learning algorithm for deep belief nets," Neural Comput., vol.18, no.7, pp.1527-1554, July 2006.","DOI":"10.1162\/neco.2006.18.7.1527"},{"key":"12","doi-asserted-by":"crossref","unstructured":"[12] D. Pearce and H.-G. Hirsch, \u201cThe Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions," Proc. ISCA ITRW ASR, pp.29-32, Paris, France, Sept. 2000.","DOI":"10.21437\/ICSLP.2000-743"},{"key":"13","doi-asserted-by":"crossref","unstructured":"[13] J. Bergstra, O. Breuleux, F. Bastien, P. Lamblin, R. Pascanu, G. Desjardins, J. Turian, D. Warde-Farley, and Y. Bengio, \u201cTheano: A CPU and GPU math compiler in Python," Proc. Scientific Comput. with Python Conf. (SciPy), pp.3-9, Austin, Texas, July 2010.","DOI":"10.25080\/Majora-92bf1922-003"}],"container-title":["IEICE Transactions on Information and Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.jstage.jst.go.jp\/article\/transinf\/E99.D\/2\/E99.D_2015EDL8168\/_pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,8,16]],"date-time":"2023-08-16T17:54:20Z","timestamp":1692208460000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.jstage.jst.go.jp\/article\/transinf\/E99.D\/2\/E99.D_2015EDL8168\/_article"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2016]]},"references-count":13,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2016]]}},"URL":"https:\/\/doi.org\/10.1587\/transinf.2015edl8168","relation":{},"ISSN":["0916-8532","1745-1361"],"issn-type":[{"value":"0916-8532","type":"print"},{"value":"1745-1361","type":"electronic"}],"subject":[],"published":{"date-parts":[[2016]]}}}