{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2022,7,9]],"date-time":"2022-07-09T14:40:36Z","timestamp":1657377636390},"reference-count":30,"publisher":"Institute of Electronics, Information and Communications Engineers (IEICE)","issue":"10","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["IEICE Trans. Inf. & Syst."],"published-print":{"date-parts":[[2016]]},"DOI":"10.1587\/transinf.2016slp0012","type":"journal-article","created":{"date-parts":[[2016,9,30]],"date-time":"2016-09-30T22:23:38Z","timestamp":1475274218000},"page":"2518-2527","source":"Crossref","is-referenced-by-count":0,"title":["Re-Ranking Approach of Spoken Term Detection Using Conditional Random Fields-Based Triphone Detection"],"prefix":"10.1587","volume":"E99.D","author":[{"given":"Naoki","family":"SAWADA","sequence":"first","affiliation":[{"name":"Integrated Graduate School of Medicine, Engineering, and Agricultural Sciences, University of Yamanashi"}]},{"given":"Hiromitsu","family":"NISHIZAKI","sequence":"additional","affiliation":[{"name":"Integrated Graduate School of Medicine, Engineering, and Agricultural Sciences, University of Yamanashi"}]}],"member":"532","reference":[{"key":"1","unstructured":"[1] \u201cThe spoken term detection (STD) 2006 evaluation plan,\u201d 2006. http:\/\/www.itl.nist.gov\/iad\/mig\/tests\/std\/2006\/docs\/std06-evalplan-v10.pdf."},{"key":"2","unstructured":"[2] D. Vergyri, I. Shafran, A. Stolcke, R.R. Gadde, M. Akbacak, B. Roark, and W. Wang, \u201cThe SRI\/OGI 2006 spoken term detection system,\u201d Proc. of INTERSPEECH 2007, pp.2393-2396, 2007."},{"key":"3","unstructured":"[3] S. Meng, J. Shao, R.P. Yu, J. Liu, and F. Seide, \u201cAddressing the out-of-vocabulary problem for large-scale chinese spoken term detection,\u201d Proc. of INTERSPEECH 2008, pp.2146-2149, 2008."},{"key":"4","doi-asserted-by":"crossref","unstructured":"[4] D. Can and M. Saraclar, \u201cLattice indexing for spoken term detection,\u201d IEEE Trans. on Audio, Speech and Language Processing, vol.19, no.8, pp.2338-2347, 2011.","DOI":"10.1109\/TASL.2011.2134087"},{"key":"5","doi-asserted-by":"crossref","unstructured":"[5] S. Natori, Y. Furuya, H. Nishizaki, and Y. Sekiguchi, \u201cSpoken Term Detection Using Phoneme Transition Network from Multiple Speech Recognizers' Outputs,\u201d J. Information Processing, vol.21, no.2, pp.176-185, 2013.","DOI":"10.2197\/ipsjjip.21.176"},{"key":"6","unstructured":"[6] T. Akiba, H. Nishizaki, K. Aikawa, T. Kawahara, and T. Matsui, \u201cOverview of the IR for Spoken Documents Task in NTCIR-9 workshop,\u201d Proc. of NTCIR-9, pp.223-235, 2011."},{"key":"7","doi-asserted-by":"crossref","unstructured":"[7] S. Natori, Y. Furuya, H. Nishizak, and Y. Sekiguchi, \u201cEntropy-based False Detection Filtering in Spoken Term Detection Tasks,\u201d Proc. of APSIPA ASC 2013, pp.1-7, 2013.","DOI":"10.1109\/APSIPA.2013.6694352"},{"key":"8","doi-asserted-by":"crossref","unstructured":"[8] Y. Itoh, H. Nishizaki, X. Hu, H. Nanjo, T. Akiba, T. Kawahara, S. Nakagawa, T. Matsui, Y. Yamashita, and K. Aikawa, \u201cConstructing japanese test collections for spoken term detection,\u201d Proc. of INTERSPEECH 2010, pp.677-680, 2010.","DOI":"10.21437\/Interspeech.2010-258"},{"key":"9","unstructured":"[9] T. Akiba, H. Nishizaki, K. Aikawa, X. Hu, Y. Itoh, T. Kawahara, S. Nakagawa, H. Nanjo, and Y. Yamanashita, \u201cOverview of the NTCIR-10 SpokenDoc-2 Task,\u201d Proc of NTCIR-10, pp.573-587, 2013."},{"key":"10","doi-asserted-by":"crossref","unstructured":"[10] U.V. Chaudhari and M. Picheny, \u201cImproved vocabulary independent search with approximate match based on conditional random fields,\u201d Proc. of ASRU 2009, pp.416-420, 2009.","DOI":"10.1109\/ASRU.2009.5373323"},{"key":"11","doi-asserted-by":"crossref","unstructured":"[11] U.V. Chaudhari and M. Picheny, \u201cMatching criteria for vocabulary-independent search,\u201d IEEE Trans. on Audio, Speech and Language Processing, vol.20, no.5, pp.1633-1643, 2012.","DOI":"10.1109\/TASL.2012.2186805"},{"key":"12","unstructured":"[12] A. Gunawardana, M. Mahajan, A. Acero, and J.C. Platt, \u201cHidden conditional random fields for phone classification,\u201d Proc. of INTERSPEECH 2008, pp.1117-1120, 2005."},{"key":"13","doi-asserted-by":"crossref","unstructured":"[13] R. Prabhavalkar, K. Livescu, E. Fosler-Lussier, and J. Keshet, \u201cDiscriminative articulatory models for spoken term detection in low-resource conversational settings,\u201d Proc. of ICASSP 2013, pp.8287-8291, 2013.","DOI":"10.1109\/ICASSP.2013.6639281"},{"key":"14","unstructured":"[14] D. Wang, S. King, J. Frankel, and P. Bell, \u201cTerm-dependent confidence for out-of-vocabulary term detection,\u201d Proc. of INTERSPEECH 2009, pp.2139-2142, 2009."},{"key":"15","doi-asserted-by":"crossref","unstructured":"[15] J. Tejedor, A. Echeverria, and D. Wang, \u201cAn evolutionary confidence measurement for spoken term detection,\u201d Proc. of International Workshop on Content-Based Multimedia Indexing (CBMI 2011), pp.151-156, 2011.","DOI":"10.1109\/CBMI.2011.5972537"},{"key":"16","unstructured":"[16] T. wei Tu, H. yi Lee, and L. shan Lee, \u201cImproved spoken term detection using support vector machines with acoustic and context features from pseudo-relevance feedback,\u201d Proc. of ASRU 2011, pp.383-388, 2011."},{"key":"17","doi-asserted-by":"crossref","unstructured":"[17] N. Sawada, S. Natori, and H. Nishizaki, \u201cRe-Ranking of Spoken Term Detections Using CRF-based Triphone Detection Models,\u201d Proc. of APSIPA ASC 2014, pp.1-4, 2014.","DOI":"10.1109\/APSIPA.2014.7041550"},{"key":"18","unstructured":"[18] T. Akiba, H. Nishizaki, H. Nanjo, and G.J.F. Jones, \u201cOverview of the NTCIR-11 SpokenQuery&Doc Task,\u201d Proc. of NTCIR-11, pp.350-364, 2014."},{"key":"19","unstructured":"[19] H. Nishizaki, N. Sawada, S. Natori, K. Domoto, and T. Utsuro, \u201cCombination of DTW-based and CRF-based Spoken Term Detection on the NTCIR-11 SpokenQuery&Doc SQ-STD Subtask,\u201d Proc. of NTCIR-11, pp.402-408, 2014."},{"key":"20","doi-asserted-by":"crossref","unstructured":"[20] J.G. Fiscus, \u201cA Post-processing System to Yield Reduced Word Error Rates: Recognizer Output Voting Error Reduction (ROVER),\u201d Proc. of ASRU'97, pp.347-354, 1997.","DOI":"10.1109\/ASRU.1997.659110"},{"key":"21","doi-asserted-by":"crossref","unstructured":"[21] M. Akbacak, L. Burget, W. Wang, and J. van Hout, \u201cRich system combination for keyword spotting in noisy and acoustically heterogeneous audio streams,\u201d Proc. of ICASSP 2013, pp.8267-8271, 2013.","DOI":"10.1109\/ICASSP.2013.6639277"},{"key":"22","doi-asserted-by":"crossref","unstructured":"[22] B. Mak and E. Barnard, \u201cPhone clustering using the Bhattacharyya distance,\u201d Proc. of ICSLP'96, pp.2005-2008, 1996.","DOI":"10.1109\/ICSLP.1996.607191"},{"key":"23","unstructured":"[23] J.D. Lafferty, A. McCallum, and F.C.N. Pereira, \u201cConditional random fields: Probabilistic models for segmenting and labeling sequence data,\u201d Proc. of ICML '01, pp.282-289, 2001."},{"key":"24","doi-asserted-by":"crossref","unstructured":"[24] K. Nongmeikapam, T. Shangkhunem, N.M. Chanu, L.N. Singh, B. Salam, and S. Bandyopadhyay, \u201cCrf based name entity recognition (ner) in manipuri: A highly agglutinative indian language,\u201d Proc. of the 2nd National Conference on Emerging Trends and Applications in Computer Science (NCETACS), pp.1-6, 2011.","DOI":"10.1109\/NCETACS.2011.5751390"},{"key":"25","doi-asserted-by":"crossref","unstructured":"[25] F. Sha and F. Pereira, \u201cShallow parsing with conditional random fields,\u201d Proc. of NAACL'03, pp.134-141, 2003.","DOI":"10.3115\/1073445.1073473"},{"key":"26","doi-asserted-by":"crossref","unstructured":"[26] Y. Liu, A. Stolcke, E. Shriberg, and M. Harper, \u201cUsing conditional random fields for sentence boundary detection in speech,\u201d Proc. of ACL'05, pp.451-458, 2005.","DOI":"10.3115\/1219840.1219896"},{"key":"27","unstructured":"[27] C. Parada, M. Dredze, D. Filimonov, and F. Jelinek, \u201cContextual information improves oov detection in speech,\u201d Proc. of HLT-NAACL 2010, pp.216-224, 2010."},{"key":"28","doi-asserted-by":"crossref","unstructured":"[28] W. Chen, Y. Zhang, and H. Isahara, \u201cAn empirical study of chinese chunking,\u201d Proc. of COLING\/ACL 2006, pp.97-104, 2006.","DOI":"10.3115\/1273073.1273086"},{"key":"29","unstructured":"[29] K. Maekawa, \u201cCorpus of Spontaneous Japanese: Its design and evaluation,\u201d Proc. of SSPR 2003, pp.7-12, ISCA, 2003."},{"key":"30","unstructured":"[30] A. Lee and T. Kawahara, \u201cRecent development of open-source speech recognition engine julius,\u201d Proc. of APSIPA ASC 2009, pp.131-137, 2009."}],"container-title":["IEICE Transactions on Information and Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.jstage.jst.go.jp\/article\/transinf\/E99.D\/10\/E99.D_2016SLP0012\/_pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,7,9]],"date-time":"2022-07-09T13:59:00Z","timestamp":1657375140000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.jstage.jst.go.jp\/article\/transinf\/E99.D\/10\/E99.D_2016SLP0012\/_article"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2016]]},"references-count":30,"journal-issue":{"issue":"10","published-print":{"date-parts":[[2016]]}},"URL":"https:\/\/doi.org\/10.1587\/transinf.2016slp0012","relation":{},"ISSN":["0916-8532","1745-1361"],"issn-type":[{"value":"0916-8532","type":"print"},{"value":"1745-1361","type":"electronic"}],"subject":[],"published":{"date-parts":[[2016]]}}}