default search action
Satoshi Nakamura 0001
Person information
- affiliation: Nara Institute of Science and Technology, Ikoma, Japan
- affiliation: ATR Spoken Language Communication Labs, Kyoto, Japan
- affiliation: National Institute of Information and Communications Technology (NICT), Spoken Language Communication Group, Keihanna Science City, Japan
- affiliation: Sharp Corporation, Nara, Japan
- affiliation (PhD 1992): Kyoto University, Japan
Other persons with the same name
- Satoshi Nakamura — disambiguation page
- Satoshi Nakamura 0002 — Meiji University, School of Interdisciplinary Mathematical Sciences, Nakano, Japan (and 3 more)
- Satoshi Nakamura 0004 — NTT Secure Platform Laboratories, Musashino, Japan (and 1 more)
- Satoshi Nakamura 0005 — Port and Airport Research Institute, Yokosuka, Japan (and 1 more)
SPARQL queries
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j132]Shohei Tanaka, Konosuke Yamasaki, Akishige Yuguchi, Seiya Kawano, Satoshi Nakamura, Koichiro Yoshino:
Do as I Demand, Not as I Say: A Dataset for Developing a Reflective Life-Support Robot. IEEE Access 12: 11774-11784 (2024) - [j131]Kei Furukawa, Takeshi Kishiyama, Satoshi Nakamura, Sakriani Sakti:
Applying Syntax-Prosody Mapping Hypothesis and Boundary-Driven Theory to Neural Sequence-to-Sequence Speech Synthesis. IEEE Access 12: 160896-160917 (2024) - [j130]Akinobu Maejima, Seitaro Shinagawa, Hiroyuki Kubo, Takuya Funatomi, Tatsuo Yotsukura, Satoshi Nakamura, Yasuhiro Mukaigawa:
Continual few-shot patch-based learning for anime-style colorization. Comput. Vis. Media 10(4): 705-723 (2024) - [j129]Yuka Ko, Katsuhito Sudoh, Sakriani Sakti, Satoshi Nakamura:
Neural End-To-End Speech Translation Leveraged by ASR Posterior Distribution. IEICE Trans. Inf. Syst. 107(10): 1322-1331 (2024) - [j128]Jieyeon Woo, Kazuhiro Shidara, Catherine Achard, Hiroki Tanaka, Satoshi Nakamura, Catherine Pelachaud:
Adaptive virtual agent: Design and evaluation for real-time human-agent interaction. Int. J. Hum. Comput. Stud. 190: 103321 (2024) - [j127]Ryo Fukuda, Katsuhito Sudoh, Satoshi Nakamura:
Improving Speech Translation Accuracy and Time Efficiency With Fine-Tuned wav2vec 2.0-Based Speech Segmentation. IEEE ACM Trans. Audio Speech Lang. Process. 32: 906-916 (2024) - [c562]Kosuke Doi, Katsuhito Sudoh, Satoshi Nakamura:
Automated Essay Scoring Using Grammatical Variety and Errors with Multi-Task Learning and Item Response Theory. BEA 2024: 316-329 - [c561]Jinming Zhao, Katsuhito Sudoh, Satoshi Nakamura, Yuka Ko, Kosuke Doi, Ryo Fukuda:
NAIST-SIC-Aligned: An Aligned English-Japanese Simultaneous Interpretation Corpus. LREC/COLING 2024: 12046-12052 - [c560]Roman Koshkin, Katsuhito Sudoh, Satoshi Nakamura:
TransLLaMa: LLM-based Simultaneous Translation System. EMNLP (Findings) 2024: 461-476 - [c559]Roman Koshkin, Katsuhito Sudoh, Satoshi Nakamura:
LLMs Are Zero-Shot Context-Aware Simultaneous Translators. EMNLP 2024: 1192-1207 - [c558]Yoichi Ishibashi, Sho Yokoi, Katsuhito Sudoh, Satoshi Nakamura:
Subspace Representations for Soft Set Operations and Sentence Similarities. NAACL-HLT 2024: 3512-3524 - [i75]Kenta Izumi, Hiroki Tanaka, Kazuhiro Shidara, Hiroyoshi Adachi, Daisuke Kanayama, Takashi Kudo, Satoshi Nakamura:
Response Generation for Cognitive Behavioral Therapy with Large Language Models: Comparative Study with Socratic Questioning. CoRR abs/2401.15966 (2024) - [i74]Roman Koshkin, Katsuhito Sudoh, Satoshi Nakamura:
TransLLaMa: LLM-based Simultaneous Translation System. CoRR abs/2402.04636 (2024) - [i73]Kosuke Doi, Katsuhito Sudoh, Satoshi Nakamura:
Automated Essay Scoring Using Grammatical Variety and Errors with Multi-Task Learning and Item Response Theory. CoRR abs/2406.08817 (2024) - [i72]Kosuke Doi, Yuka Ko, Mana Makinae, Katsuhito Sudoh, Satoshi Nakamura:
Word Order in English-Japanese Simultaneous Interpretation: Analyses and Evaluation using Chunk-wise Monotonic Translation. CoRR abs/2406.08940 (2024) - [i71]Roman Koshkin, Katsuhito Sudoh, Satoshi Nakamura:
LLMs Are Zero-Shot Context-Aware Simultaneous Translators. CoRR abs/2406.13476 (2024) - [i70]Yuka Ko, Ryo Fukuda, Yuta Nishikawa, Yasumasa Kano, Tomoya Yanagita, Kosuke Doi, Mana Makinae, Haotian Tan, Makoto Sakai, Sakriani Sakti, Katsuhito Sudoh, Satoshi Nakamura:
NAIST Simultaneous Speech Translation System for IWSLT 2024. CoRR abs/2407.00826 (2024) - [i69]Mana Makinae, Katsuhito Sudoh, Mararu Yamada, Satoshi Nakamura:
A Word Order Synchronization Metric for Evaluating Simultaneous Interpretation and Translation. CoRR abs/2407.06650 (2024) - [i68]Bin Wu, Sakriani Sakti, Shinnosuke Takamichi, Satoshi Nakamura:
A Neural Transformer Framework for Simultaneous Tasks of Segmentation, Classification, and Caller Identification of Marmoset Vocalization. CoRR abs/2410.23279 (2024) - 2023
- [j126]Tomoya Yanagita, Sakriani Sakti, Satoshi Nakamura:
Japanese Neural Incremental Text-to-Speech Synthesis Framework With an Accent Phrase Input. IEEE Access 11: 22355-22363 (2023) - [j125]Keisuke Toyama, Katsuhito Sudoh, Satoshi Nakamura:
Content Order-Controllable MR-to-Text. IEEE Access 11: 129353-129365 (2023) - [j124]Shohei Tanaka, Koichiro Yoshino, Katsuhito Sudoh, Satoshi Nakamura:
Reflective action selection based on positive-unlabeled learning and causality detection model. Comput. Speech Lang. 78: 101463 (2023) - [j123]Kota Iwauchi, Hiroki Tanaka, Kosuke Okazaki, Yasuhiro Matsuda, Mitsuhiro Uratani, Tsubasa Morimoto, Satoshi Nakamura:
Eye-movement analysis on facial expression for identifying children and adults with neurodevelopmental disorders. Frontiers Digit. Health 5 (2023) - [j122]Seiya Kawano, Koichiro Yoshino, David R. Traum, Satoshi Nakamura:
End-to-end dialogue structure parsing on multi-floor dialogue based on multi-task learning. Frontiers Robotics AI 10 (2023) - [c557]Kana Miyamoto, Hiroki Tanaka, Jennifer Hamet Bagnou, Elise Prigent, Céline Clavel, Jean-Claude Martin, Satoshi Nakamura:
Social Performance Rating During Social Skills Training in Adults with Autism Spectrum Disorder and Schizophrenia. ACIIW 2023: 1-8 - [c556]Yoichi Ishibashi, Danushka Bollegala, Katsuhito Sudoh, Satoshi Nakamura:
Evaluating the Robustness of Discrete Prompts. EACL 2023: 2365-2376 - [c555]Kota Iwauchi, Hiroki Tanaka, Satoshi Nakamura:
Predicting Autistic Traits Using Eye Movement during Visual Perspective Taking and Facial Emotion Identification. EMBC 2023: 1-4 - [c554]Hiroki Tanaka, Takeshi Saga, Kota Iwauchi, Satoshi Nakamura:
Acceptability and Trustworthiness of Virtual Agents by Effects of Theory of Mind and Social Skills Training. FG 2023: 1-7 - [c553]Kazuyo Onishi, Hiroki Tanaka, Satoshi Nakamura:
Multimodal Voice Activity Prediction: Turn-taking Events Detection in Expert-Novice Conversation. HAI 2023: 13-21 - [c552]Sashi Novitasari, Sakriani Sakti, Satoshi Nakamura:
Self-Adaptive Incremental Machine Speech Chain for Lombard TTS with High-Granularity ASR Feedback in Dynamic Noise Condition. ICASSP 2023: 1-5 - [c551]Takeshi Saga, Jieyeon Woo, Alexis Gerard, Hiroki Tanaka, Catherine Achard, Satoshi Nakamura, Catherine Pelachaud:
An Adaptive Virtual Agent Platform for Automated Social Skills Training. ICMI Companion 2023: 109-111 - [c550]Takeshi Saga, Hiroki Tanaka, Satoshi Nakamura:
Computational analyses of linguistic features with schizophrenic and autistic traits along with formal thought disorders. ICMI 2023: 119-124 - [c549]Hiroki Tanaka, Satoshi Nakamura, Jean-Claude Martin, Catherine Pelachaud:
4th Workshop on Social Affective Multimodal Interaction for Health (SAMIH). ICMI 2023: 816-817 - [c548]Yuta Nishikawa, Satoshi Nakamura:
Inter-connection: Effective Connection between Pre-trained Encoder and Decoder for Speech Translation. INTERSPEECH 2023: 2193-2197 - [c547]Yasumasa Kano, Katsuhito Sudoh, Satoshi Nakamura:
Average Token Delay: A Latency Metric for Simultaneous Translation. INTERSPEECH 2023: 4469-4473 - [c546]Sweta Agrawal, Antonios Anastasopoulos, Luisa Bentivogli, Ondrej Bojar, Claudia Borg, Marine Carpuat, Roldano Cattoni, Mauro Cettolo, Mingda Chen, William Chen, Khalid Choukri, Alexandra Chronopoulou, Anna Currey, Thierry Declerck, Qianqian Dong, Kevin Duh, Yannick Estève, Marcello Federico, Souhir Gahbiche, Barry Haddow, Benjamin Hsu, Phu Mon Htut, Hirofumi Inaguma, Dávid Javorský, John Judge, Yasumasa Kano, Tom Ko, Rishu Kumar, Pengwei Li, Xutai Ma, Prashant Mathur, Evgeny Matusov, Paul McNamee, John P. McCrae, Kenton Murray, Maria Nadejde, Satoshi Nakamura, Matteo Negri, Ha Nguyen, Jan Niehues, Xing Niu, Atul Kr. Ojha, John E. Ortega, Proyag Pal, Juan Pino, Lonneke van der Plas, Peter Polák, Elijah Rippeth, Elizabeth Salesky, Jiatong Shi, Matthias Sperber, Sebastian Stüker, Katsuhito Sudoh, Yun Tang, Brian Thompson, Kevin Tran, Marco Turchi, Alex Waibel, Mingxuan Wang, Shinji Watanabe, Rodolfo Zevallos:
Findings of the IWSLT 2023 Evaluation Campaign. IWSLT@ACL 2023: 1-61 - [c545]Ryo Fukuda, Yuta Nishikawa, Yasumasa Kano, Yuka Ko, Tomoya Yanagita, Kosuke Doi, Mana Makinae, Sakriani Sakti, Katsuhito Sudoh, Satoshi Nakamura:
NAIST Simultaneous Speech-to-speech Translation System for IWSLT 2023. IWSLT@ACL 2023: 330-340 - [c544]Yuka Ko, Ryo Fukuda, Yuta Nishikawa, Yasumasa Kano, Katsuhito Sudoh, Satoshi Nakamura:
Tagged End-to-End Simultaneous Speech Translation Training Using Simultaneous Interpretation Data. IWSLT@ACL 2023: 363-375 - [c543]Keisuke Toyama, Katsuhito Sudoh, Satoshi Nakamura:
E2E Refined Dataset. O-COCOSDA 2023: 1-5 - [c542]Kei Furukawa, Satoshi Nakamura:
Investigation of Validity of Paradigmatic Diagnosis for Downstep in Japanese. O-COCOSDA 2023: 1-6 - [c541]Kana Miyamoto, Hiroki Tanaka, Kazuhiro Shidara, Satoshi Nakamura:
Emotion Prediction Using Multi-source Biosignals During Cognitive Behavior Therapy with Conversational Virtual Agents. O-COCOSDA 2023: 1-6 - [c540]Taiki Watanabe, Seitaro Shinagawa, Takuya Funatomi, Akinobu Maejima, Yasuhiro Mukaigawa, Satoshi Nakamura, Hiroyuki Kubo:
Improved Automatic Colorization by Optimal Pre-colorization. SIGGRAPH Posters 2023: 31:1-31:2 - [i67]Heli Qi, Sashi Novitasari, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
SpeeChain: A Speech Toolkit for Large-Scale Machine Speech Chain. CoRR abs/2301.02966 (2023) - [i66]Yoichi Ishibashi, Danushka Bollegala, Katsuhito Sudoh, Satoshi Nakamura:
Evaluating the Robustness of Discrete Prompts. CoRR abs/2302.05619 (2023) - [i65]Seyed Mahed Mousavi, Shohei Tanaka, Gabriel Roccabruna, Koichiro Yoshino, Satoshi Nakamura, Giuseppe Riccardi:
Whats New? Identifying the Unfolding of New Events in Narratives. CoRR abs/2302.07748 (2023) - [i64]Yuka Okuda, Katsuhito Sudoh, Seitaro Shinagawa, Satoshi Nakamura:
Modeling Multiple User Interests using Hierarchical Knowledge for Conversational Recommender System. CoRR abs/2303.00311 (2023) - [i63]Jinming Zhao, Yuka Ko, Kosuke Doi, Ryo Fukuda, Katsuhito Sudoh, Satoshi Nakamura:
NAIST-SIC-Aligned: Automatically-Aligned English-Japanese Simultaneous Interpretation Corpus. CoRR abs/2304.11766 (2023) - [i62]Hiroki Ouchi, Hiroyuki Shindo, Shoko Wakamiya, Yuki Matsuda, Naoya Inoue, Shohei Higashiyama, Satoshi Nakamura, Taro Watanabe:
Arukikata Travelogue Dataset. CoRR abs/2305.11444 (2023) - [i61]Yuta Nishikawa, Satoshi Nakamura:
Inter-connection: Effective Connection between Pre-trained Encoder and Decoder for Speech Translation. CoRR abs/2305.16897 (2023) - [i60]Yuka Ko, Ryo Fukuda, Yuta Nishikawa, Yasumasa Kano, Katsuhito Sudoh, Satoshi Nakamura:
Tagged End-to-End Simultaneous Speech Translation Training using Simultaneous Interpretation Data. CoRR abs/2306.08582 (2023) - [i59]Takeshi Saga, Hiroki Tanaka, Satoshi Nakamura:
Computational analyses of linguistic features with schizophrenic and autistic traits along with formal thought disorders. CoRR abs/2310.09494 (2023) - [i58]Yasumasa Kano, Katsuhito Sudoh, Satoshi Nakamura:
Average Token Delay: A Duration-aware Latency Metric for Simultaneous Translation. CoRR abs/2311.14353 (2023) - 2022
- [j121]Kazuhiro Shidara, Hiroki Tanaka, Hiroyoshi Adachi, Daisuke Kanayama, Yukako Sakagami, Takashi Kudo, Satoshi Nakamura:
Automatic Thoughts and Facial Expressions in Cognitive Restructuring With Virtual Agents. Frontiers Comput. Sci. 4: 762424 (2022) - [j120]Kana Miyamoto, Hiroki Tanaka, Satoshi Nakamura:
Applying Meta-Learning and Iso Principle for Development of EEG-Based Emotion Induction System. Frontiers Digit. Health 4: 873822 (2022) - [j119]Takeshi Saga, Hiroki Tanaka, Hidemi Iwasaka, Satoshi Nakamura:
Multimodal Prediction of Social Responsiveness Score with BERT-Based Text Features. IEICE Trans. Inf. Syst. 105-D(3): 578-586 (2022) - [j118]Kana Miyamoto, Hiroki Tanaka, Satoshi Nakamura:
Online EEG-Based Emotion Prediction and Music Generation for Inducing Affective States. IEICE Trans. Inf. Syst. 105-D(5): 1050-1063 (2022) - [j117]Fan Yang, Zheng Wang, Yang Wu, Sakriani Sakti, Satoshi Nakamura:
Tackling multiple object tracking with complicated motions - Re-designing the integration of motion and appearance. Image Vis. Comput. 124: 104514 (2022) - [j116]Bin Wu, Sakriani Sakti, Jinsong Zhang, Satoshi Nakamura:
Modeling Unsupervised Empirical Adaptation by DPGMM and DPGMM-RNN Hybrid Model to Extract Perceptual Features for Low-Resource ASR. IEEE ACM Trans. Audio Speech Lang. Process. 30: 901-916 (2022) - [j115]Sashi Novitasari, Sakriani Sakti, Satoshi Nakamura:
A Machine Speech Chain Approach for Dynamically Adaptive Lombard TTS in Static and Dynamic Noise Environments. IEEE ACM Trans. Audio Speech Lang. Process. 30: 2673-2688 (2022) - [c539]Yuya Nakano, Seiya Kawano, Koichiro Yoshino, Katsuhito Sudoh, Satoshi Nakamura:
Pseudo Ambiguous and Clarifying Questions Based on Sentence Structures Toward Clarifying Question Answering System. DialDoc@ACL 2022: 31-40 - [c538]Takeshi Saga, Hiroki Tanaka, Yasuhiro Matuda, Tsubasa Morimoto, Mitsuhiro Uratani, Kosuke Okazaki, Yuichiro Fujimoto, Satoshi Nakamura:
Analysis of Feedback Contents and Estimation of Subjective Scores in Social Skills Training. EMBC 2022: 1086-1089 - [c537]Kazuhiro Shidara, Hiroki Tanaka, Rumiko Asada, Kayo Higashiyama, Hiroyoshi Adachi, Daisuke Kanayama, Yukako Sakagami, Takashi Kudo, Satoshi Nakamura:
Linguistic Features of Clients and Counselors for Early Detection of Mental Health Issues in Online Text-based Counseling. EMBC 2022: 2668-2671 - [c536]Hiroki Tanaka, Satoshi Nakamura, Kazuhiro Shidara, Jean-Claude Martin, Catherine Pelachaud:
3rd Workshop on Social Affective Multimodal Interaction for Health (SAMIH). ICMI 2022: 805-806 - [c535]Ryo Fukuda, Katsuhito Sudoh, Satoshi Nakamura:
Speech Segmentation Optimization using Segmented Bilingual Speech Corpus for End-to-end Speech Translation. INTERSPEECH 2022: 121-125 - [c534]Seiya Kawano, Muteki Arioka, Akishige Yuguchi, Kenta Yamamoto, Koji Inoue, Tatsuya Kawahara, Satoshi Nakamura, Koichiro Yoshino:
Multimodal Persuasive Dialogue Corpus using Teleoperated Android. INTERSPEECH 2022: 2308-2312 - [c533]Heli Qi, Sashi Novitasari, Sakriani Sakti, Satoshi Nakamura:
Improved Consistency Training for Semi-Supervised Sequence-to-Sequence ASR via Speech Chain Reconstruction and Self-Transcribing. INTERSPEECH 2022: 3413-3417 - [c532]Kei Furukawa, Takeshi Kishiyama, Satoshi Nakamura:
Applying Syntax-Prosody Mapping Hypothesis and Prosodic Well-Formedness Constraints to Neural Sequence-to-Sequence Speech Synthesis. INTERSPEECH 2022: 5258-5262 - [c531]Yasumasa Kano, Katsuhito Sudoh, Satoshi Nakamura:
Simultaneous Neural Machine Translation with Prefix Alignment. IWSLT@ACL 2022: 22-31 - [c530]Antonios Anastasopoulos, Loïc Barrault, Luisa Bentivogli, Marcely Zanon Boito, Ondrej Bojar, Roldano Cattoni, Anna Currey, Georgiana Dinu, Kevin Duh, Maha Elbayad, Clara Emmanuel, Yannick Estève, Marcello Federico, Christian Federmann, Souhir Gahbiche, Hongyu Gong, Roman Grundkiewicz, Barry Haddow, Benjamin Hsu, Dávid Javorský, Vera Kloudová, Surafel Melaku Lakew, Xutai Ma, Prashant Mathur, Paul McNamee, Kenton Murray, Maria Nadejde, Satoshi Nakamura, Matteo Negri, Jan Niehues, Xing Niu, John Ortega, Juan Miguel Pino, Elizabeth Salesky, Jiatong Shi, Matthias Sperber, Sebastian Stüker, Katsuhito Sudoh, Marco Turchi, Yogesh Virkar, Alexander Waibel, Changhan Wang, Shinji Watanabe:
Findings of the IWSLT 2022 Evaluation Campaign. IWSLT@ACL 2022: 98-157 - [c529]Ryo Fukuda, Yuka Ko, Yasumasa Kano, Kosuke Doi, Hirotaka Tokuyama, Sakriani Sakti, Katsuhito Sudoh, Satoshi Nakamura:
NAIST Simultaneous Speech-to-Text Translation System for IWSLT 2022. IWSLT@ACL 2022: 286-292 - [c528]Yidong Wang, Hao Chen, Yue Fan, Wang Sun, Ran Tao, Wenxin Hou, Renjie Wang, Linyi Yang, Zhi Zhou, Lan-Zhe Guo, Heli Qi, Zhen Wu, Yufeng Li, Satoshi Nakamura, Wei Ye, Marios Savvides, Bhiksha Raj, Takahiro Shinozaki, Bernt Schiele, Jindong Wang, Xing Xie, Yue Zhang:
USB: A Unified Semi-supervised Learning Benchmark for Classification. NeurIPS 2022 - [i57]Kei Furukawa, Takeshi Kishiyama, Satoshi Nakamura:
Applying Syntax-Prosody Mapping Hypothesis and Prosodic Well-Formedness Constraints to Neural Sequence-to-Sequence Speech Synthesis. CoRR abs/2203.15276 (2022) - [i56]Ryo Fukuda, Katsuhito Sudoh, Satoshi Nakamura:
Speech Segmentation Optimization using Segmented Bilingual Speech Corpus for End-to-end Speech Translation. CoRR abs/2203.15479 (2022) - [i55]Heli Qi, Sashi Novitasari, Sakriani Sakti, Satoshi Nakamura:
Improved Consistency Training for Semi-Supervised Sequence-to-Sequence ASR via Speech Chain Reconstruction and Self-Transcribing. CoRR abs/2205.06963 (2022) - [i54]Holy Lovenia, Hiroki Tanaka, Sakriani Sakti, Ayu Purwarianti, Satoshi Nakamura:
Speech Artifact Removal from EEG Recordings of Spoken Word Production with Tensor Decomposition. CoRR abs/2206.00635 (2022) - [i53]Yidong Wang, Hao Chen, Yue Fan, Wang Sun, Ran Tao, Wenxin Hou, Renjie Wang, Linyi Yang, Zhi Zhou, Lan-Zhe Guo, Heli Qi, Zhen Wu, Yufeng Li, Satoshi Nakamura, Wei Ye, Marios Savvides, Bhiksha Raj, Takahiro Shinozaki, Bernt Schiele, Jindong Wang, Xing Xie, Yue Zhang:
USB: A Unified Semi-supervised Learning Benchmark. CoRR abs/2208.07204 (2022) - [i52]Fan Yang, Norimichi Ukita, Sakriani Sakti, Satoshi Nakamura:
Actor-identified Spatiotemporal Action Detection - Detecting Who Is Doing What in Videos. CoRR abs/2208.12940 (2022) - [i51]Yoichi Ishibashi, Sho Yokoi, Katsuhito Sudoh, Satoshi Nakamura:
Subspace-based Set Operations on a Pre-trained Word Embedding Space. CoRR abs/2210.13034 (2022) - [i50]Keisuke Toyama, Katsuhito Sudoh, Satoshi Nakamura:
E2E Refined Dataset. CoRR abs/2211.00513 (2022) - [i49]Yasumasa Kano, Katsuhito Sudoh, Satoshi Nakamura:
Average Token Delay: A Latency Metric for Simultaneous Translation. CoRR abs/2211.13173 (2022) - [i48]Fan Yang, Yang Wu, Zheng Wang, Xiang Li, Sakriani Sakti, Satoshi Nakamura:
Instance-level Heterogeneous Domain Adaptation for Limited-labeled Sketch-to-Photo Retrieval. CoRR abs/2211.14515 (2022) - 2021
- [j114]Johanes Effendi, Sakriani Sakti, Satoshi Nakamura:
End-to-End Image-to-Speech Generation for Untranscribed Unknown Languages. IEEE Access 9: 55144-55154 (2021) - [j113]Johanes Effendi, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Multimodal Chain: Cross-Modal Collaboration Through Listening, Speaking, and Visualizing. IEEE Access 9: 70286-70299 (2021) - [j112]Hour Kaing, Chenchen Ding, Masao Utiyama, Eiichiro Sumita, Katsuhito Sudoh, Satoshi Nakamura:
Constituency Parsing by Cross-Lingual Delexicalization. IEEE Access 9: 141571-141578 (2021) - [j111]Sahoko Nakayama, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Code-Switching ASR and TTS Using Semisupervised Learning with Machine Speech Chain. IEICE Trans. Inf. Syst. 104-D(10): 1661-1677 (2021) - [j110]Sashi Novitasari, Sakriani Sakti, Satoshi Nakamura:
Neural Incremental Speech Recognition Toward Real-Time Machine Speech Translation. IEICE Trans. Inf. Syst. 104-D(12): 2195-2208 (2021) - [j109]Fan Yang, Xin Chang, Sakriani Sakti, Yang Wu, Satoshi Nakamura:
ReMOT: A model-agnostic refinement for multiple object tracking. Image Vis. Comput. 106: 104091 (2021) - [j108]Hour Kaing, Chenchen Ding, Masao Utiyama, Eiichiro Sumita, Sethserey Sam, Sopheap Seng, Katsuhito Sudoh, Satoshi Nakamura:
Towards Tokenization and Part-of-Speech Tagging for Khmer: Data and Discussion. ACM Trans. Asian Low Resour. Lang. Inf. Process. 20(6): 104:1-104:16 (2021) - [j107]Bin Wu, Sakriani Sakti, Jinsong Zhang, Satoshi Nakamura:
Tackling Perception Bias in Unsupervised Phoneme Discovery Using DPGMM-RNN Hybrid Model and Functional Load. IEEE ACM Trans. Audio Speech Lang. Process. 29: 348-362 (2021) - [j106]Fan Yang, Yang Wu, Zheng Wang, Xiang Li, Sakriani Sakti, Satoshi Nakamura:
Instance-Level Heterogeneous Domain Adaptation for Limited-Labeled Sketch-to-Photo Retrieval. IEEE Trans. Multim. 23: 2347-2360 (2021) - [c527]Kazuhiro Shidara, Hiroki Tanaka, Hiroyoshi Adachi, Daisuke Kanayama, Yukako Sakagami, Takashi Kudo, Satoshi Nakamura:
Relationship between Mood Improvement and Questioning to Evaluate Automatic Thoughts in Cognitive Restructuring with a Virtual Agent. ACII (Workshops and Demos) 2021: 1-5 - [c526]Kana Miyamoto, Hiroki Tanaka, Satoshi Nakamura:
Emotion Estimation from EEG Signals and Expected Subjective Evaluation. BCI 2021: 1-6 - [c525]Hiroki Tanaka, Takeshi Saga, Satoshi Nakamura:
Clustering of Human Movement Trajectories based on Distributional Representations Derived from Bi-directional LSTM Network with Geographical Coordinates. IEEE BigData 2021: 2936-2940 - [c524]Hiroki Tanaka, Satoshi Nakamura:
Virtual Agent Design for Social Skills Training Considering Autistic Traits. EMBC 2021: 4953-4956 - [c523]Kana Miyamoto, Hiroki Tanaka, Satoshi Nakamura:
Meta-Learning for Emotion Prediction from EEG while Listening to Music. ICMI Companion 2021: 324-328 - [c522]Takeshi Saga, Hiroki Tanaka, Hidemi Iwasaka, Yasuhiro Matsuda, Tsubasa Morimoto, Mitsuhiro Uratani, Kosuke Okazaki, Yuichiro Fujimoto, Satoshi Nakamura:
Multimodal Dataset of Social Skills Training in Natural Conversational Setting. ICMI Companion 2021: 395-399 - [c521]Hiroki Tanaka, Satoshi Nakamura, Jean-Claude Martin, Catherine Pelachaud:
2nd Workshop on Social Affective Multimodal Interaction for Health (SAMIH). ICMI 2021: 853-854 - [c520]Kohichi Takai, Gen Hattori, Akio Yoneyama, Keiji Yasuda, Katsuhito Sudoh, Satoshi Nakamura:
Named Entity-Factored Transformer for Proper Noun Translation. ICON 2021: 7-11 - [c519]Hour Kaing, Chenchen Ding, Katsuhito Sudoh, Masao Utiyama, Eiichiro Sumita, Satoshi Nakamura:
Multi-Source Cross-Lingual Constituency Parsing. ICON 2021: 341-346 - [c518]Shun Takahashi, Sakriani Sakti, Satoshi Nakamura:
Unsupervised Neural-Based Graph Clustering for Variable-Length Speech Representation Discovery of Zero-Resource Languages. Interspeech 2021: 1559-1563 - [c517]Johanes Effendi, Sakriani Sakti, Satoshi Nakamura:
Weakly-Supervised Speech-to-Text Mapping with Visually Connected Non-Parallel Speech-Text Data Using Cyclic Partially-Aligned Transformer. Interspeech 2021: 2257-2261 - [c516]Hirotaka Tokuyama, Sakriani Sakti, Katsuhito Sudoh, Satoshi Nakamura:
Transcribing Paralinguistic Acoustic Cues to Target Language Text in Transformer-Based Speech-to-Text Translation. Interspeech 2021: 2262-2266 - [c515]Yuka Ko, Katsuhito Sudoh, Sakriani Sakti, Satoshi Nakamura:
ASR Posterior-Based Loss for Multi-Task End-to-End Speech Translation. Interspeech 2021: 2272-2276 - [c514]Sashi Novitasari, Sakriani Sakti, Satoshi Nakamura:
Dynamically Adaptive Machine Speech Chain Inference for TTS in Noisy Environment: Listen and Speak Louder. Interspeech 2021: 4124-4128 - [c513]Sara Asai, Koichiro Yoshino, Seitaro Shinagawa, Sakriani Sakti, Satoshi Nakamura:
Eliciting Cooperative Persuasive Dialogue by Multimodal Emotional Robot. IWSDS 2021: 143-158 - [c512]Antonios Anastasopoulos, Ondrej Bojar, Jacob Bremerman, Roldano Cattoni, Maha Elbayad, Marcello Federico, Xutai Ma, Satoshi Nakamura, Matteo Negri, Jan Niehues, Juan Miguel Pino, Elizabeth Salesky, Sebastian Stüker, Katsuhito Sudoh, Marco Turchi, Alex Waibel, Changhan Wang, Matthew Wiesner:
Findings of the IWSLT 2021 Evaluation Campaign. IWSLT 2021: 1-29 - [c511]Ryo Fukuda, Yui Oka, Yasumasa Kano, Yuki Yano, Yuka Ko, Hirotaka Tokuyama, Kosuke Doi, Sakriani Sakti, Katsuhito Sudoh, Satoshi Nakamura:
NAIST English-to-Japanese Simultaneous Translation System for IWSLT 2021 Simultaneous Text-to-text Task. IWSLT 2021: 39-45 - [c510]Ryo Fukuda, Katsuhito Sudoh, Satoshi Nakamura:
On Knowledge Distillation for Translating Erroneous Speech Transcriptions. IWSLT 2021: 198-205 - [c509]Kosuke Doi, Katsuhito Sudoh, Satoshi Nakamura:
Large-Scale English-Japanese Simultaneous Interpretation Corpus: Construction and Analyses with Sentence-Aligned Data. IWSLT 2021: 226-235 - [c508]Nobuya Tachimori, Sakriani Sakti, Satoshi Nakamura:
Multi-Encoder Sequential Attention Network for Context-Aware Speech Recognition in Japanese Dialog Conversation. O-COCOSDA 2021: 1-6 - [c507]Ryo Fukuda, Sashi Novitasari, Yui Oka, Yasumasa Kano, Yuki Yano, Yuka Ko, Hirotaka Tokuyama, Kosuke Doi, Tomoya Yanagita, Sakriani Sakti, Katsuhito Sudoh, Satoshi Nakamura:
Simultaneous Speech-to-Speech Translation System with Transformer-Based Incremental ASR, MT, and TTS. O-COCOSDA 2021: 186-192 - [c506]Nobuyoshi Kaiki, Sakriani Sakti, Satoshi Nakamura:
Using Local Phrase Dependency Structure Information in Neural Sequence-to-Sequence Speech Synthesis. O-COCOSDA 2021: 206-211 - [c505]Shohei Tanaka, Koichiro Yoshino, Katsuhito Sudoh, Satoshi Nakamura:
ARTA: Collection and Classification of Ambiguous Requests and Thoughtful Actions. SIGDIAL 2021: 77-88 - [c504]Akinobu Maejima, Hiroyuki Kubo, Seitaro Shinagawa, Takuya Funatomi, Tatsuo Yotsukura, Satoshi Nakamura, Yasuhiro Mukaigawa:
Anime Character Colorization using Few-shot Learning. SIGGRAPH Asia Technical Communications 2021: 8:1-8:4 - [c503]Bin Wu, Sakriani Sakti, Satoshi Nakamura:
Incorporating Discriminative DPGMM Posteriorgrams for Low-Resource ASR. SLT 2021: 201-208 - [c502]Takatomo Kano, Sakriani Sakti, Satoshi Nakamura:
Transformer-Based Direct Speech-To-Speech Translation with Transcoder. SLT 2021: 958-965 - [c501]Kosuke Takahashi, Yoichi Ishibashi, Katsuhito Sudoh, Satoshi Nakamura:
Multilingual Machine Translation Evaluation Metrics Fine-tuned on Pseudo-Negative Examples for WMT 2021 Metrics Task. WMT@EMNLP 2021: 1049-1052 - [c500]Yasumasa Kano, Katsuhito Sudoh, Satoshi Nakamura:
Simultaneous Neural Machine Translation with Constituent Label Prediction. WMT@EMNLP 2021: 1124-1134 - [e6]Luis Fernando D'Haro, Zoraida Callejas, Satoshi Nakamura:
Conversational Dialogue Systems for the Next Decade - 11th International Workshop on Spoken Dialogue Systems, IWSDS 2020, Madrid, Spain, 21-23 September, 2020. Lecture Notes in Electrical Engineering 704, Springer 2021, ISBN 978-981-15-8394-0 [contents] - [i47]Shohei Tanaka, Koichiro Yoshino, Katsuhito Sudoh, Satoshi Nakamura:
ARTA: Collection and Classification of Ambiguous Requests and Thoughtful Actions. CoRR abs/2106.07999 (2021) - [i46]Yui Oka, Katsuhito Sudoh, Satoshi Nakamura:
Using Perturbed Length-aware Positional Encoding for Non-autoregressive Neural Machine Translation. CoRR abs/2107.13689 (2021) - [i45]Yasumasa Kano, Katsuhito Sudoh, Satoshi Nakamura:
Simultaneous Neural Machine Translation with Constituent Label Prediction. CoRR abs/2110.13480 (2021) - 2020
- [j105]Seitaro Shinagawa, Koichiro Yoshino, Seyed Hossein Alavi, Kallirroi Georgila, David R. Traum, Sakriani Sakti, Satoshi Nakamura:
An Interactive Image Editing System Using an Uncertainty-Based Confirmation Strategy. IEEE Access 8: 98471-98480 (2020) - [j104]The Tung Nguyen, Koichiro Yoshino, Sakriani Sakti, Satoshi Nakamura:
Policy Reuse for Dialog Management Using Action-Relation Probability. IEEE Access 8: 159639-159649 (2020) - [j103]Seiya Kawano, Masahiro Mizukami, Koichiro Yoshino, Satoshi Nakamura:
Entrainable Neural Conversation Model Based on Reinforcement Learning. IEEE Access 8: 178283-178294 (2020) - [j102]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Recurrent Neural Network Compression Based on Low-Rank Tensor Representation. IEICE Trans. Inf. Syst. 103-D(2): 435-449 (2020) - [j101]Johanes Effendi, Sakriani Sakti, Katsuhito Sudoh, Satoshi Nakamura:
Leveraging Neural Caption Translation with Visually Grounded Paraphrase Augmentation. IEICE Trans. Inf. Syst. 103-D(3): 674-683 (2020) - [j100]Hiroki Tanaka, Hidemi Iwasaka, Hideki Negoro, Satoshi Nakamura:
Analysis of conversational listening skills toward agent-based social skills training. J. Multimodal User Interfaces 14(1): 73-82 (2020) - [j99]Jingyi Zhang, Masao Utiyama, Eiichiro Sumita, Graham Neubig, Satoshi Nakamura:
Improving neural machine translation through phrase-based soft forced decoding. Mach. Transl. 34(1): 21-39 (2020) - [j98]Yuta Nishimura, Katsuhito Sudoh, Graham Neubig, Satoshi Nakamura:
Multi-Source Neural Machine Translation With Missing Data. IEEE ACM Trans. Audio Speech Lang. Process. 28: 569-580 (2020) - [j97]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Machine Speech Chain. IEEE ACM Trans. Audio Speech Lang. Process. 28: 976-989 (2020) - [j96]Takatomo Kano, Sakriani Sakti, Satoshi Nakamura:
End-to-End Speech Translation With Transcoding by Multi-Task Learning for Distant Language Pairs. IEEE ACM Trans. Audio Speech Lang. Process. 28: 1342-1355 (2020) - [j95]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Corrections to "Machine Speech Chain". IEEE ACM Trans. Audio Speech Lang. Process. 28: 1706 (2020) - [c499]Yoichi Ishibashi, Katsuhito Sudoh, Koichiro Yoshino, Satoshi Nakamura:
Reflection-based Word Attribute Transfer. ACL (student) 2020: 51-58 - [c498]Kosuke Takahashi, Katsuhito Sudoh, Satoshi Nakamura:
Automatic Machine Translation Evaluation using Source Language Inputs and Cross-lingual Language Model. ACL 2020: 3553-3558 - [c497]Koichiro Yoshino, Kana Ikeuchi, Katsuhito Sudoh, Satoshi Nakamura:
Improving Spoken Language Understanding by Wisdom of Crowds. COLING 2020: 2606-2612 - [c496]Yui Oka, Katsuki Chousa, Katsuhito Sudoh, Satoshi Nakamura:
Incorporating Noisy Length Constraints into Transformer with Length-aware Positional Encodings. COLING 2020: 3580-3585 - [c495]Haruko Yagura, Hiroki Tanaka, Taiki Kinoshita, Hiroki Watanabe, Shunnosuke Motomura, Katsuhito Sudoh, Satoshi Nakamura:
Analysis of selective attention processing on experienced simultaneous interpreters using EEG phase synchronization. EMBC 2020: 66-69 - [c494]Shunnosuke Motomura, Hiroki Tanaka, Satoshi Nakamura:
Sequential Attention-based Detection of Semantic Incongruities from EEG While Listening to Speech. EMBC 2020: 268-271 - [c493]Fan Yang, Feiran Li, Yang Wu, Sakriani Sakti, Satoshi Nakamura:
Using Panoramic Videos for Multi-Person Localization and Tracking In A 3D Panoramic Coordinate. ICASSP 2020: 1863-1867 - [c492]Andros Tjandra, Chunxi Liu, Frank Zhang, Xiaohui Zhang, Yongqiang Wang, Gabriel Synnaeve, Satoshi Nakamura, Geoffrey Zweig:
DEJA-VU: Double Feature Presentation and Iterated Loss in Deep Transformer Networks. ICASSP 2020: 6899-6903 - [c491]Takeshi Saga, Hiroki Tanaka, Hidemi Iwasaka, Satoshi Nakamura:
Objective Prediction of Social Skills Level for Automated Social Skills Training Using Audio and Text Information. ICMI Companion 2020: 467-471 - [c490]Kazuhiro Shidara, Hiroki Tanaka, Hiroyoshi Adachi, Daisuke Kanayama, Yukako Sakagami, Takashi Kudo, Satoshi Nakamura:
Analysis of Mood Changes and Facial Expressions during Cognitive Behavior Therapy through a Virtual Agent. ICMI Companion 2020: 477-481 - [c489]Kana Miyamoto, Hiroki Tanaka, Satoshi Nakamura:
Music Generation and Emotion Estimation from EEG Signals for Inducing Affective States. ICMI Companion 2020: 487-491 - [c488]Hiroki Tanaka, Satoshi Nakamura, Jean-Claude Martin, Catherine Pelachaud:
Social Affective Multimodal Interaction for Health. ICMI 2020: 893-894 - [c487]Kazuki Tsunematsu, Johanes Effendi, Sakriani Sakti, Satoshi Nakamura:
Neural Speech Completion. INTERSPEECH 2020: 2742-2746 - [c486]Ivan Halim Parmonangan, Hiroki Tanaka, Sakriani Sakti, Satoshi Nakamura:
Combining Audio and Brain Activity for Predicting Speech Quality. INTERSPEECH 2020: 2762-2766 - [c485]Sashi Novitasari, Andros Tjandra, Tomoya Yanagita, Sakriani Sakti, Satoshi Nakamura:
Incremental Machine Speech Chain Towards Enabling Listening While Speaking in Real-Time. INTERSPEECH 2020: 4372-4376 - [c484]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Transformer VQ-VAE for Unsupervised Unit Discovery and Speech Synthesis: ZeroSpeech 2020 Challenge. INTERSPEECH 2020: 4851-4855 - [c483]Johanes Effendi, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Augmenting Images for ASR and TTS Through Single-Loop and Dual-Loop Multimodal Chain Framework. INTERSPEECH 2020: 4901-4905 - [c482]Koichiro Yoshino, Kohei Wakimoto, Yuta Nishimura, Satoshi Nakamura:
Caption Generation of Robot Behaviors Based on Unsupervised Learning of Action Segments. IWSDS 2020: 227-241 - [c481]Ryo Fukuda, Katsuhito Sudoh, Satoshi Nakamura:
NAIST's Machine Translation Systems for IWSLT 2020 Conversational Speech Translation Task. IWSLT 2020: 172-177 - [c480]Sara Asai, Koichiro Yoshino, Seitaro Shinagawa, Sakriani Sakti, Satoshi Nakamura:
Emotional Speech Corpus for Persuasive Dialogue System. LREC 2020: 491-497 - [c479]Mayuko Okamato, Sakriani Sakti, Satoshi Nakamura:
Towards Speech Entrainment: Considering ASR Information in Speaking Rate Variation of TTS Waveform Generation. O-COCOSDA 2020: 139-144 - [c478]Daichi Ishii, Hiroyuki Kubo, Seitaro Shinagawa, Akinobu Maejima, Takuya Funatomi, Satoshi Nakamura, Yasuhiro Mukaigawa:
Confidence-aware Practical Anime-style Colorization. SIGGRAPH Talks 2020: 40:1-40:2 - [c477]Sashi Novitasari, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Cross-Lingual Machine Speech Chain for Javanese, Sundanese, Balinese, and Bataks Speech Recognition and Synthesis. SLTU-CCURL@LREC 2020: 131-138 - [e5]Marcello Federico, Alex Waibel, Kevin Knight, Satoshi Nakamura, Hermann Ney, Jan Niehues, Sebastian Stüker, Dekai Wu, Joseph Mariani, François Yvon:
Proceedings of the 17th International Conference on Spoken Language Translation, IWSLT 2020, Online, July 9 - 10, 2020. Association for Computational Linguistics 2020, ISBN 978-1-952148-07-1 [contents] - [i44]Koichiro Yoshino, Kohei Wakimoto, Yuta Nishimura, Satoshi Nakamura:
Caption Generation of Robot Behaviors based on Unsupervised Learning of Action Segments. CoRR abs/2003.10066 (2020) - [i43]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Transformer VQ-VAE for Unsupervised Unit Discovery and Speech Synthesis: ZeroSpeech 2020 Challenge. CoRR abs/2005.11676 (2020) - [i42]Yoichi Ishibashi, Katsuhito Sudoh, Koichiro Yoshino, Satoshi Nakamura:
Reflection-based Word Attribute Transfer. CoRR abs/2007.02598 (2020) - [i41]Fan Yang, Xin Chang, Chenyu Dang, Ziqiang Zheng, Sakriani Sakti, Satoshi Nakamura, Yang Wu:
ReMOTS: Self-Supervised Refining Multi-Object Tracking and Segmentation. CoRR abs/2007.03200 (2020) - [i40]Dusan Varis, Katsuhito Sudoh, Satoshi Nakamura:
Image Captioning with Visual Object Representations Grounded in the Textual Modality. CoRR abs/2010.09413 (2020) - [i39]Johanes Effendi, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Augmenting Images for ASR and TTS through Single-loop and Dual-loop Multimodal Chain Framework. CoRR abs/2011.02099 (2020) - [i38]Sashi Novitasari, Andros Tjandra, Tomoya Yanagita, Sakriani Sakti, Satoshi Nakamura:
Incremental Machine Speech Chain Towards Enabling Listening while Speaking in Real-time. CoRR abs/2011.02126 (2020) - [i37]Sashi Novitasari, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Sequence-to-Sequence Learning via Attention Transfer for Incremental Speech Recognition. CoRR abs/2011.02127 (2020) - [i36]Sashi Novitasari, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Cross-Lingual Machine Speech Chain for Javanese, Sundanese, Balinese, and Bataks Speech Recognition and Synthesis. CoRR abs/2011.02128 (2020) - [i35]Katsuhito Sudoh, Takatomo Kano, Sashi Novitasari, Tomoya Yanagita, Sakriani Sakti, Satoshi Nakamura:
Simultaneous Speech-to-Speech Translation System with Neural Incremental ASR, MT, and TTS. CoRR abs/2011.04845 (2020)
2010 – 2019
- 2019
- [j94]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
End-to-End Speech Recognition Sequence Training With Reinforcement Learning. IEEE Access 7: 79758-79769 (2019) - [j93]Fan Yang, Sakriani Sakti, Yang Wu, Satoshi Nakamura:
A Framework for Knowing Who is Doing What in Aerial Surveillance Videos. IEEE Access 7: 93315-93325 (2019) - [j92]Ryohei Eguchi, Naoaki Ono, Aki Hirai, Tetsuo Katsuragi, Satoshi Nakamura, Ming Huang, Md. Altaf-Ul-Amin, Shigehiko Kanaya:
Classification of alkaloids according to the starting substances of their biosynthetic pathways using graph convolutional neural networks. BMC Bioinform. 20(1): 380:1-380:13 (2019) - [j91]Yukitoshi Murase, Koichiro Yoshino, Satoshi Nakamura:
Associative knowledge feature vector inferred on external knowledge base for dialog state tracking. Comput. Speech Lang. 54: 1-16 (2019) - [j90]Hiroki Tanaka, Hiroki Watanabe, Hayato Maki, Sakriani Sakti, Satoshi Nakamura:
Electroencephalogram-Based Single-Trial Detection of Language Expectation Violations in Listening to Speech. Frontiers Comput. Neurosci. 13: 15 (2019) - [j89]Hiroki Watanabe, Hiroki Tanaka, Sakriani Sakti, Satoshi Nakamura:
Neural Oscillation-Based Classification of Japanese Spoken Sentences During Speech Perception. IEICE Trans. Inf. Syst. 102-D(2): 383-391 (2019) - [j88]Yu Suzuki, Yoshitaka Matsuda, Satoshi Nakamura:
Additional Operations of Simple HITs on Microtask Crowdsourcing for Worker Quality Prediction. J. Inf. Process. 27: 51-60 (2019) - [j87]Nurul Lubis, Sakriani Sakti, Koichiro Yoshino, Satoshi Nakamura:
Positive Emotion Elicitation in Chat-Based Dialogue Systems. IEEE ACM Trans. Audio Speech Lang. Process. 27(4): 866-877 (2019) - [c476]Johanes Effendi, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Listening While Speaking and Visualizing: Improving ASR Through Multimodal Chain. ASRU 2019: 471-478 - [c475]Takatomo Kano, Sakriani Sakti, Satoshi Nakamura:
Neural Machine Translation with Acoustic Embedding. ASRU 2019: 578-584 - [c474]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Speech-to-Speech Translation Between Untranscribed Unknown Languages. ASRU 2019: 593-600 - [c473]Sahoko Nakayama, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Zero-Shot Code-Switching ASR and TTS with Multilingual Machine Speech Chain. ASRU 2019: 964-971 - [c472]Holy Lovenia, Hiroki Tanaka, Sakriani Sakti, Ayu Purwarianti, Satoshi Nakamura:
Speech Artifact Removal from Eeg Recordings of Spoken Word Production with Tensor Decomposition. ICASSP 2019: 1115-1119 - [c471]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
End-to-end Feedback Loss in Speech Chain Framework via Straight-through Estimator. ICASSP 2019: 6281-6285 - [c470]Marco Vetter, Sakriani Sakti, Satoshi Nakamura:
Cross-lingual Speech-based Tobi Label Generation Using Bidirectional Lstm. ICASSP 2019: 6620-6624 - [c469]Taiki Kinoshita, Hiroki Tanaka, Koichiro Yoshino, Satoshi Nakamura:
Measuring Affective Sharing between Two People by EEG Hyperscanning. ICMI (Adjunct) 2019: 3:1-3:6 - [c468]Shunnosuke Motomura, Hiroki Tanaka, Satoshi Nakamura:
Detecting Syntactic Violations from Single-trial EEG using Recurrent Neural Networks. ICMI (Adjunct) 2019: 4:1-4:5 - [c467]Hiroki Tanaka, Hiroyoshi Adachi, Hiroaki Kazui, Manabu Ikeda, Takashi Kudo, Satoshi Nakamura:
Detecting Dementia from Face in Human-Agent Interaction. ICMI (Adjunct) 2019: 5:1-5:6 - [c466]Seiya Kawano, Koichiro Yoshino, Satoshi Nakamura:
Neural Conversation Model Controllable by Given Dialogue Act Based on Adversarial Learning and Label-aware Objective. INLG 2019: 198-207 - [c465]Andros Tjandra, Berrak Sisman, Mingyang Zhang, Sakriani Sakti, Haizhou Li, Satoshi Nakamura:
VQVAE Unsupervised Unit Discovery and Multi-Scale Code2Spec Inverter for Zerospeech Challenge 2019. INTERSPEECH 2019: 1118-1122 - [c464]Ivan Halim Parmonangan, Hiroki Tanaka, Sakriani Sakti, Shinnosuke Takamichi, Satoshi Nakamura:
Speech Quality Evaluation of Synthesized Japanese Speech Using EEG. INTERSPEECH 2019: 1228-1232 - [c463]Sashi Novitasari, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Sequence-to-Sequence Learning via Attention Transfer for Incremental Speech Recognition. INTERSPEECH 2019: 3835-3839 - [c462]Andrei Catalin Coman, Koichiro Yoshino, Yukitoshi Murase, Satoshi Nakamura, Giuseppe Riccardi:
An Incremental Turn-Taking Model for Task-Oriented Dialog Systems. INTERSPEECH 2019: 4155-4159 - [c461]Koichiro Yoshino, Yukitoshi Murase, Nurul Lubis, Kyoshiro Sugiyama, Hiroki Tanaka, Sakriani Sakti, Shinnosuke Takamichi, Satoshi Nakamura:
Spoken Dialogue Robot for Watching Daily Life of Elderly People. IWSDS 2019: 141-146 - [c460]Fan Yang, Yang Wu, Sakriani Sakti, Satoshi Nakamura:
Make Skeleton-based Action Recognition Model Smaller, Faster and Better. MMAsia 2019: 31:1-31:6 - [c459]Satoshi Nakamura:
Oriental-COCOSDA 2019 Japan country report. O-COCOSDA 2019: 1-6 - [c458]Sahoko Nakayama, Takatomo Kano, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Recognition and translation of code-switching speech utterances. O-COCOSDA 2019: 1-6 - [c457]Mayuko Okamato, Sakriani Sakti, Satoshi Nakamura:
Phoneme-level speaking rate variation on waveform generation using GAN-TTS. O-COCOSDA 2019: 1-7 - [c456]Akinobu Maejima, Hiroyuki Kubo, Takuya Funatomi, Tatsuo Yotsukura, Satoshi Nakamura, Yasuhiro Mukaigawa:
Graph matching based anime colorization with multiple references. SIGGRAPH Posters 2019: 13:1-13:2 - [c455]Tomoya Yanagita, Sakriani Sakti, Satoshi Nakamura:
Neural iTTS: Toward Synthesizing Speech in Real-time with End-to-end Neural Text-to-Speech Framework. SSW 2019: 183-188 - [e4]Satoshi Nakamura, Milica Gasic, Ingrid Zuckerman, Gabriel Skantze, Mikio Nakano, Alexandros Papangelis, Stefan Ultes, Koichiro Yoshino:
Proceedings of the 20th Annual SIGdial Meeting on Discourse and Dialogue, SIGdial 2019, Stockholm, Sweden, September 11-13, 2019. Association for Computational Linguistics 2019, ISBN 978-1-950737-61-1 [contents] - [i34]Andros Tjandra, Berrak Sisman, Mingyang Zhang, Sakriani Sakti, Haizhou Li, Satoshi Nakamura:
VQVAE Unsupervised Unit Discovery and Multi-scale Code2Spec Inverter for Zerospeech Challenge 2019. CoRR abs/1905.11449 (2019) - [i33]Andrei Catalin Coman, Koichiro Yoshino, Yukitoshi Murase, Satoshi Nakamura, Giuseppe Riccardi:
An Incremental Turn-Taking Model For Task-Oriented Dialog Systems. CoRR abs/1905.11806 (2019) - [i32]Johanes Effendi, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
From Speech Chain to Multimodal Chain: Leveraging Cross-modal Data Augmentation for Semi-supervised Learning. CoRR abs/1906.00579 (2019) - [i31]Shohei Tanaka, Koichiro Yoshino, Katsuhito Sudoh, Satoshi Nakamura:
Conversational Response Re-ranking Based on Event Causality and Role Factored Tensor Event Embedding. CoRR abs/1906.09795 (2019) - [i30]Fan Yang, Sakriani Sakti, Yang Wu, Satoshi Nakamura:
Make Skeleton-based Action Recognition Model Smaller, Faster and Better. CoRR abs/1907.09658 (2019) - [i29]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Speech-to-speech Translation between Untranscribed Unknown Languages. CoRR abs/1910.00795 (2019) - [i28]Andros Tjandra, Chunxi Liu, Frank Zhang, Xiaohui Zhang, Yongqiang Wang, Gabriel Synnaeve, Satoshi Nakamura, Geoffrey Zweig:
Deja-vu: Double Feature Presentation in Deep Transformer Networks. CoRR abs/1910.10324 (2019) - [i27]Fan Yang, Feiran Li, Yang Wu, Sakriani Sakti, Satoshi Nakamura:
Using panoramic videos for multi-person localization and tracking in a 3D panoramic coordinate. CoRR abs/1911.10535 (2019) - [i26]Katsuki Chousa, Katsuhito Sudoh, Satoshi Nakamura:
Simultaneous Neural Machine Translation using Connectionist Temporal Classification. CoRR abs/1911.11933 (2019) - 2018
- [j86]Michael Heck, Sakriani Sakti, Satoshi Nakamura:
Learning Supervised Feature Transformations on Zero Resources for Improved Acoustic Unit Discovery. IEICE Trans. Inf. Syst. 101-D(1): 205-214 (2018) - [j85]Ikuo Keshi, Yu Suzuki, Koichiro Yoshino, Satoshi Nakamura:
Semantically Readable Distributed Representation Learning and Its Expandability Using a Word Semantic Vector Dictionary. IEICE Trans. Inf. Syst. 101-D(4): 1066-1078 (2018) - [j84]Nurul Lubis, Dessi Puji Lestari, Sakriani Sakti, Ayu Purwarianti, Satoshi Nakamura:
Construction of Spontaneous Emotion Corpus from Indonesian TV Talk Shows and Its Application on Multimodal Emotion Recognition. IEICE Trans. Inf. Syst. 101-D(8): 2092-2100 (2018) - [j83]Takatomo Kano, Shinnosuke Takamichi, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
An end-to-end model for cross-lingual transformation of paralinguistic information. Mach. Transl. 32(4): 353-368 (2018) - [j82]Kazuhiro Kobayashi, Tomoki Toda, Satoshi Nakamura:
Intra-gender statistical singing voice conversion with direct waveform modification using log-spectral differential. Speech Commun. 99: 211-220 (2018) - [j81]Quoc Truong Do, Sakriani Sakti, Satoshi Nakamura:
Sequence-to-Sequence Models for Emphasis Speech Translation. IEEE ACM Trans. Audio Speech Lang. Process. 26(10): 1873-1883 (2018) - [j80]Michael Heck, Sakriani Sakti, Satoshi Nakamura:
Dirichlet Process Mixture of Mixtures Model for Unsupervised Subword Modeling. IEEE ACM Trans. Audio Speech Lang. Process. 26(11): 2027-2042 (2018) - [c454]Nurul Lubis, Sakriani Sakti, Koichiro Yoshino, Satoshi Nakamura:
Eliciting Positive Emotion through Affect-Sensitive Dialogue Response Generation: A Neural Network Approach. AAAI 2018: 5293-5300 - [c453]Yuta Nishimura, Katsuhito Sudoh, Graham Neubig, Satoshi Nakamura:
Multi-Source Neural Machine Translation with Missing Data. NMT@ACL 2018: 92-99 - [c452]Naoki Hosomi, Sakriani Sakti, Koichiro Yoshino, Satoshi Nakamura:
Deception Detection and Analysis in Spoken Dialogues based on FastText. APSIPA 2018: 139-142 - [c451]Masahiro Honda, Hiroki Tanaka, Sakriani Sakti, Satoshi Nakamura:
Detecting suppression of negative emotion by time series change of cerebral blood flow using fNIRS. BHI 2018: 398-401 - [c450]Hiroaki Tanaka, Yu Suzuki, Koichiro Yoshino, Satoshi Nakamura:
TRANS-AM: Discovery Method of Optimal Input Vectors Corresponding to Objective Variables. DaWaK 2018: 216-228 - [c449]Yu Suzuki, Satoshi Nakamura:
Information Filtering Method for Twitter Streaming Data Using Human-in-the-Loop Machine Learning. DEXA (2) 2018: 167-175 - [c448]Hiroki Tanaka, Hiroki Watanabe, Hayato Maki, Sakriani Sakti, Satoshi Nakamura:
Single-Trial Detection of Semantic Anomalies From EEG During Listening to Spoken Sentences. EMBC 2018: 977-980 - [c447]Hayato Maki, Hiroki Tanaka, Sakriani Sakti, Satoshi Nakamura:
Graph Regularized Tensor Factorization for Single-Trial EEG Analysis. ICASSP 2018: 846-850 - [c446]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Sequence-to-Sequence Asr Optimization Via Reinforcement Learning. ICASSP 2018: 5829-5833 - [c445]Hiroki Tanaka, Hideki Negoro, Hidemi Iwasaka, Satoshi Nakamura:
Listening Skills Assessment through Computer Agents. ICMI 2018: 492-496 - [c444]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Tensor Decomposition for Compressing Recurrent Neural Network. IJCNN 2018: 1-8 - [c443]Takuma Mori, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Compressing End-to-end ASR Networks by Tensor-Train Decomposition. INTERSPEECH 2018: 806-810 - [c442]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Machine Speech Chain with One-shot Speaker Adaptation. INTERSPEECH 2018: 887-891 - [c441]Tomoya Yanagita, Sakriani Sakti, Satoshi Nakamura:
Incremental TTS for Japanese Language. INTERSPEECH 2018: 902-906 - [c440]Tsuyoki Ujiro, Hiroki Tanaka, Hiroyoshi Adachi, Hiroaki Kazui, Manabu Ikeda, Takashi Kudo, Satoshi Nakamura:
Detection of Dementia from Responses to Atypical Questions Asked by Embodied Conversational Agents. INTERSPEECH 2018: 1691-1695 - [c439]Seiya Kawano, Koichiro Yoshino, Yu Suzuki, Satoshi Nakamura:
Dialogue Act Classification in Reference Interview Using Convolutional Neural Network with Byte Pair Encoding. IWSDS 2018: 17-25 - [c438]The Tung Nguyen, Koichiro Yoshino, Sakriani Sakti, Satoshi Nakamura:
Impact of Deception Information on Negotiation Dialog Management: A Case Study on Doctor-Patient Conversations. IWSDS 2018: 199-206 - [c437]Yuta Nishimura, Katsuhito Sudoh, Graham Neubig, Satoshi Nakamura:
Multi-Source Neural Machine Translation with Data Augmentation. IWSLT 2018: 48-53 - [c436]Johanes Effendi, Sakriani Sakti, Katsuhito Sudoh, Satoshi Nakamura:
Multi-paraphrase Augmentation to Leverage Neural Caption Translation. IWSLT 2018: 181-188 - [c435]Kaho Osamura, Takatomo Kano, Sakriani Sakti, Katsuhito Sudoh, Satoshi Nakamura:
Using Spoken Word Posterior Features in Neural Machine Translation. IWSLT 2018: 189-195 - [c434]Sashi Novitasari, Quoc Truong Do, Sakriani Sakti, Dessi Puji Lestari, Satoshi Nakamura:
Construction of English-French Multimodal Affective Conversational Corpus from TV Dramas. LREC 2018 - [c433]Koichiro Yoshino, Yoko Ishikawa, Masahiro Mizukami, Yu Suzuki, Sakriani Sakti, Satoshi Nakamura:
Dialogue Scenario Collection of Persuasive Dialogue with Emotional Expressions via Crowdsourcing. LREC 2018 - [c432]Koichiro Yoshino, Hiroki Tanaka, Kyoshiro Sugiyama, Makoto Kondo, Satoshi Nakamura:
Japanese Dialogue Corpus of Information Navigation and Attentive Listening Annotated with Extended ISO-24617-2 Dialogue Act Tags. LREC 2018 - [c431]Jingyi Zhang, Masao Utiyama, Eiichiro Sumita, Graham Neubig, Satoshi Nakamura:
Guiding Neural Machine Translation with Retrieved Translation Pieces. NAACL-HLT 2018: 1325-1335 - [c430]Sashi Novitasari, Quoc Truong Do, Sakriani Sakti, Dessi Puji Lestari, Satoshi Nakamura:
Multi-Modal Multi-Task Deep Learning For Speaker And Emotion Recognition Of TV-Series Data. O-COCOSDA 2018: 37-42 - [c429]Sahoko Nakayama, Takatomo Kano, Quoc Truong Do, Sakriani Sakti, Satoshi Nakamura:
Japanese-English Code-Switching Speech Data Construction. O-COCOSDA 2018: 67-71 - [c428]Nurul Lubis, Sakriani Sakti, Koichiro Yoshino, Satoshi Nakamura:
Unsupervised Counselor Dialogue Clustering for Positive Emotion Elicitation in Neural Dialogue System. SIGDIAL Conference 2018: 161-170 - [c427]Sophie Ramassamy, Hiroyuki Kubo, Takuya Funatomi, Daichi Ishii, Akinobu Maejima, Satoshi Nakamura, Yasuhiro Mukaigawa:
Pre- and post-processes for automatic colorization using a fully convolutional network. SIGGRAPH ASIA Posters 2018: 70:1-70:2 - [c426]Sahoko Nakayama, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Speech Chain for Semi-Supervised Learning of Japanese-English Code-Switching ASR and TTS. SLT 2018: 182-189 - [c425]Berrak Sisman, Mingyang Zhang, Sakriani Sakti, Haizhou Li, Satoshi Nakamura:
Adaptive Wavenet Vocoder for Residual Compensation in GAN-Based Voice Conversion. SLT 2018: 282-289 - [c424]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Multi-Scale Alignment and Contextual History for Attention Mechanism in Sequence-to-Sequence Model. SLT 2018: 648-655 - [c423]Quoc Truong Do, Sakriani Sakti, Satoshi Nakamura:
Toward Multi-Features Emphasis Speech Translation: Assessment of Human Emphasis Production and Perception with Speech and Text Clues. SLT 2018: 700-706 - [c422]Nurul Lubis, Sakriani Sakti, Koichiro Yoshino, Satoshi Nakamura:
Optimizing Neural Response Generator with Emotional Impact Information. SLT 2018: 876-883 - [c421]Bin Wu, Sakriani Sakti, Jinsong Zhang, Satoshi Nakamura:
Optimizing DPGMM Clustering in Zero Resource Setting Based on Functional Load. SLTU 2018: 1-5 - [c420]Khumaisa Nur'Aini, Johanes Effendi, Sakriani Sakti, Mirna Adriani, Satoshi Nakamura:
Corpus Construction and Semantic Analysis of Indonesian Image Description. SLTU 2018: 42-46 - [i25]Takatomo Kano, Sakriani Sakti, Satoshi Nakamura:
Structured-based Curriculum Learning for End-to-end English-Japanese Speech Translation. CoRR abs/1802.06003 (2018) - [i24]Seitaro Shinagawa, Koichiro Yoshino, Sakriani Sakti, Yu Suzuki, Satoshi Nakamura:
Interactive Image Manipulation with Natural Language Instruction Commands. CoRR abs/1802.08645 (2018) - [i23]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Tensor Decomposition for Compressing Recurrent Neural Network. CoRR abs/1802.10410 (2018) - [i22]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Machine Speech Chain with One-shot Speaker Adaptation. CoRR abs/1803.10525 (2018) - [i21]Jingyi Zhang, Masao Utiyama, Eiichiro Sumita, Graham Neubig, Satoshi Nakamura:
Guiding Neural Machine Translation with Retrieved Translation Pieces. CoRR abs/1804.02559 (2018) - [i20]Yuta Nishimura, Katsuhito Sudoh, Graham Neubig, Satoshi Nakamura:
Multi-Source Neural Machine Translation with Missing Data. CoRR abs/1806.02525 (2018) - [i19]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Multi-scale Alignment and Contextual History for Attention Mechanism in Sequence-to-sequence Model. CoRR abs/1807.08280 (2018) - [i18]Katsuki Chousa, Katsuhito Sudoh, Satoshi Nakamura:
Training Neural Machine Translation using Word Embedding-based Loss. CoRR abs/1807.11219 (2018) - [i17]Yuta Nishimura, Katsuhito Sudoh, Graham Neubig, Satoshi Nakamura:
Multi-Source Neural Machine Translation with Data Augmentation. CoRR abs/1810.06826 (2018) - [i16]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
End-to-End Feedback Loss in Speech Chain Framework via Straight-Through Estimator. CoRR abs/1810.13107 (2018) - [i15]Ryo Nakamura, Katsuhito Sudoh, Koichiro Yoshino, Satoshi Nakamura:
Another Diversity-Promoting Objective Function for Neural Dialogue Generation. CoRR abs/1811.08100 (2018) - [i14]Hisao Katsumi, Takuya Hiraoka, Koichiro Yoshino, Kazeto Yamamoto, Shota Motoura, Kunihiko Sadamasa, Satoshi Nakamura:
Optimization of Information-Seeking Dialogue Strategy for Argumentation-Based Dialogue System. CoRR abs/1811.10728 (2018) - 2017
- [j79]Shigeki Matsuda, Teruaki Hayashi, Yutaka Ashikari, Yoshinori Shiga, Hidenori Kashioka, Keiji Yasuda, Hideo Okuma, Masao Uchiyama, Eiichiro Sumita, Hisashi Kawai, Satoshi Nakamura:
Development of the "VoiceTra" Multi-Lingual Speech Translation System. IEICE Trans. Inf. Syst. 100-D(4): 621-632 (2017) - [j78]Kou Tanaka, Tomoki Toda, Satoshi Nakamura:
A Vibration Control Method of an Electrolarynx Based on Statistical F0 Pattern Prediction. IEICE Trans. Inf. Syst. 100-D(9): 2165-2173 (2017) - [j77]Matthias Sperber, Graham Neubig, Jan Niehues, Satoshi Nakamura, Alex Waibel:
Transcribing against time. Speech Commun. 93: 20-30 (2017) - [j76]Quoc Truong Do, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
Preserving Word-Level Emphasis in Speech-to-Speech Translation. IEEE ACM Trans. Audio Speech Lang. Process. 25(3): 544-556 (2017) - [c419]Nurul Lubis, Michael Heck, Sakriani Sakti, Koichiro Yoshino, Satoshi Nakamura:
Processing negative emotions through social communication: Multimodal database construction and analysis. ACII 2017: 79-85 - [c418]Yusuke Oda, Philip Arthur, Graham Neubig, Koichiro Yoshino, Satoshi Nakamura:
Neural Machine Translation via Binary Code Prediction. ACL (1) 2017: 850-860 - [c417]Makoto Morishita, Yusuke Oda, Graham Neubig, Koichiro Yoshino, Katsuhito Sudoh, Satoshi Nakamura:
An Empirical Study of Mini-Batch Creation Strategies for Neural Machine Translation. NMT@ACL 2017: 61-68 - [c416]Yusuke Oda, Katsuhito Sudoh, Satoshi Nakamura, Masao Utiyama, Eiichiro Sumita:
A Simple and Strong Baseline: NAIST-NICT Neural Machine Translation System for WAT2017 English-Japanese Translation Task. WAT@IJCNLP 2017: 135-139 - [c415]Kazutaka Kubo, Kazuhiro Kobayashi, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
An investigation of how to design control parameters for statistical voice timbre control. APSIPA 2017: 1520-1523 - [c414]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Listening while speaking: Speech chain by deep learning. ASRU 2017: 301-308 - [c413]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Attention-based Wav2Text with feature transfer learning. ASRU 2017: 309-315 - [c412]Michael Heck, Sakriani Sakti, Satoshi Nakamura:
Feature optimized DPGMM clustering for unsupervised subword modeling: A contribution to zerospeech 2017. ASRU 2017: 740-746 - [c411]Yoshitaka Matsuda, Yu Suzuki, Satoshi Nakamura:
A trade-off between estimation accuracy of worker quality and task complexity. IEEE BigData 2017: 4410-4416 - [c410]Naoto Terasawa, Hiroki Tanaka, Sakriani Sakti, Satoshi Nakamura:
Tracking liking state in brain activity while watching multiple movies. ICMI 2017: 321-325 - [c409]Jingyi Zhang, Masao Utiyama, Eiichiro Sumita, Graham Neubig, Satoshi Nakamura:
Improving Neural Machine Translation through Phrase-based Forced Decoding. IJCNLP(1) 2017: 152-162 - [c408]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Local Monotonic Attention Mechanism for End-to-End Speech And Language Processing. IJCNLP(1) 2017: 431-440 - [c407]Louisa Pragst, Koichiro Yoshino, Wolfgang Minker, Satoshi Nakamura, Stefan Ultes:
Acquisition and Assessment of Semantic Content for the Generation of Elaborateness and Indirectness in Spoken Dialogue Systems. IJCNLP(1) 2017: 915-925 - [c406]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Compressing recurrent neural network with tensor train. IJCNN 2017: 4451-4458 - [c405]Kou Tanaka, Hirokazu Kameoka, Tomoki Toda, Satoshi Nakamura:
Physically Constrained Statistical F0 Prediction for Electrolaryngeal Speech Enhancement. INTERSPEECH 2017: 1069-1073 - [c404]Michael Heck, Masayuki Suzuki, Takashi Fukuda, Gakuto Kurata, Satoshi Nakamura:
Ensembles of Multi-Scale VGG Acoustic Models. INTERSPEECH 2017: 1616-1620 - [c403]Hiroki Watanabe, Hiroki Tanaka, Sakriani Sakti, Satoshi Nakamura:
Subject-Independent Classification of Japanese Spoken Sentences by Multiple Frequency Bands Phase Pattern of EEG Response During Speech Perception. INTERSPEECH 2017: 2431-2435 - [c402]Takatomo Kano, Sakriani Sakti, Satoshi Nakamura:
Structured-Based Curriculum Learning for End-to-End English-Japanese Speech Translation. INTERSPEECH 2017: 2630-2634 - [c401]Quoc Truong Do, Sakriani Sakti, Satoshi Nakamura:
Toward Expressive Speech Translation: A Unified Sequence-to-Sequence LSTMs Approach for Translating Words and Emphasis. INTERSPEECH 2017: 2640-2644 - [c400]Nurul Lubis, Sakriani Sakti, Koichiro Yoshino, Satoshi Nakamura:
Eliciting Positive Emotional Impact in Dialogue Response Selection. IWSDS 2017: 135-148 - [c399]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Speech recognition features based on deep latent Gaussian models. MLSP 2017: 1-6 - [c398]Tamotsu Endo, Norimichi Ukita, Hiroki Tanaka, Norihiro Hagita, Satoshi Nakamura, Hiroyoshi Adachi, Manabu Ikeda, Hiroaki Kazui, Takashi Kudo:
Initial response time measurement in eye movement for dementia screening test. MVA 2017: 262-265 - [c397]Yu Suzuki, Koichiro Yoshino, Satoshi Nakamura:
A k-anonymized Text Generation Method. NBiS 2017: 1018-1026 - [c396]Johanes Effendi, Sakriani Sakti, Satoshi Nakamura:
Creation of a multi-paraphrase corpus based on various elementary operations. O-COCOSDA 2017: 1-6 - [c395]Koichiro Yoshino, Yu Suzuki, Satoshi Nakamura:
Information Navigation System with Discovering User Interests. SIGDIAL Conference 2017: 356-359 - [c394]Kohei Mukaihara, Sakriani Sakti, Satoshi Nakamura:
Recognizing Emotionally Coloured Dialogue Speech Using Speaker-Adapted DNN-CNN Bottleneck Features. SPECOM 2017: 632-641 - [c393]Ikuo Keshi, Yu Suzuki, Koichiro Yoshino, Satoshi Nakamura:
Semantically readable distributed representation learning for social media mining. WI 2017: 716-722 - [c392]Akiva Miura, Graham Neubig, Katsuhito Sudoh, Satoshi Nakamura:
Tree as a Pivot: Syntactic Matching Methods in Pivot Translation. WMT 2017: 90-98 - [c391]Jingyi Zhang, Masao Utiyama, Eiichiro Sumita, Graham Neubig, Satoshi Nakamura:
NICT-NAIST System for WMT17 Multimodal Translation Task. WMT 2017: 477-482 - [i13]Yusuke Oda, Philip Arthur, Graham Neubig, Koichiro Yoshino, Satoshi Nakamura:
Neural Machine Translation via Binary Code Prediction. CoRR abs/1704.06918 (2017) - [i12]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Compressing Recurrent Neural Network with Tensor Train. CoRR abs/1705.08052 (2017) - [i11]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Local Monotonic Attention Mechanism for End-to-End Speech Recognition. CoRR abs/1705.08091 (2017) - [i10]Koichiro Yoshino, Shinsuke Mori, Satoshi Nakamura:
Analysis of the Effect of Dependency Information on Predicate-Argument Structure Analysis and Zero Anaphora Resolution. CoRR abs/1705.10962 (2017) - [i9]Andros Tjandra, Sakriani Sakti, Ruli Manurung, Mirna Adriani, Satoshi Nakamura:
Gated Recurrent Neural Tensor Network. CoRR abs/1706.02222 (2017) - [i8]Makoto Morishita, Yusuke Oda, Graham Neubig, Koichiro Yoshino, Katsuhito Sudoh, Satoshi Nakamura:
An Empirical Study of Mini-Batch Creation Strategies for Neural Machine Translation. CoRR abs/1706.05765 (2017) - [i7]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Listening while Speaking: Speech Chain by Deep Learning. CoRR abs/1707.04879 (2017) - [i6]Matthias Sperber, Graham Neubig, Jan Niehues, Satoshi Nakamura, Alex Waibel:
Transcribing Against Time. CoRR abs/1709.05227 (2017) - [i5]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Attention-based Wav2Text with Feature Transfer Learning. CoRR abs/1709.07814 (2017) - [i4]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Sequence-to-Sequence ASR Optimization via Reinforcement Learning. CoRR abs/1710.10774 (2017) - [i3]Jingyi Zhang, Masao Utiyama, Eiichiro Sumita, Graham Neubig, Satoshi Nakamura:
Improving Neural Machine Translation through Phrase-based Forced Decoding. CoRR abs/1711.00309 (2017) - 2016
- [j75]Hayato Maki, Tomoki Toda, Sakriani Sakti, Graham Neubig, Satoshi Nakamura:
Enhancing Event-Related Potentials Based on Maximum a Posteriori Estimation with a Spatial Correlation Prior. IEICE Trans. Inf. Syst. 99-D(6): 1437-1446 (2016) - [j74]Shinnosuke Takamichi, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
A Statistical Sample-Based Approach to GMM-Based Voice Conversion Using Tied-Covariance Acoustic Models. IEICE Trans. Inf. Syst. 99-D(10): 2490-2498 (2016) - [j73]Lasguido Nio, Sakriani Sakti, Graham Neubig, Koichiro Yoshino, Satoshi Nakamura:
Neural Network Approaches to Dialog Response Retrieval and Generation. IEICE Trans. Inf. Syst. 99-D(10): 2508-2517 (2016) - [j72]Kazuhiro Kobayashi, Tomoki Toda, Tomoyasu Nakano, Masataka Goto, Satoshi Nakamura:
Improvements of Voice Timbre Control Based on Perceived Age in Singing Voice Conversion. IEICE Trans. Inf. Syst. 99-D(11): 2767-2777 (2016) - [j71]Yuji Oshima, Shinnosuke Takamichi, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
Non-Native Text-to-Speech Preserving Speaker Individuality Based on Partial Correction of Prosodic and Phonetic Characteristics. IEICE Trans. Inf. Syst. 99-D(12): 3132-3139 (2016) - [j70]Takuya Hiraoka, Kallirroi Georgila, Elnaz Nouri, David R. Traum, Satoshi Nakamura:
Reinforcement Learning of Multi-Party Trading Dialog Policies. Inf. Media Technol. 11: 264-277 (2016) - [j69]Jingyi Zhang, Masao Utiyama, Eiichiro Sumita, Hai Zhao, Graham Neubig, Satoshi Nakamura:
Learning local word reorderings for hierarchical phrase-based statistical machine translation. Mach. Transl. 30(1-2): 1-18 (2016) - [j68]Takuya Hiraoka, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Learning cooperative persuasive dialogue policies using framing. Speech Commun. 84: 83-96 (2016) - [j67]Shinnosuke Takamichi, Tomoki Toda, Alan W. Black, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
Postfilters to Modify the Modulation Spectrum for Statistical Parametric Speech Synthesis. IEEE ACM Trans. Audio Speech Lang. Process. 24(4): 755-767 (2016) - [j66]Hiroki Tanaka, Sakriani Sakti, Graham Neubig, Tomoki Toda, Hideki Negoro, Hidemi Iwasaka, Satoshi Nakamura:
Teaching Social Communication Skills Through Human-Agent Interaction. ACM Trans. Interact. Intell. Syst. 6(2): 18:1-18:26 (2016) - [c390]Jingyi Zhang, Masao Utiyama, Eiichiro Sumita, Graham Neubig, Satoshi Nakamura:
A Continuous Space Rule Selection Model for Syntax-based Statistical Machine Translation. ACL (1) 2016 - [c389]Hiroki Tanaka, Sakriani Sakti, Graham Neubig, Hideki Negoro, Hidemi Iwasaka, Satoshi Nakamura:
Automated social skills training with audiovisual information. EMBC 2016: 2262-2265 - [c388]Hayato Maki, Tomoki Toda, Sakriani Sakti, Graham Neubig, Satoshi Nakamura:
Removing noise from event-related potentials using a probabilistic generative model with grouped covariance matrices. EMBC 2016: 3728-3731 - [c387]Philip Arthur, Graham Neubig, Satoshi Nakamura:
Incorporating Discrete Translation Lexicons into Neural Machine Translation. EMNLP 2016: 1557-1567 - [c386]Oliver Adams, Graham Neubig, Trevor Cohn, Steven Bird, Quoc Truong Do, Satoshi Nakamura:
Learning a Lexicon and Translation Model from Phoneme Lattices. EMNLP 2016: 2377-2382 - [c385]Kou Tanaka, Tomoki Toda, Graham Neubig, Satoshi Nakamura:
Real-time vibration control of an electrolarynx based on statistical F0 contour prediction. EUSIPCO 2016: 1333-1337 - [c384]Soichi Yamane, Kazuhiro Kobayashi, Tomoki Toda, Tomoyasu Nakano, Masataka Goto, Satoshi Nakamura:
An estimation method of voice timbre evaluation values using feature extraction with Gaussian mixture model based on reference singer. ICASSP 2016: 5265-5269 - [c383]Kou Tanaka, Hirokazu Kameoka, Tomoki Toda, Satoshi Nakamura:
Statistical F0 prediction for electrolaryngeal speech enhancement considering generative process of F0 contours within product of experts framework. ICASSP 2016: 5665-5669 - [c382]Kazuhiro Kobayashi, Tomoki Toda, Satoshi Nakamura:
Implementation of F0 transformation for statistical singing voice conversion based on direct waveform modification. ICASSP 2016: 5670-5674 - [c381]Yusuke Tajiri, Tomoki Toda, Satoshi Nakamura:
Noise suppression method for body-conducted soft speech enhancement based on external noise monitoring. ICASSP 2016: 5935-5939 - [c380]Rui Hiraoka, Hiroki Tanaka, Sakriani Sakti, Graham Neubig, Satoshi Nakamura:
Personalized unknown word detection in non-native language reading using eye gaze. ICMI 2016: 66-70 - [c379]Hiroki Tanaka, Hiroyoshi Adachi, Norimichi Ukita, Takashi Kudo, Satoshi Nakamura:
Automatic detection of very early stage of dementia through multimodal interaction with computer avatars. ICMI 2016: 261-265 - [c378]Wakana Maeda, Yu Suzuki, Satoshi Nakamura:
Fast text anonymization using k-anonyminity. iiWAS 2016: 340-344 - [c377]Andros Tjandra, Sakriani Sakti, Ruli Manurung, Mirna Adriani, Satoshi Nakamura:
Gated Recurrent Neural Tensor Network. IJCNN 2016: 448-455 - [c376]Patrick Lumban Tobing, Tomoki Toda, Hirokazu Kameoka, Satoshi Nakamura:
Acoustic-to-Articulatory Inversion Mapping Based on Latent Trajectory Gaussian Mixture Model. INTERSPEECH 2016: 953-957 - [c375]Michael Heck, Sakriani Sakti, Satoshi Nakamura:
Supervised Learning of Acoustic Models in a Zero Resource Setting to Improve DPGMM Clustering. INTERSPEECH 2016: 1310-1314 - [c374]Kazuhiro Kobayashi, Shinnosuke Takamichi, Satoshi Nakamura, Tomoki Toda:
The NU-NAIST Voice Conversion System for the Voice Conversion Challenge 2016. INTERSPEECH 2016: 1667-1671 - [c373]Quoc Truong Do, Sakriani Sakti, Graham Neubig, Satoshi Nakamura:
Transferring Emphasis in Speech Translation Using Hard-Attentional Neural Network Models. INTERSPEECH 2016: 2533-2537 - [c372]Satoshi Tsujioka, Sakriani Sakti, Koichiro Yoshino, Graham Neubig, Satoshi Nakamura:
Unsupervised Joint Estimation of Grapheme-to-Phoneme Conversion Systems and Acoustic Model Adaptation for Non-Native Speech Recognition. INTERSPEECH 2016: 3091-3095 - [c371]Quoc Truong Do, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
A Hybrid System for Continuous Word-Level Emphasis Modeling Based on HMM State Clustering and Adaptive Training. INTERSPEECH 2016: 3196-3200 - [c370]Marco Vetter, Markus Müller, Fatima Hamlaoui, Graham Neubig, Satoshi Nakamura, Sebastian Stüker, Alex Waibel:
Unsupervised Phoneme Segmentation of Previously Unseen Languages. INTERSPEECH 2016: 3544-3548 - [c369]Takuya Hiraoka, Graham Neubig, Koichiro Yoshino, Tomoki Toda, Satoshi Nakamura:
Active Learning for Example-Based Dialog Systems. IWSDS 2016: 67-78 - [c368]Nurul Lubis, Randy Gomez, Sakriani Sakti, Keisuke Nakamura, Koichiro Yoshino, Satoshi Nakamura, Kazuhiro Nakadai:
Construction of Japanese Audio-Visual Emotion Database and Its Application in Emotion Recognition. LREC 2016 - [c367]Matthias Sperber, Graham Neubig, Satoshi Nakamura, Alex Waibel:
Optimizing Computer-Assisted Transcription Quality with Iterative User Interfaces. LREC 2016 - [c366]Akiva Miura, Graham Neubig, Michael Paul, Satoshi Nakamura:
Selecting Syntactic, Non-redundant Segments in Active Learning for Machine Translation. HLT-NAACL 2016: 20-29 - [c365]Juliana Miehle, Koichiro Yoshino, Louisa Pragst, Stefan Ultes, Satoshi Nakamura, Wolfgang Minker:
Cultural Communication Idiosyncrasies in Human-Computer Interaction. SIGDIAL Conference 2016: 74-79 - [c364]Masahiro Mizukami, Koichiro Yoshino, Graham Neubig, David R. Traum, Satoshi Nakamura:
Analyzing the Effect of Entrainment on Dialogue Acts. SIGDIAL Conference 2016: 310-318 - [c363]Sakriani Sakti, Seiji Kawanishi, Graham Neubig, Koichiro Yoshino, Satoshi Nakamura:
Deep bottleneck features and sound-dependent i-vectors for simultaneous recognition of speech and environmental sounds. SLT 2016: 35-42 - [c362]Michael Heck, Sakriani Sakti, Satoshi Nakamura:
Iterative training of a DPGMM-HMM acoustic unit recognizer in a zero resource scenario. SLT 2016: 57-63 - [c361]Kazuhiro Kobayashi, Tomoki Toda, Satoshi Nakamura:
F0 transformation techniques for statistical voice conversion with direct waveform modification with spectral differential. SLT 2016: 693-700 - [c360]Michael Heck, Sakriani Sakti, Satoshi Nakamura:
Unsupervised Linear Discriminant Analysis for Supporting DPGMM Clustering in the Zero Resource Scenario. SLTU 2016: 73-79 - [c359]Yu Suzuki, Satoshi Nakamura:
Assessing the Quality of Wikipedia Editors through Crowdsourcing. WWW (Companion Volume) 2016: 1001-1006 - [i2]Philip Arthur, Graham Neubig, Satoshi Nakamura:
Incorporating Discrete Translation Lexicons into Neural Machine Translation. CoRR abs/1606.02006 (2016) - 2015
- [j65]Hiroki Tanaka, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
NOCOA+: Multimodal Computer-Based Training for Social and Communication Skills. IEICE Trans. Inf. Syst. 98-D(8): 1536-1544 (2015) - [j64]Philip Arthur, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Semantic Parsing of Ambiguous Input through Paraphrasing and Verification. Trans. Assoc. Comput. Linguistics 3: 571-584 (2015) - [j63]Daichi Kitamura, Hiroshi Saruwatari, Hirokazu Kameoka, Yu Takahashi, Kazunobu Kondo, Satoshi Nakamura:
Multichannel Signal Separation Combining Directional Clustering and Nonnegative Matrix Factorization with Spectrogram Restoration. IEEE ACM Trans. Audio Speech Lang. Process. 23(4): 654-669 (2015) - [c358]Masahiro Mizukami, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Linguistic Individuality Transformation for Spoken Language. IWSDS 2015: 129-143 - [c357]Fajri Koto, Sakriani Sakti, Graham Neubig, Tomoki Toda, Mirna Adriani, Satoshi Nakamura:
A Study on Natural Expressive Speech: Automatic Memorable Spoken Quote Detection. IWSDS 2015: 145-152 - [c356]Takuya Hiraoka, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Evaluation of a Fully Automatic Cooperative Persuasive Dialogue System. IWSDS 2015: 153-167 - [c355]Takafumi Sasakura, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
Unknown Word Detection Based on Event-Related Brain Desynchronization Responses. IWSDS 2015: 169-175 - [c354]Yuiko Tsunomori, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
An Analysis Towards Dialogue-Based Deception Detection. IWSDS 2015: 177-187 - [c353]Yusuke Oda, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Syntax-based Simultaneous Translation through Prediction of Unseen Syntactic Constituents. ACL (1) 2015: 198-207 - [c352]Akiva Miura, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Improving Pivot Translation by Remembering the Pivot. ACL (2) 2015: 573-577 - [c351]Graham Neubig, Makoto Morishita, Satoshi Nakamura:
Neural Reranking Improves Subjective Quality of Machine Translation: NAIST at WAT2015. WAT 2015: 35-41 - [c350]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura, Mirna Adriani:
Stochastic Gradient Variational Bayes for deep learning-based ASR. ASRU 2015: 175-180 - [c349]Sakriani Sakti, Faiz Ilham, Graham Neubig, Tomoki Toda, Ayu Purwarianti, Satoshi Nakamura:
Incremental sentence compression using LSTM recurrent networks. ASRU 2015: 252-258 - [c348]Quoc Truong Do, Michael Heck, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
The NAIST ASR system for the 2015 Multi-Genre Broadcast challenge: On combination of deep learning systems using a rank-score function. ASRU 2015: 654-659 - [c347]Nurul Lubis, Sakriani Sakti, Graham Neubig, Koichiro Yoshino, Tomoki Toda, Satoshi Nakamura:
A study of social-affective communication: Automatic prediction of emotion triggers and responses in television talk shows. ASRU 2015: 777-783 - [c346]Masahiro Mizukami, Hideaki Kizuki, Toshio Nomura, Graham Neubig, Koichiro Yoshino, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Adaptive selection from multiple response candidates in example-based dialogue. ASRU 2015: 784-790 - [c345]Kou Tanaka, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
An Enhanced Electrolarynx with Automatic Fundamental Frequency Control based on Statistical Prediction. ASSETS 2015: 435-436 - [c344]Shinnosuke Takamichi, Kazuhiro Kobayashi, Kou Tanaka, Tomoki Toda, Satoshi Nakamura:
The NAIST Text-to-Speech System for the Blizzard Challenge 2015. Blizzard Challenge 2015 - [c343]Hayato Maki, Tomoki Toda, Sakriani Sakti, Graham Neubig, Satoshi Nakamura:
An evaluation of EEG ocular artifact removal with a multi-channel wiener filter based on probabilistic generative model. EMBC 2015: 2775-2778 - [c342]Jingyi Zhang, Masao Utiyama, Eiichiro Sumita, Graham Neubig, Satoshi Nakamura:
A Binarized Neural Network Joint Model for Machine Translation. EMNLP 2015: 2094-2099 - [c341]Yuki Murota, Daichi Kitamura, Shoichi Koyama, Hiroshi Saruwatari, Satoshi Nakamura:
Statistical modeling of binaural signal and its application to binaural source separation. ICASSP 2015: 494-498 - [c340]Hayato Maki, Tomoki Toda, Sakriani Sakti, Graham Neubig, Satoshi Nakamura:
EEG signal enhancement using multi-channel wiener filter with a spatial correlation prior. ICASSP 2015: 2639-2643 - [c339]Shinnosuke Takamichi, Tomoki Toda, Alan W. Black, Satoshi Nakamura:
Parameter generation algorithm considering Modulation Spectrum for HMM-based speech synthesis. ICASSP 2015: 4210-4214 - [c338]Andros Tjandra, Sakriani Sakti, Graham Neubig, Tomoki Toda, Mirna Adriani, Satoshi Nakamura:
Combination of two-dimensional cochleogram and spectrogram features for deep learning-based ASR. ICASSP 2015: 4525-4529 - [c337]Shinnosuke Takamichi, Tomoki Toda, Alan W. Black, Satoshi Nakamura:
Modulation spectrum-constrained trajectory training algorithm for GMM-based Voice Conversion. ICASSP 2015: 4859-4863 - [c336]Quoc Truong Do, Satoshi Nakamura, Marc Delcroix, Takaaki Hori:
WFST-based structural classification integrating dnn acoustic features and RNN language features for speech recognition. ICASSP 2015: 4959-4963 - [c335]Yuji Oshima, Shinnosuke Takamichi, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
Non-native speech synthesis preserving speaker individuality based on partial correction of prosodic and phonetic characteristics. INTERSPEECH 2015: 299-303 - [c334]Shinnosuke Takamichi, Tomoki Toda, Alan W. Black, Satoshi Nakamura:
Modulation spectrum-constrained trajectory training algorithm for HMM-based speech synthesis. INTERSPEECH 2015: 1206-1210 - [c333]Takashi Mieno, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Speed or accuracy? a study in evaluation of simultaneous speech translation. INTERSPEECH 2015: 2267-2271 - [c332]The Tung Nguyen, Graham Neubig, Hiroyuki Shindo, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
A latent variable model for joint pause prediction and dependency parsing. INTERSPEECH 2015: 2719-2723 - [c331]Kazuhiro Kobayashi, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
Statistical singing voice conversion based on direct waveform modification with global variance. INTERSPEECH 2015: 2754-2758 - [c330]Yusuke Tajiri, Kou Tanaka, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
Non-audible murmur enhancement based on statistical conversion using air- and body-conductive microphones in noisy environments. INTERSPEECH 2015: 2769-2773 - [c329]Patrick Lumban Tobing, Kazuhiro Kobayashi, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
Articulatory controllable speech modification based on Gaussian mixture models with direct waveform modification using spectrum differential. INTERSPEECH 2015: 3350-3354 - [c328]Quoc Truong Do, Shinnosuke Takamichi, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
Preserving word-level emphasis in speech-to-speech translation using linear regression HSMMs. INTERSPEECH 2015: 3665-3669 - [c327]Sakriani Sakti, Oyunchimeg Shagdar, Fawzi Nashashibi, Satoshi Nakamura:
Context awareness and priority control for ITS based on automatic speech recognition. ITST 2015: 17-21 - [c326]Hiroki Tanaka, Sakriani Sakti, Graham Neubig, Tomoki Toda, Hideki Negoro, Hidemi Iwasaka, Satoshi Nakamura:
Automated Social Skills Trainer. IUI 2015: 17-27 - [c325]Quoc Truong Do, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
Improving translation of emphasis with pause prediction in speech-to-speech translation systems. IWSLT 2015 - [c324]Michael Heck, Quoc Truong Do, Sakriani Sakti, Graham Neubig, Satoshi Nakamura:
The NAIST English speech recognition system for IWSLT 2015. IWSLT (Evaluation Campaign) 2015 - [c323]Makoto Morishita, Koichi Akabe, Yuto Hatakoshi, Graham Neubig, Koichiro Yoshino, Satoshi Nakamura:
Parser self-training for syntax-based machine translation. IWSLT 2015 - [c322]Yusuke Oda, Hiroyuki Fudaba, Graham Neubig, Hideaki Hata, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Learning to Generate Pseudo-Code from Source Code Using Statistical Machine Translation (T). ASE 2015: 574-584 - [c321]Hiroyuki Fudaba, Yusuke Oda, Koichi Akabe, Graham Neubig, Hideaki Hata, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Pseudogen: A Tool to Automatically Generate Pseudo-Code from Source Code. ASE 2015: 824-829 - [c320]Yusuke Oda, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Ckylark: A More Robust PCFG-LA Parser. HLT-NAACL 2015: 41-45 - [c319]Satoshi Nakamura:
Message of the O-COCOSDA Convener. O-COCOSDA/CASLRE 2015: 1 - [c318]Satoshi Nakamura:
Keynote speech 3: Toward simultaneous, natural and multimodal speech-to-speech translation. O-COCOSDA/CASLRE 2015: 1-2 - [c317]Nurul Lubis, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
Construction and analysis of social-affective interaction corpus in English and Indonesian. O-COCOSDA/CASLRE 2015: 202-206 - [c316]Takuya Hiraoka, Kallirroi Georgila, Elnaz Nouri, David R. Traum, Satoshi Nakamura:
Reinforcement Learning in Multi-Party Trading Dialog. SIGDIAL Conference 2015: 32-41 - [c315]Kyoshiro Sugiyama, Masahiro Mizukami, Graham Neubig, Koichiro Yoshino, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
An Investigation of Machine Translation Evaluation Metrics in Cross-lingual Question Answering. WMT@EMNLP 2015: 442-449 - [i1]Graham Neubig, Makoto Morishita, Satoshi Nakamura:
Neural Reranking Improves Subjective Quality of Machine Translation: NAIST at WAT2015. CoRR abs/1510.05203 (2015) - 2014
- [j62]Kazuhiro Kobayashi, Tomoki Toda, Hironori Doi, Tomoyasu Nakano, Masataka Goto, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
Voice Timbre Control Based on Perceived Age in Singing Voice Conversion. IEICE Trans. Inf. Syst. 97-D(6): 1419-1428 (2014) - [j61]Kou Tanaka, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
A Hybrid Approach to Electrolaryngeal Speech Enhancement Based on Noise Reduction and Statistical Excitation Generation. IEICE Trans. Inf. Syst. 97-D(6): 1429-1437 (2014) - [j60]Keigo Kubo, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
Structured Adaptive Regularization of Weight Vectors for a Robust Grapheme-to-Phoneme Conversion Model. IEICE Trans. Inf. Syst. 97-D(6): 1468-1476 (2014) - [j59]Yu Tsao, Ting-Yao Hu, Sakriani Sakti, Satoshi Nakamura, Lin-Shan Lee:
Variable Selection Linear Regression for Robust Speech Recognition. IEICE Trans. Inf. Syst. 97-D(6): 1477-1487 (2014) - [j58]Lasguido Nio, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
Utilizing Human-to-Human Conversation Examples for a Multi Domain Chat-Oriented Dialog System. IEICE Trans. Inf. Syst. 97-D(6): 1497-1505 (2014) - [j57]Shinnosuke Takamichi, Tomoki Toda, Yoshinori Shiga, Sakriani Sakti, Graham Neubig, Satoshi Nakamura:
Parameter Generation Methods With Rich Context Models for High-Quality and Flexible Text-To-Speech Synthesis. IEEE J. Sel. Top. Signal Process. 8(2): 239-250 (2014) - [j56]Ryoichi Miyazaki, Hiroshi Saruwatari, Satoshi Nakamura, Kiyohiro Shikano, Kazunobu Kondo, Jonathan Blanchette, Martin Bouchard:
Musical-noise-free blind speech extraction integrating microphone array and iterative spectral subtraction. Signal Process. 102: 226-239 (2014) - [j55]Matthias Sperber, Mirjam Simantzik, Graham Neubig, Satoshi Nakamura, Alex Waibel:
Segmentation for Efficient Supervised Language Annotation with an Explicit Cost-Utility Tradeoff. Trans. Assoc. Comput. Linguistics 2: 169-180 (2014) - [c314]Yusuke Oda, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Optimizing Segmentation Strategies for Simultaneous Speech Translation. ACL (2) 2014: 551-556 - [c313]Hiroki Tanaka, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
Linguistic and Acoustic Features for Automatic Identification of Autism Spectrum Disorders in Children's Narrative. CLPsych@ACL 2014: 88-96 - [c312]Daichi Kitamura, Hiroshi Saruwatari, Satoshi Nakamura, Yu Takahashi, Kazunobu Kondo, Hirokazu Kameoka:
Hybrid multichannel signal separation using supervised nonnegative matrix factorization with spectrogram restoration. APSIPA 2014: 1-10 - [c311]Kazuhiro Kobayashi, Tomoki Toda, Tomoyasu Nakano, Masataka Goto, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
Gender-dependent spectrum differential models for perceived age control based on direct waveform modification in singing voice conversion. APSIPA 2014: 1-4 - [c310]Fajri Koto, Sakriani Sakti, Graham Neubig, Tomoki Toda, Mirna Adriani, Satoshi Nakamura:
The use of semantic and acoustic features for open-domain TED talk summarization. APSIPA 2014: 1-4 - [c309]Lasguido Nio, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
Recursive neural network paraphrase identification for example-based dialog retrieval. APSIPA 2014: 1-4 - [c308]Sakriani Sakti, Yu Odagaki, Takafumi Sasakura, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
An event-related brain potential study on the impact of speech recognition errors. APSIPA 2014: 1-4 - [c307]Shinnosuke Takamichi, Tomoki Toda, Alan W. Black, Satoshi Nakamura:
Modulation spectrum-based post-filter for GMM-based Voice Conversion. APSIPA 2014: 1-4 - [c306]Kou Tanaka, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
An inter-speaker evaluation through simulation of electrolarynx control based on statistical F0 prediction. APSIPA 2014: 1-4 - [c305]Sakura Tsuruta, Kou Tanaka, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
An evaluation of target speech for a nonaudible murmur enhancement system in noisy environments. APSIPA 2014: 1-4 - [c304]Riki Yoshida, Takuya Hiraoka, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Unnecessary utterance detection for avoiding digressions in discussion. APSIPA 2014: 1-4 - [c303]Koichi Akabe, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Discriminative Language Models as a Tool for Machine Translation Error Analysis. COLING 2014: 1124-1132 - [c302]Takuya Hiraoka, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Reinforcement Learning of Cooperative Persuasive Dialogue Policies using Framing. COLING 2014: 1706-1717 - [c301]Hoa Trong Vu, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Acquiring a Dictionary of Emotion-Provoking Events. EACL 2014: 128-132 - [c300]Shinnosuke Takamichi, Tomoki Toda, Alan W. Black, Satoshi Nakamura:
Modified post-filter to recover modulation spectrum for HMM-based speech synthesis. GlobalSIP 2014: 547-551 - [c299]Daichi Kitamura, Hiroshi Saruwatari, Satoshi Nakamura, Yu Takahashi, Kazunobu Kondo, Hirokazu Kameoka:
Divergence optimization in nonnegative matrix factorization with spectrogram restoration for multichannel signal separation. HSCMA 2014: 92-96 - [c298]Shunsuke Nakai, Hiroshi Saruwatari, Ryoichi Miyazaki, Satoshi Nakamura, Kazunobu Kondo:
Theoretical analysis of biased MMSE short-time spectral amplitude estimator and its extension to musical-noise-free speech enhancement. HSCMA 2014: 122-126 - [c297]Fine Dwinita Aprilyanti, Hiroshi Saruwatari, Satoshi Nakamura, Tomoya Takatani:
Optimized joint noise suppression and dereverberation based on blind signal extraction for hands-free speech recognition system. HSCMA 2014: 182-186 - [c296]Shinnosuke Takamichi, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
A postfilter to modify the modulation spectrum in HMM-based speech synthesis. ICASSP 2014: 290-294 - [c295]Keigo Kubo, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
Narrow Adaptive Regularization of weights for grapheme-to-phoneme conversion. ICASSP 2014: 2589-2593 - [c294]Kou Tanaka, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
An evaluation of excitation feature prediction in a hybrid approach to electrolaryngeal speech enhancement. ICASSP 2014: 4488-4492 - [c293]Yuki Murota, Daichi Kitamura, Shunsuke Nakai, Hiroshi Saruwatari, Satoshi Nakamura, Yu Takahashi, Kazunobu Kondo:
Music signal separation based on Bayesian spectral amplitude estimator with automatic target prior adaptation. ICASSP 2014: 7490-7494 - [c292]Kazuhiro Kobayashi, Tomoki Toda, Tomoyasu Nakano, Masataka Goto, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
Regression approaches to perceptual age control in singing voice conversion. ICASSP 2014: 7904-7908 - [c291]Kou Tanaka, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
Direct F0 control of an electrolarynx based on statistical excitation feature prediction and its evaluation through simulation. INTERSPEECH 2014: 31-35 - [c290]Nozomi Jinbo, Shinnosuke Takamichi, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
A hearing impairment simulation method using audiogram-based approximation of auditory charatecteristics. INTERSPEECH 2014: 490-494 - [c289]Keigo Kubo, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
Structured soft margin confidence weighted learning for grapheme-to-phoneme conversion. INTERSPEECH 2014: 1263-1267 - [c288]Sho Matsumiya, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
Data-driven generation of text balloons based on linguistic and acoustic features of a comics-anime corpus. INTERSPEECH 2014: 1801-1805 - [c287]Patrick Lumban Tobing, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura, Ayu Purwarianti:
Articulatory controllable speech modification based on statistical feature mapping with Gaussian mixture models. INTERSPEECH 2014: 2298-2302 - [c286]Kazuhiro Kobayashi, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
Statistical singing voice conversion with direct waveform modification based on the spectrum differential. INTERSPEECH 2014: 2514-2518 - [c285]Nurul Lubis, Sakriani Sakti, Graham Neubig, Tomoki Toda, Ayu Purwarianti, Satoshi Nakamura:
Emotion and Its Triggers in Human Spoken Dialogue: Recognition and Analysis. IWSDS 2014: 103-110 - [c284]Takuya Hiraoka, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Construction and Analysis of a Persuasive Dialogue Corpus. IWSDS 2014: 125-138 - [c283]Hiroaki Shimizu, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Collection of a Simultaneous Translation Corpus for Comparative Analysis. LREC 2014: 670-673 - [c282]Sakriani Sakti, Keigo Kubo, Sho Matsumiya, Graham Neubig, Tomoki Toda, Satoshi Nakamura, Fumihiro Adachi, Ryosuke Isotani:
Towards Multilingual Conversations in the Medical Domain: Development of Multilingual Medical Data and A Network-based ASR System. LREC 2014: 2639-2643 - [c281]Quoc Truong Do, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Collection and analysis of a Japanese-English emphasized speech corpora. O-COCOSDA 2014: 1-5 - [c280]Fajri Koto, Sakriani Sakti, Graham Neubig, Tomoki Toda, Mirna Adriani, Satoshi Nakamura:
Memorable spoken quote corpora of TED public speaking. O-COCOSDA 2014: 1-4 - [c279]Nurul Lubis, Dessi Puji Lestari, Ayu Purwarianti, Sakriani Sakti, Satoshi Nakamura:
Construction and analysis of Indonesian Emotional Speech Corpus. O-COCOSDA 2014: 1-5 - [c278]Masahiro Mizukami, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Building a free, general-domain paraphrase database for Japanese. O-COCOSDA 2014: 1-4 - [c277]Satoshi Nakamura:
Message of the O-COCOSDA convener. O-COCOSDA 2014: 1 - [c276]Lasguido Nio, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
Conversation dialog corpora from television and movie scripts. O-COCOSDA 2014: 1-4 - [c275]Lasguido Nio, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
Improving the robustness of example-based dialog retrieval using recursive neural network paraphrase identification. SLT 2014: 306-311 - [c274]Matthias Sperber, Graham Neubig, Satoshi Nakamura, Alex Waibel:
On-the-fly user modeling for cost-sensitive correction of speech transcripts. SLT 2014: 460-465 - [c273]Nurul Lubis, Dessi Puji Lestari, Ayu Purwarianti, Sakriani Sakti, Satoshi Nakamura:
Emotion recognition on Indonesian television talk shows. SLT 2014: 466-471 - [c272]Satoshi Nakamura:
Towards real-time multilingual multimodal speech-to-speech translation. SLTU 2014: 13-15 - [c271]Sakriani Sakti, Satoshi Nakamura:
Recent progress in developing grapheme-based speech recognition for Indonesian ethnic languages: Javanese, Sundanese, Balinese and Bataks. SLTU 2014: 46-52 - [c270]Yuto Hatakoshi, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Rule-based Syntactic Preprocessing for Syntax-based Machine Translation. SSST@EMNLP 2014: 34-42 - 2013
- [j54]Sakriani Sakti, Michael Paul, Andrew M. Finch, Shinsuke Sakai, Thang Tat Vu, Noriyuki Kimura, Chiori Hori, Eiichiro Sumita, Satoshi Nakamura, Jun Park, Chai Wutiwiwatchai, Bo Xu, Hammam Riza, Karunesh Arora, Chi Mai Luong, Haizhou Li:
A-STAR: Toward translating Asian spoken languages. Comput. Speech Lang. 27(2): 509-527 (2013) - [c269]Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura, Yuji Matsumoto, Ryosuke Isotani, Yukichi Ikeda:
Towards High-Reliability Speech Translation in the Medical Domain. NLPHealthcare@IJCNLP 2013: 22-29 - [c268]Fine Dwinita Aprilyanti, Hiroshi Saruwatari, Kiyohiro Shikano, Satoshi Nakamura, Tomoya Takatani:
Semi-blind algorithm for joint noise suppression and dereverberation based on higher-order statistics and acoustic model likelihood. APSIPA 2013: 1-6 - [c267]Ryoichi Miyazaki, Hiroshi Saruwatari, Satoshi Nakamura, Kiyohiro Shikano, Kazunobu Kondo, Jonathan Blanchette, Martin Bouchard:
Toward musical-noise-free blind speech extraction: Concept and its applications. APSIPA 2013: 1-10 - [c266]Takuya Hiraoka, Yuki Yamauchi, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Dialogue management for leading the conversation in persuasive dialogue systems. ASRU 2013: 114-119 - [c265]Philip Arthur, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Inter-Sentence Features and Thresholded Minimum Error Rate Training: NAIST at CLEF 2013 QA4MRE. CLEF (Working Notes) 2013 - [c264]Philip Arthur, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
NAIST at the CLEF 2013 QA4MRE Pilot Task. CLEF (Working Notes) 2013 - [c263]Hiroki Tanaka, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
Modality and contextual differences in computer based non-verbal communication training. CogInfoCom 2013: 127-132 - [c262]Shinnosuke Takamichi, Tomoki Toda, Yoshinori Shiga, Sakriani Sakti, Graham Neubig, Satoshi Nakamura:
Improvements to HMM-based speech synthesis based on parameter generation with rich context models. INTERSPEECH 2013: 364-368 - [c261]Kazuhiro Kobayashi, Hironori Doi, Tomoki Toda, Tomoyasu Nakano, Masataka Goto, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
An investigation of acoustic features for singing voice conversion based on perceptual age. INTERSPEECH 2013: 1057-1061 - [c260]Hironori Doi, Tomoki Toda, Tomoyasu Nakano, Masataka Goto, Satoshi Nakamura:
Evaluation of a singing voice conversion method based on many-to-many eigenvoice conversion. INTERSPEECH 2013: 1067-1071 - [c259]Matthias Sperber, Graham Neubig, Christian Fügen, Satoshi Nakamura, Alex Waibel:
Efficient speech transcription through respeaking. INTERSPEECH 2013: 1087-1091 - [c258]Keigo Kubo, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
Grapheme-to-phoneme conversion based on adaptive regularization of weight vectors. INTERSPEECH 2013: 1946-1950 - [c257]Takatomo Kano, Shinnosuke Takamichi, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
Generalizing continuous-space translation of paralinguistic information. INTERSPEECH 2013: 2614-2618 - [c256]Masaya Ohgushi, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
An empirical comparison of joint optimization techniques for speech translation. INTERSPEECH 2013: 2619-2623 - [c255]Kou Tanaka, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
A hybrid approach to electrolaryngeal speech enhancement based on spectral subtraction and statistical voice conversion. INTERSPEECH 2013: 3067-3071 - [c254]Takuto Moriguchi, Tomoki Toda, Motoaki Sano, Hiroshi Sato, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
A digital signal processor implementation of silent/electrolaryngeal speech enhancement based on real-time statistical voice conversion. INTERSPEECH 2013: 3072-3076 - [c253]Tomoki Fujita, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Simple, lexicalized choice of translation timing for simultaneous speech translation. INTERSPEECH 2013: 3487-3491 - [c252]Michael Heck, Sebastian Stüker, Sakriani Sakti, Alex Waibel, Satoshi Nakamura:
Incremental unsupervised training for university lecture recognition. IWSLT 2013 - [c251]Sakriani Sakti, Keigo Kubo, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
The NAIST English speech recognition system for IWSLT 2013. IWSLT (Evaluation Campaign) 2013 - [c250]Hiroaki Shimizu, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Constructing a speech translation system using simultaneous interpretation data. IWSLT 2013 - [c249]Shigeki Matsuda, Xinhui Hu, Yoshinori Shiga, Hideki Kashioka, Chiori Hori, Keiji Yasuda, Hideo Okuma, Masao Uchiyama, Eiichiro Sumita, Hisashi Kawai, Satoshi Nakamura:
Multilingual Speech-to-Speech Translation System: VoiceTra. MDM (2) 2013: 229-233 - [c248]Sakriani Sakti, Satoshi Nakamura:
Towards language preservation: Design and collection of graphemically balanced and parallel speech corpora of Indonesian ethnic languages. O-COCOSDA/CASLRE 2013: 1-5 - [c247]Tatsuo Inukai, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
Investigation of intra-speaker spectral parameter variation and its prediction towards improvement of spectral conversion metric. SSW 2013: 89-94 - 2012
- [j53]Hansjörg Hofmann, Sakriani Sakti, Chiori Hori, Hideki Kashioka, Satoshi Nakamura, Wolfgang Minker:
Sequence-Based Pronunciation Variation Modeling for Spontaneous ASR Using a Noisy Channel Approach. IEICE Trans. Inf. Syst. 95-D(8): 2084-2093 (2012) - [j52]Sakriani Sakti, Michael Paul, Andrew M. Finch, Xinhui Hu, Jinfu Ni, Noriyuki Kimura, Shigeki Matsuda, Chiori Hori, Yutaka Ashikari, Hisashi Kawai, Hideki Kashioka, Eiichiro Sumita, Satoshi Nakamura:
Distributed speech translation technologies for multiparty multilingual communication. ACM Trans. Speech Lang. Process. 9(2): 4:1-4:27 (2012) - [c246]Hironori Doi, Tomoki Toda, Tomoyasu Nakano, Masataka Goto, Satoshi Nakamura:
Singing voice conversion method based on many-to-many eigenvoice conversion and training data generation using a singing-to-singing synthesis system. APSIPA 2012: 1-6 - [c245]Hiroki Tanaka, Sakriani Sakti, Graham Neubig, Tomoki Toda, Nick Campbell, Satoshi Nakamura:
Non-verbal cognitive skills and autistic conditions: An analysis and training tool. CogInfoCom 2012: 41-46 - [c244]Teruhisa Misu, Etsuo Mizukami, Hideki Kashioka, Satoshi Nakamura, Haizhou Li:
A bootstrapping approach for SLU portability to a new language by inducting unannotated user queries. ICASSP 2012: 4961-4964 - [c243]Shinnosuke Takamichi, Tomoki Toda, Yoshinori Shiga, Hisashi Kawai, Sakriani Sakti, Satoshi Nakamura:
An Evaluation of Parameter Generation Methods with Rich Context Models in HMM-Based Speech Synthesis. INTERSPEECH 2012: 1139-1142 - [c242]Lasguido Nio, Sakriani Sakti, Graham Neubig, Tomoki Toda, Mirna Adriani, Satoshi Nakamura:
Developing Non-goal Dialog System Based on Examples of Drama Television. IWSDS 2012: 355-361 - [c241]Graham Neubig, Kevin Duh, Masaya Ogushi, Takatomo Kano, Tetsuo Kiso, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
The NAIST machine translation system for IWSLT2012. IWSLT 2012: 54-60 - [c240]Christian Saam, Christian Mohr, Kevin Kilgour, Michael Heck, Matthias Sperber, Keigo Kubo, Sebastian Stüker, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura, Alex Waibel:
The 2012 KIT and KIT-NAIST English ASR systems for the IWSLT evaluation. IWSLT 2012: 87-90 - [c239]Michael Heck, Keigo Kubo, Matthias Sperber, Sakriani Sakti, Sebastian Stüker, Christian Saam, Kevin Kilgour, Christian Mohr, Graham Neubig, Tomoki Toda, Satoshi Nakamura, Alex Waibel:
The KIT-NAIST (contrastive) English ASR system for IWSLT 2012. IWSLT 2012: 91-95 - [c238]Hiroaki Shimizu, Masao Utiyama, Eiichiro Sumita, Satoshi Nakamura:
Minimum Bayes-Risk decoding extended with similar examples: NAIST-NICT at IWSLT 2012. IWSLT 2012: 117-120 - [c237]Takatomo Kano, Sakriani Sakti, Shinnosuke Takamichi, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
A method for translation of paralinguistic information. IWSLT 2012: 158-163 - 2011
- [j51]Komei Sugiura, Naoto Iwahashi, Hideki Kashioka, Satoshi Nakamura:
Learning, Generation and Recognition of Motions by Reference-Point-Dependent Probabilistic Models. Adv. Robotics 25(6-7): 825-848 (2011) - [j50]Komei Sugiura, Naoto Iwahashi, Hisashi Kawai, Satoshi Nakamura:
Situated Spoken Dialogue with Robots Using Active Learning. Adv. Robotics 25(17): 2207-2232 (2011) - [j49]Xugang Lu, Masashi Unoki, Satoshi Nakamura:
Sub-band temporal modulation envelopes and their normalization for automatic speech recognition in reverberant environments. Comput. Speech Lang. 25(3): 571-584 (2011) - [j48]Andrew M. Finch, Keiji Yasuda, Hideo Okuma, Eiichiro Sumita, Satoshi Nakamura:
A Bayesian Model of Transliteration and Its Human Evaluation When Integrated into a Machine Translation System. IEICE Trans. Inf. Syst. 94-D(10): 1889-1900 (2011) - [j47]Xugang Lu, Shigeki Matsuda, Masashi Unoki, Satoshi Nakamura:
Temporal modulation normalization for robust speech feature extraction and recognition. Multim. Tools Appl. 52(1): 187-199 (2011) - [j46]Teruhisa Misu, Komei Sugiura, Tatsuya Kawahara, Kiyonori Ohtake, Chiori Hori, Hideki Kashioka, Hisashi Kawai, Satoshi Nakamura:
Modeling spoken decision support dialogue and optimization of its dialogue strategy. ACM Trans. Speech Lang. Process. 7(3): 10:1-10:18 (2011) - [c236]Teruhisa Misu, Komei Sugiura, Tatsuya Kawahara, Kiyonori Ohtake, Chiori Hori, Hideki Kashioka, Satoshi Nakamura:
Online Learning of Bayes Risk-Based Optimization of Dialogue Management for Document Retrieval Systems with Speech Interface. IWSDS 2011: 29-52 - [c235]Kiyonori Ohtake, Teruhisa Misu, Chiori Hori, Hideki Kashioka, Satoshi Nakamura:
Dialogue Acts Annotation to Construct Dialogue Systems for Consulting. IWSDS 2011: 231-254 - [c234]Shunta Ishii, Tomoki Toda, Hiroshi Saruwatari, Sakriani Sakti, Satoshi Nakamura:
Blind noise suppression for Non-Audible Murmur recognition with stereo signal processing. ASRU 2011: 494-499 - [c233]Seigo Enomoto, Yusuke Ikeda, Shiro Ise, Satoshi Nakamura:
3-D Sound Reproduction System for Immersive Environments Based on the Boundary Surface Control Principle. HCI (13) 2011: 174-184 - [c232]Kazuaki Kondo, Yasuhiro Mukaigawa, Yusuke Ikeda, Seigo Enomoto, Shiro Ise, Satoshi Nakamura, Yasushi Yagi:
Providing Immersive Virtual Experience with First-Person Perspective Omnidirectional Movies and Three Dimensional Sound Field. HCI (13) 2011: 204-213 - [c231]Sakriani Sakti, Andrew M. Finch, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura:
Unsupervised determination of efficient Korean LVCSR units using a Bayesian Dirichlet process model. ICASSP 2011: 4664-4667 - [c230]Yu Tsao, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura:
Increasing discriminative capability on MAP-based mapping function estimation for acoustic model adaptation. ICASSP 2011: 5320-5323 - [c229]Yu Tsao, Shigeki Matsuda, Shinsuke Sakai, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura:
A sampling-based environment population projection approach for rapid acoustic model adaptation. ICASSP 2011: 5504-5507 - [c228]Teruhisa Misu, Kiyonori Ohtake, Chiori Hori, Hisashi Kawai, Satoshi Nakamura:
User Study of Spoken Decision Support System. INTERSPEECH 2011: 797-800 - [c227]Xugang Lu, Masashi Unoki, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura:
Adaptive Regularization Framework for Robust Voice Activity Detection. INTERSPEECH 2011: 2653-2656 - [c226]Sakriani Sakti, Andrew M. Finch, Chiori Hori, Hideki Kashioka, Satoshi Nakamura:
Conditional Random Fields for Modeling Korean Pronunciation Variation. IWSDS 2011: 49-55 - [c225]Teruhisa Misu, Etsuo Mizukami, Yoshinori Shiga, Shinichi Kawamoto, Hisashi Kawai, Satoshi Nakamura:
Analysis on Effects of Text-to-Speech and Avatar Agent in Evoking Users' Spontaneous Listener's Reactions. IWSDS 2011: 77-89 - [c224]Teruhisa Misu, Etsuo Mizukami, Yoshinori Shiga, Shinichi Kawamoto, Hisashi Kawai, Satoshi Nakamura:
Toward Construction of Spoken Dialogue System that Evokes Users' Spontaneous Backchannels. SIGDIAL Conference 2011: 259-265 - [e3]Wolfgang Minker, Gary Geunbae Lee, Satoshi Nakamura, Joseph Mariani:
Spoken Dialogue Systems Technology and Design - International Workshop on Spoken Dialogue Systems Technology, IWSDS 2009, Kloster Irsee, Germany, December 9-11, 2009. Springer 2011, ISBN 978-1-4419-7933-9 [contents] - 2010
- [j45]Youzheng Wu, Hideki Kashioka, Satoshi Nakamura:
An Unsupervised Model of Redundancy for Answer Validation. IEICE Trans. Inf. Syst. 93-D(3): 624-634 (2010) - [j44]Xugang Lu, Shigeki Matsuda, Masashi Unoki, Satoshi Nakamura:
Temporal contrast normalization and edge-preserved smoothing of temporal modulation structures of speech for robust speech recognition. Speech Commun. 52(1): 1-11 (2010) - [c223]Komei Sugiura, Naoto Iwahashi, Hisashi Kawai, Satoshi Nakamura:
Active Learning for Generating Motion and Utterances in Object Manipulation Dialogue Tasks. AAAI Fall Symposium: Dialog with Robots 2010 - [c222]Satoshi Tamura, Chiyomi Miyajima, Norihide Kitaoka, Takeshi Yamada, Satoru Tsuge, Tetsuya Takiguchi, Kazumasa Yamamoto, Takanobu Nishiura, Masato Nakayama, Yuki Denda, Masakiyo Fujimoto, Shigeki Matsuda, Tetsuji Ogawa, Shingo Kuroiwa, Kazuya Takeda, Satoshi Nakamura:
CENSREC-1-AV: an audio-visual corpus for noisy bimodal speech recognition. AVSP 2010: 6 - [c221]Yoshinori Shiga, Tomoki Toda, Shinsuke Sakai, Jinfu Ni, Hisashi Kawai, Keiichi Tokuda, Minoru Tsuzaki, Satoshi Nakamura:
NICT Blizzard Challenge 2010 Entry. Blizzard Challenge 2010 - [c220]Kentaro Kayama, Akihiro Kobayashi, Etsuo Mizukami, Teruhisa Misu, Hideki Kashioka, Hisashi Kawai, Satoshi Nakamura:
Spoken Dialog System on Plasma Display Panel Estimating Users' Interest by Image Processing. Intelligent Environments (Workshops) 2010: 4-13 - [c219]Xinhui Hu, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura:
Cluster-based language model for spoken document retrieval using NMF-based document clustering. INTERSPEECH 2010: 705-708 - [c218]Kazuhiko Abe, Sakriani Sakti, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura:
Brazilian portuguese acoustic model training based on data borrowing from other language. INTERSPEECH 2010: 861-864 - [c217]Sakriani Sakti, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura:
Utilizing a noisy-channel approach for Korean LVCSR. INTERSPEECH 2010: 1513-1516 - [c216]Xinhui Hu, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura:
Construction and evaluations of an annotated Chinese conversational corpus in travel domain for the language model of speech recognition. INTERSPEECH 2010: 1910-1913 - [c215]Xugang Lu, Masashi Unoki, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura:
Voice activity detection in a reguarized reproducing kernel hilbert space. INTERSPEECH 2010: 3086-3089 - [c214]Komei Sugiura, Naoto Iwahashi, Hideki Kashioka, Satoshi Nakamura:
Active learning of confidence measure function in robot language acquisition framework. IROS 2010: 1774-1779 - [c213]Xugang Lu, Masashi Unoki, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura:
Speech enhancement as a functional approximation and generalization. ISCSLP 2010: 18-22 - [c212]Yu Tsao, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura:
An environment structuring framework to facilitating suitable prior density estimation for MAPLR on robust speech recognition. ISCSLP 2010: 29-32 - [c211]Sakriani Sakti, Andrew M. Finch, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura:
Korean pronunciation variation modeling with probabilistic Bayesian networks. IUCS 2010: 52-57 - [c210]Hansjörg Hofmann, Sakriani Sakti, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura, Wolfgang Minker:
Improving spontaneous English ASR using a joint-sequence pronunciation model. IUCS 2010: 58-61 - [c209]Teruhisa Misu, Kiyonori Ohtake, Chiori Hori, Hideki Kashioka, Hisashi Kawai, Satoshi Nakamura:
Web text classification for response generation in spoken decision support dialogue systems. IUCS 2010: 131-134 - [c208]Naoto Kimura, Chiori Hori, Teruhisa Misu, Kiyonori Ohtake, Hisashi Kawai, Satoshi Nakamura:
Expansion of WFST-Based Dialog Management for Handling Multiple ASR Hypotheses. IWSDS 2010: 61-72 - [c207]Akihiro Kobayashi, Kentaro Kayama, Etsuo Mizukami, Teruhisa Misu, Hideki Kashioka, Hisashi Kawai, Satoshi Nakamura:
Evaluation of Facial Direction Estimation from Cameras for Multi-modal Spoken Dialog System. IWSDS 2010: 73-84 - [c206]Hansjörg Hofmann, Sakriani Sakti, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura, Wolfgang Minker:
Sequence-Based Pronunciation Modeling Using a Noisy-Channel Approach. IWSDS 2010: 156-162 - [c205]Teruhisa Misu, Chiori Hori, Kiyonori Ohtake, Hideki Kashioka, Hisashi Kawai, Satoshi Nakamura:
Construction and Experiment of a Spoken Consulting Dialogue System. IWSDS 2010: 169-175 - [c204]Etsuo Mizukami, Hideki Kashioka, Hisashi Kawai, Satoshi Nakamura:
A Study Toward an Evaluation Method for Spoken Dialogue Systems Considering User Criteria. IWSDS 2010: 176-181 - [c203]Teruhisa Misu, Chiori Hori, Kiyonori Ohtake, Etsuo Mizukami, Akihiro Kobayashi, Kentaro Kayama, Tetsuya Fujii, Hideki Kashioka, Hisashi Kawai, Satoshi Nakamura:
Sightseeing Guidance Systems Based on WFST-Based Dialogue Manager. IWSDS 2010: 194-195 - [c202]Kiyonori Ohtake, Teruhisa Misu, Chiori Hori, Hideki Kashioka, Satoshi Nakamura:
Dialogue Acts Annotation for NICT Kyoto Tour Dialogue Corpus to Construct Statistical Dialogue Systems. LREC 2010 - [c201]Teruhisa Misu, Komei Sugiura, Kiyonori Ohtake, Chiori Hori, Hideki Kashioka, Hisashi Kawai, Satoshi Nakamura:
Modeling Spoken Decision Making Dialogue and Optimization of its Dialogue Strategy. SIGDIAL Conference 2010: 221-224 - [c200]Shin'ichi Kawamoto, Tatsuo Yotsukura, Satoshi Nakamura, Junya Yamamoto, Tsunenori Shirahama, Hakuei Yamamoto:
Integrating lip-synch into game production workflow: "Sengoku BASARA 3" (Copyright restrictions prevent ACM from providing the full text for this article). SIGGRAPH ASIA (Sketches) 2010: 2:1 - [c199]Teruhisa Misu, Komei Sugiura, Kiyonori Ohtake, Chiori Hori, Hideki Kashioka, Hisashi Kawai, Satoshi Nakamura:
Dialogue strategy optimization to assist user's decision for spoken consulting dialogue systems. SLT 2010: 354-359 - [e2]Takao Kobayashi, Keikichi Hirose, Satoshi Nakamura:
11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010, Makuhari, Chiba, Japan, September 26-30, 2010. ISCA 2010 [contents] - [e1]Gary Geunbae Lee, Joseph Mariani, Wolfgang Minker, Satoshi Nakamura:
Spoken Dialogue Systems for Ambient Environments - Second International Workshop on Spoken Dialogue Systems Technology, IWSDS 2010, Gotemba, Shizuoka, Japan, October 1-2, 2010. Proceedings. Lecture Notes in Computer Science 6392, Springer 2010, ISBN 978-3-642-16201-5 [contents]
2000 – 2009
- 2009
- [b1]Sakriani Sakti, Satoshi Nakamura, Konstantin Markov, Wolfgang Minker:
Incorporating Knowledge Sources into Statistical Speech Recognition. Lecture Notes in Electrical Engineering 42, Springer 2009, ISBN 978-0-387-85829-6, pp. 1-59 [contents] - [j43]Tobias Cincarek, Rainer Gruhn, Christian Hacker, Elmar Nöth, Satoshi Nakamura:
Automatic pronunciation scoring of words and sentences independent from the non-native's first language. Comput. Speech Lang. 23(1): 65-88 (2009) - [j42]Chiori Hori, Bing Zhao, Stephan Vogel, Alex Waibel, Hideki Kashioka, Satoshi Nakamura:
Consolidation-Based Speech Translation and Evaluation Approach. IEICE Trans. Inf. Syst. 92-D(3): 477-488 (2009) - [j41]Andrew M. Finch, Eiichiro Sumita, Satoshi Nakamura:
Class-Dependent Modeling for Dialog Translation. IEICE Trans. Inf. Syst. 92-D(12): 2469-2477 (2009) - [c198]Kiyonori Ohtake, Teruhisa Misu, Chiori Hori, Hideki Kashioka, Satoshi Nakamura:
Annotating Dialogue Acts to Construct Dialogue Systems for Consulting. ALR7@IJCNLP 2009: 32-39 - [c197]Xinhui Hu, Ryosuke Isotani, Satoshi Nakamura:
Construction of Chinese Segmented and POS-tagged Conversational Corpora and Their Evaluations on Spontaneous Speech Recognitions. ALR7@IJCNLP 2009: 70-75 - [c196]Xinhui Hu, Hideki Kashioka, Ryosuke Isotani, Satoshi Nakamura:
Japanese Spontaneous Spoken Document Retrieval Using NMF-Based Topic Models. AIRS 2009: 149-156 - [c195]Yu Tsao, Shigeki Matsuda, Satoshi Nakamura, Chin-Hui Lee:
MAP estimation of online mapping parameters in ensemble speaker and speaking environment modeling. ASRU 2009: 271-275 - [c194]Chiori Hori, Kiyonori Ohtake, Teruhisa Misu, Hideki Kashioka, Satoshi Nakamura:
Weighted finite state transducer based statistical dialog management. ASRU 2009: 490-495 - [c193]Sakriani Sakti, Noriyuki Kimura, Michael Paul, Chiori Hori, Eiichiro Sumita, Satoshi Nakamura, Jun Park, Chai Wutiwiwatchai, Bo Xu, Hammam Riza, Karunesh Arora, Chi Mai Luong, Haizhou Li:
The Asian network-based speech-to-speech translation system. ASRU 2009: 507-512 - [c192]Ranniery Maia, Tomoki Toda, Shinsuke Sakai, Yoshinori Shiga, Jinfu Ni, Hisashi Kawai, Keiichi Tokuda, Minoru Tsuzaki, Satoshi Nakamura:
The NICT Entry for the Blizzard Challenge 2009: an Enhanced HMM-based Speech Synthesis System with Trajectory Training considering Global Variance and State-Dependent Mixed Excitation. Blizzard Challenge 2009 - [c191]Yoshihiro Adachi, Shinichi Kawamoto, Tatsuo Yotsukura, Shigeo Morishima, Satoshi Nakamura:
Automatic voice assignment tool for Instant Casting movie System. ICASSP 2009: 1897-1900 - [c190]Shinsuke Sakai, Tatsuya Kawahara, Tohru Shimizu, Satoshi Nakamura:
Optimal learning of P-Layer additive F0 models with cross-validation. ICASSP 2009: 4245-4248 - [c189]Jinfu Ni, Shinsuke Sakai, Tohru Shimizu, Satoshi Nakamura:
CART-based modeling of Chinese tonal patterns with a functional model tracing the fundamental frequency trajectories. ICASSP 2009: 4253-4256 - [c188]Xugang Lu, Shigeki Matsuda, Masashi Unoki, Tohru Shimizu, Satoshi Nakamura:
Temporal contrast normalization and edge-preserved smoothing on temporal modulation structure for robust speech recognition. ICASSP 2009: 4573-4576 - [c187]Chiori Hori, Kiyonori Ohtake, Teruhisa Misu, Hideki Kashioka, Satoshi Nakamura:
Statistical dialog management applied to WFST-based dialog systems. ICASSP 2009: 4793-4796 - [c186]Chiori Hori, Kiyonori Ohtake, Teruhisa Misu, Hideki Kashioka, Satoshi Nakamura:
Recent advances in WFST-based dialog system. INTERSPEECH 2009: 268-271 - [c185]Shinsuke Sakai, Ranniery Maia, Hisashi Kawai, Satoshi Nakamura:
A close look into the probabilistic concatenation model for corpus-based speech synthesis. INTERSPEECH 2009: 752-755 - [c184]Shigeki Matsuda, Yu Tsao, Jinyu Li, Satoshi Nakamura, Chin-Hui Lee:
A study on soft margin estimation of linear regression parameters for speaker adaptation. INTERSPEECH 2009: 1603-1606 - [c183]Ranniery Maia, Tomoki Toda, Keiichi Tokuda, Shinsuke Sakai, Satoshi Nakamura:
A decision tree-based clustering approach to state definition in an excitation modeling framework for HMM-based speech synthesis. INTERSPEECH 2009: 1783-1786 - [c182]Teruhisa Misu, Kiyonori Ohtake, Chiori Hori, Hideki Kashioka, Satoshi Nakamura:
Annotating communicative function and semantic content in dialogue act for construction of consulting dialogue systems. INTERSPEECH 2009: 1843-1846 - [c181]Komei Sugiura, Naoto Iwahashi, Hideki Kashioka, Satoshi Nakamura:
Bayesian learning of confidence measure function for generation of utterances and motions in object manipulation dialogue task. INTERSPEECH 2009: 2483-2486 - [c180]Xugang Lu, Masashi Unoki, Satoshi Nakamura:
Subband temporal modulation spectrum normalization for automatic speech recognition in reverberant environments. INTERSPEECH 2009: 2503-2506 - [c179]Xugang Lu, Masashi Unoki, Satoshi Nakamura:
Normalization on the modulation spectrum of the subband temporal envelopes for automatic speech recognition in reverberant environments. IUCS 2009: 247-254 - [c178]Chiori Hori, Kiyonori Ohtake, Teruhisa Misu, Hideki Kashioka, Satoshi Nakamura:
Evaluation for WFST-based dialog management. IUCS 2009: 255-260 - [c177]Kiyonori Ohtake, Teruhisa Misu, Chiori Hori, Hideki Kashioka, Satoshi Nakamura:
Dialogue act annotation for consulting dialogue corpus. IUCS 2009: 372-378 - [c176]Jinfu Ni, Shinsuke Sakai, Hisashi Kawai, Satoshi Nakamura:
Hyperbolic structure of fundamental frequency contour. IUCS 2009: 389-394 - [c175]Xinhui Hu, Ryosuke Isotani, Satoshi Nakamura:
Spoken document retrieval using topic models. IUCS 2009: 400-403 - [c174]Yu Tsao, Jinyu Li, Chin-Hui Lee, Satoshi Nakamura:
Soft margin estimation on improving environment structures for ensemble speaker and speaking environment modeling. IUCS 2009: 404-408 - [c173]Chiori Hori, Sakriani Sakti, Michael Paul, Noriyuki Kimura, Yutaka Ashikari, Ryosuke Isotani, Eiichiro Sumita, Satoshi Nakamura:
Network-based speech-to-speech translation. IWSLT 2009: 168 - [c172]Michael Paul, Hirofumi Yamamoto, Eiichiro Sumita, Satoshi Nakamura:
On the Importance of Pivot Language Selection for Statistical Machine Translation. HLT-NAACL (Short Papers) 2009: 221-224 - 2008
- [j40]Jinsong Zhang, Xinhui Hu, Satoshi Nakamura:
Using Mutual Information Criterion to Design an Efficient Phoneme Set for Chinese Speech Recognition. IEICE Trans. Inf. Syst. 91-D(3): 508-513 (2008) - [j39]Jinsong Zhang, Satoshi Nakamura:
An Improved Greedy Search Algorithm for the Development of a Phonetically Rich Speech Corpus. IEICE Trans. Inf. Syst. 91-D(3): 615-630 (2008) - [j38]Idomucogiin Dawa, Satoshi Nakamura:
A Study on Cross Transformation of Mongolian Language. Inf. Media Technol. 3(4): 888-906 (2008) - [j37]Shinichi Kawamoto, Tatsuo Yotsukura, Ken Anjyo, Satoshi Nakamura:
Efficient lip-synch tool for 3D cartoon animation. Comput. Animat. Virtual Worlds 19(3-4): 247-257 (2008) - [j36]Carlos Toshinori Ishi, Shigeki Matsuda, Takayuki Kanda, Takatoshi Jitsuhiro, Hiroshi Ishiguro, Satoshi Nakamura, Norihiro Hagita:
A Robust Speech Recognition System for Communication Robots in Noisy Environments. IEEE Trans. Robotics 24(3): 759-763 (2008) - [c171]Ranniery Maia, Jinfu Ni, Shinsuke Sakai, Tomoki Toda, Keiichi Tokuda, Tohru Shimizu, Satoshi Nakamura:
The NICT/ATR speech synthesis system for the Blizzard Challenge 2008. Blizzard Challenge 2008 - [c170]Michael Paul, Hideo Okuma, Hirofumi Yamamoto, Eiichiro Sumita, Shigeki Matsuda, Tohru Shimizu, Satoshi Nakamura:
Multilingual Mobile-Phone Translation Services for World Travelers. COLING (Demos) 2008: 165-168 - [c169]Sakriani Sakti, Eka Kelana, Hammam Riza, Shinsuke Sakai, Konstantin Markov, Satoshi Nakamura:
Development of Indonesian Large Vocabulary Continuous Speech Recognition System within A-STAR Project. IJCNLP 2008: 19-24 - [c168]Chiori Hori, Kiyonori Ohtake, Teruhisa Misu, Hideki Kashioka, Satoshi Nakamura:
Dialog management using weighted finite-state transducers. INTERSPEECH 2008: 211-214 - [c167]Konstantin Markov, Satoshi Nakamura:
Improved novelty detection for online GMM based speaker diarization. INTERSPEECH 2008: 363-366 - [c166]Masato Nakayama, Takanobu Nishiura, Yuki Denda, Norihide Kitaoka, Kazumasa Yamamoto, Takeshi Yamada, Satoru Tsuge, Chiyomi Miyajima, Masakiyo Fujimoto, Tetsuya Takiguchi, Satoshi Tamura, Tetsuji Ogawa, Shigeki Matsuda, Shingo Kuroiwa, Kazuya Takeda, Satoshi Nakamura:
CENSREC-4: development of evaluation framework for distant-talking speech recognition under reverberant environments. INTERSPEECH 2008: 968-971 - [c165]Keiichiro Oura, Yoshihiko Nankaku, Tomoki Toda, Keiichi Tokuda, Ranniery Maia, Shinsuke Sakai, Satoshi Nakamura:
Simultaneous Acoustic, Prosodic, and Phrasing Model Training for TTs Conversion Systems. ISCSLP 2008: 1-4 - [c164]Jinfu Ni, Shinsuke Sakai, Tohru Shimizu, Satoshi Nakamura:
Frequency Modulation Technique for Prosodic Modification. ISCSLP 2008: 117-120 - [c163]Xugang Lu, Shigeki Matsuda, Tohru Shimizu, Satoshi Nakamura:
Noise Reduction Based Random Matrix Theory. ISCSLP 2008: 285-288 - [c162]Xugang Lu, Shigeki Matsuda, Tohru Shimizu, Satoshi Nakamura:
Normalization on Temporal Modulation Transfer Function for Robust Speech Recognition. ISUC 2008: 16-23 - [c161]Chiori Hori, Kiyonori Ohtake, Teruhisa Misu, Hideki Kashioka, Satoshi Nakamura:
A Statistical Approach to Expandable Spoken Dialog Systems using WFSTs. ISUC 2008: 24-27 - [c160]Jinfu Ni, Shinsuke Sakai, Tohru Shimizu, Satoshi Nakamura:
Prosody Modeling from Tone to Intonation in Chinese using a Functional F0 Model. ISUC 2008: 397-404 - [c159]Sakriani Sakti, Konstantin Markov, Satoshi Nakamura:
Probabilistic Pronunciation Variation Model Based on Bayesian Network for Conversational Speech Recognition. ISUC 2008: 405-410 - [c158]Kiyonori Ohtake, Teruhisa Misu, Chiori Hori, Hideki Kashioka, Satoshi Nakamura:
Dialogue Act Annotation for Statistically Managed Spoken Dialogue Systems. ISUC 2008: 416-422 - [c157]Takanobu Nishiura, Masato Nakayama, Yuki Denda, Norihide Kitaoka, Kazumasa Yamamoto, Takeshi Yamada, Satoru Tsuge, Chiyomi Miyajima, Masakiyo Fujimoto, Tetsuya Takiguchi, Satoshi Tamura, Shingo Kuroiwa, Kazuya Takeda, Satoshi Nakamura:
Evaluation Framework for Distant-talking Speech Recognition under Reverberant Environments: newest Part of the CENSREC Series -. LREC 2008 - [c156]Hideki Kashioka, Susumu Akamine, Takafumi Nakanishi, Hisashi Miyamori, Koji Zettsu, Yutaka Kidawara, Satoshi Nakamura:
Spoken Dialog System for Next Generation Knowledge Access. MDM 2008: 225-226 - [c155]Shinichi Kawamoto, Tatsuo Yotsukura, Shigeo Morishima, Satoshi Nakamura:
Post-recording tool for instant casting movie system. ACM Multimedia 2008: 893-896 - 2007
- [j35]Ian R. Lane, Tatsuya Kawahara, Tomoko Matsui, Satoshi Nakamura:
Out-of-Domain Utterance Detection Using Classification Confidences of Multiple Topics. IEEE Trans. Speech Audio Process. 15(1): 150-161 (2007) - [j34]Wolfgang Herbordt, Herbert Buchner, Satoshi Nakamura, Walter Kellermann:
Multichannel Bin-Wise Robust Frequency-Domain Adaptive Filtering and Its Application to Adaptive Beamforming. IEEE Trans. Speech Audio Process. 15(4): 1340-1351 (2007) - [j33]Sakriani Sakti, Konstantin Markov, Satoshi Nakamura:
Incorporating Knowledge Sources Into a Statistical Acoustic Model for Spoken Language Communication Systems. IEEE Trans. Computers 56(9): 1199-1211 (2007) - [c154]Eiichiro Sumita, Tohru Shimizu, Satoshi Nakamura:
NICT-ATR Speech-to-Speech Translation System. ACL 2007 - [c153]Norihide Kitaoka, Kazumasa Yamamoto, Tomohiro Kusamizu, Seiichi Nakagawa, Takeshi Yamada, Satoru Tsuge, Chiyomi Miyajima, Takanobu Nishiura, Masato Nakayama, Yuki Denda, Masakiyo Fujimoto, Tetsuya Takiguchi, Satoshi Tamura, Shingo Kuroiwa, Kazuya Takeda, Satoshi Nakamura:
Development of VAD evaluation framework CENSREC-1-C and investigation of relationship between VAD and speech recognition performance. ASRU 2007: 607-612 - [c152]Konstantin Markov, Satoshi Nakamura:
Never-ending learning system for on-line speaker diarization. ASRU 2007: 699-704 - [c151]Jinfu Ni, Toshio Hirai, Hisashi Kawai, Tomoki Toda, Keiichi Tokuda, Minoru Tsuzaki, Shinsuke Sakai, Ranniery Maia, Satoshi Nakamura:
ATRECSS - ATR English speech corpus for speech synthesis. Blizzard Challenge 2007 - [c150]Sakriani Sakti, Konstantin Markov, Satoshi Nakamura:
A method to integrate additional knowledge sources into HMM based on junction tree decomposition. EUSIPCO 2007: 2404-2408 - [c149]Jinfu Ni, Satoshi Nakamura:
Use of Poisson Processes to Generate Fundamental Frequency Contours. ICASSP (4) 2007: 825-828 - [c148]Konstantin Markov, Satoshi Nakamura:
Never-ending learning with dynamic hidden Markov network. INTERSPEECH 2007: 1437-1440 - [c147]Sakriani Sakti, Konstantin Markov, Satoshi Nakamura:
An HMM acoustic model incorporating various additional knowledge sources. INTERSPEECH 2007: 2117-2120 - [c146]Yoshihiro Adachi, Shinichi Kawamoto, Shigeo Morishima, Satoshi Nakamura:
Acoustic Features for Estimation of Perceptional Similarity. PCM 2007: 306-314 - [c145]Shigeo Morishima, Shigeru Kuriyama, Shinichi Kawamoto, Tadamichi Suzuki, Masaaki Taira, Tatsuo Yotsukura, Satoshi Nakamura:
Data-driven efficient production of cartoon character animation. SIGGRAPH Sketches 2007: 76 - [c144]Shinsuke Sakai, Jinfu Ni, Ranniery Maia, Keiichi Tokuda, Minoru Tsuzaki, Tomoki Toda, Hisashi Kawai, Satoshi Nakamura:
Communicative speech synthesis with XIMERA: a first step. SSW 2007: 28-33 - 2006
- [j32]Satoshi Nakamura:
Special Section on Statistical Modeling for Speech Processing. IEICE Trans. Inf. Syst. 89-D(3): 867-868 (2006) - [j31]Masakiyo Fujimoto, Satoshi Nakamura:
A Non-stationary Noise Suppression Method Based on Particle Filtering and Polyak Averaging. IEICE Trans. Inf. Syst. 89-D(3): 922-930 (2006) - [j30]Sakriani Sakti, Satoshi Nakamura, Konstantin Markov:
Improving Acoustic Model Precision by Incorporating a Wide Phonetic Context Based on a Bayesian Framework. IEICE Trans. Inf. Syst. 89-D(3): 946-953 (2006) - [j29]Sakriani Sakti, Konstantin Markov, Satoshi Nakamura:
A Hybrid HMM/BN Acoustic Model Utilizing Pentaphone-Context Dependency. IEICE Trans. Inf. Syst. 89-D(3): 954-961 (2006) - [j28]Konstantin Markov, Satoshi Nakamura:
Using Hybrid HMM/BN Acoustic Models: Design and Implementation Issues. IEICE Trans. Inf. Syst. 89-D(3): 981-988 (2006) - [j27]Shigeki Matsuda, Takatoshi Jitsuhiro, Konstantin Markov, Satoshi Nakamura:
ATR Parallel Decoding Based Speech Recognition System Robust to Noise and Speaking Styles. IEICE Trans. Inf. Syst. 89-D(3): 989-997 (2006) - [j26]Masakiyo Fujimoto, Kazuya Takeda, Satoshi Nakamura:
CENSREC-3: An Evaluation Framework for Japanese Speech Recognition in Real Car-Driving Environments. IEICE Trans. Inf. Syst. 89-D(11): 2783-2793 (2006) - [j25]Konstantin Markov, Jianwu Dang, Satoshi Nakamura:
Integration of articulatory and spectrum features based on the hybrid HMM/BN modeling framework. Speech Commun. 48(2): 161-175 (2006) - [j24]Akira Sasou, Futoshi Asano, Satoshi Nakamura, Kazuyo Tanaka:
HMM-based noise-robust feature compensation. Speech Commun. 48(9): 1100-1111 (2006) - [j23]Satoshi Nakamura, Konstantin Markov, Hiromi Nakaiwa, Gen-ichiro Kikui, Hisashi Kawai, Takatoshi Jitsuhiro, Jinsong Zhang, Hirofumi Yamamoto, Eiichiro Sumita, Seiichi Yamamoto:
The ATR multilingual speech-to-speech translation system. IEEE Trans. Speech Audio Process. 14(2): 365-376 (2006) - [c143]Tomoki Toda, Hisashi Kawai, Toshio Hirai, Jinfu Ni, Nobuyuki Nishizawa, Junichi Yamagishi, Minoru Tsuzaki, Keiichi Tokuda, Satoshi Nakamura:
Developing a Test Bed of English Text-to-Speech System XIMERA for the Blizzard Challenge 2006. Blizzard Challenge 2006 - [c142]Carlos Toshinori Ishi, Shigeki Matsuda, Takayuki Kanda, Takatoshi Jitsuhiro, Hiroshi Ishiguro, Satoshi Nakamura, Norihiro Hagita:
Robust Speech Recognition System for Communication Robots in Real Environments. Humanoids 2006: 340-345 - [c141]Jinsong Zhang, Xinhui Hu, Satoshi Nakamura:
Automatic Derivation of a Phoneme Set with Tone Information for Chinese Speech Recognition Based on Mutual Information Criterion. ICASSP (1) 2006: 337-340 - [c140]Masakiyo Fujimoto, Satoshi Nakamura:
Sequential Non-Stationary Noise Tracking Using Particle Filtering with Switching Dynamical System. ICASSP (1) 2006: 769-772 - [c139]Sakriani Sakti, Konstantin Markov, Satoshi Nakamura:
Incorporation of Pentaphone-Context Dependency Based on Hybrid Hmm/Bn Acoustic Modeling Framework. ICASSP (1) 2006: 1177-1180 - [c138]Konstantin Markov, Satoshi Nakamura:
Forward-backwards training of hybrid HMM/BN acoustic models. INTERSPEECH 2006 - [c137]Satoshi Nakamura, Masakiyo Fujimoto, Kazuya Takeda:
CENSREC2: corpus and evaluation environments for in car continuous digit speech recognition. INTERSPEECH 2006 - [c136]Sakriani Sakti, Konstantin Markov, Satoshi Nakamura:
The use of Bayesian network for incorporating accent, gender and wide-context dependency information. INTERSPEECH 2006 - [c135]Hirofumi Yamamoto, Gen-ichiro Kikui, Satoshi Nakamura, Yoshinori Sagisaka:
Speech recognition of foreign out-of-vocabulary words using a hierarchical language model. INTERSPEECH 2006 - [c134]Tohru Shimizu, Yutaka Ashikari, Eiichiro Sumita, Hideki Kashioka, Satoshi Nakamura:
Development of client-server speech translation system on a multi-lingual speech communication platform. IWSLT 2006: 213-216 - [c133]Shuichi Itahashi, Chiu-yu Tseng, Satoshi Nakamura:
Oriental COCOSDA: Past, Present and Future. LREC 2006: 753-756 - [c132]Tohru Shimizu, Yutaka Ashikari, Toshiyuki Takezawa, Masahide Mizushima, Gen-ichiro Kikui, Yutaka Sasaki, Satoshi Nakamura:
Developing Client-Server Speech Translation Platform. MDM 2006: 141 - [c131]Shinichi Kawamoto, Tatsuo Yotsukura, Satoshi Nakamura:
Key-frame removal method for blendshape-based cartoon lip-sync animation. SIGGRAPH Research Posters 2006: 12 - [c130]Tatsuo Yotsukura, Shinichi Kawamoto, Satoshi Nakamura:
Lip-sync animation from HMM using dynamic features. SIGGRAPH Research Posters 2006: 13 - 2005
- [j22]Takatoshi Jitsuhiro, Satoshi Nakamura:
Automatic Generation of Non-uniform and Context-Dependent HMMs Based on the Variational Bayesian Approach. IEICE Trans. Inf. Syst. 88-D(3): 391-400 (2005) - [j21]Ian R. Lane, Tatsuya Kawahara, Tomoko Matsui, Satoshi Nakamura:
Dialogue Speech Recognition by Combining Hierarchical Topic Classification and Language Model Switching. IEICE Trans. Inf. Syst. 88-D(3): 446-454 (2005) - [j20]Satoshi Nakamura, Kazuya Takeda, Kazumasa Yamamoto, Takeshi Yamada, Shingo Kuroiwa, Norihide Kitaoka, Takanobu Nishiura, Akira Sasou, Mitsunori Mizumachi, Chiyomi Miyajima, Masakiyo Fujimoto, Toshiki Endo:
AURORA-2J: An Evaluation Framework for Japanese Noisy Speech Recognition. IEICE Trans. Inf. Syst. 88-D(3): 535-544 (2005) - [j19]Tatsuo Yotsukura, Shigeo Morishima, Satoshi Nakamura:
Construction of Audio-Visual Speech Corpus Using Motion-Capture System and Corpus Based Facial Animation. IEICE Trans. Inf. Syst. 88-D(11): 2477-2483 (2005) - [j18]Jinsong Zhang, Satoshi Nakamura, Keikichi Hirose:
Tone nucleus-based multi-level robust acoustic tonal modeling of sentential F0 variations for Chinese continuous speech tone recognition. Speech Commun. 46(3-4): 440-454 (2005) - [j17]Donglai Zhu, Satoshi Nakamura, Kuldip K. Paliwal, Ren-Hua Wang:
Maximum likelihood sub-band adaptation for robust speech recognition. Speech Commun. 47(3): 243-264 (2005) - [c129]Wolfgang Herbordt, Satoshi Nakamura, Walter Kellermann:
Joint optimization of LCMV beamforming and acoustic echo cancellation for automatic speech recognition. ICASSP (3) 2005: 77-80 - [c128]Masakiyo Fujimoto, Satoshi Nakamura:
Particle Filter Based Non-Stationary Noise Tracking for Robust Speech Recognition. ICASSP (1) 2005: 257-260 - [c127]Tor André Myrvoll, Satoshi Nakamura:
Online cepstral filtering using a sequential EM approach with Polyak averaging and feedback. ICASSP (1) 2005: 261-264 - [c126]Konstantin Markov, Satoshi Nakamura:
Modeling Successive Frame Dependencies with Hybrid HMM/BN Acoustic Model. ICASSP (1) 2005: 701-704 - [c125]Masakiyo Fujimoto, Satoshi Nakamura, Toshiki Endo, Kazuya Takeda, Chiyomi Miyajima, Shingo Kuroiwa, Takeshi Yamada, Norihide Kitaoka, Kazumasa Yamamoto, Mitsunori Mizumachi, Takanobu Nishiura, Akira Sasou:
CENSREC-3: Data Collection for In-Car Speech Recognition and Its Common Evaluation Framework. ICDE Workshops 2005: 1208 - [c124]Takatoshi Jitsuhiro, Shigeki Matsuda, Yutaka Ashikari, Satoshi Nakamura, Ikuko Eguchi Yairi, Seiji Igi:
Spoken dialog system and its evaluation of geographic information system for elderly persons' mobility support. INTERSPEECH 2005: 197-200 - [c123]Sakriani Sakti, Satoshi Nakamura, Konstantin Markov:
Incorporating a Bayesian wide phonetic context model for acoustic rescoring. INTERSPEECH 2005: 1629-1632 - [c122]Shigeki Matsuda, Wolfgang Herbordt, Satoshi Nakamura:
Outlier detection for acoustic model training using robust statistics. INTERSPEECH 2005: 3337-3340 - [c121]Satoshi Nakamura, Takeshi Shoji, Masahiko Tsukamoto, Shojiro Nishio:
SoundWeb: Hyperlinked Voice Data for Wearable Computing Environment. ISWC 2005: 14-19 - [c120]Tatsuo Yotsukura, Shigeo Morishima, Satoshi Nakamura:
Speech to talking heads system based on hidden Markov models. SIGGRAPH Posters 2005: 27 - [c119]Shinichi Kawamoto, Tatsuo Yotsukura, Shigeo Morishima, Satoshi Nakamura:
Automatic head-movement control for emotional speech. SIGGRAPH Posters 2005: 28 - 2004
- [j16]Shigeo Morishima, Satoshi Nakamura:
Multimodal Translation System Using Texture-Mapped Lip-Sync Images for Video Mail and Automatic Dubbing Applications. EURASIP J. Adv. Signal Process. 2004(11): 1637-1647 (2004) - [j15]Toshiki Endo, Shingo Kuroiwa, Satoshi Nakamura:
Missing Feature Theory Applied to Robust Speech Recognition over IP Network. IEICE Trans. Inf. Syst. 87-D(5): 1119-1126 (2004) - [j14]Takatoshi Jitsuhiro, Tomoko Matsui, Satoshi Nakamura:
Automatic Generation of Non-uniform HMM Topologies Based on the MDL Criterion. IEICE Trans. Inf. Syst. 87-D(8): 2121-2129 (2004) - [j13]Kaisheng Yao, Kuldip K. Paliwal, Satoshi Nakamura:
Noise adaptive speech recognition based on sequential noise parameter estimation. Speech Commun. 42(1): 5-23 (2004) - [j12]Sadaoki Furui, Mary E. Beckman, Julia Hirschberg, Shuichi Itahashi, Tatsuya Kawahara, Satoshi Nakamura, Shrikanth S. Narayanan:
Introduction to the Special Issue on Spontaneous Speech Processing. IEEE Trans. Speech Audio Process. 12(4): 349-350 (2004) - [j11]Kazumasa Murai, Satoshi Nakamura:
A Robust Bimodal Speech Section Detection. J. VLSI Signal Process. 36(2-3): 81-90 (2004) - [j10]Panikos Heracleous, Satoshi Nakamura, Kiyohiro Shikano:
Simultaneous Recognition of Distant-Talking Speech of Multiple Talkers Based on the 3-D N-Best Search Method. J. VLSI Signal Process. 36(2-3): 105-116 (2004) - [c118]Wolfgang Herbordt, Walter Kellermann, Satoshi Nakamura:
Joint optimization of LCMV beamforming and acoustic echo cancellation. EUSIPCO 2004: 2003-2006 - [c117]Ian R. Lane, Tatsuya Kawahara, Tomoko Matsui, Satoshi Nakamura:
Out-of-domain detection based on confidence measures from multiple topic classification. ICASSP (1) 2004: 757-760 - [c116]Takatoshi Jitsuhiro, Satoshi Nakamura:
Automatic generation of non-uniform HMM structures based on variational Bayesian approach. ICASSP (1) 2004: 805-808 - [c115]Tor André Myrvoll, Satoshi Nakamura:
Minimum mean square error filtering of noisy cepstral coefficients with applications to ASR. ICASSP (1) 2004: 977-980 - [c114]Tor André Myrvoll, Satoshi Nakamura:
Online minimum mean square error filtering of noisy cepstral coefficients using a sequential EM algorithm. INTERSPEECH 2004: 117-120 - [c113]Akira Sasou, Kazuyo Tanaka, Satoshi Nakamura, Futoshi Asano:
HMM-based feature compensation method: an evaluation using the AURORA2. INTERSPEECH 2004: 121-124 - [c112]Frank K. Soong, Wai Kit Lo, Satoshi Nakamura:
Optimal acoustic and language model weights for minimizing word verification errors. INTERSPEECH 2004: 441-444 - [c111]Konstantin Markov, Satoshi Nakamura, Jianwu Dang:
Integration of articulatory dynamic parameters in HMM/BN based speech recognition system. INTERSPEECH 2004: 561-564 - [c110]Takatoshi Jitsuhiro, Satoshi Nakamura:
Increasing the mixture components of non-uniform HMM structures based on a variational Bayesian approach. INTERSPEECH 2004: 697-700 - [c109]Sakriani Sakti, Arry Akhmad Arman, Satoshi Nakamura, Paulus Hutagaol:
Indonesian speech recognition for hearing and speaking impaired people. INTERSPEECH 2004: 1037-1040 - [c108]Rainer Gruhn, Konstantin Markov, Satoshi Nakamura:
A statistical lexicon for non-native speech recognition. INTERSPEECH 2004: 1497-1500 - [c107]Tobias Cincarek, Rainer Gruhn, Satoshi Nakamura:
Speech recognition for multiple non-native accent groups with speaker-group-dependent acoustic models. INTERSPEECH 2004: 1509-1512 - [c106]Wai Kit Lo, Frank K. Soong, Satoshi Nakamura:
Robust verification of recognized words in noise. INTERSPEECH 2004: 1665-1668 - [c105]Tatsuya Kawahara, Ian Richard Lane, Tomoko Matsui, Satoshi Nakamura:
Topic classification and verification modeling for out-of-domain utterance detection. INTERSPEECH 2004: 2197-2200 - [c104]Shigeki Matsuda, Takatoshi Jitsuhiro, Konstantin Markov, Satoshi Nakamura:
Speech recognition system robust to noise and speaking styles. INTERSPEECH 2004: 2817-2820 - [c103]Jinsong Zhang, Satoshi Nakamura, Keikichi Hirose:
Efficient tone classification of speaker independent continuous Chinese speech using anchoring based discriminating features. INTERSPEECH 2004: 2977-2980 - [c102]Wai Kit Lo, Frank K. Soong, Satoshi Nakamura:
Generalized posterior probability for minimizing verification errors at subword, word and sentence levels. ISCSLP 2004: 13-16 - [c101]Satoshi Nakamura, Konstantin Markov, Takatoshi Jitsuhiro, Jinsong Zhang, Hirofumi Yamamoto, Gen-ichiro Kikui:
Multi-lingual speech recognition system for speech-to-speech translation. IWSLT 2004: 147-154 - [c100]Tatsuo Yotsukura, Shigeo Morishima, Satoshi Nakamura:
Face expression synthesis based on a facial motion distribution chart. SIGGRAPH Posters 2004: 85 - [p1]Shinichi Kawamoto, Hiroshi Shimodaira, Tsuneo Nitta, Takuya Nishimoto, Satoshi Nakamura, Katsunobu Itou, Shigeo Morishima, Tatsuo Yotsukura, Atsuhiko Kai, Akinobu Lee, Yoichi Yamashita, Takao Kobayashi, Keiichi Tokuda, Keikichi Hirose, Nobuaki Minematsu, Atsushi Yamada, Yasuharu Den, Takehito Utsuro, Shigeki Sagayama:
Galatea: Open-Source Software for Developing Anthropomorphic Spoken Dialog Agents. Life-like characters 2004: 187-212 - 2003
- [j9]Takanobu Nishiura, Ryousuke Nishioka, Takeshi Yamada, Satoshi Nakamura, Kiyohiro Shikano:
Multiple beamforming with source localization based on CSP analysis. Syst. Comput. Jpn. 34(5): 69-80 (2003) - [j8]Jingdong Chen, Kuldip K. Paliwal, Satoshi Nakamura:
Cepstrum derived from differentiated power spectrum for robust speech recognition. Speech Commun. 41(2-3): 469-484 (2003) - [c99]Takanobu Nishiura, Masato Nakayama, Satoshi Nakamura:
An evaluation of adaptive beamformer based on average speech spectrum for noisy speech recognition. ICASSP (1) 2003: 668-671 - [c98]Jinsong Zhang, Keikichi Hirose, Satoshi Nakamura:
A multilevel framework to model the inherently confounding nature of sentential F0sentential F0 contours contours for recognizing Chinese lexical tones. ICASSP (1) 2003: 776-779 - [c97]Konstantin Markov, Satoshi Nakamura:
Hybrid HMM/BN LVCSR system integrating multiple acoustic features. ICASSP (1) 2003: 840-843 - [c96]Takanobu Nishiura, Masato Nakayama, Satoshi Nakamura:
An evaluation of adaptive beamformer based on average speech spectrum for noisy speech recognition. ICME 2003: 209-212 - [c95]Akira Sasou, Futoshi Asano, Kazuyo Tanaka, Satoshi Nakamura:
Adaptation of acoustic model using the gain-adapted HMM decomposition method. INTERSPEECH 2003: 29-32 - [c94]Ian R. Lane, Tatsuya Kawahara, Tomoko Matsui, Satoshi Nakamura:
Hierarchical topic classification for dialog speech recognition based on language model switching. INTERSPEECH 2003: 429-432 - [c93]Panikos Heracleous, Satoshi Nakamura, Kiyohiro Shikano:
A semi-blind source separation method for hands-free speech recognition of multiple talkers. INTERSPEECH 2003: 509-512 - [c92]Mitsunori Mizumachi, Satoshi Nakamura:
Noise reduction using paired-microphones on non-equally-spaced microphone arrangement. INTERSPEECH 2003: 585-588 - [c91]Donglai Zhu, Satoshi Nakamura, Kuldip K. Paliwal, Ren-Hua Wang:
Maximum likelihood sub-band weighting for robust speech recognition. INTERSPEECH 2003: 673-676 - [c90]Konstantin Markov, Jianwu Dang, Yosuke Iizuka, Satoshi Nakamura:
Hybrid HMM/BN ASR system integrating spectrum and articulatory features. INTERSPEECH 2003: 965-968 - [c89]Kaisheng Yao, Kuldip K. Paliwal, Satoshi Nakamura:
Model based noisy speech recognition with environment parameters estimated by noise adaptive speech recognition with prior. INTERSPEECH 2003: 1273-1276 - [c88]Takeshi Yamada, Jiro Okada, Kazuya Takeda, Norihide Kitaoka, Masakiyo Fujimoto, Shingo Kuroiwa, Kazumasa Yamamoto, Takanobu Nishiura, Mitsunori Mizumachi, Satoshi Nakamura:
Integration of noise reduction algorithms for Aurora2 task. INTERSPEECH 2003: 1769-1772 - [c87]Takanobu Nishiura, Satoshi Nakamura, Kazuhiro Miki, Kiyohiro Shikano:
Environmental sound source identification based on hidden Markov model for robust speech recognition. INTERSPEECH 2003: 2157-2160 - [c86]Futoshi Asano, Yoichi Motomura, Hideki Asoh, Takashi Yoshimura, Naoyuki Ichimura, Kiyoshi Yamamoto, Nobuhiko Kitawaki, Satoshi Nakamura:
Detection and separation of speech segment using audio and video information fusion. INTERSPEECH 2003: 2257-2260 - [c85]Takatoshi Jitsuhiro, Tomoko Matsui, Satoshi Nakamura:
Automatic generation of non-uniform context-dependent HMM topologies based on the MDL criterion. INTERSPEECH 2003: 2721-2724 - [c84]Toshiki Endo, Shingo Kuroiwa, Satoshi Nakamura:
Missing feature theory applied to robust speech recognition over IP network. INTERSPEECH 2003: 3081-3084 - [c83]Tatsuo Yotsukura, Shigeo Morishima, Satoshi Nakamura:
Model-based talking face synthesis for anthropomorphic spoken dialog agent system. ACM Multimedia 2003: 351-354 - 2002
- [j7]Takeshi Yamada, Satoshi Nakamura, Kiyohiro Shikano:
Distant-talking speech recognition based on a 3-D Viterbi search using a microphone array. IEEE Trans. Speech Audio Process. 10(2): 48-56 (2002) - [j6]Satoshi Nakamura:
Statistical multimodal integration for audio-visual speech processing. IEEE Trans. Neural Networks 13(4): 854-866 (2002) - [c82]Kaisheng Yao, Kuldip K. Paliwal, Satoshi Nakamura:
Noise adaptive speech recognition in time-varying noise based on sequential kullback proximal algorithm. ICASSP 2002: 189-192 - [c81]Satoshi Nakamura, Ken'ichi Kumatani, Satoshi Tamura:
Robust bi-modal speech recognition based on state synchronous modeling and stream weight optimization. ICASSP 2002: 309-312 - [c80]Takanobu Nishiura, Satoshi Nakamura, Kiyohiro Shikano:
Talker localization in a real acoustic environment based on DOA estimation and statistical sound source identification. ICASSP 2002: 893-896 - [c79]Shigeo Morishima, Shin Ogata, Kazumasa Murai, Satoshi Nakamura:
Audio-visual speech translation with automatic lip syncqronization and face tracking based on 3-D head model. ICASSP 2002: 2117-2120 - [c78]Satoshi Nakamura, Kazuo Hiyane, Futoshi Asano, Yutaka Kaneda, Takeshi Yamada, Takanobu Nishiura, Tetsunori Kobayashi, Shiro Ise, Hiroshi Saruwatari:
Design and collection of acoustic sound data for hands-free speech recognition and sound scene understanding. ICME (2) 2002: 161-164 - [c77]Takanobu Nishiura, Satoshi Nakamura:
An evaluation of sound source identification with RWCP sound scene database in real acoustic environments. ICME (2) 2002: 265-268 - [c76]Kazumasa Murai, Satoshi Nakamura:
Real time face detection for multimodal speech recognition. ICME (2) 2002: 373-376 - [c75]Satoshi Nakamura, Panikos Heracleous:
3-D N-Best Search for Simultaneous Recognition of Distant-Talking Speech of Multiple Talkers. ICMI 2002: 59-63 - [c74]Shigeo Morishima, Satoshi Nakamura:
Multi-Modal Translation System and Its Evaluation. ICMI 2002: 241-246 - [c73]Satoshi Nakamura, Ken'ichi Kumatani, Satoshi Tamura:
Multi-Modal Temporal Asynchronicity Modeling by Product HMMs for Robust. ICMI 2002: 305-312 - [c72]Masaki Ida, Satoshi Nakamura:
HMM COmposition-based rapid model adaptation using a priori noise GMM adaptation evaluation on Aurora2 corpus. INTERSPEECH 2002: 437-440 - [c71]Kaisheng Yao, Donglai Zhu, Satoshi Nakamura:
Evaluation of a noise adaptive speech recognition system on the Aurora 3 database. INTERSPEECH 2002: 457-460 - [c70]Konstantin Markov, Satoshi Nakamura:
Modeling HMM state distributions with Bayesian networks. INTERSPEECH 2002: 1013-1016 - [c69]Sheng Gao, Jinsong Zhang, Satoshi Nakamura, Chin-Hui Lee, Tat-Seng Chua:
Weighted graph based decision tree optimization for high accuracy acoustic modeling. INTERSPEECH 2002: 1233-1236 - [c68]Takanobu Nishiura, Satoshi Nakamura, Yuka Okada, Takeshi Yamada, Kiyohiro Shikano:
Suitable design of adaptive beamformer based on average speech spectrum for noisy speech recognition. INTERSPEECH 2002: 1789-1792 - [c67]Mitsunori Mizumachi, Satoshi Nakamura:
The 2ch hybrid subtractive beamformer applied to line sound sources. INTERSPEECH 2002: 1833-1836 - [c66]Kaisheng Yao, Kuldip K. Paliwal, Satoshi Nakamura:
Noise adaptive speech recognition with acoustic models trained from noisy speech evaluated on Aurora-2 database. INTERSPEECH 2002: 2437-2440 - [c65]Kozo Okuda, Tatsuya Kawahara, Satoshi Nakamura:
Speaking rate compensation based on likelihood criterion in acoustic model training and decoding. INTERSPEECH 2002: 2589-2592 - [c64]Jinsong Zhang, Satoshi Nakamura:
Modeling varying pauses to develop robust acoustic models for recognizing noisy conversational speech. INTERSPEECH 2002: 2601-2604 - [c63]Hisao Kuwabara, Shuichi Itahashi, Mikio Yamamoto, Toshiyuki Takezawa, Satoshi Nakamura, Kazuya Takeda:
The Present Status of Speech Database in Japan: Development, Management, and Application to Speech Research. LREC 2002 - 2001
- [j5]Tetsuya Takiguchi, Satoshi Nakamura, Kiyohiro Shikano:
HMM-separation-based speech recognition for a distant moving speaker. IEEE Trans. Speech Audio Process. 9(2): 127-140 (2001) - [j4]Satoshi Nakamura, Eli Yamamoto:
Speech-to-Lip Movement Synthesis by Maximizing Audio-Visual Joint Probability Based on the EM Algorithm. J. VLSI Signal Process. 27(1-2): 119-126 (2001) - [c62]Satoshi Nakamura:
Fusion of Audio-Visual Information for Integrated Speech Processing. AVBPA 2001: 127-143 - [c61]Shigeo Morishima, Shin Ogata, Satoshi Nakamura:
Multimodal translation. AVSP 2001: 98-103 - [c60]Panikos Heracleous, Satoshi Nakamura, Kiyohiro Shikano:
A microphone array-based 3-D N-best search algorithm for the simultaneous recognition of multiple sound sources in real environments. ICASSP 2001: 193-196 - [c59]Konstantin Markov, Seiichi Nakagawa, Satoshi Nakamura:
Discriminative training of HMM using maximum normalized likelihood algorithm. ICASSP 2001: 497-500 - [c58]Ken'ichi Kumatani, Satoshi Nakamura, Kiyohiro Shikano:
An Adaptive Integration Based On Product Hmm For Audio-Visual Speech Recognition. ICME 2001 - [c57]Takafumi Misawa, Kazumasa Murai, Satoshi Nakamura, Shigeo Morishima:
Automatic Face Tracking And Model Match-Move In Video Sequence Using 3d Face Model. ICME 2001 - [c56]Shigeo Morishima, Shin Ogata, Satoshi Nakamura:
Trends of Learning Technology Standard. ICME 2001 - [c55]Kazumasa Murai, Ken'ichi Kumatani, Satoshi Nakamura:
Speech Detection By Facial Image For Multimodal Speech Recognition. ICME 2001 - [c54]Takanobu Nishiura, Rainer Gruhn, Satoshi Nakamura:
Automatic Steering Of Microphone Array And Video Camera Toward Multi-Lingual Tele-Conference Through Speech-To-Speech Translation. ICME 2001 - [c53]Shin Ogata, Kazumasa Murai, Satoshi Nakamura, Shigeo Morishima:
Model-Based Lip Synchronization With Automatically Translated Systhetic Voice Toward A Multi-Modal Translation System. ICME 2001 - [c52]Kaisheng Yao, Jingdong Chen, Kuldip K. Paliwal, Satoshi Nakamura:
Feature extraction and model-based noise compensation for noisy speech recognition evaluated on AURORA 2 task. INTERSPEECH 2001: 233-236 - [c51]Jingdong Chen, Kuldip K. Paliwal, Satoshi Nakamura:
Sub-band based additive noise removal for robust speech recognition. INTERSPEECH 2001: 571-574 - [c50]Kaisheng Yao, Kuldip K. Paliwal, Satoshi Nakamura:
Sequential noise compensation by a sequential kullback proximal algorithm. INTERSPEECH 2001: 1139-1142 - [c49]Kozo Okuda, Tomoko Matsui, Satoshi Nakamura:
Towards the creation of acoustic models for stressed Japanese speech. INTERSPEECH 2001: 1653-1656 - [c48]Jinsong Zhang, Shuwu Zhang, Yoshinori Sagisaka, Satoshi Nakamura:
A hybrid approach to enhance task portability of acoustic models in Chinese speech recognition. INTERSPEECH 2001: 1661-1664 - [c47]Mitsunori Mizumachi, Satoshi Nakamura:
Noise reduction using paired-microphones for both far-field and near-field sound sources. INTERSPEECH 2001: 2607-2610 - [c46]Takanobu Nishiura, Satoshi Nakamura, Kiyohiro Shikano:
Statistical sound source identification in a real acoustic environment for robust speech recognition using a microphone array. INTERSPEECH 2001: 2611-2614 - [c45]Satoshi Nakamura, Masahiko Tsukamoto, Shojiro Nishio:
A Method of Key Input with Two Mice. ISWC 2001: 13-20 - 2000
- [j3]Tetsuya Takiguchi, Satoshi Nakamura, Kiyohiro Shikano:
Model adaptation by HMM decomposition and composition in noisy reverberant environments. Syst. Comput. Jpn. 31(5): 77-85 (2000) - [j2]Futoshi Asano, Satoru Hayamizu, Takeshi Yamada, Satoshi Nakamura:
Speech enhancement based on the subspace method. IEEE Trans. Speech Audio Process. 8(5): 497-507 (2000) - [c44]Takanobu Nishiura, Takeshi Yamada, Satoshi Nakamura, Kiyohiro Shikano:
Localization of multiple sound sources based on a CSP analysis with a microphone array. ICASSP 2000: 1053-1056 - [c43]Tetsuya Takiguchi, Satoshi Nakamura, Kiyohiro Shikano:
Speech recognition for a distant moving speaker based on HMM composition and separation. ICASSP 2000: 1403-1406 - [c42]Kiyotsugu Kakihara, Satoshi Nakamura, Kiyohiro Shikano:
Speech-to-Face Movement Synthesis based on HMMS. IEEE International Conference on Multimedia and Expo (I) 2000: 427- - [c41]Satoshi Nakamura, Hidetoshi Ito, Kiyohiro Shikano:
Stream weight optimization of speech and lip image sequence for audio-visual speech recognition. INTERSPEECH 2000: 20-24 - [c40]Satoshi Nakamura, Keiko Watanuki, Toshiyuki Takezawa, Satoru Hayamizu:
Multimodal corpora for human-machine interaction research. INTERSPEECH 2000: 25-28 - [c39]Mitsunori Mizumachi, Masato Akagi, Satoshi Nakamura:
Design of robust subtractive beamformer for noisy speech recognition. INTERSPEECH 2000: 57-60 - [c38]Jinsong Zhang, Satoshi Nakamura, Keikichi Hirose:
Discriminating Chinese lexical tones by anchoring F0 features. INTERSPEECH 2000: 87-90 - [c37]Jingdong Chen, Kuldip K. Paliwal, Satoshi Nakamura:
A block cosine transform and its application in speech recognition. INTERSPEECH 2000: 117-120 - [c36]Rainer Gruhn, Harald Singer, Hajime Tsukada, Masaki Naito, Atsushi Nishino, Atsushi Nakamura, Yoshinori Sagisaka, Satoshi Nakamura:
Cellular-phone based speech-to-speech translation system ATR-MATRIX. INTERSPEECH 2000: 448-451 - [c35]Tomoko Matsui, Masaki Naito, Yoshinori Sagisaka, Kozo Okuda, Satoshi Nakamura:
Analysis of acoustic models trained on a large-scale Japanese speech database. INTERSPEECH 2000: 503-506 - [c34]Kaisheng Yao, Bertram E. Shi, Satoshi Nakamura, Zhigang Cao:
Residual noise compensation by a sequential EM algorithm for robust speech recognition in nonstationary noise. INTERSPEECH 2000: 770-773 - [c33]Yoshinori Atake, Toshio Irino, Hideki Kawahara, Jinlin Lu, Satoshi Nakamura, Kiyohiro Shikano:
Robust fundamental frequency estimation using instantaneous frequencies of harmonic components. INTERSPEECH 2000: 907-910 - [c32]Konstantin P. Markov, Satoshi Nakamura:
Frame level likelihood transformations for ASR and utterance verification. INTERSPEECH 2000: 1038-1041 - [c31]Satoshi Nakamura, Kazuo Hiyane, Futoshi Asano, Takanobu Nishiura, Takeshi Yamada:
Acoustical Sound Database in Real Environments for Sound Scene Understanding and Hands-Free Speech Recognition. LREC 2000
1990 – 1999
- 1999
- [c30]Panikos Heracleous, Takeshi Yamada, Satoshi Nakamura, Kiyohiro Shikano:
Simultaneous recognition of multiple sound sources based on 3-d n-best search using microphone array. EUROSPEECH 1999: 69-72 - [c29]Satoshi Nakamura, Kazuo Hiyane, Futoshi Asano, Takeshi Yamada, Takashi Endo:
Data collection in real acoustical environments for sound scene understanding and hands-free speech recognition. EUROSPEECH 1999 - 1998
- [j1]Eli Yamamoto, Satoshi Nakamura, Kiyohiro Shikano:
Lip movement synthesis from speech based on Hidden Markov Models. Speech Commun. 26(1-2): 105-115 (1998) - [c28]Eli Yamamoto, Satoshi Nakamura, Kiyohiro Shikano:
Subjective Evaluation for HMM-Based Speech-To-Lip Movement Synthesis. AVSP 1998: 227-232 - [c27]Eli Yamamoto, Satoshi Nakamura, Kiyohiro Shikano:
Lip Movement Synthesis from Speech Based on Hidden Markov Models. FG 1998: 154-159 - [c26]Takeshi Yamada, Satoshi Nakamura, Kiyohiro Shikano:
Hands-free speech recognition based on 3-D Viterbi search using a microphone array. ICASSP 1998: 245-248 - [c25]Makoto Shozakai, Satoshi Nakamura, Kiyohiro Shikano:
Robust speech recognition in car environments. ICASSP 1998: 269-272 - [c24]Hideki Banno, Jinlin Lu, Satoshi Nakamura, Kiyohiro Shikano, Hideki Kawahara:
Efficient representation of short-time phase based on group delay. ICASSP 1998: 861-864 - [c23]Alexandre Girardi, Kiyohiro Shikano, Satoshi Nakamura:
Creating speaker independent HMM models for restricted database using STRAIGHT-TEMPO morphing. ICSLP 1998 - [c22]Tetsuya Takiguchi, Satoshi Nakamura, Kiyohiro Shikano, Masatoshi Morishima, Toshihiro Isobe:
Evaluation of model adaptation by HMM decomposition on telephone speech recognition. ICSLP 1998 - [c21]Takeshi Yamada, Satoshi Nakamura, Kiyohiro Shikano:
An effect of adaptive beamforming on hands-free speech recognition based on 3-d viterbi search. ICSLP 1998 - [c20]Eli Yamamoto, Satoshi Nakamura, Kiyohiro Shikano:
Speech-to-lip movement synthesis based on the EM algorithm using audio-visual HMMs. ICSLP 1998 - [c19]Norimichi Yodo, Kiyohiro Shikano, Satoshi Nakamura:
Compression algorithm of trigram language models based on maximum likelihood estimation. ICSLP 1998 - [c18]Satoshi Nakamura, Eli Yamamoto, Kiyohiro Shikano:
Speech-to-lip movement synthesis maximizing audio-visual joint probability based on EM algorithm. MMSP 1998: 53-58 - 1997
- [c17]Eli Yamamoto, Satoshi Nakamura, Kiyohiro Shikano:
Speech to lip movement synthesis by HMM. AVSP 1997: 137-140 - [c16]Tetsuya Takiguchi, Satoshi Nakamura, Qiang Hou, Kiyohiro Shikano:
Model adaptation based on HMM decomposition for reverberant speech recognition. ICASSP 1997: 827-830 - [c15]Alexandre Girardi, Harald Singer, Kiyohiro Shikano, Satoshi Nakamura:
Maximum likelihood successive state splitting algorithm for tied-mixture HMNET. EUROSPEECH 1997: 119-122 - [c14]Makoto Shozakai, Satoshi Nakamura, Kiyohiro Shikano:
A non-iterative model-adaptive e-CMN/PMC approach for speech recognition in car environments. EUROSPEECH 1997: 287-290 - [c13]Masaaki Inoue, Satoshi Nakamura, Takeshi Yamada, Kiyohiro Shikano:
Microphone array design measures for hands-free speech recognition. EUROSPEECH 1997: 331-334 - [c12]Satoshi Nakamura, Ron Nagai, Kiyohiro Shikano:
Improved bimodal speech recognition using tied-mixture HMMs and 5000 word audio-visual synchronous database. EUROSPEECH 1997: 1623-1626 - [c11]Satoshi Nakamura, Kiyohiro Shikano:
Room acoustics and reverberation: impact on hands-free recognition. EUROSPEECH 1997: 2419-2422 - 1996
- [c10]Satoshi Nakamura, Tetsuya Takiguchi, Kiyohiro Shikano:
Noise and room acoustics distorted speech recognition by HMM composition. ICASSP 1996: 69-72 - [c9]Takeshi Yamada, Satoshi Nakamura, Kiyohiro Shikano:
Robust speech recognition with speaker localization by a microphone array. ICSLP 1996: 1317-1320 - 1993
- [c8]Satoshi Nakamura, Toshio Akabane, Seiji Hamaguchi:
Robust word spotting in adverse car environments. EUROSPEECH 1993: 1045-1048 - 1991
- [c7]Satoshi Nakamura, Toshio Akabane:
A neural speaker model for speaker clustering. ICASSP 1991: 853-856 - 1990
- [c6]Toshiyuki Hanazawa, Kenji Kita, Satoshi Nakamura, Takeshi Kawabata, Kiyohiro Shikano:
ATR HMM-LR continuous speech recognition system. ICASSP 1990: 53-56 - [c5]Hiroaki Hattori, Satoshi Nakamura, Kiyohiro Shikano:
Supplementation of HMM for articulatory variation in speaker adaptation. ICASSP 1990: 153-156 - [c4]Satoshi Nakamura, Kiyohiro Shikano:
A comparative study of spectral mapping for speaker adaptation. ICASSP 1990: 157-160 - [c3]Hiroaki Hattori, Satoshi Nakamura, Kiyohiro Shikano, Shigeki Sagayama:
Speaker weighted training of HMM using multiple reference speakers. ICSLP 1990: 149-152
1980 – 1989
- 1989
- [c2]Satoshi Nakamura, Kiyohiro Shikano:
Speaker adaptation applied to HMM and neural networks. ICASSP 1989: 89-92 - 1988
- [c1]Masanobu Abe, Satoshi Nakamura, Kiyohiro Shikano, Hisao Kuwabara:
Voice conversion through vector quantization. ICASSP 1988: 655-658
Coauthor Index
aka: Shin'ichi Kawamoto
aka: Konstantin P. Markov
aka: Alexander Waibel
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-10 20:49 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint