SciTePress - Publication Details
loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Authors: Jasmin Menges 1 ; Johannes Walter 2 ; Jasmin Bächle 3 and Klemens Schnattinger 3

Affiliations: 1 iRIX Software Engineering AG, Dornacherstrasse 192, 4053 Basel, Switzerland ; 2 Fraunhofer-Institut fuer Kurzzeitdynamik, Ernst-Mach-Institut, Am Klingelberg 1, 79588 Efringen-Kirchen, Germany ; 3 Business Innovation Center, Baden-Wuerttemberg Cooperative State University (DHBW), Hangstrasse 46-50, Loerrach, Germany

Keyword(s): Deep Learning, Speech Production, MRI Data.

Abstract: This paper investigates the potential of Deep Learning in the area of speech production. The purpose is to study whether algorithms are able to classify the spoken content based only on images of the oral region. With the real-time MRI data of Lim et al. more detailed insights into the speech production of the vocal tract could be obtained. In this project, the data was applied to recognize spoken letters from tongue movements using a vector-based image detection approach. In addition, to generate more data, randomization was applied. The pixel vectors of a video clip during which a certain letter was spoken could then be passed into a Deep Learning model. For this purpose, the neural networks LSTM and 3D-CNN were used. It has been proven that it is possible to classify letters with an accuracy of 93% using a 3D-CNN model.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 8.209.245.224

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Menges, J., Walter, J., Bächle, J. and Schnattinger, K. (2023). Speech Detection of Real-Time MRI Vocal Tract Data. In Proceedings of the 15th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - KDIR; ISBN 978-989-758-671-2; ISSN 2184-3228, SciTePress, pages 182-187. DOI: 10.5220/0012155600003598

@conference{kdir23,
author={Jasmin Menges and Johannes Walter and Jasmin Bächle and Klemens Schnattinger},
title={Speech Detection of Real-Time MRI Vocal Tract Data},
booktitle={Proceedings of the 15th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - KDIR},
year={2023},
pages={182-187},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012155600003598},
isbn={978-989-758-671-2},
issn={2184-3228},
}

TY - CONF

JO - Proceedings of the 15th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - KDIR
TI - Speech Detection of Real-Time MRI Vocal Tract Data
SN - 978-989-758-671-2
IS - 2184-3228
AU - Menges, J.
AU - Walter, J.
AU - Bächle, J.
AU - Schnattinger, K.
PY - 2023
SP - 182
EP - 187
DO - 10.5220/0012155600003598
PB - SciTePress