default search action
Gerasimos Potamianos
Person information
SPARQL queries
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j17]Katerina Papadimitriou, Galini Sapountzaki, Kyriaki Vasilaki, Eleni Efthimiou, Stavroula-Evita Fotinea, Gerasimos Potamianos:
A large corpus for the recognition of Greek Sign Language gestures. Comput. Vis. Image Underst. 249: 104212 (2024) - 2023
- [c120]Katerina Papadimitriou, Gerasimos Potamianos:
Sign Language Recognition via Deformable 3D Convolutions and Modulated Graph Convolutional Networks. ICASSP 2023: 1-5 - [c119]Katerina Papadimitriou, Galini Sapountzaki, Kyriaki Vasilaki, Eleni Efthimiou, Stavroula-Evita Fotinea, Gerasimos Potamianos:
SL-REDU GSL: A Large Greek Sign Language Recognition Corpus. ICASSP Workshops 2023: 1-5 - [c118]Katerina Papadimitriou, Gerasimos Potamianos:
Multimodal Locally Enhanced Transformer for Continuous Sign Language Recognition. INTERSPEECH 2023: 1513-1517 - 2022
- [j16]Niki Efthymiou, Panagiotis Paraskevas Filntisis, Petros Koutras, Antigoni Tsiami, Jack Hadfield, Gerasimos Potamianos, Petros Maragos:
ChildBot: Multi-robot perception and interaction with children. Robotics Auton. Syst. 150: 103975 (2022) - [c117]Maria Parelli, Katerina Papadimitriou, Gerasimos Potamianos, Georgios Pavlakos, Petros Maragos:
Spatio-Temporal Graph Convolutional Networks for Continuous Sign Language Recognition. ICASSP 2022: 8457-8461 - [c116]Alexandros Koumparoulis, Gerasimos Potamianos:
Accurate and Resource-Efficient Lipreading with Efficientnetv2 and Transformers. ICASSP 2022: 8467-8471 - 2021
- [j15]Spyridon Thermos, Gerasimos Potamianos, Petros Daras:
Joint Object Affordance Reasoning and Segmentation in RGB-D Videos. IEEE Access 9: 89699-89713 (2021) - [c115]Alexandros Koumparoulis, Gerasimos Potamianos, Samuel Thomas, Edmilson da Silva Morais:
Resource-efficient TDNN Architectures for Audio-visual Speech Recognition. EUSIPCO 2021: 506-510 - [c114]Panagiotis Giannoulis, Gerasimos Potamianos, Petros Maragos:
Overlapped Sound Event Classification via Multi-Channel Sound Separation Network. EUSIPCO 2021: 571-575 - [c113]Panagiotis Paraskevas Filntisis, Niki Efthymiou, Gerasimos Potamianos, Petros Maragos:
An Audiovisual Child Emotion Recognition System for Child-Robot Interaction Applications. EUSIPCO 2021: 791-795 - [c112]Eleni Efthimiou, Stavroula-Evita Fotinea, Christina Flouda, Theodore Goulas, Gkioulan Ametoglou, Galini Sapountzaki, Katerina Papadimitriou, Gerasimos Potamianos:
The SL-ReDu Environment for Self-monitoring and Objective Learner Assessment in Greek Sign Language. HCI (8) 2021: 72-81 - [c111]Katerina Papadimitriou, Maria Parelli, Galini Sapountzaki, Georgios Pavlakos, Petros Maragos, Gerasimos Potamianos:
Multimodal Fusion and Sequence Learning for Cued Speech Recognition from Videos. HCI (8) 2021: 277-290 - [c110]Niki Efthymiou, Panagiotis Paraskevas Filntisis, Gerasimos Potamianos, Petros Maragos:
A robotic edutainment framework for designing child-robot interaction scenarios. PETRA 2021: 160-166 - 2020
- [j14]Spyridon Thermos, Georgios Th. Papadopoulos, Petros Daras, Gerasimos Potamianos:
Deep sensorimotor learning for RGB-D object recognition. Comput. Vis. Image Underst. 190 (2020) - [c109]Maria Parelli, Katerina Papadimitriou, Gerasimos Potamianos, Georgios Pavlakos, Petros Maragos:
Exploiting 3D Hand Pose Estimation in Deep Learning-Based Sign Language Recognition from RGB Videos. ECCV Workshops (2) 2020: 249-263 - [c108]Panagiotis Paraskevas Filntisis, Niki Efthymiou, Gerasimos Potamianos, Petros Maragos:
Emotion Understanding in Videos Through Body, Context, and Visual-Semantic Embedding Loss. ECCV Workshops (1) 2020: 747-755 - [c107]Katerina Papadimitriou, Gerasimos Potamianos:
A Fully Convolutional Sequence Learning Approach for Cued Speech Recognition from Videos. EUSIPCO 2020: 326-330 - [c106]Spyridon Thermos, Petros Daras, Gerasimos Potamianos:
A Deep Learning Approach to Object Affordance Segmentation. ICASSP 2020: 2358-2362 - [c105]Alexandros Koumparoulis, Gerasimos Potamianos, Samuel Thomas, Edmilson da Silva Morais:
Audio-Assisted Image Inpainting for Talking Faces. ICASSP 2020: 7664-7668 - [c104]Katerina Papadimitriou, Gerasimos Potamianos:
Multimodal Sign Language Recognition via Temporal Deformable Convolutional Sequence Learning. INTERSPEECH 2020: 2752-2756 - [c103]Alexandros Koumparoulis, Gerasimos Potamianos, Samuel Thomas, Edmilson da Silva Morais:
Resource-Adaptive Deep Learning for Visual Speech Recognition. INTERSPEECH 2020: 3510-3514 - [c102]Gerasimos Potamianos, Katerina Papadimitriou, Eleni Efthimiou, Stavroula-Evita Fotinea, Galini Sapountzaki, Petros Maragos:
SL-ReDu: greek sign language recognition for educational applications. Project description and early results. PETRA 2020: 59:1-59:6 - [i5]Spyridon Thermos, Petros Daras, Gerasimos Potamianos:
A Deep Learning Approach to Object Affordance Segmentation. CoRR abs/2004.08644 (2020) - [i4]Niki Efthymiou, Panagiotis Paraskevas Filntisis, Petros Koutras, Antigoni Tsiami, Jack Hadfield, Gerasimos Potamianos, Petros Maragos:
ChildBot: Multi-Robot Perception and Interaction with Children. CoRR abs/2008.12818 (2020) - [i3]Panagiotis Paraskevas Filntisis, Niki Efthymiou, Gerasimos Potamianos, Petros Maragos:
Emotion Understanding in Videos Through Body, Context, and Visual-Semantic Embedding Loss. CoRR abs/2010.16396 (2020)
2010 – 2019
- 2019
- [j13]Panagiotis Giannoulis, Gerasimos Potamianos, Petros Maragos:
Room-localized speech activity detection in multi-microphone smart homes. EURASIP J. Audio Speech Music. Process. 2019: 15 (2019) - [j12]Panagiotis Paraskevas Filntisis, Niki Efthymiou, Petros Koutras, Gerasimos Potamianos, Petros Maragos:
Fusing Body Posture With Facial Expressions for Joint Recognition of Affect in Child-Robot Interaction. IEEE Robotics Autom. Lett. 4(4): 4011-4018 (2019) - [c101]Sotirios Panagiotis Chytas, Gerasimos Potamianos:
Hierarchical Detection of Sound Events and their Localization Using Convolutional Neural Networks with Adaptive Thresholds. DCASE 2019: 50-54 - [c100]Katerina Papadimitriou, Gerasimos Potamianos:
Fingerspelled Alphabet Sign Recognition in Upper-Body Videos. EUSIPCO 2019: 1-5 - [c99]Katerina Papadimitriou, Gerasimos Potamianos:
End-to-End Convolutional Sequence Learning for ASL Fingerspelling Recognition. INTERSPEECH 2019: 2315-2319 - [c98]Alexandros Koumparoulis, Gerasimos Potamianos:
MobiLipNet: Resource-Efficient Deep Learning Based Lipreading. INTERSPEECH 2019: 2763-2767 - [e3]Sharon L. Oviatt, Björn W. Schuller, Philip R. Cohen, Daniel Sonntag, Gerasimos Potamianos, Antonio Krüger:
The Handbook of Multimodal-Multisensor Interfaces: Language Processing, Software, Commercialization, and Emerging Directions - Volume 3. Association for Computing Machinery 2019, ISBN 978-1-970001-75-4 [contents] - [i2]Panagiotis Paraskevas Filntisis, Niki Efthymiou, Petros Koutras, Gerasimos Potamianos, Petros Maragos:
Fusing Body Posture with Facial Expressions for Joint Recognition of Affect in Child-Robot Interaction. CoRR abs/1901.01805 (2019) - 2018
- [c97]Panagiotis Giannoulis, Gerasimos Potamianos, Petros Maragos:
Multi-Channel Non-Negative Matrix Factorization for Overlapped Acoustic Event Detection. EUSIPCO 2018: 857-861 - [c96]Katerina Papadimitriou, Gerasimos Potamianos:
A Hybrid Approach to Hand Detection and Type Classification in Upper-Body Videos. EUVIP 2018: 1-6 - [c95]Antigoni Tsiami, Panagiotis Paraskevas Filntisis, Niki Efthymiou, Petros Koutras, Gerasimos Potamianos, Petros Maragos:
Far-Field Audio-Visual Scene Perception of Multi-Party Human-Robot Interaction for Children and Adults. ICASSP 2018: 6568-6572 - [c94]Spyridon Thermos, Georgios Th. Papadopoulos, Petros Daras, Gerasimos Potamianos:
Attention-Enhanced Sensorimotor Object Recognition. ICIP 2018: 336-340 - [c93]Niki Efthymiou, Petros Koutras, Panagiotis Paraskevas Filntisis, Gerasimos Potamianos, Petros Maragos:
Multi- View Fusion for Action Recognition in Child-Robot Interaction. ICIP 2018: 455-459 - [c92]Antigoni Tsiami, Petros Koutras, Niki Efthymiou, Panagiotis Paraskevas Filntisis, Gerasimos Potamianos, Petros Maragos:
Multi3: Multi-Sensory Perception System for Multi-Modal Child Interaction with Multiple Robots. ICRA 2018: 1-8 - [c91]Jack Hadfield, Petros Koutras, Niki Efthymiou, Gerasimos Potamianos, Costas S. Tzafestas, Petros Maragos:
Object Assembly Guidance in Child-Robot Interaction using RGB-D based 3D Tracking. IROS 2018: 347-354 - [c90]Alexandros Koumparoulis, Gerasimos Potamianos:
Deep View2View Mapping for View-Invariant Lipreading. SLT 2018: 588-594 - [e2]Sharon L. Oviatt, Björn W. Schuller, Philip R. Cohen, Daniel Sonntag, Gerasimos Potamianos, Antonio Krüger:
The Handbook of Multimodal-Multisensor Interfaces: Foundations, User Modeling, and Common Modality Combinations - Volume 2. Association for Computing Machinery 2018, ISBN 978-1-970001-71-6 [contents] - 2017
- [j11]Isidoros Rodomagoulakis, Athanasios Katsamanis, Gerasimos Potamianos, Panagiotis Giannoulis, Antigoni Tsiami, Petros Maragos:
Room-localized spoken command recognition in multi-room, multi-microphone environments. Comput. Speech Lang. 46: 419-443 (2017) - [c89]Alexandros Koumparoulis, Gerasimos Potamianos, Youssef Mroueh, Steven J. Rennie:
Exploring ROI size in deep learning based lipreading. AVSP 2017: 64-69 - [c88]Spyridon Thermos, Georgios Th. Papadopoulos, Petros Daras, Gerasimos Potamianos:
Deep Affordance-Grounded Sensorimotor Object Recognition. CVPR 2017: 49-57 - [c87]Panagiotis Giannoulis, Gerasimos Potamianos, Petros Maragos:
On the Joint Use of NMF and Classification for Overlapping Acoustic Event Detection. IWCIM@EUSIPCO 2017: 90 - [p4]Gerasimos Potamianos, Etienne Marcheret, Youssef Mroueh, Vaibhava Goel, Alexandros Koumbaroulis, Argyrios Vartholomaios, Spyridon Thermos:
Audio and visual modality combination in speech processing applications. The Handbook of Multimodal-Multisensor Interfaces, Volume 1 (1) 2017: 489-543 - [e1]Sharon L. Oviatt, Björn W. Schuller, Philip R. Cohen, Daniel Sonntag, Gerasimos Potamianos, Antonio Krüger:
The Handbook of Multimodal-Multisensor Interfaces: Foundations, User Modeling, and Common Modality Combinations - Volume 1. ACM 2017, ISBN 978-1-970001-67-9 [contents] - [i1]Spyridon Thermos, Georgios Th. Papadopoulos, Petros Daras, Gerasimos Potamianos:
Deep Affordance-grounded Sensorimotor Object Recognition. CoRR abs/1704.02787 (2017) - 2016
- [c86]Panagiotis Giannoulis, Gerasimos Potamianos, Petros Maragos, Athanasios Katsamanis:
Improved Dictionary Selection and Detection Schemes in Sparse-CNMF-Based Overlapping Acoustic Event Detection. DCASE 2016: 25-29 - [c85]Spyridon Thermos, Gerasimos Potamianos:
Audio-visual speech activity detection in a two-speaker scenario incorporating depth information from a profile or frontal view. SLT 2016: 579-584 - 2015
- [c84]Etienne Marcheret, Gerasimos Potamianos, Josef Vopicka, Vaibhava Goel:
Scattering vs. discrete cosine transform features in visual speech processing. AVSP 2015: 175-180 - [c83]Panagiotis Giannoulis, Alessio Brutti, Marco Matassoni, Alberto Abad, Athanasios Katsamanis, Miguel Matos, Gerasimos Potamianos, Petros Maragos:
Multi-room speech activity detection using a distributed microphone network in domestic environments. EUSIPCO 2015: 1271-1275 - [c82]Z.-I. Skordilis, Antigoni Tsiami, Petros Maragos, Gerasimos Potamianos, Luca Spelgatti, Roberto Sannino:
Multichannel speech enhancement using MEMS microphones. ICASSP 2015: 2729-2733 - [c81]Etienne Marcheret, Gerasimos Potamianos, Josef Vopicka, Vaibhava Goel:
Detecting audio-visual synchrony using deep neural networks. INTERSPEECH 2015: 548-552 - 2014
- [c80]Panagiotis Giannoulis, Gerasimos Potamianos, Athanasios Katsamanis, Petros Maragos:
Multi-microphone fusion for detection of speech and acoustic events in smart spaces. EUSIPCO 2014: 2375-2379 - [c79]Antigoni Tsiami, Athanasios Katsamanis, Petros Maragos, Gerasimos Potamianos:
Experiments in acoustic source localization using sparse arrays in adverse indoors environments. EUSIPCO 2014: 2390-2394 - [c78]Georgios Floros, Konstantinos Kyritsis, Gerasimos Potamianos:
Database and baseline system for detecting degraded traffic signs in urban environments. EUVIP 2014: 1-5 - [c77]Panagiotis Giannoulis, Antigoni Tsiami, Isidoros Rodomagoulakis, Athanasios Katsamanis, Gerasimos Potamianos, Petros Maragos:
The Athena-RC system for speech activity detection and speaker localization in the DIRHA smart home. HSCMA 2014: 167-171 - [c76]Athanasios Katsamanis, Isidoros Rodomagoulakis, Gerasimos Potamianos, Petros Maragos, Antigoni Tsiami:
Robust far-field spoken command recognition for home automation combining adaptation and multichannel processing. ICASSP 2014: 5547-5551 - [c75]Antigoni Tsiami, Isidoros Rodomagoulakis, Panagiotis Giannoulis, Athanasios Katsamanis, Gerasimos Potamianos, Petros Maragos:
ATHENA: a Greek multi-sensory database for home automation control uthor: isidoros rodomagoulakis (NTUA, Greece). INTERSPEECH 2014: 1608-1612 - 2013
- [c74]Isidoros Rodomagoulakis, Gerasimos Potamianos, Petros Maragos:
Advances in Large Vocabulary Continuous Speech Recognition in Greek: Modeling and nonlinear features. EUSIPCO 2013: 1-5 - [c73]Georgios Galatas, Gerasimos Potamianos, Fillia Makedon:
Robust Multi-Modal Speech Recognition in Two Languages Utilizing Video and Distance Information from the Kinect. HCI (4) 2013: 43-48 - [c72]Isidoros Rodomagoulakis, Panagiotis Giannoulis, Z.-I. Skordilis, Petros Maragos, Gerasimos Potamianos:
Experiments on far-field multichannel speech processing in smart homes. DSP 2013: 1-6 - 2012
- [c71]Georgios Galatas, Gerasimos Potamianos, Fillia Makedon:
Audio-visual speech recognition incorporating facial depth information captured by the Kinect. EUSIPCO 2012: 2714-2717 - [c70]Panagiotis Giannoulis, Gerasimos Potamianos:
A hierarchical approach with feature selection for emotion recognition from speech. LREC 2012: 1203-1206 - [c69]Georgios Galatas, Gerasimos Potamianos, Fillia Makedon:
Audio-visual speech recognition using depth information from the Kinect in noisy video conditions. PETRA 2012: 2 - 2011
- [j10]S.-H. Gary Chan, J. Li, Pascal Frossard, Gerasimos Potamianos:
Special Section on Interactive Multimedia. IEEE Trans. Multim. 13(5): 841-843 (2011) - [c68]Georgios Galatas, Gerasimos Potamianos, Dimitrios I. Kosmopoulos, Christopher McMurrough, Fillia Makedon:
Bilingual corpus for AVASR using multiple sensors and depth information. AVSP 2011: 103-106 - [c67]Georgios Galatas, Gerasimos Potamianos, Alexandros Papangelis, Fillia Makedon:
Audio visual speech recognition in noisy visual environments. PETRA 2011: 19 - 2010
- [c66]Lae-Hoon Kim, Mark Hasegawa-Johnson, Gerasimos Potamianos, Vit Libal:
Joint estimation of DOA and speech based on EM beamforming. ICASSP 2010: 121-124 - [p3]Alexander Waibel, Rainer Stiefelhagen, Rolf Carlson, Josep R. Casas, Jan Kleindienst, Lori Lamel, Oswald Lanz, Djamel Mostefa, Maurizio Omologo, Fabio Pianesi, Lazaros Polymenakos, Gerasimos Potamianos, John Soldatos, Gerhard Sutschet, Jacques M. B. Terken:
Computers in the Human Interaction Loop. Handbook of Ambient Intelligence and Smart Environments 2010: 1071-1116
2000 – 2009
- 2009
- [c65]Gerasimos Potamianos:
Audio-visual automatic speech recognition and related bimodal speech technologies: A review of the state-of-the-art and open problems. ASRU 2009: 22 - [c64]Kshitiz Kumar, Jirí Navrátil, Etienne Marcheret, Vit Libal, Ganesh N. Ramaswamy, Gerasimos Potamianos:
Audio-visual speech synchronization detection using a bimodal linear prediction model. CVPR Workshops 2009: 53-59 - [c63]Xiaodan Zhuang, Jing Huang, Gerasimos Potamianos, Mark Hasegawa-Johnson:
Acoustic fall detection using Gaussian mixture models and GMM supervectors. ICASSP 2009: 69-72 - [c62]Jing Huang, Xiaodan Zhuang, Vit Libal, Gerasimos Potamianos:
Long-time span acoustic activity analysis from far-field sensors in smart homes. ICASSP 2009: 4173-4176 - [c61]Kshitiz Kumar, Jirí Navrátil, Etienne Marcheret, Vit Libal, Gerasimos Potamianos:
Robust audio-visual speech synchrony detection by generalized bimodal linear prediction. INTERSPEECH 2009: 2251-2254 - [c60]Vit Libal, Bhuvana Ramabhadran, Nadia Mana, Fabio Pianesi, Paul Chippendale, Oswald Lanz, Gerasimos Potamianos:
Multimodal Classification of Activities of Daily Living Inside Smart Homes. IWANN (2) 2009: 687-694 - [p2]Keni Bernardin, Rainer Stiefelhagen, Aristodemos Pnevmatikakis, Oswald Lanz, Alessio Brutti, Josep R. Casas, Gerasimos Potamianos:
Person Tracking. Computers in the Human Interaction Loop 2009: 11-22 - [p1]Gerasimos Potamianos, Lori Lamel, Matthias Wölfel, Jing Huang, Etienne Marcheret, Claude Barras, Xuan Zhu, John W. McDonough, Javier Hernando, Dusan Macho, Climent Nadeu:
Automatic Speech Recognition. Computers in the Human Interaction Loop 2009: 43-59 - 2008
- [c59]Patrick Lucey, Gerasimos Potamianos, Sridha Sridharan:
Patch-based analysis of visual speech from multiple views. AVSP 2008: 69-74 - [c58]Rajesh Balchandran, Mark E. Epstein, Gerasimos Potamianos, Ladislav Serédi:
A multi-modal spoken dialog system for interactive TV. ICMI 2008: 191-192 - 2007
- [j9]Djamel Mostefa, Nicolas Moreau, Khalid Choukri, Gerasimos Potamianos, Stephen M. Chu, Ambrish Tyagi, Josep R. Casas, Jordi Turmo, Luca Cristoforetti, Francesco Tobia, Aristodemos Pnevmatikakis, Vasileios Mylonakis, Fotios Talantzis, Susanne Burger, Rainer Stiefelhagen, Keni Bernardin, Cedrick Rochet:
The CHIL audiovisual corpus for lecture and meeting analysis inside smart rooms. Lang. Resour. Evaluation 41(3-4): 389-407 (2007) - [j8]ZhenQiu Zhang, Gerasimos Potamianos, Andrew W. Senior, Thomas S. Huang:
Joint face and head tracking inside multi-camera smart rooms. Signal Image Video Process. 1(2): 163-178 (2007) - [c57]Patrick Lucey, Gerasimos Potamianos, Sridha Sridharan:
An extended pose-invariant lipreading system. AVSP 2007 - [c56]Jing Huang, Etienne Marcheret, Karthik Visweswariah, Vit Libal, Gerasimos Potamianos:
The IBM Rich Transcription 2007 Speech-to-Text Systems for Lecture Meetings. CLEAR 2007: 429-441 - [c55]Jing Huang, Etienne Marcheret, Karthik Visweswariah, Gerasimos Potamianos:
The IBM RT07 Evaluation Systems for Speaker Diarization on Lecture Meetings. CLEAR 2007: 497-508 - [c54]Ambrish Tyagi, Mark A. Keck, James W. Davis, Gerasimos Potamianos:
Kernel-Based 3D Tracking. CVPR 2007 - [c53]Etienne Marcheret, Vit Libal, Gerasimos Potamianos:
Dynamic Stream Weight Modeling for Audio-Visual Speech Recognition. ICASSP (4) 2007: 945-948 - [c52]Patrick Lucey, Gerasimos Potamianos, Sridha Sridharan:
A unified approach to multi-pose audio-visual ASR. INTERSPEECH 2007: 650-653 - [c51]Jing Huang, Etienne Marcheret, Karthik Visweswariah, Vit Libal, Gerasimos Potamianos:
Detection, diarization, and transcription of far-field lecture speech. INTERSPEECH 2007: 2161-2164 - [c50]Vit Libal, Jonathan Connell, Gerasimos Potamianos, Etienne Marcheret:
An Embedded System for In-Vehicle Visual Speech Activity Detection. MMSP 2007: 255-258 - 2006
- [c49]Gerasimos Potamianos, ZhenQiu Zhang:
A Joint System for Single-Person 2D-Face and 3D-Head Tracking in CHIL Seminars. CLEAR 2006: 105-118 - [c48]ZhenQiu Zhang, Gerasimos Potamianos, Ming Liu, Thomas S. Huang:
Robust Multi-View Multi-Camera Face Detection inside Smart Rooms Using Spatio-Temporal Dynamic Programming. FGR 2006: 407-412 - [c47]ZhenQiu Zhang, Gerasimos Potamianos, Stephen M. Chu, Jilin Tu, Thomas S. Huang:
Person Tracking in Smart Rooms using Dynamic Programming and Adaptive Subspace Learning. ICME 2006: 2061-2064 - [c46]Gerasimos Potamianos, Patrick Lucey:
Audio-Visual ASR from Multiple Views inside Smart Rooms. MFI 2006: 35-40 - [c45]Etienne Marcheret, Gerasimos Potamianos, Karthik Visweswariah, Jing Huang:
The IBM RT06s Evaluation System for Speech Activity Detection in CHIL Seminars. MLMI 2006: 323-335 - [c44]Jing Huang, Martin Westphal, Stanley F. Chen, Olivier Siohan, Daniel Povey, Vit Libal, Alvaro Soneiro, Henrik Schulz, Thomas Ross, Gerasimos Potamianos:
The IBM Rich Transcription Spring 2006 Speech-to-Text System for Lecture Meetings. MLMI 2006: 432-443 - [c43]Patrick Lucey, Gerasimos Potamianos:
Lipreading Using Profile Versus Frontal Views. MMSP 2006: 24-28 - 2005
- [c42]Gerasimos Potamianos, Patricia Scanlon:
Exploiting lower face symmetry in appearance-based automatic speechreading. AVSP 2005: 79-84 - [c41]ZhenQiu Zhang, Gerasimos Potamianos, Andrew W. Senior, Stephen M. Chu, Thomas S. Huang:
A Joint System for Person Tracking and Face Detection. ICCV-HCI 2005: 47-59 - [c40]Dusan Macho, Jaume Padrell, Alberto Abad, Climent Nadeu, Javier Hernando, John W. McDonough, Matthias Wölfel, Ulrich Klee, Maurizio Omologo, Alessio Brutti, Piergiorgio Svaizer, Gerasimos Potamianos, Stephen M. Chu:
Automatic Speech Activity Detection, Source Localization, and Speech Recognition on the Chil Seminar Corpus. ICME 2005: 876-879 - [c39]Jintao Jiang, Gerasimos Potamianos, Giridharan Iyengar:
Improved face finding in visually challenging environments. ICME 2005: 1078-1081 - [c38]Etienne Marcheret, Karthik Visweswariah, Gerasimos Potamianos:
Speech activity detection fusing acoustic phonetic and energy features. INTERSPEECH 2005: 241-244 - [c37]Stephen M. Chu, Etienne Marcheret, Gerasimos Potamianos:
Automatic Speech Recognition and Speech Activity Detection in the CHIL Smart Room. MLMI 2005: 332-343 - 2004
- [j7]Jing Huang, Gerasimos Potamianos, Jonathan Connell, Chalapathy Neti:
Audio-visual speech recognition using an infrared headset. Speech Commun. 44(1-4): 83-96 (2004) - [c36]Gerasimos Potamianos, Chalapathy Neti, Jing Huang, Jonathan H. Connell, Stephen M. Chu, Vit Libal, Etienne Marcheret, Norman Haas, Jintao Jiang:
Towards practical deployment of audio-visual speech recognition. ICASSP (3) 2004: 777-780 - [c35]Jintao Jiang, Gerasimos Potamianos, Harriet J. Nock, Giridharan Iyengar, Chalapathy Neti:
Improved face and feature finding for audio-visual speech recognition in visually challenging environments. ICASSP (5) 2004: 873-876 - [c34]Stephen M. Chu, Vit Libal, Etienne Marcheret, Chalapathy Neti, Gerasimos Potamianos:
Multistage information fusion for audio-visual speech recognition. ICME 2004: 1651-1654 - [c33]Patricia Scanlon, Gerasimos Potamianos, Vit Libal, Stephen M. Chu:
Mutual information based visual feature selection for lipreading. INTERSPEECH 2004: 2037-2040 - [c32]Etienne Marcheret, Stephen M. Chu, Vaibhava Goel, Gerasimos Potamianos:
Efficient likelihood computation in multi-stream HMM based audio-visual speech recognition. INTERSPEECH 2004: 2297-2300 - 2003
- [j6]Gerasimos Potamianos, Chalapathy Neti, Guillaume Gravier, Ashutosh Garg, Andrew W. Senior:
Recent advances in the automatic recognition of audiovisual speech. Proc. IEEE 91(9): 1306-1326 (2003) - [c31]Gerasimos Potamianos, Chalapathy Neti, Sabine Deligne:
Joint audio-visual speech processing for recognition and enhancement. AVSP 2003: 95-104 - [c30]Jing Huang, Gerasimos Potamianos, Chalapathy Neti:
Improving audio-visual speech recognition with an infrared headset. AVSP 2003: 175-178 - [c29]Ashutosh Garg, Gerasimos Potamianos, Chalapathy Neti, Thomas S. Huang:
Frame-dependent multi-stream reliability indicators for audio-visual speech recognition. ICASSP (1) 2003: 24-27 - [c28]Upendra V. Chaudhari, Ganesh N. Ramaswamy, Gerasimos Potamianos, Chalapathy Neti:
Audio-visual speaker recognition using time-varying stream reliability prediction. ICASSP (5) 2003: 712-715 - [c27]Upendra V. Chaudhari, Ganesh N. Ramaswamy, Gerasimos Potamianos, Chalapathy Neti:
Information fusion and decision cascading for audio-visual speaker recognition based on time-varying stream reliability prediction. ICME 2003: 9-12 - [c26]Jonathan H. Connell, Norman Haas, Etienne Marcheret, Chalapathy Neti, Gerasimos Potamianos, Senem Velipasalar:
A real-time prototype for small-vocabulary audio-visual ASR. ICME 2003: 469-472 - [c25]Ashutosh Garg, Gerasimos Potamianos, Chalapathy Neti, Thomas S. Huang:
Frame-dependent multi-stream reliability indicators for audio-visual speech recognition. ICME 2003: 605-608 - [c24]Gerasimos Potamianos, Chalapathy Neti:
Audio-visual speech recognition in challenging environments. INTERSPEECH 2003: 1293-1296 - 2002
- [j5]Chalapathy Neti, Gerasimos Potamianos, Juergen Luettin, Eric Vatikiotis-Bateson:
Editorial. EURASIP J. Adv. Signal Process. 2002(11): 1151-1153 (2002) - [c23]Guillaume Gravier, Scott Axelrod, Gerasimos Potamianos, Chalapathy Neti:
Maximum entropy and MCE based HMM stream weight estimation for audio-visual ASR. ICASSP 2002: 853-856 - [c22]Roland Goecke, Gerasimos Potamianos, Chalapathy Neti:
Noisy audio feature enhancement using audio-visual speech data. ICASSP 2002: 2025-2028 - [c21]Sabine Deligne, Gerasimos Potamianos, Chalapathy Neti:
Audio-visual speech enhancement with AVCDCN (audio-visual codebook dependent cepstral normalization). INTERSPEECH 2002: 1449-1452 - 2001
- [j4]Gerasimos Potamianos, Chalapathy Neti, Giridharan Iyengar, Andrew W. Senior, Ashish Verma:
A Cascade Visual Front End for Speaker Independent Automatic Speechreading. Int. J. Speech Technol. 4(3-4): 193-208 (2001) - [c20]Gerasimos Potamianos, Chalapathy Neti:
Automatic speechreading of impaired speech. AVSP 2001: 177-182 - [c19]Gerasimos Potamianos, Juergen Luettin, Chalapathy Neti:
Hierarchical discriminant features for audio-visual LVCSR. ICASSP 2001: 165-168 - [c18]Juergen Luettin, Gerasimos Potamianos, Chalapathy Neti:
Asynchronous stream modeling for large vocabulary audio-visual speech recognition. ICASSP 2001: 169-172 - [c17]Hervé Glotin, D. Vergyr, Chalapathy Neti, Gerasimos Potamianos, Juergen Luettin:
Weighting schemes for audio-visual fusion in speech recognition. ICASSP 2001: 173-176 - [c16]Gerasimos Potamianos, Chalapathy Neti:
Improved ROI and within frame discriminant features for lipreading. ICIP (3) 2001: 250-253 - [c15]Iain A. Matthews, Gerasimos Potamianos, Chalapathy Neti, Juergen Luettin:
A Comparison Of Model And Transform-Based Visual Features For Audio-Visual LVCSR. ICME 2001 - [c14]Gerasimos Potamianos, Chalapathy Neti, Giridharan Iyengar, Eric Helmuth:
Large-vocabulary audio-visual speech recognition by machines and humans. INTERSPEECH 2001: 1027-1030 - [c13]Giridharan Iyengar, Gerasimos Potamianos, Chalapathy Neti, Tanveer A. Faruquie, Ashish Verma:
Robust detection of visual ROI for automatic speechreading. MMSP 2001: 79-84 - [c12]Chalapathy Neti, Gerasimos Potamianos, Juergen Luettin, Iain A. Matthews, Hervé Glotin, Dimitra Vergyri:
Large-vocabulary audio-visual speech recognition: a summary of the Johns Hopkins Summer 2000 Workshop. MMSP 2001: 619-624 - 2000
- [c11]Eric Cosatto, Gerasimos Potamianos, Hans Peter Graf:
Audio-Visual Unit Selection for the Synthesis of Photo-Realistic Talking-Heads. IEEE International Conference on Multimedia and Expo (II) 2000: 619-622 - [c10]Gerasimos Potamianos, Ashish Verma, Chalapathy Neti, Giridharan Iyengar, Sankar Basu:
A Cascade Image Transform for Speaker Independent Automatic Speech Reading. IEEE International Conference on Multimedia and Expo (II) 2000: 1097- - [c9]Chalapathy Neti, Giridharan Iyengar, Gerasimos Potamianos, Andrew W. Senior, Benoît Maison:
Perceptual interfaces for information interaction: joint processing of audio and visual information for human-computer interaction. INTERSPEECH 2000: 11-14 - [c8]Gerasimos Potamianos, Chalapathy Neti:
Stream confidence estimation for audio-visual speech recognition. INTERSPEECH 2000: 746-749
1990 – 1999
- 1999
- [c7]Gerasimos Potamianos, Alexandros Potamianos:
Speaker adaptation for audio-visual speech recognition. EUROSPEECH 1999: 1291-1294 - 1998
- [j3]Gerasimos Potamianos, Frederick Jelinek:
A study of n-gram and decision tree letter language modeling methods. Speech Commun. 24(3): 171-192 (1998) - [c6]Gerasimos Potamianos, Hans Peter Graf:
Discriminative training of HMM stream exponents for audio-visual speech recognition. ICASSP 1998: 3733-3736 - [c5]Gerasimos Potamianos, Hans Peter Graf, Eric Cosatto:
An Image Transform Approach for HMM based Automatic Lipreading. ICIP (3) 1998: 173-177 - [c4]Gerasimos Potamianos, Hans Peter Graf:
Linear discriminant analysis for speechreading. MMSP 1998: 221-226 - 1997
- [j2]Gerasimos Potamianos, John K. Goutsias:
Stochastic approximation algorithms for partition function estimation of Gibbs random fields. IEEE Trans. Inf. Theory 43(6): 1948-1965 (1997) - [c3]Gerasimos Potamianos, Eric Cosatto, Hans Peter Graf, David B. Roe:
Speaker independent audio-visual database for bimodal ASR. AVSP 1997: 65-68 - 1993
- [j1]Gerasimos Potamianos, John K. Goutsias:
Partition function estimation of Gibbs random field images using Monte Carlo simulations. IEEE Trans. Inf. Theory 39(4): 1322-1332 (1993) - [c2]Gerasimos Potamianos, John Goutsias:
An analysis of Monte Carlo methods for likelihood estimation of Gibbsian images. ICASSP (5) 1993: 519-522 - 1991
- [c1]Gerasimos Potamianos, John Goutsias:
A novel method for computing the partition function of Markov random field images using Monte Carlo simulations. ICASSP 1991: 2325-2328
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-08 01:32 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint