Learning Siamese Features for Finger Spelling Recognition

Kwolek, Bogdan; Sako, Shinji

doi:10.1007/978-3-319-70353-4_20

Bogdan Kwolek¹⁸ &
Shinji Sako¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 10617))

Included in the following conference series:

International Conference on Advanced Concepts for Intelligent Vision Systems

2956 Accesses
5 Citations

Abstract

This paper is devoted to finger spelling recognition on the basis of images acquired by a single color camera. The recognition is realized on the basis of learned low-dimensional embeddings. The embeddings are calculated both by single as well as multiple siamese-based convolutional neural networks. We train classifiers operating on such features as well as convolutional neural networks operating on raw images. The evaluations are performed on freely available dataset with finger spellings of Japanese Sign Language. The best results are achieved by a classifier trained on concatenated features of multiple siamese networks.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

¥17,985 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: JPY 3498; Price includes VAT (Japan)

eBook: JPY 5719; Price includes VAT (Japan)

Softcover Book: JPY 7149; Price includes VAT (Japan)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Handwritten Word Recognition in English Language

Classification of Kannada Hand Written Alphabets Using Multi-class Support Vector Machine with Convolution Neural Networks

Joint space representation and recognition of sign language fingerspelling using Gabor filter and convolutional neural network

Article 18 November 2020

References

Barros, P., Magg, S., Weber, C., Wermter, S.: A multichannel convolutional neural network for hand posture recognition. In: Wermter, S., Weber, C., Duch, W., Honkela, T., Koprinkova-Hristova, P., Magg, S., Palm, G., Villa, A.E.P. (eds.) ICANN 2014. LNCS, vol. 8681, pp. 403–410. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-11179-7_51
Google Scholar
Bell, S., Bala, K.: Learning visual similarity for product design with convolutional neural networks. ACM Trans. Graph. 34(4), 98:1–98:10 (2015)
Article Google Scholar
Berlemont, S., Lefebvre, G., Duffner, S., Garcia, C.: Siamese neural network based similarity metric for inertial gesture classification and rejection. In: 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG), pp. 1–6 (2015)
Google Scholar
Chopra, S., Hadsell, R., LeCun, Y.: Learning a similarity metric discriminatively, with application to face verification. In: Proceeding of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 539–546 (2005)
Google Scholar
Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20(3), 273–297 (1995)
MATH Google Scholar
Hosoe, H., Sako, S., Kwolek, B.: Recognition of JSL finger spelling using convolutional neural networks. In: 15th IAPR International Conference on Machine Vision Applications (MVA), pp. 85–88. IEEE, Nagoya, Japan (2017)
Google Scholar
Kane, L., Khanna, P.: A framework for live and cross platform fingerspelling recognition using modified shape matrix variants on depth silhouettes. Comput. Vis. Image Underst. 141, 138–151 (2015)
Article Google Scholar
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. CoRR (2014)
Google Scholar
Koller, O., Ney, H., Bowden, R.: Deep hand: how to train a CNN on 1 million hand images when your data is continuous and weakly labelled. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3793–3802 (2016)
Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: NIPS, pp. 1097–1105 (2012)
Google Scholar
Kwolek, B.: Face detection using convolutional neural networks and Gabor filters. In: Duch, W., Kacprzyk, J., Oja, E., Zadrożny, S. (eds.) ICANN 2005. LNCS, vol. 3696, pp. 551–556. Springer, Heidelberg (2005). https://doi.org/10.1007/11550822_86
Chapter Google Scholar
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. In: Proceeding of the IEEE, pp. 2278–2324 (1998)
Google Scholar
Lin, J., Morère, O., Chandrasekhar, V., Veillard, A., Goh, H.: Deephash: getting regularization, depth and fine-tuning right. CoRR (2015)
Google Scholar
Nagi, J., Ducatelle, et al., F.: Max-pooling convolutional neural networks for vision-based hand gesture recognition. In: IEEE ICSIP, pp. 342–347 (2011)
Google Scholar
Oyedotun, O.K., Khashman, A.: Deep learning in vision-based static hand gesture recognition. Neural Comput. Appl. 28, 1–11 (2016)
Article Google Scholar
Pisharady, P., Saerbeck, M.: Recent methods and databases in vision-based hand gesture recognition. Comput. Vis. Image Underst. 141, 152–165 (2015)
Article Google Scholar
Rautaray, S.S., Agrawal, A.: Vision based hand gesture recognition for human computer interaction: a survey. Artif. Intell. Rev. 43(1), 1–54 (2015)
Article Google Scholar
Sagayam, K.M., Hemanth, D.J.: Hand posture and gesture recognition techniques for virtual reality applications: a survey. Virtual Reality 21(2), 91–107 (2017)
Article Google Scholar
Tabata, Y., Kuroda, T.: Finger spelling recognition using distinctive features of hand shape. In: International Conference on Disability, Virtual Reality and Associated Technologies with Art Abilitation, pp. 287–292 (2008)
Google Scholar
Taigman, Y., Yang, M., Ranzato, M., Wolf, L.: Deepface: closing the gap to human-level performance in face verification. In: Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2014, pp. 1701–1708 (2014)
Google Scholar
Tompson, J., Stein, M., LeCun, Y., Perlin, K.: Real-time continuous pose recovery of human hands using convolutional networks. ACM Trans. Graph. 33(5), 169 (2014)
Article Google Scholar
Yi, D., Lei, Z., Li, S.Z.: Deep metric learning for practical person re-identification. In: ICPR, pp. 34–39 (2014). https://doi.org/10.1109/ICPR.2014.16

Download references

Acknowledgment

This work was supported by Polish National Science Center (NCN) under a NCN research grant 2014/15/B/ST6/02808 as well as JSPS KAKENHI Grant Number 17H06114 and 15KK0008.

Author information

Authors and Affiliations

AGH University of Science and Technology, 30 Mickiewicza Av., 30-059, Krakow, Poland
Bogdan Kwolek
Frontier Research Institute for Information Science, Nagoya Institute of Technology, Gokiso-cho, Showa-ku Nagoya, 466-8555, Japan
Shinji Sako

Authors

Bogdan Kwolek
View author publications
You can also search for this author in PubMed Google Scholar
Shinji Sako
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Bogdan Kwolek .

Editor information

Editors and Affiliations

DGA, Paris, France
Jacques Blanc-Talon
University of Antwerp, Antwerp, Belgium
Rudi Penne
Ghent University - imec, Ghent, Belgium
Wilfried Philips
CSIRO Data 61, Canberra, Aust Capital Terr, Australia
Dan Popescu
University of Antwerp, Wilrijk, Belgium
Paul Scheunders

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kwolek, B., Sako, S. (2017). Learning Siamese Features for Finger Spelling Recognition. In: Blanc-Talon, J., Penne, R., Philips, W., Popescu, D., Scheunders, P. (eds) Advanced Concepts for Intelligent Vision Systems. ACIVS 2017. Lecture Notes in Computer Science(), vol 10617. Springer, Cham. https://doi.org/10.1007/978-3-319-70353-4_20

Download citation

DOI: https://doi.org/10.1007/978-3-319-70353-4_20
Published: 23 November 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-70352-7
Online ISBN: 978-3-319-70353-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Learning Siamese Features for Finger Spelling Recognition

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Handwritten Word Recognition in English Language

Classification of Kannada Hand Written Alphabets Using Multi-class Support Vector Machine with Convolution Neural Networks

Joint space representation and recognition of sign language fingerspelling using Gabor filter and convolutional neural network

References

Acknowledgment

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Learning Siamese Features for Finger Spelling Recognition

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Handwritten Word Recognition in English Language

Classification of Kannada Hand Written Alphabets Using Multi-class Support Vector Machine with Convolution Neural Networks

Joint space representation and recognition of sign language fingerspelling using Gabor filter and convolutional neural network

References

Acknowledgment

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation