Initial Explorations on Chaotic Behaviors of Recurrent Neural Networks

Myrzakhmetov, Bagdat; Takhanov, Rustem; Assylbekov, Zhenisbek

doi:10.1007/978-3-031-24337-0_26

Bagdat Myrzakhmetov^8,9,
Rustem Takhanov⁸ &
Zhenisbek Assylbekov⁸

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13451))

Included in the following conference series:

International Conference on Computational Linguistics and Intelligent Text Processing

417 Accesses

Abstract

In this paper we analyzed the dynamics of Recurrent Neural Network architectures. We explored the chaotic nature of state-of-the-art Recurrent Neural Networks: Vanilla Recurrent Network and Recurrent Highway Networks. Our experiments showed that they exhibit chaotic behavior in the absence of input data. We also proposed a way of removing chaos from Recurrent Neural Networks. Our findings show that initialization of the weight matrices during the training plays an important role, as initialization with the matrices whose norm is smaller than one will lead to the non-chaotic behavior of the Recurrent Neural Networks. The advantage of the non-chaotic cells is stable dynamics. At the end, we tested our chaos-free version of the Recurrent Highway Networks (RHN) in a real-world application. In the language modeling task, chaos-free versions of RHN perform on par with the original version.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

¥17,985 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: JPY 3498; Price includes VAT (Japan)

eBook: JPY 11439; Price includes VAT (Japan)

Softcover Book: JPY 14299; Price includes VAT (Japan)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Recurrent Neural Network for the Identification of Nonlinear Dynamical Systems: A Comparative Study

Recurrent Polynomial and Neural Structures in Modelling of a Neutralisation Process

New Results for Prediction of Chaotic Systems Using Deep Recurrent Neural Networks

Article 07 March 2021

References

Zilly, J.G., Srivastava, R.K., Koutník, J., Schmidhuber, J.: Recurrent highway networks. In: International Conference on Machine Learning, pp. 4189–4198 (2017)
Google Scholar
Sussillo, D., Barak, O.: Opening the black box: low-dimensional dynamics in high-dimensional recurrent neural networks. Neural Comput. 25(3), 626–649 (2013)
Article MathSciNet MATH Google Scholar
Cho, K., et al.: Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv preprint arXiv:1406.1078 (2014)
Pascanu, R., Mikolov, T., Bengio, Y.: On the difficulty of training recurrent neural networks. In: International Conference on Machine Learning, pp. 1310–1318 (2013)
Google Scholar
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)
Article Google Scholar
Strogatz, S.H.: Nonlinear Dynamics and Chaos: With Applications to Physics, Biology, Chemistry, and Engineering. Westview press, Boulder (2014)
MATH Google Scholar
Laurent, T., von Brecht, J.: A recurrent neural network without chaos. arXiv preprint arXiv:1612.06212 (2017)
Ott, E.: Chaos in Dynamical Systems. Cambridge University Press, Cambridge (2002)
Book MATH Google Scholar
Abadi, M., et al.: Tensorflow: a system for large-scale machine learning. In: OSDI, vol. 16, pp. 265–283 (2016)
Google Scholar
Mikolov, T., Karafiát, M., Burget, L., Černockỳ, J., Khudanpur, S.: Recurrent neural network based language model. In: Eleventh Annual Conference of the International Speech Communication Association (2010)
Google Scholar
Marcus, M.P., Marcinkiewicz, M.A., Santorini, B.: Building a large annotated corpus of English: the Penn treebank. Comput. Linguist. 19(2), 313–330 (1993)
Google Scholar
Srivastava, R.K., Greff, K., Schmidhuber, J.: Training very deep networks. In: Advances in Neural Information Processing Systems (NIPS), pp. 2377–2385 (2015)
Google Scholar
Kuznetsov, Y. A.: Elements of Applied Bifurcation Theory, vol. 112. Springer, Cham (2013)
Google Scholar
Wolf, A., Swift, J.B., Swinney, H.L., Vastano, J.A.: Determining Lyapunov exponents from a time series. Physica D 16(3), 285–317 (1985)
Article MathSciNet MATH Google Scholar
Lyapunov, A.M.: The general problem of the stability of motion. Int. J. Control 55(3), 531–534 (1992)
Article MathSciNet Google Scholar
Elman, J.L.: Finding structure in time. Cogn. Sci. 14(2), 179–211 (1990)
Article Google Scholar
Gers̆gorin, S.: Über die Abgrenzung der Eigenwerte einer Matrix Bulletin de l’Académie des Sciences de l’URSS. Classe des sciences mathématiques et na, no. 6, 749–754 (1932)
Google Scholar
Srivastava, R.K., Steunebrink, B.R., Schmidhuber, J.: First experiments with POWERPLAY. Neural Netw. Official J. Int. Neural Netw. Soc. 41, 130–136 (2013)
Article Google Scholar
Graves, A.: Adaptive computation time for recurrent neural networks. arXiv preprint arXiv:1603.08983 (2016)
Zoph , B., Le, Q.V.: Neural architecture search with reinforcement learning. arXiv preprint arXiv:1611.01578 (2016)

Download references

Acknowledgement

This work has been funded by the Committee of Science of the Ministry of Education and Science of the Republic of Kazakhstan, IRN AP05133700. The work of Bagdat Myrzakhmetov partially has been funded by the Committee of Science of the Ministry of Education and Science of the Republic of Kazakhstan under the research grant AP05134272. The authors would like to thank Professor Anastasios Bountis for his valuable feedback.

Author information

Authors and Affiliations

School of Science and Technology, Nazarbayev University, Astana, Kazakhstan
Bagdat Myrzakhmetov, Rustem Takhanov & Zhenisbek Assylbekov
National Laboratory Astana, Nazarbayev University, Astana, Kazakhstan
Bagdat Myrzakhmetov

Authors

Bagdat Myrzakhmetov
View author publications
You can also search for this author in PubMed Google Scholar
Rustem Takhanov
View author publications
You can also search for this author in PubMed Google Scholar
Zhenisbek Assylbekov
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Bagdat Myrzakhmetov .

Editor information

Editors and Affiliations

Instituto Politécnico Nacional, Mexico City, Mexico
Alexander Gelbukh

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Myrzakhmetov, B., Takhanov, R., Assylbekov, Z. (2023). Initial Explorations on Chaotic Behaviors of Recurrent Neural Networks. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2019. Lecture Notes in Computer Science, vol 13451. Springer, Cham. https://doi.org/10.1007/978-3-031-24337-0_26

Download citation

DOI: https://doi.org/10.1007/978-3-031-24337-0_26
Published: 26 February 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-24336-3
Online ISBN: 978-3-031-24337-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Initial Explorations on Chaotic Behaviors of Recurrent Neural Networks

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Recurrent Neural Network for the Identification of Nonlinear Dynamical Systems: A Comparative Study

Recurrent Polynomial and Neural Structures in Modelling of a Neutralisation Process

New Results for Prediction of Chaotic Systems Using Deep Recurrent Neural Networks

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Initial Explorations on Chaotic Behaviors of Recurrent Neural Networks

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Recurrent Neural Network for the Identification of Nonlinear Dynamical Systems: A Comparative Study

Recurrent Polynomial and Neural Structures in Modelling of a Neutralisation Process

New Results for Prediction of Chaotic Systems Using Deep Recurrent Neural Networks

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation