Chinese-Vietnamese Word Alignment Method Based on Bidirectional RNN and Linguistic Features | SpringerLink
Skip to main content

Chinese-Vietnamese Word Alignment Method Based on Bidirectional RNN and Linguistic Features

  • Conference paper
  • First Online:
Computer Supported Cooperative Work and Social Computing (ChineseCSCW 2018)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 917))

  • 867 Accesses

Abstract

We propose an automatically word alignment method of Chinese-Vietnamese based on bidirectional RNN and linguistic features. With the bidirectional RNN, we can obtain the context information of both forward and backward direction. Moreover, some bilingual features are also integrated. In the process of training model. The experiments show that the approach proposed outperform the previous method, and suggests that linguistic features and context information can effectively enhance the effect of word alignment.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
¥17,985 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
JPY 3498
Price includes VAT (Japan)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
JPY 5719
Price includes VAT (Japan)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
JPY 7149
Price includes VAT (Japan)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Auli, M., Galley, M., Quirk, C., Zweig, G.: Joint language and translation modeling with recurrent neural networks. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), Seattle, Washington, USA, pp. 1044–1054 (2013)

    Google Scholar 

  2. Brown, P.F., Pietra, V.J.D., Pietra, S.A.D., Mercer, R.L.: The mathematics of statistical machine translation: parameter estimation. Comput. Linguist. 19(2), 263–311 (1993)

    Google Scholar 

  3. Dyer, C., Clark, J.H., Lavie, A., Smith, N.A.: Unsupervised word alignment with arbitrary features. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Portland, Oregon, USA, pp. 409–419 (2011)

    Google Scholar 

  4. Gutmann, M., Hyvärinen, A.: Noise-contrastive estimation: a new estimation principle for unnormalized statistical models. In: Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics (AISTATS 2010), Chia Laguna Resort, Sardinia, Italy, pp. 297–304 (2010)

    Google Scholar 

  5. Kalchbrenner, N., Blunsom, P.: Recurrent continuous translation models. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), Seattle, Washington, USA, pp. 1700–1709 (2013)

    Google Scholar 

  6. Mikolov, T., Karafiát, M., Burget, L., Ĉernocký, J., Khudanpur, S.: Recurrent neural network based language model. In: Proceedings of INTERSPEECH 2010 11th Annual Conference of the International Speech Communication Association, Makuhari, Chiba, Japan, pp. 1045–1048 (2010)

    Google Scholar 

  7. Mikolov, T., Zweig, G.: Context dependent recurrent neural network language model. In: In Proceedings of IEEE Spoken Language Technology Workshop (SLT), Miami, Florida, USA, pp. 234–239 (2012)

    Google Scholar 

  8. Mnih, A., Teh, Y.W.: A fast and simple algorithm for training neural probabilistic language models. In: Proceedings of the 29th International Conference on Machine Learning (ICML 2012), Edinburgh, Scotland, UK, pp. 419–426 (2012)

    Google Scholar 

  9. Mo, Y., Guo, J., Yu, Z., Luo, L., Gao, S.: A bilingual word alignment algorithm of Vietnamese-Chinese based on feature constraint. Int. J. Mach. Learn. Cybern. 6(4), 537–543 (2015)

    Article  Google Scholar 

  10. Sundermeyer, M., Oparin, I., Gauvain, J.L., Freiberg, B., Schlüter, R., Ney, H.: Comparison of feedforward and recurrent neural network language models. In: Proceedings of IEEE International Conference on Acoustics Speech and Signal (ICASSP), Vancouver, BC, Canada, pp. 8430–8434 (2013)

    Google Scholar 

  11. Tamura, A., Watanabe, T., Sumita, E.: Recurrent neural networks for word alignment model. In: Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL), Baltimore, MD, USA, pp. 1470–1480 (2014)

    Google Scholar 

  12. Vogel, S., Ney, H., Tillmann, C.: Hmm-based word alignment in statistical translation. In: Proceedings of International Conference on Computational Linguistics (COLING), Copenhagen, Denmark, pp. 836–841 (1996)

    Google Scholar 

  13. Yang, N., Liu, S., Li, M., Zhou, M., Yu, N.: Word alignment modeling with context dependent deep neural network. In: Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL), Sofia, Bulgaria, pp. 166–175 (2013)

    Google Scholar 

  14. Yuanyuan, M., Jianyi, G., Zhengtao, Y., Cunli, M., Yitong, N.: A bilingual word alignment method of Vietnamese-Chinese based on deep neural network. J. Shandong Univ. (Nat. Sci.) 51(1), 77–83 (2016)

    Google Scholar 

Download references

Acknowledgment

This work was supported by the National Natural Science Foundation of China (Grant Nos.61761026, 61732005, 61672271, 61472168), the Natural Science Fundation of Yunnan Province (Grant No.2018FB104), Innovation Talent Fund For Technology of Yunnan Province (Grant No.2014HE001).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zhengtao Yu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Gao, S., Zhu, H., Wang, Z., Yu, Z., Wang, X. (2019). Chinese-Vietnamese Word Alignment Method Based on Bidirectional RNN and Linguistic Features. In: Sun, Y., Lu, T., Xie, X., Gao, L., Fan, H. (eds) Computer Supported Cooperative Work and Social Computing. ChineseCSCW 2018. Communications in Computer and Information Science, vol 917. Springer, Singapore. https://doi.org/10.1007/978-981-13-3044-5_33

Download citation

  • DOI: https://doi.org/10.1007/978-981-13-3044-5_33

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-13-3043-8

  • Online ISBN: 978-981-13-3044-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics