BJTU’s Submission to CCMT 2020 Quality Estimation Task

Huang, Hui; Xu, Jin’an; Zhu, Wenjing; Chen, Yufeng; Dang, Rui

doi:10.1007/978-981-33-6162-1_10

Hui Huang⁷,
Jin’an Xu⁷,
Wenjing Zhu⁷,
Yufeng Chen⁷ &
…
Rui Dang⁷

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1328))

Included in the following conference series:

China Conference on Machine Translation

381 Accesses
1 Citations

Abstract

This paper presents the systems developed by Beijing Jiaotong University for the CCMT 2020 quality estimation task. In this paper, we propose an effective method to utilize pretrained language models to improve the performance of QE. Our model combines three popular pretrained models, which are Bert, XLM and XLM-R, to create a very strong baseline for both sentence-level and word-level QE. We tried different strategies, including further pretraining for bilingual input, multi-task learning for multi-granularities and weighted loss for unbalanced word labels. To generate more accurate prediction, we performed model ensemble for both granularities. Experiment results show high accuracy on both directions, and outperform the winning system of last year on sentence level, demonstrating the effectiveness of our proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

¥17,985 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: JPY 3498; Price includes VAT (Japan)

eBook: JPY 5719; Price includes VAT (Japan)

Softcover Book: JPY 7149; Price includes VAT (Japan)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

SAU’S Submission for CCMT 2021 Quality Estimation Task

NiuTrans Submission for CCMT19 Quality Estimation Task

NJUNLP’s Submission for CCMT 2022 Quality Estimation Task

Notes

1.
https://github.com/jhclark/tercom.

References

Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)
Lample, G., Conneau, A.: Cross-lingual language model pretraining. arXiv preprint arXiv:1901.07291 (2019)
Conneau, A., et al.: Unsupervised cross-lingual representation learning at scale. arXiv preprint arXiv:1911.02116 (2019)
Kim, H., Lim, J.H., Kim, H.K., Na, S.H.: QE BERT: bilingual BERT using multi-task learning for neural quality estimation. In: Proceedings of the Fourth Conference on Machine Translation (Volume 3: Shared Task Papers, Day 2) (2019)
Google Scholar
Hyun, K., Jong-Hyeok, L., Seung-Hoon, N.: Predictor-estimator using multilevel task learning with stack propagation for neural quality estimation. In: Proceedings of the Second Conference on Machine Translation, Volume 2: Shared Task Papers, pp. 562–568 (2017)
Google Scholar
Specia, L., Blain, F., Logacheva, V., Astudillo, R., Martins, A.F.: Findings of the wmt 2018 shared task on quality estimation. In: Proceedings of the Third Conference on Machine Translation: Shared Task Papers, pp. 689–709 (2018)
Google Scholar
Zhi-Hua, Z., Jianxin, W., Wei, T.: Ensembling neural networks: many could be better than all. Artif. Intell. 137(1–2), 239–263 (2002)
MathSciNet MATH Google Scholar
Kepler, F., et al.: Unbabel’s Participation in the WMT19 Translation Quality Estimation Shared Task. arXiv preprint arXiv:1907.10352 (2019)
Wang, Z., et al.: NiuTrans Submission for CCMT19 Quality Estimation Task. In: Huang, S., Knight, K. (eds.) CCMT 2019. CCIS, vol. 1104, pp. 82–92. Springer, Singapore (2019). https://doi.org/10.1007/978-981-15-1721-1_9
Chapter Google Scholar

Download references

Acknowledgement

This work is supported by the National Natural Science Foundation of China (Contract 61976015, 61976016, 61876198 and 61370130), and the Beijing Municipal Natural Science Foundation (Contract 4172047), and the International Science and Technology Cooperation Program of the Ministry of Science and Technology (K11F100010).

Author information

Authors and Affiliations

School of Computer and Information Technology, Beijing Jiaotong University, Beijing, China
Hui Huang, Jin’an Xu, Wenjing Zhu, Yufeng Chen & Rui Dang

Authors

Hui Huang
View author publications
You can also search for this author in PubMed Google Scholar
Jin’an Xu
View author publications
You can also search for this author in PubMed Google Scholar
Wenjing Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Yufeng Chen
View author publications
You can also search for this author in PubMed Google Scholar
Rui Dang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jin’an Xu .

Editor information

Editors and Affiliations

Soochow University, Suzhou, China
Junhui Li
Dublin City University, Dublin, Ireland
Andy Way

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Huang, H., Xu, J., Zhu, W., Chen, Y., Dang, R. (2020). BJTU’s Submission to CCMT 2020 Quality Estimation Task. In: Li, J., Way, A. (eds) Machine Translation. CCMT 2020. Communications in Computer and Information Science, vol 1328. Springer, Singapore. https://doi.org/10.1007/978-981-33-6162-1_10

Download citation

DOI: https://doi.org/10.1007/978-981-33-6162-1_10
Published: 14 January 2021
Publisher Name: Springer, Singapore
Print ISBN: 978-981-33-6161-4
Online ISBN: 978-981-33-6162-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics