Abstract
The influenza virus changes its antigenicity frequently due to rapid mutations, leading to immune escape and failure of vaccination. Rapid determination of the influenza antigenicity could help identify the antigenic variants in time. Here, we built a stacked auto-encoder (SAE) model for predicting the antigenic variant of human influenza A(H3N2) viruses based on the hemagglutinin (HA) protein sequences. The model achieved an accuracy of 0.95 in five-fold cross-validations, better than the logistic regression model did. Further analysis of the model shows that most of the active nodes in the hidden layer reflected the combined contribution of multiple residues to antigenic variation. Besides, some features (residues on HA protein) in the input layer were observed to take part in multiple active nodes, such as residue 189, 145 and 156, which were also reported to mostly determine the antigenic variation of influenza A(H3N2) viruses. Overall, this work is not only useful for rapidly identifying antigenic variants in influenza prevention, but also an interesting attempt in inferring the mechanisms of biological process through analysis of SAE model, which may give some insights into interpretation of the deep learning models.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Thompson, W.W., Shay, D.K., Weintraub, E., Brammer, L., Cox, N., Anderson, L.J., et al.: Mortality associated with influenza and respiratory syncytial virus in the United States. JAMA 289, 179–186 (2003)
Taubenberger, J.K., Kash, J.C.: Influenza virus evolution, host adaptation, and pandemic formation. Cell Host Microbe 7, 440–451 (2010)
Stephenson, I., Heath, A., Major, D., Newman, R.W., Hoschler, K., Junzi, W., et al.: Reproducibility of serologic assays for influenza virus A(H5N1). Emerg. Infect. Dis. 15, 1252–1259 (2009)
Lee, M.S., Chen, J.S.E.: Predicting antigenic variants of influenza A/H3N2 viruses. Emerg. Infect. Dis. 10, 1385–1390 (2004)
Wu, A., Peng, Y., Du, X., Shu, Y., Jiang, T.: Correlation of influenza virus excess mortality with antigenic variation: application to rapid estimation of influenza mortality burden. PLoS Comput. Biol. 6(8), e1000882 (2010)
Du, X., Dong, L., Lan, Y., Peng, Y., Wu, A., Zhang, Y., et al.: Mapping of H3N2 influenza antigenic evolution in China reveals a strategy for vaccine strain recommendation. Nat. Commun. 3, 709 (2012)
Li, H., Peng, Y., Zou, Y., Huang, Z., Wu, A., Li, K., et al.: PREDAC-H5: a user-friendly tool for the automated surveillance of antigenic variants for the HPAI H5N1 virus. Infect. Genet. Evol. 28, 62–63 (2014)
Bengio, Y.: Learning deep architectures for AI. Found. Trends® Mach. Learn. 2(1), 1–127 (2009)
Eickholt, J., Cheng, J.: Predicting protein residue-residue contacts using deep networks and boosting. Bioinformatics 28, 3066–3072 (2012)
Chen, Y., Li, Y., Narayan, R., Subramanian, A., Xie, X.: Gene expression inference with deep learning. Bioinformatics 32, 1832–1839 (2016)
Leung, M.K., Xiong, H.Y., Lee, L.J., Frey, B.J.: Deep learning of the tissue-regulated splicing code. Bioinformatics 30, i121–i129 (2014)
Smith, D.J., Lapedes, A.S., de Jong, J.C., Bestebroer, T.M., Rimmelzwaan, G.F., Osterhaus, A.D., et al.: Mapping the antigenic and genetic evolution of influenza virus. Science 305, 371–376 (2004)
Bao, Y.M., Bolotov, P., Dernovoy, D., Kiryutin, B., Zaslavsky, L., Tatusova, T., et al.: The influenza virus resource at the national center for biotechnology information. J. Virol. 82, 596–601 (2008)
Palm, R.B.: Prediction as a candidate for learning deep hierarchical models of data, Master, Informatics and Mathematical Modelling. Technical University of Denmark (2012)
Koel, B.F., Burke, D.F., Bestebroer, T.M., van der Vliet, S., Zondag, G.C., Vervaet, G., et al.: Substitutions near the receptor binding site determine major antigenic change during influenza virus evolution. Science 342, 976–979 (2013)
Acknowledgement
This study was supported by the National Natural Science Foundation (31500126 and 31371338), the National Key Plan for Scientific Research and Development of China (2016YFD0500300 and 2016YFC1200200), and the Chinese Academy of Medical Sciences (2016-I2 M-1-005).
There are no conflicts of interest.
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Tan, Z., Xu, B., Li, K., Jiang, T., Peng, Y. (2017). Predicting the Antigenic Variant of Human Influenza A(H3N2) Virus with a Stacked Auto-Encoder Model. In: Zou, B., Han, Q., Sun, G., Jing, W., Peng, X., Lu, Z. (eds) Data Science. ICPCSEE 2017. Communications in Computer and Information Science, vol 728. Springer, Singapore. https://doi.org/10.1007/978-981-10-6388-6_25
Download citation
DOI: https://doi.org/10.1007/978-981-10-6388-6_25
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-6387-9
Online ISBN: 978-981-10-6388-6
eBook Packages: Computer ScienceComputer Science (R0)