i6mA-word2vec: A Newly Model Which Used Distributed Features for Predicting DNA N6-Methyladenine Sites in Genomes | SpringerLink
Skip to main content

i6mA-word2vec: A Newly Model Which Used Distributed Features for Predicting DNA N6-Methyladenine Sites in Genomes

  • Conference paper
  • First Online:
Intelligent Computing Theories and Application (ICIC 2022)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13394))

Included in the following conference series:

  • 1737 Accesses

Abstract

DNA N6 methyladenine (6mA) is a widely studied and widespread epigenetic modification, which plays a vital role in cell growth and development. 6mA is present in many biological cellular processes, such as the regulation of gene expression and the rule of cross dialogue between transposon and histone modification. Therefore, in some biological research, the prediction of the 6mA site is very significant. Unfortunately, the existing biological experimental methods are expensive both in time and money. And they cannot meet the needs of existing research. So it is high time to develop a targeted and efficient computing model. Consequently, this paper proposes an intelligent and efficient calculation model i6mA-word2vec for the discrimination of 6mA sites. In our work, we use word2vec from the field of natural language processing to carry out distributed feature encoding. The word2vec model automatically represents the target class topic. Then, the extracted feature space was sent into the convolutional neural network as prediction input. The experimental prediction results show that our computational model has better performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
¥17,985 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
JPY 3498
Price includes VAT (Japan)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
JPY 13727
Price includes VAT (Japan)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
JPY 17159
Price includes VAT (Japan)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Luo, G.Z., Blanco, M.A., Greer, E.L., et al.: DNA N(6)-methyladenine: a new epigenetic mark in eukaryotes? Nat. Rev. Mol. Cell Biol. 16, 705–710 (2015)

    Article  Google Scholar 

  2. Liu, B., et al.: iRO-3wPseKNC: identify DNA replication origins by three-window-based PseKNC. Bioinformatics 34(18), 3086–3093 (2018). https://doi.org/10.1093/bioinformatics/bty312

    Article  Google Scholar 

  3. Wahab, A., Ali, S.D., Tayara, H., et al.: iIM-CNN: intelligent identifier of 6mA sites on different species by using convolution neural network. IEEE Access 7, 178577–178583 (2019)

    Article  Google Scholar 

  4. Alam, W., Ali, S.D., Tayara, H., et al.: A CNN-based RNA N6-methyladenosine site predictor for multiple species using heterogeneous features representation. IEEE Access 8, 138203–138209 (2020)

    Article  Google Scholar 

  5. Dunn, D.B., Smith, J.D.: Occurrence of a new base in the deoxyribonucleic acid of a strain of bacterium coli. Nature 175, 336–337 (1955)

    Article  Google Scholar 

  6. Vanyushin, B.F., Tkacheva, S.G., Belozersky, A.N.: Rare bases in animal DNA. Nature 225(5236), 948–949 (1970)

    Article  Google Scholar 

  7. Xiao, C.L., Zhu, S., He, M., et al.: N6-methyladenine DNA modification in the human genome. Mol. Cell 71(2), 306–318 (2018)

    Article  Google Scholar 

  8. Chao, Z., Wang, C., Liu, H., et al.: Identification and analysis of adenine N6-methylation sites in the rice genome. Nat. Plants 4, 554–563 (2018)

    Article  Google Scholar 

  9. Pomraning, K.R., Smith, K.M., Freitag, M.: Genome-wide high throughput analysis of DNA methylation in eukaryotes. Methods 47(3), 142–150 (2009)

    Article  Google Scholar 

  10. Krais, A.M., et al.: Genomic N6-methyladenine determination by MEKC with LIF. Electrophoresis 31, 3548–3551 (2010)

    Article  Google Scholar 

  11. Flusberg, B.A., Webster, D.R., Lee, J.H., et al.: Direct detection of DNA methylation during single-molecule, real-time sequencing. Nat. Methods 7(6), 461–465 (2010)

    Article  Google Scholar 

  12. de Araujo Oliveira, J.V., et al.: SnoReport 2.0: new features and a refined Support Vector Machine to improve snoRNA identification. BMC Bioinform. 17(18), 464 (2016)

    Article  Google Scholar 

  13. Gupta, Y., et al.: ptRNApred: computational identification and classification of post-transcriptional RNA. Nucleic Acids Res. 42(22), e167 (2014)

    Article  Google Scholar 

  14. Jana, H., Hofacker, I.L., Stadler, P.F.: SnoReport: computational identification of snoRNAs with unknown targets. Bioinformatics 2, 158–164 (2008)

    Google Scholar 

  15. Song, K.: Recognition of prokaryotic promoters based on a novel variable-window Z-curve method. Nucleic Acids Res. 40(3), 963–971 (2012)

    Article  Google Scholar 

  16. Umarov, R.Kh., Solovyev, V.V.: Recognition of prokaryotic and eukaryotic promoters using convolutional deep learning neural networks. PLoS One 12(2), e0171410 (2017)

    Google Scholar 

  17. Wu, Q., Wang, J., Yan, H.: An improved position weight matrix method based on an entropy measure for the recognition of prokaryotic promoters. Int. J. Data Min. Bioinform. 5(1), 22 (2011)

    Article  Google Scholar 

  18. Chen, W., et al.: iDNA4mC: identifying DNA N4-methylcytosine sites based on nucleotide chemical properties. Bioinformatics 33, 3518–3523 (2017)

    Article  Google Scholar 

  19. He, W., et al.: 4mCPred: machine learning methods for DNA N4-methylcytosine sites prediction. Bioinformatics 35, 593–601 (2018)

    Article  Google Scholar 

  20. Liu, Z., Xiao, X., Qiu, W.R., et al.: IDNA-methyl: identifying DNA methylation sites via pseudo trinucleotide composition. Anal. Biochem. 474, 69 (2015)

    Article  Google Scholar 

  21. Fu, Y., Luo, G.Z., Chen, K., et al.: N6-methyldeoxyadenosine marks active transcription start sites in chlamydomonas. Cell 161(4), 879–892 (2015)

    Article  Google Scholar 

  22. Iyer, L.M., Abhiman, S., Aravind, L.: Natural history of eukaryotic DNA methylation systems. Progress Mol. Biol. Transl. Sci. 101(101), 25–104 (2011)

    Article  Google Scholar 

  23. Chen, W., Lv, H., Nie, F., et al.: i6mA-Pred: identifying DNA N6-methyladenine sites in the rice genome. Bioinformatics 35(11), 2796–2800 (2019)

    Article  Google Scholar 

  24. Huang, Q., Zhang, J., Wei, L., et al.: 6mA-RicePred: a method for identifying DNA N 6-methyladenine sites in the rice genome based on feature fusion. Front. Plant Sci. 11, 4 (2020)

    Article  Google Scholar 

  25. Xu, H., Hu, R., Jia, P., et al.: 6mA-Finder: a novel online tool for predicting DNA N6-methyladenine sites in genomes. Bioinformatics 36(10), 3257–3259 (2020)

    Article  Google Scholar 

Download references

Acknowledgments

This work was supported in part by the University Innovation Team Project of Jinan (2019GXRC015), the Natural Science Foundation of Shandong Province, China (Grant No. ZR2021MF036).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yi Cao .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Fu, W., Zhong, Y., Chen, B., Cao, Y., Chen, J., Cong, H. (2022). i6mA-word2vec: A Newly Model Which Used Distributed Features for Predicting DNA N6-Methyladenine Sites in Genomes. In: Huang, DS., Jo, KH., Jing, J., Premaratne, P., Bevilacqua, V., Hussain, A. (eds) Intelligent Computing Theories and Application. ICIC 2022. Lecture Notes in Computer Science, vol 13394. Springer, Cham. https://doi.org/10.1007/978-3-031-13829-4_58

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-13829-4_58

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-13828-7

  • Online ISBN: 978-3-031-13829-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics