Abstract
This paper takes up the topic of a task of learning fuzzy context-free grammar from data. The induction process is divided into two phases: first the generic grammar is derived from the positive sentences, next the membership grades are assigned to the productions taking into account the occurrences of productions in a learning set. The problem of predicting the location of promoters in Escherichia coli is examined. Language of bacterial sequence can be described using formal system such as context-free grammar, and problem of promoter region recognition can be replaced by grammar induction. The induced fuzzy grammar was compared to other machine learning methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Lee, E.T., Zadeh, L.A.: Note on fuzzy languages. Inform. Sci. 1, 421–434 (1969)
Mordeson, J.N., Mailk, D.S.: Fuzzy Automata and Languages: Theory and Applications. Chapman and Hall, Boca Raton (2002)
Mozhiwen, W.: An Evolution Strategy for the Induction of Fuzzy Finite-state Automata. Journal of Mathematics and Statistics 2(2), 386–390 (2006)
Wen, M.Z., Min, W.: Fuzzy Automata Induction using Construction Method. Journal of Mathematics and Statistics 2(2), 395–400 (2006)
Molina-Lozano, H., Vallejo-Clemente, E.E., Morett-Sanchez, J.E.: DNA sequence analysis using fuzzy grammars. In: IEEE International Conference on Fuzzy Systems, pp. 1915–1921 (2008)
Carter, P., Kremer, S.C.: Fuzzy Grammar Induction from Large Corpora. In: IEEE International Conference on Fuzzy Systems (2006)
Hopcroft, J.E., Motwani, R., Ullman, J.D.: Introduction to Automata Theory, Languages, and Computation. Addison-Wesley, Reading (2001)
Blattner, F., Plunkett, G., Bloch, C., Perna, N., Burland, V., Riley, M., Collado-Vides, J., Glasner, J., Rode, C., Mayhew, G., et al.: The complete genome sequence of Escherichia coli k-12. Science 277, 1453–1462 (1997)
Lewin, B.: Genes VII. Oxford University Press, Oxford (2000)
Murphy, P.M., Aha, D.W.: UCI Repository of Machine Learning Databases, Department of Information and Computer Science. University of California, Irvine, CA (1992)
O’Neill, M.: Escherichia coli promoters: neural networks develop distinct descriptions in learning to search for promoters of different spacing classes. Nucleic Acids Res. 20, 3471–3477 (1992)
Unold, O.: Grammar-Based Classifier System for Recognition of Promoter Regions. In: Beliczynski, B., Dzielinski, A., Iwanowski, M., Ribeiro, B. (eds.) ICANNGA 2007, Part I. LNCS, vol. 4431, pp. 798–805. Springer, Heidelberg (2007)
Leung, S.W., Mellish, C., Robertson, D.: Basic gene grammars and dna-chart parser for language processing of Escherichia coli promoter dna sequences. Bioinformatics 17, 226–236 (2001)
Towell, G., Shavlik, J.: Extracting refined rules from knowledge-based neural networks. Machine Learning 13, 71–101 (1993)
Rice, P., Elliston, K., Gribskov, M.: DNA. In: Girbskov, M., Devereux, J. (eds.) Sequence Analysis Primer, ch. 1, pp. 1–59. Stockton Press (1991)
Unold, O.: Context-free grammar induction with grammar-based classifier system. Archives of Control Science 15 (LI) 4, 681–690 (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Unold, O. (2010). Learning Fuzzy Context-Free Grammar—A Preliminary Report. In: Sempere, J.M., García, P. (eds) Grammatical Inference: Theoretical Results and Applications. ICGI 2010. Lecture Notes in Computer Science(), vol 6339. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15488-1_33
Download citation
DOI: https://doi.org/10.1007/978-3-642-15488-1_33
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15487-4
Online ISBN: 978-3-642-15488-1
eBook Packages: Computer ScienceComputer Science (R0)