Abstract
In this paper, we introduce our efficient simple method which can locate all occurrences of pattern P of k subpatterns with “don’t cares” of length m in text S of length n. Our algorithm employs advanced data structure and the Kangaroo method, which can be applied to selected suffixes of the suffix tree of S to answer subsequent queries in O(k) time using a predefined computational method to find all occurrences of pattern P with “don’t cares” in text S in a fast and effective manner.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Aho, A.V., Corasick, M.J.: Efficient string matching: an aid to bibliographic search. Commun. ACM 18(6), 333–340 (1975)
Alamro, H., Badkobeh, G., Belazzougui, D., Iliopoulos, C.S., Puglisi, S.J.: Computing the antiperiod (s) of a string. In: 30th Annual Symposium on Combinatorial Pattern Matching (CPM 2019). Schloss Dagstuhl-Leibniz-Zentrum fuer Informatik (2019)
Alstrup, S., Husfeldt, T., Rauhe, T.: Marked ancestor problems. In: Proceedings 39th Annual Symposium on Foundations of Computer Science (Cat. no. 98CB36280), pp. 534–543. IEEE (1998)
Amir, A., Lewenstein, M., Porat, E.: Faster algorithms for string matching with k mismatches. J. Algorithms 50(2), 257–275 (2004)
Bille, P., Gørtz, I.L., Vildhøj, H.W., Vind, S.: String indexing for patterns with wildcards. Theory Comput. Syst. 55(1), 41–60 (2014)
Boyer, R.S., Moore, J.S.: A fast string searching algorithm. Commun. ACM 20(10), 762–772 (1977). https://doi.org/10.1145/359842.359859
Chen, G., Wu, X., Zhu, X., Arslan, A.N., He, Y.: Efficient string matching with wildcards and length constraints. Knowl. Inf. Syst. 10(4), 399–419 (2006)
Clifford, R., Efremenko, K., Porat, E., Rothschild, A.: Pattern matching with don’t cares and few errors. J. Comput. Syst. Sci. 76(2), 115–124 (2010)
Clifford, R., Porat, E.: A filtering algorithm for k-mismatch with don’t cares. In: International Symposium on String Processing and Information Retrieval, pp. 130–136. Springer (2007)
Cole, R., Gottlieb, L.A., Lewenstein, M.: Dictionary matching and indexing with errors and don’t cares. In: Proceedings of the Thirty-Sixth Annual ACM Symposium on Theory of Computing, pp. 91–100 (2004)
Cole, R., Hariharan, R.: Approximate string matching: a simpler faster algorithm. SIAM J. Comput. 31(6), 1761–1782 (2002)
Cole, R., Hariharan, R.: Verifying candidate matches in sparse and wildcard matching. In: Proceedings of the Thirty-Fourth Annual ACM Symposium on Theory of Computing, pp. 592–601 (2002)
Commentz-Walter, B.: A string matching algorithm fast on the average. In: International Colloquium on Automata, Languages, and Programming, pp. 118–132. Springer (1979)
Crochemore, M., Hancart, C.: Pattern matching in strings. In: Mikhail, J.A. (ed.) Algorithms and Theory of Computation Handbook, pp. 11.1–11.28. CRC Press (1998). https://hal-upec-upem.archives-ouvertes.fr/hal-00620790
Crochemore, M., Hancart, C., Lecroq, T.: Algorithms on Strings. Cambridge University Press, Cambridge (2007)
Crochemore, M., Perrin, D.: Pattern matching in strings. In: Image Analysis and Processing II, pp. 67–79. Springer (1988)
Farach, M.: Optimal suffix tree construction with large alphabets. In: Proceedings 38th Annual Symposium on Foundations of Computer Science, pp. 137–143. IEEE (1997)
Fischer, J., Huson, D.H.: New common ancestor problems in trees and directed acyclic graphs. Inf. Process. Lett. 110(8–9), 331–335 (2010)
Fischer, M.J., Paterson, M.S.: String-matching and other products. Tech. rep. Massachusetts Institute of Technology, Cambridge Project MAC (1974)
Galil, Z., Giancarlo, R.: Improved string matching with k mismatches. ACM SIGACT News 17(4), 52–54 (1986)
Gusfield, D.: Algorithms on stings, trees, and sequences: computer science and computational biology. ACM SIGACT News 28(4), 41–60 (1997)
Iliopoulos, C.S., Rahman, M.S.: Pattern matching algorithms with don’t cares. In: Proceedings of 33rd SOFSEM, pp. 116–126 (2007)
Indyk, P.: Faster algorithms for string matching problems: matching the convolution bound. In: Proceedings 39th Annual Symposium on Foundations of Computer Science (Cat. no. 98CB36280), pp. 166–173. IEEE (1998)
Kalai, A.T.: Efficient pattern-matching with don’t cares. In: SODA 2002 Proceedings of the Thirteenth Annual ACM-SIAM Symposium on Discrete Algorithms, pp. 655–656. ACM Press (January 2002). https://www.microsoft.com/en-us/research/publication/efficient-pattern-matching-dont-cares/
Karp, R.M., Rabin, M.O.: Efficient randomized pattern-matching algorithms. IBM J. Res. Dev. 31(2), 249–260 (1987)
Knuth, D.E., Morris Jr., J.H., Pratt, V.R.: Fast pattern matching in strings. SIAM J. Comput. 6(2), 323–350 (1977)
Lam, T.W., Sung, W.K., Tam, S.L., Yiu, S.M.: Space efficient indexes for string matching with don’t cares. In: International Symposium on Algorithms and Computation, pp. 846–857. Springer (2007)
Landau, G.M., Vishkin, U.: Fast parallel and serial approximate string matching. J. Algorithms 10(2), 157–169 (1989)
Liu, N., Xie, F., Wu, X.: Multi-pattern matching with variable-length wildcards using suffix tree. Pattern Anal. Appl. 21(4), 1151–1165 (2018)
Nicolae, M., Rajasekaran, S.: On pattern matching with k mismatches and few don’t cares. Inf. Process. Lett. 118, 78–82 (2017)
Pinter, R.Y.: Efficient string matching with don’t-care patterns. In: Combinatorial Algorithms on Words, pp. 11–29. Springer (1985)
Rahman, M.S., Iliopoulos, C.S., Lee, I., Mohamed, M., Smyth, W.F.: Finding patterns with variable length gaps or don’t cares. In: International Computing and Combinatorics Conference, pp. 146–155. Springer (2006)
Sahinalp, S.C., Vishkin, U.: Efficient approximate and dynamic matching of patterns using a labeling paradigm. In: Proceedings of 37th Conference on Foundations of Computer Science, pp. 320–328. IEEE (1996)
Sung, W.K.: Algorithms in Bioinformatics: A Practical Introduction. CRC Press, Boca Raton (2009)
Ukkonen, E.: On-line construction of suffix trees. Algorithmica 14(3), 249–260 (1995)
Wu, S., Manber, U., et al.: A fast algorithm for multi-pattern searching. University of Arizona, Department of Computer Science (1994)
Ziviani, N., Baeza-Yates, R.: String processing and information retrieval. In: 14th International Symposium, SPIRE 2007 Santiago, Chile, October 29–31, 2007 Proceedings, vol. 4726. Springer (2007)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Alamro, H., Iliopoulos, C. (2020). Detecting Pattern Efficiently with Don’t Cares. In: Iliadis, L., Angelov, P., Jayne, C., Pimenidis, E. (eds) Proceedings of the 21st EANN (Engineering Applications of Neural Networks) 2020 Conference. EANN 2020. Proceedings of the International Neural Networks Society, vol 2. Springer, Cham. https://doi.org/10.1007/978-3-030-48791-1_35
Download citation
DOI: https://doi.org/10.1007/978-3-030-48791-1_35
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-48790-4
Online ISBN: 978-3-030-48791-1
eBook Packages: Computer ScienceComputer Science (R0)