Abstract
Shellfish farming and harvesting have experienced a surge in popularity in recent years. However, the presence of fecal bacteria can contaminate shellfish, posing a risk to human health. This can result in the reclassification of shellfish production areas or even prohibit harvesting, leading to significant economic losses. Therefore, it is crucial to establish effective strategies for predicting contamination of shellfish by the bacteria Escherichia coli (E. coli). In this study, various univariate and multivariate time series forecasting models were investigated to address this problem. These models include autoregressive integrated moving average (ARIMA), vector autoregressive (VAR), and long short-term memory (LSTM) networks. The data used for this study consisted of measurements of both E. coli concentrations and meteorological variables, which were obtained from the Portuguese Institute of Sea and Atmosphere (IPMA) for four shellfish production areas. Overall, the ARIMA models performed the best with the lowest root mean squared error (RMSE) compared to the other models tested. The ARIMA models were able to accurately predict the concentrations of E. coli one week in advance. Additionally, the models were able to detect the peaks of E. coli for all areas, except for one, with recall values ranging from 0.75 to 1. This work represents the initial steps in the search for candidate forecasting models to help the shellfish production sector in anticipating harvesting prohibitions and hence supporting management and regulation decisions.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Mateus, M., et al.: Early warning systems for shellfish safety: the pivotal role of computational science. In: Rodrigues, J.M.F., et al. (eds.) ICCS 2019. LNCS, vol. 11539, pp. 361–375. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-22747-0_28
Matarazzo Suplicy, F.: A review of the multiple benefits of mussel farming. Rev. Aquac. 12(1), 204–223 (2020)
Hallegraeff, G., Anderson, D., Cembella, A., Enevoldsen, H.: Manual on Harmful Marine Microalgae, 2nd edn. UNESCO (2004)
Mok, J.S., Shim, K.B., Kwon, J.Y., Kim, P.H.: Bacterial quality evaluation on the shellfish-producing area along the south coast of Korea and suitability for the consumption of shellfish products therein. Fisheries Aquatic Sci. 21(36), (2018)
European Union: Commission Implementing Regulation (EU) 2019/ 627 - of 15 March 2019 - Laying down Uniform Practical Arrangements for the Performance of Official Controls on Products of Animal Origin Intended for Human Consumption in Accordance with Regulation (EU) 2017. Offic. J. Eur. Union, 131, 51–100, (2019)
Schmidt, W., et al.: A generic approach for the development of short-term predictions of Escherichia coli and biotoxins in shellfish. In: Aquaculture Environment Interactions, vol. 10, pp. 173–185 (2018)
Chen, Q., Guan, T., Yun, L., Li, R., Recknagel, F.: Online forecasting chlorophyll a concentrations by an auto-regressive integrated moving average model: feasibilities and potentials. In: Harmful Algae, Elsevier B. V., vol. 43, pp. 58–65 (2015)
Cho, H., Choi, U.-J., Park, H.: Deep learning application to time-series prediction of daily chlorophyll-a concentration. In: WIT Transactions on Ecology and the Environment, vol. 215, pp. 157–163. https://doi.org/10.2495/EID180141
Lee, S., Lee, D.: Improved prediction of harmful algal blooms in four Major South Korea’s rivers using deep learning models. Int. J. Environ. Res. Public Health 15 (2018)
Cruz, R.C., Costa, P.R., Krippahl, L., Lopes, M.B.: Forecasting biotoxin contamination in mussels across production areas of the Portuguese coast with artificial neural networks. Knowl. Based Syst. 257 (2022)
Ciccarelli, C., et al.: Assessment of relationship between rainfall and Escherichia coli in clams (Chamelea gallina) using the Bayes Factor. Italian J. Food Saf. 6(6826) (2017)
Jang, J., Hur, H.G., Sadowsky, M.J., Byappanahalli, M.N., Yan, T., Ishii, S.: Environmental Escherichia coli: ecology and public health implications-a review. J. Appl. Microbiol. 123(3), 570–581 (2017)
Anacleto, P., Pedro, S., Nunes, M.L., Rosa, R., Marques, A.: Microbiological composition of native and exotic clams from Tagus estuary: effect of season and environmental parameters. Mar. Pollut. Bull. 74(1), 116–124 (2013)
Campos, C.J.A., Kershaw, S.R., Lee, R.J.: Environmental influences on faecal indicator organisms in coastal waters and their accumulation in bivalve shellfish. Estuaries Coasts 36, 834–853 (2013)
Brockwell, P.J., Davis, R.A.: Introduction to Time Series and Forecasting, 2nd edn. Springer, Berlin (2002)
Wei, W.W.S.: Multivariate Time Series Analysis and Applications, 1st edn. Wiley, Hoboken (2019)
Chatfield, C.: Time-Series Forecasting. CHAPMAN & HALL/CRC (2001)
Cowpertwait, P.S.P., Metcalfe, A.V.: Introductory Time Series with R. Springer, Berlin (2009)
Tsay, R.S.: Multivariate Time Series Analysis: With R and Financial Applications, 1st edn. Willey, Hopboken (2014)
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9, 1735–80 (1997)
Goodfellow, I., Bengio, Y., Courville, A.: Deep Learning. MIT Press, Cambridge (2016)
Hewamalage, H., Bergmeir, C., Bandara, K.: Recurrent neural networks for time series forecasting: current status and future directions. Int. J. Forecast. 37(1), 388–427 (2021)
Acknowledgements
This work was supported by national funds through Fundação para a Ciência e a Tecnologia (FCT) through projects UIDB/00297/2020 and UIDP/00297/2020 (NOVA Math), UIDB/00667/2020 and UIDP/00667/2020 (UNIDEMI), UIDB/50008/2020 (IT), UIDB/50021/2020 (INESC-ID), and also the project MATISSE (DSAIPA/DS/0026/2019), and CEECINST/00042/2021, PTDC/CCI-BIO/4180/2020, and PTDC/CTM-REF/2679/2020. This project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 951970 (OLISSIPO project).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Ferraz, F., Ribeiro, D., Lopes, M.B., Pedro, S., Vinga, S., Carvalho, A.M. (2024). Comparative Analysis of Machine Learning Models for Time-Series Forecasting of Escherichia Coli Contamination in Portuguese Shellfish Production Areas. In: Nicosia, G., Ojha, V., La Malfa, E., La Malfa, G., Pardalos, P.M., Umeton, R. (eds) Machine Learning, Optimization, and Data Science. LOD 2023. Lecture Notes in Computer Science, vol 14505. Springer, Cham. https://doi.org/10.1007/978-3-031-53969-5_14
Download citation
DOI: https://doi.org/10.1007/978-3-031-53969-5_14
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-53968-8
Online ISBN: 978-3-031-53969-5
eBook Packages: Computer ScienceComputer Science (R0)