Abstract
Enabling the knowledge discovery in Series of Satellite Images is a challenging task. Spatiotemporal and non-traditional data constitute this domain, i.e., images and semi-structured textual data. Although there is a massive amount of applicability for this knowledge, we still face a lack of work that addresses the challenges. The gap is even worse when working with the Series of Solar Satellite Images (SSSI). Thus aiming to enable the extraction of knowledge in SSSI, we proposed a new Extraction, Transformation, and Load (ETL) architecture called Multiple data-source Solar ETL (MS-ETL). MS-ETL extracts SSSI data from multiples source and joins it in a single source of truth, enabling knowledge extraction: The solar images are extracted from a data source that is a different SSSI textual data source. After the extraction, MS-ETL transforms the textual data into structured data and loads it to a database allowing simple out-of-the-box analysis. In the end, we obtain a single source of truth for SSSI of almost two solar cycles in a reasonable amount of time. Now that information is available and can be used to apply advanced machine learning techniques. Notably, the proposed approach collects well-suited data for input in deep learning techniques.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Barbosa-Santillan, L.I., Sanchez-Escobar, J.J., Barbosa-Santillan, L.F., Meneses-Viveros, A., Gao, Z., Roa-Gil, J.C., León Paredes, G.A.: Streaming of global navigation satellite system data from the global system of navigation. Int. J. Adv. Comput. Sci. Appl. 12(2) (2021)
Benson, B., Brown, E., Bonasera, S., Acciarini, G., Pérez-Hernández, J.A., Sutton, E., Jah, M.K., Bridges, C., Jin, M., Baydin, A.G.: Simultaneous multivariate forecast of space weather indices using deep neural network ensembles (2021)
Brovelli, M.A., Cannata, M., Rogora, M.: Simile, a geospatial enabler of the monitoring of sustainable development goal 6 (ensure availability and sustainability of water for all). In: The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences XLII-4/W20, pp. 3–10 (2019)
Denis, A., Desclee, B., Migdall, S., Hansen, H., Bach, H., Ott, P., Kouadio, A.L., Tychon, B.: Multispectral remote sensing as a tool to support organic crop certification: Assessment of the discrimination level between organic and conventional maize. Remote Sens. 13(1) (2021)
Deshmukh, V., Baskar, S., Bradley, E., Berger, T., Meiss, J.D.: Machine learning approaches to solar-flare forecasting: Is complex better? (2022)
Fayyad, U., Piatetsky-Shapiro, G., Smyth, P.: From data mining to knowledge discovery in databases. AI Mag. 17(3), 37 (1996)
Sunspot Index and Long term Solar Observations. Solar cycles min/max (2022)
Nishizuka, N., Kubo, Y., Sugiura, K., Den, M., Ishii, M.: Operational solar flare prediction model using deep flare net. Earth, Planets Space 73(1) (2021)
Pesnell, D., Scherrer, P.H., Cheung, M., Woods, T., Fink, D.: Solar dynamics observatory—solar dynamics observatory (2022)
Sha, D., Miao, X., Xu, M., Yang, C., Xie, H., Mestas-Nuñez, A.M., Li, Y., Liu, Q., Yang, J.: An on-demand service for managing and analyzing arctic sea ice high spatial resolution imagery. Data 5(2) (2020)
Silveira-Junior, C.R.: Multiple data-source solar extraction, transformation, and load (2022)
Silveira-Junior, C.R., Santos, M.P., Ribeiro, M.: A flexible architecture for the pre-processing of solar satellite image time series data - the setl architecture. Int. J. Data Mining Model. Manag. 11, 129–143 (2019)
SolarMonitor.org. Noaa active regions (2022)
Acknowledgment
The authors would like to thank NASA for the full access and support to downloading the Solar Satellite Images that enabled this research work and to SolarMonitor.org for the full access to the Solar Flares data. The authors also thank the Federal University of São Carlos for supporting this project and Miro (miro.com) for enabling remote collaboration among the authors and the whole research project organization. Furthermore, we thank Miro for the offered financial support.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Silveira Junior, C.R., Ribeiro, M.X. (2024). MS-ETL: An Architecture for the Multiple Data Source Extraction, Transformation, and Load Applied to Solar Flares Data. In: Arai, K. (eds) Intelligent Systems and Applications. IntelliSys 2023. Lecture Notes in Networks and Systems, vol 823. Springer, Cham. https://doi.org/10.1007/978-3-031-47724-9_3
Download citation
DOI: https://doi.org/10.1007/978-3-031-47724-9_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-47723-2
Online ISBN: 978-3-031-47724-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)