Abstract
High-quality, meaningful data are crucial for successfully implementing analytics solutions that apply artificial intelligence (AI) and perform simulations using physics-based models. In such context, this paper proposes a semi-automated approach for the semantic enrichment of the building energy consumption data of Sofia, delivering a more meaningful dataset for further analytics and simulations. The aim is to enrich the building energy consumption dataset of the City of Sofia, Bulgaria, from the Sustainable Energy Development Agency with cadastral and spatial data, including а cadastral identifier, geometry, coordinates, built-up area, floors, etc. The data enrichment process is rather time-consuming since it requires substantial manual work. For this reason, a semi-automated data enrichment pipeline has been developed, including various processing activities such as data classification, cleaning, filtering, validation, aggregation, augmentation and formatting. A dedicated crawler is developed to collect additional data needed for the enrichment. As a result, 1991 of a total of 2586 building data points have been successfully enriched. The enriched dataset is used for statistical and clustering analyses and applied to elaborate the energy atlas of Sofia.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
De Mulder, C., Flameling, T., Weijers, S., Amerlinck, Y., Nopens, I.: An open software package for data reconciliation and gap filling in preparation of Water and resource recovery facility modeling. Environ Model Softw. 107, 186–198 (2018)
Sukhobok, D., Nikolov, N., Roman, D.: Tabular data anomaly patterns. In: IEEE Big Data Innovations and Applications, pp. 25–34 (2017)
Mobasheri, A., Zipf, A., Francis, L.: OpenStreetMap data quality enrichment through awareness raising and collective action tools-experiences from a European project. Geo-Spatial Inf. Sci. 21(3), 234–246 (2018)
Mobasheri, A., Huang, H., Degrossi, L.C., Zipf, A.: Enrichment of OpenStreetMap data completeness with sidewalk geometries using data mining techniques. Sensors 18(2), 509 (2018)
Ciavotta, M., Cutrona, V., De Paoli, F., Nikolov, N., Palmonari, M., Roman, D.: Supporting semantic data enrichment at scale. In: Technologies and Applications for Big Data Value, pp. 19–39 (2022)
Senaratne, H., Mobasheri, A., Ali, A.L., Capineri, C., Haklay, M.: A review of volunteered geographic information quality assessment methods. Int. J. Geogr. Inf. Sci. 31, 139–167 (2017)
Cutrona, V.: Semantic enrichment for large-scale data analytics (2019)
Barberán, R., Diego, C., Pilar, E.: Water supply and energy in residential buildings: potential savings and financial profitability. Sustainability 11(1), 295 (2019)
IEA: 2013 Modernising Building Energy Codes. IEA Publications: Paris. Accessed Jan 2024
IEA: 2013 Transition to Sustainable Buildings. Strategies and Opportunities to 2050. IEA Publications, Paris. Accessed Jan 2024
Hao, L., Yuhuan, Z., Jia-Ning, K., Song, W.: Identifying sectoral energy-carbon-water nexus characteristics of China. J. Clean. Prod. 249, 1–13 (2020)
Fenner, A.E., et al.: The carbon footprint of buildings: a review of methodologies and applications. Renew. Sustain. Energy Rev. 94, 1142–1152 (2018)
European Parliament and Council: Directive 2010/31/EU on the energy performance of buildings. Off. J. Eur. Union. https://doi.org/10.3000/17252555.L_2010.153.eng. Accessed Jan 2024
Building certificates of energy characteristics 2023, Sustainable Energy Development Agency. https://portal.seea.government.bg/en/IndustrialSystemsReport. Accessed Dec 2023
Vitanova, L., Petrova-Antonova, D., Hristov, P.O., Shirinyan, E.: Towards energy Atlas of Sofia City in Bulgaria. Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci. 48, 123–129 (2023)
Shirinyan, E., Petrova-Antonova, D.: Modeling Buildings in CityGML LOD1: building parts, terrain intersection curve, and address features. ISPRS Int. J. Geo-Inf. 11, 166 (2022). https://doi.org/10.3390/ijgi11030166
Enriched energy consumption dataset. https://github.com/Teodora18/enriched_energy_consumption_data. Accessed Jan 2024
Acknowledgements
This research is part of the GATE project funded by the Horizon 2020 WIDESPREAD-2018–2020 TEAMING Phase 2 programme under agreement no. 857155 and Operational Programme Science and Education for Smart Growth under Grant Agreement No. BG05M2OP001–1.003–0002-C01, the enrRichMyData project, funded by Horizon Europe research and innovation programme under agreement No. 101070284, and the FLEdg project, funded under the Driving Urban Transitions (DUT) Partnership program, under agreement no. KP-06-D002/5.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 IFIP International Federation for Information Processing
About this paper
Cite this paper
Koleva, T., Vitanova, L., Petrova-Antonova, D., Kostadinov, A. (2024). Semantic Enrichment and Analysis of Building Energy Consumption Data for the City of Sofia. In: Maglogiannis, I., Iliadis, L., Karydis, I., Papaleonidas, A., Chochliouros, I. (eds) Artificial Intelligence Applications and Innovations. AIAI 2024 IFIP WG 12.5 International Workshops. AIAI 2024. IFIP Advances in Information and Communication Technology, vol 715. Springer, Cham. https://doi.org/10.1007/978-3-031-63227-3_33
Download citation
DOI: https://doi.org/10.1007/978-3-031-63227-3_33
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-63226-6
Online ISBN: 978-3-031-63227-3
eBook Packages: Computer ScienceComputer Science (R0)