Abstract
Since buildings are one of the largest sources of energy consumption in most cities of the world, energy management is one of the major concerns in their design. To ameliorate this problem, buildings are becoming smarter by the incorporation of intelligent supervision and control systems. Data captured by the sensors can be interpreted and processed by rule-based computation methods of biological inspiration (such as genetic fuzzy systems, GFS) for predicting the future behavior of the building in a knowledge-based interpretable human-like manner. GFS are computational models inspired in human cognition which use evolutionary computation (inspired in the natural evolution) to automatically learn fuzzy rules which contain explicit imprecise knowledge about a system or process. This knowledge, represented using fuzzy rules that involve fuzzy linguistic variables and values, is used to perform approximate reasoning on the input values for obtaining inferred values for the output variables. In energy management of buildings, these rules allow a smart control of the system actuators to reduce the building average energy consumption. However, the large amount of data produced on a per second basis complicates the generation of accurate and interpretable models by means of traditional methods. In this paper, we present an evolutionary computation-based approach, namely a genetic fuzzy system, to build scalable and interpretable knowledge bases for predicting energy consumption in smart buildings. For accomplishing this task, we propose a cognitive computation system for multi-step prediction based on S-FRULER, a state-of-the-art scalable distributed GFS, coupled with a feature subset selection method to automatically select the most relevant features for different time steps. S-FRULER is able to learn a fuzzy rule-based system made up of Takagi-Sugeno-Kang (TSK) rules that are able to predict the output values using both linguistic imprecise knowledge (represented by fuzzy sets) and fuzzy inference. Experiments with real data on two different problems related with the energy management revealed an average improvement of 6% on accuracy with respect to S-FRULER without feature selection, and with knowledge bases with a lower number of variables.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Alcalá R, Alcalá-Fdez J, Herrera F. A proposal for the genetic lateral tuning of linguistic fuzzy systems and its interaction with rule selection. IEEE Trans Fuzzy Syst 2007;15(4):616–35.
Alcalá R, Alcalá-Fdez J, Herrera F, Otero J. Genetic learning of accurate and compact fuzzy rule based systems based on the 2-tuples linguistic representation. Int J Approx Reason 2007;44(1):45–64.
Alcalá R, Gacto MJ, Herrera F. A fast and scalable multiobjective genetic fuzzy system for linguistic fuzzy modeling in high-dimensional regression problems. IEEE Trans Fuzzy Syst 2011;19(4):666–81.
Aljarah I, Ala’M AZ, Faris H, Hassonah MA, Mirjalili S, Saadeh H. Simultaneous feature selection and support vector machine optimization using the grasshopper optimization algorithm. Cogn Comput 2018; 10(3):478–495.
Atli BG, Miche Y, Kalliola A, Oliver I, Holtmanns S, Lendasse A. Anomaly-based intrusion detection using extreme learning machine and aggregation of network traffic statistics in probability space. Cogn Comput 2018;10(5):848–863.
Balac N, Sipes T, Wolter N, Nunes K, Sinkovits B, Karimabadi H. Large scale predictive analytics for real-time energy management. IEEE international conference on Big Data, 2013; 2013. p. 657–64.
Bontempi G, Taieb S, Borgne YL. Machine learning strategies for time series forecasting. Business Intelligence. 2013;62–77.
Chen T, Guestrin C. Xgboost: a scalable tree boosting system. Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, San Francisco, CA, USA, August 13–17, 2016; 2016. p. 785–794.
Cordón O, Herrera F, Hoffmann F, Magdalena L, Cordon O, Herrera F, Hoffmann F. Genetic fuzzy systems. Singapore: World Scientific Publishing Company; 2001.
Ding S, Xi X, Liu Z, Qiao H, Zhang B. A novel manifold regularized online semi-supervised learning model. Cogn Comput 2018;10(1):49–61.
Fan C, Xiao F, Wang S. Development of prediction models for next-day building energy consumption and peak power demand using data mining techniques. Appl Energy 2014;127:1–10.
Fernández A, López V, del Jesus MJ, Herrera F. 2015. Revisiting evolutionary fuzzy systems: taxonomy, applications, new trends and challenges. Knowl-Based Syst. 80:109–121.
Fernández A, del Río S, López V, Bawakid A, del Jesus MJ, Benítez JM, Herrera F. Big data with cloud computing: an insight on the computing environment, mapreduce, and programming frameworks. Wiley Interdiscip Rev: Data Min Knowl Disc 2014;4(5):380–409.
Foucquier A, Robert S, Suard F, Stéphan L, Jay A. State of the art in building modelling and energy performances prediction: a review. Renew Sust Energ Rev 2013;23:272–88.
Friedman JH. Greedy function approximation: a gradient boosting machine. Ann Stat. 2001;29(5):1189–1232.
Guyon I, Elisseeff A. An introduction to variable and feature selection. J Mach Learn Res 2003;3:1157–82.
Herrera F. Genetic fuzzy systems: taxonomy, current research trends and prospects. Evol Intel 2008;1(1):27–46.
Ishibuchi H, Nozaki K, Tanaka H, Hosaka Y, Matsuda M. Empirical study on learning in fuzzy systems by rice taste analysis. Fuzzy Sets Syst 1994;64(2):129–44.
L’Heureux A, Grolinger K, ElYamany HF, Capretz MAM. Machine learning with big data: challenges and approaches. IEEE Access 2017;5:7776–97.
Li C, Deng C, Zhou S, Zhao B, Huang GB. Conditional random mapping for effective elm feature representation. Cogn Comput 2018;10(5):827–847.
Life-OPERE. 2016. http://www.life-opere.org/. Online; Accessed 19 Feb 2018.
Lu T, Viljanen M. Prediction of indoor temperature and relative humidity using neural network models: model comparison. Neural Comput Appl 2009;18(4):345–57.
Luo X, Zhu X, Lim EG, Huang Y. A semi-blind model with parameter identification for building temperature estimation. Cogn Comput 2018;10(1):105–16.
Marchiori E. Class conditional nearest neighbor for large margin instance selection. IEEE Trans Pattern Anal Mach Intell 2010;32(2):364–70.
Márquez AA, Márquez FA, Roldán AM, Peregrín A. An efficient adaptive fuzzy inference system for complex and high dimensional regression problems in linguistic fuzzy modelling. Knowl-Based Syst 2013; 54:42–52.
Mechaqrane A, Zouak M. A comparison of linear and neural network ARX models applied to a prediction of the indoor temperature of a building. Neural Comput Appl 2004;13(1):32–7.
Meteogalicia: Galician meteorological web page. http://www.meteogalicia.gal (2016). Online; Accessed 19 Feb 2018.
Nian X, Sun M, Guo H, Wang H, Dai L. Observer-based stabilization control of time-delay t-s fuzzy systems via the non-uniform delay partitioning approach. Cogn Comput 2017;9(1):225–36.
Parliament E, Council E. On the energy performance of buildings. Off J Eur Union 2010;153:13–35.
Ramírez-Gallego S, Fernández A, García S, Chen M, Herrera F. Big data: tutorial and guidelines on information and process fusion for analytics algorithms with mapreduce. Inf Fusion 2018;42:51–61.
Reyes-Ortiz JL, Oneto L, Anguita D. Big data analytics in the cloud: spark on Hadoop vs MPI/OpenMP on Beowulf. Procedia Comput Sci 2015;53:121–30.
Riza LS, Bergmeir CN, Herrera F, Benítez Sánchez JM. 2015. FRBS: Fuzzy rule-based systems for classification and regression in R. American Statistical Association.
Rodríguez FJ, García A, Pardo PJ, Chávez F, Luque-Baena RM. Study and classification of plum varieties using image analysis and deep learning techniques. Progr Artif Intell 2018;7(2):119–27.
Rodríguez-Fdez I, Mucientes M, Bugarín A. An instance selection algorithm for regression and its application in variance reduction. Proceedings of the IEEE international conference on fuzzy systems (FUZZ-IEEE); 2013. p. 1–8.
Rodríguez-Fdez I, Mucientes M, Bugarín A. Reducing the complexity in genetic learning of accurate regression TSK rule-based systems. Proceedings of the 2015 IEEE international conference on fuzzy systems (FUZZ-IEEE). IEEE; 2015. p. 1–8.
Rodríguez-Fdez I, Mucientes M, Bugarín A. FRULER: fuzzy rule learning through evolution for regression. Inf Sci 2016;354:1–18.
Rodríguez-Fdez I, Mucientes M, Bugarín A. S-FRULER: scalable fuzzy rule learning through evolution for regression. Knowl-Based Syst 2016;110:255–66.
Rodriguez-Mier P, Mucientes M, Bugarín A. Scalable modeling of thermal dynamics in buildings using fuzzy rules for regression. 2017 IEEE international conference on fuzzy systems (FUZZ-IEEE); 2017. p. 1–6.
Ruano A, Crispim E, Conceicao E, Lúcio MM. Prediction of building’s temperature using neural networks models. Energy Build 2006;38(6):682–94.
Shaikh PH, Nor NBM, Nallagownden P, Elamvazuthi I, Ibrahim T. A review on optimized control systems for building energy and comfort management of smart sustainable buildings. Renew Sust Energ Rev 2014;34: 409–29.
Teodosiu C, Hohota R, Rusaouën G, Woloszyn M. Numerical prediction of indoor air humidity and its effect on indoor environment. Build Environ 2003;38(5):655–64.
Thomas B, Soleimani-Mohseni M. Artificial neural network models for indoor temperature prediction: investigations in two buildings. Neural Comput Appl 2006;16(1):81–9.
White T. 2012. Hadoop: the definitive guide. O’Reilly Media, Inc.
Yaqoob I, Hashem IAT, Gani A, Mokhtar S, Ahmed E, Anuar NB, Vasilakos AV. Big data: from beginning to future. Int J Inf Manag 2016;36(6):1231–47.
Zaharia M, Chowdhury M, Franklin MJ, Shenker S, Stoica I. Spark: cluster computing with working sets. Proceedings of the 2nd USENIX conference on hot topics in cloud computing, vol 10, p 10; 2010.
Zhao HX, Magoulès F. A review on the prediction of building energy consumption. Renew Sust Energ Rev 2012;16(6):3586–92.
Zhao J, Lasternas B, Lam KP, Yun R, Loftness V. Occupant behavior and schedule modeling for building energy simulation through office appliance power consumption data mining. Energy Build 2014;82:341–55.
Zou H, Hastie T. Regularization and variable selection via the elastic net. J R Stat Soc 2005;67(2):301–20.
Funding
This research was supported by the European Union LIFE programme (grant LIFE12 ENV/ES/001173). Also, the following support is acknowledged: Spanish Ministry of Economy and Competitiveness (grant TIN2017-84796-C2-1-R), Galician Ministry of Education, Culture and Universities (grants GRC2014/030 and accreditation 2016-2019, ED431G/08). These grants are co-funded by the European Regional Development Fund (ERDF/FEDER program).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of Interest
The authors declare that they have no conflict of interest.
Additional information
Informed Consent
Informed consent was not required as no human or animals were involved.
Human and Animal Rights
This article does not contain any studies with human or animal subjects performed by any of the authors.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Rodriguez-Mier, P., Mucientes, M. & Bugarín, A. Feature Selection and Evolutionary Rule Learning for Big Data in Smart Building Energy Management. Cogn Comput 11, 418–433 (2019). https://doi.org/10.1007/s12559-019-09630-6
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12559-019-09630-6