Abstract
Recent trends focus on incentivizing consumers to reduce their demand consumption during peak hours for sustainable demand response. To minimize the loss, the distributor companies should target the right set of consumers and demand the right amount of electricity reductions. Almost all the existing algorithms focus on demanding single unit reductions from the selected consumers and thus have limited practical applicability. Even for single unit reductions, none of the work provides a polynomial time constant approximation factor algorithm to minimize the loss to the distributor company. This paper proposes a novel bounded integer min-knapsack algorithm (MinKPDR) and shows that the algorithm, while allowing for multiple unit reduction, also optimizes the loss to the distributor company within a factor of two (multiplicative) and a problem dependant additive constant. The loss is a function of the cost of buying the electricity from the market, costs incurred by the consumers, and compliance probabilities of the consumers. When the compliance probabilities of the consumers are not known, the problem can be formulated as a combinatorial multi-armed bandit (CMAB) problem. Existing CMAB algorithms fail to work in this setting due to the non-monotonicity of a reward function and time varying optimal sets. We propose a novel algorithm (Twin-MinKPDR-CB) to learn these compliance probabilities efficiently. Twin-MinKPDR-CB works for non-monotone reward functions, bounded min-knapsack constraints, and time-varying optimal sets. We theoretically show that Twin-MinKPDR-CB achieves sub-linear regret of \(O(\log T)\) with T being the number of rounds for which demand response is run.
A. Singh and P. M. Reddy—contributed equally.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Agrawal, S., Devanur, N.R.: Bandits with concave rewards and convex knapsacks. In: Proceedings of the Fifteenth ACM Conference on Economics and Computation, pp. 989–1006 (2014)
Akasiadis, C., et al.: Incentives for rescheduling residential electricity consumption to promote renewable energy usage. In: 2015 SAI Intelligent Systems Conference (IntelliSys), pp. 328–337, November 2015
Anderson, R., Fuloria, S.: On the security economics of electricity metering. In: Proceedings of the WEIS (2010)
Chao, H.: Competitive electricity markets with consumer subscription service in a smart grid. J. Regul. Econ. 1–26 (2012)
Chen, W., Hu, W., Li, F., Li, J., Liu, Y., Lu, P.: Combinatorial multi-armed bandit with general reward functions. Adv. Neural Inf. Process. Syst. 29, 1659–1667 (2016)
Chen, W., Wang, Y., Yuan, Y.: Combinatorial multi-armed bandit: General framework and applications. In: International Conference on Machine Learning, pp. 151–159 (2013)
Chen, X., Nie, Y., Li, N.: Online residential demand response via contextual multi-armed bandits. arXiv preprint arXiv:2003.03627 (2020)
Csirik, J.: Heuristics for the 0–1 min-knapsack problem. Acta Cybernetica 10(1–2), 15–20 (1991)
Gai, Y., Krishnamachari, B., Jain, R.: Combinatorial network optimization with unknown variables: multi-armed bandits with linear rewards and individual observations. IEEE/ACM Trans. Netw. 20(5), 1466–1478 (2012)
Gurobi Optimization L: Gurobi optimizer reference manual (2021). http://www.gurobi.com
Hsu, Y.Y., Su, C.C.: Dispatch of direct load control using dynamic programming. IEEE Trans. Power Syst. 6(3), 1056–1061 (1991)
Jain, S., Balakrishnan, N., Narahari, Y., Hussain, S.A., Voo, N.Y.: Constrained tâtonnement for fast and incentive compatible distributed demand management in smart grids. In: Proceedings of the Fourth International Conference on Future Energy Systems, pp. 125–136. ACM (2013)
Jain, S., Gujar, S., Bhat, S., Zoeter, O., Narahari, Y.: A quality assuring, cost optimal multi-armed bandit mechanism for expertsourcing. Artif. Intell. 254, 44–63 (2018)
Jain, S., Narayanaswamy, B., Narahari, Y.: A multiarmed bandit incentive mechanism for crowdsourcing demand response in smart grids (2014)
Li, Y., Hu, Q., Li, N.: Learning and selecting the right customers for reliability: a multi-armed bandit approach (2018)
Ma, H., Parkes, D.C., Robu, V.: Generalizing demand response through reward bidding. In: Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, pp. 60–68. AAMAS 2017, International Foundation for Autonomous Agents and Multiagent Systems, Richland, SC (2017)
Ma, H., Robu, V., Li, N.L., Parkes, D.C.: Incentivizing reliability in demand-side response. In: the proceedings of The 25th International Joint Conference on Artificial Intelligence (IJCAI 2016), pp. 352–358 (2016)
Methenitis, G., Kaisers, M., La Poutré, H.: Forecast-based mechanisms for demand response. In: Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, pp. 1600–1608. International Foundation for Autonomous Agents and Multiagent Systems (2019)
Mittal, S., Schulz, A.S.: An FPTAS for optimizing a class of low-rank functions over a polytope. Math. Program. 141(1–2), 103–120 (2013)
Park, S., Jin, Y., Song, H., Yoon, Y.: Designing a critical peak pricing scheme for the profit maximization objective considering price responsiveness of customers. Energy 83, 521–531 (2015)
Ramchurn, S., Vytelingum, P., Rogers, A., Jennings, N.: Agent-based control for decentralised demand side management in the smart grid. In: AAMAS, pp. 5–12, February 2011
Robu, V., Chalkiadakis, G., Kota, R., Rogers, A., Jennings, N.R.: Rewarding cooperative virtual power plant formation using scoring rules. Energy 117, 19–28 (2016). https://doi.org/10.1016/j.energy.2016.10.077
Shweta, J., Sujit, G.: A multiarmed bandit based incentive mechanism for a subset selection of customers for demand response in smart grids. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, pp. 2046–2053 (2020)
Xu, H., Liu, Y., Lau, W.C., Li, R.: Combinatorial multi-armed bandits with concave rewards and fairness constraints. In: Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, pp. 2554–2560 (2020)
Zhang, Q., Wang, X., Fu, M.: Optimal implementation strategies for critical peak pricing. In: 2009 6th International Conference on the European Energy Market, pp. 1–6. IEEE (2009)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Singh, A., Reddy, P.M., Jain, S., Gujar, S. (2021). Designing Bounded Min-Knapsack Bandits Algorithm for Sustainable Demand Response. In: Pham, D.N., Theeramunkong, T., Governatori, G., Liu, F. (eds) PRICAI 2021: Trends in Artificial Intelligence. PRICAI 2021. Lecture Notes in Computer Science(), vol 13031. Springer, Cham. https://doi.org/10.1007/978-3-030-89188-6_1
Download citation
DOI: https://doi.org/10.1007/978-3-030-89188-6_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-89187-9
Online ISBN: 978-3-030-89188-6
eBook Packages: Computer ScienceComputer Science (R0)