{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,3,19]],"date-time":"2025-03-19T10:49:21Z","timestamp":1742381361012},"reference-count":23,"publisher":"Institute for Operations Research and the Management Sciences (INFORMS)","issue":"3","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Management Science"],"published-print":{"date-parts":[[2022,3]]},"abstract":" Managing large-scale systems often involves simultaneously solving thousands of unrelated stochastic optimization problems, each with limited data. Intuition suggests that one can decouple these unrelated problems and solve them separately without loss of generality. We propose a novel data-pooling algorithm called Shrunken-SAA that disproves this intuition. In particular, we prove that combining data across problems can outperform decoupling, even when there is no a priori structure linking the problems and data are drawn independently. Our approach does not require strong distributional assumptions and applies to constrained, possibly nonconvex, nonsmooth optimization problems such as vehicle-routing, economic lot-sizing, or facility location. We compare and contrast our results to a similar phenomenon in statistics (Stein\u2019s phenomenon), highlighting unique features that arise in the optimization setting that are not present in estimation. We further prove that, as the number of problems grows large, Shrunken-SAA learns if pooling can improve upon decoupling and the optimal amount to pool, even if the average amount of data per problem is fixed and bounded. Importantly, we highlight a simple intuition based on stability that highlights when and why data pooling offers a benefit, elucidating this perhaps surprising phenomenon. This intuition further suggests that data pooling offers the most benefits when there are many problems, each of which has a small amount of relevant data. Finally, we demonstrate the practical benefits of data pooling using real data from a chain of retail drug stores in the context of inventory management. <\/jats:p> This paper was accepted by Chung Piaw Teo, Management Science Special Section on Data-Driven Prescriptive Analytics. <\/jats:p>","DOI":"10.1287\/mnsc.2020.3933","type":"journal-article","created":{"date-parts":[[2021,3,17]],"date-time":"2021-03-17T14:12:52Z","timestamp":1615990372000},"page":"1595-1615","source":"Crossref","is-referenced-by-count":21,"title":["Data Pooling in Stochastic Optimization"],"prefix":"10.1287","volume":"68","author":[{"ORCID":"http:\/\/orcid.org\/0000-0003-4371-9114","authenticated-orcid":false,"given":"Vishal","family":"Gupta","sequence":"first","affiliation":[{"name":"Data Science and Operations, USC Marshall School of Business, Los Angles, California 90089;"}]},{"ORCID":"http:\/\/orcid.org\/0000-0003-1672-0507","authenticated-orcid":false,"given":"Nathan","family":"Kallus","sequence":"additional","affiliation":[{"name":"School of Operations Research and Information Engineering and Cornell Tech, Cornell University, New York, New York 10044"}]}],"member":"109","reference":[{"key":"B1","first-page":"91","volume-title":"Research Developments in Probability and Statistics","author":"Beran R","year":"1996"},{"key":"B2","first-page":"499","volume":"2","author":"Bousquet O","year":"2002","journal-title":"J. Machine Learn. Res."},{"key":"B3","doi-asserted-by":"publisher","DOI":"10.1214\/aoms\/1177693318"},{"key":"B4","doi-asserted-by":"publisher","DOI":"10.1214\/11-STS382"},{"key":"B5","doi-asserted-by":"publisher","DOI":"10.1214\/aop\/1176996359"},{"key":"B6","doi-asserted-by":"publisher","DOI":"10.1016\/j.orl.2017.10.005"},{"key":"B7","doi-asserted-by":"publisher","DOI":"10.1016\/0047-259X(88)90153-4"},{"key":"B8","doi-asserted-by":"publisher","DOI":"10.1016\/j.jbankfin.2013.04.033"},{"key":"B9","doi-asserted-by":"publisher","DOI":"10.1017\/CBO9781316576533"},{"key":"B10","doi-asserted-by":"publisher","DOI":"10.1038\/scientificamerican0577-119"},{"key":"B11","doi-asserted-by":"publisher","DOI":"10.1007\/s10107-017-1172-1"},{"key":"B12","doi-asserted-by":"publisher","DOI":"10.1007\/978-0-387-21606-5"},{"key":"B13","doi-asserted-by":"publisher","DOI":"10.1287\/mnsc.2019.3554"},{"key":"B14","doi-asserted-by":"publisher","DOI":"10.2307\/2331042"},{"key":"B15","doi-asserted-by":"publisher","DOI":"10.1137\/S1052623499363220"},{"key":"B16","doi-asserted-by":"publisher","DOI":"10.1287\/opre.2015.1422"},{"key":"B17","doi-asserted-by":"publisher","DOI":"10.2307\/3314676"},{"key":"B19","doi-asserted-by":"publisher","DOI":"10.1214\/cbms\/1462061091"},{"key":"B20","first-page":"2635","volume":"11","author":"Shalev-Shwartz S","year":"2010","journal-title":"J. Machine Learn. Res."},{"key":"B21","doi-asserted-by":"publisher","DOI":"10.1137\/1.9780898718751"},{"key":"B22","doi-asserted-by":"crossref","unstructured":"Stein C (1956) Inadmissibility of the usual estimator for the mean of a multivariate normal distribution. Proc. 3rd Berkeley Sympos. Math. Statist. Probab., vol. 1 (University of California Press, Berkeley), 197\u2013206.","DOI":"10.1525\/9780520313880-018"},{"key":"B23","doi-asserted-by":"publisher","DOI":"10.1214\/ss\/1177012274"},{"key":"B24","doi-asserted-by":"publisher","DOI":"10.3150\/13-BEJSP14"}],"container-title":["Management Science"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/pubsonline.informs.org\/doi\/pdf\/10.1287\/mnsc.2020.3933","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,4,2]],"date-time":"2023-04-02T11:50:49Z","timestamp":1680436249000},"score":1,"resource":{"primary":{"URL":"https:\/\/pubsonline.informs.org\/doi\/10.1287\/mnsc.2020.3933"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,3]]},"references-count":23,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2022,3]]}},"alternative-id":["10.1287\/mnsc.2020.3933"],"URL":"https:\/\/doi.org\/10.1287\/mnsc.2020.3933","relation":{},"ISSN":["0025-1909","1526-5501"],"issn-type":[{"value":"0025-1909","type":"print"},{"value":"1526-5501","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,3]]}}}