Multi-fidelity surrogate-based optimization for decomposed buffer allocation problems | OR Spectrum Skip to main content
Log in

Multi-fidelity surrogate-based optimization for decomposed buffer allocation problems

  • Regular Article
  • Published:
OR Spectrum Aims and scope Submit manuscript

Abstract

The buffer allocation problem (BAP) for flow lines has been extensively addressed in the literature. In the framework of iterative approaches, algorithms alternate an evaluative method and a generative method. Since an accurate estimation of system performance typically requires high computational effort, an efficient generative method reducing the number of iterations is desirable, for searching for the optimal buffer configuration in a reasonable time. In this work, an iterative optimization algorithm is proposed in which a highly accurate simulation is used as the evaluative method and a surrogate-based optimization is used as the generative method. The surrogate model of the system performance is built to select promising solutions so that an expensive simulation budget is avoided. The performance of the surrogate model is improved with the help of fast but rough estimators obtained with approximated analytical methods. The algorithm is embedded in a problem decomposition framework: several problem portions are solved hierarchically to reduce the solution space and to ease the search of the optimum solution. Further, the paper investigates a jumping strategy for practical application of the approach so that the algorithm response time is reduced. Numerical results are based on balanced and unbalanced flow lines composed of single-machine stations.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
¥17,985 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price includes VAT (Japan)

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14

Similar content being viewed by others

References

  • Alfieri A, Matta A (2012) Mathematical programming formulations for approximate simulation of multistage production systems. Eur J Oper Res 219(3):773–783

    Article  Google Scholar 

  • Colledani M, Gershwin S (2013) A decomposition method for approximate evaluation of continuous flow multi-stage lines with general markovian machines. Ann Oper Res 209(1):5–40

    Article  Google Scholar 

  • Dallery Y, David R, Xiaolan X (1988) An efficient algorithm for analysis of transfer lines with unreliable machines and finite buffers. IIE Trans 20:280–283

    Article  Google Scholar 

  • Dolgui A, Eremeev AV, Sigaev VS (2007) Hbba: hybrid algorithm for buffer allocation in tandem production lines. J Intell Manuf 18(3):411–420

    Article  Google Scholar 

  • Frigerio N, Lin Z, Matta A (2018) Multi-fidelity models for decomposed simulation optimization. In: Rabe M et al (ed) Proceedings of the 2018 Winter Simulation Conference (WSC), Gothenburg, Sweden, pp 2217–2248

  • Gershwin SB (1987) An efficient decomposition method for the approximate evaluation of tandem queues with finite storage space and blocking. Oper Res 35(2):291–305

    Article  Google Scholar 

  • Gershwin SB, Schor JE (2000) Efficient algorithms for buffer space allocation. Ann Oper Res 93(1–4):117–144

    Article  Google Scholar 

  • Helber S, Schimmelpfeng K, Stolletz R, Lagershausen S (2011) Using linear programming to analyze and optimize stochastic flow lines. Ann Oper Res 182(1):193–211

    Article  Google Scholar 

  • Hillier MS (2000) Characterizing the optimal allocation of storage space in production line systems with variable processing times. IIE Trans 32(1):1–8

    Google Scholar 

  • Jones DR, Schonlau M, Welch WJ (1998) Efficient global optimization of expensive black-box functions. J Global Optim 13(4):455–492

    Article  Google Scholar 

  • Kose SY, Kilincci O (2015) Hybrid approach for buffer allocation in open serial production lines. Comput Oper Res 60:67–78

    Article  Google Scholar 

  • Law AM, Kelton WD (2000) Simulation modeling & analysis, 3rd edn. McGraw-Hill Inc, New York

    Google Scholar 

  • Li J, Meerkov S (2009) Production Systems Engineering

  • Liberopoulos G, Papadopoulos C, Tan B, MacGregor Smith J, Gershwin S (2006) Stochastic modeling of manufacturing systems: advances in design, performance evaluation, and control issues. Springer, New York

    Book  Google Scholar 

  • Lin Z, Matta A, Shanthikumar J (2020) A matlab extended kernel regression toolbox. https://doi.org/10.13140/rg.2.2.16632.19206

  • Lin Z, Matta A, Shanthikumar JG (2019) Combining simulation experiments and analytical models with area-based accuracy for performance evaluation of manufacturing systems. IISE Trans. 51(3):266–283

    Article  Google Scholar 

  • Matta A (2008) Simulation optimization with mathematical programming representation of discrete event systems. In: Mason SJ et al (ed) Proceedings of the 2008 Winter Simulation Conference, IEEE, Miami, FL, pp 1393–1400

  • Matta A, Pezzoni M, Semeraro Q (2012) A kriging-based algorithm to optimize production systems approximated by analytical models. J Intell Manuf 23(3):587–597

    Article  Google Scholar 

  • McKay MD, Beckman RJ, Conover WJ (1979) A comparison of three methods for selecting values of input variables in the analysis of output from a computer code. Technometrics 21(2):239–245

    Google Scholar 

  • Mockus J, Tiesis V, Zilinskas A (1978) The application Bayesian methods for seeking the extremum. Towards Global Optim 2:117–129

    Google Scholar 

  • Papadopoulos C, O’Kelly M, Vidalis M, Spinellis D (2009) Analysis and design of discrete part production lines, vol 31

  • Sacks J, Welch WJ, Mitchell TJ, Wynn HP (1989) Design and analysis of computer experiments. Stat Sci 4:409–423

    Article  Google Scholar 

  • Shi C, Gershwin SB (2016) A segmentation approach for solving buffer allocation problems in large production systems. Int J Prod Res 54(20):6121–6141

    Article  Google Scholar 

  • Shi L, Men S (2003) Optimal buffer allocation in production lines. IIE Trans 35(1):1–10

    Article  Google Scholar 

  • Soyster AL, Schmidt J, Rohrer M (1979) Allocation of buffer capacities for a class of fixed cycle production lines. AIIE Trans 11(2):140–146

    Article  Google Scholar 

  • Stolletz R, Weiss S (2013) Buffer allocation using exact linear programming formulations and sampling approaches. IFAC Proc Vol 46(9):1435–1440

    Article  Google Scholar 

  • Tolio T, Matta A (1998) A method for performance evaluation of automated flow lines. CIRP Ann Manuf Technol 47(1):373–376

    Article  Google Scholar 

  • Wand MP, Jones MC (1995) Kernel Smoothing. Monographs on Statistics & Applied Probability. Chapman and Hall/CRC, New York

    Book  Google Scholar 

  • Weiss S, Matta A, Stolletz R (2018) Optimization of buffer allocations in flow lines with limited supply. IISE Trans 50(3):191–202

    Article  Google Scholar 

  • Weiss S, Schwarz JA, Stolletz R (2019) The buffer allocation problem in production lines: formulations, solution methods, and instances. IISE Trans 51(5):456–485

    Article  Google Scholar 

  • Weiss S, Stolletz R (2015) Buffer allocation in stochastic flow lines via sample-based optimization with initial bounds. OR Spectr 37(4):869–902

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nicla Frigerio.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix

Appendix

This section describes briefly how to build the surrogate model using the EKR method. A MATLAB EKR toolbox (both the EKR code and a manual) can be found at Lin et al. (2020) (https://doi.org/10.13140/RG.2.2.16632.19206). More details about the method can be found in Lin et al. (2019).

Given a system configuration \(\varvec{x}\) under which the high-fidelity system performance \(y_h(\varvec{x})\) is unknown, its low-fidelity outputs \(y_{l_j}(\varvec{x}), \forall j\) are firstly corrected by scaling functions. Two scaling functions are considered:

  1. 1.

    Additive scaling function: \({\tilde{y}}^{l_j}_i(\varvec{x})=y_{l_j}(\varvec{x})+(y_h(\varvec{x}^0_i)-y_{l_j}(\varvec{x}^0_i)),\forall i\in {\mathcal {N}}, \forall j\in {\mathcal {J}};\)

  2. 2.

    Multiplicative scaling function: \({\tilde{y}}^{l_j}_i(\varvec{x})=\frac{y_h(\varvec{x}^0_i)}{y_{l_j}(\varvec{x}^0_i)} \cdot y_{l_j}(\varvec{x}),\forall i\in {\mathcal {N}}, \forall j\in {\mathcal {J}}.\)

where \(y_h(\varvec{x}_i^0)\) and \(y_{l_j}(\varvec{x}^0_i)\) are the outputs of the high-fidelity model and the jth low-fidelity model at the ith design point \(\varvec{x}^0_i\), respectively. Different scaling functions can be used for different low-fidelity models.

These corrected outputs are expected to have more reliable prediction performance if their scaling functions are estimated by the initial design points close to the unobserved point. Therefore, for the jth low-fidelity model, Kernel regression (Wand and Jones 1995) is used to locally fit a polynomial on the corrected data \({\tilde{y}}^{l_j}_i(\varvec{x})\) with distance-based weights. The system performance estimate at the unobserved point \(\varvec{x}\), using the jth low-fidelity model’s corrected data, has a closed form:

$$\begin{aligned} \hat{y}_{l_j}(\varvec{x})=\varvec{e}_1^T(\varvec{X_x}^T\varvec{W_x}\varvec{X_x})^{-1}\varvec{X_x}^T\varvec{W_x}\tilde{\varvec{Y}}_{l_j}, \forall j \in {\mathcal {J}}, \end{aligned}$$
(15)

where \(\varvec{e}_1\) is a \((dp+1)\)-dimensional vector whose first element is 1 and the rest are 0,

$$\begin{aligned} {\varvec{X}}_{\varvec{x}}=\left[ \begin{matrix} 1 &{}\quad (\varvec{x}^0_1-\varvec{x})^T &{}\quad \cdots &{}\quad \left[ (\varvec{x}^0_1-\varvec{x})^p\right] ^T \\ 1 &{}\quad (\varvec{x}^0_2-\varvec{x})^T &{}\quad \cdots &{}\quad \left[ (\varvec{x}^0_2-\varvec{x})^p\right] ^T \\ \vdots &{}\quad \vdots &{}\quad \ddots &{}\quad \vdots \\ 1 &{}\quad (\varvec{x}^0_n-\varvec{x})^T &{}\quad \cdots &{}\quad \left[ (\varvec{x}^0_n-\varvec{x})^p\right] ^T \\ \end{matrix} \right] \end{aligned}$$

is an \(n \times (dp+1)\) matrix, p is the degree of the fitted polynomial and \(\tilde{\varvec{Y}}_{l_j}=[{\tilde{y}}^{l_j}_1(\varvec{x}),\ldots ,{\tilde{y}}^{l_j}_n(\varvec{x})]^T\). \(\varvec{W_x}=\mathrm{diag}\{K_{1,\varvec{\varTheta }_1}(\varvec{x}^0_1-\varvec{x}),\ldots ,K_{1,\varvec{\varTheta }_1}(\varvec{x}^0_n-\varvec{x})\}\) is an \(n \times n\) diagonal matrix where

$$\begin{aligned} K_{1,\varvec{\varTheta }_1}(\varvec{x}^0_i-\varvec{x})=\prod _{k=1}^{d}\exp \left\{ -\frac{1}{2\theta _{1,k}}(x_{ik}^0-x_k)^2\right\} , \forall i \in {\mathcal {N}}, \end{aligned}$$

\(\varvec{\varTheta }_1=\mathrm{diag}\{\theta _{1,1},\ldots ,\theta _{1,d}\}\), and \(\theta _{1,k}>0, k=1,\cdots ,d\) are parameters to be selected.

Finally, the estimates from different low-fidelity models are combined with the weights related to the estimated weighted square error:

$$\begin{aligned} \hat{y}_{\text {EKR}}(\varvec{x})=\sum _{j \in {\mathcal {J}}}w_{l_j}(\varvec{x})\hat{y}_{l_j}(\varvec{x}), \end{aligned}$$

where

$$\begin{aligned} w_{l_j}(\varvec{x})= & {} \frac{K_{2,\theta _2}(\hat{WSE}_{l_j}(\varvec{x}))}{\sum _{i \in {\mathcal {J}}} K_{2,\theta _2}(\hat{WSE}_{l_i}(\varvec{x}))},\\ \hat{WSE}_{l_j}(\varvec{x})= & {} (\mathrm{tr}(\varvec{W_x}))^{-1}\tilde{\varvec{Y}}_{l_j}^T(\varvec{W_x}-\varvec{W_x}^T\varvec{X_x}(\varvec{X_x}^T\varvec{W_x}\varvec{X_x})^{-1}\varvec{X_x}^T\varvec{W_x})\tilde{\varvec{Y}}_{l_j}, \forall j \in {\mathcal {J}}, \end{aligned}$$

and \(\mathrm{tr}(\varvec{W_x})\) is the trace of \(\varvec{W_x}\). \(K_{2,\theta _2}(\cdot )\) has the following form:

$$\begin{aligned} K_{2,\theta _2}(\hat{WSE}_{l_j}(\varvec{x}))=\exp \left\{ {-\frac{\hat{WSE}_{l_j}(\varvec{x})}{2\theta _2WSE_\mathrm{min}(\varvec{x})}}\right\} , \forall j \in {\mathcal {J}}, \end{aligned}$$

where

$$\begin{aligned} WSE_\mathrm{min}(\varvec{x})=\min _{j\in {\mathcal {J}}}\{\hat{WSE}_{l_j}(\varvec{x})\}, \end{aligned}$$

and \(\theta _2\) is an unknown parameter to be selected.

The model parameters \(\theta _{1,k}, \theta _2\) are selected according to the cross-validation. Where \(p=1\), the estimated root square error of \(\hat{y}_{\text {EKR}}(\varvec{x})\) as an estimate of the high-fidelity response is provided as:

$$\begin{aligned} \hat{s}_{\text {EKR}}(\varvec{x})=\sqrt{\hat{WSE}(\varvec{x})\left( 1+\frac{1}{2^{d/2}\mathrm{tr}(\varvec{W_x})}\right) } \end{aligned}$$

where

$$\begin{aligned} \hat{WSE}(\varvec{x})=(\mathrm{tr}(\varvec{W_x}))^{-1}\tilde{\varvec{Y}}^T(\varvec{W_x}-\varvec{W_x}^T\varvec{X_x}(\varvec{X_x}^T\varvec{W_x}\varvec{X_x})^{-1}\varvec{X_x}^T\varvec{W_x})\tilde{\varvec{Y}} \end{aligned}$$

and:

$$\begin{aligned} \tilde{\varvec{Y}}=\left[ \sum _{j\in {\mathcal {J}}}w_{l_j}(\varvec{x}){\tilde{y}}^{l_j}_1(\varvec{x}),\ldots ,\sum _{j\in {\mathcal {J}}}w_{l_j}(\varvec{x}){\tilde{y}}^{l_j}_n(\varvec{x})\right] ^T. \end{aligned}$$

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Lin, Z., Frigerio, N., Matta, A. et al. Multi-fidelity surrogate-based optimization for decomposed buffer allocation problems. OR Spectrum 43, 223–253 (2021). https://doi.org/10.1007/s00291-020-00603-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00291-020-00603-y

Keywords

Navigation