Application of robust optimization to automated test assembly | Annals of Operations Research Skip to main content
Log in

Application of robust optimization to automated test assembly

  • Published:
Annals of Operations Research Aims and scope Submit manuscript

Abstract

In automated test assembly (ATA), 0-1 linear programming (0-1 LP) methods are applied to select questions (items) from an item bank to assemble an optimal test. The objective in this 0-1 LP optimization problem is to assemble a test that measures, in as precise a way as possible, the ability of candidates. Item response theory (IRT) is commonly applied to model the relationship between the responses of candidates and their ability level. Parameters that describe the characteristics of each item, such as difficulty level and the extent to which an item differentiates between more and less able test takers (discrimination) are estimated in the application of the IRT model. Unfortunately, since all parameters in IRT models have to be estimated, they do have a level of uncertainty to them. Some of the other parameters in the test assembly model, such as average response times, have been estimated with uncertainty as well. General 0-1 LP methods do not take this uncertainty into account, and overestimate the predicted level of measurement precision. In this paper, alternative robust optimization methods are applied. It is demonstrated how the Bertsimas and Sim method can be applied to take this uncertainty into account in ATA. The impact of applying this method is illustrated in two numerical examples. Implications are discussed, and some directions for future research are presented.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
¥17,985 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price includes VAT (Japan)

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

References

  • Adema, J. J., Boekkooi-Timminga, E., & van der Linden, W. J. (1991). Achievement test construction using 0-1 linear programming. European Journal of Operations Research, 55, 103–111.

    Article  Google Scholar 

  • Alvarez, P. P., & Vera, J. R. (2011). Application of robust optimization to the sawmill planning problem. Annals of Operations Research. Advance online publication. doi:10.1007/s10479-011-1002-4.

    Google Scholar 

  • Ariel, A., Veldkamp, B. P., & Breithaupt, K. (2006). Optimal testlet pool assembly for multi-stage testing designs. Applied Psychological Measurement, 30, 204–215.

    Article  Google Scholar 

  • Armstrong, R., Belov, D., & Weissman, A. (2005). Developing and assembling the law school admission test. Interfaces, 35, 140–151.

    Article  Google Scholar 

  • Atamtürk, A. (2006). Strong formulations of robust mixed 0-1 programming. Mathematical Programming, 108, 235–250.

    Article  Google Scholar 

  • Ben-Tal, A., El Ghaoui, L., & Nemirovski, A. (2009). Robust optimization. Princeton: Princeton University Press.

    Google Scholar 

  • Ben-Tal, A., & Nemirovski, A. (2000). Robust solutions of linear programming problems contaminated with uncertain data. Mathematical Programming, 88, 411–424.

    Article  Google Scholar 

  • Bertsimas, D., & Sim, M. (2003). Robust discrete optimization and network flows. Mathematical Programming, 98, 49–71.

    Article  Google Scholar 

  • Hambleton, R. K., Swaminathan, H., & Rogers, H. J. (1991). Fundamentals of item response theory. Newbury Park: Sage.

    Google Scholar 

  • Huitzing, H. A., Veldkamp, B. P., & Verschoor, A. J. (2005). Infeasibility in automatic test assembly models: a comparison study of different methods. Journal of Educational Measurement, 42, 223–243.

    Article  Google Scholar 

  • De Jong, M. G., Steenkamp, J. B. E. M., & Veldkamp, B. P. (2009). A model for the construction of country-specific, yet internationally comparable short-form marketing scales. Marketing Science, 28, 674–689.

    Article  Google Scholar 

  • Lord, F. M. (1980). Applications of item response theory to practical testing problems. Hillsdale: Erlbaum.

    Google Scholar 

  • Soyster, A. L. (1973). Convex programming with set-inclusive constraints and applications to inexact linear programming. Operations Research, 21, 1154–1157.

    Article  Google Scholar 

  • Theunissen, T. J. J. M. (1985). Binary programming and test design. Psychometrika, 50, 411–420.

    Article  Google Scholar 

  • van der Linden, W. J. (1998). Optimal assembly of psychological and educational test. Applied Psychological Measurement, 22, 195–211.

    Article  Google Scholar 

  • van der Linden, W. J. (2005). Linear models for optimal test design. New York: Springer.

    Google Scholar 

  • van der Linden, W. J., & Boekkooi-Timminga, E. (1989). A maximin model for test design with practical constraints. Psychometrika, 54, 237–247.

    Article  Google Scholar 

  • van der Linden, W. J., & Hambleton, R. K. (1997). Handbook of modern item response theory. New York: Springer.

    Book  Google Scholar 

  • Veldkamp, B. P. (2002). Multidimensional constrained test assembly. Applied Psychological Measurement, 26, 133–146.

    Article  Google Scholar 

  • Veldkamp, B. P. (1999). Multi-objective test assembly problems. Journal of Educational Measurement, 36, 253–266.

    Article  Google Scholar 

  • Veldkamp, B. P., Matteucci, M., & de Jong, M. (2012, submitted). Uncertainties in the item parameter estimates and automated test assembly.

Download references

Acknowledgements

This study received funding from the Law School Admission Council (LSAC). The opinions and conclusions contained in this report are those of the author and do not necessarily reflect the position or policy of LSAC.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Bernard P. Veldkamp.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Veldkamp, B.P. Application of robust optimization to automated test assembly. Ann Oper Res 206, 595–610 (2013). https://doi.org/10.1007/s10479-012-1218-y

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10479-012-1218-y

Keywords

Navigation