Investigating the multi-objective optimization of quality and efficiency using deep reinforcement learning

Wang, Zhenhui; Lu, Juan; Chen, Chaoyi; Ma, Junyan; Liao, Xiaoping

doi:10.1007/s10489-022-03326-5

Investigating the multi-objective optimization of quality and efficiency using deep reinforcement learning

Published: 15 February 2022

Volume 52, pages 12873–12887, (2022)
Cite this article

Applied Intelligence Aims and scope Submit manuscript

Zhenhui Wang¹,
Juan Lu^1,2,
Chaoyi Chen¹,
Junyan Ma¹ &
…
Xiaoping Liao¹

1118 Accesses
1 Altmetric
Explore all metrics

Abstract

This study proposes a pipeline, which is based on deep reinforcement learning, aims to solve the multi-objective problem (MOP) on efficiency and quality in manufacturing. The rapid development in the area of artificial intelligent has caused a series of reactions that stirred the traditional manufacturing, pushing for the better machining quality and higher productivity. Despite all this, there has been very little research applying reinforcement learning to solve practical problems in milling process. The proposed pipeline is a two-step algorithm and makes full use of double deep Q network (DDQN) to settle the MOP of milling parameters. Firstly, surface roughness (Ra) and material removal rate (MRR) are selected as quality and efficiency indicators, respectively. In specific, the reliable prediction model of Ra is constructed on a small batch raw data via DDQN improved support vector regression (DDQN-SVR) rather than sophisticated and complex physical modeling. The MRR model is constructed by an accepted empirical formula. Then, DDQN is employed again to solve the MOP of satisfying minimum Ra and maximum MRR and compared to other accepted algorithms. Eventually, the optimal combination of machining parameters determined by entropy method was validated by experiment.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

¥17,985 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price includes VAT (Japan)

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1

Multi-objective optimization enabling CFRP energy-efficient milling based on deep reinforcement learning

Article 18 September 2024

Deep representation learning and reinforcement learning for workpiece setup optimization in CNC milling

Article Open access 29 June 2023

Deep Reinforcement Learning for Continuous Control of Material Thickness

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

References

Attar H, Ehtemam-Haghighi S, Kent D, Dargusch MS (2018) Recent developments and opportunities in additive manufacturing of titanium-based matrix composites: a review. Int J Mach Tools Manuf 133:85–102
Article Google Scholar
Lu X, Zhang H, Jia Z, Feng Y, Liang SY (2018) Cutting parameters optimization for MRR under the constraints of surface roughness and cutter breakage in micro-milling process. J Mech Sci Technol 32:3379–3388
Article Google Scholar
Wu D, Wang H, Zhang K, Zhao B, Lin X (2020) Research on adaptive CNC machining arithmetic and process for near-net-shaped jet engine blade. J Intell Manuf 31:717–744
Article Google Scholar
Zhu L, Li H, Yang J, Wang WS (2012) Research on theoretical modeling of 3D chip of orthogonal turn-milling. Dongbei Daxue Xuebao/Journal of Northeastern University 33:111–115
Google Scholar
Bakhtiari H, Karimi M, Rezazadeh S (2016) Modeling, analysis and multi-objective optimization of twist extrusion process using predictive models and meta-heuristic approaches, based on finite element results. J Intell Manuf 27:463–473
Article Google Scholar
Lu J, Liao X, Li S, Ouyang H, Chen K, Huang B (2019) An effective ABC-SVM approach for surface roughness prediction in manufacturing processes. Complexity 2019:1–13
Article Google Scholar
Xiao Z, Liao X, Long Z, Li M (2017) Effect of cutting parameters on surface roughness using orthogonal array in hard turning of AISI 1045 steel with YT5 tool. Int J Adv Manuf Technol 93:273–282
Article Google Scholar
Tangjitsitcharoen S, Thesniyom P, Ratanakuakangwan S (2017) Prediction of surface roughness in ball-end milling process by utilizing dynamic cutting force ratio. J Intell Manuf 28:13–21
Article Google Scholar
Mumtaz J, Li Z, Imran M, Yue L, Jahanzaib M, Sarfraz S, Shehab E, Ismail SO, Afzal K (2019) Multi-objective optimisation for minimum quantity lubrication assisted milling process based on hybrid response surface methodology and multi-objective genetic algorithm. Adv Mech Eng 11
Soepangkat B, Norcahyo R, Pramujati B, Wahid M (2019) Multi-objective optimization in face milling process with cryogenic cooling using grey fuzzy analysis and BPNN-GA methods, engineering computations, ahead-of-print
Sugumaran V (2013) Developing Gaussian process model to predict the surface roughness in boring operation. International Journal of Engineering Trends and Technology 4:219–223
Google Scholar
Zhang GJ, Li J, Chen Y, Huang Y, Shao XY, Li MZ (2014) Prediction of surface roughness in end face milling based on Gaussian process regression and cause analysis considering tool vibration. Int J Adv Manuf Tech 75:1357–1370
Article Google Scholar
Aich U, Banerjee S (2014) Modeling of EDM responses by support vector machine regression with parameters selected by particle swarm optimization. Appl Math Model 38:2800–2818
Article MATH Google Scholar
Cao WD, Liu X, Ni JJ (2020) Parameter optimization of support vector regression using Henry gas solubility optimization algorithm. Ieee Access 8:88633–88642
Article Google Scholar
Lela B, Bajic D, Jozic S (2009) Regression analysis, support vector machines, and Bayesian neural network approaches to modeling surface roughness in face milling. Int J Adv Manuf Tech 42:1082–1088
Article Google Scholar
Zuperl U, Cus F (2012) System for off-line feedrate optimization and neural force control in end milling. International Journal of Adaptive Control and Signal Processing 26:105–123
Article MATH Google Scholar
Saravanan R, Asokan P, Sachidanandam M (2002) A multi-objective genetic algorithm (GA) approach for optimization of surface grinding operations. Int J Mach Tools Manuf 42:1327–1334
Article Google Scholar
Warsi SS, Agha MH, Ahmad R, Jaffery SHI, Khan M (2019) Sustainable turning using multi-objective optimization: a study of Al 6061 T6 at high cutting speeds. Int J Adv Manuf Technol 100:843–855
Article Google Scholar
Zhang Y, Wang G-G, Li K, Yeh W-C, Jian M, Dong J (2020) Enhancing MOEA/D with information feedback models for large-scale many-objective optimization. Inf Sci 522:1–16
Article MathSciNet MATH Google Scholar
Cheng R, Jin Y, Olhofer M, Sendhoff B (2016) A reference vector guided evolutionary algorithm for many-objective optimization. IEEE T Evolut Comput 20:773–791
Article Google Scholar
Trivedi A, Srinivasan D, Sanyal K, Ghosh A (2016) A survey of multiobjective evolutionary algorithms based on decomposition. IEEE T Evolut Comput 21:440–462
Google Scholar
Acherjee B, Maity D, Kuar A (2020) Optimization of correlated and conflicting responses of ECM process using flower pollination algorithm. International Journal of Applied Metaheuristic Computing 11:1–15
Google Scholar
Ghosh T, Wang Y, Martinsen K, Wang K (2020) A surrogate-assisted optimization approach for multi-response end milling of aluminum alloy AA3105. Int J Adv Manuf Technol 111:2419–2439
Article Google Scholar
Naik S, Das SR, Dhupal D (2020) Experimental investigation, predictive modeling, parametric optimization and cost analysis in electrical discharge machining of Al-SiC metal matrix composite. Silicon 2020:1–24
Google Scholar
Bhavsar SN, Aravindan S, Rao PV (2015) Investigating material removal rate and surface roughness using multi-objective optimization for focused ion beam (FIB) micro-milling of cemented carbide. Precis Eng-J Int Soc Precis Eng Nanotechnol 40:131–138
Google Scholar
Ishibuchi H, Setoguchi Y, Masuda H, Nojima Y (2017) Performance of decomposition-based many-objective algorithms strongly depends on Pareto front shapes. IEEE T Evolut Comput 21:169–190
Article Google Scholar
Du W, Ding S (2021) A survey on multi-agent deep reinforcement learning: from the perspective of challenges and applications. Artif Intell Rev 54:3215–3238
Article Google Scholar
Levine S, Pastor P, Krizhevsky A, Ibarz J, Quillen D (2018) Learning hand-eye coordination for robotic grasping with deep learning and large-scale data collection. Int J Robot Res 37:421–436
Article Google Scholar
Li X, Serlin Z, Yang G, Belta C (2019) A formal methods approach to interpretable reinforcement learning for robotic planning. Science Robotics 4:eaay6276
Article Google Scholar
Silver D, Huang A, Maddison CJ, Guez A, Sifre L, van den Driessche G, Schrittwieser J, Antonoglou I, Panneershelvam V, Lanctot M, Dieleman S, Grewe D, Nham J, Kalchbrenner N, Sutskever I, Lillicrap T, Leach M, Kavukcuoglu K, Graepel T, Hassabis D (2016) Mastering the game of Go with deep neural networks and tree search. Nature 529:484−+
Article Google Scholar
Silver D, Schrittwieser J, Simonyan K, Antonoglou I, Huang A, Guez A, Hubert T, Baker L, Lai M, Bolton A, Chen Y, Lillicrap T, Hui F, Sifre L, Driessche G, Graepel T, Hassabis D (2017) Mastering the game of go without human knowledge. Nature 550:354–359
Article Google Scholar
Ding S, Zhao X, Xu X, Sun T, Jia W (2019) An effective asynchronous framework for small scale reinforcement learning problems. Appl Intell 49:4303–4318
Article Google Scholar
Li J, Monroe W, Ritter A (2016) D. Jurafsky. Deep Reinforcement Learning for Dialogue Generation
B. Dhingra, L. Li, X. Li, J. Gao, Y.-N. Chen, F. Ahmed, L Deng, Towards End-to-End Reinforcement Learning of Dialogue Agents for Information Access, 2017
Book Google Scholar
Watkins JCH, Dayan P (1992) Q-learning. Mach Learn 8:279–292
Article MATH Google Scholar
Mnih V, Kavukcuoglu K, Silver D, Graves A, Antonoglou I, Wierstra D, Riedmiller M (2013) Playing Atari with deep reinforcement learning. Comput Sci 2013:1–9
Google Scholar
Mnih V, Kavukcuoglu K, Silver D, Rusu AA, Veness J, Bellemare MG, Graves A, Riedmiller M, Fidjeland AK, Ostrovski G, Petersen S, Beattie C, Sadik A, Antonoglou I, King H, Kumaran D, Wierstra D, Legg S, Hassabis D (2015) Human-level control through deep reinforcement learning. Nature 518:529–533
Article Google Scholar
H. Van Hasselt, A. Guez, D. Silver, Deep Reinforcement Learning with Double Q-learning, (2015)
Z. Wang, N. Freitas, M. Lanctot, Dueling network architectures for deep reinforcement learning, (2015) 1995–2003
Google Scholar
Hasan MM, Lwin K, Imani M, Shabut A, Bittencourt LF, Hossain MA (2019) Dynamic multi-objective optimisation using deep reinforcement learning: benchmark, algorithm and an application to identify vulnerable zones based on water quality. Eng Appl Artif Intell 86:107–135
Article Google Scholar
Du W, Ding S, Zhang C, Du S (2021) Modified action decoder using Bayesian reasoning for multi-agent deep reinforcement learning. Int J Mach Learn Cybern 12:2947–2961
Article Google Scholar
Li K, Zhang T, Wang R (2020) Deep reinforcement learning for multi-objective optimization. IEEE Transactions on Cybernetics 2020:1–12
Google Scholar
Lu R, Li Y-C, Li Y, Jiang J, Ding Y (2020) Multi-agent deep reinforcement learning based demand response for discrete manufacturing systems energy management. Appl Energy 276:115473
Article Google Scholar
W. Gang, Z. Mianhao, Optimization of cutting parameters in machining surface to reduce errors, 2011
Book Google Scholar
C.C. Chang, C.J. Lin, LIBSVM: a library for support vector machines, Acm T Intel Syst Tec, 2 (2011), LIBSVM
Vapnik V (1995) The nature of statistical learning theory
Brereton RG, Lloyd GR (2010) Support vector machines for classification and regression. Analyst 135:230–267
Article Google Scholar
Han F, Li L, Cai W, Li C, Deng X, Sutherland JW (2020) Parameters optimization considering the trade-off between cutting power and MRR based on linear decreasing particle swarm algorithm in milling. J Clean Prod 262:121388
Article Google Scholar
Moreira LC, Li WD, Lu X, Fitzpatrick ME (2019) Energy-efficient machining process analysis and optimisation based on BS EN24T alloy steel as case studies. Robot Comput Integr Manuf 58:1–12
Article Google Scholar
Cherkassky V, Ma YQ (2004) Practical selection of SVM parameters and noise estimation for SVM regression. Neural Netw 17:113–126
Article MATH Google Scholar
Levis AA, Papageorgiou LG (2005) Customer demand forecasting via support vector regression analysis. Chem Eng Res Des 83:1009–1018
Article Google Scholar
Gupta AK, Guntuku SC, Desu RK, Balu A (2015) Optimisation of turning parameters by integrating genetic algorithm with support vector regression and artificial neural networks. Int J Adv Manuf Technol 77:331–339
Article Google Scholar
Deb K, Jain H (2014) An evolutionary many-objective optimization algorithm using reference-point-based nondominated sorting approach, part I: solving problems with box constraints. IEEE T Evolut Comput 18:577–601
Article Google Scholar
Hou Y, Wu N, Li Z, Zhang Y, Qu T, Zhu Q (2020) Many-objective optimization for scheduling of crude oil operations based on NSGA-III with consideration of energy efficiency. Swarm and Evolutionary Computation 57:100714
Article Google Scholar
Lowe R, Wu Y, Tamar A, Harb J, Abbeel P (2017) I. Mordatch. Multi-agent actor-critic for mixed cooperative-competitive environments
Lei W, Wen H, Wu J, Hou W (2021) MADDPG-based security situational awareness for smart grid with intelligent edge. Appl Sci 11:3101
Article Google Scholar
Behnamian J, Zandieh M, Ghomi SMTF (2010) A multi-phase covering Pareto-optimal front method to multi-objective parallel machine scheduling. Int J Prod Res 48:4949–4976
Article MATH Google Scholar

Download references

Acknowledgments

This research is supported by the National Natural Science Foundation of China (NSFC) (Grant No. 51665005 and 52165062), Natural Science Foundation of Guangxi Province (Grant No. 2020JJD160004 and 2019JJB160048), and Middle-aged and Young Teachers’ Basic Ability Promotion Project of Guangxi (Grant No. 2020KY10014).

Author information

Authors and Affiliations

Guangxi Key Laboratory of Manufacturing Systems and Advance Manufacturing Technology, Guangxi University, Nanning, 535004, China
Zhenhui Wang, Juan Lu, Chaoyi Chen, Junyan Ma & Xiaoping Liao
Department of Mechanical and Marine Engineering, Beibu Gulf University, Qinzhou, 535011, China
Juan Lu

Authors

Zhenhui Wang
View author publications
You can also search for this author in PubMed Google Scholar
Juan Lu
View author publications
You can also search for this author in PubMed Google Scholar
Chaoyi Chen
View author publications
You can also search for this author in PubMed Google Scholar
Junyan Ma
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoping Liao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xiaoping Liao.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wang, Z., Lu, J., Chen, C. et al. Investigating the multi-objective optimization of quality and efficiency using deep reinforcement learning. Appl Intell 52, 12873–12887 (2022). https://doi.org/10.1007/s10489-022-03326-5

Download citation

Accepted: 25 January 2022
Published: 15 February 2022
Issue Date: September 2022
DOI: https://doi.org/10.1007/s10489-022-03326-5

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

¥17,985 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price includes VAT (Japan)

Instant access to the full article PDF.

Institutional subscriptions

Investigating the multi-objective optimization of quality and efficiency using deep reinforcement learning

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Multi-objective optimization enabling CFRP energy-efficient milling based on deep reinforcement learning

Deep representation learning and reinforcement learning for workpiece setup optimization in CNC milling

Deep Reinforcement Learning for Continuous Control of Material Thickness

Explore related subjects

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now