Abstract
This study proposes a pipeline, which is based on deep reinforcement learning, aims to solve the multi-objective problem (MOP) on efficiency and quality in manufacturing. The rapid development in the area of artificial intelligent has caused a series of reactions that stirred the traditional manufacturing, pushing for the better machining quality and higher productivity. Despite all this, there has been very little research applying reinforcement learning to solve practical problems in milling process. The proposed pipeline is a two-step algorithm and makes full use of double deep Q network (DDQN) to settle the MOP of milling parameters. Firstly, surface roughness (Ra) and material removal rate (MRR) are selected as quality and efficiency indicators, respectively. In specific, the reliable prediction model of Ra is constructed on a small batch raw data via DDQN improved support vector regression (DDQN-SVR) rather than sophisticated and complex physical modeling. The MRR model is constructed by an accepted empirical formula. Then, DDQN is employed again to solve the MOP of satisfying minimum Ra and maximum MRR and compared to other accepted algorithms. Eventually, the optimal combination of machining parameters determined by entropy method was validated by experiment.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Attar H, Ehtemam-Haghighi S, Kent D, Dargusch MS (2018) Recent developments and opportunities in additive manufacturing of titanium-based matrix composites: a review. Int J Mach Tools Manuf 133:85–102
Lu X, Zhang H, Jia Z, Feng Y, Liang SY (2018) Cutting parameters optimization for MRR under the constraints of surface roughness and cutter breakage in micro-milling process. J Mech Sci Technol 32:3379–3388
Wu D, Wang H, Zhang K, Zhao B, Lin X (2020) Research on adaptive CNC machining arithmetic and process for near-net-shaped jet engine blade. J Intell Manuf 31:717–744
Zhu L, Li H, Yang J, Wang WS (2012) Research on theoretical modeling of 3D chip of orthogonal turn-milling. Dongbei Daxue Xuebao/Journal of Northeastern University 33:111–115
Bakhtiari H, Karimi M, Rezazadeh S (2016) Modeling, analysis and multi-objective optimization of twist extrusion process using predictive models and meta-heuristic approaches, based on finite element results. J Intell Manuf 27:463–473
Lu J, Liao X, Li S, Ouyang H, Chen K, Huang B (2019) An effective ABC-SVM approach for surface roughness prediction in manufacturing processes. Complexity 2019:1–13
Xiao Z, Liao X, Long Z, Li M (2017) Effect of cutting parameters on surface roughness using orthogonal array in hard turning of AISI 1045 steel with YT5 tool. Int J Adv Manuf Technol 93:273–282
Tangjitsitcharoen S, Thesniyom P, Ratanakuakangwan S (2017) Prediction of surface roughness in ball-end milling process by utilizing dynamic cutting force ratio. J Intell Manuf 28:13–21
Mumtaz J, Li Z, Imran M, Yue L, Jahanzaib M, Sarfraz S, Shehab E, Ismail SO, Afzal K (2019) Multi-objective optimisation for minimum quantity lubrication assisted milling process based on hybrid response surface methodology and multi-objective genetic algorithm. Adv Mech Eng 11
Soepangkat B, Norcahyo R, Pramujati B, Wahid M (2019) Multi-objective optimization in face milling process with cryogenic cooling using grey fuzzy analysis and BPNN-GA methods, engineering computations, ahead-of-print
Sugumaran V (2013) Developing Gaussian process model to predict the surface roughness in boring operation. International Journal of Engineering Trends and Technology 4:219–223
Zhang GJ, Li J, Chen Y, Huang Y, Shao XY, Li MZ (2014) Prediction of surface roughness in end face milling based on Gaussian process regression and cause analysis considering tool vibration. Int J Adv Manuf Tech 75:1357–1370
Aich U, Banerjee S (2014) Modeling of EDM responses by support vector machine regression with parameters selected by particle swarm optimization. Appl Math Model 38:2800–2818
Cao WD, Liu X, Ni JJ (2020) Parameter optimization of support vector regression using Henry gas solubility optimization algorithm. Ieee Access 8:88633–88642
Lela B, Bajic D, Jozic S (2009) Regression analysis, support vector machines, and Bayesian neural network approaches to modeling surface roughness in face milling. Int J Adv Manuf Tech 42:1082–1088
Zuperl U, Cus F (2012) System for off-line feedrate optimization and neural force control in end milling. International Journal of Adaptive Control and Signal Processing 26:105–123
Saravanan R, Asokan P, Sachidanandam M (2002) A multi-objective genetic algorithm (GA) approach for optimization of surface grinding operations. Int J Mach Tools Manuf 42:1327–1334
Warsi SS, Agha MH, Ahmad R, Jaffery SHI, Khan M (2019) Sustainable turning using multi-objective optimization: a study of Al 6061 T6 at high cutting speeds. Int J Adv Manuf Technol 100:843–855
Zhang Y, Wang G-G, Li K, Yeh W-C, Jian M, Dong J (2020) Enhancing MOEA/D with information feedback models for large-scale many-objective optimization. Inf Sci 522:1–16
Cheng R, Jin Y, Olhofer M, Sendhoff B (2016) A reference vector guided evolutionary algorithm for many-objective optimization. IEEE T Evolut Comput 20:773–791
Trivedi A, Srinivasan D, Sanyal K, Ghosh A (2016) A survey of multiobjective evolutionary algorithms based on decomposition. IEEE T Evolut Comput 21:440–462
Acherjee B, Maity D, Kuar A (2020) Optimization of correlated and conflicting responses of ECM process using flower pollination algorithm. International Journal of Applied Metaheuristic Computing 11:1–15
Ghosh T, Wang Y, Martinsen K, Wang K (2020) A surrogate-assisted optimization approach for multi-response end milling of aluminum alloy AA3105. Int J Adv Manuf Technol 111:2419–2439
Naik S, Das SR, Dhupal D (2020) Experimental investigation, predictive modeling, parametric optimization and cost analysis in electrical discharge machining of Al-SiC metal matrix composite. Silicon 2020:1–24
Bhavsar SN, Aravindan S, Rao PV (2015) Investigating material removal rate and surface roughness using multi-objective optimization for focused ion beam (FIB) micro-milling of cemented carbide. Precis Eng-J Int Soc Precis Eng Nanotechnol 40:131–138
Ishibuchi H, Setoguchi Y, Masuda H, Nojima Y (2017) Performance of decomposition-based many-objective algorithms strongly depends on Pareto front shapes. IEEE T Evolut Comput 21:169–190
Du W, Ding S (2021) A survey on multi-agent deep reinforcement learning: from the perspective of challenges and applications. Artif Intell Rev 54:3215–3238
Levine S, Pastor P, Krizhevsky A, Ibarz J, Quillen D (2018) Learning hand-eye coordination for robotic grasping with deep learning and large-scale data collection. Int J Robot Res 37:421–436
Li X, Serlin Z, Yang G, Belta C (2019) A formal methods approach to interpretable reinforcement learning for robotic planning. Science Robotics 4:eaay6276
Silver D, Huang A, Maddison CJ, Guez A, Sifre L, van den Driessche G, Schrittwieser J, Antonoglou I, Panneershelvam V, Lanctot M, Dieleman S, Grewe D, Nham J, Kalchbrenner N, Sutskever I, Lillicrap T, Leach M, Kavukcuoglu K, Graepel T, Hassabis D (2016) Mastering the game of Go with deep neural networks and tree search. Nature 529:484−+
Silver D, Schrittwieser J, Simonyan K, Antonoglou I, Huang A, Guez A, Hubert T, Baker L, Lai M, Bolton A, Chen Y, Lillicrap T, Hui F, Sifre L, Driessche G, Graepel T, Hassabis D (2017) Mastering the game of go without human knowledge. Nature 550:354–359
Ding S, Zhao X, Xu X, Sun T, Jia W (2019) An effective asynchronous framework for small scale reinforcement learning problems. Appl Intell 49:4303–4318
Li J, Monroe W, Ritter A (2016) D. Jurafsky. Deep Reinforcement Learning for Dialogue Generation
B. Dhingra, L. Li, X. Li, J. Gao, Y.-N. Chen, F. Ahmed, L Deng, Towards End-to-End Reinforcement Learning of Dialogue Agents for Information Access, 2017
Watkins JCH, Dayan P (1992) Q-learning. Mach Learn 8:279–292
Mnih V, Kavukcuoglu K, Silver D, Graves A, Antonoglou I, Wierstra D, Riedmiller M (2013) Playing Atari with deep reinforcement learning. Comput Sci 2013:1–9
Mnih V, Kavukcuoglu K, Silver D, Rusu AA, Veness J, Bellemare MG, Graves A, Riedmiller M, Fidjeland AK, Ostrovski G, Petersen S, Beattie C, Sadik A, Antonoglou I, King H, Kumaran D, Wierstra D, Legg S, Hassabis D (2015) Human-level control through deep reinforcement learning. Nature 518:529–533
H. Van Hasselt, A. Guez, D. Silver, Deep Reinforcement Learning with Double Q-learning, (2015)
Z. Wang, N. Freitas, M. Lanctot, Dueling network architectures for deep reinforcement learning, (2015) 1995–2003
Hasan MM, Lwin K, Imani M, Shabut A, Bittencourt LF, Hossain MA (2019) Dynamic multi-objective optimisation using deep reinforcement learning: benchmark, algorithm and an application to identify vulnerable zones based on water quality. Eng Appl Artif Intell 86:107–135
Du W, Ding S, Zhang C, Du S (2021) Modified action decoder using Bayesian reasoning for multi-agent deep reinforcement learning. Int J Mach Learn Cybern 12:2947–2961
Li K, Zhang T, Wang R (2020) Deep reinforcement learning for multi-objective optimization. IEEE Transactions on Cybernetics 2020:1–12
Lu R, Li Y-C, Li Y, Jiang J, Ding Y (2020) Multi-agent deep reinforcement learning based demand response for discrete manufacturing systems energy management. Appl Energy 276:115473
W. Gang, Z. Mianhao, Optimization of cutting parameters in machining surface to reduce errors, 2011
C.C. Chang, C.J. Lin, LIBSVM: a library for support vector machines, Acm T Intel Syst Tec, 2 (2011), LIBSVM
Vapnik V (1995) The nature of statistical learning theory
Brereton RG, Lloyd GR (2010) Support vector machines for classification and regression. Analyst 135:230–267
Han F, Li L, Cai W, Li C, Deng X, Sutherland JW (2020) Parameters optimization considering the trade-off between cutting power and MRR based on linear decreasing particle swarm algorithm in milling. J Clean Prod 262:121388
Moreira LC, Li WD, Lu X, Fitzpatrick ME (2019) Energy-efficient machining process analysis and optimisation based on BS EN24T alloy steel as case studies. Robot Comput Integr Manuf 58:1–12
Cherkassky V, Ma YQ (2004) Practical selection of SVM parameters and noise estimation for SVM regression. Neural Netw 17:113–126
Levis AA, Papageorgiou LG (2005) Customer demand forecasting via support vector regression analysis. Chem Eng Res Des 83:1009–1018
Gupta AK, Guntuku SC, Desu RK, Balu A (2015) Optimisation of turning parameters by integrating genetic algorithm with support vector regression and artificial neural networks. Int J Adv Manuf Technol 77:331–339
Deb K, Jain H (2014) An evolutionary many-objective optimization algorithm using reference-point-based nondominated sorting approach, part I: solving problems with box constraints. IEEE T Evolut Comput 18:577–601
Hou Y, Wu N, Li Z, Zhang Y, Qu T, Zhu Q (2020) Many-objective optimization for scheduling of crude oil operations based on NSGA-III with consideration of energy efficiency. Swarm and Evolutionary Computation 57:100714
Lowe R, Wu Y, Tamar A, Harb J, Abbeel P (2017) I. Mordatch. Multi-agent actor-critic for mixed cooperative-competitive environments
Lei W, Wen H, Wu J, Hou W (2021) MADDPG-based security situational awareness for smart grid with intelligent edge. Appl Sci 11:3101
Behnamian J, Zandieh M, Ghomi SMTF (2010) A multi-phase covering Pareto-optimal front method to multi-objective parallel machine scheduling. Int J Prod Res 48:4949–4976
Acknowledgments
This research is supported by the National Natural Science Foundation of China (NSFC) (Grant No. 51665005 and 52165062), Natural Science Foundation of Guangxi Province (Grant No. 2020JJD160004 and 2019JJB160048), and Middle-aged and Young Teachers’ Basic Ability Promotion Project of Guangxi (Grant No. 2020KY10014).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Wang, Z., Lu, J., Chen, C. et al. Investigating the multi-objective optimization of quality and efficiency using deep reinforcement learning. Appl Intell 52, 12873–12887 (2022). https://doi.org/10.1007/s10489-022-03326-5
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-022-03326-5