Discounted Sampling Policy Gradient for Robot Multi-objective Visual Control | SpringerLink
Skip to main content

Discounted Sampling Policy Gradient for Robot Multi-objective Visual Control

  • Conference paper
  • First Online:
Evolutionary Multi-Criterion Optimization (EMO 2021)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 12654))

Included in the following conference series:

Abstract

Robot visual control often involves multiple objectives such as achieving high efficiency, maintaining stability, and avoiding failure. This paper proposes a novel Vision-Based Control method (VBC) with the Discounted Sampling Policy Gradient (DSPG) and Cosine Annealing (CA) to achieve excellent multi-objective control performance. In our proposed visual control framework, a DSPG learning agent is employed to learn a policy estimating continuous kinematics for VBC. The deep policy maps the visual observation to a specific action in an end-to-end manner. The DSPG agent finally can update the policy to obtain the optimal or near-optimal solution using shaped rewards from the environment. The proposed VBC-DSPG model is optimized using a heuristic method. Experimental results demonstrate that the proposed method performs very well compared with some classical competitors in the multi-objective visual control scenario.

This work was supported by National Key Research and Development Project, Ministry of Science and Technology, China (Grant No. 2018AAA0101301), National Natural Science Foundation of China (Grant No. 61876163), in part by Science and Technology Innovation Committee Foundation of Shenzhen (Grant No. JCYJ20200109143223052) and Hong Kong Research Grant Council (GRF 11200220).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
¥17,985 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
JPY 3498
Price includes VAT (Japan)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
JPY 11439
Price includes VAT (Japan)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
JPY 14299
Price includes VAT (Japan)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Santamarianavarro, A., Andradecetto, J. : Uncalibrated image-based visual servoing. In: 2013 IEEE International Conference on Robotics and Automation, pp. 5247–5252 (2016)

    Google Scholar 

  2. Fraichard, Thierry, Levesy, Valentin: From crowd simulation to robot navigation in crowds. IEEE Robot. Autom. Lett. 5(2), 729–735 (2020)

    Article  Google Scholar 

  3. Chaumette, F., Hutchinson, S.: Visual servo control. I. Basic approaches. IEEE Robot. Autom. Mag. 13(4), 82–90 (2006)

    Google Scholar 

  4. Wang, B., et al.: Parallel structure of six wheel-legged robot model predictive tracking control based on dynamic model. In: 2019 Chinese Automation Congress, pp. 5143–5148 (2019)

    Google Scholar 

  5. Sun, W., et al.: Multi-objective control for uncertain nonlinear active suspension systems. Mechatronics 24(4), 318–327 (2014)

    Article  Google Scholar 

  6. Malis, E: Improving vision-based control using efficient second-order minimization techniques. In: 2004 International Conference on Robotics and Automation, pp. 1843–1848 (2004)

    Google Scholar 

  7. Marey, M., Chaumette, F.: Analysis of classical and new visual servoing control laws. In: 2008 International Conference on Robotics and Automation, pp. 3244–3249 (2008)

    Google Scholar 

  8. Watanabe, K., et al.: Image-based visual PID control of a micro helicopter using a stationary camera. Adv. Robot. 22(2–3), 381–393 (2008)

    Article  Google Scholar 

  9. Kober, J., Bagnell, J.A., Peters, J.: Reinforcement learning in robotics: a survey. Int. J. Robot. Res. 32(11), 1238–1274 (2013)

    Article  Google Scholar 

  10. Zhang, J., et al.: VR-goggles for robots: real-to-sim domain adaptation for visual control. IEEE Robot. Autom. Lett. 4, 1148–1155 (2019)

    Article  Google Scholar 

  11. Xi, A., et al.: Balance control of a biped robot on a rotating platform based on efficient reinforcement learning. IEEE/CAA J. Automatica Sinica 6(4), 938–951 (2019)

    Article  MathSciNet  Google Scholar 

  12. Agostinelli, F., et al.: Solving the Rubik’s cube with deep reinforcement learning and search. Nat. Mach. Intell. 1(8), 356–363 (2019)

    Article  Google Scholar 

  13. Sampedro, C., et al.: Image-based visual servoing controller for multirotor aerial robots using deep reinforcement learning. In: 2018 International Conference on Intelligent Robots and Systems, pp. 979–986 (2018)

    Google Scholar 

  14. Sehgal, A., et al.: Deep reinforcement learning using genetic algorithm for parameter optimization. In: 2019 Third IEEE International Conference on Robotic Computing, pp. 596–601 (2019)

    Google Scholar 

  15. Wang, Lixin., Wang, Maolin, Yue, Ting: A fuzzy deterministic policy gradient algorithm for pursuit-evasion differential games. Neurocomputing 362, 106–117 (2019)

    Article  Google Scholar 

  16. Samma, H., et al.: Q-learning-based simulated annealing algorithm for constrained engineering design problems. Neural Comput. Appl. 32(9), 1–15 (2019). https://doi.org/10.1007/s00521-019-04008-z

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Meng Xu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Xu, M., Zhang, Q., Wang, J. (2021). Discounted Sampling Policy Gradient for Robot Multi-objective Visual Control. In: Ishibuchi, H., et al. Evolutionary Multi-Criterion Optimization. EMO 2021. Lecture Notes in Computer Science(), vol 12654. Springer, Cham. https://doi.org/10.1007/978-3-030-72062-9_35

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-72062-9_35

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-72061-2

  • Online ISBN: 978-3-030-72062-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics