A Cascaded Deep Neural Network for Position Estimation of Industrial Robots

  • Chapter
  • First Online:
Deep Learning for Unmanned Systems

Part of the book series: Studies in Computational Intelligence ((SCI,volume 984))

  • 1701 Accesses

Abstract

The estimation of an object’s position and orientation from images plays an important role in the field of industrial robots and visual servo, the performance of the vision control system is deeply dependent on the image processing model and algorithm. Before deep learning is widely used in computer vision, the traditional image processing methods are successful in handling the low dimension information of image features, but the traditional image processing methods always fail in complex images with high dimension feature information. In this research chapter, our main contribution is to propose a cascaded convolution network that could obtain high precision pose estimates. Where Single Shot MultiBox Detector (SSD) is utilized to obtain the bounding box of the object to narrow down the recognition range. And a convolutional neural network is utilized to detect the orientation of the object. The method is designed for industrial detection tasks, so the optimized method can run in real-time and extract weak features of sample images. To verify the effect of the detection method based on deep learning in the industrial system, a hand-eye system is built for detecting Radio Remote Unit. A series of experiments have been carried out on the system with the proposed method and the traditional method. In general, the proposed method has advantages in accuracy and recognition rate compared with the traditional algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 189.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 249.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info
Hardcover Book
USD 249.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Adams S, Nolte L (1975) Bayes optimum array detection of targets of known location. J Acoust Soc Am 58(3):656–669

    Article  Google Scholar 

  2. Bhasin M, Raghava G Eslpred: SVM-based method for subcellular localization of eukaryotic proteins using dipeptide composition and psi-blast. Nucleic Acids Res 32(suppl\_2):W414–W419

    Google Scholar 

  3. Cao W, Wang X, Ming Z, Gao J (2018) A review on neural networks with random weights. Neurocomputing 275:278–287

    Article  Google Scholar 

  4. Cortes C, Vapnik V (1995) Support-vector networks. Machine Learn 20(3):273–297

    Google Scholar 

  5. Fan Y, Levine MD, Wen G, Qiu S (2017) A deep neural network for real-time detection of falling humans in naturally occurring scenes. Neurocomputing 260:43–58

    Article  Google Scholar 

  6. Fang S, Huang X, Chen H, ** N (2016) Dual-arm robot assembly system for 3c product based on vision guidance. In: 2016 IEEE international conference on robotics and biomimetics (ROBIO). IEEE, pp 807–812

    Google Scholar 

  7. Gall J, Yao A, Razavi N, Van Gool L, Lempitsky V (2011) Hough forests for object detection, tracking, and action recognition. IEEE Trans Pattern Anal Mach Intell 33(11):2188–2202

    Article  Google Scholar 

  8. Girshick R (2015) Fast R-CNN. In: Proceedings of the IEEE international conference on computer vision, pp 1440–1448

    Google Scholar 

  9. Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 580–587

    Google Scholar 

  10. Goodfellow I, Bengio Y, Courville A (2016) Deep learning. MIT Press

    Google Scholar 

  11. Gu J, Wang Z, Kuen J, Ma L, Shahroudy A, Shuai B, Liu T, Wang X, Wang G, Cai J et al (2018) Recent advances in convolutional neural networks. Pattern Recogn 77:354–377

    Article  Google Scholar 

  12. Gutierrez-Galan D, Dominguez-Morales JP, Cerezuela-Escudero E, Rios-Navarro A, Tapiador-Morales R, Rivas-Perez M, Dominguez-Morales M, Jimenez-Fernandez A, Linares-Barranco A (2018) Embedded neural network for real-time animal behavior classification. Neurocomputing 272:17–26

    Article  Google Scholar 

  13. He K, Gkioxari G, Dollár P, Girshick R (2017) Mask R-CNN. In: Proceedings of the IEEE international conference on computer vision, pp 2961–2969

    Google Scholar 

  14. He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778

    Google Scholar 

  15. He T, Li X, Jiang Y (2014) Improved ht object detection algorithm based on canny edge operator. J Multimedia 9(9):1089

    Article  Google Scholar 

  16. Howard AG, Zhu M, Chen B, Kalenichenko D, Wang W, Weyand T, Andreetto M, Adam H (2017) Mobilenets: efficient convolutional neural networks for mobile vision applications. ar**v:1704.04861

  17. Hu J, Wang Z, Alsaadi FE, Hayat T (2017) Event-based filtering for time-varying nonlinear systems subject to multiple missing measurements with uncertain missing probabilities. Inf Fusion 38:74–83

    Article  Google Scholar 

  18. Hu J, Wang Z, Liu S, Gao H (2016) A variance-constrained approach to recursive state estimation for time-varying complex networks with missing measurements. Automatica 64:155–162

    Article  MathSciNet  Google Scholar 

  19. Hu J, Wang Z, Shen B, Gao H (2013) Quantised recursive filtering for a class of nonlinear systems with multiplicative noises and missing measurements. Int J Control 86(4):650–663

    Article  MathSciNet  Google Scholar 

  20. Hui-jiang W (2016) Edge detection of axis parts via combining canny operator with hough transformation. Mech Electr Eng Technol 9:7

    Google Scholar 

  21. Jeong J, Park H, Kwak N (2017) Enhancement of ssd by concatenating feature maps for object detection. ar**v:1705.09587

  22. Jiang LJ, Wang GR (2005) A stereovision method for resuming the position and pose of symmetry revolution object. J Eng Graph 5:17

    Google Scholar 

  23. Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems, pp 1097–1105

    Google Scholar 

  24. LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521(7553):436–444

    Article  Google Scholar 

  25. LeCun Y, Boser BE, Denker JS, Henderson D, Howard RE, Hubbard WE, Jackel LD (1990) Handwritten digit recognition with a back-propagation network. In: Advances in neural information processing systems, pp 396–404

    Google Scholar 

  26. LeCun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324

    Article  Google Scholar 

  27. Li H, Lin Z, Shen X, Brandt J, Hua G (2015) A convolutional neural network cascade for face detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 325–5334

    Google Scholar 

  28. Lin TY, Goyal P, Girshick R, He K, Dollár P (2017) Focal loss for dense object detection. In: Proceedings of the IEEE international conference on computer vision, pp 2980–2988

    Google Scholar 

  29. Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu CY, Berg AC (2016) SSD: single shot multibox detector. In: European conference on computer vision. Springer, pp 21–37

    Google Scholar 

  30. Ma J, Lu Y, Li L, Lu X, He F, Zhong Z (2017) Edge location method based on canny operator and improved hough transform. Autom Inf Eng 38(03):32–36

    Google Scholar 

  31. Ng AY (2004) Feature selection, l1 versus l2 regularization, and rotational invariance. In: Proceedings of the twenty-first international conference on machine learning, p 78

    Google Scholar 

  32. Radman A, Zainal N, Suandi SA (2017) Automated segmentation of iris images acquired in an unconstrained environment using hog-svm and growcut. Digital Signal Process 64:60–70

    Article  MathSciNet  Google Scholar 

  33. Redmon J, Divvala S, Girshick R, Farhadi A (2016) You only look once: Unified, real-time object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 779–788

    Google Scholar 

  34. Ren S, He K, Girshick R, Sun J (2015) Faster R-CNN: towards real-time object detection with region proposal networks. In: Advances in neural information processing systems, pp 91–99

    Google Scholar 

  35. Rublee E, Rabaud V, Konolige K, Bradski G (2011) ORB: An efficient alternative to sift or surf. In: 2011 International conference on computer vision. IEEE, pp 2564–2571

    Google Scholar 

  36. Rumelhart DE, Hinton GE, Williams RJ (1986) Learning representations by back-propagating errors. Nature 323(6088):533–536

    Article  Google Scholar 

  37. Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. ar**v:1409.1556

  38. Spong MW, Vidyasagar M (2008) Robot dynamics and control. Wiley

    Google Scholar 

  39. Stewart D (1965) A platform with six degrees of freedom. Proc Inst Mech Eng 180(1):371–386

    Article  Google Scholar 

  40. Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1–9

    Google Scholar 

  41. Tavana M, Abtahi AR, Di Caprio D, Poortarigh M (2018) An artificial neural network and bayesian network model for liquidity risk assessment in banking. Neurocomputing 275:2525–2554

    Article  Google Scholar 

  42. Thrun S, Saul LK, Schölkopf B (2004) Advances in neural information processing systems 16: proceedings of the 2003 conference, vol 16. MIT Press

    Google Scholar 

  43. Wang G, **a XG, Chen VC, Fielder R (2004) Detection, location, and imaging of fast moving targets using multifrequency antenna array sar. IEEE Transactions on Aerospace and Electronic Systems 40(1):345–355

    Article  Google Scholar 

  44. Wang L, Guo S, Huang W, Qiao Y (2015) Places205-vggnet models for scene recognition. ar**v:1508.01667

  45. ** J, Zhang JZ (2012) Edge detection from remote sensing images based on canny operator and hough transform. In: Advances in computer science and engineering. Springer, pp 807–814

    Google Scholar 

  46. Zhang J, Dong Y, Wangning H, Donghai L (2017) Hardware and algorithm design of industrial robot for 3c products manufacturing. Mach Des Res 231:2912–2924

    Google Scholar 

  47. Zhang Y, Xu Y, Ding G (2008) License plate character recognition algorithm based on filled function method training bp neural network. In: 2008 Chinese control and decision conference. IEEE, pp 3886–3891

    Google Scholar 

  48. Zhou Z, Zhang J (2007) Object detection and tracking based on adaptive canny operator and gm (1, 1) model. In: 2007 IEEE international conference on Grey systems and intelligent services. IEEE, pp 434–439

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Weiyang Lin .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Lin, W., Ye, C., Zhou, J., Ren, X., Tong, M. (2021). A Cascaded Deep Neural Network for Position Estimation of Industrial Robots. In: Koubaa, A., Azar, A.T. (eds) Deep Learning for Unmanned Systems. Studies in Computational Intelligence, vol 984. Springer, Cham. https://doi.org/10.1007/978-3-030-77939-9_5

Download citation

Publish with us

Policies and ethics

Navigation