A Cascaded Deep Neural Network for Position Estimation of Industrial Robots

Lin, Weiyang; Ye, Chao; Zhou, Jiaoju; Ren, **nyang; Tong, Mingsi

doi:10.1007/978-3-030-77939-9_5

Weiyang Lin⁴,
Chao Ye⁴,
Jiaoju Zhou⁴,
**nyang Ren⁴ &
…
Mingsi Tong⁴

Part of the book series: Studies in Computational Intelligence ((SCI,volume 984))

1701 Accesses

Abstract

The estimation of an object’s position and orientation from images plays an important role in the field of industrial robots and visual servo, the performance of the vision control system is deeply dependent on the image processing model and algorithm. Before deep learning is widely used in computer vision, the traditional image processing methods are successful in handling the low dimension information of image features, but the traditional image processing methods always fail in complex images with high dimension feature information. In this research chapter, our main contribution is to propose a cascaded convolution network that could obtain high precision pose estimates. Where Single Shot MultiBox Detector (SSD) is utilized to obtain the bounding box of the object to narrow down the recognition range. And a convolutional neural network is utilized to detect the orientation of the object. The method is designed for industrial detection tasks, so the optimized method can run in real-time and extract weak features of sample images. To verify the effect of the detection method based on deep learning in the industrial system, a hand-eye system is built for detecting Radio Remote Unit. A series of experiments have been carried out on the system with the proposed method and the traditional method. In general, the proposed method has advantages in accuracy and recognition rate compared with the traditional algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 189.00; Price excludes VAT (USA)

Softcover Book: USD 249.99; Price excludes VAT (USA)

Hardcover Book: USD 249.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

C-CNN: Cascaded Convolutional Neural Network for Small Deformable and Low Contrast Object Localization

Hand Pose Estimation Using Convolutional Neural Networks and Support Vector Regression

Automatic Training Method of Deep Neural Network for Robot Vision

References

Adams S, Nolte L (1975) Bayes optimum array detection of targets of known location. J Acoust Soc Am 58(3):656–669
Article Google Scholar
Bhasin M, Raghava G Eslpred: SVM-based method for subcellular localization of eukaryotic proteins using dipeptide composition and psi-blast. Nucleic Acids Res 32(suppl\_2):W414–W419
Google Scholar
Cao W, Wang X, Ming Z, Gao J (2018) A review on neural networks with random weights. Neurocomputing 275:278–287
Article Google Scholar
Cortes C, Vapnik V (1995) Support-vector networks. Machine Learn 20(3):273–297
Google Scholar
Fan Y, Levine MD, Wen G, Qiu S (2017) A deep neural network for real-time detection of falling humans in naturally occurring scenes. Neurocomputing 260:43–58
Article Google Scholar
Fang S, Huang X, Chen H, ** N (2016) Dual-arm robot assembly system for 3c product based on vision guidance. In: 2016 IEEE international conference on robotics and biomimetics (ROBIO). IEEE, pp 807–812
Google Scholar
Gall J, Yao A, Razavi N, Van Gool L, Lempitsky V (2011) Hough forests for object detection, tracking, and action recognition. IEEE Trans Pattern Anal Mach Intell 33(11):2188–2202
Article Google Scholar
Girshick R (2015) Fast R-CNN. In: Proceedings of the IEEE international conference on computer vision, pp 1440–1448
Google Scholar
Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 580–587
Google Scholar
Goodfellow I, Bengio Y, Courville A (2016) Deep learning. MIT Press
Google Scholar
Gu J, Wang Z, Kuen J, Ma L, Shahroudy A, Shuai B, Liu T, Wang X, Wang G, Cai J et al (2018) Recent advances in convolutional neural networks. Pattern Recogn 77:354–377
Article Google Scholar
Gutierrez-Galan D, Dominguez-Morales JP, Cerezuela-Escudero E, Rios-Navarro A, Tapiador-Morales R, Rivas-Perez M, Dominguez-Morales M, Jimenez-Fernandez A, Linares-Barranco A (2018) Embedded neural network for real-time animal behavior classification. Neurocomputing 272:17–26
Article Google Scholar
He K, Gkioxari G, Dollár P, Girshick R (2017) Mask R-CNN. In: Proceedings of the IEEE international conference on computer vision, pp 2961–2969
Google Scholar
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778
Google Scholar
He T, Li X, Jiang Y (2014) Improved ht object detection algorithm based on canny edge operator. J Multimedia 9(9):1089
Article Google Scholar
Howard AG, Zhu M, Chen B, Kalenichenko D, Wang W, Weyand T, Andreetto M, Adam H (2017) Mobilenets: efficient convolutional neural networks for mobile vision applications. ar**v:1704.04861
Hu J, Wang Z, Alsaadi FE, Hayat T (2017) Event-based filtering for time-varying nonlinear systems subject to multiple missing measurements with uncertain missing probabilities. Inf Fusion 38:74–83
Article Google Scholar
Hu J, Wang Z, Liu S, Gao H (2016) A variance-constrained approach to recursive state estimation for time-varying complex networks with missing measurements. Automatica 64:155–162
Article MathSciNet Google Scholar
Hu J, Wang Z, Shen B, Gao H (2013) Quantised recursive filtering for a class of nonlinear systems with multiplicative noises and missing measurements. Int J Control 86(4):650–663
Article MathSciNet Google Scholar
Hui-jiang W (2016) Edge detection of axis parts via combining canny operator with hough transformation. Mech Electr Eng Technol 9:7
Google Scholar
Jeong J, Park H, Kwak N (2017) Enhancement of ssd by concatenating feature maps for object detection. ar**v:1705.09587
Jiang LJ, Wang GR (2005) A stereovision method for resuming the position and pose of symmetry revolution object. J Eng Graph 5:17
Google Scholar
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems, pp 1097–1105
Google Scholar
LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521(7553):436–444
Article Google Scholar
LeCun Y, Boser BE, Denker JS, Henderson D, Howard RE, Hubbard WE, Jackel LD (1990) Handwritten digit recognition with a back-propagation network. In: Advances in neural information processing systems, pp 396–404
Google Scholar
LeCun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324
Article Google Scholar
Li H, Lin Z, Shen X, Brandt J, Hua G (2015) A convolutional neural network cascade for face detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 325–5334
Google Scholar
Lin TY, Goyal P, Girshick R, He K, Dollár P (2017) Focal loss for dense object detection. In: Proceedings of the IEEE international conference on computer vision, pp 2980–2988
Google Scholar
Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu CY, Berg AC (2016) SSD: single shot multibox detector. In: European conference on computer vision. Springer, pp 21–37
Google Scholar
Ma J, Lu Y, Li L, Lu X, He F, Zhong Z (2017) Edge location method based on canny operator and improved hough transform. Autom Inf Eng 38(03):32–36
Google Scholar
Ng AY (2004) Feature selection, l1 versus l2 regularization, and rotational invariance. In: Proceedings of the twenty-first international conference on machine learning, p 78
Google Scholar
Radman A, Zainal N, Suandi SA (2017) Automated segmentation of iris images acquired in an unconstrained environment using hog-svm and growcut. Digital Signal Process 64:60–70
Article MathSciNet Google Scholar
Redmon J, Divvala S, Girshick R, Farhadi A (2016) You only look once: Unified, real-time object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 779–788
Google Scholar
Ren S, He K, Girshick R, Sun J (2015) Faster R-CNN: towards real-time object detection with region proposal networks. In: Advances in neural information processing systems, pp 91–99
Google Scholar
Rublee E, Rabaud V, Konolige K, Bradski G (2011) ORB: An efficient alternative to sift or surf. In: 2011 International conference on computer vision. IEEE, pp 2564–2571
Google Scholar
Rumelhart DE, Hinton GE, Williams RJ (1986) Learning representations by back-propagating errors. Nature 323(6088):533–536
Article Google Scholar
Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. ar**v:1409.1556
Spong MW, Vidyasagar M (2008) Robot dynamics and control. Wiley
Google Scholar
Stewart D (1965) A platform with six degrees of freedom. Proc Inst Mech Eng 180(1):371–386
Article Google Scholar
Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1–9
Google Scholar
Tavana M, Abtahi AR, Di Caprio D, Poortarigh M (2018) An artificial neural network and bayesian network model for liquidity risk assessment in banking. Neurocomputing 275:2525–2554
Article Google Scholar
Thrun S, Saul LK, Schölkopf B (2004) Advances in neural information processing systems 16: proceedings of the 2003 conference, vol 16. MIT Press
Google Scholar
Wang G, **a XG, Chen VC, Fielder R (2004) Detection, location, and imaging of fast moving targets using multifrequency antenna array sar. IEEE Transactions on Aerospace and Electronic Systems 40(1):345–355
Article Google Scholar
Wang L, Guo S, Huang W, Qiao Y (2015) Places205-vggnet models for scene recognition. ar**v:1508.01667
** J, Zhang JZ (2012) Edge detection from remote sensing images based on canny operator and hough transform. In: Advances in computer science and engineering. Springer, pp 807–814
Google Scholar
Zhang J, Dong Y, Wangning H, Donghai L (2017) Hardware and algorithm design of industrial robot for 3c products manufacturing. Mach Des Res 231:2912–2924
Google Scholar
Zhang Y, Xu Y, Ding G (2008) License plate character recognition algorithm based on filled function method training bp neural network. In: 2008 Chinese control and decision conference. IEEE, pp 3886–3891
Google Scholar
Zhou Z, Zhang J (2007) Object detection and tracking based on adaptive canny operator and gm (1, 1) model. In: 2007 IEEE international conference on Grey systems and intelligent services. IEEE, pp 434–439
Google Scholar

Download references

Author information

Authors and Affiliations

Harbin Institute of Technology, The Research of Intelligent Control and System, Mechanical Building, 92 ** Da Zhi Street, Nan Gang District, Harbin, China
Weiyang Lin, Chao Ye, Jiaoju Zhou, **nyang Ren & Mingsi Tong

Authors

Weiyang Lin
View author publications
You can also search for this author in PubMed Google Scholar
Chao Ye
View author publications
You can also search for this author in PubMed Google Scholar
Jiaoju Zhou
View author publications
You can also search for this author in PubMed Google Scholar
**nyang Ren
View author publications
You can also search for this author in PubMed Google Scholar
Mingsi Tong
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Weiyang Lin .

Editor information

Editors and Affiliations

College of Computer and Information Sciences, Prince Sultan University, Riyadh, Saudi Arabia
Anis Koubaa
College of Computer and Information Sciences, Prince Sultan University, Riyadh, Saudi Arabia
Ahmad Taher Azar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Lin, W., Ye, C., Zhou, J., Ren, X., Tong, M. (2021). A Cascaded Deep Neural Network for Position Estimation of Industrial Robots. In: Koubaa, A., Azar, A.T. (eds) Deep Learning for Unmanned Systems. Studies in Computational Intelligence, vol 984. Springer, Cham. https://doi.org/10.1007/978-3-030-77939-9_5

Download citation

DOI: https://doi.org/10.1007/978-3-030-77939-9_5
Published: 02 October 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-77938-2
Online ISBN: 978-3-030-77939-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

A Cascaded Deep Neural Network for Position Estimation of Industrial Robots

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

C-CNN: Cascaded Convolutional Neural Network for Small Deformable and Low Contrast Object Localization

Hand Pose Estimation Using Convolutional Neural Networks and Support Vector Regression

Automatic Training Method of Deep Neural Network for Robot Vision

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

A Cascaded Deep Neural Network for Position Estimation of Industrial Robots

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

C-CNN: Cascaded Convolutional Neural Network for Small Deformable and Low Contrast Object Localization

Hand Pose Estimation Using Convolutional Neural Networks and Support Vector Regression

Automatic Training Method of Deep Neural Network for Robot Vision

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation