Abstract
The estimation of an object’s position and orientation from images plays an important role in the field of industrial robots and visual servo, the performance of the vision control system is deeply dependent on the image processing model and algorithm. Before deep learning is widely used in computer vision, the traditional image processing methods are successful in handling the low dimension information of image features, but the traditional image processing methods always fail in complex images with high dimension feature information. In this research chapter, our main contribution is to propose a cascaded convolution network that could obtain high precision pose estimates. Where Single Shot MultiBox Detector (SSD) is utilized to obtain the bounding box of the object to narrow down the recognition range. And a convolutional neural network is utilized to detect the orientation of the object. The method is designed for industrial detection tasks, so the optimized method can run in real-time and extract weak features of sample images. To verify the effect of the detection method based on deep learning in the industrial system, a hand-eye system is built for detecting Radio Remote Unit. A series of experiments have been carried out on the system with the proposed method and the traditional method. In general, the proposed method has advantages in accuracy and recognition rate compared with the traditional algorithm.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Adams S, Nolte L (1975) Bayes optimum array detection of targets of known location. J Acoust Soc Am 58(3):656–669
Bhasin M, Raghava G Eslpred: SVM-based method for subcellular localization of eukaryotic proteins using dipeptide composition and psi-blast. Nucleic Acids Res 32(suppl\_2):W414–W419
Cao W, Wang X, Ming Z, Gao J (2018) A review on neural networks with random weights. Neurocomputing 275:278–287
Cortes C, Vapnik V (1995) Support-vector networks. Machine Learn 20(3):273–297
Fan Y, Levine MD, Wen G, Qiu S (2017) A deep neural network for real-time detection of falling humans in naturally occurring scenes. Neurocomputing 260:43–58
Fang S, Huang X, Chen H, ** N (2016) Dual-arm robot assembly system for 3c product based on vision guidance. In: 2016 IEEE international conference on robotics and biomimetics (ROBIO). IEEE, pp 807–812
Gall J, Yao A, Razavi N, Van Gool L, Lempitsky V (2011) Hough forests for object detection, tracking, and action recognition. IEEE Trans Pattern Anal Mach Intell 33(11):2188–2202
Girshick R (2015) Fast R-CNN. In: Proceedings of the IEEE international conference on computer vision, pp 1440–1448
Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 580–587
Goodfellow I, Bengio Y, Courville A (2016) Deep learning. MIT Press
Gu J, Wang Z, Kuen J, Ma L, Shahroudy A, Shuai B, Liu T, Wang X, Wang G, Cai J et al (2018) Recent advances in convolutional neural networks. Pattern Recogn 77:354–377
Gutierrez-Galan D, Dominguez-Morales JP, Cerezuela-Escudero E, Rios-Navarro A, Tapiador-Morales R, Rivas-Perez M, Dominguez-Morales M, Jimenez-Fernandez A, Linares-Barranco A (2018) Embedded neural network for real-time animal behavior classification. Neurocomputing 272:17–26
He K, Gkioxari G, Dollár P, Girshick R (2017) Mask R-CNN. In: Proceedings of the IEEE international conference on computer vision, pp 2961–2969
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778
He T, Li X, Jiang Y (2014) Improved ht object detection algorithm based on canny edge operator. J Multimedia 9(9):1089
Howard AG, Zhu M, Chen B, Kalenichenko D, Wang W, Weyand T, Andreetto M, Adam H (2017) Mobilenets: efficient convolutional neural networks for mobile vision applications. ar**v:1704.04861
Hu J, Wang Z, Alsaadi FE, Hayat T (2017) Event-based filtering for time-varying nonlinear systems subject to multiple missing measurements with uncertain missing probabilities. Inf Fusion 38:74–83
Hu J, Wang Z, Liu S, Gao H (2016) A variance-constrained approach to recursive state estimation for time-varying complex networks with missing measurements. Automatica 64:155–162
Hu J, Wang Z, Shen B, Gao H (2013) Quantised recursive filtering for a class of nonlinear systems with multiplicative noises and missing measurements. Int J Control 86(4):650–663
Hui-jiang W (2016) Edge detection of axis parts via combining canny operator with hough transformation. Mech Electr Eng Technol 9:7
Jeong J, Park H, Kwak N (2017) Enhancement of ssd by concatenating feature maps for object detection. ar**v:1705.09587
Jiang LJ, Wang GR (2005) A stereovision method for resuming the position and pose of symmetry revolution object. J Eng Graph 5:17
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems, pp 1097–1105
LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521(7553):436–444
LeCun Y, Boser BE, Denker JS, Henderson D, Howard RE, Hubbard WE, Jackel LD (1990) Handwritten digit recognition with a back-propagation network. In: Advances in neural information processing systems, pp 396–404
LeCun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324
Li H, Lin Z, Shen X, Brandt J, Hua G (2015) A convolutional neural network cascade for face detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 325–5334
Lin TY, Goyal P, Girshick R, He K, Dollár P (2017) Focal loss for dense object detection. In: Proceedings of the IEEE international conference on computer vision, pp 2980–2988
Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu CY, Berg AC (2016) SSD: single shot multibox detector. In: European conference on computer vision. Springer, pp 21–37
Ma J, Lu Y, Li L, Lu X, He F, Zhong Z (2017) Edge location method based on canny operator and improved hough transform. Autom Inf Eng 38(03):32–36
Ng AY (2004) Feature selection, l1 versus l2 regularization, and rotational invariance. In: Proceedings of the twenty-first international conference on machine learning, p 78
Radman A, Zainal N, Suandi SA (2017) Automated segmentation of iris images acquired in an unconstrained environment using hog-svm and growcut. Digital Signal Process 64:60–70
Redmon J, Divvala S, Girshick R, Farhadi A (2016) You only look once: Unified, real-time object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 779–788
Ren S, He K, Girshick R, Sun J (2015) Faster R-CNN: towards real-time object detection with region proposal networks. In: Advances in neural information processing systems, pp 91–99
Rublee E, Rabaud V, Konolige K, Bradski G (2011) ORB: An efficient alternative to sift or surf. In: 2011 International conference on computer vision. IEEE, pp 2564–2571
Rumelhart DE, Hinton GE, Williams RJ (1986) Learning representations by back-propagating errors. Nature 323(6088):533–536
Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. ar**v:1409.1556
Spong MW, Vidyasagar M (2008) Robot dynamics and control. Wiley
Stewart D (1965) A platform with six degrees of freedom. Proc Inst Mech Eng 180(1):371–386
Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1–9
Tavana M, Abtahi AR, Di Caprio D, Poortarigh M (2018) An artificial neural network and bayesian network model for liquidity risk assessment in banking. Neurocomputing 275:2525–2554
Thrun S, Saul LK, Schölkopf B (2004) Advances in neural information processing systems 16: proceedings of the 2003 conference, vol 16. MIT Press
Wang G, **a XG, Chen VC, Fielder R (2004) Detection, location, and imaging of fast moving targets using multifrequency antenna array sar. IEEE Transactions on Aerospace and Electronic Systems 40(1):345–355
Wang L, Guo S, Huang W, Qiao Y (2015) Places205-vggnet models for scene recognition. ar**v:1508.01667
** J, Zhang JZ (2012) Edge detection from remote sensing images based on canny operator and hough transform. In: Advances in computer science and engineering. Springer, pp 807–814
Zhang J, Dong Y, Wangning H, Donghai L (2017) Hardware and algorithm design of industrial robot for 3c products manufacturing. Mach Des Res 231:2912–2924
Zhang Y, Xu Y, Ding G (2008) License plate character recognition algorithm based on filled function method training bp neural network. In: 2008 Chinese control and decision conference. IEEE, pp 3886–3891
Zhou Z, Zhang J (2007) Object detection and tracking based on adaptive canny operator and gm (1, 1) model. In: 2007 IEEE international conference on Grey systems and intelligent services. IEEE, pp 434–439
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this chapter
Cite this chapter
Lin, W., Ye, C., Zhou, J., Ren, X., Tong, M. (2021). A Cascaded Deep Neural Network for Position Estimation of Industrial Robots. In: Koubaa, A., Azar, A.T. (eds) Deep Learning for Unmanned Systems. Studies in Computational Intelligence, vol 984. Springer, Cham. https://doi.org/10.1007/978-3-030-77939-9_5
Download citation
DOI: https://doi.org/10.1007/978-3-030-77939-9_5
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-77938-2
Online ISBN: 978-3-030-77939-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)