Abstract
This paper describes preliminary work in the recent promising approach of generating synthetic training data for facilitating the learning procedure of deep learning (DL) models, with a focus on aerial photos produced by unmanned aerial vehicles (UAV). The general concept and methodology are described, and preliminary results are presented, based on a classification problem of fire identification in forests as well as a counting problem of estimating number of houses in urban areas. The proposed technique constitutes a new possibility for the DL community, especially related to UAV-based imagery analysis, with much potential, promising results, and unexplored ground for further research.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
Python Imaging Library. https://pypi.python.org/pypi/PIL.
- 2.
- 3.
Open AI Tanzania Challenge. https://blog.werobotics.org/2018/08/06/welcome-to-the-open-ai-tanzania-challenge/.
- 4.
Unity. https://unity.com.
References
Amara, J., Bouaziz, B., Algergawy, A., et al.: A deep learning-based approach for banana leaf diseases classification. In: BTW (Workshops), pp. 79–88 (2017)
Canziani, A., Paszke, A., Culurciello, E.: An analysis of deep neural network models for practical applications. ar**v preprint ar**v:1605.07678 (2016)
Douarre, C., Schielein, R., Frindel, C., Gerth, S., Rousseau, D.: Deep learning based root-soil segmentation from x-ray tomography. bioRxiv, p. 071662 (2016)
Dyrmann, M., Mortensen, A.K., Midtiby, H.S., Jorgensen, R.N., et al.: Pixel-wise classification of weeds and crops in images by using a fully convolutional neural network. In: Proceedings of the International Conference on Agricultural Engineering, Aarhus, Denmark, pp. 26–29 (2016)
Gaidon, A., Wang, Q., Cabon, Y., Vig, E.: Virtual worlds as proxy for multi-object tracking analysis. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4340–4349 (2016)
Goodfellow, I., et al.: Generative adversarial nets. In: Advances in Neural Information Processing Systems, pp. 2672–2680 (2014)
Kamilaris, A.: Simulating training data for deep learning models. In: Machine Learning in the Environmental Sciences Workshop, in Proceedings of EnviroInfo 2018, Munich, Germany, September 2018
Kamilaris, A., Assumpcio, A., Blasi, A.B., Torrellas, M., Prenafeta-Boldú, F.X.: Estimating the environmental impact of agriculture by means of geospatial and big data analysis: the case of catalonia. In: Otjacques, B., Hitzelberger, P., Naumann, S., Wohlgemuth, V. (eds.) From Science to Society. PI, pp. 39–48. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-65687-8_4
Kamilaris, A., Prenafeta-Boldu, F.X.: Disaster monitoring using unmanned aerial vehicles and deep learning. In: Disaster Management for Resilience and Public Safety Workshop, in Proceedings of EnviroInfo 2017, Luxembourg, September 2017
Kamilaris, A., Prenafeta-Boldu, F.X.: Deep learning in agriculture: a survey. Comput. Electron. Agric. 147, 70–90 (2018)
Kar, A., et al.: Meta-sim: learning to generate synthetic datasets. ar**v preprint ar**v:1904.11621 (2019)
Lehmussola, A., Ruusuvuori, P., Selinummi, J., Huttunen, H., Yli-Harja, O.: Computational framework for simulating fluorescence microscope images with cell populations. IEEE Trans. Med. Imaging 26(7), 1010–1016 (2007)
Lempitsky, V., Zisserman, A.: Learning to count objects in images. In: Advances in Neural Information Processing Systems, pp. 1324–1332 (2010)
Li, P., Liang, X., Jia, D., **ng, E.P.: Semantic-aware grad-GAN for virtual-to-real urban scene adaption. ar**v preprint ar**v:1801.01726 (2018)
Olah, C., et al.: The building blocks of interpretability. Distill 3(3), e10 (2018)
Prakash, A., et al.: Structured domain randomization: Bridging the reality gap by context-aware synthetic data. ar**v preprint ar**v:1810.10093 (2018)
Puig, X., et al.: Virtualhome: simulating household activities via programs. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 8494–8502 (2018)
Rahnemoonfar, M., Sheppard, C.: Deep count: fruit counting based on deep simulated learning. Sensors 17(4), 905 (2017)
Richter, S.R., Vineet, V., Roth, S., Koltun, V.: Playing for data: ground truth from computer games. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9906, pp. 102–118. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46475-6_7
Ros, G., Sellart, L., Materzynska, J., Vazquez, D., Lopez, A.M.: The synthia dataset: a large collection of synthetic images for semantic segmentation of urban scenes. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3234–3243 (2016)
Schmidhuber, J.: Deep learning in neural networks: an overview. Neural Netw. 61, 85–117 (2015)
Shah, S., Dey, D., Lovett, C., Kapoor, A.: Aerial Informatics and Robotics Platform. Microsoft Research, Washigton (2017)
Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J., Wojna, Z.: Rethinking the inception architecture for computer vision. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2818–2826 (2016)
Walach, E., Wolf, L.: Learning to count with CNN boosting. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9906, pp. 660–676. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46475-6_41
Wu, Y., Wu, Y., Gkioxari, G., Tian, Y.: Building generalizable agents with a realistic and rich 3D environment. ar**v preprint ar**v:1801.02209 (2018)
Zhu, J.Y., Park, T., Isola, P., Efros, A.A.: Unpaired image-to-image translation using cycle-consistent adversarial networks. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2223–2232 (2017)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Kamilaris, A., van den Brink, C., Karatsiolis, S. (2019). Training Deep Learning Models via Synthetic Data: Application in Unmanned Aerial Vehicles. In: Vento, M., et al. Computer Analysis of Images and Patterns. CAIP 2019. Communications in Computer and Information Science, vol 1089. Springer, Cham. https://doi.org/10.1007/978-3-030-29930-9_8
Download citation
DOI: https://doi.org/10.1007/978-3-030-29930-9_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-29929-3
Online ISBN: 978-3-030-29930-9
eBook Packages: Computer ScienceComputer Science (R0)