A New Augmented Method for Processing Video Datasets Based on Deep Neural Network

Wang, Wei; Wang, Haiyan; Ni, Fuchuan

doi:10.1007/978-981-16-9423-3_16

Wei Wang^42,43,
Haiyan Wang^42,43 &
Fuchuan Ni^42,43

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 854))

1553 Accesses

Abstract

Large datasets are required for deep learning to achieve good performance. However, there is a lack of sufficient training datasets in many research fields, which may become a shortcoming of computer vision applications. This article provided a new data augmentation method for making training small datasets, which could be divided into two steps: 1. Unbalanced sampling based on information density. 2. Splicing images to form a dataset. Different information density dataset combinations had been used for testing the model generalization. The enhanced loss function which consisted of label smoothing loss and cross-entropy loss had been used to minimize the model preference during training models. Finally, with the same amount of data, the Mean Absolute Error (MAE) of the model with our sampling method could get 55% increase compared with the traditional sampling method. The best MAE could reach 0.98 if the splicing method had been adopted. The results showed that this augmented method was suitable for scenarios with small sample size, especially video datasets. To get the best performance, the splicing method was a nice choice to optimal model generalization performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: EUR 29.95; Price includes VAT (Germany)

eBook: EUR 181.89; Price includes VAT (Germany)

Softcover Book: EUR 235.39; Price includes VAT (Germany)

Hardcover Book: EUR 235.39; Price includes VAT (Germany)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

LMix: regularization strategy for convolutional neural networks

Article 21 August 2022

Union-net: lightweight deep neural network model suitable for small data sets

Article 25 November 2022

Does Large Pretrained Dataset Always Help? On the Effect of Dataset Size on Big Transfer Model

References

Kamilaris, A., Prenafeta-Boldú, F.X.: Deep learning in agriculture: a survey. Comput. Electron. Agric. 147, 70–90 (2018)
Article Google Scholar
Tianhao, Z., Yansen, L., Zhiyi, H.: Applying image recognition and counting to reserved live pigs statistics. Comput. Appl. Software. 12 (2016)
Google Scholar
Bruijning, M., Visser, M.D., Hallmann, C.A., Jongejans, E.: Trackdem: automated particle tracking to obtain population counts and size distributions from videos in r. Methods Ecol. Evol. 9(4), 965–73 (2018)
Article Google Scholar
Chabot, D., Dillon, C., Francis, C.: An approach for using off-the-shelf object-based image analysis software to detect and count birds in large volumes of aerial imagery. Avian Conserv. Ecol. 13(1) (2018)
Google Scholar
Norouzzadeh, M.S., et al.: Automatically identifying, counting, and describing wild animals in camera-trap images with deep learning. Proc. Natl. Acad. Sci. 115(25), E5716–E25 (2018)
Article Google Scholar
Tian, M., Guo, H., Chen, H., Wang, Q., Long, C., Ma, Y.: Automated pig counting using deep learning. Comput. Electron. Agric. 163, 104840 (2019)
Google Scholar
Baweja, H.S., Parhar, T., Mirbod, O., Nuske, S.: Stalknet: a deep learning pipeline for high-throughput measurement of plant stalk count and stalk width. In: Hutter, M., Siegwart, R. (eds.) Field and Service Robotics, pp. 271–284. Springer International Publishing, Cham (2018). https://doi.org/10.1007/978-3-319-67361-5_18
Chapter Google Scholar
Chen, G., Shen, S., Wen, L., Luo, S., Bo, L. (eds.): Efficient pig counting in crowds with keypoints tracking and spatial-aware temporal response filtering. In: 2020 IEEE International Conference on Robotics and Automation (ICRA). IEEE (2020)
Google Scholar
Shorten, C., Khoshgoftaar, T.M.: A survey on image data augmentation for deep learning. J. Big Data 6(1), 1–48 (2019)
Article Google Scholar
DeVries, T., Taylor, G.W.: Dataset augmentation in feature space. ar**v preprint ar**v:170205538. (2017)
Moosavi-Dezfooli, S.-M., Fawzi, A., Frossard, P. (eds.): Deepfool: a simple and accurate method to fool deep neural networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2016)
Google Scholar
Psota, E.T., Mittek, M., Pérez, L.C., Schmidt, T., Mote, B.: Multi-pig part detection and association with a fully-convolutional network. Sensors. 19(4), 852 (2019)
Article Google Scholar
Inoue, H.: Data augmentation by pairing samples for images classification. ar**v preprint ar**v:180102929. (2018)
Howard, A.G., et al.: Mobilenets: Efficient convolutional neural networks for mobile vision applications. ar**v preprint ar**v:170404861. (2017)
Lukasik, M., Bhojanapalli, S., Menon, A., Kumar, S. (eds.): Does label smoothing mitigate label noise?. In: International Conference on Machine Learning. PMLR (2020)
Google Scholar
BloodAxe. pytorch-toolbelt. (2021)
Google Scholar
Zhong, Z., Zheng, L., Kang, G., Li, S., Yang, Y.: Random erasing data augmentation. Proc. AAAI Conf. Artif. Intell. 34(07), 13001–13008 (2020). https://doi.org/10.1609/aaai.v34i07.7000
Article Google Scholar

Download references

Acknowledgements

This work is supported by China Scholarship Council (No. 201906765023) and Hubei Chenguang Talented Youth Development Foundation.

Author information

Authors and Affiliations

College of Informatics, Huazhong Agricultural University, Wuhan, 430070, China
Wei Wang, Haiyan Wang & Fuchuan Ni
Hubei Engineering Technology Research Center of Agricultural Big Data, Wuhan, 430070, Hubei, China
Wei Wang, Haiyan Wang & Fuchuan Ni

Authors

Wei Wang
View author publications
You can also search for this author in PubMed Google Scholar
Haiyan Wang
View author publications
You can also search for this author in PubMed Google Scholar
Fuchuan Ni
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Haiyan Wang .

Editor information

Editors and Affiliations

Department of Electrical Engineering, University of Texas at Arlington, Arlington, TX, USA
Qilian Liang
Tian** Normal University, Tian**, China
Wei Wang
Tian** Normal University, Tian**, China
Jiasong Mu
Dalian University of Technology, Dalian, China
**n Liu
School of Information Science and Technology, Dalian Maritime University, Dalian, China
Zhenyu Na

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, W., Wang, H., Ni, F. (2022). A New Augmented Method for Processing Video Datasets Based on Deep Neural Network. In: Liang, Q., Wang, W., Mu, J., Liu, X., Na, Z. (eds) Artificial Intelligence in China. Lecture Notes in Electrical Engineering, vol 854. Springer, Singapore. https://doi.org/10.1007/978-981-16-9423-3_16

Download citation

DOI: https://doi.org/10.1007/978-981-16-9423-3_16
Published: 22 March 2022
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-9422-6
Online ISBN: 978-981-16-9423-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

A New Augmented Method for Processing Video Datasets Based on Deep Neural Network

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

LMix: regularization strategy for convolutional neural networks

Union-net: lightweight deep neural network model suitable for small data sets

Does Large Pretrained Dataset Always Help? On the Effect of Dataset Size on Big Transfer Model

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

A New Augmented Method for Processing Video Datasets Based on Deep Neural Network

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

LMix: regularization strategy for convolutional neural networks

Union-net: lightweight deep neural network model suitable for small data sets

Does Large Pretrained Dataset Always Help? On the Effect of Dataset Size on Big Transfer Model

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation