Using Hidden Feature Space of Diffusion Neural Networks for Image Blending Problem

Karachev, D.; Shtekhin, S.; Stadnik, A.

doi:10.1134/S1063779624030468

Using Hidden Feature Space of Diffusion Neural Networks for Image Blending Problem

Published: 06 June 2024

Volume 55, pages 347–350, (2024)
Cite this article

Physics of Particles and Nuclei Aims and scope Submit manuscript

D. Karachev¹,
S. Shtekhin¹ &
A. Stadnik¹

5 Accesses
Explore all metrics

Abstract

In this paper, a new augmentation algorithm based on the idea of blending two images is proposed. The method is developed using state-of-the-art generative diffusion neural networks and can be used to solve the problem of data scarcity, improve the training quality and robustness of neural networks. Flexible customization of the algorithm allows adding snow, rain and other weather conditions to the selected.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Price includes VAT (Germany)

Instant access to the full article PDF.

Institutional subscriptions

An overview of mixing augmentation methods and augmentation strategies

Article Open access 30 June 2022

A survey on Image Data Augmentation for Deep Learning

Article Open access 06 July 2019

PatchMix: patch-level mixup for data augmentation in convolutional neural networks

Article 30 May 2024

REFERENCES

D. **ng and A. Tzes, “Synthetic aerial dataset for UAV detection via text-to-image diffusion models,” in Proceedings of IEEE Conference on Artificial Intelligence (CAI), Santa Clara, CA, USA, 2023. https://doi.org/10.1109/CAI54212.2023.00030
A. Nair and N. Mehendale, “Dronescape: A high-resolution drone footage dataset for tree region segmentation.” https://ssrn.com/abstract=4512595. Accessed July 17, 2023.https://doi.org/10.2139/ssrn.4512595
R. Rombach, A. Blattmann, D. Lorenz, P. Esser, and B. Ommer, “High-resolution image synthesis with latent diffusion models,” (2022). https://doi.org/10.48550/ar**v.2112.10752
A. Lugmayr, M. Danelljan, A. Romero, F. Yu, R. Timofte, and L. Van Gool, “Repaint: Inpainting using denoising diffusion probabilistic models,” (2022). ar**v:2201.09865.
L. Zhang and M. Agrawala, “Adding conditional control to text-to-image diffusion models,” (2023). ar**v: 2302.05543.
T. Brooks, A. Holynski, and A. A. Efros, “Instructpix2pix: Learning to follow image editing instructions,” (2022). ar**v:2211.09800.
A. Radford, J. W. Kim, C. Hallacy, A. Ramesh, G. Goh, S. Agarwal, G. Sastry, A. Askell, P. Mishkin, J. Clark, G. Krueger, and I. Sutskever, “Learning transferable visual models from natural language supervision,” (2021). ar**v:2103.00020.
A. Kirillov, E. Mintun, N. Ravi, H. Mao, C. Rolland, L. Gustafson, T. **ao, S. Whitehead, A. C. Berg, W.‑Y. Lo, et al., “Segment anything,” (2023). ar**v: 2304.02643.
J. Yu, Z. Wang, V. Vasudevan, L. Yeung, M. Seyedhosseini, and Y. Wu, “Coca: Contrastive captioners are image-text foundation models,” (2022). ar**v:2205.01917.

Download references

Funding

This work was supported by ongoing institutional funding. No additional grants to carry out or direct this particular research were obtained.

Author information

Authors and Affiliations

Industry Center for Information Systems’ Development and Deployment, Sochi, Russia
D. Karachev, S. Shtekhin & A. Stadnik

Authors

D. Karachev
View author publications
You can also search for this author in PubMed Google Scholar
S. Shtekhin
View author publications
You can also search for this author in PubMed Google Scholar
A. Stadnik
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to D. Karachev.

Ethics declarations

The authors of this work declare that they have no conflicts of interest.

Additional information

Publisher’s Note.

Pleiades Publishing remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Karachev, D., Shtekhin, S. & Stadnik, A. Using Hidden Feature Space of Diffusion Neural Networks for Image Blending Problem. Phys. Part. Nuclei 55, 347–350 (2024). https://doi.org/10.1134/S1063779624030468

Download citation

Received: 18 September 2023
Revised: 06 November 2023
Accepted: 01 December 2023
Published: 06 June 2024
Issue Date: June 2024
DOI: https://doi.org/10.1134/S1063779624030468