Log in

Using Hidden Feature Space of Diffusion Neural Networks for Image Blending Problem

  • Published:
Physics of Particles and Nuclei Aims and scope Submit manuscript

Abstract

In this paper, a new augmentation algorithm based on the idea of blending two images is proposed. The method is developed using state-of-the-art generative diffusion neural networks and can be used to solve the problem of data scarcity, improve the training quality and robustness of neural networks. Flexible customization of the algorithm allows adding snow, rain and other weather conditions to the selected.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price includes VAT (Germany)

Instant access to the full article PDF.

Fig. 1.
Fig. 2.
Fig. 3.
Fig. 4.
Fig. 5.
Fig. 6.

Similar content being viewed by others

REFERENCES

  1. D. **ng and A. Tzes, “Synthetic aerial dataset for UAV detection via text-to-image diffusion models,” in Proceedings of IEEE Conference on Artificial Intelligence (CAI), Santa Clara, CA, USA, 2023. https://doi.org/10.1109/CAI54212.2023.00030

  2. A. Nair and N. Mehendale, “Dronescape: A high-resolution drone footage dataset for tree region segmentation.” https://ssrn.com/abstract=4512595. Accessed July 17, 2023.https://doi.org/10.2139/ssrn.4512595

  3. R. Rombach, A. Blattmann, D. Lorenz, P. Esser, and B. Ommer, “High-resolution image synthesis with latent diffusion models,” (2022). https://doi.org/10.48550/ar**v.2112.10752

  4. A. Lugmayr, M. Danelljan, A. Romero, F. Yu, R. Timofte, and L. Van Gool, “Repaint: Inpainting using denoising diffusion probabilistic models,” (2022). ar**v:2201.09865.

  5. L. Zhang and M. Agrawala, “Adding conditional control to text-to-image diffusion models,” (2023). ar**v: 2302.05543.

  6. T. Brooks, A. Holynski, and A. A. Efros, “Instructpix2pix: Learning to follow image editing instructions,” (2022). ar**v:2211.09800.

  7. A. Radford, J. W. Kim, C. Hallacy, A. Ramesh, G. Goh, S. Agarwal, G. Sastry, A. Askell, P. Mishkin, J. Clark, G. Krueger, and I. Sutskever, “Learning transferable visual models from natural language supervision,” (2021). ar**v:2103.00020.

  8. A. Kirillov, E. Mintun, N. Ravi, H. Mao, C. Rolland, L. Gustafson, T. **ao, S. Whitehead, A. C. Berg, W.‑Y. Lo, et al., “Segment anything,” (2023). ar**v: 2304.02643.

  9. J. Yu, Z. Wang, V. Vasudevan, L. Yeung, M. Seyedhosseini, and Y. Wu, “Coca: Contrastive captioners are image-text foundation models,” (2022). ar**v:2205.01917.

Download references

Funding

This work was supported by ongoing institutional funding. No additional grants to carry out or direct this particular research were obtained.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to D. Karachev.

Ethics declarations

The authors of this work declare that they have no conflicts of interest.

Additional information

Publisher’s Note.

Pleiades Publishing remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Karachev, D., Shtekhin, S. & Stadnik, A. Using Hidden Feature Space of Diffusion Neural Networks for Image Blending Problem. Phys. Part. Nuclei 55, 347–350 (2024). https://doi.org/10.1134/S1063779624030468

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1134/S1063779624030468

Navigation