Abstract
This paper presents a novel method for facilitating user-friendly image retrieval by attaching names to image regions. We first detect only the most prominent regions in images when such entities exist, using our own nonlinear image segmentation technique. Besides their visual features, the layout and relations between selected regions are also emphasized. Next, we apply an adaptive and multi-modal classification and naming of image regions using subsequent clustering methods to the features of the regions and related words as well as relevancy information. For both the naming and the testing, we have added a set of illustrations acting as abstract prototypes of the regions to randomly selected natural images. Experiments on 20,000 natural images show the efficacy of using this multilayer region naming model as well as of extensively interacting with users, enabling them to present their queries by a combination of region names, sketches and example images or regions.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Smeulders, A., et al.: Content-based image retrieval at the end of the early years. IEEE Trans. PAMI 22(12), 1349–1380 (2000)
Zhou, X.S., Huang, T.S.: Unifying Keywords and Visual Contents in Image Retrieval. IEEE Multimedia 9(2), 23–33 (2002)
Zhao, R., Grosky, W.I.: Negotiating the semantic gap: from feature maps to semantic landscapes. Pattern Recognition 35, 593–600 (2002)
Barnard, K., et al.: Matching Words and Pictures. Journal of Machine Learning Research 3, 1107–1135 (2003)
Hofmann, T.: Learning and Representing Topic. A Hierarchical Mixture Model for Word Occurrences in Document Databases. In: Proc. of CONALD, Pittsburgh (1998)
Wang, J.Z., Li, J.: Learning-based linguistic indexing of pictures with 2-D MHMMs. In: Proc. ACM Multimedia, pp. 436–445 (2002)
Lim, J.-H., Tian, Q., Mulhem, P.: Home Photo Content Modeling for Personalized Event-Based Retrieval. IEEE Multimedia 9(2), 28–37 (2003)
Carbonetto, P., de Freitas, N., Barnard, K.: A Statistical Model for General Contextual Object Recognition. In: Pajdla, T., Matas, J(G.) (eds.) ECCV 2004. LNCS, vol. 3021, pp. 350–362. Springer, Heidelberg (2004)
Gabrilovich, E., Markovitch, S.: Feature Generation for Text Categorization Using World Knowledge. In: Proc. of The 19th International Joint Conf. for Artificial Intelligence (2005)
Metzler, D., Manmatha, R.: An inference network approach to image retrieval. In: Enser, P.G.B., Kompatsiaris, Y., O’Connor, N.E., Smeaton, A.F., Smeulders, A.W.M. (eds.) CIVR 2004. LNCS, vol. 3115, pp. 42–50. Springer, Heidelberg (2004)
Kutics, A., Nakagawa, A.: Detecting Prominent Objects for Image Retrieval. In: IEEE International Conference on ICIP, vol. 3, pp. 445–448 (2005)
Hendee, W.R., Wells, P.N.T.: The Perception of Visual Information. Springer, Heidelberg (1997)
Mojsilovic, A.: A method for color naming and description of color composition in images. In: Proc. of the ICIP (2002)
Bhusnan, N., Rao, A.R., Lohse, G.L.: The texture lexicon: Understanding the categorization of visual texture terms and their relationship to texture images. Cognitive Science 21(2), 219–246 (1997)
Hanbury, A.: MUSCLE, Guide to annotation, Version 2.0, Tech. Univ. of Vienna (2005)
Fellbaum, C. (ed.): WordNet: An Electronic Lexical Database. MIT Press, Cambridge (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kutics, A., Nakagawa, A. (2006). Naming of Image Regions for User-Friendly Image Retrieval. In: Campilho, A., Kamel, M.S. (eds) Image Analysis and Recognition. ICIAR 2006. Lecture Notes in Computer Science, vol 4141. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11867586_57
Download citation
DOI: https://doi.org/10.1007/11867586_57
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44891-4
Online ISBN: 978-3-540-44893-8
eBook Packages: Computer ScienceComputer Science (R0)