Abstract
We propose a novel approach for semantics-based image annotation and retrieval. Our approach is based on monotonic tree, a derivation of contour tree for discrete data. Monotonic tree provides a way to bridge the gap between the high-level semantics and low-level features. Each branch (subtree) of the monotonic tree is termed as a structural element if its area is within a given scale. The structural elements are classified and clustered based on their low level features such as color, spatial location, harshness, and shape. Each cluster corresponds to some semantic feature. The category keywords indicating the semantic features are automatically annotated to the images. Based on the semantic features extracted from images, high-level (semantics-based) querying and browsing of images can be achieved. The experimental results demonstrate the effectiveness of our approach.
Chapter PDF
Similar content being viewed by others
References
Ahuja, N. (1982). Dot pattern processing using voronoi neighborhoods. IEEE Trans. on Pattern Analysis and Machine Intelligence, 4 (3): 336–343.
Ahuja, N. and Rosenfeld, A. (1981). Mosaic models for texture. IEEE Transactions on Pattern Analysis and Machine Intelligence, 3 (1): 1–11.
Ahuja, N. and Tuceryan, M. (1989). Extraction of early perceptual structure in dot patterns: integrating region, boundary, and component gestalt. Computer Vision, Graphics, and Image Processing, 48 (3): 304–356.
Bjarnestam, A. (Feb.5, 1998). Description of an image retrieval system. In The Challenge of Image Retrieval research workshop, Newcastle upon Tyne.
Dugad, R. and Ahuja, N. (1998). Unsupervised multidimensional hierarchical clustering. In IEEE International Conference on Acoustics Speech and Signal Processing, Seattle.
Eakins, J. and Graham, M. (Jan. 10, 1999). Content-based image retrieval. In Reports of JISC Technology Applications Programme.
Hirata, K. and Kato, T. (1993). Rough sketch-based image information retrieval. NEC Research & Development, 34 (2): 263–273.
L. Zhu, A. Rao and A. Zhang (2000). Theory of keyblock-based image retrieval. ACM Transactions on Information Systems.
Manjunath, B. and Ma, W. (1996). Texture Features for Browsing and Retrieval of Image Data. IEEE Transactions on Pattern Analysis and Machine Intelligence,18(8):837–842.
Mehrotra, R. and Gary, J. E. (1995). Similar-shape retrieval in shape data management. IEEE Computer,28(9):57–62.
Morse, S. (1969). Concepts of use in computer map processing. Communications of the ACM, 12 (3): 147–152.
Pass, G., Zabih, R., and Miller, J. (1996). Comparing images using color coherence vectors. In Proceedings of ACM Multimedia 96,pages 65–73, Boston MA USA.
Picard, R. (1996). A society of models for video and image libraries. Technical Report 360, MIT Media Laboratory Perceptual Computing.
Robl, C. and Farber, G. (1998). Contour tracer for a fast and precise edge-line extraction. In IAPR Workshop On Machine Vision Applications (MVA98).
Rosin, P. and West, G. (1989). Segmentation of edges into lines and arcs. Image and Vision Computing, 7 (2): 109–114.
Rosin, P. and West, G. (1992). Multi-stage combined ellipse and line detection. In British Machine Vision Conference (BMVC92), pages 197–206.
Sheikholeslami, G. and Zhang, A. (1997). An Approach to Clustering Large Visual Databases Using Wavelet Transform. In Proceedings of the SPIE Conference on Visual Data Exploration and Analysis IV,pages 322–333, San Jose.
Smith, J. and Chang, S. (1996a). Visualseek: A fully automated content-based image query system. In ACM Multimedia 96.
Smith, J. R. and Chang, S. (1994). Transform Features For Texture Classification and Discrimination in Large Image Databases. In Proceedings of the IEEE International Conference on Image Processing, pages 407–411.
Smith, J. R. and Chang, S.-F. (1996b). VisualSeek: a fully automated content-based image query
system. In Proceedings of ACM Multimedia 96,pages 87–98, Boston MA USA.
Song, Y. and Zhang, A. (April 3–5,2002). Monotonic tree. In The 10th International Conference on Discrete Geometry for Computer Imagery, Bordeaux, France.
Strang, G. and Nguyen, T. (1996). Wavelets and Filter Banks. Wellesley-Cambridge Press, Wellesley, MA.
Swain, M. and Ballard, D. (1991). Color Indexing. Int Journal of Computer Vision, 7(1):11–32.
Syeda-Mahmood, T. (1996). Finding shape similarity using a constrained non-rigid transform. In International Conference on Pattern Recognition.
van Kreveld, M., van Oostrum, R., Bajaj, C., Pascucci, V., and Schikore, D. (1997). Contour trees and small seed sets for iso-surface traversal. In Proc. 13th Ann. Sympos. Comput. Geom., pages 212–220.
Zahn, C. (1971). Graph-theoretical methods for detecting and describing gestalt clusters. IEEE Trans. on Computers, C-20: 68–86.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer Science+Business Media New York
About this chapter
Cite this chapter
Song, Y., Wang, W., Zhang, A. (2002). Automatic Annotation and Retrieval of Images. In: Zhou, X., Pu, P. (eds) Visual and Multimedia Information Management. VDB 2002. IFIP — The International Federation for Information Processing, vol 88. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-35592-4_19
Download citation
DOI: https://doi.org/10.1007/978-0-387-35592-4_19
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4757-6935-7
Online ISBN: 978-0-387-35592-4
eBook Packages: Springer Book Archive