SceneNet: A Perceptual Ontology for Scene Understanding

Kadar, Ilan; Ben-Shahar, Ohad

doi:10.1007/978-3-319-16181-5_27

Ilan Kadar¹⁶ &
Ohad Ben-Shahar¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 8926))

Included in the following conference series:

European Conference on Computer Vision

4470 Accesses
1 Citations

Abstract

Scene recognition systems which attempt to deal with a large number of scene categories currently lack proper knowledge about the perceptual ontology of scene categories and would enjoy significant advantage from a perceptually meaningful scene representation. In this work we perform a large-scale human study to create “SceneNet”, an online ontology database for scene understanding that organizes scene categories according to their perceptual relationships. This perceptual ontology suggests that perceptual relationships do not always conform the semantic structure between categories, and it entails a lower dimensional perceptual space with “perceptually meaningful” Euclidean distance, where each embedded category is represented by a single prototype. Using the SceneNet ontology and database we derive a computational scheme for learning non-linear map** of scene images into the perceptual space, where each scene image is closest to its category prototype than to any other prototype by a large margin. Then, we demonstrate how this approach facilitates improvements in large-scale scene categorization over state-of-the-art methods and existing semantic ontologies, and how it reveals novel perceptual findings about the discriminative power of visual attributes and the typicality of scenes.

Download to read the full chapter text

Chapter PDF

SUN Database: Exploring a Large Collection of Scene Categories

Article 13 August 2014

Can computer vision problems benefit from structured hierarchical classification?

Article Open access 06 May 2016

ConceptFusion: A Flexible Scene Classification Framework

Keywords

References

**ao, J., Hays, J., Ehinger, K., Oliva, A., Torralba, A.: Sun database: Large scale scene recognition from abbey to zoo. In: CVPR (2010)
Google Scholar
SceneNet: An Online Perceptual Ontology Database for Scene Understanding. (2013) Anonymous URL. Concealed for blind review
Google Scholar
Fei-Fei, L., Perona, P.: A bayesian hierarchy model for learning natural scene categories. In: CVPR (2005)
Google Scholar
Bosch, A., Zisserman, A., Muñoz, X.: Scene Classification Via pLSA. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3954, pp. 517–530. Springer, Heidelberg (2006)
Chapter Google Scholar
Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: CVPR, pp. 2169–2178 (2006)
Google Scholar
Griffin, G., Perona, P.: Learning and using taxonomies for fast visual categorization. In: CVPR (2008)
Google Scholar
Bart, E., Porteous, I., Perona, P., Welling, M.: Unsupervised learning of visual taxonomies. In: CVPR (2008)
Google Scholar
Ahuja, N., Todorovic, S.: Learning the taxonomy and models of categories present in arbitrary images. In: ICCV (2007)
Google Scholar
Marszałek, M., Schmid, C.: Constructing Category Hierarchies for Visual Recognition. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part IV. LNCS, vol. 5305, pp. 479–491. Springer, Heidelberg (2008)
Chapter Google Scholar
Sivic, J., Russell, B., Zisserman, A., Freeman, W., Efros, A.: Unsupervised discovery of visual object class hierarchies. In: CVPR (2008)
Google Scholar
Li, L., Wang, C., Lim, Y., Blei, D., Fei-Fei, L.: Building and using a semantivisual image hierarchy. In: CVPR (2010)
Google Scholar
Marszalek, M., Schmid, C.: Semantic hierarchies for visual object recognition. In: CVPR (2007)
Google Scholar
Torralba, A., Fergus, R., W.T., F.: 80 million tiny images: a large dataset for non-parametric object and scene recognition. IEEE Trans. Pattern Anal. Mach. Intell. 30(11), 1958–1970 (2008)
Article Google Scholar
Fergus, R., Bernal, H., Weiss, Y., Torralba, A.: Semantic Label Sharing for Learning with Many Categories. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part I. LNCS, vol. 6311, pp. 762–775. Springer, Heidelberg (2010)
Chapter Google Scholar
Deselaers, T., Ferrari, V.: Visual and semantic similarity in imagenet. In: CVPR, pp. 1777–1784 (2011)
Google Scholar
Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Fei-Fei, L.: Imagenet: A large-scale hierarchical image database. In: CVPR (2009)
Google Scholar
Verma, N., Mahajan, D., Sellamanickam, S., Nair, V.: Learning hierarchical similarity metrics. In: CVPR (2012)
Google Scholar
Miller, G.: Wordnet: A lexical database for english. In: Communications of the ACM (1995)
Google Scholar
Deng, J., Berg, A., Fei-Fei, L.: Hierarchical semantic indexing for large scale image retrieval. In: CVPR (2011)
Google Scholar
Weinberger, K., Chapelle, O.: Large margin taxonomy embedding for document categorization. In: NIPS, pp. 1737–1744 (2008)
Google Scholar
Kadar, I., Ben-Shahar, O.: Small sample scene categorization from perceptual relations. In: CVPR, pp. 2711–2718 (2012)
Google Scholar
Rousselet, G.A., Fabre-Thorpe, M., Thorpe, S.J.: Parallel processing in high-level categorization of natural images. Nature Neuroscience 5(7), 629–630 (2002)
Google Scholar
Torgerson, W.S.: Multidimensional scaling: theory and method. Psychometrika 17(6), 401–419 (1952)
Article MATH MathSciNet Google Scholar
Oliva, A., Torralba, A.: Modeling the shape of the scene: A holistic representation of the spatial envelope. Int. J. Comput. Vision 42(3), 145–175 (2001)
Article MATH Google Scholar
Greene, M., Oliva, A.: Forest before the trees: the precedence of global features in visual perception. Cognit. Sci. 58, 137–179 (2009)
Google Scholar
Patterson, G., Hays, J.: SUN attribute database: Discovering, annotating, and recognizing scene attributes. In: CVPR (2012)
Google Scholar
Saunders, C., Gammerman, A., Vovk, V.: Ridge regression learning algorithm in dual variables. In: ICML, p. 515521 (1998)
Google Scholar
Boyd, S., Vandenberghe, L. (eds.): Convex Optimization. Cambridge University Press (2004)
Google Scholar
Weinberger, K., Saul, L.: Fast solvers and efficient implementations for distance metric learning. In: ICML, pp. 1160–1167 (2008)
Google Scholar
Vogel, J., Schiele, B.: Semantic typicality measure for natural scene categorization. In: Annual Pattern Recognition Symposium (2004)
Google Scholar
Ehinger, K., **ao, J., Torralba, A., Oliva, A.: Estimating scene typicality from human ratings and image features. In: Proceedings of the 33rd Annual Conference of the Cognitive Science Society, pp. 2562–2567 (2011)
Google Scholar
Murphy, G.L. (ed.): The big book of concepts. MIT Press (2002)
Google Scholar
Rosch, E.: Cognitive representations of semantic categories. J. Exp. Psych. (1975)
Google Scholar
Mervis, C., Pani, J.: Acquisition of basic object categories. Cognit. Sci. 12 (1980)
Google Scholar

Download references

Author information

Authors and Affiliations

Ben-Gurion University of the Negev, Beer-Sheva, Israel
Ilan Kadar & Ohad Ben-Shahar

Authors

Ilan Kadar
View author publications
You can also search for this author in PubMed Google Scholar
Ohad Ben-Shahar
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ilan Kadar .

Editor information

Editors and Affiliations

University College London, London, United Kingdom
Lourdes Agapito
University of Lugano, Lugano, Switzerland
Michael M. Bronstein
Technische Universität Dresden, Dresden, Germany
Carsten Rother

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kadar, I., Ben-Shahar, O. (2015). SceneNet: A Perceptual Ontology for Scene Understanding. In: Agapito, L., Bronstein, M., Rother, C. (eds) Computer Vision - ECCV 2014 Workshops. ECCV 2014. Lecture Notes in Computer Science(), vol 8926. Springer, Cham. https://doi.org/10.1007/978-3-319-16181-5_27

Download citation

DOI: https://doi.org/10.1007/978-3-319-16181-5_27
Published: 20 March 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-16180-8
Online ISBN: 978-3-319-16181-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

SceneNet: A Perceptual Ontology for Scene Understanding

Abstract

Chapter PDF

Similar content being viewed by others

SUN Database: Exploring a Large Collection of Scene Categories

Can computer vision problems benefit from structured hierarchical classification?

ConceptFusion: A Flexible Scene Classification Framework

Keywords

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

SceneNet: A Perceptual Ontology for Scene Understanding

Abstract

Chapter PDF

Similar content being viewed by others

SUN Database: Exploring a Large Collection of Scene Categories

Can computer vision problems benefit from structured hierarchical classification?

ConceptFusion: A Flexible Scene Classification Framework

Keywords

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation