The Importance of Ontological Structure: Why Validation by ‘Fit-to-Data’ Is Insufficient

  • Chapter
  • First Online:
Simulating Social Complexity

Part of the book series: Understanding Complex Systems ((UCS))

Abstract

This chapter will briefly describe some common methods by which people make quantitative estimates of how well they expect empirical models to make predictions. However, the chapter’s main argument is that fit-to-data, the traditional yardstick for establishing confidence in models, is not quite the solid ground on which to build such belief some people think it is, especially for the kind of system agent-based modelling is usually applied to. Further, the chapter will show that the amount of data required to establish confidence in an arbitrary model by fit-to-data is often infeasible, unless there is some appropriate ‘big data’ available. This arbitrariness can be reduced by constraining the choice of model. In agent-based models, these constraints are introduced by their descriptiveness rather than by removing variables from consideration or making assumptions for the sake of simplicity. By comparing with neural networks, we show that agent-based models have a richer ontological structure. For agent-based models, in particular, this richness means that the ontological structure has a greater significance and yet is all too commonly taken for granted or assumed to be ‘common sense’. The chapter therefore also discusses some approaches to validating ontologies.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    Less naively, you would use a calculated inflation figure for the old basket of goods as input to the model; however, if people are not buying things in the old basket, the model may still not be providing meaningful output.

  2. 2.

    The case for agent-based modelling being that it is necessary to represent all the agents if you want to understand the emergent system-level dynamics.

  3. 3.

    Part of this is the confusion between ‘free parameters’, which can be adjusted to make the results fit data, and parameters with values that are, at least in theory, empirically observable, even if currently unknown. Agent-based models have a lot of the latter but relatively few of the former.

  4. 4.

    Macbeth, Act V, Scene V.

  5. 5.

    http://www.earth-system-dynamics.net/ <Accessed May 2017>.

  6. 6.

    https://www.commod.org/en <Accessed May 2017>.

  7. 7.

    Its popularity in the social simulation community is reflected by the fact that tools have been built to link it with Wilensky’s (1999) Netlogo (Thiele et al. 2012).

  8. 8.

    http://www.r-project.org/ <Accessed May 2017>.

  9. 9.

    https://www.wikipedia.org/ <Accessed May 2017>.

References

  • Abu-Mostafa, Y. S. (1989). The Vapnik-Chervonenkis dimension: Information versus complexity in learning. Neural Computation, 1(3), 312–317.

    Article  Google Scholar 

  • Aha, D. W. (1992). Tolerating noisy, irrelevant and novel attributes in instance-based learning algorithms. International Journal of Man-Machine Studies, 36(2), 267–287.

    Article  Google Scholar 

  • Baader, F., & Nutt, W. (2003). Basic description logics. In F. Baader, D. Calvanese, D. L. McGuinness, D. Nardi, & P. F. Patel-Schneider (Eds.), The description logic handbook (pp. 43–95). New York, NY: Cambridge University Press.

    Google Scholar 

  • Baader, F., Küsters, R., & Wolter, F. (2003). Extensions to description logics. In F. Baader, D. Calvanese, D. L. McGuinness, D. Nardi, & P. F. Patel-Schneider (Eds.), The description logic handbook (pp. 219–261). New York, NY: Cambridge University Press.

    Google Scholar 

  • Bagosi, T., Calvanese, D., Hardi, J., Komla-Ebri, S., Lanti, D., Rezk, M., et al. (2014, August 8–12). The ontop framework for ontology based data access. In D. Zhao, J. Du, H. Wang, P. Wang, J. Donghong, & J. Z. Pan (Eds.), The semantic web and web science. 8th Chinese conference, CSWS, revised selected papers (pp. 67–77). Berlin: Springer-Verlag, Wuhan, China.

    Google Scholar 

  • Barwise, J., & Seligman, J. (1997). Information flow: The logic of distributed systems. Cambridge: Cambridge University Press.

    Book  MATH  Google Scholar 

  • Bellatreche, L., Xuan Dong, N., Peirra, G., & Hondjack, D. (2006). Contribution of ontology-based data modeling to automatic integration of electronic catalogues within engineering databases. Computers in Industry, 57, 711–724.

    Article  Google Scholar 

  • Becu, N., Bousquet, F., Barreteau, O., Perez, P., & Walker, A. (2003). A methodology for eliciting and modelling stakeholders’ representations with agent based modelling. In D. Hales, B. Edmonds, E. Norling, & J. Rouchier (Eds.), Multi-Agent-Based Simulation III. MABS 2003. Lecture Notes in Computer Science 2927 (pp. 131–148). Berlin, Heidelberg: Springer.

    Google Scholar 

  • Bergman, M. (2014). 50 ontology map** and alignment tools. http://www.mkbergman.com/1769/50-ontology-map**-and-alignment-tools/. Accessed May 2017.

  • Berners-Lee, T., Hendler, J., & Lassila, O. (2001). The semantic web: A new form of web content that is meaningful to computers will unleash a revolution of new possibilities. Scientific American, 284(5), 28–37.

    Article  Google Scholar 

  • Bharwani, S., Besa, M. C., Taylor, R., Fischer, M., Devisscher, T., & Kenfack, C. (2015). Identifying salient drivers of livelihood decision-making in the forest communities of Cameroon: Adding value to social simulation models. Journal of Artificial Societies and Social Simulation, 18(1), 3. http://jasss.soc.surrey.ac.uk/18/1/3.html. Accessed May 2017.

  • Bishop, C. M. (1995). Neural networks for pattern recognition. Oxford: Oxford University Press.

    MATH  Google Scholar 

  • Brewer, M. J., Butler, A., & Cooksley, S. (2016). The relative performance of AIC, AICC and BIC in the presence of unobserved heterogeneity. Methods in Ecology and Evolution, 7, 679–692.

    Article  Google Scholar 

  • Calvanese, D., & De Giacomo, G. (2003). Expressive description logics. In F. Baader, D. Calvanese, D. L. McGuinness, D. Nardi, & P. F. Patel-Schneider (Eds.), The description logic handbook (pp. 178–218). New York, NY: Cambridge University Press.

    Google Scholar 

  • Cheng, B., & Titterington, D. M. (1994). Neural networks: A review from a statistical perspective. Statistical Science, 9(1), 2–30.

    Article  MathSciNet  MATH  Google Scholar 

  • Chenoweth, S. V. (1991). On the NP-hardness of blocks world. In AAAI-91 proceedings (pp. 623–628).

    Google Scholar 

  • Chester, D. L. (1990, January 15–19). Why two hidden layers are better than one. In Proceedings of the international joint conference on neural networks, (Vol. 1, pp. 265–268), Washington DC.

    Google Scholar 

  • Clarke, K. A. (2005). The phantom menace: Omitted variable bias in econometric research. Conflict Management and Peace Science, 22(4), 341–352.

    Article  Google Scholar 

  • Cuenca Grau, B., Horrocks, I., Motik, B., Parsia, B., Patel-Schneider, P., & Sattler, U. (2008). OWL 2: The next step for OWL. Journal of Web Semantics, 6(4), 309–322.

    Article  Google Scholar 

  • Cybenko, G. (1989). Approximation by superposition of a sigmoidal function. Mathematics of Control, Signals, and Systems, 2(4), 303–314.

    Article  MathSciNet  MATH  Google Scholar 

  • Devlin, K. (1991). Logic and information. Cambridge, Cambridge University Press.

    Google Scholar 

  • Do, H.-H., & Rahm, E. (2002, August 20–23) COMA: A system for flexible combination of schema matching approaches. In VLDB 2002: 28th International Conference on Very Large Data Bases, Kowloon Shangri-La Hotel, Hong Kong, China. http://www.vldb.org/conf/2002/S17P03.pdf. Accessed May 2017.

  • Doan, A., Madhavan, J., Domingos, P., & Halevy, A. (2004). Ontology matching: A machine learning approach. In S. Staab & R. Studer (Eds.), Handbook on ontologies (pp. 385–403). Berlin: Springer-Verlag.

    Chapter  Google Scholar 

  • Drchal, J., Čertický, M., & Jakob, M. (2016). VALFRAM: Validation framework for activity-based models. Journal of Artificial Societies and Social Simulation, 19(3), 15. http://jasss.soc.surrey.ac.uk/19/3/15.html. Accessed May 2017.

  • Edmonds, B. (2002, June 3). Simplicity is not truth-indicative. In Centre for policy modelling discussion papers CPM-02-99. http://cfpm.org/discussionpapers/111/simplicity-is-not-truth-indicative. Accessed May 2017.

  • Edmonds, B., & Moss, S. (2005, July 19). From KISS to KIDS: An ‘anti-simplistic’ modelling approach. In P. Davidsson, B. Logan, & K. Takadama (Eds.), Multi-agent and multi-agent-based simulation, joint workshop MABS 2004, Revised selected papers. Lecture notes in artificial intelligence 3415 (pp. 130–114), New York, NY, USA.

    Google Scholar 

  • Elsenbroich, C. (2012). Explanation in agent-based modelling: Functions, causality or mechanisms? Journal of Artificial Societies and Social Simulation, 15(3), 1. http://jasss.soc.surrey.ac.uk/15/3/1.html. Accessed May 2017.

  • Epstein, J. M. (2008). Why model? Journal of Artificial Societies and Social Simulation, 11(4), 12. http://jasss.soc.surrey.ac.uk/11/4/12.html. Accessed May 2017.

  • Etienne, M. (2014). Companion modelling: A participatory approach to support sustainable development. The Netherlands: Springer.

    Book  Google Scholar 

  • Evans, J. S. B. T., & Over, D. E. (2004). If. Oxford: Oxford University Press.

    Book  Google Scholar 

  • Faria, D., Pesquita, C., Santos, E., Palmonari, M., Cruz, I. F., & Couto, F. M. (2013, September 9–13). The agreementmakerlight ontology matching system. In R. Meersman, H. Panetto, T. Dillon, J. Eder, Z. Bellahsene, N. Ritter, P. De Leenheer, & D. Dou (Eds.), On the move to meaningful internet systems: OTM 2013 conferences. Confederated international conferences CoopIS, DOA-trusted cloud, and ODBASE 2013, Proceedings. lecture notes in computer science 8185 (pp. 527–541), , Graz, Austria.

    Google Scholar 

  • Filatova, T., Polhill, J. G., & van Ewijk, S. (2016). Regime shifts in coupled socio-environmental systems: Review of modelling challenges and approaches. Environmental Modelling and Software, 75, 333–347.

    Article  Google Scholar 

  • Funahashi, K. (1989). On the approximate realisation of continuous map**s by neural networks. Neural Networks, 2(3), 183–192.

    Article  Google Scholar 

  • Ge, J., & Polhill, J. G. (2016). Exploring the combined effect of factors influencing commuting patterns and CO2 emissions in Aberdeen using an agent-based model. Journal of Artificial Societies and Social Simulation, 19(3), 11. http://jasss.soc.surrey.ac.uk/19/3/11.html. Accessed May 2017.

  • Giunchiglia, F., Autayeu, A., & Pane, J. (2012). S-match: An open source framework for matching lightweight ontologies. Semantic Web, 3(3), 307–317.

    Google Scholar 

  • Gotts, N. M., & Polhill, J. G. (2009, October 5–6). Narrative scenarios, mediating formalisms, and the agent-based simulation of land use change. In F. Squazzoni (Ed.), Epistemological aspects of computer simulation in the social sciences. Second international workshop EPOS, Revised selected and invited papers. Lecture notes in artificial intelligence 5466 (pp. 99–116), Brescia, Italy.

    Google Scholar 

  • Gotts, N. M., & Polhill, J. G. (2010). Size matters: Large-scape replications of experiments with FEARLUS. Advances in Complex Systems, 13(4), 453–467.

    Article  MathSciNet  Google Scholar 

  • Grimm, V., Frank, K., Jeltsch, F., Brandl, R., Uchmański, J., & Wissel, C. (1996). Pattern-oriented modelling in population ecology. The Science of the Total Environment, 153, 151–166.

    Article  Google Scholar 

  • Gruber, T. R. (1993). A translation approach to portable ontology specification. Knowledge Acquisition, 5(2), 199–220.

    Article  Google Scholar 

  • Grubic, T., & Fan, I.-S. (2010). Supply chain ontology: Review, analysis and synthesis. Computers in Industry, 61, 776–786.

    Article  Google Scholar 

  • Guarino, N., & Welty, C. A. (2009). An overview of ontoclean. In S. Staab & R. Studer (Eds.), Handbook on ontologies (pp. 201–220). Berlin: Springer Verlag.

    Chapter  Google Scholar 

  • Gurney, K. (1997). An introduction to neural networks. London: UCL Press.

    Book  Google Scholar 

  • Hanson, S. J., & Burr, D. J. (1990). What connectionist models learn: Learning and representation in connectionist networks. The Behavioral and Brain Sciences, 13, 471–518.

    Article  Google Scholar 

  • Hertz, J., Krogh, A., & Palmer, R. G. (1991). Introduction to the theory of neural computation. Boston, MA: Addison-Wesley.

    Google Scholar 

  • Holland, J. H. (1986). Esca** brittleness: The possibilities of general-purpose learning algorithms applied to parallel rule-based systems. In R. S. Michalski, J. G. Carbonell, & T. M. Mitchell (Eds.), Machine learning: An artificial intelligence approach (Vol. II). Burlington, MA: Morgan Kaufmann.

    Google Scholar 

  • Hornik, K., Stinchcombe, M., & White, H. (1989). Multilayer feedforward networks are universal approximators. Neural Networks, 2(5), 359–366.

    Article  Google Scholar 

  • Horrocks, I., Patel-Schneider, P. F., & van Harmelen, F. (2003). From SHIQ and RDF to OWL: The making of a web ontology language. Journal of Web Semantics, 1(1), 7–26.

    Article  Google Scholar 

  • Hu, W., & Qu, Y. (2008). Falcon-AO: A practical ontology matching system. Web Semantics: Science, Services and Agents on the World Wide Web, 6(3), 237–239.

    Article  Google Scholar 

  • Hu, W., Qu, Y., & Cheng, G. (2008). Matching large ontologies: A divide-and-conquer approach. Data & Knowledge Engineering, 67, 140–160.

    Article  Google Scholar 

  • Huhn, U., & Schulz, S. (2004). Building a very large ontology from medical thesauri. In S. Staab & R. Studer (Eds.), Handbook on ontologies (pp. 133–150). Berlin: Springer-Verlag.

    Chapter  Google Scholar 

  • Jean-Mary, Y. R., Shironoshita, E. P., & Kabuka, M. R. (2009). Ontology matching with semantic verification. Web Semantics: Science, Services and Agents on the World Wide Web, 7(3), 235–251.

    Article  Google Scholar 

  • Jones, D. M., Bench-Capon, T. J. M., & Visser, P. R. S. (1998, 31 August–4 September). Methodologies for ontology development. In J. Cuena (Ed.), IT & knows: Information technologies and knowledge systems. Proceedings of a conference held as part of the XV IFIP world computer congress (pp. 62–75.), Vienna, Austria and Budapest, Hungary. http://cgi.csc.liv.ac.uk/~tbc/publications/itknows.pdf. Accessed May 2017.

  • Kalfoglou, Y., & Schorlemmer, M. (2003). Ontology map**: The state of the art. The Knowledge Engineering Review, 18(1), 1–31.

    Article  MATH  Google Scholar 

  • Klein, H. K., & Hirschheim, R. A. (1987). A comparative framework of data modelling paradigms and approaches. The Computer Journal, 30(1), 8–15.

    Article  Google Scholar 

  • Livet, P., Muller, J.-P., Phan, D., & Sanders, L. (2010). Ontology, a mediator for agent-based modeling in social science. Journal of Artificial Societies and Social Simulation, 13(1), 3. http://jasss.soc.surrey.ac.uk/13/1/3.html. Accessed May 2017.

  • Moss, S. (2002). Agent based modelling for integrated assessment. Integrated Assessment, 3(1), 63–77.

    Article  Google Scholar 

  • Moss, S., & Edmonds, B. (2005). Sociology and simulation: Statistical and qualitative cross-validation. American Journal of Sociology, 110(4), 1095–1131.

    Article  Google Scholar 

  • Moss, S. (2008). Alternative approaches to the empirical validation of agent-based models. Journal of Artificial Societies and Social Simulation, 11(1), 5. http://jasss.soc.surrey.ac.uk/11/1/5.html. Accessed May 2017.

  • Müller, J. P. (2010). A framework for integrated modeling using a knowledge-driven approach. In D. A. Swayne, W. Yang, A. A. Voinov, A. Rizzoli, & T. Filatova (Eds.), Fifth Biennial international congress on environmental modelling and software, Ottawa, Canada.http:// www.iemss.org/iemss2010/papers/S21/S.21.08.A%20framework%20foceling%20using%20a%20knowledgedriven%20approach%20-%20JEAN-PIERRE %20MULLER.pdf. Accessed May 2017.

  • Ngo, D., & Bellahsene, Z. (2012, October 8–12). YAM++: A multi-strategy based approach for ontology matching task. In A. ten Teije, J. Völker, S. Handschuh, H. Stuckenschmidt, M. d’Acquin, A. Nikolov, N. Aussenac-Gilles, & N. Hernandez (Eds.), Knowledge engineering and knowledge management. 18th international conference, EKAW. Proceedings. Lecture notes in computer science 7603 (pp. 421–425), Galway City, Ireland.

    Google Scholar 

  • Object Modelling Group. (2014). Ontology definition metamodel version 1.1. In OMG Document Number: Formal/2014–09-02. http://www.omg.org/spec/ODM/1.1/PDF/. Accessed May 2017.

  • Oreskes, N., Shrader-Frechette, K., & Belitz, K. (1994). Verification, validation, and confirmation of numerical models in the earth sciences. Science, 263(5147), 641–646.

    Article  Google Scholar 

  • Perez, P., Dray, A., Dietze, P., Moore, D., Jenkinson, R., Siokou, C., et al. (2009). An ontology-based simulation model exploring the social contexts of psychostimulant use among young Australians. International Society for the Study of Drug Policy. http://ro.uow.edu.au/smartpapers/36. Accessed May 2017.

  • Polhill, J. G. (2015). Extracting OWL ontologies from agent-based models: A Netlogo extension. Journal of Artificial Societies and Social Simulation, 18(2), 15. http://jasss.soc.surrey.ac.uk/18/2/15.html. Accessed May 2017.

  • Polhill, J. G., & Gotts, N. M. (2009). Ontologies for transparent integrated human-natural systems modelling. Landscape Ecology, 24, 1255–1267.

    Article  Google Scholar 

  • Polhill, J. G., Sutherland, L.-A., & Gotts, N. M. (2010). Using qualitative evidence to enhance an agent-based modelling system for studying land use change. Journal of Artificial Societies and Social Simulation, 13(2), 10. http://jasss.soc.surrey.ac.uk/13/2/10.html. Accessed May 2017.

  • Radax, W., & Rengs, B. (2010). Prospects and pitfalls of statistical testing: Insights from replicating the demographic prisoner’s dilemma. Journal of Artificial Societies and Social Simulation, 13(4), 1. http://jasss.soc.surrey.ac.uk/13/4/1.html. Accessed May 2017.

  • Rossiter, S., Noble, J., & Bell, K. R. W. (2010). Social simulations: Improving interdisciplinary understanding of scientific positioning and validity. Journal of Artificial Societies and Social Simulation, 13(1), 10. http://jasss.soc.surrey.ac.uk/13/1/10.html. Accessed May 2017.

  • Rumbaugh, J. (2003). Object-oriented analysis and design (OOAD). In A. Ralston, E. D. Reilly, & D. Hemmendinger (Eds.), Encyclopedia of computer science (4th ed., pp. 1275–1279). Chichester: John Wiley and Sons Ltd..

    Google Scholar 

  • Schulze, J., Müller, B., Groeneveld, J., & Grimm, V. (2017). Agent-based modelling of social-ecological systems: Achievements, challenges, and a way forward. Journal of Artificial Societies and Social Simulation, 20(2), 8. http://jasss.soc.surrey.ac.uk/20/2/8.html. Accessed May 2017.

  • Shalizi, C. R. (2006). Methods and techniques of complex systems science: An overview. In T. S. Deisboeck & J. Y. Kresh (Eds.), Complex systems science in biomedicine (pp. 33–114). New York, NY: Springer.

    Chapter  Google Scholar 

  • Shearer, R., Motik, B. and Horrocks, I. (2008, 26–27 October). HermiT: A highly-efficient OWL reasoner. In OWLED 2008. OWL: Experiences and Directions. Fifth International Workshop, Karlsruhe, Germany. http://webont.org/owled/2008/papers/owled2008eu_submission_12.pdf. Accessed May 2017.

  • Shvaiko, P., & Euzenat, J. (2013). Ontology matching: State of the art and future challenges. IEEE Transactions on Knowledge and Data Engineering, 25(1), 158–176.

    Article  Google Scholar 

  • Sirin, E., Parsia, B., Cuenca Grau, B., Kalyanpur, A., & Katz, Y. (2007). Pellet: A practical OWL-DL reasoner. Web Semantics: Science, Services and Agents on the World Wide Web, 5(2), 51–53.

    Article  Google Scholar 

  • Sowa, J. (1999). Knowledge representation: Logical, philosophical, and computational foundations. Pacific Grove, CA: Brooks/Cole.

    Google Scholar 

  • Sure, Y., Staab, S., & Studer, R. (2004). On-to-knowledge methodology (OTKM). In S. Staab & R. Studer (Eds.), Handbook on ontologies (pp. 117–132). Berlin: Springer-Verlag.

    Chapter  Google Scholar 

  • ten Broeke, G., van Voorn, G., & Ligtenberg, A. (2016). Which sensitivity analysis method should I use for my agent-based model? Journal of Artificial Societies and Social Simulation, 19(1), 5. http://jasss.soc.surrey.ac.uk/19/1/5.html. Accessed May 2017.

  • Thiele, J. C., Kurth, W., & Grimm, V. (2012). Agent-based modelling: Tools for linking NetLogo and R. Journal of Artificial Societies and Social Simulation, 15(3), 8. http://jasss.soc.surrey.ac.uk/15/3/8.html. Accessed May 2017.

  • Thompson, N. S., & Derr, P. (2009). Contra Epstein, good explanations predict. Journal of Artificial Societies and Social Simulation, 12(1), 9. http://jasss.soc.surrey.ac.uk/12/1/9.html. Accessed May 2017.

  • Troitzsch, K. G. (2009). Not all explanations predict satisfactorily, and not all good predictions explain. Journal of Artificial Societies and Social Simulation, 12(1), 10. http://jasss.soc.surrey.ac.uk/12/1/10.html. Accessed May 2017.

  • Troitzsch, K. G. (2015). What one can learn from extracting OWL ontologies from a NetLogo model that was not designed for such an exercise. Journal of Artificial Societies and Social Simulation, 18(2), 14. http://jasss.soc.surrey.ac.uk/18/2/14.html. Accessed May 2017.

  • Tsarkov, D., & Horrocks, I. (2006, August 17–20). FaCT++ description logic reasoner: System description. In U. Furbach & N. Shankar (Eds.), Automated reasoning. Third international joint conference, IJCAR 2006. Proceedings. Lecture notes in computer science 4130 (pp. 292–297), Seattle, WA, USA.

    Google Scholar 

  • Vapnik, V. N., & Chervonenkis, A. Y. (1971). On the uniform convergence of relative frequencies of events to their probabilities. Theory of Probability and its Applications, 16, 264–280.

    Article  MATH  Google Scholar 

  • Watkin, T. L. H., Rau, A., & Biehl, M. (1993). The statistical mechanics of learning a rule. Reviews of Modern Physics, 65(2), 499–555.

    Article  MathSciNet  Google Scholar 

  • Windrum, P., Fagiolo, G., & Moneta, A. (2007) Empirical validation of agent-based models: Alternatives and prospects. Journal of Artificial Societies and Social Simulation 10(2), 8. http://jasss.soc.surrey.ac.uk/10/2/8.html. Accessed May 2017.

  • Winograd, T. (1972). Understanding natural language. Edinburgh: Edinburgh University Press.

    Google Scholar 

  • Wilensky, U. (1999). NetLogo. Center for connected learning and computer-based modeling. Evanston, IL: Northwestern University. http://ccl.northwestern.edu/netlogo. Accessed May 2017

  • Wood, S. N., & Augustin, N. H. (2002). GAMs with integrated model selection using penalized regression splines and applications to environmental modelling. Ecological Modelling, 157(2–3), 157–177.

    Article  Google Scholar 

  • Yang, G., & Feng, J. (2012). Database semantic interoperability based on information flow theory and formal concept analysis. International Journal of Information Technology and Computer Science, 4(7), 33–42.

    Article  Google Scholar 

Download references

Acknowledgements

We acknowledge funding from the Engineering and Physical Sciences Research Council (award no. 91310127), the European Commission Framework Programme 7 ‘GLAMURS’ project (grant agreement no. 613420) and the Scottish Government Rural Affairs, Food and the Environment Strategic Research Programme, Theme 2: Productive and Sustainable Land Management and Rural Economies. We are also grateful to Bruce Edmonds and Mark Brewer for useful comments on earlier drafts of this chapter; any mistakes are of course our own.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Gary Polhill .

Editor information

Editors and Affiliations

Appendices

Further Reading

Shalizi’s (2006) book chapter covers approaches to modelling (and measuring) complex systems in a more formal and comprehensive way, with a focus on more traditional mathematical modelling techniques. However, he also covers issues with validation and penalization of parameters, including discussions of VC theory and Ockham’s razor.

Sowa’s (1999) book on knowledge representation is a good introduction to various issues in the field and covers various formalisms and underlying philosophical questions that the formal representation of knowledge yields. Baader et al.’s (2003) Description Logic Handbook goes in to more details on description logics. Another book, which goes into some depth on controversies in the formal representation of what otherwise seems to be a simple everyday concept, ‘if-then’, is Evans and Over’s (2004) book, and this too is highly recommended.

Since one of the ways of validating ontologies is through engaging with stakeholders, the Companion Modelling school of agent-based modelling, pioneered especially by research teams based in France, is well worth familiarizing yourself with. They have a websiteFootnote 6 and a book (Etienne 2014) as well as several publications illustrating their work. Since they sometimes use ontologies as part of their methodological approach to modelling with stakeholders, the work of authors such as Jean-Pierre Müller, Nicolas Becu and Pascal Perez and their collaborators are particularly worth investigating. Some example articles include Müller (2010), Becu et al. (2003) and Perez et al. (2009). Companion modellers are not the only ones to apply knowledge elicitation to model design, however – see, for example, Bharwani et al. (2015).

Validation has long been a subject of discussion in agent-based modelling, and this chapter has not dedicated space to reviewing the excellent thinking that has already been done on the topic. The interested reader wanting to access some of this literature is advised to look for keywords such as validation, calibration and verification in the Journal of Artificial Societies and Social Simulation, currently the principal journal for publication of agent-based social simulation work. Notable recent articles include Schulze et al. (2017), Drchal et al. (2016), ten Broeke et al. (2016) and Lovelace et al. (2015). Other older articles worth a read are Elsenbroich (2012), Radax and Rengs (2010) and Rossiter et al. (2010). See also some of the debates such as Thompson and Derr’s (2009) critique of Epstein’s (2008) article and Troitzsch’s (2009) response and Moss’s (2008) reflections on Windrum et al.’s (2007) paper. A practical article on one approach to validating agent-based models outwith JASSS is Moss and Edmonds (2005).

Appendix 1: Neural Networks

Though there are variants, typically the excitation, x j , of a node j is given by the weighted sum of its inputs (8.2):

$$ {x}_j=\sum_{i\in \mathrm{inputs}}{w}_{ij}{o}_i $$
(8.2)

where o i (usually in the range [0, 1], though some formalisms use [−1, 1]) is the output of a node i with a connection that inputs to node j and w ij is the strength (weight) of that connection.

Nonlinearity of the behaviour of the node is critical to the power that the neural network has as an information processing system. It is introduced by making the output o j of a node a nonlinear function of its excitation x j . There are a number of ways this can be achieved. Since many learning algorithms rely on the differentiability of the output with respect to the weights, the sigmoid function is typically used:

$$ {o}_j=\frac{1}{1+\exp \left(-{x}_j\right)} $$
(8.3)

So, a neural network essentially consists of a directed graph of nodes, where each of the links has a weight. If the graph is acyclic, the neural network is known as a feed-forward network. (If cyclic, the network is recurrent.) Nodes with no input connections are input nodes; those with no output connections are output nodes. Since they have no input connections and hence no excitation, input nodes are often also not given a nonlinear treatment as per (8.3), though this breaks somewhat with the simulation of a neuron. Similarly, nonlinearity may not be applied to output nodes. If there are N input nodes, and M output nodes, then essentially a feed-forward network without nonlinearity on the output nodes is computing a map** from R N to R M. With nonlinearity, the map** is from R N to [0, 1]M.

Appendix 2: Metrics of and Methods for Validation

Table 8.3 explains various metrics and measures of validation, showing you where to find out more information on them and how to use them with R. For those of you unfamiliar with R, it is a popularly usedFootnote 7 free (as in open-source and in the financial sense) statistical software package, available for Windows, OS-X and Linux.Footnote 8 Each of the examples assumes you are validating against a single variable (unless otherwise stated) for which you have a number of samples from your data and corresponding output from your model. The R variable vdata contains the empirical data to validate against (which must not have been used for calibration – though many of the metrics can of course be applied to the calibration process), whilst the variable model contains the corresponding output from the model. The two variables vdata and model are, in R terms, vectors of equal length. If the model predicted the data perfectly, then for each element i of the two vectors, vdata[i] == model[i]. More information on each of the approaches can be found on Wikipedia,Footnote 9 R documentation and in various machine learning and advanced statistical textbooks.

Table 8.3 Measures of validation

Appendix 3: Expressivity of Various Modelling Approaches

Description logics use a letter-based notation to describe the axioms each logic has (Baader and Nutt 2003; Calvanese and De Giacomo 2003; Baader et al. 2003). Briefly, \( \mathcal{AL} \) is a basic description logic, and (𝒟) is for data properties; \( \mathcal{C} \) provides more complex class axioms than the basic axioms in \( \mathcal{AL} \); is for complex relationship assertions such as irreflexivity (all NetLogo links are irreflexive, e.g. as you cannot link anything to itself); \( \mathcal{O} \) introduces nominals (a bit-like enumerations in Java); inverse relationships; \( \mathcal{N} \) numerical restrictions on properties; and functional properties. Table 8.4 provides an initial indication of the description logic expressivity needed to capture the syntax used to specify the ontologies of various modelling approaches. However, the labels applied in the ‘description logic’ column do not necessarily mean that the full capabilities of the language are necessarily used.

Table 8.4 Comparison of expressivity of ontologies of various modelling approaches

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Polhill, G., Salt, D. (2017). The Importance of Ontological Structure: Why Validation by ‘Fit-to-Data’ Is Insufficient. In: Edmonds, B., Meyer, R. (eds) Simulating Social Complexity. Understanding Complex Systems. Springer, Cham. https://doi.org/10.1007/978-3-319-66948-9_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-66948-9_8

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-66947-2

  • Online ISBN: 978-3-319-66948-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Navigation