Abstract
An increasing number and size of datasets abiding by the Linked Data paradigm are published everyday. Discovering links between these datasets is thus central to achieve the vision behind the Data Web. Declarative Link Discovery (LD) frameworks rely on complex Link Specification (LS) to express the conditions under which two resources should be linked. Understanding such LS is not a trivial task for non-expert users, particularly when such users are interested in generating LS to match their needs. Even if the user applies a machine learning algorithm for the automatic generation of the required LS, the challenge of explaining the resultant LS persists. Hence, providing explainable LS is the key challenge to enable users who are unfamiliar with underlying LS technologies to use them effectively and efficiently. In this paper, we address this problem by proposing a generic approach that allows a LS to be verbalized, i.e., converted into understandable natural language. We propose a summarization approach to the verbalized LS based on the selectivity of the underlying LS. Our adequacy and fluency evaluations show that our approach can generate complete and easily understandable natural language descriptions even by lay users.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
For a complete description of the vocabulary, see http://nlp.stanford.edu/software/dependencies_manual.pdf.
- 2.
The survey interface can be accessed at https://umfragen.uni-paderborn.de/index.php/186916?lang=en.
- 3.
- 4.
- 5.
- 6.
References
Assessing sentence scoring techniques for extractive text summarization. Expert Systems with Applications (2013)
Allahyari, M., et al.: Text summarization techniques: a brief survey. CoRR (2017)
Carvalho, M.G., Laender, A.H.F., Gonçalves, M.A., da Silva, A.S.: Replica Identification Using Genetic Programming. ACM, New York (2008)
Dalianis, H., Hovy, E.: Aggregation in natural language generation. In: Adorni, G., Zock, M. (eds.) EWNLG 1993. LNCS, vol. 1036, pp. 88–105. Springer, Heidelberg (1996). https://doi.org/10.1007/3-540-60800-1_25
Doddington, G.: Automatic evaluation of machine translation quality using n-gram co-occurrence statistics. In: Proceedings of HLT, pp. 138–145 (2002)
Fuchs, N.E.: First-order reasoning for attempto controlled english (2010)
Gerber, D., Ngomo, A.-C.N.: Extracting multilingual natural-language patterns for rdf predicates. In: EKAW, pp. 87–96 (2012)
Gerber, D., Ngonga Ngomo, A.-C.: Bootstrap** the linked data web. In: 1st Workshop on Web Scale Knowledge Extraction @ ISWC 2011 (2011)
Isele, R., Jentzsch, A., Bizer, C.: Efficient multidimensional blocking for link discovery without losing Recall. In: WebDB (2011)
Köpcke, H., Thor, A., Rahm, E.: Comparative evaluation of entity resolution approaches with fever. Proc. VLDB Endow. 2(2), 1574–1577 (2009)
Lehmann, J., Gerber, D., Morsey, M., Ngonga Ngomo, A.-C.: Defacto - deep fact validation. In: ISWC (2012)
Mellish, C., Sun, X.: The semantic web as a linguistic resource: opportunities for natural language generation (2006)
Ngomo, A.-C.N., Auer, S.: Limes - a time-efficient approach for large-scale link discovery on the web of data. In: IJCAI (2011)
Ngonga Ngomo, A.-C., Lehmann, J., Auer, S., Höffner, K.: Raven - active learning of link specifications. In: Proceedings of OM@ISWC (2011)
Ngonga Ngomo, A.-C., Lyko, K.: EAGLE: efficient active learning of link specifications using genetic programming. In: Simperl, E., Cimiano, P., Polleres, A., Corcho, O., Presutti, V. (eds.) ESWC 2012. LNCS, vol. 7295, pp. 149–163. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-30284-8_17
Piccinini, H., Casanova, M.A., Furtado, A.L., Nunes, B.P.: Verbalization of rdf triples with applications. In: ISWC - Outrageous Ideas Track (2011)
Pohl, A.: The polish interface for linked open data. In: Proceedings of the ISWC 2010 Posters & Demonstrations Track, pp. 165–168 (2011)
Reiter, E., Dale, R.: Building Natural Language Generation Systems. Cambridge University Press, New York (2000)
Sherif, M.A., Ngonga Ngomo, A.-C., Lehmann, J.: Wombat – a generalization approach for automatic link discovery. In: Blomqvist, E., Maynard, D., Gangemi, A., Hoekstra, R., Hitzler, P., Hartig, O. (eds.) ESWC 2017. LNCS, vol. 10249, pp. 103–119. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-58068-5_7
Sun, X., Mellish, C.: An experiment on “free generation” from single rdf triples. In: Association for Computational Linguistics (2007)
Vougiouklis, P., et al.: Neural wikipedian: generating textual summaries from knowledge base triples. J. Web Semant. 52–53, 1–15 (2018)
Wilcock, G., Jokinen, K.: Generating Responses and Explanations from RDF/XML and DAML+OIL (2003)
Acknowledgement
This work has been supported by the BMVI projects LIMBO (GA no. 19F2029C) and OPAL( no. 19F2028A), Eurostars Project SAGE (GA no. E!10882) as well as the H2020 projects SLIPO (GA no.731581).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Ahmed, A.F., Sherif, M.A., Ngomo, AC.N. (2019). LSVS: Link Specification Verbalization and Summarization. In: Métais, E., Meziane, F., Vadera, S., Sugumaran, V., Saraee, M. (eds) Natural Language Processing and Information Systems. NLDB 2019. Lecture Notes in Computer Science(), vol 11608. Springer, Cham. https://doi.org/10.1007/978-3-030-23281-8_6
Download citation
DOI: https://doi.org/10.1007/978-3-030-23281-8_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-23280-1
Online ISBN: 978-3-030-23281-8
eBook Packages: Computer ScienceComputer Science (R0)