Abstract
This paper examines the benefits of both the Rhetorical Representation and Vector Representation for Arabic text summarization. The Rhetorical Representation uses the Rhetorical Structure Theory (RST) for building the Rhetorical Structure Tree (RS-Tree) and extracts the most significant paragraphs as a summary. On the other hand, the Vector Representation uses a cosine similarity measure for ranking and extracting the most significant paragraphs as a summary. The framework evaluates both summaries using precision. Statistical results show that Rhetorical Representation is superior to Vector Representation. Moreover, the rhetorical summary keeps the text in context, without leading to lack of cohesion in which the anaphoric reference is not broken i.e. improving the ability of extracting the semantics behind the text.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Hammo, B.H., Abu-Salem, H., Martha, E.W.: A Hybrid Arabic Text Summarization Technique Based on Text Structure and Topic Identification. Int. J. Comput. Proc. Oriental Lang. (2011)
Alsanie, W., Touir, A., Mathkour, H.: Towards an infrastructure for Arabic text summarization using rhetorical structure theory. M.Sc. Thesis, King Saud University, Riyadh, Saudi Arabia (2005)
Ibrahim, A., Elghazaly, T.: Arabic text summarization using Rhetorical Structure Theory. In: 8th International Conference on Informatics and Systems (INFOS), pp. NLP-34–NLP-38 (2012)
Ibrahim, A., Elghazaly, T.: Rhetorical Representation for Arabic Text. In: ISSR Annual Conference the 46th Annual Conference on Statistics, Computer Science, and Operations Research (2011)
Abd-Elfattah, M., Fuji, R.: Automatic text summarization. In: Proceeding of World Academy of Science, Engineering and Technology, Cairo, Egypt, pp. 192–195 (2008)
Manning, C., Raghavan, P., Schütze, H.: An Introduction to Information Retrieval, p. 181. Cambridge University Press (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ibrahim, A., Elghazaly, T. (2013). Rhetorical Representation and Vector Representation in Summarizing Arabic Text. In: Métais, E., Meziane, F., Saraee, M., Sugumaran, V., Vadera, S. (eds) Natural Language Processing and Information Systems. NLDB 2013. Lecture Notes in Computer Science, vol 7934. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38824-8_53
Download citation
DOI: https://doi.org/10.1007/978-3-642-38824-8_53
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38823-1
Online ISBN: 978-3-642-38824-8
eBook Packages: Computer ScienceComputer Science (R0)