Abstract
PrIx development was originally driven by the need of searching for textual information in large collections of untranscribed text images. The spots that result from the PrIx process are not image transcripts, but they provide very rich probabilistic information about the text rendered in the images and image regions or locations. This chapter presents approaches to exploit this information to go beyond information search applications. Specifically, we will present methods to use the PrIx of an image or an image collection to deal with tasks that traditionally require actual textual data such as electronic text.We will cover, in order, basic and advanced text analytics, statistical information extraction and document image classification by textual content.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this chapter
Cite this chapter
Toselli, A.H., Puigcerver, J., Vidal, E. (2024). Beyond Search Applications of Probabilistic Indexing. In: Probabilistic Indexing for Information Search and Retrieval in Large Collections of Handwritten Text Images. The Information Retrieval Series, vol 49. Springer, Cham. https://doi.org/10.1007/978-3-031-55389-9_9
Download citation
DOI: https://doi.org/10.1007/978-3-031-55389-9_9
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-55388-2
Online ISBN: 978-3-031-55389-9
eBook Packages: Computer ScienceComputer Science (R0)