Abstract
Electronic Health Records (EHR) and the constant adoption of Information Technologies in healthcare have dramatically increased the amount of unstructured data stored. The extraction of key information from this data will bring better caregivers decisions and an improvement in patients’ treatments. With more than 495 million people talking Spanish, the need to adapt algorithms and technologies used in EHR knowledge extraction in English speaking countries, leads to the development of different frameworks. Thus, we present TIDA, a Spanish EHR semantic search engine, to give support to Spanish speaking medical centers and hospitals to convert pure raw data into information understandable for cognitive systems. This paper presents the results of TIDA’s Spanish EHR free-text treatment component with the adaptation of negation and context detection algorithms applied in a semantic search engine with a database with more than 30,000 clinical notes.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Savova, G.K., et al.: Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications. Journal of the American Medical Informatics Association 17(5), 507–513 (2010)
Bodenreider, O.: The unified medical language system (UMLS): integrating biomedical terminology. Nucleic Acids Research 32(suppl. 1), D267–D270 (2004)
Watson, I.B.M.: (December 13, 2013), http://www.ibm.com/watson
Cervantes Institute. El espaol: una lengua viva. Informe (2012), http://cvc.cervantes.es/lengua/anuario/anuario_12/i_cervantes/p01.htm (January 8, 2014)
Logica and Nordic Healthcare group. Results from a survey conducted by Logica and Nordic Healthcare Group (January 2012), http://www.logica-group.com/we-are-logica/media-centre/thought-pieces/2012/market-study-of-electronic-medical-record-emr-systems-in-europe/~/media/Global%20site/Media%20Centre%20Items/Thought%20pieces/2012/WPPSEMRJLv16LR.ashx (January 7, 2014)
Hospitales, H.M.: Estadsticas y Resultados Sanitarios (2012), http://www.hmhospitales.com/grupohm/Estadisticas/Paginas/Estadisticas-Generales.aspx (January 7, 2014)
Ferrucci, D., Lally, A.: UIMA: an architectural approach to unstructured information processing in the corporate research environment. Natural Language Engineering 10(3-4), 327–348 (2004)
Apache OpenNLP, http://opennlp.apache.org (November 21, 2013)
Chapman, W.W., et al.: A simple algorithm for identifying negated findings and diseases in discharge summaries. Journal of Biomedical Informatics 34(5), 301–310 (2001)
Marcus, M.P., Marcinkiewicz, M.A., Santorini, B.: Building a large annotated corpus of English: The Penn Treebank. Computational Linguistics 19(2), 313–330 (1993)
Kim, J.-D., et al.: GENIA corpusa semantically annotated corpus for bio-textmining. Bioinformatics 19(suppl. 1), i180–i182 (2003)
ILSP. Hellenic National Corpus (January 9, 2014), http://hnc.ilsp.gr/en/
Harkema, H., et al.: ConText: An algorithm for determining negation, experiencer, and temporal status from clinical reports. Journal of Biomedical Informatics 42(5), 839–851 (2009)
Taul, M., Mart, M.A., Recasens, M.: AnCora: Multilevel Annotated Corpora for Catalan and Spanish. In: LREC (2008)
Lucene, A.: (November 21, 2013), https://lucene.apache.org/core/
Solr, A.: (November 21, 2013), http://lucene.apache.org/solr/
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Costumero, R., Gonzalo, C., Menasalvas, E. (2014). TIDA: A Spanish EHR Semantic Search Engine. In: Saez-Rodriguez, J., Rocha, M., Fdez-Riverola, F., De Paz Santana, J. (eds) 8th International Conference on Practical Applications of Computational Biology & Bioinformatics (PACBB 2014). Advances in Intelligent Systems and Computing, vol 294. Springer, Cham. https://doi.org/10.1007/978-3-319-07581-5_28
Download citation
DOI: https://doi.org/10.1007/978-3-319-07581-5_28
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-07580-8
Online ISBN: 978-3-319-07581-5
eBook Packages: EngineeringEngineering (R0)