Are Morphosyntactic Taggers Suitable to Improve Automatic Transcription?

Huet, Stéphane; Gravier, Guillaume; Sébillot, Pascale

doi:10.1007/11846406_49

Stéphane Huet²¹,
Guillaume Gravier²¹ &
Pascale Sébillot²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4188))

Included in the following conference series:

International Conference on Text, Speech and Dialogue

1043 Accesses

Abstract

The aim of our paper is to study the interest of part of speech (POS) tagging to improve speech recognition. We first evaluate the part of misrecognized words that can be corrected using POS information; the analysis of a short extract of French radio broadcast news shows that an absolute decrease of the word error rate by 1.1% can be expected. We also demonstrate quantitatively that traditional POS taggers are reliable when applied to spoken corpus, including automatic transcriptions. This new result enables us to effectively use POS tag knowledge to improve, in a postprocessing stage, the quality of transcriptions, especially correcting agreement errors.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: EUR 29.95; Price includes VAT (Spain)

eBook: EUR 85.59; Price includes VAT (Spain)

Softcover Book: EUR 103.99; Price includes VAT (Spain)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

A Revised Comparison of Polish Taggers in the Application for Automatic Speech Recognition

Part of Speech Tagging for Polish: State of the Art and Future Perspectives

Part of speech tagging: a systematic review of deep learning and machine learning approaches

Article Open access 24 January 2022

References

Chelba, C., Jelinek, F.: Structured language modeling. Computer Speech and Language 14, 283–332 (2000)
Article Google Scholar
Khudanpur, S., Wu, J.: A maximum entropy language model to integrate n-grams and topic dependencies for conversational speech recognition. In: Proc. of ICASSP (1999)
Google Scholar
Iyer, R., Ostendorf, M.: Modeling long distance dependence in language: Topic mixtures versus dynamic cache models. IEEE Transactions on Speech and Audio Processing 7, 30–39 (1999)
Article Google Scholar
Maltese, G., Mancini, F.: An automatic technique to include grammatical and morphological information in a trigram-based statistical language model. In: Proc. of ICASSP (1992)
Google Scholar
Brown, P., Della Pietra, V., de Souza, P., Lai, J., Mercer, R.: Class-based n-gram models of natural language. Computational Linguistics 18, 467–480 (1992)
Google Scholar
Heeman, P.: POS tags and decision trees for language modeling. In: Proc. of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora. (1999)
Google Scholar
Galliano, S., Geoffrois, E., Mostefa, D., Choukri, K., Bonastre, J.F., Gravier, G.: The ESTER phase II evaluation campaign for the rich transcription of French broadcast news. In: Proc. of Eurospeech (2005)
Google Scholar
Valli, A., Véronis, J.: Étiquetage grammatical de corpus oraux: problèmes et perspectives. Revue française de linguistique appliquée 4, 113–133 (1999)
Google Scholar
Gauvain, J.L., Adda, G., Adda-Decker, M., Allauzen, A., Gendner, V., Lamel, L., Schwenk, H.: Where are we in transcribing French broadcast news? In: Proc. of Eurospeech (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

IRISA, Campus de Beaulieu, F-35042 Cedex, Rennes, France
Stéphane Huet, Guillaume Gravier & Pascale Sébillot

Authors

Stéphane Huet
View author publications
You can also search for this author in PubMed Google Scholar
Guillaume Gravier
View author publications
You can also search for this author in PubMed Google Scholar
Pascale Sébillot
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Informatics, Masaryk University, Brno, Czech Republic
Petr Sojka
Faculty of Informatics, Masaryk University, Botanická 68a, CZ-602 00, Brno, Czech Republic
Ivan Kopeček
Faculty of Informatics, Department of Computer Graphics and Design, Masaryk University, Botanická 68a, 60200, Brno, Czech Republic
Karel Pala

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Huet, S., Gravier, G., Sébillot, P. (2006). Are Morphosyntactic Taggers Suitable to Improve Automatic Transcription?. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2006. Lecture Notes in Computer Science(), vol 4188. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11846406_49

Download citation

DOI: https://doi.org/10.1007/11846406_49
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-39090-9
Online ISBN: 978-3-540-39091-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Are Morphosyntactic Taggers Suitable to Improve Automatic Transcription?

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

A Revised Comparison of Polish Taggers in the Application for Automatic Speech Recognition

Part of Speech Tagging for Polish: State of the Art and Future Perspectives

Part of speech tagging: a systematic review of deep learning and machine learning approaches

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Are Morphosyntactic Taggers Suitable to Improve Automatic Transcription?

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

A Revised Comparison of Polish Taggers in the Application for Automatic Speech Recognition

Part of Speech Tagging for Polish: State of the Art and Future Perspectives

Part of speech tagging: a systematic review of deep learning and machine learning approaches

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation