Real-Time Audio Classification to Determine the Article of a German Noun

Zaiss, Dena; Gilani, Iman Baghernejad Monavar; Tutsch, Dietmar

doi:10.1007/978-3-031-32700-1_12

Dena Zaiss¹¹,
Iman Baghernejad Monavar Gilani¹¹ &
Dietmar Tutsch¹¹

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 674))

Included in the following conference series:

Real-Time Meeting of the Gesellschaft für Informatik

146 Accesses

Abstract

The determination of an article for gender assignment to a noun is different in every language and depends on the certain rules as well as the context of a word in a text. The rules given in the German language help with the article assignment, but are not always consistent and meaningful for all words. In the human brain, this assignment can take place intuitively and in real-time, if it is trained over many years. This work is concerned with verifying whether a convolutional neural network (CNN) can match an article to a word the same way as the human brain, using methods of audio classification in deep learning. The chosen attribute to train the model is the sound of the word in this work, because the sound plays a big role in the article determination.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 149.00; Price excludes VAT (USA)

Softcover Book: USD 199.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Accent and Gender Recognition from English Language Speech and Audio Using Signal Processing and Deep Learning

Towards modeling raw speech in gender identification of children using sincNet over ERB scale

Article 08 September 2023

Age group classification and gender recognition from speech with temporal convolutional neural networks

Article Open access 13 January 2022

References

Genus bei Fremdwörtern-Variantengrammatik des Standarddeutschen, http://mediawiki.ids-mannheim.de/VarGra/index.php/Genus_bei_ Fremdwörtern, Accessed 31 May 2022
Doshi, K.: Audio deep learning made simple: sound classification, step-by-step by Ketan Doshi towards data science. https://towardsdatascience.com/audio-deep-learning-made-simple-sound-classification-step-by-step-cebc936bbe5. Accessed 15 May 2022
Yang, Y. -Y., et al.: Torchaudio: building blocks for audio and speech processing. In: ICASSP 2022–2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 6982–6986 (2022). https://doi.org/10.1109/ICASSP43922.2022.9747236
McFee, B., et al.: librosa: audio and music signal analysis in python. In: Proceedings of the 14th python in Science Conference, pp. 18–25 (2015). https://doi.org/10.5281/zenodo.4792298
[at] REDAKTION. Convolutional Neural Networks [at] Blog. https://www.alexanderthamm.com/de/blog/convolutional-neural-networks-am-beispiel-der-revolution-der-computervision/. Accessed 31 May 2022
Allibhai, E.: Hold-out vs. cross-validation in machine learning—by Eijaz Allibhai—medium, https://medium.com/@eijaz/holdout-vs-cross-validation-in-machine-learning-7637112d3f8f. Accessed 10 Apr 2022
How to implement early stop** in PyTorch - Quora, https://www.quora.com/How-can-I-implement-early-stop**-in-PyTorch. Accessed 15 Mar 2022
Logan, B.: Mel Frequency Ceptral Coefficient for Music Modeling. Cambridge Research Laboratory, Cambridge (2000)
Google Scholar
Brownlee, J.: Gentle Introduction to the Adam Optimization Algorithm for Deep Learning. Machine Learning Mastery, San Juan, Blog (2021)
Google Scholar

Download references

Author information

Authors and Affiliations

University of Wuppertal, Rainer -Gruenter -Str. 21, 42119, Wuppertal, Germany
Dena Zaiss, Iman Baghernejad Monavar Gilani & Dietmar Tutsch

Authors

Dena Zaiss
View author publications
You can also search for this author in PubMed Google Scholar
Iman Baghernejad Monavar Gilani
View author publications
You can also search for this author in PubMed Google Scholar
Dietmar Tutsch
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Dena Zaiss .

Editor information

Editors and Affiliations

Lehrgebiet Kommunikationsnetze, FernUniversität in Hagen, Hagen, Germany
Herwig Unger
Lehrgebiet Kommunikationsnetze, FernUniversität in Hagen, Hagen, Germany
Marcel Schaible

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zaiss, D., Gilani, I.B.M., Tutsch, D. (2023). Real-Time Audio Classification to Determine the Article of a German Noun. In: Unger, H., Schaible, M. (eds) Real-time and Autonomous Systems 2022. Real-Time 2022. Lecture Notes in Networks and Systems, vol 674. Springer, Cham. https://doi.org/10.1007/978-3-031-32700-1_12

Download citation

DOI: https://doi.org/10.1007/978-3-031-32700-1_12
Published: 17 May 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-32699-8
Online ISBN: 978-3-031-32700-1
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

Real-Time Audio Classification to Determine the Article of a German Noun

Abstract

Access this chapter

Similar content being viewed by others

Accent and Gender Recognition from English Language Speech and Audio Using Signal Processing and Deep Learning

Towards modeling raw speech in gender identification of children using sincNet over ERB scale

Age group classification and gender recognition from speech with temporal convolutional neural networks

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Real-Time Audio Classification to Determine the Article of a German Noun

Abstract

Access this chapter

Similar content being viewed by others

Accent and Gender Recognition from English Language Speech and Audio Using Signal Processing and Deep Learning

Towards modeling raw speech in gender identification of children using sincNet over ERB scale

Age group classification and gender recognition from speech with temporal convolutional neural networks

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation