Text Generation and Enhanced Evaluation of Metric for Machine Translation

Amin, Sujit S.; Ragha, Lata

doi:10.1007/978-981-15-8530-2_1

Sujit S. Amin⁸ &
Lata Ragha⁸

Part of the book series: Algorithms for Intelligent Systems ((AIS))

1019 Accesses
1 Citations

Abstract

Here the power of a recurrent neural network (RNN) has been exhibited for generating grammatically correct new text from given input text and translation of the new text to the Hindi language with modified bilingual evaluation understudy (BLEU) metric score. Our system aims to generate a grammatically correct new text from given input sentences or paragraphs and translate generated text to Hindi with high translation score. To accomplish a grammatically correct sentence, natural language toolkit (NLTK) is used for grammar correction at the end of text generation. RNN is not very useful for text generation of a gated connection decided to be used for this purpose. The generated text is transferred to machine translation (MT) module. For MT since evaluation is done by humans is a time-consuming task and results differ from evaluator to another evaluator. Hence, the need for assessment of translation system is emerged. The synonym issue is not considered by the BLEU metric. A synonym is treated as a separate word. A modified BLEU (M-BLEU) has been developed as evaluation metrics. It includes several features such as replacing synonym and shallow modules of parsing. The final score of translation is given by BLEU metric scores. Finally, two outputs are there: one is generated text (English) and second is translated text with an improved translation score (Hindi).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: EUR 29.95; Price includes VAT (Spain)

eBook: EUR 160.49; Price includes VAT (Spain)

Softcover Book: EUR 207.99; Price includes VAT (Spain)

Hardcover Book: EUR 207.99; Price includes VAT (Spain)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Empirical Analysis of Performance of MT Systems and Its Metrics for English to Bengali: A Black Box-Based Approach

Human Versus Automatic Evaluation of NMT for Low-Resource Indian Language

Otem&Utem: Over- and Under-Translation Evaluation Metric for NMT

References

Bengio Y, Simard P, Frasconi P (1994) Learning long-term dependencies with gradient descent is difficult. IEEE Trans Neural Netw 5:157–166
Article Google Scholar
Pollastri G, Przybylski D, Rost B, Baldi P (2002) Improving the prediction of protein secondary structure in three and eight classes using recurrent neural networks and profiles. Proteins Struct Funct Genet 47:228–235
Google Scholar
Chung J, Gulcehre C, Cho K, Bengio Y (2014) Empirical evaluation of gated recurrent neural networks on sequence modeling. ar**v preprint ar**v:1412.3555
Souri A, El Maazouzi Z, Al Achhab M, El Mohajir B (2018) Arabic text generation using recurrent neural networks. In: Communications in computer and information science, pp 523–533
Google Scholar
Gasthaus J, Wood F, Teh Y (2010) Lossless compression based on the sequence memoizer. In: 2010 data compression conference
Google Scholar
Bahdanau D, Cho K, Bengio Y (2014) Neural machine translation by jointly learning to align and translate. ar**v preprint ar**v:1409.0473
Luong M-T, Manning CD (2015) Stanford neural machine translation systems for spoken language domain. In: International workshop on spoken language translation
Google Scholar
Papineni K, Roukos S, Ward T, Zhu W (2001) BLEU. In: Proceedings of the 40th annual meeting on association for computational linguistics - ACL 2002
Google Scholar
Dwivedi SK, Sukhadeve PP (2010) Machine translation system in Indian perspectives. J Comp Sci 6(10):1111–1116. https://doi.org/10.3844/jcssp.2010.1111.1116
Verwimp L, Renkens V, Wambacq P State gradients for RNN memory analysis. http://dx.doi.org/10.21437/Interspeech.2018-1153
Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9:1735–1780
Article Google Scholar
Rozovskaya A, Roth D Grammatical error correction: machine translation and classifiers. https://www.aclweb.org/anthology/P16-1208
Hermanto A, Adji T, Setiawan N (2015) Recurrent neural network language model for English-Indonesian machine translation: experimental study. In: 2015 International Conference on Science in Information Technology (ICSITech)
Google Scholar
Castilho S, Doherty S, Gaspari F, Moorkens J (2018) Approaches to human and machine translation quality assessment. In: Machine translation: technologies and applications, pp 9–38
Google Scholar
Bojanowski P, Joulin A, Mikolov T (2015) Alternative structures for character-level RNNs. ar**v preprint ar**v:1511.06303

Download references

Author information

Authors and Affiliations

Department of Computer Engineering, Fr. C. Rodrigues Institute of Technology, Vashi, Navi Mumbai, India
Sujit S. Amin & Lata Ragha

Authors

Sujit S. Amin
View author publications
You can also search for this author in PubMed Google Scholar
Lata Ragha
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sujit S. Amin .

Editor information

Editors and Affiliations

GITAM University, Bangalore, India
I. Jeena Jacob
Department of Mathematics and Computer Science, Concordia University Chicago, River Forest, IL, USA
Selvanayaki Kolandapalayam Shanmugam
University of Florida, Gainesville, FL, USA
Selwyn Piramuthu
Gdańsk University of Technology, Gdańsk, Poland
Przemyslaw Falkowski-Gilski

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Amin, S.S., Ragha, L. (2021). Text Generation and Enhanced Evaluation of Metric for Machine Translation. In: Jeena Jacob, I., Kolandapalayam Shanmugam, S., Piramuthu, S., Falkowski-Gilski, P. (eds) Data Intelligence and Cognitive Informatics. Algorithms for Intelligent Systems. Springer, Singapore. https://doi.org/10.1007/978-981-15-8530-2_1

Download citation

DOI: https://doi.org/10.1007/978-981-15-8530-2_1
Published: 09 January 2021
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-8529-6
Online ISBN: 978-981-15-8530-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

Text Generation and Enhanced Evaluation of Metric for Machine Translation

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Empirical Analysis of Performance of MT Systems and Its Metrics for English to Bengali: A Black Box-Based Approach

Human Versus Automatic Evaluation of NMT for Low-Resource Indian Language

Otem&Utem: Over- and Under-Translation Evaluation Metric for NMT

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Text Generation and Enhanced Evaluation of Metric for Machine Translation

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Empirical Analysis of Performance of MT Systems and Its Metrics for English to Bengali: A Black Box-Based Approach

Human Versus Automatic Evaluation of NMT for Low-Resource Indian Language

Otem&Utem: Over- and Under-Translation Evaluation Metric for NMT

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation