Mongolian Word Segmentation Based on Three Character Level Seq2Seq Models

Liu, Na; Su, **angdong; Gao, Guanglai; Bao, Feilong

doi:10.1007/978-3-030-04221-9_50

Na Liu^16,17,18,
**angdong Su^16,17,
Guanglai Gao^16,17 &
…
Feilong Bao^16,17

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 11305))

Included in the following conference series:

International Conference on Neural Information Processing

2508 Accesses

Abstract

Mongolian word segmentation is splitting the Mongolian words into roots and suffixes. It plays an important role in Mongolian related natural language processing tasks. To improve performance and avoid the tedious work of rule-making and statistics over large-scale corpus in early methods, this work takes a Seq2Seq framework to realize Mongolian word segmentation. Since each Mongolian word consisted of several sequential characters, we map Mongolian word segmentation to character-level Seq2Seq task, and further propose three different models from three different prospective to achieve the segmentation goal. The three character-level Seq2Seq models are (1) translation model, (2) true and pseudo map** model, (3) binary choice model. The main differences of these three models are the output sequences and the architectures of the RNNs in segmentation. We employ an improved beam search to optimize the second segmentation model and boost the segmentation process. All the models are trained on a limited dataset, and the second model achieved the state-of-the-art accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: EUR 29.95; Price includes VAT (Germany)

eBook: EUR 42.79; Price includes VAT (Germany)

Softcover Book: EUR 53.49; Price includes VAT (Germany)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Mongolian Word Segmentation Based on BiLSTM-CNN-CRF Model

Unsupervised Joint Monolingual Character Alignment and Word Segmentation

Span Labeling Approach for Vietnamese and Chinese Word Segmentation

References

Hankamer, J.: Finite state morphology and left to right phonology. In: Proceedings of the West Coast Conference on Formal Linguistics, vol. 5, pp. 41–52 (1986)
Google Scholar
Na, L., Junyi, W., Gui**, L.: Query expansion based on mongolian semantics. In: Third World Congress on Software Engineering IEEE Computer Society, pp. 25–28. IEEE, Wuhan (2012)
Google Scholar
**g, W., Hou, H., Bao, F., Jiang, Y.: Template-based model for Mongolian - Chinese machine translation. In: Technologies and Applications of Artificial Intelligence, pp. 352–357. IEEE, Tainan (2016)
Google Scholar
Weihua, W., Feilong, B., Guanglai, G.: Mongolian named entity recognition with bidirectional recurrent neural networks. In: IEEE International Conference on TOOLS with Artificial Intelligence, pp. 495–500. IEEE, San Jose (2017)
Google Scholar
Liu, R., Bao, F., Gao, G., Wang, Y.: Mongolian text-to-speech system based on deep neural network. In: Tao, J., Zheng, T.F., Bao, C., Wang, D., Li, Y. (eds.) NCMMSC 2017. CCIS, vol. 807, pp. 99–108. Springer, Singapore (2018). https://doi.org/10.1007/978-981-10-8111-8_10
Chapter Google Scholar
Hongxu, H., Liu, Q., Nasanurtu, M.: Mongolian word segmentation based on statistical language model. Pattern Recognit. Artif. Intell. 22(1), 108–112 (2009)
Google Scholar
Ming, Y., Hongxu, H.: Researching of Mongolian word segmentation system based on dictionary, rules and language model. M.S. Thesis, Inner Mongolia University, Hohhot, Inner Mongolia, China (2011)
Google Scholar
Jianguo, S., Hongxu, H., Bao, F.: Research on Slavic Mongolian word segmentation based on dictionary and rule. J. Chin. Inf. Process. 29(1), 197–202 (2015)
Google Scholar
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)
Article Google Scholar
Luong, T., Pham, H., Manning, C.D.: Effective approaches to attention-based neural machine translation. In: Conference on Empirical Methods in Natural Language Processing, EMNLP 2015, pp. 1412–1421 (2015)
Google Scholar
Wu, Y., Schuster, M., Chen, Z., Le, Q.V., Norouzi, M.: Google’s neural machine translation system: bridging the gap between human and machine translation. ar**v preprint ar**v:1609.08144 (2016)
Vaswani, A., et al.: Tensor2tensor for neural machine translation. ar**v preprint ar**v:1803.07416 (2018)
Bahdanau, D., Cho, K., Bengio, Y.: Neural machine translation by jointly learning to align and translate. Computer Science. ar**v preprint ar**v:1409.0473(2014)
Shiqi, S., Yong, C., Zhongjun, H., Wei, H., et al.: Minimum risk training for neural machine translation. In: ACL 2016: Annual Meeting of the Association for Computational Linguistics, vol. 1, pp. 1683–1692. ACL, Berlin (2016)
Google Scholar
Bahdanau, D., Chorowski, J., Serdyuk, D., Brakel, P., Bengio, Y.: End-to-end attention-based large vocabulary speech recognition. Computer Science, pp. 4945–4949 (2016)
Google Scholar
Arık, S.O., Chrzanowski, M., Coates, A., Diamos, G., Gibiansky, A.: Deep voice: realtime neural text-to-speech. In: International Conference on Machine Learning and Computing, ICMLC 2017, pp. 195–2049. ACM, Singapore (2017)
Google Scholar
Asri, L.E., He, J., Suleman, K.: A sequence-to-sequence model for user simulation in spoken dialogue systems. In: Conference of the International Speech Communication Association, Interspeech, pp. 1151–1155. IEEE, San Francisco (2016)
Google Scholar
Nallapati, R., Zhou, B., Santos, C., Gulcehre, C., **ang, B.: Abstractive text summarization using sequence-to-sequence RNNs and beyond. In: Proceedings of the 20th SIGNLL Conference on Computational Natural Language Learning, CoNLL 2016, pp. 280–290. ACL, Berlin (2016)
Google Scholar
See, A., Liu, P.J., Manning, C.D.: Get to the point: summarization with pointer-generator networks. In: ACL 2017: Annual Meeting of the Association for Computational Linguistics, vol. 1, pp. 1073–1083. ACL, Vancouver (2017)
Google Scholar
Sutskever, I., Vinyals, O., Le, Q.V.: Sequence to sequence learning with neural networks. In: Advances in Neural Information Processing Systems 27 (NIPS 2014), vol. 4, pp. 3104–3112. MIT Press Cambridge, Montréal (2014)
Google Scholar
Hakkani-Tür, D., Tur, G., Celikyilmaz, A., Chen, Y.N., Gao, J., Deng, L.: Multi-domain joint semantic frame parsing using bi-directional RNN-LSTM. In: Conference of the International Speech Communication Association, Interspeech, pp. 715–719. IEEE, San Francisco (2016)
Google Scholar
Mnih, V., Heess, N., Graves, A., Kavukcuoglu, K.: Recurrent models of visual attention. In: Advances in Neural Information Processing Systems 27 (NIPS 2014), vol. 3, pp. 2204–2212. MIT Press Cambridge, Montréal (2014)
Google Scholar
Neubig, G.: Neural machine translation and sequence-to-sequence models: a tutorial, pp. 41–43. ar**v preprint ar**v: 1703.01619v1 (2017)
Google Scholar

Download references

Acknowledgements

This work was funded by National Natural Science Foundation of China (Grant No. 61762069), Natural Science Foundation of Inner Mongolia Autonomous Region (Grant No. 2017BS0601), Research program of science and technology at Universities of Inner Mongolia Autonomous Region (Grant No. NJZY18237).

Author information

Authors and Affiliations

College of Computer Science, Inner Mongolia University, Hohhot, China
Na Liu, **angdong Su, Guanglai Gao & Feilong Bao
Inner Mongolia Key Laboratory of Mongolian Information Processing Technology, Hohhot, China
Na Liu, **angdong Su, Guanglai Gao & Feilong Bao
Department of Science, Hetao University, Bayannur, China
Na Liu

Authors

Na Liu
View author publications
You can also search for this author in PubMed Google Scholar
**angdong Su
View author publications
You can also search for this author in PubMed Google Scholar
Guanglai Gao
View author publications
You can also search for this author in PubMed Google Scholar
Feilong Bao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to **angdong Su .

Editor information

Editors and Affiliations

The Chinese Academy of Sciences, Bei**g, China
Long Cheng
City University of Hong Kong, Kowloon, Hong Kong
Andrew Chi Sing Leung
Kobe University, Kobe, Japan
Seiichi Ozawa

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Liu, N., Su, X., Gao, G., Bao, F. (2018). Mongolian Word Segmentation Based on Three Character Level Seq2Seq Models. In: Cheng, L., Leung, A., Ozawa, S. (eds) Neural Information Processing. ICONIP 2018. Lecture Notes in Computer Science(), vol 11305. Springer, Cham. https://doi.org/10.1007/978-3-030-04221-9_50

Download citation

DOI: https://doi.org/10.1007/978-3-030-04221-9_50
Published: 17 November 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-04220-2
Online ISBN: 978-3-030-04221-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Mongolian Word Segmentation Based on Three Character Level Seq2Seq Models

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Mongolian Word Segmentation Based on BiLSTM-CNN-CRF Model

Unsupervised Joint Monolingual Character Alignment and Word Segmentation

Span Labeling Approach for Vietnamese and Chinese Word Segmentation

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Mongolian Word Segmentation Based on Three Character Level Seq2Seq Models

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Mongolian Word Segmentation Based on BiLSTM-CNN-CRF Model

Unsupervised Joint Monolingual Character Alignment and Word Segmentation

Span Labeling Approach for Vietnamese and Chinese Word Segmentation

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation