Chinese Lexical Semantics
20th Workshop, CLSW 2019, Bei**g, China, June 28–30, 2019, Revised Selected Papers
Article
With the arrival of the big data era, the amount of micro-blog users and texts is constantly increasing, and research on personalized recommendation algorithm for micro-blog texts is becoming more and more urg...
Book and Conference Proceedings
20th Workshop, CLSW 2019, Bei**g, China, June 28–30, 2019, Revised Selected Papers
Chapter and Conference Paper
Embedding is widely used in most natural language processing. e.g., neural machine translation, text classification, text abstraction and sentiment analysis etc. Word-based embedding is faster and character-b...
Chapter and Conference Paper
Text error correction is an essential part of text proofreading. This paper presents a method for generating text error correction suggestion based on SoundShape Code. By converting the target words into Soun...
Chapter and Conference Paper
With the widespread use of social media, social networks have become an important information carrier and platform for users to explore the world. Social networks not only reflect the hot events in society bu...
Chapter and Conference Paper
In the information age, the network technology continues to develop. As an emerging social media, Sina Weibo has a huge user base. Every day, hundreds of millions of users express their opinions on hot events...
Chapter and Conference Paper
Using the named entity’s type tag to construct a unique vector for a class of named entities can solve the problem that named entities are too scattered in the semantic space. In the relation extraction task, ...
Chapter and Conference Paper
Anaphora resolution plays an important role in Chinese micro-blog information mining. Based on the linguistic features of personal pronouns in Chinese micro-blog texts, this paper proposes a multi-strategy me...
Chapter and Conference Paper
Topic models are important tools for mining the potential topics of text. However, the existing topic model is mostly derived from latent Dirichlet allocation (LDA), which requires the number of topics to be s...
Chapter and Conference Paper
The knowledge of Chinese semantic collocation plays an important role in Chinese semantic understanding. Based on the investigation of the existing semantic collocation knowledge base, this article proposes a...
Chapter and Conference Paper
As an integral part of deep learning, attention mechanism and bi-directional long short-term memory (Bi-LSTM) are widely used in the field of NLP (natural language processing) and their effectiveness has been ...
Chapter and Conference Paper
The correct definition and recognition of sentences is the basis of NLP. For the characteristics of Chinese text structure, the theory of NT clause was proposed from the perspective of micro topics. Based on ...
Chapter and Conference Paper
The classification of semantic relations between words is an important part of semantic analysis in natural language research. The automatic achievement of this classification is of significance to constructi...
Chapter and Conference Paper
In text proofreading area, the error detection and error correction are reversible process to each other. In this paper, considering them in the same angle, we put forward an idea of “scattered string concentr...
Chapter and Conference Paper
Feature dimension reduction is an important step in text categorization, but traditional feature dimension reduction method ignores semantic information of features. In order to solve this problem, this paper,...
Chapter and Conference Paper
Due to specific semantic constraints between quantifiers and nouns and the frequent phenomena of quantifier-noun collocation error in real text, this paper proposes a new model of extracting quantifier-noun co...
Chapter and Conference Paper
Traditional Chinese text automatic proofreading technology is mainly focused on one or more pre-set error types, the proofreading performance in a real scene needs to be improved. This paper proposes a real sc...
Chapter and Conference Paper
Automatic error-detecting for Chinese text is one of the important issues in the field of Chinese Information Processing. The current study of text error-detecting focuses on words-level and syntactic-level, b...
Chapter and Conference Paper
This paper presents the construction of a Chinese word sense-tagged corpus. The resulting lexical resource includes mainly three components: 1) a corpus annotated with word senses; 2) a lexicon containing sens...