Log in

Semantic Hashtag Relation Classification Using Co-occurrence Word Information

  • Published:
Wireless Personal Communications Aims and scope Submit manuscript

Abstract

Users using social networking service (SNS) may express their thoughts and feelings using simple hashtags. Hashtags are related to other hashtags and images that are used together in the user’s other posts. Understanding the meaning of personal hashtags can be a way to learn latent semantic expressions of personal words. Existing methods for learning and analyzing semantics such as Latent Semantic Analysis, Latent Dirichlet Allocation and Word Embedding need large-scale corpus to construct an elaborate model. Large-scale corpus usually consists of words that a lot of people already use. Thus, existing methods are able to catch the latent meaning of words used in general. However, it is difficult for these methods to find personal meanings of words that are used by a particular person. Because the number of words that a person use is usually very small compared to a large-scale corpus. Another reason for the difficulty is that existing methods use occurrence frequency or co-occurrence probability. Therefore, the importance or the frequency or the probability of personalized meaning may disappear because of this large difference in the number of words. In this research we focused on the classification of semantic words using a user’s hashtag data and the co-occurrence of these hashtags. The performance is evaluated and enhances previous work by 18% for Precision and more than 70% for Recall.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price includes VAT (Germany)

Instant access to the full article PDF.

Fig. 1
Fig. 2

Similar content being viewed by others

References

  1. Mikolov, T., Sutskever, I., Chen, K., Corrado, G. S., & Dean, J. (2013). Distributed representations of words and phrases and their compositionality. In Advances in neural information processing systems (pp. 3111–3119).

  2. Harris, Z. (1954). Distributional structure. (J. Katz, Ed.). Word Journal of the International Linguistic Association, 10(23), 146–162.

    Google Scholar 

  3. Landauer, Thomas K., Foltz, Peter W., & Laham, Darrell. (1998). An introduction to latent semantic analysis. Discourse Processes, 25(2-3), 259–284.

    Article  Google Scholar 

  4. Blei, David M., Ng, Andrew Y., & Jordan, Michael I. (2003). Latent dirichlet allocation. The Journal of Machine Learning Research, 3, 993–1022.

    MATH  Google Scholar 

  5. Mikolov, T., Kombrink, S., Burget, L., Černocký, J., & Khudanpur, S. (2011). Extensions of recurrent neural network language models. In IEEE international conference on acoustics, speech and signal processing (ICASSP). Prague, CZ.

  6. Denton, E., Weston, J., Paluri, M., Bourdev, L., & Fergus, R. (2015). User conditional hashtag prediction for images. In Proceedings of the 21th ACM SIGKDD international conference on knowledge discovery and data mining (pp. 1731–1740). ACM.

  7. Weston, J., Chopra, S., & Adams, K. (2014). # TagSpace: Semantic embeddings from hashtags. In Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP) (pp. 1822–1827).

  8. Wang, X., Wei, F., Liu, X., Zhou, M., & Zhang, M. (2011). Topic sentiment analysis in twitter: A graph-based hashtag sentiment classification approach. In Proceedings of the 20th ACM international conference on information and knowledge management (pp. 1031–1040). ACM.

  9. Bansal, P., Bansal, R., & Varma, V. (2015). Towards deep semantic analysis of hashtags. In European conference on information retrieval. Springer International Publishing, Berlin.

  10. Seo, S., Kim, J. K., & Choi, L. (2017). Semantic hashtag relation classification using co-occurrence word information. In ICUFN 20179th international conference on ubiquitous and future networks (pp. 860–862).

  11. Spärck Jones, K. (1972). A statistical interpretation of term specificity and its application in retrieval. Journal of Documentation, 28, 11–21.

    Article  Google Scholar 

Download references

Acknowledgements

This work was supported in part by the National Research Foundation of Korea under Grant Number 2014R1A1A2059527.

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Jong-Kook Kim or Joongheon Kim.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Seo, S., Kim, JK., Kim, SI. et al. Semantic Hashtag Relation Classification Using Co-occurrence Word Information. Wireless Pers Commun 107, 1355–1365 (2019). https://doi.org/10.1007/s11277-018-5745-y

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11277-018-5745-y

Keywords

Navigation