On the impact of text similarity functions on hashtag recommendations in microblogging environments

  • Original Article
  • Published:
Social Network Analysis and Mining


Microblogging applications such as Twitter are experiencing tremendous success. Microblog users utilize hashtags to categorize posted messages which aim at bringing order to the myriads of microblog messages. However, the percentage of messages incorporating hashtags is small and the used hashtags are very heterogeneous as hashtags may be chosen freely and may consist of any arbitrary combination of characters. This heterogeneity and the lack of use of hashtags lead to significant drawbacks in regards to the search functionality as messages are not categorized in a homogeneous way. In this paper, we present an approach for the recommendation of hashtags suitable for the message the user currently enters which aims at creating a more homogeneous set of hashtags. Furthermore, we present a detailed study on how the similarity measures used for the computation of recommendations influence the final set of recommended hashtags.

  1. According to http://business.twitter.com/en/basics/what-is-twitter/.

  2. According to http://yearinreview.twitter.com/de/tps.html.

  3. http://dev.twitter.com/docs/streaming-apis.


