Abstract
Social networks are the most successful Web 2.0 applications, where users share and create over 2.5 quintillion bytes of data daily. This data can be exploited to retrieve many kinds of information which will be used in several applications. In fact, social networks have attracted considerable attention from researchers in different domains. This paper serves as an introduction to social network data analysis. In this work we present the recent and representative works in social network data analysis in an analytical fashion. We also highlight most important applications and used methods in the context of structural data analysis. Then, we list the major tasks and approaches proposed to analyse added-content in social media.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
A measure to capture the vague notion of importance in a graph, used to identify the most significant vertices.
- 2.
- 3.
- 4.
- 5.
References
Aci, M., İnan, C., Avci, M.: A hybrid classification method of k nearest neighbor, Bayesian methods and genetic algorithm. Expert Syst. Appl. 37(7), 5061–5067 (2010)
Aggarwal, C.C.: Social Network Data Analytics, 1st edn. Springer, Heidelberg (2011). https://doi.org/10.1007/978-1-4419-8462-3
Aggarwal, C.C., Han, J., Wang, J., Yu, P.S.: A framework for clustering evolving data streams. In: Proceedings of the 29th International Conference on Very Large Data Bases. VLDB 2003, vol. 29, pp. 81–92. VLDB Endowment, Germany (2003)
Ames, M., Naaman, M.: Why we tag: motivations for annotation in mobile and online media. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 971–980. ACM, New York (2007)
Anderson, A., Huttenlocher, D., Kleinberg, J., Leskovec, J.: Effects of user similarity in social media. In: Proceedings of the Fifth ACM International Conference on Web Search and Data Mining, WSDM 2012, pp. 703–712. ACM, New York (2012)
Aouay, S., Jamoussi, S., Gargouri, F.: Feature based link prediction. In: 11th IEEE/ACS International Conference on Computer Systems and Applications, Qatar, AICCSA, pp. 523–527 (2014)
Bader, B., Harshman, R., Kolda, T.: Temporal analysis of semantic graphs using ASALSAN. In: Seventh IEEE International Conference on Data Mining (ICDM), USA, pp. 33–42 (2007)
Banerjee, A., Dhillon, I., Ghosh, J., Sra, S.: Expectation maximization for clustering on hyperspheres. Technical report, University of Texas at Austin, USA (2003)
Beach, A., et al.: Fusing mobile, sensor, and social data to fully enable context-aware computing. In: Proceedings of the Eleventh Workshop on Mobile Computing Systems and Applications, HotMobile 2010, pp. 60–65. ACM, New York (2010)
Cai, L., Hofmann, T.: Hierarchical document categorization with support vector machines. In: Proceedings of the Thirteenth ACM International Conference on Information and Knowledge Management, CIKM 2004, pp. 78–87. ACM, New York (2004)
Chang, M., Poon, C.K.: Using phrases as features in email classification. J. Syst. Softw. 82(6), 1036–1045 (2009)
Chen, W., Wang, M.: A fuzzy c-means clustering-based fragile watermarking scheme for image authentication. Expert Syst. Appl. 36(2), 1300–1307 (2009)
Chua, T., Tang, J., Hong, R., Li, H., Luo, Z., Zheng, Y.: NUS-WIDE: a real-world web image database from National University of Singapore. In: Proceedings of the ACM International Conference on Image and Video Retrieval, CIVR 2009, pp. 48:1–48:9. ACM, New York (2009)
Clauset, A., Moore, C., Newman, M.E.J.: Hierarchical structure and the prediction of missing links in networks. Nature 453, 98–101 (2008)
Conover, M., Gonçalves, B., Ratkiewicz, J., Flammini, A., Menczer, F.: Predicting the political alignment of Twitter users, pp. 192–199, October 2011
Cooper, M., Foote, J., Girgensohn, A., Wilcox, L.: Temporal event clustering for digital photo collections. ACM Trans. Multimedia Comput. Commun. Appl. 1(3), 269–288 (2005)
Crandall, D.J., Backstrom, L., Huttenlocher, D., Kleinberg, J.: Map** the world’s photos. In: Proceedings of the 18th International Conference on World Wide Web, WWW 2009, pp. 761–770. ACM, New York (2009)
Dai, W., Yang, Q., Xue, G., Yu, Y.: Self-taught clustering. In: Proceedings of the 25th International Conference on Machine Learning, ICML 2008, pp. 200–207. ACM, New York (2008)
Dakiche, N., Tayeb, F.B.S., Slimani, Y., Benatchba, K.: Tracking community evolution in social networks: a survey. Inf. Process. Manag. 56(3), 1084–1102 (2019)
Daumé, I.I.I., Marcu, D.: Domain adaptation for statistical classifiers. J. Artif. Int. Res. 26(1), 101–126 (2006)
Deng, J., Dong, W., Socher, R., Li, L., Li, K., Fei-fei, L.: ImageNet: a large-scale hierarchical image database. In: Computer Society Conference on Computer Vision and Pattern Recognition CVPR, USA, pp. 248–255 (2009)
Dong, J., Zhao, Y., Peng, T.: Ontology classification for semantic-web-based software engineering. IEEE Trans. Serv. Comput. 2, 303–317 (2009)
Du, J., **an, Y., Yang, J.: A survey on social network visualization. In: International Symposium on Social Science (ISSS 2015), China, pp. 275–279 (2015). Atlantis Press
Fortunato, S.: Community detection in graphs. Phys. Rep. 486(3–5), 75–174 (2010)
Fouss, F., Pirotte, A., Renders, J., Saerens, M.: Random-walk computation of similarities between nodes of a graph with application to collaborative recommendation. IEEE Trans. Knowl. Data Eng. 19(3), 355–369 (2007)
Freeman, L.C.: Visualizing social networks. J. Soc. Struct. 1, 4 (2000)
Gallagher, A., Joshi, D., Yu, J., Luo, J.: Geo-location inference from image content and user tags. In: 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, USA, pp. 55–62 (2009)
Girvan, M., Newman, M.E.: Community structure in social and biological networks. Proc. Natl. Acad. Sci. 99(12), 7821–7826 (2002)
Goh, K.I., Oh, E., Kahng, B., Kim, D.: Betweenness centrality correlation in social networks. Phys. Rev. E 67, 017101 (2003)
Hannachi, L., Asfari, O., Benblidia, N., Bentayeb, F., Kabachi, N., Boussaid, O.: Community extraction based on topic-driven-model for clustering users tweets. In: Zhou, S., Zhang, S., Karypis, G. (eds.) ADMA 2012. LNCS (LNAI), vol. 7713, pp. 39–51. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-35527-1_4
Hays, J., Efros, A.A.: IM2GPS: estimating geographic information from a single image. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1–8. IEEE Computer Society, Washington, D.C. (2008)
He, H., Wang, H., Yang, J., Yu, P.S.: Blinks: ranked keyword searches on graphs. In: Proceedings of the 2007 ACM SIGMOD International Conference on Management of Data, SIGMOD 2007, pp. 305–316. ACM, New York (2007)
Ho, K.T., Bui, Q.V., Bui, M.: Dynamic social network analysis using author-topic model. In: Hodoň, M., Eichler, G., Erfurth, C., Fahrnberger, G. (eds.) I4CS 2018. CCIS, vol. 863, pp. 47–62. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-93408-2_4
Irfan, R., et al.: A survey on text mining in social networks. Knowl. Eng. Rev. 30, 157–170 (2015)
Jaffali, S., Jamoussi, S.: Principal component analysis neural network for textual document categorization and dimension reduction. In: 6th International Conference on Sciences of Electronics. Technologies of Information and Telecommunications (SETIT), pp. 835–839. IEEE, Tunisia (2012)
Jain, A.K.: Data clustering: 50 years beyond k-means. Pattern Recogn. Lett. 31(8), 651–666 (2010)
Jo, T.: Neural text categorizer for exclusive text categorization. Int. J. Inf. Sci. 34(1) (2010)
Joshi, D., Luo, J.: Inferring generic activities and events from image content and bags of geo-tags. In: Proceedings of the 2008 International Conference on Content-Based Image and Video Retrieval, CIVR 2008, pp. 37–46. ACM, New York (2008)
Kacholia, V., Pandit, S., Chakrabarti, S., Sudarshan, S., Desai, R., Karambelkar, H.: Bidirectional expansion for keyword search on graph databases. In: Proceedings of the 31st International Conference on Very Large Data Bases, VLDB 2005, pp. 505–516. VLDB Endowment, Norway (2005)
Kamvar, S., Haveliwala, T., Manning, C., Golub, G.: Exploiting the block structure of the web for computing pagerank. Technical report 2003–17, Stanford InfoLab, UK (2003)
Kashfia, S., Alhajj, R.: Emotion and sentiment analysis from Twitter text. J. Comput. Sci. 36, 101003 (2019)
Kavitha, V., Punithavalli, M.: Clustering time series data stream - a literature survey. Int. J. Comput. Sci. Inf. Secur. IJCSIS 8(1), 289–294 (2010)
Khalessizadeh, S.M., Zaefarian, R., Nasseri, S., Ardil, E.: Genetic mining: using genetic algorithm for topic based on concept distribution. Int. J. Math. Comput. Phys. Electr. Comput. Eng. 2(1), 35–38 (2008)
Khemakhem, I.T., Jamoussi, S., Hamadou, A.B.: POS tagging without a tagger: using aligned corpora for transferring knowledge to under-resourced languages. Computación y Sistemas 20(4), 667–679 (2016)
Leicht, E.A., Holme, P., Newman, M.E.J.: Vertex similarity in networks. Phys. Rev. E 73(2), 026–120 (2006)
Li, X., Chen, H.: Recommendation as link prediction in bipartite graphs. Decis. Support Syst. 54(2), 880–890 (2013)
Li, Z.L., Fang, X., Sheng, O.R.L.: A survey of link recommendation for social networks: methods, theoretical foundations, and future research directions. ACM Trans. Manag. Inf. Syst. 9(1), 1:1–1:26 (2017)
Lin, Y., Chi, Y., Zhu, S., Sundaram, H., Tseng, B.L.: FacetNet: a framework for analyzing communities and their evolutions in dynamic networks. In: Proceedings of the 17th International Conference on World Wide Web, WWW 2008, pp. 685–694. ACM, New York (2008)
Liu, D., Hua, X.S., Yang, L., Wang, M., Zhang, H.: Tag ranking. In: Proceedings of the 18th International Conference on World Wide Web, WWW 2009, pp. 351–360. ACM, New York (2009)
Liu, H., Hu, Z., Haddadi, H., Tian, H.: Hidden link prediction based on node centrality and weak ties. EPL (Europhys. Lett.) 101(1), 18004 (2013)
Lü, L., **, C., Zhou, T.: Similarity index based on local paths for link prediction of complex networks. Phys. Rev. E 80, 046–122 (2009)
Macskassy, S.A., Provost, F.: A simple relational classifier. In: Proceedings of the Second Workshop on Multi-Relational Data Mining (MRDM-2003) at KDD-2003, pp. 64–76 (2003)
McPherson, M., Smith-Lovin, L., Cook, J.M.: Birds of a feather: homophily in social networks. Annual Rev. Sociol. 27(1), 415–444 (2001)
Menon, A.K., Elkan, C.: Link prediction via matrix factorization. In: Gunopulos, D., Hofmann, T., Malerba, D., Vazirgiannis, M. (eds.) ECML PKDD 2011. LNCS (LNAI), vol. 6912, pp. 437–452. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-23783-6_28
Miao, D., Duan, Q., Zhang, H., Jiao, N.: Rough set based hybrid algorithm for text classification. Expert Syst. Appl. 36(5), 9168–9174 (2009)
Moody, J., McFarland, D., Bender-deMoll, S.: Dynamic network visualization. Am. J. Sociol. 110(4), 1206–1241 (2005)
Moradabadi, B., Meybodi, M.R.: Link prediction in weighted social networks using learning automata. Eng. Appl. Artif. Intell. 70, 16–24 (2018)
Naaman, M., Harada, S., Wang, Q., Garcia-Molina, H., Paepcke, A.: Context data in geo-referenced digital photo collections. In: Proceedings of the 12th Annual ACM International Conference on Multimedia, MULTIMEDIA 2004, pp. 196–203. ACM, New York (2004)
Neville, J., Jensen, D.: Leveraging relational autocorrelation with latent group models. In: Fifth IEEE International Conference on Data Mining (ICDM 2005), p. 8. IEEE (2005)
Newman, M.E.J.: Clustering and preferential attachment in growing networks. Phys. Rev. E 64, 025–102 (2001)
Pan, S.J., Yang, Q.: A survey on transfer learning. IEEE Trans. Knowl. Data Eng. 22(10), 770–783 (2010)
Papadimitriou, A., Symeonidis, P., Manolopoulos, Y.: Fast and accurate link prediction in social networking systems. J. Syst. Softw. 85(9), 2119–2132 (2012)
Qi, G., Aggarwal, C.C., Huang, T.S.: Community detection with edge content in social media networks. In: Kementsietsidis, A., Salles, M.A.V. (eds.) Proceedings of the 2012 IEEE 28th International Conference on Data Engineering, pp. 534–545. IEEE Computer Society, Washington, D.C. (2012)
Qi, G., Hua, X., Zhang, H.: Learning semantic distance from community-tagged media collection. In: Proceedings of the 17th ACM International Conference on Multimedia, MM 2009, pp. 243–252. ACM, New York (2009)
Raina, R., Battle, A., Lee, H., Packer, B., Ng, A.Y.: Self-taught learning: transfer learning from unlabeled data. In: Proceedings of the 24th International Conference on Machine Learning, ICML 2007, pp. 759–766. ACM, New York (2007)
Rao, Y., Li, X.: A topic-based dynamic clustering algorithm for text stream. In: International Conference on Artificial Intelligence and Industrial Engineering (AIIE 2015), Thailand, pp. 480–483 (2015)
Rocchio, J.J.: Relevance feedback in information retrieval, pp. 313–323 (1971)
Rossetti, G., Cazabet, R.: Community discovery in dynamic networks: a survey. ACM Comput. Surv. 51(2), 35:1–35:37 (2018)
Sapountzi, A., Psannis, K.E.: Social networking data analysis tools and challenges. Future Gen. Comput. Syst. 86, 893–913 (2018)
Seifzadeh, S., Farahat, A.K., Kamel, M.S., Karray, F.: Short-text clustering using statistical semantics. In: Gangemi, A., Leonardi, S., Panconesi, A. (eds.) Proceedings of the 24th International Conference on World Wide Web, pp. 805–810. ACM, New York (2015)
Shen, Z., Ma, K.: MobiVis: a visualization system for exploring mobile data. In: Proceedings of MobiVis: A Visualization System for Exploring Mobile Data, pp. 175–182. IEEE, Japan (2008)
Shen, Z., Ma, K., Eliassi-Rad, T.: Visual analysis of large heterogeneous social networks by semantic and structural abstraction. IEEE Trans. Vis. Comput. Graph. 12(6), 1427–1439 (2006)
Sigurbjörnsson, B., van Zwol, R.: Flickr tag recommendation based on collective knowledge. In: Proceedings of the 17th International Conference on World Wide Web, WWW 2008, pp. 327–336. ACM, New York (2008)
Sun, J., Tao, D., Faloutsos, C.: Beyond streams and graphs: dynamic tensor analysis. In: Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2006, pp. 374–383. ACM, New York (2006)
Tang, F., Gao, Y.: Fast near duplicate detection for personal image collections. In: Proceedings of the 17th ACM International Conference on Multimedia, MM 2009, pp. 701–704. ACM, New York (2009)
Tang, J., Yan, S., Hong, R., Qi, G., Chua, T.: Inferring semantic concepts from community-contributed images and noisy tags. In: Proceedings of the 17th ACM International Conference on Multimedia, MM 2009, pp. 223–232. ACM, New York (2009)
Valverde-Rebaza, J.C., de Andrade Lopes, A.: Exploiting behaviors of communities of Twitter users for link prediction. Social Netw. Analys. Mining 3(4), 1063–1074 (2013)
Wang, X.J., Zhang, L., Li, X., Ma, W.Y.: Annotating images by mining image search results. IEEE Trans. Pattern Anal. Mach. Intell. 30(11), 1919–1932 (2008)
Wang, Z., Song, Y., Zhang, C.: Transferred dimensionality reduction. In: Daelemans, W., Goethals, B., Morik, K. (eds.) ECML PKDD 2008. LNCS (LNAI), vol. 5212, pp. 550–565. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-87481-2_36
Weinberger, K.Q., Slaney, M., Van Zwol, R.: Resolving tag ambiguity. In: Proceedings of the 16th ACM International Conference on Multimedia, MM 2008, pp. 111–120. ACM, New York (2008)
Wu, P., Tretter, D.: Close & closer: social cluster and closeness from photo collections. In: Gao, W., et al. (eds.) ACM Multimedia, pp. 709–712. ACM, New York (2009)
Wu, S., Sun, J., Tang, J.: Patent partner recommendation in enterprise social networks. In: Proceedings of the Sixth ACM International Conference on Web Search and Data Mining, WSDM 2013, pp. 43–52. ACM, New York (2013)
**ang, R., Neville, J.: Collective inference for network data with copula latent Markov networks. In: Proceedings of the Sixth ACM International Conference on Web Search and Data Mining, WSDM 2013, pp. 647–656. ACM, New York (2013)
Xu, G., Zhang, Y., Li, L.: Web Mining and Social Networking: Techniques and Applications, 1st edn. Springer, New York (2011). https://doi.org/10.1007/978-1-4419-7735-9
Xu, X., Zhang, F., Niu, Z.: An ontology-based query system for digital libraries. In: Pacific-Asia Workshop on Computational Intelligence and Industrial Application, China, pp. 222–226 (2008)
Yamamoto, T., Honda, K., Notsu, A., Ichihashi, H.: A comparative study on TIBA imputation methods in FCMdd-based linear clustering with relational data. Adv. Fuzzy Syst. 2011, 265170:1–265170:10 (2011)
Yu, J., Luo, J.: Leveraging probabilistic season and location context models for scene understanding. In: Proceedings of the 2008 International Conference on Content-Based Image and Video Retrieval, CIVR 2008, pp. 169–178. ACM, New York (2008)
Zadrozny, B.: Learning and evaluating classifiers under sample selection bias. In: Proceedings of the Twenty-First International Conference on Machine Learning, ICML 2004, pp. 114–121. ACM, New York (2004)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Jaffali, S., Jamoussi, S., Khelifi, N., Hamadou, A.B. (2020). Survey on Social Networks Data Analysis. In: Rautaray, S., Eichler, G., Erfurth, C., Fahrnberger, G. (eds) Innovations for Community Services. I4CS 2020. Communications in Computer and Information Science, vol 1139. Springer, Cham. https://doi.org/10.1007/978-3-030-37484-6_6
Download citation
DOI: https://doi.org/10.1007/978-3-030-37484-6_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-37483-9
Online ISBN: 978-3-030-37484-6
eBook Packages: Computer ScienceComputer Science (R0)