Abstract
Two topic detection techniques—co-word network analysis and topic modeling—were applied to extract topics in the Ted Talks. Ted Talks was chosen for its enormous impact worldwide and the rich descriptive data accompanying each talk that allow us to compare the topics resulting from different methods. The co-word network was built based on the “related_tags” field so that modularity analysis can be performed to classify the tags according to their co-occurrence patterns. Topic modeling was applied to the description field and the full-text transcript separately to detect the topics present in the free-text. The results of network modularity analysis revealed 13 interpretable topics consisting of closely knitted tags. Topic modeling generated 25 topics for the description and 40 for the transcript, respectively. Our results showed that both topic extraction methods were able to successfully identify the range of topics in the TED Talks. While the co-word network gave a broad overview and afforded visualization, the topic model revealed topics with greater granularity. We compared the semantics of the topics produced by different methods and discussed the methodological implications of our research.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Callon, M., Courtial, J.P., Laville, F.: Co-word analysis as a tool for describing the network of interactions between basic and technological research: the case of polymer chemistry. Scientometrics 22(1), 155–205 (1991)
Rip, A., Courtial, J.P.: Co-word maps of biotechnology: an example of cognitive scientometrics. Scientometrics 6(6), 381–400 (1984)
Leydesdorff, L., Nerghes, A.: Co-word maps and topic modeling: a comparison using small and medium-sized corpora (N < 1,000). J. Am. Soc. Inf. Sci. 68(4), 1024–1035 (2017)
Ding, Y., Chowdhury, G.G., Foo, S.: Bibliometric cartography of information retrieval research by using co-word analysis. Inform. Process. Manag. 37(6), 817–842 (2001)
Hu, J., Zhang, Y.: Research patterns and trends of recommendation system in China using co-word analysis. Inform. Process. Manag. 51(4), 329–339 (2015)
Tang, M.C., Cheng, Y.J., Chen, K.H.: A longitudinal study of intellectual cohesion in digital humanities using bibliometric analyses. Scientometrics 113(2), 985–1008 (2017)
Tang, M.C., Teng, W., Lin, M.: Determining the critical thresholds for co-word network based on the theory of percolation transition: a case study in Buddhist studies. J. Documentation 76(2), 462–483 (2019)
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3(Jan), 993–1022 (2003)
Blei, D.M., Lafferty, J.D.: Dynamic topic models. In: Proceedings of the 23rd international conference on Machine learning, pp. 113–120, Association for Computing Machinery, New York (2006)
Fan, W.M., Jeng, W., Tang, M.C.: Using data citation to define a knowledge domain: a case study of the Add-Health dataset. J. Assoc. Inform. Sci. Technol. 74(1), 81–98 (2023)
Suominen, A., Toivanen, H.: Map of science with topic modeling: comparison of unsupervised learning and human-assigned subject classification. J. Am. Soc. Inf. Sci. 67(10), 2464–2476 (2016)
Callon, M., Law, J., Rip, A.: Qualitative scientometrics. In: Callon, M., Law, J., Rip, A. (eds.) Map** the Dynamics of Science and Technology, pp. 103–123. Palgrave Macmillan, London (1986)
Blondel, V.D., Guillaume, J.L., Lambiotte, R., Lefebvre, E.: Fast unfolding of communities in large networks. J. Stat. Mech.: Theory Exp. 2008(10), 10008 (2008)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 Springer Nature Switzerland AG
About this paper
Cite this paper
Hung, LT., Tang, MC., Lin, SC. (2023). Using Co-word Network Community Detection and LDA Topic Modeling to Extract Topics in TED Talks. In: Nah, F., Siau, K. (eds) HCI in Business, Government and Organizations. HCII 2023. Lecture Notes in Computer Science, vol 14039. Springer, Cham. https://doi.org/10.1007/978-3-031-36049-7_11
Download citation
DOI: https://doi.org/10.1007/978-3-031-36049-7_11
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-36048-0
Online ISBN: 978-3-031-36049-7
eBook Packages: Computer ScienceComputer Science (R0)