Abstract
Recently there has been much excitement about using big data in social research, especially data derived from digital media. This chapter examines leading examples of this type of research from three platforms: Facebook, Twitter, and Wikipedia. It discusses their findings, data sources, and claims to validity. The aim is to assess how a number of landmark studies advance on existing social science research, pushing it in the direction of a more quantitative and more scientific mode of research. The chapter argues that this more scientific approach has advantages and drawbacks: on the positive side, for example, it often makes for rapid cumulative advances in knowledge. On the negative side, several of these studies are poorly theorized in terms of how these platforms fit into existing theories and research about the uses of information and communication technologies. The chapter goes on to examine how big data fits into the advance of social science, arguing that research technologies, quantification, and new sources of data are key drivers of research fronts. The chapter concludes with reflections about how these advantages and limitations shape how these studies are used and how they will (or will not) fit into the disciplinary landscape of the social sciences, and the study of media and communications in particular.
Similar content being viewed by others
References
Asur S, Huberman B (2010) Predicting the future with social media. ar**v:1003.5699; Retrieved from http://arxiv.org/abs/1003.5699
Backstrom L, Boldi P, Rosa M, Ugander J, Vigna S (2012) Four degrees of separation. In: Proceedings of the 3rd annual ACM web science conference (WebSci’12). ACM, New York, pp 33–42
Barberá P (2015) Birds of the same feather flock together. Bayesian point estimation using Twitter data. Polit Anal 23(1):76–91
Benkler Y, Faris R, Roberts H, Zuckerman E (2017) Study: Breitbart-led right-wing media ecosystem altered broader media agenda. Columbia Journalism Rev 1(4.1):7
Bimber B (2003) Information and American democracy: technology in the evolution of political power. Cambridge University Press, Cambridge
Bond R, Farris C, Jones J, Kramer A, Marlow C, Settle J, Fowler J (2012) A 61-million-person experiment in social influence and political mobilization. Nature 489:295–298
boyd D, Crawford K (2012) Critical questions for big data: provocations for a cultural, technological and scholarly phenomenon. Inf Commun Soc 15(5):662–679
Bruns A, Liang YE (2012) Tools and methods for capturing Twitter data during natural disasters. First Monday 17(4). http://firstmonday.org/htbin/cgiwrap/bin/ojs/index.php/fm/article/viewArticle/3937/3193
Cha M, Haddadi H, Benevenuto F, Gummadi K (2010) Measuring user influence in Twitter: the million follower Fallacy. In: Proceedings of the international AAAI conference on weblogs and social media (ICWSM), May. Available at http://twitter.mpi-sws.org/. Last accessed 4 Dec 2012
Collins R (1994) Why the social sciences won’t become high-consensus, rapid-discovery science. Sociol Forum 9(2):155–177
Collins R (1998) The sociology of philosophies: a global theory of intellectual change. Harvard University Press, Cambridge, MA
Eagle N, Greene K (2014) Reality mining: using big data to engineer a better world. MIT Press, Cambridge, MA
Ekbia H et al (2015) Big data, bigger dilemmas: a critical review. J Assoc Inf Sci Technol 66(8):1523–1545
Ford H, Wajcman J (2017) ‘Anyone can edit’, not everyone does: Wikipedia’s infrastructure and the gender gap. Soc Stud Sci 47(4):511–527
Freeman L (2004) The development of social network analysis: a study in the sociology of science. Empirical Press, Vancouver
Generous N, Fairchild G, Deshpande A, Del Valle SY, Priedhorsky R (2014) Global disease monitoring and forecasting with Wikipedia. PLoS Comput Biol 10(11):e1003892
Giles J (2012) Making the links: from E-mails to social networks, the digital traces left life in the modern world are transforming social science. Nature 488:448–450
Golder S, Macy M (2014) Digital footprints: opportunities and challenges for online social research. Annu Rev Sociol 40:6.1–6.24
González-Bailón S, Wang N, Rivero A, Borge-Holthoefer J, Moreno Y (2014) Assessing the bias in samples of large online networks social networks. Soc Networks 38:16–27
Hacking I (1983) Representing and intervening. Cambridge University Press, Cambridge
Hacking I (1992) The self-vindication of the laboratory sciences. In: Pickering A (ed) Science as practice and culture. University of Chicago Press, Chicago, pp 29–64
Hamby P (2013) Did Twitter kill the boys on the bus? Searching for a better way to cover a campaign. Joan Shorenstein Center on the Press, Politics, and Public Policy
Hermida A (2013) Twitter as an ambient news network. In: Weller K, Bruns A, Burgess J, Mahrt M, Puschmann C (eds) Twitter and society. Peter Lang, Oxford, pp 359–372
Hill BM, Shaw A (2013) The Wikipedia gender gap revisited: characterizing survey response bias with propensity score estimation. PLoS One 8(6):e65782
Hindman M (2010) The myth of digital democracy. Princeton University Press, Princeton
Kramer AD, Guillory JE, Hancock JT (2014) Experimental evidence of massive-scale emotional contagion through social networks. Proceedings of the National Academy of Sciences, 201320040
Kwak H, Lee C, Park H, Moon S (2010) What is Twitter, a social network or a news media? In: Proceedings of the 19th international World Wide Web (WWW) conference, 26–30 Apr 2010, Raleigh
Lazer D, Kennedy R, King G, Vespignani A (2014) The parable of Google flu: traps in big data analysis. Science 343(6176):1203–1205
Leskovec J, Horvitz E (2008) Planetary-scale views on a large instant-messaging network. International World Wide Web conference (WWW), Bei**g
Lewis K, Kaufman J, Gonzalez M, Wimmer A, Christakis N (2008) Tastes, ties, and time: a new social network dataset using Facebook.com. Soc Networks 30(4):330–342
Liao H-T (2009) Conflict and consensus in the Chinese version of Wikipedia. IEEE Technol Soc Mag 28(2):49–56
Manyika J, Chui M, Brown B, Bughin J, Dobbs R, Roxburgh C, Byers C (2011) Big data: the next frontier for innovation, competition and productivity. McKinsey Global Institute. Available at: http://www.mckinsey.com/insights/mgi/research/technology_and_innovation/big_data_the_next_frontier_for_innovation. Last accessed 29 Aug 2012
Mayer-Schoenberger V, Cukier K (2013) Big data: a revolution that will transform how we live, work and think. John Murray, London
Mestyán M, Yasseri T, Kertész J (2012) Early prediction of movie box office success based on Wikipedia activity big data. http://arxiv.org/abs/1211.0970. Last accessed 4 Dec 2012
Meyer ET, Schroeder R (2015) Knowledge machines digital transformations of the sciences and humanities. MIT Press, Cambridge, MA
Meyer ET, Schroeder R, Cowls J (2016) The net as a knowledge machine: how the Internet became embedded in research. New Media Soc 18(7):1159–1189
O’Neil C (2016) Weapons of math destruction: how big data increases inequality and threatens democracy. Allen Lane, London
Okoli C, Mehdi M, Mesgari M, Nielsen F, Lanamäki A (2012) The people’s encyclopedia under the gaze of the sages: a systematic review of scholarly research on Wikipedia (available at SSRN)
Pasquale F (2015) The black box society: the secret algorithms that control money and information. Harvard University Press, Cambridge, MA
Porter T (2008) Statistics and statistical methods. In: Porter T, Ross D (eds) The modern social sciences. Cambridge University Press, Cambridge, pp 238–250
Prior M (2007) Post-broadcast democracy: how media choice increases inequality in political involvement and polarizes elections. Cambridge University Press, Cambridge
Rainie L, Wellman B (2012) Networked: the new social operating system. MIT Press, Cambridge, MA
Reagle JM (2010) Good faith collaboration: the culture of Wikipedia. MIT Press, Cambridge, MA
Rule J (2007) Privacy in peril: how we are sacrificing a fundamental right in exchange for security and convenience. Oxford University Press, New York
Savage M, Burrows R (2007) The coming crisis of empirical sociology. Sociology 41(5):885–899
Savage M, Burrows R (2009) Some further reflections on the coming crisis of empirical sociology. Sociology 43(4):762–772
Schroeder R (2007) Rethinking science, technology and social change. Stanford University Press, Stanford
Schroeder R (2014a) Big data: towards a more scientific social science and humanities? In: Graham M, Dutton WH (eds) Society and the Internet. Oxford University Press, Oxford, pp 164–176
Schroeder R (2014b) Big data and the brave new world of social media research. Big Data Soc 1(2):1–11
Schroeder R (2016) Big data and communication research. Oxford Research Encyclopedia of Communication, http://communication.oxfordre.com/
Schroeder R (2018) Social theory after the Internet: media, technology and globalization. UCL Press, London
Schroeder R, Taylor L (2015) Big data and Wikipedia research: social science knowledge across disciplinary divides. Inf Commun Soc 18(9):1039–1056
Segev E, Ahituv N (2010) Popular searches in Google and Yahoo!: a “digital divide” in information uses? Inf Soc 26(1):17–37
Silver N (2012) The signal and the noise: the art and science of prediction. Allen Lane, London
Tancer B (2009) Click: what millions of people are doing online and why it matters. Harper Collins, New York
Tatum C, LaFrance M (2009) Wikipedia as a distributed knowledge laboratory: the case of neoliberalism. In: Jankowski N (ed) e-Research: transformation in scholarly practice. Routledge, Abingdon, pp 310–327
van Dijck J (2013) The culture of connectivity: a critical history of social media. Oxford University Press, Oxford
Waller V (2011) Not just information: who searches for what on the search engine Google? J Am Soc Inf Sci Technol 62(4):761–775
Waterman D (2005) Hollywood’s road to riches. Harvard University Press, Cambridge, MA
Weller K, Bruns A, Burgess J, Mahrt M, Puschmann C (eds) (2013) Twitter and society. Peter Lang, Oxford
West R, Weber I, Castillo C (2012) Drawing a data-driven portrait of Wikipedia editors. In: Proceedings of the 8th international symposium on Wikis and open collaboration (WikiSym’12), Linz, 2012. Available at http://www.wikisym.org/ws2012/bin/view/Main/Program. Last accessed 4 Dec 2012
Wilson R, Gosling S, Graham L (2012) A review of Facebook research in the social sciences. Perspect Psychol Sci 7(3):203–220
Yasseri T, Sumi R, Rung A, Kornai A, Kertész J (2012) Dynamics of conflicts in Wikipedia. PLoS One 7(6):e38869
Zhang X, Zhu F (2011) Group size and incentives to contribute: a natural experiment at Chinese Wikipedia. Am Econ Rev 101:1601–1615
Zimmer M (2010) “But the data is already public”: on the ethics of research in Facebook. Ethics Inf Technol 12(4):313–325
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Science+Business Media B.V., part of Springer Nature
About this entry
Cite this entry
Schroeder, R., Cowls, J. (2018). Big Data Approaches to the Study of Digital Media. In: Hunsinger, J., Klastrup, L., Allen, M. (eds) Second International Handbook of Internet Research. Springer, Dordrecht. https://doi.org/10.1007/978-94-024-1202-4_13-1
Download citation
DOI: https://doi.org/10.1007/978-94-024-1202-4_13-1
Received:
Accepted:
Published:
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-024-1202-4
Online ISBN: 978-94-024-1202-4
eBook Packages: Springer Reference Biomedicine and Life SciencesReference Module Biomedical and Life Sciences