A new approach to analyzing patterns of collaboration in co-authorship networks: mesoscopic analysis and interpretation

Velden, Theresa; Haque, Asif-ul; Lagoze, Carl

doi:10.1007/s11192-010-0224-6

A new approach to analyzing patterns of collaboration in co-authorship networks: mesoscopic analysis and interpretation

Published: 17 April 2010

Volume 85, pages 219–242, (2010)
Cite this article

Scientometrics Aims and scope Submit manuscript

Theresa Velden¹,
Asif-ul Haque¹ &
Carl Lagoze¹

1186 Accesses
63 Citations
Explore all metrics

Abstract

This paper focuses on methods to study patterns of collaboration in co-authorship networks at the mesoscopic level. We combine qualitative methods (participant interviews) with quantitative methods (network analysis) and demonstrate the application and value of our approach in a case study comparing three research fields in chemistry. A mesoscopic level of analysis means that in addition to the basic analytic unit of the individual researcher as node in a co-author network, we base our analysis on the observed modular structure of co-author networks. We interpret the clustering of authors into groups as bibliometric footprints of the basic collective units of knowledge production in a research specialty. We find two types of coauthor-linking patterns between author clusters that we interpret as representing two different forms of cooperative behavior, transfer-type connections due to career migrations or one-off services rendered, and stronger, dedicated inter-group collaboration. Hence the generic coauthor network of a research specialty can be understood as the overlay of two distinct types of cooperative networks between groups of authors publishing in a research specialty. We show how our analytic approach exposes field specific differences in the social organization of research.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A new network model for the study of scientific collaborations: Romanian computer science and mathematics co-authorship networks

Article 17 May 2016

Functional information characteristics of large-scale research collaboration: network measures and implications

Article 05 November 2014

Network effects and research collaborations: evidence from IMF Working Paper co-authorship

Article 18 April 2022

Notes

We call these ‘seed’ groups as we regard them as entry points into scientific communities, and plan to extend our field studies following links (of cooperation or competition) of these seed labs.
It is worth noting that these specialties do not fall squarely into single sub-disciplines but typically unite the efforts of researchers who identity with different subdisciplines. For field B which we label here as ‘synthetic chemistry’ these are mainly organic chemists, inorganic and organo-metallic chemists as well as polymer chemists. For field A which we label here as belonging to the subdiscipline ‘physical chemistry’ these are physical chemists as well as experimental and theoretical physicist, often with a background in atomic and molecular physics, but also in nuclear physics. For field C these are indeed mostly physical chemists.
When comparing this number to other co-author networks in the literature, remember that this number is calculated for a reduced co-author network (after excluding one-time authors). Hence this number will overestimate the relative size of the giant component for the unreduced network.
Available from Martin Rosvall’s home page at http://www.tp.umu.se/~rosvall/code.html.
Our field studies confirm that in areas of synthetic chemistry the task of having to find someone with the instrumental equipment to conduct certain measurements on your sample is very common.
Institutional Review Board, http://en.wikipedia.org/wiki/Institutional_review_board.

References

Acedo, F., Barroso, C., Casanueva, C., & Galán, J. (2006). Co-authorship in management and organizational studies: An empirical and network analysis. Journal of Management Studies, 43(5), 957.
Article Google Scholar
Adams, J., Gurney, K., & Jackson, L. (2008). Calibrating the zoom—A test of Zitt’s hypothesis. Scientometrics, 75(1), 81–95.
Article Google Scholar
Althouse, B., West, J., Bergstrom, T., & Bergstrom, C. (2008). Differences in impact factor across fields and over time. Arxiv preprint ar**v:0804.3116.
Google Scholar
Barabasi, A., Jeong, H., Neda, Z., et al. (2002). Evolution of the social network of scientific collaborations. Physica A: Statistical Mechanics and its Applications, 311(3–4), 590–614.
Article MATH MathSciNet Google Scholar
Bassecoulard, E., Lelu, A., & Zitt, M. (2007). A modular sequence of retrieval procedures to delineate a scientific field: From vocabulary to citations and back. In Proceedings of 11th international conference on scientometrics and informetrics (ISSI 2007), Madrid, Spain, pp. 25 ff.
Batagelj, V., & Mrvar, A. (2003). Analysis and visualization of large networks. In M. Juenger & P. Mutzel (Eds.), Graph drawing software (pp. 77–103). Berlin: Springer.
Caruana, R., Elhawary, M., Nguyen, N., & Smith, C. (2006). Meta clustering. In Proceedings of the sixth international conference on data mining (ICDM’06).
Crane, D. (1972). Invisible colleges: Diffusion of knowledge in scientific communities. Chicago: The University of Chicago Press.
Google Scholar
Fortunato, S., & Castellano, C. (2007). Community structure in graphs. arxiv: 0712.271.
Freeman, L. C. (1978). Centrality in social networks—Conceptual clarification. Social Networks, 1, 215–239.
Article Google Scholar
Fry, J., & Talja, S. (2007). The intellectual and social organization of academic fields and the sha** of digital resources. Journal of Information Science, 33(2), 115.
Article Google Scholar
Glänzel, W., & de Lange, C. (1997). Modelling and measuring multilateral co-authorship in international scientific collaboration. Part II. A comparative study on the extent and change of international scientific collaboration links. Scientometrics, 40(3), 605–626.
Article Google Scholar
Glänzel, W., & de Lange, C. (2002). A distributional approach to multinationality measures of international scientific collaboration. Scientometrics, 54(1), 75–89.
Article Google Scholar
Guimera, R., Sales-Pardo, M., & Amaral, L. (2007). Classes of complex networks defined by role-to-role connectivity profiles. Nature Physics, 3(1), 63–69.
Article Google Scholar
Knorr Cetina, K. (1999). Epistemic cultures—How the sciences make knowledge. Cambridge: Harvard University Press.
Google Scholar
Kretschmer, H. (1994). Coauthorship networks of invisible colleges and institutionalized communities. Scientometrics, 30(1), 363–369.
Article Google Scholar
Kretschmer, H. (2004). Author productivity and geodesic distance in bibliographic co-authorship networks, and visibility on the Web. Scientometrics, 60(3), 409–420.
Article Google Scholar
Leclerc, M., & Gagné, J. (1994). International scientific cooperation: The continentalization of science. Scientometrics, 31(3), 261–292.
Article Google Scholar
Leydesdorff, L., & Wagner, C. (2008). International collaboration in science and the formation of a core group. Journal of Informetrics, 2(4), 317–325.
Article Google Scholar
Liberman, S., & Wolf, K. B. (1998). Bonding number in scientific disciplines. Social Networks, 20(3), 239–246.
Article Google Scholar
Lievrouw, L. A. (1990). Reconceiling structure and process in the study of scholarly communication. In C. L. Borgman (Ed.), Scholarly communication and bibliometrics. London: Sage Publications.
Google Scholar
Liu, X., Bollen, J., Nelson, M., & Van de Sompel, H. (2005). Co-authorship networks in the digital library research community. Information Processing and Management, 41(6), 1462–1480.
Article Google Scholar
Moed, H., Burger, W., Frankfort, J., & Van Raan, A. (1985). The application of bibliometric indicators: Important field-and time-dependent factors to be considered. Scientometrics, 8(3), 177.
Article Google Scholar
Mogoutov, A., & Kahane, B. (2007). Data search strategy for science and technology emergence: A scalable and evolutionary query for nanotechnology tracking. Research Policy, 36(6), 893–903.
Article Google Scholar
Morris, S. (2005). Manifestation of emerging specialties in journal literature: A growth model of papers, references, exemplars, bibliographic coupling, cocitation, and clustering coefficient distribution. Journal of the American Society for Information Science and Technology, 56(12), 1250–1273.
Article Google Scholar
Morris, S., Goldstein, M., & Deyong, C. (2007). Manifestation of research teams in journal literature: A growth model of papers, authors, collaboration, coauthorship, weak ties, and Lotka’s law. Journal of the American Society for Information Science and Technology, 58(12), 1764–1782.
Article Google Scholar
Nepusz, T., Petróczi, A., Négyessy, L., & Bazsó, F. (2008). Fuzzy communities and the concept of bridgeness in complex networks. Physical Review E, 77(1), 16107.
Article Google Scholar
Newman, M. (2001a). The structure of scientific collaboration networks. PNAS, 98(2), 404–409.
Article MATH Google Scholar
Newman, M. (2001b). Scientific collaboration networks. I. Network construction and fundamental results. Physical Review E, 64(1), 16131.
Article Google Scholar
Newman, M. (2004). Fast algorithm for detecting community structure in networks. Physical Review E, 69, 066133.
Article Google Scholar
Palla, G., Derenyi, I., Farkas, I., & Vicsek, T. (2005). Uncovering the overlap** community structure of complex networks in nature and society. Nature, 435(7043), 814–818.
Article Google Scholar
Radicchi, F., et al. (2004). Defining and identifying communities in networks. PNAS, 101(9), 2658–2663.
Article Google Scholar
Rosvall, M., & Bergstrom, C. (2007). An information-theoretic framework for resolving community structure in complex networks. PNAS, 104(18), 7327.
Article Google Scholar
Schaeffer, S. (2007). Graph clustering. Computer Science Review, 1(1), 27–64.
Article MathSciNet Google Scholar
Seglen, P., & Aksnes, D. (2000). Scientific productivity and group size: A bibliometric analysis of Norwegian microbiological research. Scientometrics, 49(1), 125–143.
Article Google Scholar
Sen, P. (2006). Complexities of social networks: A Physicist’s perspective. Arxiv preprint physics, 0605072.
Snyder, H., & Bonzi, S. (1998). Patterns of self-citation across disciplines (1980–1989). Journal of Information Science, 24(6), 431.
Article Google Scholar
Wagner, C. S., & Leydesdorff, L. (2005). Network structure, self-organization, and the growth of international collaboration in science. Research Policy, 34, 1608–1618.
Article Google Scholar
Whitley, R. (2000). The intellectual and social organization of the sciences. Oxford: Clarendon Press.
Google Scholar
Zitt, M., Bassecoulard, E., & Okubo, Y. (2000). Shadows of the past in international cooperation: Collaboration profiles of the top five producers of science. Scientometrics, 47(3), 627–657.
Article Google Scholar
Zitt, M., Ramanana-Rahary, S., & Bassecoulard, E. (2005). Relativity of citation performance and excellence measures: From cross-field to cross-scale effects of field-normalisation. Scientometrics, 63(2), 373–401.
Article Google Scholar
Zuccala, A. (2006). Modeling the invisible college. Journal of the American Society for Information Science and Technology, 57(2), 152–168.
Article Google Scholar

Download references

Acknowledgments

We are indebted to our field study participants. Further, this research has been made possible through financial support by the National Science Foundation through grants IIS-738543 SGER: Advancing the State of eChemistry, DUE-0840744 NSDL Technical Network Services: A Cyberinfrastructure Platform for STEM Education, and NSF award 0404553. Support also came from Microsoft Corporation for the project ORE-based eChemistry. We are grateful to those that make our work so much more effective by making neat tools and algorithms available on the Web, such as Martin Rosvall (infomap clustering code), Vladimir Batagelj and Andrej Mrvar (pajek), Michael Weseman (plot), and Peter Mcaster (OmniGraffle extensions for pie charts).

Author information

Authors and Affiliations

Computer and Information Science, Cornell University, Ithaca, NY, USA
Theresa Velden, Asif-ul Haque & Carl Lagoze

Authors

Theresa Velden
View author publications
You can also search for this author in PubMed Google Scholar
Asif-ul Haque
View author publications
You can also search for this author in PubMed Google Scholar
Carl Lagoze
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Theresa Velden.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (PDF 433 kb)

Supplementary material 2 (CSV 162 kb)

Supplementary material 1 (ZIP 1,429 kb)

Appendices

Appendix 1: Data

Guided by our informants we developed lexical queries to retrieve from Web of Science the literature covering a specific speciality—sometimes a single term, sometimes a 2–3 line long query using Boolean operators. Our informants validated the data sets, checking whether crucial authors whom they expected to see were actually represented, and whether out-of-scope topics had been accidentally introduced. The data was then further processed to eliminate articles with only one author or unknown authors (indicated by ‘Anon’ in the WoS records), or with only a single reference, as the latter records typically do not refer to original research articles or reviews.

This study is part of a larger dissertation research project. To ensure the anonymity of the human participants of the field study part of this research in accordance with the IRB^{Footnote 6} approved protocol for the dissertation research we cannot disclose the lexical queries that would provide access to the exact data sets discussed in this report. If access to the networks build from the data used in this study is needed for a particular research or validation purpose we are happy to work out a solution that retains the anonymity of our study participants.

Data field A

Time span: 1991–2008, number of records: 53,947 (retrieved on 22 Jun 2009)

Data field B

Time span: 1991–2008, number of records: 12,817 (retrieved on 14 Nov 2008)

Data field C

Time span: 1987–2008, number of records: 30,636 (retrieved on 17 Dec 2008).

We derive the geographical affiliation of a cluster at continent level from the country affiliation listed in the Web of Science record for each publication. We represent each cluster by all the publications any of its authors has been a co-author of. Then we determine the country affiliation that is most often listed for papers published by cluster authors. In cases where the second placed country is listed at least 50% as many times as the most often listed country, and if these two countries belong to different continents, we assign a mixed, two continent geographical affiliation

Appendix 2: Methods

In Guimera et al. (2007) each node of a clustered network was assigned to one out of seven node roles. The seven node roles are defined by the values of two parameters that quantify how a node is connected to the other nodes in its cluster and how the links going outside the cluster are distributed.

The first parameter, z, is simply the inside-its-cluster-degree of a node, normalized by subtracting the average inside-the-cluster degree and then dividing the result by the standard deviation. Nodes that have z < 2.5 were called non-hubs, while nodes with z ≥ 2.5 were called hubs. In other words, hubs are well-connected inside their own cluster with a normalized threshold beyond 2.5 (z > 2.5).

The second parameter is the participation coefficient P that quantifies how a node distributes its outside links among the clusters. For a node i in a network of N clusters it is defined as follows:

$$ P_{i} = 1 - \sum_{\text{s}} \left( {k_{\text{s}}^{i} /k^{i} } \right) $$

where s = (1…N), k ⁱ_s = number of links of node i to nodes in cluster s, k ⁱ = the total degree of node i.

From this definition it follows that P is normalized for each node so that the value is within 0 and 1. A node that connects to only one cluster would have a small P whereas a node that connects to a lot of clusters would have a high P.

Non-hubs are classified into 4 types based on their P values.

P ≤ 0.05: Ultra-peripheral (R1) with its connections restricted to its own cluster
0.05 < P ≤ 0.62: Peripheral (R2) with limited outside connectivity
0.62 < P ≤ 0.8: Satellite connector (R3) with good connectivity with a number of outside clusters
0.8 < P: Kinless (R4) with outside connectivity distributed evenly among all clusters

Similarly hubs are categorized into 3 types based on their P values.

P ≤ 0.30: Provincial hub (R5) that is well-connected only inside its own cluster
0.30 < P ≤ 0.75: Connector hub (R6) that connects to many clusters
0.75 < P: Global hub (R7) that connects to most clusters

We have used the same set of threshold values to assign roles to nodes.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Velden, T., Haque, Au. & Lagoze, C. A new approach to analyzing patterns of collaboration in co-authorship networks: mesoscopic analysis and interpretation. Scientometrics 85, 219–242 (2010). https://doi.org/10.1007/s11192-010-0224-6

Download citation

Received: 16 November 2009
Published: 17 April 2010
Issue Date: October 2010
DOI: https://doi.org/10.1007/s11192-010-0224-6

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A new approach to analyzing patterns of collaboration in co-authorship networks: mesoscopic analysis and interpretation

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

A new network model for the study of scientific collaborations: Romanian computer science and mathematics co-authorship networks

Functional information characteristics of large-scale research collaboration: network measures and implications

Network effects and research collaborations: evidence from IMF Working Paper co-authorship

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Electronic supplementary material

Supplementary material 1 (PDF 433 kb)

Supplementary material 2 (CSV 162 kb)

Supplementary material 1 (ZIP 1,429 kb)

Appendices

Appendix 1: Data

Data field A

Data field B

Data field C

Appendix 2: Methods

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

A new approach to analyzing patterns of collaboration in co-authorship networks: mesoscopic analysis and interpretation

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

A new network model for the study of scientific collaborations: Romanian computer science and mathematics co-authorship networks

Functional information characteristics of large-scale research collaboration: network measures and implications

Network effects and research collaborations: evidence from IMF Working Paper co-authorship

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Electronic supplementary material

Supplementary material 1 (PDF 433 kb)

Supplementary material 2 (CSV 162 kb)

Supplementary material 1 (ZIP 1,429 kb)

Appendices

Appendix 1: Data

Data field A

Data field B

Data field C

Appendix 2: Methods

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation