The Open Community Data Exchange: Advancing Data Sharing and Discovery in Open Online Community Science

  • Chapter
  • First Online:
Big Data Factories

Part of the book series: Computational Social Sciences ((CSS))

  • 1146 Accesses

Abstract

While online behavior creates an enormous amount of digital data that can be the basis for social science research, to date, the science has been conducted piecemeal, one Internet address at a time, often without social or scholarly impact beyond the site’s own stakeholders. Scientists lack the tools, methods, and practices to combine, compare, contrast, and communicate about online behavior across Internet addresses or over time. In response, we are building the infrastructure for computational social scientists, social scientists, and citizens to make corresponding advances in our understanding of online human interactions. In this chapter, we present our effort to (1) specify the Open Community Data Exchange (OCDX) metadata standard to describe datasets, (2) introduce concepts from the data curation lifecycle to social computing research, and (3) describe candidate infrastructure for creating, editing, viewing, sharing, and analyzing manifests.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Similar content being viewed by others

References

  • Bird, C., Rigby, P.C., Barr, E.T., Hamilton, D.J., German, D.M., Devanbu, P. (2009). The promises and perils of mining git. Proceedings from Mining Software Repositories, 2009. MSR’09. 6th IEEE International Working Conference on Mining Software Repositories.

    Google Scholar 

  • Bishop, J.L. Verleger, M.A. (2013). The flipped classroom: A survey of the research. ASEE National Conference Proceedings, Atlanta.

    Google Scholar 

  • Blincoe, K., Valetto, G., Goggins, S. (2012). Leveraging task contexts for managing developers’ coordination. Proceedings from ACM Conference on Computer Supported Cooperative Work, 2012, Seattle.

    Google Scholar 

  • Chiasson, M., Germonprez, M., & Mathiassen, L. (2009). Pluralist action research: A review of the information systems literature. Information Systems Journal, 19(1. (Jan. 2009), 31–54.

    Article  Google Scholar 

  • Dabbish, L., Stuart, C., Tsay, J., Herbsleb, J. (2012). Social coding in Github: Transparency and collaboration in an open software repository. Proceedings from CSCW’12, Seattle, Washington.

    Google Scholar 

  • Germonprez, M., Kendall, J. E., Kendall, K. E., & Young, B. (2014). Collectivism, creativity, competition, and control in open source software development: Reflections on the emergent governance of the SPDXtextregistered working group. International Journal of Information Systems and Management, 1(1/2. (2014), 125–145.

    Article  Google Scholar 

  • Goggins, S. P., Mascaro, C., & Valetto, G. (2013). Group informatics: A methodological approach and ontology for sociotechnical group research. Journal of the American Society for Information Science and Technology, 64(3. (Mar. 2013), 516–539.

    Article  Google Scholar 

  • Howison, J., & Crowston, K. (2014). Collaboration through open superposition: A theory of the open source way. MIS Quarterly, 38(1).

    Google Scholar 

  • Irwin, A. (1995). Citizen science: A study of people, expertise and sustainable development. New York: Psychology Press.

    Google Scholar 

  • Jisc. (n.d.). DCC curation lifecycle model. Retrieved from: http://www.dcc.ac.uk/resources/curation-lifecycle-model.

  • Maron, D., Missen, C., McNeirney, K., Elnora, K.T. (2015). Lo-fi to hi-fi crowd cataloging: Increasing e-resource records and promoting metadata literacy within WiderNet. Poster presented at the iConference.

    Google Scholar 

  • Moorhead, S. A., Hazlett, D. E., Harrison, L., Carroll, J. K., Irwin, A., & Hoving, C. (2013). A new dimension of health care: Systematic review of the uses, benefits, and limitations of social media for health communication. Journal of Medical Internet Research, 15(4. (Apr. 2013), e85.

    Article  Google Scholar 

  • Morgan, J.T., Halfaker, A., Taraborelli, D., Goggins, S., Hwang, T., Computing, S. (2015). Bridging the data divide. (2015).

    Google Scholar 

  • Nahon, K., & Hemsley, J. (2014). Homophily in the guise of cross-linking: Political blogs and content. American Behavioral Scientist, 58(10. (Sep. 2014), 1294–1313.

    Article  Google Scholar 

  • Ren, Y., Kraut, R., & Kiesler, S. (2007). Applying common identity and bond theory to design of online communities. Organization Studies, 28(3. (Mar. 2007), 377–408.

    Article  Google Scholar 

  • Tandoc, E.C. (2014). Journalism is twerking? How web analytics is changing the process of gatekee**. New Media & Society. (Apr. 2014), 1–17.

    Google Scholar 

  • Weick, K. E. (1989). Theory construction as disciplined imagination. The Academy of Management Review, 14(4. (1989), 516–531.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sean P. Goggins .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Goggins, S.P., Million, A.J., Link, G.J.P., Germonprez, M., Schuster, K. (2017). The Open Community Data Exchange: Advancing Data Sharing and Discovery in Open Online Community Science. In: Matei, S., Jullien, N., Goggins, S. (eds) Big Data Factories. Computational Social Sciences. Springer, Cham. https://doi.org/10.1007/978-3-319-59186-5_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-59186-5_3

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-59185-8

  • Online ISBN: 978-3-319-59186-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Navigation