Order-of-Magnitude Popularity Estimation of Pirated Content

  • Chapter
  • First Online:
Machine Learning Techniques for Online Social Networks

Part of the book series: Lecture Notes in Social Networks ((LNSN))

Abstract

Understanding the spread of information in complex networks is a key problem. Content sharing in popular online social networks such as Facebook and Twitter has been well studied, however, the future trajectory of a cascade has been shown to be inherently unpredictable. Nonetheless, cascade virality has recently been studied as a classification problem, resulting in good prediction accuracy. Herein, we address the important problem of pirated media popularity estimation in torrent applications, such as Project Free TV, Popcorn-Time, and The Pirate Bay. Although pirating software and media is illegal, the practice of pirating is actually growing in popularity. On a large sample of data acquired from The Pirate Bay, we demonstrate high accuracy in the task of identifying whether the popularity of a torrent will continue to grow in the future. Specifically, we achieve close to perfect accuracy in estimating the order-of-magnitude popularity of torrents.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 79.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 99.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info
Hardcover Book
USD 99.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Adomavicius, G., Tuzhilin, A.: Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions. IEEE Trans. Knowl. Data Eng. 17(6), 734–749 (2005)

    Article  Google Scholar 

  2. Chatzopoulou, G., Sheng, C., Faloutsos, M.: A first step towards understanding popularity in youtube. In: INFOCOM IEEE Conference on Computer Communications Workshops, 2010, pp. 1–6. IEEE, New york (2010)

    Google Scholar 

  3. Cheng, J., Adamic, L., Dow, P.A., Kleinberg, J.M., Leskovec, J.: Can cascades be predicted? In: Proceedings of the 23rd International Conference on World Wide Web, WWW ’14, pp. 925–936 (2014)

    Google Scholar 

  4. Christin, N., Weigend, A.S., Chuang, J.: Content availability, pollution and poisoning in file sharing peer-to-peer networks. In: Proceedings of the 6th ACM conference on Electronic commerce, EC ’05, pp. 68–77. ACM, New York (2005)

    Google Scholar 

  5. Di, W., Dhungel, P., **aojun, H., Chao, Z., Ross, K.W.: Understanding peer exchange in bittorrent systems. In: IEEE Tenth International Conference on Peer-to-Peer Computing (P2P), pp. 1–8 (2010)

    Google Scholar 

  6. Ding, W., Shang, Y., Guo, L., Hu, X., Yan, R., He, T.: Video popularity prediction by sentiment propagation via implicit network. In: Proceedings of the 24th ACM International on Conference on Information and Knowledge Management, CIKM ’15, pp. 1621–1630 (2015)

    Google Scholar 

  7. Fabio, H., Thomas, B., David, H.: The pirate bay 2008-12 dataset. http://www.csg.uzh.ch/publications/data/piratebay/

  8. Falkner, J., Piatek, M., John, J.P., Krishnamurthy, A., Anderson, T.: Profiling a million user dht. In: Proceedings of the 7th ACM SIGCOMM conference on Internet measurement, IMC ’07, pp. 129–134. ACM, New York (2007)

    Google Scholar 

  9. Farzad, A., Rabiee, H.: Modeling topological characteristics of bittorrent-like peer-to-peer networks. IEEE Commun. Lett. 15(8), 896–898 (2011)

    Article  Google Scholar 

  10. Fletcher, G.H.L., Sheth, H.A.: Unstructured peer-to-peer networks: topological properties and search performance. In: Third International Joint Conference on Autonomous Agents and MUlti-Agent Systems. W6: Agents and Peer-to-Peer Computing, pp. 14–27. Springer, Berlin (2004)

    Google Scholar 

  11. Gibbs, S.: Swedish police raid sinks the pirate bay. The Guardian (2014)

    Google Scholar 

  12. Guo, R., Shaabani, E., Bhatnagar, A., Shakarian, P.: Toward order-of-magnitude cascade prediction. In: Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2015, pp. 1610–1613. ACM, New York (2015)

    Google Scholar 

  13. Han, J., Chung, T., Kim, S., Kwon, T.T., Kim, H.c., Choi, Y.: How prevalent is content bundling in bittorrent. In: Proceedings of the ACM SIGMETRICS Joint International Conference on Measurement and Modeling of Computer Systems, SIGMETRICS ’11, pp. 127–128 (2011)

    Google Scholar 

  14. Herlocker, J.L., Konstan, J.A., Borchers, A., Riedl, J.: An algorithmic framework for performing collaborative filtering. In: Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR ’99, pp. 230–237. ACM, New York (1999)

    Google Scholar 

  15. Justin, B., Michael, K., Nick, T., Cox, L.P.: An empirical study of seeders in bittorrent. Technical report, Duke University (2006)

    Google Scholar 

  16. Karaganis, J., Renkema, L.: Copy culture in the US & Germany (2013)

    Google Scholar 

  17. Karagiannis, T., Broido, A., Brownlee, N., Claffy, K., Faloutsos, M.: Is P2P dying or just hiding? [P2P traffic measurement]. In: Global Telecommunications Conference, 2004. GLOBECOM ’04, vol. 3, pp. 1532–1538. IEEE, New York (2004)

    Google Scholar 

  18. Liu, Z., Dhungel, P., Wu, D., Zhang, C., Ross, K.W.: Understanding and improving ratio incentives in private communities. In: Proceedings of the 2010 IEEE 30th International Conference on Distributed Computing Systems, ICDCS ’10, pp. 610–621. IEEE Computer Society, Washington (2010)

    Google Scholar 

  19. McKelvey, F.: We like copies, just dont let the others fool you the paradox of the pirate bay. Telev. New Media 16(8), 734–750 (2015)

    Google Scholar 

  20. Menczer, F.: Lexical and semantic clustering by web links. J. Am. Soc. Inf. Sci. Technol. 55, 1261–1269 (2004)

    Article  Google Scholar 

  21. News, B.: The pirate bay ‘breaches’ bt’s ban of the filesharing site (2012)

    Google Scholar 

  22. Pinto, H., Almeida, J.M., Gonçalves, M.A.: Using early view patterns to predict the popularity of youtube videos. In: Proceedings of the Sixth ACM International Conference on Web Search and Data Mining, pp. 365–374. ACM, New York (2013)

    Google Scholar 

  23. Ripeanu, M., Mowbray, M., Andrade, N., Lima, A.: Gifting technologies: a bittorrent case study. First Monday 11 (2006)

    Google Scholar 

  24. Saroiu, S., Gummadi, K.P., Dunn, R.J., Gribble, S.D., Levy, H.M.: An analysis of internet content delivery systems. SIGOPS Oper. Syst. Rev. 36, 315–327 (2002)

    Article  Google Scholar 

  25. Siganos, G., Pujol, J., Rodriguez, P.: Monitoring the bittorrent monitors: a bird’s eye view. In: Moon, S.B., Teixeira, R., Uhlig, S. (eds.) Passive and Active Network Measurement. PAM 2009. Lecture Notes in Computer Science, vol. 5448, pp. 175–184. Springer, Berlin (2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Charalampos Chelmis .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG, part of Springer Nature

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Chelmis, C., Zois, DS. (2018). Order-of-Magnitude Popularity Estimation of Pirated Content. In: Özyer, T., Alhajj, R. (eds) Machine Learning Techniques for Online Social Networks. Lecture Notes in Social Networks. Springer, Cham. https://doi.org/10.1007/978-3-319-89932-9_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-89932-9_5

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-89931-2

  • Online ISBN: 978-3-319-89932-9

  • eBook Packages: Social SciencesSocial Sciences (R0)

Publish with us

Policies and ethics

Navigation