Abstract
Understanding the spread of information in complex networks is a key problem. Content sharing in popular online social networks such as Facebook and Twitter has been well studied, however, the future trajectory of a cascade has been shown to be inherently unpredictable. Nonetheless, cascade virality has recently been studied as a classification problem, resulting in good prediction accuracy. Herein, we address the important problem of pirated media popularity estimation in torrent applications, such as Project Free TV, Popcorn-Time, and The Pirate Bay. Although pirating software and media is illegal, the practice of pirating is actually growing in popularity. On a large sample of data acquired from The Pirate Bay, we demonstrate high accuracy in the task of identifying whether the popularity of a torrent will continue to grow in the future. Specifically, we achieve close to perfect accuracy in estimating the order-of-magnitude popularity of torrents.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Adomavicius, G., Tuzhilin, A.: Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions. IEEE Trans. Knowl. Data Eng. 17(6), 734–749 (2005)
Chatzopoulou, G., Sheng, C., Faloutsos, M.: A first step towards understanding popularity in youtube. In: INFOCOM IEEE Conference on Computer Communications Workshops, 2010, pp. 1–6. IEEE, New york (2010)
Cheng, J., Adamic, L., Dow, P.A., Kleinberg, J.M., Leskovec, J.: Can cascades be predicted? In: Proceedings of the 23rd International Conference on World Wide Web, WWW ’14, pp. 925–936 (2014)
Christin, N., Weigend, A.S., Chuang, J.: Content availability, pollution and poisoning in file sharing peer-to-peer networks. In: Proceedings of the 6th ACM conference on Electronic commerce, EC ’05, pp. 68–77. ACM, New York (2005)
Di, W., Dhungel, P., **aojun, H., Chao, Z., Ross, K.W.: Understanding peer exchange in bittorrent systems. In: IEEE Tenth International Conference on Peer-to-Peer Computing (P2P), pp. 1–8 (2010)
Ding, W., Shang, Y., Guo, L., Hu, X., Yan, R., He, T.: Video popularity prediction by sentiment propagation via implicit network. In: Proceedings of the 24th ACM International on Conference on Information and Knowledge Management, CIKM ’15, pp. 1621–1630 (2015)
Fabio, H., Thomas, B., David, H.: The pirate bay 2008-12 dataset. http://www.csg.uzh.ch/publications/data/piratebay/
Falkner, J., Piatek, M., John, J.P., Krishnamurthy, A., Anderson, T.: Profiling a million user dht. In: Proceedings of the 7th ACM SIGCOMM conference on Internet measurement, IMC ’07, pp. 129–134. ACM, New York (2007)
Farzad, A., Rabiee, H.: Modeling topological characteristics of bittorrent-like peer-to-peer networks. IEEE Commun. Lett. 15(8), 896–898 (2011)
Fletcher, G.H.L., Sheth, H.A.: Unstructured peer-to-peer networks: topological properties and search performance. In: Third International Joint Conference on Autonomous Agents and MUlti-Agent Systems. W6: Agents and Peer-to-Peer Computing, pp. 14–27. Springer, Berlin (2004)
Gibbs, S.: Swedish police raid sinks the pirate bay. The Guardian (2014)
Guo, R., Shaabani, E., Bhatnagar, A., Shakarian, P.: Toward order-of-magnitude cascade prediction. In: Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2015, pp. 1610–1613. ACM, New York (2015)
Han, J., Chung, T., Kim, S., Kwon, T.T., Kim, H.c., Choi, Y.: How prevalent is content bundling in bittorrent. In: Proceedings of the ACM SIGMETRICS Joint International Conference on Measurement and Modeling of Computer Systems, SIGMETRICS ’11, pp. 127–128 (2011)
Herlocker, J.L., Konstan, J.A., Borchers, A., Riedl, J.: An algorithmic framework for performing collaborative filtering. In: Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR ’99, pp. 230–237. ACM, New York (1999)
Justin, B., Michael, K., Nick, T., Cox, L.P.: An empirical study of seeders in bittorrent. Technical report, Duke University (2006)
Karaganis, J., Renkema, L.: Copy culture in the US & Germany (2013)
Karagiannis, T., Broido, A., Brownlee, N., Claffy, K., Faloutsos, M.: Is P2P dying or just hiding? [P2P traffic measurement]. In: Global Telecommunications Conference, 2004. GLOBECOM ’04, vol. 3, pp. 1532–1538. IEEE, New York (2004)
Liu, Z., Dhungel, P., Wu, D., Zhang, C., Ross, K.W.: Understanding and improving ratio incentives in private communities. In: Proceedings of the 2010 IEEE 30th International Conference on Distributed Computing Systems, ICDCS ’10, pp. 610–621. IEEE Computer Society, Washington (2010)
McKelvey, F.: We like copies, just dont let the others fool you the paradox of the pirate bay. Telev. New Media 16(8), 734–750 (2015)
Menczer, F.: Lexical and semantic clustering by web links. J. Am. Soc. Inf. Sci. Technol. 55, 1261–1269 (2004)
News, B.: The pirate bay ‘breaches’ bt’s ban of the filesharing site (2012)
Pinto, H., Almeida, J.M., Gonçalves, M.A.: Using early view patterns to predict the popularity of youtube videos. In: Proceedings of the Sixth ACM International Conference on Web Search and Data Mining, pp. 365–374. ACM, New York (2013)
Ripeanu, M., Mowbray, M., Andrade, N., Lima, A.: Gifting technologies: a bittorrent case study. First Monday 11 (2006)
Saroiu, S., Gummadi, K.P., Dunn, R.J., Gribble, S.D., Levy, H.M.: An analysis of internet content delivery systems. SIGOPS Oper. Syst. Rev. 36, 315–327 (2002)
Siganos, G., Pujol, J., Rodriguez, P.: Monitoring the bittorrent monitors: a bird’s eye view. In: Moon, S.B., Teixeira, R., Uhlig, S. (eds.) Passive and Active Network Measurement. PAM 2009. Lecture Notes in Computer Science, vol. 5448, pp. 175–184. Springer, Berlin (2009)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this chapter
Cite this chapter
Chelmis, C., Zois, DS. (2018). Order-of-Magnitude Popularity Estimation of Pirated Content. In: Özyer, T., Alhajj, R. (eds) Machine Learning Techniques for Online Social Networks. Lecture Notes in Social Networks. Springer, Cham. https://doi.org/10.1007/978-3-319-89932-9_5
Download citation
DOI: https://doi.org/10.1007/978-3-319-89932-9_5
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-89931-2
Online ISBN: 978-3-319-89932-9
eBook Packages: Social SciencesSocial Sciences (R0)