Poisoning Complete-Linkage Hierarchical Clustering

Biggio, Battista; Bulò, Samuel Rota; Pillai, Ignazio; Mura, Michele; Mequanint, Eyasu Zemene; Pelillo, Marcello; Roli, Fabio

doi:10.1007/978-3-662-44415-3_5

Battista Biggio²⁰,
Samuel Rota Bulò²¹,
Ignazio Pillai²⁰,
Michele Mura²⁰,
Eyasu Zemene Mequanint²⁰,
Marcello Pelillo²² &
…
Fabio Roli²⁰

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 8621))

Included in the following conference series:

Joint IAPR International Workshops on Statistical Techniques in Pattern Recognition (SPR) and Structural and Syntactic Pattern Recognition (SSPR)

2765 Accesses
17 Citations

Abstract

Clustering algorithms are largely adopted in security applications as a vehicle to detect malicious activities, although few attention has been paid on preventing deliberate attacks from subverting the clustering process itself. Recent work has introduced a methodology for the security analysis of data clustering in adversarial settings, aimed to identify potential attacks against clustering algorithms and to evaluate their impact. The authors have shown that single-linkage hierarchical clustering can be severely affected by the presence of a very small fraction of carefully-crafted poisoning attacks into the input data, highlighting that the clustering algorithm may be itself the weakest link in a security system. In this paper, we extend this analysis to the case of complete-linkage hierarchical clustering by devising an ad hoc poisoning attack. We verify its effectiveness on artificial data and on application examples related to the clustering of malware and handwritten digits.

Download to read the full chapter text

Chapter PDF

Effectiveness of Hard Clustering Algorithms for Securing Cyber Space

Detection of IP Gangs: Strategically Organized Bots

Enhancing Detection of R2L Attacks by Multistage Clustering Based Outlier Detection

Article 29 January 2022

References

Perdisci, R., Corona, I., Giacinto, G.: Early detection of malicious flux networks via large-scale passive DNS traffic analysis. IEEE Trans. Dependable and Secure Comp. 9(5), 714–726 (2012)
Google Scholar
Pouget, F., Dacier, M., Zimmerman, J., Clark, A., Mohay, G.: Internet attack knowledge discovery via clusters and cliques of attack traces. J. of Information Assurance and Security 1(1) (2006)
Google Scholar
Perdisci, R., Ariu, D., Giacinto, G.: Scalable fine-grained behavioral clustering of http-based malware. Computer Networks 57(2), 487–500 (2013)
Article Google Scholar
Rieck, K., Trinius, P., Willems, C., Holz, T.: Automatic analysis of malware behavior using machine learning. J. Comput. Secur. 19(4), 639–668 (2011)
Google Scholar
Hanna, S., Huang, L., Wu, E., Li, S., Chen, C., Song, D.: Juxtapp: A scalable system for detecting code reuse among Android applications. In: Flegel, U., Markatos, E., Robertson, W. (eds.) DIMVA 2012. LNCS, vol. 7591, pp. 62–81. Springer, Heidelberg (2013)
Chapter Google Scholar
Burguera, I., Zurutuza, U., Nadjm-Tehrani, S.: Crowdroid: behavior-based malware detection system for android. In: SPSM 2011, pp. 15–26 (2011)
Google Scholar
Spitzner, L.: Honeypots: Tracking Hackers. Addison-Wesley Professional (2002)
Google Scholar
Biggio, B., Fumera, G., Roli, F.: Security evaluation of pattern classifiers under attack. IEEE Trans. Knowledge and Data Eng. 26(4), 984–996 (2014)
Article Google Scholar
Brückner, M., Kanzow, C., Scheffer, T.: Static prediction games for adversarial learning problems. J. Mach. Learn. Res. 13, 2617–2654 (2012)
MATH MathSciNet Google Scholar
Huang, L., Joseph, A.D., Nelson, B., Rubinstein, B., Tygar, J.D.: Adversarial machine learning. In: ACM Workshop AISec 2011, pp. 43–57 (2011)
Google Scholar
Barreno, M., Nelson, B., Sears, R., Joseph, A.D., Tygar, J.D.: Can machine learning be secure? In: ASIACCS 2006, pp. 16–25 (2006)
Google Scholar
Großhans, M., Sawade, C., Brückner, M., Scheffer, T.: Bayesian games for adversarial regression problems. In: ICML, vol. 28 (2013)
Google Scholar
Dutrisac, J.G., Skillicorn, D.: Hiding clusters in adversarial settings. In: ISI 2008, pp. 185–187 (2008)
Google Scholar
Skillicorn, D.B.: Adversarial knowledge discovery. IEEE Intelligent Systems 24, 54–61 (2009)
Article Google Scholar
Biggio, B., Pillai, I., Rota Bulò, S., Ariu, D., Pelillo, M., Roli, F.: Is data clustering in adversarial settings secure? In: ACM Workshop AISec 2013, pp. 87–98 (2013)
Google Scholar
Biggio, B., Nelson, B., Laskov, P.: Poisoning attacks against support vector machines. In: ICML (2012)
Google Scholar
Kolcz, A., Teo, C.H.: Feature weighting for improved classifier robustness. In: CEAS (2009)
Google Scholar
Jain, A.K., Dubes, R.C.: Algorithms for clustering data. Prentice-Hall, Inc., Upper Saddle River (1988)
MATH Google Scholar
Meilǎ, M.: Comparing clusterings: An axiomatic view. In: ICML, pp. 577–584 (2005)
Google Scholar
Halkidi, M., Batistakis, Y., Vazirgiannis, M.: On clustering validation techniques. Journal of Intelligent Information Systems 17(2-3), 107–145 (2001)
Article MATH Google Scholar
LeCun, Y., Jackel, L., Bottou, L., Brunot, A., Cortes, C., Denker, J., Drucker, H., Guyon, I., Müller, U., Säckinger, E., Simard, P., Vapnik, V.: Comparison of learning algorithms for handwritten digit recognition. In: Int’l Conf. on Artificial Neural Networks, pp. 53–60 (1995)
Google Scholar

Download references

Author information

Authors and Affiliations

University of Cagliari, Italy
Battista Biggio, Ignazio Pillai, Michele Mura, Eyasu Zemene Mequanint & Fabio Roli
FBK-irst, Trento, Italy
Samuel Rota Bulò
Ca’ Foscari University, Venice, Italy
Marcello Pelillo

Authors

Battista Biggio
View author publications
You can also search for this author in PubMed Google Scholar
Samuel Rota Bulò
View author publications
You can also search for this author in PubMed Google Scholar
Ignazio Pillai
View author publications
You can also search for this author in PubMed Google Scholar
Michele Mura
View author publications
You can also search for this author in PubMed Google Scholar
Eyasu Zemene Mequanint
View author publications
You can also search for this author in PubMed Google Scholar
Marcello Pelillo
View author publications
You can also search for this author in PubMed Google Scholar
Fabio Roli
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computing, University of Eastern Finland, 80101, Joensuu, Finland
Pasi Fränti
School of Computer Science, The University of Manchester, Manchester, UK
Gavin Brown
Delft University of Technology, Delft, The Netherlands
Marco Loog
Universidad de Alicante, Spain
Francisco Escolano
Università Ca’ Foscari Venezia, Venezia Mestre, Italy
Marcello Pelillo

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Biggio, B. et al. (2014). Poisoning Complete-Linkage Hierarchical Clustering. In: Fränti, P., Brown, G., Loog, M., Escolano, F., Pelillo, M. (eds) Structural, Syntactic, and Statistical Pattern Recognition. S+SSPR 2014. Lecture Notes in Computer Science, vol 8621. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-44415-3_5

Download citation

DOI: https://doi.org/10.1007/978-3-662-44415-3_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-44414-6
Online ISBN: 978-3-662-44415-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)

Poisoning Complete-Linkage Hierarchical Clustering

Abstract

Chapter PDF

Similar content being viewed by others

Effectiveness of Hard Clustering Algorithms for Securing Cyber Space

Detection of IP Gangs: Strategically Organized Bots

Enhancing Detection of R2L Attacks by Multistage Clustering Based Outlier Detection

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Navigation

Poisoning Complete-Linkage Hierarchical Clustering

Abstract

Chapter PDF

Similar content being viewed by others

Effectiveness of Hard Clustering Algorithms for Securing Cyber Space

Detection of IP Gangs: Strategically Organized Bots

Enhancing Detection of R2L Attacks by Multistage Clustering Based Outlier Detection

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation