Abstract
We present a new stochastic process, called as Social Diffusion Process (SDP), to address the graph modeling. Based on this model, we derive a graph evolution algorithm and a series of graph-based approaches to solve machine learning problems, including clustering and semi-supervised learning. SDP can be viewed as a special case of Matthew effect, which is a general phenomenon in nature and societies. We use social event as a metaphor of the intrinsic stochastic process for broad range of data. We evaluate our approaches in a large number of frequently used datasets and compare our approaches to other state-of-the-art techniques. Results show that our algorithm outperforms the existing methods in most cases. We also applying our algorithm into the functionality analysis of microRNA and discover biologically interesting cliques. Due to the broad availability of graph-based data, our new model and algorithm potentially have applications in wide range.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Kalton, A., Langley, P., Wagstaff, K., Yoo, J.: Generalized clustering, supervised learning, and data assignment. In: Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 299–304. ACM, New York (2001)
Belkin, M., Niyogi, P.: Laplacian eigenmaps for dimensionality reduction and data representation. Neural computation 15, 1373–1396 (2003)
Shi, J., Malik, J.: Normalized cuts and image segmentation. In: IEEE Conf. Computer Vision and Pattern Recognition (1997)
Ng, A.Y., Jordan, M.I., Weiss, Y.: On spectral clustering: Analysis and an algorithm. In: Dietterich, T.G., Becker, S., Ghahramani, Z. (eds.) NIPS, pp. 849–856. MIT Press, Cambridge (2001)
Chan, P.K., Schlag, M.D.F., Zien, J.Y.: Spectral K -way ratio-cut partitioning and clustering. In: DAC, pp. 749–754 (1993)
Hagen, L.W., Kahng, A.B.: New spectral methods for ratio cut partitioning and clustering. IEEE Trans. on CAD of Integrated Circuits and Systems 11, 1074–1085 (1992)
Nowicki, K., Snijders, T.A.B.: Estimation and prediction for stochastic blockstructures. Journal of the American Statistical Association 96, 1077–1088 (2001)
Airoldi, E.M., Blei, D.M., Fienberg, S.E., **ng, E.P.: Mixed membership stochastic blockmodels (2008)
Pitman, J.: Combinatorial stochastic processes. Technical report. Springer, New York (2002)
Blei, D., Griffiths, T., Jordan, M.: The nested chinese restaurant process and bayesian nonparametric inference of topic hierarchies. Journal of the ACM 57, 21–30 (2010)
Blei, D., Frazier, P.: Distance dependent chinese restaurant processes. In: ICML (2010)
Jordan, M.I., Ghahramani, Z., Jaakkola, T., Saul, L.K.: An introduction to variational methods for graphical models. Machine Learning 37(2), 183–233 (1999)
Ishwaran, H., James, L.F.: Generalized weighted chinese restaurant processes for species sampling mixture models. Statistica Sinica 13, 1211–1236 (2003)
Ahmed, A., **ng, E.P.: Dynamic non-parametric mixture models and the recurrent chinese restaurant process: with applications to evolutionary clustering. In: SDM, pp. 219–230. SIAM, Philadelphia (2008)
van Dongen, S.: Graph Clustering by Flow Simulation. PhD thesis, University of Utrecht (2000)
Merton, R.K.: The matthew effect in science. Science 159, 56–63 (1968)
Rossiter, M.W.: The matthew matilda effect in science. Social Studies of Science 23, 325–341 (1993)
Stanovich, K.E.: Matthew effects in reading: Some consequences of individual differences in the acquisition of literacy. Reading Research Quarterly 21, 360–407 (1986)
Zhu, X., Ghahramani, Z., Lafferty, J.: Semi-supervised learning using gaussian fields and harmonic functions. In: ICML, pp. 912–919 (2003)
Zhou, D., Bousquet, O., Lal, T., Weston, J., Scholkopf, B.: Learning with local and global consistency. In: NIPS, p. 321 (2003)
Bartel, D.P.: Micrornas: target recognition and regulatory functions. Cell 136, 215–233 (2009)
Bearfoot, J.L., et al.: Genetic analysis of cancer-implicated microrna in ovarian cancer. Clinical cancer research: an official Journal of the American Association for Cancer Research 14(22), 7246–7250 (2008)
Blenkiron, C., et al.: Microrna expression profiling of human breast cancer identifies new markers of tumor subtype. Genome Biology 8, R214+ (2007)
Ng, K., Mishra, S.: De novo svm classification of precursor micrornas from genomic pseudo hairpins using global and intrinsic folding measures. Bioinformatics 23, 1321–1330 (2007)
Huang, J.C., Frey, B.J., Morris, Q.: Comparing sequence and expression for predicting microRNA targets using genMIR3. In: Altman, R.B., Dunker, A.K., Hunter, L., Murray, T., Klein, T.E. (eds.) Pacific Symposium on Biocomputing, pp. 52–63. World Scientific, Singapore (2008)
Dahiya, N., Morin, P.: Micrornas in ovarian carcinomas. Endocrine-Related Cancer (2009) (in press), doi:10.1677/ERC–09–0203
Berezikov, E., et al.: Approaches to microrna discovery. Nat. Genet. 38(suppl.), S2–S7 (2006)
Jiang, Q., et al.: mir2disease: a manually curated database for microrna deregulation in human disease. Nucleic Acids Res. 37, D98–D104 (2009)
Hu, G., et al.: Microrna-98 and let-7 confer cholangiocyte expression of cytokine-inducible src homology 2-containing protein in response to microbial challenge. J. Immunol. 183, 1617–1624 (2009)
Abbott, A., et al.: The let-7 microrna family members mir-48, mir-84, and mir-241 function together to regulate developmental timing in caenorhabditis elegans. Dev. Cell 9, 403–414 (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Luo, D., Ding, C., Huang, H. (2011). Graph Evolution via Social Diffusion Processes. In: Gunopulos, D., Hofmann, T., Malerba, D., Vazirgiannis, M. (eds) Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2011. Lecture Notes in Computer Science(), vol 6912. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23783-6_25
Download citation
DOI: https://doi.org/10.1007/978-3-642-23783-6_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23782-9
Online ISBN: 978-3-642-23783-6
eBook Packages: Computer ScienceComputer Science (R0)