Capacity releasing diffusion for speed and locality
Wang D, Fountoulakis K, Henzinger M, Mahoney MW, Rao Satish. 2017. Capacity releasing diffusion for speed and locality. Proceedings of the 34th International Conference on Machine Learning. International Conference on Machine Learning, PMLR, vol. 70, 3598–3607.
Download (ext.)
http://proceedings.mlr.press/v70/wang17b/wang17b.pdf
[Published Version]
Conference Paper
| Published
| English
Author
Wang, Di;
Fountoulakis, Kimon;
Henzinger, MonikaISTA ;
Mahoney, Michael W.;
Rao , Satish
Series Title
PMLR
Abstract
Diffusions and related random walk procedures are of central importance in many areas of machine learning, data analysis, and applied mathematics. Because they spread mass agnostically at each step in an iterative manner, they can sometimes spread mass “too aggressively,” thereby failing to find the “right” clusters. We introduce a novel Capacity Releasing Diffusion (CRD) Process, which is both faster and stays more local than the classical spectral diffusion process. As an application, we use our CRD Process to develop an improved local algorithm for graph clustering. Our local graph clustering method can find local clusters in a model of clustering where one begins the CRD Process in a cluster whose vertices are connected better internally than externally by an O(log2n) factor, where n is the number of nodes in the cluster. Thus, our CRD Process is the first local graph clustering algorithm that is not subject to the well-known quadratic Cheeger barrier. Our result requires a certain smoothness condition, which we expect to be an artifact of our analysis. Our empirical evaluation demonstrates improved results, in particular for realistic social graphs where there are moderately good—but not very good—clusters.
Publishing Year
Date Published
2017-09-01
Proceedings Title
Proceedings of the 34th International Conference on Machine Learning
Publisher
ML Research Press
Volume
70
Page
3598-3607
Conference
International Conference on Machine Learning
Conference Location
Sydney, Australia
Conference Date
2017-08-06 – 2017-08-11
eISSN
IST-REx-ID
Cite this
Wang D, Fountoulakis K, Henzinger M, Mahoney MW, Rao Satish. Capacity releasing diffusion for speed and locality. In: Proceedings of the 34th International Conference on Machine Learning. Vol 70. ML Research Press; 2017:3598-3607.
Wang, D., Fountoulakis, K., Henzinger, M., Mahoney, M. W., & Rao , Satish. (2017). Capacity releasing diffusion for speed and locality. In Proceedings of the 34th International Conference on Machine Learning (Vol. 70, pp. 3598–3607). Sydney, Australia: ML Research Press.
Wang, Di, Kimon Fountoulakis, Monika Henzinger, Michael W. Mahoney, and Satish Rao . “Capacity Releasing Diffusion for Speed and Locality.” In Proceedings of the 34th International Conference on Machine Learning, 70:3598–3607. ML Research Press, 2017.
D. Wang, K. Fountoulakis, M. Henzinger, M. W. Mahoney, and Satish Rao , “Capacity releasing diffusion for speed and locality,” in Proceedings of the 34th International Conference on Machine Learning, Sydney, Australia, 2017, vol. 70, pp. 3598–3607.
Wang D, Fountoulakis K, Henzinger M, Mahoney MW, Rao Satish. 2017. Capacity releasing diffusion for speed and locality. Proceedings of the 34th International Conference on Machine Learning. International Conference on Machine Learning, PMLR, vol. 70, 3598–3607.
Wang, Di, et al. “Capacity Releasing Diffusion for Speed and Locality.” Proceedings of the 34th International Conference on Machine Learning, vol. 70, ML Research Press, 2017, pp. 3598–607.
All files available under the following license(s):
Copyright Statement:
This Item is protected by copyright and/or related rights. [...]
Link(s) to Main File(s)
Access Level
Open Access
Export
Marked PublicationsOpen Data ISTA Research Explorer
Sources
arXiv 1706.05826