Thermodynamics-inspired unsupervised clustering of objects
ORAL
Abstract
The goal of unsupervised machine learning is to find the underlying structure that describes a dataset. Real-world applications include the clustering of customers by the ads they click on or the movies they watch. We treat unsupervised learning as an optimization problem by designing a morphism to transform data attributes into a mathematical graph and defining a graph entropy that is minimized when only relatively few nodes are highly connected. The thermodynamics of such system are derived and simulated annealing is used to cluster similar data together. The methodology is applied to network traffic patterns and scientific literature and results are discussed.
–
Presenters
-
Jorge Munoz
Intel Corporation, The Datum Institute
Authors
-
Jorge Munoz
Intel Corporation, The Datum Institute