Distance

A metric function (or distance) is a generalization of geometric distance (i.e. Euclidean Distance)


Direct similarity measures are not always reliable for high-dimensional clustering (see Guha1999)


Similarity is the opposite of distance


Non-metric


Resources

  • Strehl, Alexander, Joydeep Ghosh, and Raymond Mooney. "Impact of similarity measures on web-page clustering." 2000. [1]
  • Guha, Sudipto, Rajeev Rastogi, and Kyuseok Shim. "ROCK: A robust clustering algorithm for categorical attributes." 1999. [2]


Source