• Non ci sono risultati.

Dominant Sets and Pairwise Clustering Dominant Sets and Pairwise Clustering

N/A
N/A
Protected

Academic year: 2021

Condividi "Dominant Sets and Pairwise Clustering Dominant Sets and Pairwise Clustering"

Copied!
74
0
0

Testo completo

(1)

Dominant Sets and Pairwise Clustering Dominant Sets and Pairwise Clustering

tra)(k

Marcello Pelillo Marcello Pelillo

Computer Vision and Pattern Recognition Group Computer Vision and Pattern Recognition Group

Dipartimento di Informatica Dipartimento di Informatica Universit

Universitàà CaCa’’ Foscari, VeneziaFoscari, Venezia

joint work with Massimiliano Pavan

(2)

Talk Talk s Outline s Outline

ƒ Dominant sets and their characterization

ƒ Evolutionary game dynamics for clustering

ƒ Experiments on intensity/color/texture image segmentation

ƒ Extension of the framework to hierarchical clustering

ƒ Experiments on the (hierarchical) organization of an image database

(3)

The (Pairwise) Clustering Problem The (Pairwise) Clustering Problem

Given:

Given:

- a set of n “objects”

- an n × n matrix of pairwise similarities Goal:

Goal: Partition the input objects into maximally homogeneous groups (i.e., clusters).

(4)

Applications Applications

Clustering problems abound in many areas of computer science and engineering.

A short list of applications domains:

Image processing and computer vision Computational biology and bioinformatics Information retrieval

Data mining

Signal processing Machine learning

(5)

What is a Cluster?

What is a Cluster?

No universally accepted definition of a “cluster”.

Informally, a cluster should satisfy two criteria:

Internal criterion

Internal criterion: all objects inside a cluster should be highly similar to each other.

External criterion:

External criterion: all objects outside a cluster should be highly dissimilar to the ones inside.

(6)

Clustering as a Graph

Clustering as a Graph - - Theoretic Problem Theoretic Problem

(7)

An Illustrative Example: The Binary Case An Illustrative Example: The Binary Case

Suppose the similarity matrix is a binary matrix.

In this case, the notion of a cluster coincide with that of a maximal clique.

Given an unweighted undirected graph G=(V,E):

A clique is a subset of mutually adjacent vertices

A maximal clique is a clique that is not contained in a larger one

How to generalize the notion of a maximal clique to weighted graphs?

(8)

Basic Definitions Basic Definitions

j i

S

(9)

Assigning Node Weights / 1 Assigning Node Weights / 1

S

j i S - { i }

(10)

Assigning Node Weights / 2

Assigning Node Weights / 2

(11)

Dominant Sets

Dominant Sets

(12)

From Dominant Sets to Local Optima From Dominant Sets to Local Optima

(and Back) / 1

(and Back) / 1

(13)

The Standard Simplex The Standard Simplex

(when

(when n n = 3) = 3)

(14)

From Dominant Sets to Local Optima From Dominant Sets to Local Optima

(and Back) / 2 (and Back) / 2

Generalization of Motzkin-Straus theorem from graph theory

(15)

Talk Talk s Outline s Outline

ƒ Dominant sets and their characterization

ƒƒ Evolutionary game dynamics for clusteringEvolutionary game dynamics for clustering

ƒ Experiments on intensity/color/texture image segmentation

ƒ Extension of the framework to hierarchical clustering

ƒ Experiments on the (hierarchical) organization of an image database

(16)

Replicator Equations

Replicator Equations

(17)

The Fundamental Theorem of Natural Selection

The Fundamental Theorem of Natural Selection

(18)

Grouping by Replicator Equations

Grouping by Replicator Equations

(19)

A MATLAB Implementation

A MATLAB Implementation

(20)

Characteristic Vectors

Characteristic Vectors

(21)

Separating Structure for Clutter

Separating Structure for Clutter

(22)
(23)

Separating Structure from Clutter

Separating Structure from Clutter

(24)
(25)

Talk Talk s Outline s Outline

ƒ Dominant sets and their characterization

ƒ Evolutionary game dynamics for clustering

ƒƒ Experiments on intensity/color/texture image Experiments on intensity/color/texture image segmentation

segmentation

ƒ Extension of the framework to hierarchical clustering

ƒ Experiments on the (hierarchical) organization of an image database

(26)

Image Segmentation Image Segmentation

Image segmentation problem:

Decompose a given image into segments, i.e. regions containing

“similar” pixels.

First step in many

computer vision problems

Example: Segments might be regions of the image depicting the same object.

Semantics Problem: How should we infer objects from segments?

(27)

Image Segmentation

Image Segmentation

(28)

Experimental Setup

Experimental Setup

(29)

Intensity Segmentation Results Intensity Segmentation Results

Dominant sets Ncut

(30)

Intensity Segmentation Results Intensity Segmentation Results

(97 x 115) (97 x 115)

Dominant sets Ncut

(31)

Color Segmentation Results Color Segmentation Results

(125 x 83) (125 x 83)

Original image Dominant sets Ncut

(32)

Texture Segmentation Results Texture Segmentation Results

(approx. 90 x 120)

(approx. 90 x 120)

(33)

Ncut Results

Ncut Results

(34)

Dealing with Large Data Sets

Dealing with Large Data Sets

(35)

Grouping Out

Grouping Out - - of of - - Sample Data Sample Data

(36)
(37)
(38)

Results on Berkeley Database Images Results on Berkeley Database Images

(321 x 481)

(321 x 481)

(39)

Results on Berkeley Database Images Results on Berkeley Database Images

(321 x 481)

(321 x 481)

(40)

Capturing Elongated Structures / 1

Capturing Elongated Structures / 1

(41)

Capturing Elongated Structures / 2

Capturing Elongated Structures / 2

(42)

Closing Closing the Similarity Graph the Similarity Graph

Basic idea

Basic idea: Trasform the original similarity graph G into a “closed”

version thereof (Gclosed), whereby edge-weights take into account chained (path-based) structures.

Unweighted (0/1) case:

Gclosed = Transitive Closure of G

Note:

Note: Gclosed can be obtained from:

A + A2 + … + An

(43)

Weighted Closure of Weighted Closure of G G

Observation

Observation: When G is weighted, the ij-entry of Ak represents the sum of the total weights on the paths of length k between vertices i and j.

Hence, our choice is:

Aclosed = A + A2 + … + An

(44)

Example: Without Closure (

Example: Without Closure ( σ σ = 2) = 2)

(45)

Example: Without Closure (

Example: Without Closure ( σ σ = 4) = 4)

(46)

Example: Without Closure (

Example: Without Closure ( σ σ = 8) = 8)

(47)

Example: With Closure (

Example: With Closure ( σ σ = 0.5) = 0.5)

(48)
(49)

Grouping Edge Elements Grouping Edge Elements

Here, the elements to be grouped are edgels (edge elements).

We used Herault/Horaud (1993) similarities, which combine the following four terms:

1. Co-circularity 2. Smoothness 3. Proximity 4. Contrast

Comparison with Mean-Field Annealing (MFA).

(50)
(51)
(52)
(53)
(54)

Talk Talk s Outline s Outline

ƒ Dominant sets and their characterization

ƒ Evolutionary game dynamics for clustering

ƒ Experiments on intensity/color/texture image segmentation

ƒƒ Extension of the framework to hierarchical clusteringExtension of the framework to hierarchical clustering

ƒ Experiments on the (hierarchical) organization of an image database

(55)

Building a Hierarchy:

Building a Hierarchy:

A Family of Quadratic Programs

A Family of Quadratic Programs

(56)

An Observation

An Observation

(57)

The effects of α

(58)

Bounds for the Regularization Parameter / 1

Bounds for the Regularization Parameter / 1

(59)

Bounds for the Regularization Parameter / 2

Bounds for the Regularization Parameter / 2

(60)

Bounds for the Regularization Parameter / 3

Bounds for the Regularization Parameter / 3

(61)

The Landscape of

The Landscape of f f

αα

(62)

Sketch of the Hierarchical Clustering Algorithm

Sketch of the Hierarchical Clustering Algorithm

(63)

Pseudo

Pseudo - - code of the Algorithm code of the Algorithm

(64)

Results on the IRIS dataset / 1

Results on the IRIS dataset / 1

(65)

Results on the IRIS dataset / 2

Results on the IRIS dataset / 2

(66)

Luo and Hancock

Luo and Hancock s Similarities (CVPR s Similarities (CVPR 01) 01)

(67)

Klein and Kimia

Klein and Kimia s Similarities (SODA s Similarities (SODA 01) 01)

(68)

Gdalyahu and Weinshall

Gdalyahu and Weinshall s Similarities (PAMI 01) s Similarities (PAMI 01)

(69)

Factorization Results

Factorization Results

(Perona and Freeman, 98)

(Perona and Freeman, 98)

(70)

Typical-cut Results (From Gdalyahu, 1999)

(71)

Conclusions

Conclusions

(72)

On On - - going and Future Projects going and Future Projects

ƒƒ Grouping with asymmetric affinities: Game theoryGrouping with asymmetric affinities: Game theory

ƒƒ Clustering on hypergraphs (high-Clustering on hypergraphs (high-order relations)order relations)

ƒƒ Graph matching, object recognition and trackingGraph matching, object recognition and tracking

ƒƒ Video segmentationVideo segmentation

(73)

Other Applications of Dominant

Other Applications of Dominant - - Set Clustering Set Clustering

Bioinformatics Bioinformatics

Identification of protein binding sites Identification of protein binding sites

R. Zauhar, M. Bruist, Univ. of Sciences Philadelphia, USA (2005) Clustering gene expression profiles

Clustering gene expression profiles

T. Li et al, Fudan University, Hong Kong (2005) Security and video surveillance

Security and video surveillance

Detection of anomalous activities in video streams Detection of anomalous activities in video streams

R. Hamid et al., Georgia Institute of Tehcnology, USA (2005) Detection of malicious activities in the internet

Detection of malicious activities in the internet F. Pouget, Insitut Eurécom (2006)

Content

Content-based image retrieval-based image retrieval

G. Giacinto, F. Roli, University of Cagliari, Italy (2007)

(74)

References References

M. Pavan, M. Pelillo. A new graph-theoretic approach to clustering and

segmentation. Proc. IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Madison, WI, 2003.

M. Pavan, M. Pelillo. Dominant sets and hierarchical clustering. Proc. IEEE International Conference on Computer Vision, Nice, France, 2003.

M. Pavan, M. Pelillo. Efficient out-of-sample extension of dominant-set clusters.

In: Advances in Neural Information Processing Systems 17 (MIT Press, 2005).

A. Torsello, S. Rota Bulo’, M. Pelillo.Grouping with asymmetric affinities: A game-theoretic perspective. Proc. IEEE Computer Society Conference on Computer Vision and Pattern Recognition, New York, NY, 2006.

M. Pavan, M. Pelillo. Dominant sets and pairwise clustering. IEEE Transactions on Pattern Analysis and Machine Intelligence 29(1):167-172, 2007.

Riferimenti

Documenti correlati

Stress must thus be laid on the need for general practitioners to adopt a correct approach to changes in calcium-phosphorus metabolism and especially second- ary increase in

Il quarto testo viene anch’esso dal primo libro dei Re, da una parte narrativa al capitolo 20 che descrive una guerra tra il Regno di Israele del Nord guidato da Acab (marito

Constitution enacted specific strategies for enforcing both representation and decision- making XX. Weimar parliamentary democracy witnessed an intense debate also on

Per precisare questo fatto in termini più quantitativi, si può fare riferimento alle Tabelle 1 e 2, che rappresentano in forma compatta tutte le marche flessive delle forme finite

Each panel contains two data sets, one for the case in which only blazars are considered in the model pre- dictions (solid lines) and one for the fit including the new source

Comitato Scientifico / Scientific Committee Giuseppe Amoruso Politecnico di Milano Paolo Belardi Università degli Studi di Perugia Stefano Bertocci Università degli Studi di Firenze