18.S096: Spectral Clustering and Cheeger’s Inequality

A new set of Lecture notes for my course is available here. This one is about spectral clustering and Cheeger’s Inequality. In a nutshell spectral clustering can be seen as attempting to cluster a graph by clustering the corresponding points of its Diffusion Maps embedding and Cheeger’s Inequality provides a guarantee of performance (for the case of two clusters). Take a look!

As usual, I will document the open problems here. I remind that there is a much more detailed description of the problems on the notes, including description of partial progress.

Open Problem 3.1.

Cheeger’s inequality (see my notes) allows to efficiently approximate the Cheeger number of a graph up to a square root factor. It means in particular that, given {G = (V,E)} and {\phi} we can efficiently between the cases where: {h_G\leq \phi} or {h_G \geq 2\sqrt{\phi}}. Can this be improved in the following sense?

Does there exists a constant {c>0} such that it is {NP}-hard to, given {\phi}, and {G} distinguish between the cases

  1. {h_G \leq \phi}, and

  2. {h_G \geq c\sqrt{\phi}}?

It turns out that this is a consequence of an important conjecture in Theoretical Computer Science, the Small-Set Expansion Hypothesis. (see this nice survey and references within). In particular, this conjecture is known to imply the Unique-Games Conjecture, that we will discuss in future lectures. (see my notes for references)

Open Problem 3.2. (Certifying that matrices are PSD)

Given a symmetric matrix {M} with small condition number, is there a quasi-linear time (on {n} and the number of non-zero entries of {M}) procedure that certifies that {M\succeq 0}. More specifically, the procedure can be randomized in the sense that it may, with some probably not certify that {M\succeq 0} even if that is the case, what is important is that it never produces erroneous certificates (and that it has a bounded-away-from-zero probably of succeeding, provided that {M\succeq 0}).

The Cholesky decomposition produces such certificates, but we do not know how to compute it in quasi-linear time. Note also that the power method can be used in {\alpha I - M} to produce certificates that have arbitrarily small probability of being false certificates. Later in these lecture we will discuss the practical relevance of such a method as a tool to quickly certify solution produced by heuristics.

Open Problem 3.3.

Given a graph {G=(V,E,W)}, a natural way of evaluating {k}-way clusterign is via the {k}-way expansion constant:

\rho_G(k) = \min_{S_1,\dots,S_k} \max_{l=1,\dots,k} \left\{ \frac{\mathrm{cut}(S)}{\mathrm{vol}(S)} \right\},

where the maximum is over all choice of {k} disjoint subsets of {V} (but not necessarily forming a partition).

Let {G=(V,E,W)} be a graph and {k} a positive integer, is the following true?

\rho_G(k) \leq \mathrm{polylog}(k) \sqrt{\lambda_k}. \ \ \ \ \ (2)

This is a Theorem by Lee et al if we replace the poly-logarithmic factor by a quadratic factor.

Interestingly, this ( with the poly-logarithmic factor) is known not to hold if we ask that the subsets form a partition (meaning that every vertex belongs to at least one of the sets), see notes and references therein. Note also that no dependency on {k} would contradict the Small-Set Expansion Hypothesis above.


Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s