The second set of Lecture notes for my course is now available here. This week’s notes are about graphs, embedding of graphs in Euclidean space (focusing in Diffusion Maps) and relations between behavior of a graph based semi-supervised learning method and Sobolev Embedding Theorem. Given the nature of these topics, these notes have a lot more images than normal, take a look!
The notes also describe three open problems that I would like to document here, there is a much more detailed description of the problems on the notes, including description of partial progress.
Open Problem 2.1.
Given a positive integer , the Ramsey number
denotes the smallest integer
such that any graph on
nodes has either a clique or an independent set of size
. The following are open questions:
- What is
? It is known that
.
- What are the asymptotics of $R(r)$? The current best bounds are subexponential improvements of
.
- Construct a family of graphs
with increasing number of nodes for which there exists
such that
.
The current world’s record for deterministic constructions of Ramsey graphs produces graphs for which the the clique or independence number is (see here and here), while we know (by the bounds above) that there exist graphs for which this number is
.
Open Problem 2.2. (Erdos-Hajnal conjecture)
For any finite graph $H$, there exists a constant $\delta_H > 0$ such that any graph on $n$ nodes that does not contain $H$ as a subgraph (is an $H$-free graph) must have
,
where
is the maximum among the clique and independence number of
.
I think this is one of the most fascinating conjectures in graph theory. It is known (see this survey) that , for some constant
. Note that this lower bound already shows that
-free graphs need to have considerably larger
. This is an amazing local to global effect, where imposing a constraint on small groups of vertices are connected (being
-free is a local property) creates extremely large cliques or independence sets — much larger than
as in random Erd\H{o}s-Reny\’i graphs (see, for example, the notes).
Open Problem 2.3. (The planted clique problem)
Let be a random graph constructed by taking a
and planting a clique of size
:
- Is there a polynomial time algorithm that is able to find the largest clique of
(with high probability) for
? For example, for
?
- Is there a polynomial time algorithm that is able to distinguish, with high probability,
from a draw of
for
? For example, for
?
- Is there a quasi-linear time algorithm able to find the largest clique of
(with high probability) for
, for some
? (This question was posed here)
Questions 1 and 2 are central questions in average case complexity. In fact, the hypothesis that finding planted cliques for is hard is used as the basis of several cryptographic protocols and average case hardness results, see the notes for some references and a brief explanation of why the problem is easy if
.