Graphon
[[File:Exchangeable random graph from graphon.png|thumb|300px|A realization of an exchangeable random graph defined by a graphon. The graphon is shown as a magenta heatmap (lower right). A random graph of size is generated by independently assigning to each vertex a latent random variable
(values along vertical axis) and
including each edge independently with probability .
For example, edge (green, dotted) is present with probability
; the green boxes in the right square represent the
values of and . The upper left
panel shows the graph realization as an adjacency matrix.]]
In graph theory and statistics, a graphon (also known as a graph limit) is a symmetric measurable function , that is important in the study of dense graphs. Graphons arise both as a natural notion for the limit of a sequence of dense graphs, and as the fundamental defining objects of exchangeable random graph models. Graphons are tied to dense graphs by the following pair of observations: the random graph models defined by graphons give rise to dense graphs almost surely, and, by the regularity lemma, graphons capture the structure of arbitrary large dense graphs.
Statistical formulation
A graphon is a symmetric measurable function . Usually a graphon is understood as defining an exchangeable random graph model according to the following scheme:
- Each vertex of the graph is assigned an independent random value
- Edge is independently included in the graph with probability .
A random graph model is an exchangeable random graph model if and only if it can be defined in terms of a (possibly random) graphon in this way.
The model based on a fixed graphon is sometimes denoted ,
by analogy with the Erdős–Rényi model of random graphs.
A graph generated from a graphon in this way is called a -random graph.
It follows from this definition and the law of large numbers that, if , exchangeable random graph models are dense almost surely.
= Examples =
The simplest example of a graphon is for some constant . In this case the associated exchangeable random graph model is the Erdős–Rényi model that includes each edge independently with probability .
If we instead start with a graphon that is piecewise constant by:
- dividing the unit square into blocks, and
- setting equal to on the block,
the resulting exchangeable random graph model is the
community stochastic block model, a generalization of the Erdős–Rényi model.
We can interpret this as a random graph model consisting of distinct Erdős–Rényi graphs with parameters respectively, with bigraphs between them where each possible edge between blocks and is included independently with probability .
Many other popular random graph models can be understood as exchangeable random graph models defined by some graphon, a detailed survey is included in Orbanz and Roy.
= Jointly exchangeable adjacency matrices =
A random graph of size can be represented as a random adjacency matrix. In order to impose consistency (in the sense of projectivity) between random graphs of different sizes it is natural to study the sequence of adjacency matrices arising as the upper-left sub-matrices of some infinite array of random variables; this allows us to generate by adding a node to and sampling the edges for
Following the fundamental importance of exchangeable sequences in classical probability, it is natural to look for an analogous notion in the random graph setting. One such notion is given by jointly exchangeable matrices; i.e. random matrices satisfying
:
for all permutations
There is a representation theorem for jointly exchangeable random adjacency matrices, analogous to de Finetti’s representation theorem for exchangeable sequences. This is a special case of the Aldous–Hoover theorem for jointly exchangeable arrays and, in this setting, asserts that the random matrix
- Sample
u_j\sim U[0,1] independently X_{ij}=X_{ji}=1 independently at random with probabilityW(u_i,u_j),
where
= Graphon estimation =
Due to identifiability issues, it is impossible to estimate either the graphon function
Analytic formulation
Any graph on
This matrix corresponds to a step function
defined by partitioning
and for each
entry of
This function
In general, if we have a sequence of graphs
If these graphs converge (according to some suitable definition of convergence), then we expect the limit of these graphs to correspond to the limit of these associated functions.
This motivates the definition of a graphon (short for "graph function") as a symmetric measurable function
= Examples =
== Constant graphon ==
Take a sequence of
Intuitively, as
In the space of graphons, it turns out that such a sequence converges almost surely to the constant
== Half graphon ==
Take the sequence
the adjacency matrix
As
Matching this intuition, the sequence
== Complete bipartite graphon ==
Take the sequence
If we order the vertices by placing all vertices in one part at the beginning
and placing the vertices of the other part at the end,
the adjacency matrix of
For example, the adjacency matrix of
As
so that this sequence of graphs converges to a "complete bipartite" graphon
defined by
If we instead order the vertices of
the adjacency matrix has a chessboard structure of zeros and ones.
For example, under this ordering, the adjacency matrix of
As
the adjacency matrices become a finer and finer chessboard.
Despite this behavior, we still want the limit of
This means that when we formally define convergence for a sequence of graphs, the definition of a limit should be agnostic to relabelings of the vertices.
== Limit of W-random graphs ==
Take a random sequence
Then just like in the first example from this section, it turns out that
= Recovering graph parameters from graphons =
Given graph
This is because
in
corresponds to a region
of area
Similar reasoning shows that the triangle density in
= Notions of convergence =
There are many different ways to measure the distance between two graphs.
If we are interested in metrics that "preserve" extremal properties of graphs,
then we should restrict our attention to metrics that identify random graphs as similar.
For example, if we randomly draw two graphs independently from an Erdős–Rényi model
Naively, given two graphs on the same vertex set, one might define their distance as the number of edges that must be added or removed to get from one graph to the other, i.e. their edit distance. However, the edit distance does not identify random graphs as similar; in fact, two graphs drawn independently from
There are two natural metrics that behave well on dense random graphs in the sense that we want.
The first is a sampling metric, which says that two graphs are close if their distributions of subgraphs are close.
The second is an edge discrepancy metric, which says two graphs are close when their edge densities are close on all their corresponding subsets of vertices.
Miraculously, a sequence of graphs converges with respect to one metric precisely when it converges with respect to the other.
Moreover, the limit objects under both metrics turn out to be graphons.
The equivalence of these two notions of convergence mirrors how various notions of quasirandom graphs are equivalent.{{r|qrandom}}
== Homomorphism densities ==
One way to measure the distance between two graphs
That is, for each graph
If these numbers are close for every graph
intuitively
Rather than dealing directly with subgraphs, however, it turns out to be
easier to work with graph homomorphisms.
This is fine when dealing with large, dense graphs, since in this scenario
the number of subgraphs and the number of graph homomorphisms from a fixed graph are asymptotically equal.
Given two graphs
homomorphism density
In other words,
Graphons offer a simple way to compute homomorphism densities.
Indeed, given a graph
where the integral is multidimensional, taken over the unit hypercube
This follows from the definition of an associated graphon, by considering when the above integrand is equal to
We can then extend the definition of homomorphism density to arbitrary graphons
for any graph
Given this setup, we say a sequence of graphs
Although not evident from the definition alone, if
simultaneously.
== Cut distance ==
Take two graphs
Because these graphs share the same vertices,
one way to measure their distance is to restrict to subsets
As a preliminary formalization of this notion of distance, for any pair of graphs
In other words, the labeled cut distance encodes the maximum discrepancy of the edge densities between
We can generalize this concept to graphons by expressing the edge density
where
This motivates the following more general definition.
Definition 1. For any symmetric, measurable functionf : [0,1]^2 \to \mathbb{R} , define the cut norm off to be the quantity
\lVert f \rVert_\square = \sup_{S, T\subseteq [0,1]} \left| \int_S \int_{T} f(x,y) \; \mathrm{d}x \, \mathrm{d}y \right| taken over all measurable subsets
S, T of the unit interval.{{r|Lovasz:2013}}
This captures our earlier notion of labeled cut distance, as we have the equality
This distance measure still has one major limitation: it can assign nonzero distance to two isomorphic graphs.
To make sure isomorphic graphs have distance zero, we should compute the minimum cut norm over all possible "relabellings" of the vertices.
This motivates the following definition of the cut distance.
Definition 2. For any pair of graphonsU andW , define their cut distance to be
\delta_\square(U, W) = \inf_{\varphi} \lVert U - W^\varphi \rVert_\square where
W^\varphi(x,y) = W(\varphi(x), \varphi(y)) is the composition ofW with the map\varphi , and the infimum is taken over all measure-preserving bijections from the unit interval to itself.{{r|whatis}}
The cut distance between two graphs is defined to be the cut distance between their associated graphons.
We now say that a sequence of graphs
== Equivalence of convergence ==
As it turns out, for any sequence of graphs
Counting Lemma. For any pair of graphonsU andW , we have
|t(F, U) - t(F, W)| \le e(F) \delta_\square(U, W) for all graphs
F .
The name "counting lemma" comes from the bounds that this lemma gives on homomorphism densities
Inverse Counting Lemma. For every real number\varepsilon > 0 , there exist a real number\eta > 0 and a positive integerk such that for any pair of graphonsU andW with
|t(F, U) - t(F, W)| \le \eta for all graphs
F satisfyingv(F) \le k ,we must have
\delta_\square(U, W) < \varepsilon .
This lemma shows that left-convergence implies convergence under the cut distance.
= The space of graphons =
We can make the cut-distance into a metric by taking the set of all graphons and identifying two graphons
The resulting space of graphons is denoted
This space turns out to be compact.
Moreover, it contains the set of all finite graphs, represented by their associated graphons, as a dense subset.
These observations show that the space of graphons is a completion of the space of graphs with respect to the cut distance. One immediate consequence of this is the following.
Corollary 1. For every real number\varepsilon > 0 , there is an integerN such that for every graphonW , there is a graphG with at mostN vertices such that\delta_\square(W, W_G) < \varepsilon .
To see why, let
Applications
= Regularity lemma =
Compactness of the space of graphons
| last1=Lovász | first1=László | authorlink1=László Lovász
| last2=Szegedy | first2=Balázs
| title=Szemerédi's lemma for the analyst
| journal=Geometric and Functional Analysis
| date=2007
| volume=17
| pages=252–270
| doi=10.1007/s00039-007-0599-6
| s2cid=15201345}}
Szemeredi's regularity lemma can be translated into the language of graphons as follows. Define a step function to be a graphon
The proof of compactness requires only the weak regularity lemma:
Weak Regularity Lemma for Graphons. For every graphonW and\varepsilon > 0 , there is a step functionW' with at most\lceil 4^{1/\varepsilon^2} \rceil steps such that\lVert W - W' \rVert_\square \le \varepsilon .
but it can be used to prove stronger regularity results, such as the strong regularity lemma:
Strong Regularity Lemma for Graphons. For every sequence\mathbf{\varepsilon} = (\varepsilon_0, \varepsilon_1, \dots) of positive real numbers, there is a positive integerS such that for every graphonW , there is a graphonW' and a step functionU withk < S steps such that\lVert W - W' \rVert_1 \le \varepsilon_0 and\lVert W' - U \rVert_\square \le \varepsilon_k.
The proof of the strong regularity lemma is similar in concept to Corollary 1 above. It turns out that every graphon
= Sidorenko's conjecture =
{{Main|Sidorenko's conjecture}}
The analytic nature of graphons allows greater flexibility in attacking inequalities related to homomorphisms.
For example, Sidorenko's conjecture is a major open problem in extremal graph theory, which asserts that for any graph
Since this quantity is the expected number of labeled subgraphs of
the conjecture can be interpreted as the claim
that for any bipartite graph
Many approaches to Sidorenko's conjecture formulate the problem as an integral inequality on graphons, which then allows the problem to be attacked using other analytical approaches. {{r|norms}}
Generalizations
Graphons are naturally associated with dense simple graphs. There are extensions of this model to dense directed weighted graphs, often referred to as decorated graphons.{{r|decorate}} There are also recent extensions to the sparse graph regime, from both the perspective of random graph models and graph limit theory.
References
| last1 = Chung | first1 = Fan R. K. | author1-link = Fan Chung
| author2-link = Ronald Graham | last2 = Graham | first2 = Ronald L.
| last3 = Wilson | first3 = R. M.
| title = Quasi-random graphs
| journal = Combinatorica
| year = 1989
| volume = 9
| issue = 4
| pages = 345–362
| doi = 10.1007/BF02125347 | doi-access = free }}
|last1 = Haupt |first1 = Andreas
|last2 = Schultz |first2 = Thomas
|first3 = Mohammed |last3 = Khatami
|first4 = Ngoc |last4 = Tran
|title = Advances in Mathematical Sciences
|editor-first1 = Bahar |editor-last1 = Acu
|editor-first2 = Donatella | editor-last2 = Danialli
|editor-first3 = Marta | editor-last3 = Lewicka
| editor-first4 = Arati | editor-last4 = Pati
| editor-first5 = Saraswathy |editor-last5 = RV
|editor-first6 = Miranda |editor-last6 = Teboh-Ewungkem
|series = Association for Women in Mathematics Series
|volume = 21
|chapter = Classification on Large Networks: A Quantitative Bound via Motifs and Graphons (Research)
|publisher = Springer, Cham
|doi = 10.1007/978-3-030-42687-3_7 |doi-access=free
|isbn = 978-3-030-42687-3
|date = July 17, 2020
|pages = 107–126
| arxiv = 1710.08878 }}
| last1 = Hatami | first1 = H.
| title = Graph norms and Sidorenko's conjecture
| journal = Israel Journal of Mathematics
| year = 2010
| volume = 175
| issue = 1
| pages = 125–150
| doi = 10.1007/s11856-010-0005-1 | doi-access = | arxiv = 0806.0047 }}
| last1 = Sidorenko | first1 = A. | author1-link = Alexander Sidorenko
| title = A correlation inequality for bipartite graphs
| journal = Graphs and Combinatorics
| year = 1993
| volume = 9
| issue = 2–4
| pages = 201–204
| doi = 10.1007/BF02988307 | doi-access =}}
| last1 = Glasscock | first1 = D.
| title = What is a graphon
| journal = Notices of the American Mathematical Society
| year = 2015
| volume = 62
| issue = 1
| pages = 46–48
| arxiv = 1611.00718}}