1. Introduction
While networks appear in many different applications, many real-world networks were found to share some important characteristics. First of all, often their degree distribution is heavy-tailed, which is sometimes denoted as the network being scale-free. Second, they often have a high clustering coefficient, implying that it is likely that two neighbors of a vertex are connected themselves as well. For this reason, random graph models that can achieve both scale-freeness and a high clustering coefficient have been at the center of attention over the last years.
One example of such a model is the popular hyperbolic random graph (HRG) Krioukov et al. (Reference Krioukov, Papadopoulos, Kitsak, Vahdat and Boguná2010), which has for example been used to model the network of world wide trade García-Pérez et al., (Reference García-Pérez, Boguñá, Allard and Serrano2016) or the Internet on the Autonomous Systems level Boguñá, Papadopoulos, and Krioukov (Reference Boguñá, Papadopoulos and Krioukov2010); Kleinberg (Reference Kleinberg2007). This random graph model embeds the vertices in an underlying hyperbolic space and connects them with probabilities depending on their distances, where nearby vertices are more likely to connect. The triangle inequality then ensures the presence of many triangles, while the hyperbolic space ensures the presence of a scale-free degree distribution. Recently, the geometric inhomogeneous random graph (GIRG) was proposed as a generalization of HRG. It combines power-law distributed weights with Euclidean space, making the model simpler to analyze Bringmann, Keusch, and Lengler (2015).
While the HRG and the GIRG have been designed to exhibit high clustering and a scale-free degree distribution, the question remains whether other properties of this model match real-world data. For this reason, many properties of the GIRG or HRG have been analyzed mathematically, such as the maximum clique size Bläsius, Friedrich, and Krohmer (Reference Bläsius, Friedrich and Krohmer2018), number of $k$ -cliques Michielan and Stegehuis (Reference Michielan and Stegehuis2022), spectral gap Kiwi and Mitsche (Reference Kiwi and Mitsche2018) and separator size Bläsius, Friedrich, and Krohmer (Reference Bläsius, Friedrich and Krohmer2016); Lengler and Todorovic (Reference Lengler and Todorovic2017).
In this paper, we focus on another network property: the number of maximal cliques, that is, cliques that are not part of any larger clique. Cliques in general are an important indicator for structural properties of a network. Indeed, the number of large cliques is a measure of the tendency of a network to cluster into groups. Small cliques of size 3 (triangles), on the other hand, can form an indication of the transitivity of a network or its clustering coefficient.
To study these structural clique-based properties, however, all cliques of a given size need to be listed, which can be a computationally expensive process. To list all network cliques, it suffices to list only all maximal cliques, as all smaller cliques can be generated from at least one maximal clique. For this reason, enumerating all maximal cliques of a graph is at the heart of our understanding of cliques in general.
For enumerating all maximal cliques, an output-polynomial algorithm Tsukiyama et al. (Reference Tsukiyama, Ide, Ariyoshi and Shirakawa1977) exists, which can enumerate all maximal cliques efficiently if the graph contains only few of them. This creates a link between enumeration and counting: if the maximal clique count is low, then it is possible to efficiently enumerate them. There also exist highly efficient implementations to enumerate all maximal cliques Eppstein, Löffler, and Strash (Reference Eppstein, Löffler and Strash2010, Reference Eppstein, Löffler and Strash2013); Eppstein and Strash (Reference Eppstein and Strash2011). However, for a given graph, it is usually not known a priori how many maximal cliques it has. If this number is large, enumerating all maximal cliques can still take exponential time. However, in practice, enumerating the number of maximal cliques often takes a short amount of time for many real-world instances as well as in realistic network models Bläsius and Fischbeck (Reference Bläsius and Fischbeck2022). In this paper, we therefore focus on the number of maximal cliques in the GIRG random graph, that is, the maximal clique count. As the GIRG possesses the two main characteristics that are essential to many real-world networks, scale-freeness and an underlying geometry, we believe that investigating the number of maximal cliques in the GIRG can provide insights into in why enumerating the number of maximal cliques can often be done efficiently for many real-world networks.
To investigate the influence of the different properties of scale-freeness and clustering, we investigate the number of maximal cliques in three steps. First, we investigate a model without heavy-tailed degrees and with a small clustering coefficient, the Erdős–Rényi model $G(n, p)$ ; see Section 2. We then investigate the GIRG model (Section 3), which has both clustering and scale-free degrees. Finally, in Section 4, we investigate the inhomogeneous random graph (IRG), a model that is scale-free but has a small clustering coefficient. We complement our theoretical bounds with experiments in Section 5. In all models, we will be interested in the large $n$ limit. That is, we investigate how the number of maximal cliques scales in the number of nodes $n$ when $n$ grows large. Our main findings can be summarized as follows; also see Table 1 for an overview of our results.
-
• There is a strong dependence on the density of the network. For the Erdős–Rényi model ( $G(n, p)$ ) we obtain a linear upper bound for sparse graphs ( $O(n)$ edges) and a polynomial upper bound for non-dense graphs ( $O(n^{2 - \varepsilon })$ edges for any $\varepsilon \gt 0$ ). For dense graphs on the other hand ( $\Omega (n^2)$ edges), we obtain a super-polynomial lower bound. If the density is high enough, our lower bound is even exponential.
-
• This insight carries over to the IRG and GIRG models. Though they are overall sparse, they contain sufficiently large dense subgraphs that allow us to obtain super-polynomial lower bounds.
-
• In the IRG model with power-law exponent $\tau \in (2, 3)$ the small maximal cliques localize: asymptotically maximal cliques of constant size $k\gt 2$ are formed by $k - 2$ hubs of high degree proportional to $n^{1 / (\tau - 1)}$ and two vertices of lower degree proportional to $n^{(\tau -2)/(\tau -1)}$ .
-
• We complement our theoretical lower bounds with experiments showing that the super-polynomial growth becomes only relevant for very large networks.
Discussion and related work
Although cliques themselves have been studied extensively in the literature, there is, to the best of our knowledge, only little previous work on the number of maximal cliques in network models. In fact, the only theoretical analysis we are aware of is the recent preprint by Yamaji (Reference Yamaji2023), giving bounds for HRGs and random geometric graph (RGG), which are also shown in Table 1. Interestingly, this includes the upper bound of $\exp (O(n^{\frac{3 - \tau }{6} + \varepsilon }))$ for the HRG model. In contrast to that, we give the asymptotically larger lower bound $\exp (\Omega (n^{\frac{3 - \tau }{4} - \varepsilon }))$ for the corresponding GIRG variant. Thus, there is an asymptotic difference between the HRG and the GIRG model.
This is surprising as the GIRG model is typically perceived as a generalization of the HRG model. More precisely, there is a mapping between the two models such that for every HRG with average degree $d_{\textrm{HRG}}$ there exist GIRGs with average degree $d_{\textrm{GIRG}}$ and $D_{\textrm{GIRG}}$ with $d_{\textrm{GIRG}} \le d_{\textrm{HRG}} \le D_{\textrm{GIRG}}$ that are sub- and supergraphs of the HRG, respectively. Moreover, $d_{\textrm{GIRG}}$ and $D_{\textrm{GIRG}}$ are only a constant factor apart and experiments indicate that $d_{\textrm{HRG}} = d_{\textrm{GIRG}}\cdot (1 + o(1))$ , that is, every HRG has a corresponding GIRG that is missing only a sublinear number of edges Bläsius et al., (Reference Bläsius, Freiberger, Friedrich, Katzmann, Montenegro-Retana and Thieffry2022). In the case of maximal cliques, however, this minor difference between the models leads to an asymptotic difference.
Besides this theoretical analysis, it has been observed empirically that the number of maximal cliques in most real-world networks as well as in the GIRG and the IRG model is smaller than the number of edges of the graph Bläsius and Fischbeck (Reference Bläsius and Fischbeck2022). This indicates linear scaling in the graph size with low constant factors and small lower-order terms, which seems to be a stark contradiction to the super-polynomial lower bounds we prove here. We resolve this contradiction with our experiments in Section 5, where we observe that the graph size has to be quite large before the asymptotic behavior kicks in, that is, we observe the super-polynomial scaling as predicted by our theorems but on such a low level that it is overshadowed by the linear lower-order terms.
Notation and setting
In the rest of this paper, we will be interested in results in the large $n$ limit, where $n$ denotes the number of nodes in the random graph. We therefore use classical asymptotic notation, in terms of the graph size $n$ . For any two non-negative functions $f(n),g(n)$ we will write $f(n) \in o(g(n))$ if $\lim _{n \to \infty } f(n)/g(n) = 0$ ; $f(n) \in O(g(n))$ if $\limsup _{n \to \infty } f(n)/g(n) \lt \infty$ ; $f(n) \in \Omega (g(n))$ if $\liminf _{n \to \infty } f(n)/g(n) \gt 0$ ; $f(n) \in \Theta (g(n))$ if $f(n) \in O(g(n))$ and $f(n) \in \Omega (g(n))$ . Moreover, we will say that a sequence of events $\{\mathcal{E}_n\}_{n \geq 1}$ happens with high probability (w.h.p.) if $\lim _{n \to \infty } \mathbb{P}\left (\mathcal{E}_n\right )=1$ .
2. Erdős–Rényi random graph
An Erdős–Rényi random graph Gilbert (Reference Gilbert1959); Erdős and Rényi (Reference Erdős and Rényi2022) $G(n, p)$ has $n$ vertices, and each pair of vertices is connected independently with probability $p$ . We give bounds on the number of maximal cliques in a $G(n, p)$ depending on $p$ . Roughly speaking, we give super-polynomial lower bounds for the dense regime and polynomial upper bounds for a sparser regime. Specifically, we first give a general lower bound that is super-polynomial if $p$ is non-vanishing for growing $n$ , that is, if $p \in \Omega (1)$ . Note that $p \in \Omega (1)$ yields a dense graph with a quadratic number of edges in expectation. For super-dense graph with $p = 1 - c/n$ for a constant $c$ , we strengthen this lower bound to exponential. In contrast to this, we give a polynomial upper if $p \in O(n^{-a})$ for any constant $a \gt 0$ . For sparse graphs with $p \in O(n^{-1})$ , yielding graphs with $\Theta (n)$ edges in expectation, our upper bound on the number of maximal cliques is linear. We start with the general lower bound.
Theorem 2.1. Let $N$ be the number of maximal cliques in a $G(n, p)$ . Then, for $n$ sufficiently large,
Proof. Let $N_k$ be the number of maximal cliques of size $k$ . To estimate ${\mathbb{E}}[N_k]$ , note that the probability that a fixed subset $C \subseteq V$ of $|C| = k$ vertices forms a clique is $p^{k(k-1)/2}$ . Moreover, it is maximal if none of the other $n - k$ vertices is connected to all $k$ vertices of $C$ , which happens with probability $(1 - p^k)^{n - k}$ . As the two events are independent and there are $\binom{n}{k}$ vertex sets of size $k$ , we obtain
Using that $\binom{n}{k} \ge (n / k)^k$ and increasing the exponents of the probabilities, we obtain
We now set $k=\log (n)/\log (1/p) = -\log (n)/\log (p)$ , which yields $p^k = n^{-1}$ . Thus, in the above bound, the term $n^k p^{k^2/2}$ simplifies to $n^k n^{-k/2} = n^{k/2}$ . Moreover, the term $(1 - p^k)^{n}$ simplifies to $(1 - 1/n)^n$ , which converges to $1/e$ for $n \to \infty$ . Thus, we obtain
Changing the base of the second factor yields
As there are clearly at least as many maximal cliques as maximal cliques of size $k$ , claimed bound for ${\mathbb{E}}[N]$ follows.
This means that in a dense Erdős–Rényi random graph (constant $p$ ), the expected number of maximal cliques is super-polynomial in $n$ . In the following, we show that, when the graph gets even denser, the number of maximal cliques even grows exponentially. For this, we prove the existence of an induced subgraph that has many maximal cliques. Specifically, we aim to find a large co-matching, that is, the complement graph of a matching (or equivalently, a co-matching).
Lemma 2.2. Let $G$ be a co-matching on $2k$ vertices. Then $G$ has $2^k$ maximal cliques.
Proof. The complement $\overline G$ of $G$ is a matching with $k$ edges. The maximal independent sets of $\overline G$ are the vertex sets that contain for each edge exactly one of its vertices. Thus, $\overline G$ has $2^k$ maximal independent sets, which implies that $G$ has $2^k$ maximal cliques.
With this, we can show an exponential lower bound for super-dense Erdős–Rényi graphs.
Theorem 2.3. For every $c\gt 0$ , there exists a $\zeta \gt 0$ and $n'\gt 0$ such that $G(n,1-c/n)$ contains at least $2^{\zeta n}$ cliques with high probability for all $n\geq n'$ .
Proof. A co-matching in $G(n,1-c/n)$ corresponds to an induced matching in $G(n,c/n)$ . Now fix $M\gt 1$ . Then, by [van der Hofstad, (2017), Theorem 5.12], with high probability the Erdős–Rényi random graph contains a linear number of vertices of degree at most $M$ and at least $1$ . Denote the reduced graph with only vertices of degree at most $M$ by $G_{\leq M}$ , which has a linear number of edges. Now we construct an induced matching of linear size in $G_{\leq M}$ as follows. Start with any edge $\{u,v\}$ in $G_{\leq M}$ , and add it to the matching. Then, remove $u$ , $v$ and all neighbors of $u$ and $v$ from $G_{\leq M}$ . This removes at most $2M^2$ edges from $G_{\leq M}$ , as all degrees are bounded by $M$ . Then, pick another edge and continue this process until $G_{\leq M}$ contains no more edges. As this process removes only a constant number of edges after picking a new edge, at least a linear number of edges will be added before the process finishes. Thus, there is an induced matching of at least $\zeta n$ with high probability, which yields the claim due to Lemma2.2.
Next we consider less dense Erdős–Rényi graphs with $p \in O(n^{-a})$ for a constant $a \in (0, 1]$ and prove a polynomial upper bound on the number of maximal cliques. The degree of the polynomial depends on $a$ . For sparse graphs with $p \in O(n^{-1})$ , our bound is linear.
Theorem 2.4. Let $p = (c / n)^a$ for constants $c \gt 0$ and $a \in (0, 1]$ and let $N$ be the number of maximal cliques in a $G(n, p)$ . Then ${\mathbb{E}}[N] \in O(n^x)$ with
Proof. As in Theorem2.1, let $N_k$ be the number of maximal cliques of size $k$ . Note that the number of maximal cliques is upper bounded by the number of (potentially non-maximal) cliques. Thus, we obtain
Using that $\binom{n}{k} \le (en / k)^k$ , inserting $p = (c / n)^a$ , and rearranging yields
We first argue that we can focus on the case where $k$ is constant as the above term vanishes sufficiently quickly for growing $k$ . For this, note that $a{k(k-1)} /{2} - k \ge k$ if $k \ge 4/a + 1$ . Thus, as $c / n \lt 1$ for sufficiently large $n$ , the second factor of Equation (3) is upper bounded by $(c / n)^k$ . For $k \ge 4/a + 1$ , it then follows that ${\mathbb{E}}[N_k] \le ({c^2e}/ (kn) )^k$ . For sufficiently large $n$ , the fraction is smaller than $1$ and thus the sum over all $N_k$ for larger values of $k$ is upper bounded by a constant due to the convergence of the geometric series.
Focusing on $k \in \Theta (1)$ and ignoring constant factors, we obtain
To evaluate the maximum, note that $x(k)$ describes a parabola with its maximum at $k_0 = 1/a + 1/2$ . However, $k_0$ may not be integral. To determine the integer $k$ that maximizes $x(k)$ , note that for $a \in [\frac{1}{i}, \frac{1}{i - 1}]$ with $i \in \mathbb N^+$ , we get $k_0 \in [i - \frac{1}{2}, i + \frac{1}{2}]$ . Thus, $i$ is the closest integer to $k_0$ . As the parabola is symmetric at its maximum $k_0$ , the exponent $x(k)$ is maximized for the integer $k = i = \lceil \frac{1}{a} \rceil$ . Substituting $k(k-1)/2 =\binom{k}{2}$ yields the claim.
3. Geometric inhomogeneous random graphs (GIRGs)
While the Erdős–Rényi random graph is homogeneous, and does not contain geometry, we now investigate the number of maximal cliques in a model that contains both these properties, the GIRG Bringmann, Keusch, and Lengler (2015). We will use similar notation as in Bringmann, Keusch, and Lengler (2015), except for the parameters $\alpha$ and $\beta$ , which we will replace by $1/T$ and $\tau$ , respectively, to be more consistent with the literature on other similar models Krioukov et al. (Reference Krioukov, Papadopoulos, Kitsak, Vahdat and Boguná2010). In this model, each vertex $v$ has a weight, $w_v$ and a position $x_v$ . The weights are independent copies of a power-law random variable $W$ with exponent $\tau$ , that is,
for all $w\geq 1$ . We impose the condition $\tau \in (2,3)$ , to ensure that the weights have finite mean but unbounded variance. The parameter $\mu$ denotes the mean of this distribution and can be computed as $\mu = (\tau -2)^{-1}$ . The vertex positions $x_1,\ldots ,x_n$ are independent copies of a uniform random variable on the $d$ -dimensional torus $\mathbb{T}^d = \mathbb{R}^d/\mathbb{Z}^d$ .
An edge between any two vertices $u,v \in V$ of the GIRG appears independently with a probability $p_{uv}$ determined by the weights and the positions of the vertices:
where $\|\cdot \|$ denotes the maximum norm on the torus, $\mu$ is a parameter controlling the average degree, and $0 \lt T \lt 1$ is the temperature and controls the influence of the geometry. We say that $T=0$ is the threshold case of the GIRG. That is, when $T=0$ ,
In general, we will be interested in results for the GIRG model when the number of nodes, $n$ , tends to infinity. We will then often refer to the family of GIRGs generated for varying $n$ by $G^{(n)}$ , where we assume that all other parameters ( $\mu ,\ \tau ,\ \gamma ,\ d$ ) remain fixed.
In the following, we first give a lower bound for the threshold case (Section 3.1). The proof makes use of the toroidal structure of the ground space. To prove that this is not essential to obtain a super-polynomial number of maximal cliques, we additionally give a lower bound for a variant of the model where the ground space is a 2-dimensional unit square with Euclidean norm (Section 3.2). Finally, in Section 3.3, we show how to extend these results to the general case with non-zero temperatures.
3.1 Threshold case
Here we show that a $d$ -dimensional threshold GIRG $G = (V, E)$ has, with high probability, a super-polynomial number of maximal cliques. To achieve this, we proceed as follows to show that $G$ has a large co-matching as induced subgraph (also see Lemma2.2). We consider the vertex set $S \subseteq V$ containing all vertices whose weight lies between a lower bound $w_\ell$ and an upper bound $w_u$ . As a co-matching is quite dense, it makes sense to think of these as rather large weights. We then define disjoint regions $B_1, \dots , B_{2k}$ . For $i \in [k]$ , we call $B_i$ and $B_{i + k}$ a pair of opposite regions. These regions will satisfy the following three properties. First, every $B_i$ contains a vertex from $S$ with high probability. Second, pairs of vertices from $S$ in opposite regions are not connected. And third, vertices from $S$ that do not lie in opposite regions are connected. Note that these properties imply the existence of a co-matching on $2k$ vertices, as choosing an arbitrary vertex of $S$ for each region $B_i$ makes it so that each chosen vertex has exactly one partner from the opposite region to which it is not connected, while it is connected to the vertices from all other regions.
In the following we first give a parameterized definition of the regions $B_i$ and then show how to choose the parameters for the above strategy to work; also see Figure 1. Each $B_i$ is an axis-aligned box, that is, the cross product of intervals. Let $g(n), h(n) \gt 0$ such that $1 / (g(n) + h(n))$ is an even number. Think of $h(n)$ of as the height of each box and of $g(n)$ as the gap between the boxes, yielding $2k = 1 / (g(n) + h(n))$ boxes. Now we define $B_i = [(i - 1) \cdot (g(n) + h(n)), (i - 1) \cdot (g(n) + h(n)) + h(n)] \times [0, \frac{1}{2} - g(n)]^{d - 1}$ for $i \in [2k]$ . We call the resulting regions $B_1, \dots , B_{2k}$ the evenly spaced boxes of height $h(n)$ and gap $g(n)$ , see Figure 1 for an illustration for $d=1$ and $d=2$ . As before, $B_i$ and $B_{i + k}$ for $i \in [k]$ are opposite boxes.
With this, note that the distance between any pair of points in opposite boxes is at least $u = \frac{1}{2} - h(n)$ (recall that we assume the infinity norm). Moreover, the distance between any pair of points in non-opposite regions is at most $\ell = \frac{1}{2} - g(n)$ . This yields the following lemma.
Lemma 3.1. Let $B_1, \dots , B_{2k}$ be evenly spaced boxes of height $h(n)$ and gap $g(n)$ in $\mathbb T^d$ . Let $w_\ell = (\frac{1}{2} - g(n))^{d / 2} \sqrt{\mu n}$ and $w_u = (\frac{1}{2} - h(n))^{d / 2} \sqrt{\mu n}$ . If a GIRG on $n$ vertices with $T=0$ , $\tau \in (2,3)$ and $\mu$ places one vertex of weight in $[w_\ell , w_u)$ in each box $B_i$ , then these vertices form a co-matching.
Proof. As observed above, the vertices in opposite boxes have distance at least $u = \frac{1}{2} - h(n)$ . Moreover, the vertices considered here have weight less than $w_u = u^{d / 2} \sqrt{\mu n}$ . As $w_u^2 / (\mu n u^d) = 1$ , these vertices are not connected because the weight interval $[w_\ell , w_u)$ is open at $w_u$ (see Equation (6)). Similarly, vertices in non-opposite boxes have distance at most $\ell = \frac{1}{2} - g(n)$ and weight at least $w_\ell = \ell ^{d / 2} \sqrt{\mu n}$ . As $w_\ell ^2 / (\mu n \ell ^d) = 1$ , such vertices are connected. Hence, we get a co-matching.
It now remains to choose $g(n)$ and $h(n)$ appropriately. First observe that, for the weight range in Lemma 3.1 to be non-empty, we need $w_\ell \lt w_u$ and thus $g \gt h$ . Beyond that, we want to achieve the following three goals. First, the weight range needs to be sufficiently large such that we actually have a sufficient number of vertices in this range. For this, we want to choose $g(n)$ substantially larger than $h(n)$ . Second, we want to make each box $B_i$ sufficiently large for it to contain a vertex with high probability. For this, we mainly want $h(n)$ to be large. Third, we want the number of boxes $2k = 1 / (g(n) + h(n))$ to be large to obtain a large co-matching. For this, we want $g(n)$ and $h(n)$ to be small.
Note that the restrictions of choosing $h(n)$ large, $g(n)$ larger than $h(n)$ , and $g(n) + h(n)$ small are obviously conflicting. In the following, we show how to balance these goals out to obtain a co-matching of polynomial size. We start by estimating the number of vertices in the given weight range in the following lemma, which is slightly more general then we need.
Lemma 3.2. Let the vertex weights independently be sampled as in (4) , with $\tau \in (2,3)$ . Let $a, b \gt 0$ be constants and let $g(n), h(n)$ be functions of $n$ such that $g(n), h(n) \in o(1)$ . Let $S$ be the set of vertices with weight in $[(a - g(n))^b \sqrt{\mu n}, (a - h(n))^b \sqrt{\mu n})$ . Then
Proof. Recall from (4) that the cumulative distribution function for the weights is $F(x) = 1 - x^{1 - \tau }$ . Thus, we get
We can now use the Taylor expansion of $f(x) = (a - x)^c$ at $0$ to obtain the bound $f(x) = a^c - c a^{c - 1} x \pm O(x^2)$ , which is valid for $x \in o(1)$ . Since $g(n), h(n) \in o(1)$ we can thus bound the above term in parentheses for $c = b(1 - \tau )$ as
Note that, if additionally $h(n) \in o(g(n))$ , we can write the last factor as $g(n)(1 - h(n)/g(n) \pm O(g(n))) = g(n)(1 \pm o(1))$ and obtain the following corollary.
Corollary 3.3. Let $a, b \gt 0$ be constants and let $g(n), h(n)$ be functions of $n$ such that $g(n) \in o(1)$ and $h(n) \in o(g(n))$ . Let $S$ be the set of vertices with weight in $[(a - g(n))^b \sqrt{\mu n}, (a - h(n))^b \sqrt{\mu n})$ . Then
Consider again the weights $w_\ell$ and $w_u$ as given in Lemma3.1 and let $S$ be the set of vertices in $[w_\ell , w_u)$ . Then Corollary3.3 in particular implies that $S$ contains $\Theta (g(n) \cdot n^{\frac{3 - \tau }{2}})$ vertices in expectation.
With this, we turn to our second goal mentioned above, namely that each box $B_i$ should be sufficiently large.
Lemma 3.4. Let $B_1, \dots , B_{2k}$ be evenly spaced boxes of height $h(n)$ and gap $g(n)$ in $\mathbb T^d$ . If $g(n) \in o(1)$ then each box $B_i$ has volume $h / 2^{d - 1} \cdot (1 - o(1))$ .
Proof. Recall that the height of $B_i$ is $h(n)$ while its extent in all other dimensions is $\ell = \frac{1}{2} - g(n)$ . Thus its volume is $h(n) \cdot (\frac{1}{2} - g(n))^{d - 1} = h(n) / 2^{d - 1} \cdot (1 - 2g(n))^{d - 1}$ . The claim follows from the fact that $(1 - 2g(n))^{d - 1}$ approaches $1$ from below for $n \to \infty$ as $g(n) \in o(1)$ and $d$ constant.
Corollary3.3 and Lemma3.4 together tell us that the expected number of vertices in each box that have a weight in the desired range is in $\Theta (h(n) \cdot g(n) \cdot n^{\frac{3 - \tau }{2}})$ . Recall we want to choose $h(n)$ and $g(n)$ as small as possible such that each box still contains a vertex with high probability. We set $h(n) = c\cdot n^{-\frac{3 - \tau }{4}}$ and $g(n) = c\cdot n^{-\frac{3 - \tau }{4} + \varepsilon }$ for arbitrary constants $c \gt 0$ and $\varepsilon \gt 0$ . Note that this satisfies the condition $h(n) \in o(g(n))$ of Corollary3.3 and yields an expected number of $\Theta (n^\varepsilon )$ vertices with the desired weight in each box. Since the number of vertices in a given box follows a binomial distribution and since $n^{\varepsilon } \in \omega (\log (n))$ , we can apply a Chernoff bound to conclude that actual number of vertices matches the expected value (up to constant factors) with probability $1 - O(n^{-c'})$ for any $c' \gt 0$ [Bläsius et al., (Reference Bläsius, Freiberger, Friedrich, Katzmann, Montenegro-Retana and Thieffry2022) Corollaries 2.3 and 2.4]. Together with a union bound, it follows that every box contains $\Theta (n^{\varepsilon })$ vertices (and thus at least one vertex) with probability $1 - O(2k \cdot n^{-c'})$ . By choosing $g(n), h(n)$ , and $k$ appropriately, we obtain the following theorem.
Theorem 3.5. Let $G^{(n)}$ be a $d$ -dimensional GIRG with $T=0$ , $\mu \gt 0$ and $\tau \in (2,3)$ and let $s \gt 0$ and $\varepsilon \gt 0$ be arbitrary constants. Then, with high probability, $G^{(n)}$ contains a co-matching of size $s \cdot n^{\frac{3 - \tau }{4} - \varepsilon }$ as induced subgraph.
Proof. Let $B_1, \dots , B_{2k}$ be evenly spaced boxes of height $h(n) = c\cdot n^{-\frac{3 - \tau }{4}}$ and gap $g(n) = c\cdot n^{-\frac{3 - \tau }{4} + \varepsilon }$ (for appropriately chosen $c \gt 0$ , which will be determined later). Let $w_\ell$ and $w_u$ be defined as in Lemma3.1. As argued above, Corollary3.3 and Lemma3.4 imply that, with high probability, each box $B_i$ includes at least one vertex with weight in $[w_\ell , w_u)$ . By Lemma3.1 any set that contains exactly one vertex of each box forms a co-matching of size $2k$ .
Recall that $2k = 1 / (g(n) + h(n))$ . Thus, we can choose $c$ such that $2k = s \cdot n^{\frac{3 - \tau }{4} - \varepsilon }$ . Again, by the above argumentation, it follows that every box contains at least one vertex with probability $1 - O(2k n^{-c'}) = 1 - O(n^{\frac{3 - \tau }{4} - \varepsilon - c'})$ for any constant $c' \gt 0$ . Choosing $c'$ sufficiently large then yields the claim.
This theorem together with Lemma2.2 directly imply the following corollary.
Corollary 3.6. Let $G^{(n)}$ be a $d$ -dimensional GIRG with $T = 0$ , $\mu \gt 0$ and $\tau \in (2,3)$ , and let $b \gt 0$ and $\varepsilon \gt 0$ be arbitrary constants. Then, with high probability, the number of maximal cliques in $G^{(n)}$ is at least $b^{n^{(3 - \tau )/4 - \varepsilon }}$ .
Figure 2a shows this lower bound for $b=2$ against $n$ . Interestingly, while Corollary3.6 shows that the number of maximal cliques grows super-polynomially in $n$ , for $\tau \gt 2$ , this growth may still be slower than the linear slope $n$ for large geometric networks. This is of particular importance as the smaller order terms of the number of maximal cliques contain terms of at least $\Theta (n)$ . Indeed, the number of maximal 2-cliques is lower bounded by the number of vertices of degree 1, which scales linearly by Equation (4). Thus, for practical purposes, the dominant term could be the linear term instead of the super-polynomial term, especially if the degree exponent is close to 3.
3.2 GIRG with 2-dimensional square
Our previous lower bound for the number of maximal cliques relies on the toroidal structure of the underlying space. We now show that even if the vertex positions are constrained to be positioned in the square $[0,1]^2$ instead, the GIRG still contains a super-polynomial number of maximal cliques. In this setting, we will also switch from the infinity norm to the 2-norm. We will discuss possible extensions to other norms in Section 6.
Theorem 3.7. For any $\varepsilon \gt 0$ and $b\gt 0$ , a 2-dimensional GIRG $G^{(n)}$ on $n$ vertices with vertex positions uniformly distributed over $[0,1]^2$ equipped with the 2-norm and $T=0$ , $\mu \gt 0$ , and $\tau \in (2,3)$ contains with high probability at least
maximal cliques for some $C\gt 0$ .
Proof. Let $S$ be the set of vertices with weights within $[a\sqrt{\mu n(1-c\cdot n^{-\beta })},a\sqrt{\mu n}]$ for some $0\lt a\lt 1/4$ , $\beta \gt 0$ (and appropriately chosen $c \gt 0$ , which will be determined later). By Corollary3.3,
Let $\mathcal{C}$ be a circle on $[0,1]^2$ of constant radius $R\lt 1/4$ . We now create an even number of areas $B_1,\dots ,B_{2k}$ of height $h(n)$ , evenly distributed over $\mathcal{C}$ as illustrated in Figure 3a. That is, we consider $2k$ identical and evenly spaced circular segments $B_1,\dots , B_{2k}$ of height $h$ and chord length $q$ . We ensure that any pair of vertices in two opposite areas $B_i$ and $B_{i+k}$ are disconnected. That is, the distance $t$ between the two ends of these areas should equal
We also ensure that any pair of vertices in non-opposite areas connect. This means that the distance $\ell$ between the rightmost part of $B_i$ and the leftmost part of any non-opposite $B_j$ is at most
The width $q$ of one of the $B_i$ ’s is given by
Thus, the area of $B_i$ is given by
for some constant $c_1\gt 0$ . The probability that a given area contains no vertices from $S$ is given by
for some $c_2\gt 0$ .
We now calculate the maximal number of areas that we can pack on $\mathcal{C}$ . The circumference of $\mathcal{C}$ is $2\pi R$ . The arc length of a single area is at most $q\pi /2$ . Furthermore, the arc length of the section with a chord of length $\ell$ , $a(\ell )$ , is given by
using the Taylor series of $\sin ^{-1}(1-x)$ around $x=0$ . Now take $h(n)=c\cdot n^{-\gamma (n)}$ . Then, by Equation (18), $a(\ell )$ scales as
Now the arc length between two adjacent sections $B_i$ and $B_{i+1}$ is equal to $\pi R-a(\ell )$ . This means that the arc length between $B_i$ and $B_{i+1}$ scales as $\pi R-(\pi R-R\Theta \sqrt{c\cdot n^{-\beta }+c\cdot n^{-\gamma (n)})})=R\Theta (\sqrt{c\cdot n^{-\beta }+c\cdot n^{-\gamma (n)}})$ .
The maximal value of the number of possible areas $2k$ , is the total circumference of $\mathcal{C}$ divided by the arc length of an interval $B_i$ and the arc length between $B_i$ and $B_{i+1}$ , which yields
Thus, by choosing $c$ correctly, we can let $2k= s\cdot \min (n^{\beta /2},n^{\gamma (n)/2})$ for any $s\gt 0$ . When all $i\in [2k]$ contain at least one vertex in $S$ , any set of $2k$ vertices with exactly one vertex in each $B_i$ forms a co-matching, as illustrated in Figure 3b. Furthermore E quation(17) shows that with high probability, all $B_i$ are non-empty, as long as $n^{(3-\tau )/2-\beta }h(n)^{3/2}\to \infty$ as $n\to \infty$ .
We therefore choose $\beta =(3-\tau )/5-\varepsilon$ and $\gamma (n)=(3-\tau )/5$ . Then, with high probability there is a co-matching of size $2k =s\cdot n^{(3-\tau )/10-\varepsilon }$ . Thus, by Lemma2.2 and choosing $s$ sufficiently large yields that for fixed $b\gt 0$ the number of maximal cliques can be bounded from below by
Figure 2b shows the lower bound of Theorem3.7 against $n$ . As for the toroidal case, the super-polynomial growth may be dominated by lower-order linear terms.
3.3 Non-threshold case
We now show how our constructions extend to the non-threshold GIRG, where the connection probability is given by Equation (5) instead of Equation (6).
Theorem 3.8. Let $G^{(n)}$ be a $d$ -dimensional GIRG on $n$ vertices with $T \gt 0$ , $\mu \gt 0$ and $\tau \in (2,3)$ and let $s \gt 0$ be an arbitrary constant. Then, there exists an $\varepsilon \gt 0$ such that, with high probability, $G$ contains a co-matching of size $s \cdot n^{(3 - \tau )/5} \cdot (\varepsilon \log n)^{-(1/2)}$ as induced subgraph.
Proof. As before, we consider $2k$ boxes $B_1, \ldots , B_{2k}$ with height $h(n)$ and gap $g(n)$ , though now we choose
for a constant $\varepsilon \gt 0$ , which we determine below. We again focus on the vertex set $S$ containing all vertices with weights in $[w_\ell , w_u]$ , though our choice for $w_u$ is slightly different. In particular, we choose
Our goal now is to show that, with high probability, there exists at least one co-matching that contains one vertex from each box. That is, if $M$ denotes the number of such co-matchings, we want to show that $M \gt 0$ with high probability.
We start by bounding the number of vertices from $S$ that lie in a given box $B_i$ , denoted by $S(B_i)$ . Since $T$ and $d$ are constants and $(\varepsilon \log n)^{1/2} \in \omega (1)$ , we have $(T/d + 1)h(n) \in o(g(n))$ , allowing us to bound $\mathbb{E}[|S|]$ using Corollary3.3, which yields
Moreover, since the vertices are distributed uniformly at random in the ground space, the expected fraction of vertices from $S$ that lie in the box $B_i$ is proportional to its volume, which is $\Theta (h)$ according to Lemma3.4. It follows that
Analogous to the proof of Lemma3.4 we can apply a Chernoff bound to conclude that the number of vertices in $S(B_i)$ matches the expected value (up to constant factors) with probability $1 - O(n^{-c})$ for any $c \gt 0$ . Note that the number of boxes is given by
which is at most $n$ . Thus, applying the union bound yields that with high probability every box contains $n' \in \Theta (n^{(3 - \tau )(1/2 - 2/5)} (\varepsilon \log n)^{1/2})$ vertices. In the following, we implicitly condition on this event to happen. Now recall that a co-matching consisting of one vertex from each box forms if each vertex is adjacent to the vertices in all other boxes, except the vertex from the opposite box.
Despite the temperature, vertices in non-opposite boxes are still adjacent with probability $1$ , since the weight of two such vertices $i$ and $j$ is at least $w_\ell$ and their distance is at most $\frac{1}{2} - g(n)$ and, thus, according to Equation (5)
In contrast to the threshold case, however, the probability for vertices in opposite boxes to be adjacent is no longer $0$ . Since two such vertices $i$ and $j$ have distance at least $\frac{1}{2} - h(n)$ and weight at most $w_u = (\frac{1}{2} - (T/d + 1)h)^{d/2}\sqrt{\mu n}$ , we can bound the probability for them to be adjacent using Equation (5), which yields
Since $1 - x \le e^{-x}$ , we obtain $p_{ij} \le e^{-2h(n)}$ .
With this we are now ready to bound the probability $\mathbb{P}\left (M \gt 0\right )$ , that at least one co-matching forms that contains one vertex from each box. To this end, we need to find one non-edge in each pair of opposite boxes, that is, each such pair needs to contain two vertices (one from each box) that are not adjacent. Conversely, the only way to not find a co-matching is if there exists one pair of opposite boxes such that all vertices in one box are adjacent to all vertices in the other. This happens with probability at most $(p_{ij})^{(n')^2}$ . Since there are $k$ pairs of opposite boxes, applying the union bound yields
Now recall that $n' \in \Theta (n^{(3 - \tau )(1/2 - 2/5)} (\varepsilon \log n)^{1/2})$ and that $h(n) = 1/(2s) \cdot n^{-(3 - \tau )/5}$ . Consequently, we obtain
Moreover, since $k \in O(n^{(3 - \tau )/5})$ (see Equation 22), we have
meaning, for sufficiently large $n$ , we can choose $\varepsilon$ such that $\mathbb{P}\left (M = 0\right ) \in O(n^{-1})$ and, conversely, $\mathbb{P}\left (M \gt 0\right ) = 1 - O(n^{-1})$ . So with high probability there exists at least one co-matching of size
where the inequality holds for sufficiently large $n$ .
Together with Lemma2.2 we obtain the following corollary.
Corollary 3.9. Let $G^{(n)}$ be a $d$ -dimensional GIRG on $n$ vertices with $T \gt 0$ , $\mu \gt 0$ , and $\tau \in (2,3)$ and let $b \gt 0$ be an arbitrary constant. Then there exists an $\varepsilon \gt 0$ such that, with high probability, the number of maximal cliques in $G^{(n)}$ is at least
We can extend Theorem3.7 to non-zero temperature in a very similar fashion (proof is in Appendix A)
Theorem 3.10. For any $\varepsilon \gt 0$ and $b\gt 1$ , a 2-dimensional GIRG $G^{(n)}$ on $n$ vertices with $T\gt 0$ and vertex positions uniformly distributed over $[0,1]^2$ contains with high probability at least
maximal cliques.
4. Inhomogeneous random graphs (IRGs)
We now turn to a random graph model that is scale-free, but does not contain a source of geometry, the IRG, or Chung–Lu random graph Chung and Lu (Reference Chung and Lu2002). We show that also in this model, the number of maximal cliques scales super-polynomially in the network size $n$ . Again, every vertex $i$ draws its weight $w_i$ independently from the power-law distribution Equation (4), where we will again assume that $\tau \in (2,3)$ . Then, all pairs of vertices $u$ and $v$ connect independently with probability:
where $\mu$ controls the expected average degree.
To show a lower bound on the number of maximal cliques, we make use of the fact that an IRG contains a not too small rather dense subgraph with high probability. The following theorem is obtained by looking just at the subgraph induced by vertices with weights in a certain range. We chose the specific range to satisfy three criteria. First, the range is sufficiently large, such that the subgraph contains many vertices. Second, the range is sufficiently small such that all vertex pairs in the subgraph are connected with a similar probability. And third, the weights are large enough such that a densely connected subgraph forms, but not so large that the vertices merge into a single clique.
Theorem 4.1. Let $G^{(n)}$ be an IRG on $n$ vertices with $\tau \in (2, 3)$ and $\mu \gt 0$ and let $b \gt 1$ and $\varepsilon \in (0, \frac{3 - \tau }{4})$ be arbitrary constants. Then, the expected number of maximal cliques in $G^{(n)}$ is in
Proof. We show that already the subgraph $G'$ induced by the vertices in a certain weight range has the claimed expected number of maximal cliques. To define $G'$ , we consider weights in $[w_\ell , w_u]$ with $w_\ell = \sqrt{(1 - g(n)) \mu n}$ and $w_u = \sqrt{(1 - h(n)) \mu n}$ . To abbreviate notation, let
For constants $a$ and $c$ we determine later, we choose $g(n)$ and $h(n)$ as:
Note that $w_\ell \lt w_u$ if and only if $a \gt 1$ . Let $n'$ be the number of vertices in $G'$ . From Lemma3.2 it follows:
As every vertex has independently the same probability to be in $G'$ , a Chernoff bound implies that $n' \in \Theta (n^\varepsilon \gamma (n) \log n)$ holds with high probability. Thus, in the following, we implicitly condition on this event to happen.
To give a lower bound on the number of maximal cliques in $G'$ , we only count the number $N_k$ of maximal cliques of size $k$ with
We note that this is the same constant $c$ as in the definition of $h(n)$ above. As the number of maximal cliques in $G'$ is a lower bound for the number of maximal cliques in $G^{(n)}$ , we lower bound the expectation of $N_k$ , by the expected number of maximal cliques in $G'$ ,
In the following, we give estimates for the three terms individually.
We start with the event that $C$ is a clique. Due to the lower and upper bound on the weights in $G'$ , it follows that any pair of vertices in $G'$ is connected with probability at least $p_\ell = 1 - g(n)$ and at most $p_u = 1 - h(n)$ . Thus, for a fixed subset $C$ of vertices of size $|C| = k$ , the probability that all $k$ vertices are pairwise connected is at least $p_\ell ^{k (k - 1) / 2} = (1 - g(n))^{k (k - 1) / 2}$ . As $g(n)$ goes to $0$ for growing $n$ and $1 - x \in \Omega (\exp ({-}x))$ in this case, we get
For $C$ to be a maximal clique (conditioning on it being a clique), additionally no other vertex can be connected to all vertices from $C$ . This probability is at least $(1 - p_u^k)^{n' - k} = (1 - (1 - h(n))^k)^{n' - k}$ . As $1 - x \le \exp ({-}x)$ , it follows that $(1 - h(n))^k \le \exp ({-}hk) = n^{-3\varepsilon }$ , where the last equality follows from plugging in the values we chose for $h(n)$ and $k$ . Again using $1 - x \in \Omega (\exp ({-}x))$ for sufficiently small $x$ , we can conclude that
Finally, for the binomial coefficient, we get
Our goal is to show that $\log ({\mathbb{E}}[N_k])\geq n^{(3-\tau )/4-\varepsilon }\log (n)\log (b)$ . Thus, plugging Equations (29), (30), and (31) into the logarithm of (28) yields that we need to show that for every constant $b \gt 1$ , we can choose the constants $a \gt 1$ and $c$ in the definitions of $g(n)$ and $h(n)$ such that
This can be achieved by simply plugging in the values for $n'$ , $k$ , and $g(n)$ . For the first (and only positive) term, we obtain
and thus for sufficiently large $n$
For the negative terms, we start with the latter and obtain
This is asymptotically smaller than the positive term and can thus be ignored. For the other negative term, first note that $gk = 3 a \varepsilon \log n$ . Thus, we obtain
Together with the positive term, we obtain that for sufficiently large $n$ , it holds
With this, we can choose $a \gt 1$ such that the first factor is positive and we can choose $c$ such that $\varepsilon ^2 / c = \log b$ , which proves (32), as then
for some $C\gt 0$ .
Figure 2c shows that the lower bound provided by Theorem4.1 may still be smaller than linear for networks that are quite large, especially when $\tau \approx 3$ .
4.1 Small maximal cliques are rare
We now focus on the maximal cliques of a fixed size in the IRG. How many maximal cliques of size $k$ are present in an IRG?
Let $N(K_k)$ denote the number of maximal cliques of size $k$ . Furthermore, let $M_n(\varepsilon )$ denote
Thus, $M_n(\varepsilon )$ is the set of sets of $k$ vertices such that two vertices have weight proportional to $n^{(\tau -2)/(\tau -1)}$ , and all other vertices have weights proportional to $n^{1/(\tau -1)}$ . Denote the number of maximal $k$ -cliques with sets of vertices in $M_n(\varepsilon )$ by $N(K_k,M_n(\varepsilon ))$ . Then, the following theorem shows that these ‘typical’ maximal cliques are asymptotically all maximal cliques. Furthermore, it shows that all maximal cliques of size $k\gt 2$ occur equally frequently in scaling, and they also appear on the same types of vertices. Here we use $\stackrel{\mathbb{P}}{\longrightarrow }$ to denote convergence in probability.
Theorem 4.2 (Maximal clique localization). Let $G^{(n)}$ be an IRG on $n$ vertices with $\tau \in (2, 3)$ and $\mu \gt 0$ . For any fixed $k\geq 3$ ,
-
(i) For any $\varepsilon _n$ such that $\lim _{n\to \infty }\varepsilon _n=0$ ,
(34) \begin{equation} \frac{N\big (K_k,M_n\left (\varepsilon _n\right )\big ) }{N(K_k)}{\stackrel{\mathbb{P}}{\longrightarrow }} 1. \end{equation} -
2. Furthermore, for any fixed $0\lt \varepsilon \lt 1$ ,
(35) \begin{equation} {\mathbb{E}}\left [{N(K_k,M_n(\varepsilon ))}\right ]=\Theta (n^{(3-\tau )(2\tau -3)/(\tau -1)}). \end{equation}
Theorem4.2(i) states that asymptotically all maximal $k$ -cliques are formed between two vertices of weights proportional to $n^{(\tau -2)/(\tau -1)}$ and all other vertices of weights proportional to $n^{1/(\tau -1)}$ . Theorem4.2(ii) then shows that there are proportional to $n^{(3-\tau )(2\tau -3)/(\tau -1)}$ such maximal $k$ -cliques. As visualized in Figure 4, this scaling is significantly smaller than the scaling of the total number of $k$ -cliques, which scales as $n^{k/2(3-\tau )}$ Hofstad, Leeuwaarden, and Stegehuis (2021). Interestingly, the scaling of the number of maximal cliques is $k$ -independent, contrary to the total number of cliques. In particular, the number of $k$ maximal cliques is always $o(n)$ , contrary to the number of $k$ -cliques which scales larger than $n$ when $\tau \lt 3-2/k$ . This shows once more that the large number of maximal cliques in the IRG is caused by extremely large maximal cliques, as fixed-size maximal cliques are only linearly many.
To prove this theorem, we need the following technical lemma, which is proven in Appendix B:
Lemma 4.3. When $\tau \in (2,3)$ , then
Furthermore, we need a lemma that bounds the probability that a given clique on vertices of weights $x_1\leq x_2\dots \leq x_k$ is maximal:
Lemma 4.4. Let $G$ be an IRG with $\tau \in (2, 3)$ and $\mu \gt 0$ . Then, the probability that a given clique between $k$ vertices of weights $x_1\leq x_2\dots \leq x_k$ is maximal is bounded by
for some $0\lt C_1\leq C_2\lt \infty$ .
Proof. When $x_{1}\leq x_2\leq \dots \leq x_k$ , we can compute the probability that this $k$ clique is part of a larger clique with a randomly chosen vertex as:
for some $c_1,\dots , c_k\gt 0.$ When $x_1\leq x_2\leq \dots \leq x_k$ , this term becomes
The ratio between two consecutive terms of this summation equals
Now as $x_{l}\leq x_{{l+1}}$ and $\tau \in (2,3)$ , this ratio is larger than 1 for $l\geq 2$ , and smaller than one for $l=1$ . This means that the summation can be dominated by
for some $C\gt 0$ .
Thus, the probability that a clique on vertices with weights $x_1,\dots , x_k$ is maximal can be upper bounded by:
We lower bound the probability that the clique is maximal by using that
Thus,
Now we are ready to prove Theorem4.2:
Proof of Theorem 4.2. Fix $\ell _i\leq u_i$ for $i\in [k]$ . We now compute the expected number of maximal $k$ -cliques in which the vertices have weights $n^{(\tau -2)/(\tau -1)}[\ell _i,u_i]$ for $i=1,2$ , and $n^{1/(\tau -1)}[\ell _i,u_i]$ for $i\geq 3$ .
We bound the expected number of such maximal copies of $K_k$ by:
where $I(K_k, \boldsymbol{v})$ is the indicator that a maximal $k$ -clique is present on vertices $\boldsymbol{v}$ , and the sum over $\boldsymbol{v}$ is over all possible sets of $k$ vertices. Now the probability that a clique is maximal can be upper bounded as in Lemma4.4.
We bound the minimum in Equation (45) by:
-
(a) $x_ix_j/n$ for $\{i,j\}=\{1,2\}$ or $i=1,j\geq 3$ ;
-
(b) 1 for $i,j\geq 3$ .
Making the change of variables $x_i=y_in^{1/(\tau -1)}$ for $i=3,\dots ,k$ and $x_i=y_i/n^{(\tau -2)/(\tau -1)}$ otherwise, we obtain the bound
for some $\tilde{K}\gt 0$ . Because the weights are sampled i.i.d. from a power-law distribution, the maximal weight $w_{\max }$ satisfies that for any $\eta _n\to 0$ , $w_{\max }\leq n^{1/(\tau -1)}/\eta _n$ with high probability. Thus, we may assume that $u_i\leq 1/\eta _n$ when $i\geq 3$ . Now suppose that at least one vertex has weight smaller than $\varepsilon _n n^{(\tau -2)/(\tau -1)}$ for $i=1,2$ or smaller than $\varepsilon _n n^{1/(\tau -1)}$ for $i\geq 3$ . This corresponds to taking $u_i=\varepsilon _n$ and $\ell _i=0$ for at least one $i$ , or at least one integral in Equation (45) with interval $[0,\varepsilon _n]$ . Similarly, when vertex 1 or 2 has weight higher than $1/\varepsilon _n n^{(\tau -2)/(\tau -1)}$ , this corresponds to taking $\ell _i=1/\varepsilon _n$ and $u_i=\infty$ for $i=1$ or 2, or at least one integral in (45) with interval $[1/\varepsilon _n,\infty ]$ . Lemma4.3 then shows that these integrals tends to zero when choosing $u_i=\eta _n$ fixed for $i\geq 3$ and $\varepsilon _n\to 0$ . Thus, choosing $\eta _n\to 0$ sufficiently slowly compared to $\varepsilon _n$ yields that
where
Let $\bar{\Gamma }_n(\varepsilon _n,\eta _n)$ be the complement of $\Gamma _n(\varepsilon _n,\eta _n)$ . Denote the number of maximal cliques with vertices in $\bar{\Gamma }_n(\varepsilon _n,\eta _n)$ by $N(K_k,\bar{\Gamma }_n(\varepsilon _n,\eta _n))$ . Since $w_{\max }\leq n^{1/(\tau -1)}/\eta _n$ with high probability, $\Gamma _n(\varepsilon _n,\eta _n)=M_n(\varepsilon _n)$ with high probability. Therefore, with high probability,
where $N\big (K_k,\bar{M}_n\left (\varepsilon _n\right )\big )$ denotes the number of maximal $k$ -cliques on vertices not in $M_n\left (\varepsilon _n\right )$ . By (46) and the Markov inequality, we have for all $\epsilon \gt 0$
Furthermore, Lemma4.3 combined with the lower bound in (45) shows that when choosing $u_i=1/\varepsilon$ and $\ell _i=\varepsilon$ for some fixed $\varepsilon \gt 0$ for all $i$ ,
Thus, for fixed $\varepsilon \gt 0$ ,
which shows that
as required. This completes the proof of Theorem4.2.
5. Experiments
As mentioned in the introduction, empirical evidence suggests that the number of maximal cliques in IRGs and GIRGs is small Bläsius and Fischbeck (Reference Bläsius and Fischbeck2022). In fact, all generated networks with $n ={50}\,\textrm{k}$ nodes and expected average degree $10$ have fewer maximal cliques than edges. This stands in stark contrast to our super-polynomial lower bounds. This discrepancy probably comes from the fact that $n ={50}\,\textrm{k}$ is low enough that a linear lower-order term dominates the super-polynomial terms. In this section, we complement our theoretical lower bounds with experimentsFootnote 1 with an $n$ that is sufficiently large to make the super-polynomial terms dominant. Additionally, we consider dense and super-dense Erdős–Rényi graphs.
5.1 Cliques in the dense subgraph of GIRGs and IRGs
Our theoretical lower bounds are based on the existence of a dense subgraph among the vertices with weights $\Theta (\sqrt{n})$ . To experimentally observe the super-polynomial scaling, we generate IRGs and GIRGs restricted to vertices of high weight. This restriction lets us consider much larger values of $n$ . In the following, we first describe the exact experiment setup, before describing and discussing the results.
Experiment setup
We generate IRGs and GIRGs with varying number of vertices $n$ and deterministic power-law weights where the $v$ th vertex has weight
Note that the minimum weight is $w_n = 1$ .
We use the power-law exponents $\tau \in \{2.2, 2.5, 2.8\}$ and for GIRGs we consider the temperatures $T \in \{0, 0.4, 0.8\}$ and dimension $d = 1$ . For each parameter setting, we consider two subgraphs: The subgraph induced by vertices with $0.5 \sqrt{n} \le w_i \le \sqrt{n}$ and within the larger interval $0.5 \sqrt{n} \le w_i \le n$ . In preliminary experiments, we also tried constant factors other than $0.5$ , yielding comparable results.
As connection probability for the IRGs between the $u$ th and $v$ th vertex, we use $\min \{1, w_u w_v / n\}$ , that is, vertices of weight $1$ have connection probability $1 / n$ and vertices of weight at least $\sqrt{n}$ are deterministically connected. For GIRGs, we choose the constant factor $\mu$ in Equation (5) such that we obtain the same expectedFootnote 2 average degree as for the corresponding IRG in the considered subgraph. For each of these configurations, we generate 10 graphs. Figure 5 shows the average.
General observations
One can clearly see in Figure 5 (top row) that the scaling of the number of cliques depending on the graph size is super-polynomial (upward curves in a plot with logarithmic axes). Thus, on the one hand, this agrees with our theoretical analysis. On the other hand, the plots also explain why previous experiments Bläsius and Fischbeck (Reference Bläsius and Fischbeck2022) showed a small number of cliques: While the scaling is super-polynomial, the constant factors are quite low. In the top-left plot for $\tau = 2.5$ , more than 200 M nodes are necessary to get just barely above 1 M maximal cliques in the dense subgraph. For $\tau = 0.8$ this is even more extreme with $n ={200}\,\textrm{T}$ yielding only 10 k maximal cliques. Thus, unless we deal with huge graphs, the maximal cliques in the dense part of the graph are dominated by the number of cliques in the sparser parts, despite the super-polynomial growth of the former.
Effect of the power-law exponent $\tau$
The top plots of Figure 5 show that a smaller power-law exponent $\tau$ leads to more maximal cliques. The bottom plots show the number of cliques with respect to the size of the dense subgraph and not with respect to the size of the full graph. One can see that the difference for the different power-law exponents solely comes from the fact that the dense subgraph is larger for smaller $\tau$ . For the same size of the dense subgraph, the scaling is almost independent of the power-law exponent.
Effect of the geometry
In the left plots of Figure 5, we can see that geometry leads to fewer maximal cliques. For $T = 0$ , the super-polynomial scaling is only barely noticeable. Higher temperatures lead to a larger number of cliques, and we get even more cliques for IRGs. Interestingly, the scaling is slower for IRGs when additionally considering the core of vertices with weight more than $\sqrt{n}$ (see next paragraph).
Effect of the core
When not capping the weight at $\sqrt{n}$ but also considering vertices of even higher weight (right plots), we can observe the following. The overall picture remains similar, with a slightly increased number of cliques. However, this increase is higher for GIRGs than it is for IRGs. A potential explanation for this is the following. For IRGs, the core forms a clique and adding a large clique to the graph does not change the overall number of maximal cliques by too much. For GIRGs, however, it depends on the constant $\mu$ controlling the average degree whether this subgraph forms a clique or not. Thus, for the same average degree, the maximum clique is probably somewhat smaller for GIRGs and thus adding the vertices of weight at least $\sqrt{n}$ leads to more additional cliques than in IRGs.
5.2 Cliques in the dense and super-dense Erdős–Rényi graphs
Here we count the cliques for dense Erdős–Rényi graphs with constant connection probabilities $p \in \{0.6, 0.7, 0.8, 0.9\}$ and super-dense Erdős–Rényi graphs with connection probability $p = 1 - c / n$ for $c \in \{1, 2, 4, 8\}$ . Note that the complement of a super-dense Erdős–Rényi graph has constant expected average degree. The scaling of the number of cliques with respect to the number of vertices is shown in Figure 6, where each point represents 20 samples.
Note that for constant $p$ , the left plot with logarithmic $y$ -axis is curved downward, indicating sub-exponential scaling, while the middle plot with logarithmic $x$ - and $y$ -axis is bent upward, indicating super-polynomial scaling. This is in line with our lower bound in Theorem2.1.
For the super-dense case, the right plot indicates exponential scaling, in line with Theorem2.3.
6. Conclusion and discussion
In this paper, we have investigated the number of maximal cliques in three random graph models: the Erdős–Rényi random graph, the IRG and the GIRG. We have shown that sparse Erdős–Rényi random graphs only contain a polynomial amount of maximal cliques, but in the other two sparse models, the number of maximal cliques scales at least super-polynomially in the network size. This is caused by the degree-heterogeneity in these models, as many large maximal cliques are present close to the core of these random graphs. We prove that there only exist a linear amount of small maximal cliques. Interestingly, these small maximal cliques are almost always formed by two low-degree vertices, whereas all other vertices are hubs of high degree.
We have then shown that this dominant super-polynomial behavior of the number of maximal cliques often only kicks for extreme network sizes, and that experimentally, lower-order linear terms instead drive the scaling of the number of maximal cliques until large values of the network size. This explains the dichotomy between the theoretical super-polynomial lower bounds for these models, and the observation that in real-world networks, the amount of maximal cliques is often quite small.
Several of our results only constitute lower bounds for the number of maximal cliques. We believe that relatively close upper bounds can be constructed in a similar fashion, but leave this open for further research.
While Theorem3.7 only holds for 2-norms, we believe that the theorem can be extended to any $L^p$ -norm for $p\neq 1,\infty$ , by looking at the $L^p$ norm-cycle instead of the regular cycle. For $p=1,\infty$ this approach fails, shortest distance paths to non-opposing segments pass through the center of the cycle. Therefore, opposing segments are just as close as many non-opposing ones. Whether Theorem3.7 also holds with 1 or $\infty$ norms is therefore a question for further research. We also believe that this approach also extends to the underlying space $[0,1]^d$ for general $d$ , where instead of looking at a cycle inside $[0,1]^2$ , one studies a $d$ -ball inscribed in $[0,1]^d$ instead.
Funding
This work has received funding through NWO Veni 202.001 and NWO M2 grant 0.379.
Competing interests
The authors declare none.
Data availability
The code used to perform the experiments that support our findings is available at https://github.com/thobl/maximal-cliques-scale-free-rand-graph.
Appendix A. Proof of Theorem3.10
Lemma A.1. Let $(A_i)_{i\in [k]}$ be a set of areas of size $A$ , and let $S$ be a set of vertices, such that $A|S|\gt n^\varepsilon$ for some $\varepsilon \gt 0$ . Then, for any $0\lt \lambda \lt 1$ and $k\lt \exp (\lambda A|S|)$ , with high probability all areas contain at least $(1-\lambda )A|S|$ vertices.
Proof. The Chernoff bound gives for the number of vertices from $S$ within area $A$ , $N_{S,A}$ :
This implies that when $A|S|\gt n^{\varepsilon }$ for some $\varepsilon \gt 0$ , then, with high probability, all areas contain at least $(1-\lambda )A|S|$ vertices.
We follow the same construction of areas and sets as in the proof of Theorem3.7. By Equation (20), this creates $2k = s \cdot n^{\min (\beta /2,\gamma (n)/2)}$ areas of size $A=n^{-3/2\gamma (n)}$ , with on average ${\mathbb{E}}[{|S|}]=n^{(3-\tau )/2-\beta }$ vertices. Thus, LemmaA.1 shows that as long as $\beta +3/2\gamma (n) \lt (3-\tau )/2$ , then all areas contain with high probability at least
vertices for some $c_1\gt 0$ .
From Equation (5), it follows that any set of vertices that contains one in each given area still satisfies the requirement that all vertices in non-opposite boxes connect, as in non-opposite boxes, the connection probability equals 1 by Equation (14). Now to form a co-matching, vertices in opposite boxes should not connect.
With high probability, a positive proportion of vertices in two opposing areas have distance at least $t+h=a+n^{-\gamma (n)}$ , by the uniform distribution within areas, and the fact that a positive proportion of the two areas have distance $t+h$ .
By Equation (5), the probability that vertices $i,j\in S$ at distance at least $a+n^{-\gamma (n)}$ are connected is bounded by
Similarly as in Equation (23),
Using that $n'= c_1n^{(3-\tau )/2-\beta -3/2\gamma (n)}$ therefore yields
Thus, choosing $\beta =\gamma (n)=(3-\tau )/5-\varepsilon$ ensures that there is a co-matching of size $k=s \cdot n^{(3-\tau )/10-\varepsilon }$
B. Proof of Lemma 4.3
Proof. This integral equals
Now the first integral is bounded by
as $2-\tau \gt -1$ , and $k-1-\tau \gt -1$ for $k\geq 3$ as well. We now turn to the second integral. The second integral is finite if
This results in
W.l.o.g. we assume that $x_3\gt x_4\gt \dots \gt x_k$ . Then, the inner integral evaluates to
We now show that all these terms evaluate to a finite integral when plugged into Equation (60). Indeed,
as the index $l-k$ remains at least 3. Therefore, Equation (36) is finite as well.