The 1971 Fortuin–Kasteleyn–Ginibre inequality for two monotone functions on a distributive lattice is well known and has seen many applications in statistical mechanics and other fields of mathematics. In 2008, one of us (Sahi) conjectured an extended version of this inequality for all n > 2 monotone functions on a distributive lattice. Here, we prove the conjecture for two special cases: for monotone functions on the unit square in whose upper level sets are k-dimensional rectangles and, more significantly, for arbitrary monotone functions on the unit square in . The general case for , remains open.
I. INTRODUCTION
For functions f, g on a probability space (L, μ), their expectation and correlation are defined by
Now suppose further that L is a distributive lattice8 and that the probability measure μ satisfies
In this situation, if f, g are positive monotone (decreasing) functions9 on L, then one has
The first inequality is obvious, while the second is the celebrated FKG inequality of Fortuin–Kasteleyn–Ginibre1 that plays an important role in several areas of mathematics/physics. We will refer to a distributive lattice L with probability measure μ satisfying (1.2) as an FKG poset.
In formulating (1.2), we have tacitly assumed that the poset L is a discrete set. However, the FKG inequality also has important continuous versions, which can be proved by a discrete approximation. For example, if Qk = [0, 1]k is the unit hypercube in equipped with the partial order: x ≥ y if and only if xi ≥ yi for all i, then the FKG inequality holds for the Lebesgue measure and, more generally, for any absolutely continuous measure whose density function satisfies (1.2).
In Ref. 5, Sahi introduced a sequence of multilinear functionals En(f1, …, fn), n = 1, 2, 3, …, generalizing E1 and E2 (see Definition 3.1) and proposed the following conjecture:
Sahi5 proved the conjecture for the lattice {0, 1} × {0, 1} and for a certain subclass of positive monotone functions on the general power set lattice {0, 1}k equipped with a product measure. Since the functionals En satisfy the following “branching” property (Ref. 5, Theorem 6)
the inequalities (1.4) form a hierarchy in the following sense: if Cn denotes the n-function positivity conjecture, then Cn implies Cn−1 for n > 2.
The work of Sahi was inspired by that of Richards4 who first had the idea of generalizing the FKG inequality to more than two functions. A natural first candidate for such an inequality is the cumulant (Ursell function) κn, but an easy example shows that the inequality already fails for κ3. Nevertheless, Richards [Ref. 4, Conjecture 2.5] conjectured the existence of such a hierarchy of inequalities, although without an explicit formula for En.
Indeed for n = 3, 4, 5, Sahi’s functional En coincides with the “conjugate” cumulant introduced by Richards [Ref. 4, formula (2.2)], although for n ≥ 6 one has . We note also that Ref. 4 contains two “proofs” of the positivity of —one for a discrete lattice and the other for a continuous analog. However, it seems to us that both proofs have essential gaps. Thus, beyond the special cases treated in Ref. 5, Conjecture 1.1 remains a conjecture, even for n = 3, 4, 5.
In this paper, we provide further evidence in support of Conjecture 1.1. We consider the continuous case of the Lebesgue measure on the unit hypercube Qk = [0, 1]k in , and we prove the inequalities (1.4) for the following two additional cases:
for arbitrary positive monotone functions on the unit square in and
for monotone characteristic functions of k-dimensional rectangles in [0, 1]k and, by multlinearity of En, for functions whose level sets are (not necessarily homothetic) rectangles.
First, we treat the case of three functions on in Sec. II. This introduces several key ideas, including a reduction to a non-linear inequality involving decreasing sequences. In Sec. III, we define En for arbitrary n and prove Conjecture 1.1, first for characteristic functions of k-dimensional rectangles and then for general monotone functions on , that is, we extend Sec. II to all n > 3. This requires additional ideas involving the symmetric group Sn and an intricate induction on n. Subsections III A and III B are written in complete generality, and we hope these ideas will help in the eventual resolution of Conjecture 1.1.
Since the FKG inequality has many applications in probability, combinatorics, statistics, and physics, it reasonable to suppose that the generalized inequality will likewise prove to be useful in one or more of these areas. Although we do not have a compelling application in mind, we feel that it is important to find such an application. Indeed, the right application might provide additional insight into Conjecture 1.1 and perhaps even suggest a line of attack.
To end this section, we tantalize the reader with an interesting reformulation of the inequalities En ≥ 0 in terms of a formal power series from Ref. 5. First, if F(x) is a positive function on a probability space L, then it is natural to define the geometric mean of F by
Now, suppose F(x, t) is a power series of the form
Then, is a well defined power series, and formula (1.6) gives
where the constants cj are certain algebraic expressions in various .
(Ref. 5 , Conjecture 4). If the f1(x), f2(x), … is a sequence of positive monotone functions on an FKG poset, then cn ≥ 0 for all n.
II. THE INEQUALITY FOR THREE FUNCTIONS
For three functions, the multilinear functional En introduced in Ref. 5 is given by
We note that E3 is different from the cumulant (Ursell function), which is given by
We will consider the functional E3 for functions on the unit hypercube,
equipped with the Lebesgue measure and the usual partial order: x ≤ x′ if and only if for all i. We say that a real valued function f on Qk is monotone (decreasing) if x ≤ x′ implies f(x) ≥ f(x′). We note that the FKG inequality is usually stated for monotonically increasing functions, but this is a somewhat arbitrary choice. Indeed, FKG and our theorems for decreasing functions are equivalent to the corresponding results for increasing functions. For a general FKG poset, this follows by reversing the partial order, and for Qk, this follows by the change of variables xi ↦ 1 − xi. We also note that monotonicity for Q1 has the usual 1-variable meaning of a decreasing function.
If f, g, h are positive monotone functions on [0, 1]2, then E3(f, g, h) ≥ 0.
The generalization of Theorem 2.1 to n functions is given in Theorem 3.6. We now reduce Theorem 2.1 to characteristic functions χS, S ⊂ Qk. These are defined by χS(x) = 1 if x ∈ S and χS(x) = 0 if x ∉ S. We will say S is monotone if χS is monotone.
It suffices to prove Theorem 2.1 for χS, χT, χU for all monotone S, T, U.
Any positive f can be written as an integral over the characteristic functions of its upper level sets. Thus, , with ξs(x) = 1 if f(x) > s and 0 otherwise (see the “layer cake principle” in Ref. 2). If f is monotone, then ξs is monotone for every s. Since E3 is multi-linear in f, g, h, this reduces Theorem 2.1 to the case of monotone characteristic functions.■
We now describe a further reduction of Theorem 2.1 to a discrete family of characteristic functions. Let be the set of decreasing m-tuples of integers, each between 0 and m,
For each , we define a monotone subset Sa of Q2 = [0, 1]2 as follows. Divide Q2 uniformly into m2 little squares, write Di,j for the square with top right vertex (i/m, j/m), and set
Then, Sa is a monotone subset of Q2, and, conversely, any monotone union of Di,j is of this form.
It suffices to prove Theorem 2.1 for χa, χb, χc ; ; for all m.
By Lemma 2.2, it suffices to consider monotone characteristic functions χS, χT, χU. Divide Q2 uniformly into m2 little squares Di,j as before, and let Sm, Tm, Um be the unions of the Di,j contained in S, T, U, respectively; then, these are monotone subsets of Q2 of the form (2.5). Moreover, , etc., converge to χS, χSχT, etc., in L1 as m → ∞. Thus, if , then we get .■
A. Proof of the three function inequality in two dimensions
We now prove Theorem 2.1 for χa, χb, χc, which suffices by Lemma 2.3. To simplify the notation, we work directly with a, b, c and we define the product ab, expectation , etc., as follows:
Then, we have χab = χaχb, , E2(a, b) = E2(χa, χb), E3(a, b, c) = E3(χa, χb, χc).
In particular, by the FKG inequality, we obtain the following lemma:
For all a, b in , we have E2(a, b) ≥ 0.■
To study E3(a, b, c), we consider certain perturbations of a. We say that has a descent at i if ai > ai+1, and in this case, we can define three new sequences a− = a−,i, a+ = a+,i, a⋆ = a⋆,i, also in , in which the following changes, and only these, are made to a:
If a has a descent at i, but b does not, then we have .
If a, b have a descent at i and bi+1 ≤ ai+1, then a⋆b = ab.
For all a, b, c in , we have E3(a, b, c) ≥ 0.
Let be the set of triples (a, b, c) in for which E3(a, b, c) attains its minimum, and let be the subset of for which the quantity attains its maximum.
If b, say, also has a descent at i, then by symmetry we may assume bi+1 ≤ ai+1. Then, by Proposition 2.8, E3(a⋆, b, c) ≤ E3(a, b, c), and we again reach a contradiction since .
This proves Theorem 2.1 for χa, χb, χc and, thus, by Lemma 2.3, in general.
III. THE INEQUALITY FOR n FUNCTIONS
A. The definition of En
In this subsection and Subsection III B, we work with arbitrary functions on a probability space. We start by recalling the definition of the multilinear functional En(f1, …, fn) from Ref. 5. This involves the decomposition of a permutation σ in the symmetric group Sn as a product of disjoint cycles,
For σ as in (3.1), we write Cσ for the number of cycles in σ and we set
Then, the following definition is due to Sahi.5
Using (3.3), one can easily verify that E1, E2, E3 coincide with their earlier definitions. We note that the factor of 2 in the term 2E(f1f2f3) in formula (2.1) comes from the two 3-cycles (123) and (213). More generally, En will have repeated terms because Eσ is unchanged if we rearrange the indices within a cycle. For example, for n = 4, we have
We now give an explicit formula for En in a special case.
We note that the above formula implies that En is positive, i.e., Conjecture 1.1 holds for the Lebesgue measure on [0, 1]. While it is easy enough to give a direct proof the lemma, we prefer to postpone the proof to Subsection III B where we will derive it as a consequence of a more general result.
B. Algebraic properties of En
We first prove a recursive formula relating En to En−1.
Lemma 3.2 is now an easy consequence.
We next establish a useful formula for the partial sum Pc of En over the set of permutations containing a fixed cycle c.
C. Proof of the n function inequality for rectangles in any dimension
By a rectangle in dimension k or a k-rectangle, we mean a subset of [0, 1]k of the form
If fi are characteristic functions of k-rectangles, then En(f1, …, fn) ≥ 0.
If f is the characteristic function of a rectangle, then any level set of f is either the same rectangle or empty. However, using the layer-cake principle2 and multilinearity as in the Proof of Lemma 2.2, we obtain the following immediate extension of the previous result.
If f1, …, fn are positive, monotone functions whose level sets are (not necessarily homothetic) rectangles, then En(f1, …, fn) ≥ 0.■
D. Proof of the n function inequality in two dimensions
Our main result is as follows:
If f1, …, fn are positive and monotone on [0, 1]2, then En(f1, …, fn) ≥ 0.
As before, we can deduce this from the special case of χa as in (2.5).
It suffices to prove Theorem 3.6 for , , for all m.
This is proved along the same lines as Lemmas 2.2 and 2.3.■
In this section, we work with and to simplify the notation for a1, …, an in , we set
Then, we have .
To study the positivity of En, we first consider a special case.
We now prove the generalization of Proposition 2.6.
We shall prove the next three theorems together by induction on n.
Let us write A(n), B(n), and C(n) for the assertions of Theorems 3.10, 3.11, and 3.12. Then, A(1), B(1) are vacuously true, while C(1) is evident. Therefore, it suffices to prove the implications A(n − 1) ∧ C(n − 1) ⟹ A(n) and A(n) ⟹ B(n) ⟹ C(n) for all n ≥ 2.
B(n) ⟹ C(n): This argument is similar to the Proof of Theorem 2.9. Let be the set of n-tuples a = (a1, …, an) in for which En(a) achieves its minimum, and let be the subset of for which achieves its maximum on . We claim that for a in each ai is a constant sequence; by Proposition 3.8, this clearly implies C(n), En(a) ≥ 0.
If the claim is not true, then one of the sequences has a descent at some i. First suppose that only one sequence, by symmetry an = a, has a descent at i. By Proposition 3.9 and minimality of En(a), we deduce En(a) = En(a1, …, an−1, a±). Thus, replacing a by a+ preserves En(a) but increases λ(a), which is a contradiction. If two sequences have a descent at i, then by symmetry we may assume these are an−1 = b, an = c with bi+1 ≤ ci+1. Now, B(n), (3.17), implies that replacing c by c⋆ does not increase En(a), but it does increase λ(a), which is a contradiction.■
This proves Theorem 3.6 for and, thus, by Lemma 3.7, in general.
ACKNOWLEDGMENTS
This work was partially supported by the NSF (Grant Nos. DMS-1939600 and DMS-2001537) and the Simons Foundation (Grant No. 509766). The hospitality of the Institute for Advanced Study is gratefully acknowledged.
AUTHOR DECLARATIONS
Conflict of Interest
The authors have no conflicts to disclose.
DATA AVAILABILITY
Data sharing is not applicable to this article as no new data were created or analyzed in this study.
APPENDIX: THE EQUIVALENCE OF CONJECTURES 1.1 AND 1.2
We start by recalling some basic facts about partitions and permutations. For more background and details involving these ideas, we refer the reader to Ref. 3.
A partition λ of n, of length l, is a weakly decreasing sequence of positive integers,
we say that the λj are the parts of λ, and we write and .
The conjugation action of Sn permutes the indices in the cycle decomposition (3.1) of an element σ. Thus, the class of σ is uniquely determined by its “cycle type,” i.e., the partition λ whose parts are the cycle lengths of σ, arranged in decreasing order. Moreover, if denotes the number of parts of size i, then the conjugacy class of cycle type λ contains n!/zλ elements, where
For a function f on a probability space, we define its moments by the formula
We have .
If f is as above and u is a parameter, then we can define the formal logarithm
We have
For a set of functions on a probability space, the following are equivalent:
For all n, we have En(f1, …, fn) ≥ 0 if .
The power series has positive coefficients if .
The previous theorem proves the equivalence of Conjectures 1.1 and 1.2. In particular, our Theorem 3.6 implies Conjecture 1.2 for the Lebesgue measure on the unit square in .
REFERENCES
A distributive lattice is a partially ordered set, closed under join (supremum) ∨ and meet (infimum) ∧ such that each operation distributes over the other. A key example is the power set of a set, partially ordered by inclusion.
In this paper, we use positive as a synonym for non-negative and monotone for monotone decreasing. By reversing the partial order, our results and conjectures hold equally for monotone increasing functions. We note further that the positivity requirement on functions is redundant for the second inequality but essential for the first.